I was recently researching various text mining and language processing techniques to extract Job Skills from Job postings and Resume data. The input data is a free text corpus and the expected output would be the desired skills sets for a given job profile.
I decided to document all my research as a paper with all the technical details that might be useful for someone researching a similar problem. So here it is –
Direct link to paper : https://confusedcoders.com/wp-content/uploads/2019/09/Job-Skills-extraction-with-LSTM-and-Word-Embeddings-Nikita-Sharma.pdf
The output of the exercise were very promising and I was able to extend the model to various Job categories. The techniques is also able to identify new and emerging Skillsets rather than being limited to a known set of Skills.
Sample of skills extracted from a Software Engineering Job post:
Same model extended to a Civil Engineering job post:
Checkout my portfolio here: https://confusedcoders.com/nikita-sharma-greenhorn-data-science-student
I am a greenhorn Data Science student with interest in finding patterns in data. My language of choice is Python and I am starting to get my hands dirty with R.
I blog on Medium.com  and ConfusedCoders.com . I share my code on Github.com .