Comparative Analysis of Encoder-Based NER and Large Language Models for Skill Extraction from Russian Job Vacancies (2407.19816v2)

Published 29 Jul 2024 in cs.CL

Abstract: The labor market is undergoing rapid changes, with increasing demands on job seekers and a surge in job openings. Identifying essential skills and competencies from job descriptions is challenging due to varying employer requirements and the omission of key skills. This study addresses these challenges by comparing traditional Named Entity Recognition (NER) methods based on encoders with LLMs for extracting skills from Russian job vacancies. Using a labeled dataset of 4,000 job vacancies for training and 1,472 for testing, the performance of both approaches is evaluated. Results indicate that traditional NER models, especially DeepPavlov RuBERT NER tuned, outperform LLMs across various metrics including accuracy, precision, recall, and inference time. The findings suggest that traditional NER models provide more effective and efficient solutions for skill extraction, enhancing job requirement clarity and aiding job seekers in aligning their qualifications with employer expectations. This research contributes to the field of NLP and its application in the labor market, particularly in non-English contexts.

PDF HTML Abstract

Summarize Bookmark Chat (Pro)

Authors (7)

Nikita Matkin (2 papers)
Aleksei Smirnov (2 papers)
Mikhail Usanin (1 paper)
Egor Ivanov (1 paper)
Kirill Sobyanin (1 paper)
Sofiia Paklina (1 paper)
Petr Parshakov (3 papers)

Comparative Analysis of Encoder-Based NER and Large Language Models for Skill Extraction from Russian Job Vacancies (2407.19816v2)

Related Papers