Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Leveraging the Inherent Hierarchy of Vacancy Titles for Automated Job Ontology Expansion (2004.02814v1)

Published 6 Apr 2020 in cs.CL and cs.LG

Abstract: Machine learning plays an ever-bigger part in online recruitment, powering intelligent matchmaking and job recommendations across many of the world's largest job platforms. However, the main text is rarely enough to fully understand a job posting: more often than not, much of the required information is condensed into the job title. Several organised efforts have been made to map job titles onto a hand-made knowledge base as to provide this information, but these only cover around 60\% of online vacancies. We introduce a novel, purely data-driven approach towards the detection of new job titles. Our method is conceptually simple, extremely efficient and competitive with traditional NER-based approaches. Although the standalone application of our method does not outperform a finetuned BERT model, it can be applied as a preprocessing step as well, substantially boosting accuracy across several architectures.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Jeroen Van Hautte (9 papers)
  2. Vincent Schelstraete (2 papers)
  3. Mikaƫl Wornoo (1 paper)
Citations (4)

Summary

We haven't generated a summary for this paper yet.