VacancySBERT: the approach for representation of titles and skills for semantic similarity search in the recruitment domain

07/31/2023
by   Maiia Bocharova, et al.
0

The paper focuses on deep learning semantic search algorithms applied in the HR domain. The aim of the article is developing a novel approach to training a Siamese network to link the skills mentioned in the job ad with the title. It has been shown that the title normalization process can be based either on classification or similarity comparison approaches. While classification algorithms strive to classify a sample into predefined set of categories, similarity search algorithms take a more flexible approach, since they are designed to find samples that are similar to a given query sample, without requiring pre-defined classes and labels. In this article semantic similarity search to find candidates for title normalization has been used. A pre-trained language model has been adapted while teaching it to match titles and skills based on co-occurrence information. For the purpose of this research fifty billion title-descriptions pairs had been collected for training the model and thirty three thousand title-description-normalized title triplets, where normalized job title was picked up manually by job ad creator for testing purposes. As baselines FastText, BERT, SentenceBert and JobBert have been used. As a metric of the accuracy of the designed algorithm is Recall in top one, five and ten model's suggestions. It has been shown that the novel training objective lets it achieve significant improvement in comparison to other generic and specific text encoders. Two settings with treating titles as standalone strings, and with included skills as additional features during inference have been used and the results have been compared in this article. Improvements by 10 VacancySBERT (with skills) respectively. The benchmark has been developed as open-source to foster further research in the area.

READ FULL TEXT
research
09/20/2021

JobBERT: Understanding Job Titles through Skills

Job titles form a cornerstone of today's human resources (HR) processes....
research
07/01/2022

Learning Job Titles Similarity from Noisy Skill Labels

Measuring semantic similarity between job titles is an essential functio...
research
12/21/2021

Predicting Job Titles from Job Descriptions with Multi-label Text Classification

Finding a suitable job and hunting for eligible candidates are important...
research
06/11/2019

Innovating HR Using an Expert System for Recruiting IT Specialists -- ESRIT

One of the most rapidly evolving and dynamic business sector is the IT d...
research
03/31/2023

JobHam-place with smart recommend job options and candidate filtering options

Due to the increasing number of graduates, many applicants experience th...
research
03/21/2022

Semantic Similarity Computing for Scientific Academic Conferences fused with domain features

Aiming at the problem that the current general-purpose semantic text sim...
research
04/17/2023

SkillGPT: a RESTful API service for skill extraction and standardization using a Large Language Model

We present SkillGPT, a tool for skill extraction and standardization (SE...

Please sign up or login with your details

Forgot password? Click here to reset