Enhancing the Demand for Labour survey by including skills from online job advertisements using model-assisted calibration

by   Maciej Beręsewicz, et al.

In the article we describe an enhancement to the Demand for Labour (DL) survey conducted by Statistics Poland, which involves the inclusion of skills obtained from online job advertisements. The main goal is to provide estimates of the demand for skills (competences), which is missing in the DL survey. To achieve this, we apply a data integration approach combining traditional calibration with the LASSO-assisted approach to correct representation error in the online data. Faced with the lack of access to unit-level data from the DL survey, we use estimated population totals and propose a bootstrap approach that accounts for the uncertainty of totals reported by Statistics Poland. We show that the calibration estimator assisted with LASSO outperforms traditional calibration in terms of standard errors and reduces representation bias in skills observed in online job ads. Our empirical results show that online data significantly overestimate interpersonal, managerial and self-organization skills while underestimating technical and physical skills. This is mainly due to the under-representation of occupations categorised as Craft and Related Trades Workers and Plant and Machine Operators and Assemblers.


Adaptively selecting occupations to detect skill shortages from online job ads

This research develops a data-driven method to generate sets of highly s...

Estimating the number of entities with vacancies using administrative and online data

In this article we describe a study aimed at estimating job vacancy stat...

Practical Skills Demand Forecasting via Representation Learning of Temporal Dynamics

Rapid technological innovation threatens to leave much of the global wor...

Job Transitions in a Time of Automation and Labor Market Crises

Job security can never be taken for granted, especially in times of rapi...

What Skills do IT Companies look for in New Developers? A Study with Stack Overflow Jobs

Context: There is a growing demand for information on how IT companies l...

Survey data integration for regression analysis using model calibration

We consider regression analysis in the context of data integration. To c...