Survey vs Scraped Data: Comparing Time Series Properties of Web and Survey Vacancy Data

Loading...
Thumbnail Image
Date
2019
Volume
8
Issue
1
Journal
Series Titel
Book Title
Publisher
Berlin : Springer Nature
Abstract

This paper studies the relationship between a vacancy population obtained from web crawling and vacancies in the economy inferred by a National Statistics Office (NSO) using a traditional method. We compare the time series properties of samples obtained between 2007 and 2014 by Statistics Netherlands and by a web scraping company. We find that the web and NSO vacancy data present similar time series properties, suggesting that both time series are generated by the same underlying phenomenon: the real number of new vacancies in the economy. We conclude that, in our case study, web-sourced data are able to capture aggregate economic activity in the labor market.

Description
Keywords
data collection, Labor demand, statistical inference, time series, vacancies, web crawling
Citation
De Pedraza, P., Visintin, S., Tijdens, K., & Kismihók, G. (2019). Survey vs Scraped Data: Comparing Time Series Properties of Web and Survey Vacancy Data. 8(1). https://doi.org//10.2478/izajole-2019-0004
License
CC BY 4.0 Unported