Word Embeddings Quantify 100 Years of Gender and Ethnic Stereotypes

11/22/2017
by   Nikhil Garg, et al.
0

Word embeddings use vectors to represent words such that the geometry between vectors captures semantic relationship between the words. In this paper, we develop a framework to demonstrate how the temporal dynamics of the embedding can be leveraged to quantify changes in stereotypes and attitudes toward women and ethnic minorities in the 20th and 21st centuries in the United States. We integrate word embeddings trained on 100 years of text data with the U.S. Census to show that changes in the embedding track closely with demographic and occupation shifts over time. The embedding captures global social shifts -- e.g., the women's movement in the 1960s and Asian immigration into the U.S -- and also illuminates how specific adjectives and occupations became more closely associated with certain populations over time. Our framework for temporal analysis of word embedding opens up a powerful new intersection between machine learning and quantitative social science.

READ FULL TEXT

page 6

page 7

page 26

page 28

page 29

page 32

page 33

research
07/21/2016

Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings

The blind application of machine learning runs the risk of amplifying bi...
research
05/29/2018

Unsupervised detection of diachronic word sense evolution

Most words have several senses and connotations which evolve in time due...
research
05/16/2019

Tracing cultural diachronic semantic shifts in Russian using word embeddings: test sets and baselines

The paper introduces manually annotated test sets for the task of tracin...
research
02/15/2021

How COVID-19 Is Changing Our Language : Detecting Semantic Shift in Twitter Word Embeddings

Words are malleable objects, influenced by events that are reflected in ...
research
03/12/2021

Abolitionist Networks: Modeling Language Change in Nineteenth-Century Activist Newspapers

The abolitionist movement of the nineteenth-century United States remain...
research
08/27/2021

Opinions are Made to be Changed: Temporally Adaptive Stance Classification

Given the rapidly evolving nature of social media and people's views, wo...
research
06/04/2019

Tracing Antisemitic Language Through Diachronic Embedding Projections: France 1789-1914

We investigate some aspects of the history of antisemitism in France, on...

Please sign up or login with your details

Forgot password? Click here to reset