Democratizing Machine Learning for Interdisciplinary Scholars: Report on Organizing the NLP+CSS Online Tutorial Series

11/29/2022
by   Ian Stewart, et al.
0

Many scientific fields – including biology, health, education, and the social sciences – use machine learning (ML) to help them analyze data at an unprecedented scale. However, ML researchers who develop advanced methods rarely provide detailed tutorials showing how to apply these methods. Existing tutorials are often costly to participants, presume extensive programming knowledge, and are not tailored to specific application fields. In an attempt to democratize ML methods, we organized a year-long, free, online tutorial series targeted at teaching advanced natural language processing (NLP) methods to computational social science (CSS) scholars. Two organizers worked with fifteen subject matter experts to develop one-hour presentations with hands-on Python code for a range of ML methods and use cases, from data pre-processing to analyzing temporal variation of language change. Although live participation was more limited than expected, a comparison of pre- and post-tutorial surveys showed an increase in participants' perceived knowledge of almost one point on a 7-point Likert scale. Furthermore, participants asked thoughtful questions during tutorials and engaged readily with tutorial content afterwards, as demonstrated by 10K total views of posted tutorial recordings. In this report, we summarize our organizational efforts and distill five principles for democratizing ML+X tutorials. We hope future organizers improve upon these principles and continue to lower barriers to developing ML skills for researchers of all fields.

READ FULL TEXT
research
05/28/2021

Natural Language Processing 4 All (NLP4All): A New Online Platform for Teaching and Learning NLP Concepts

Natural Language Processing offers new insights into language data acros...
research
06/20/2021

Machine learning in the social and health sciences

The uptake of machine learning (ML) approaches in the social and health ...
research
05/04/2023

ExeKGLib: Knowledge Graphs-Empowered Machine Learning Analytics

Many machine learning (ML) libraries are accessible online for ML practi...
research
11/04/2022

NLP Inspired Training Mechanics For Modeling Transient Dynamics

In recent years, Machine learning (ML) techniques developed for Natural ...
research
11/15/2022

Searching for Carriers of the Diffuse Interstellar Bands Across Disciplines, using Natural Language Processing

The explosion of scientific publications overloads researchers with info...
research
05/05/2022

Interactive Model Cards: A Human-Centered Approach to Model Documentation

Deep learning models for natural language processing (NLP) are increasin...
research
11/26/2020

Automatic coding of students' writing via Contrastive Representation Learning in the Wasserstein space

Qualitative analysis of verbal data is of central importance in the lear...

Please sign up or login with your details

Forgot password? Click here to reset