SIGHT: A Large Annotated Dataset on Student Insights Gathered from Higher Education Transcripts

06/15/2023
by   Rose E. Wang, et al.
0

Lectures are a learning experience for both students and teachers. Students learn from teachers about the subject material, while teachers learn from students about how to refine their instruction. However, online student feedback is unstructured and abundant, making it challenging for teachers to learn and improve. We take a step towards tackling this challenge. First, we contribute a dataset for studying this problem: SIGHT is a large dataset of 288 math lecture transcripts and 15,784 comments collected from the Massachusetts Institute of Technology OpenCourseWare (MIT OCW) YouTube channel. Second, we develop a rubric for categorizing feedback types using qualitative analysis. Qualitative analysis methods are powerful in uncovering domain-specific insights, however they are costly to apply to large data sources. To overcome this challenge, we propose a set of best practices for using large language models (LLMs) to cheaply classify the comments at scale. We observe a striking correlation between the model's and humans' annotation: Categories with consistent human annotations (>0.9 inter-rater reliability, IRR) also display higher human-model agreement (>0.7), while categories with less consistent human annotations (0.7-0.8 IRR) correspondingly demonstrate lower human-model agreement (0.3-0.5). These techniques uncover useful student feedback from thousands of comments, costing around $0.002 per comment. We conclude by discussing exciting future directions on using online student feedback and improving automated annotation techniques for qualitative research.

READ FULL TEXT

page 2

page 15

page 16

page 17

page 18

page 19

page 33

page 34

research
05/22/2015

Learning Program Embeddings to Propagate Feedback on Student Code

Providing feedback, both assessing final work and giving hints to stuck ...
research
05/25/2023

Pro-f-quiz: increasing the PROductivity of Feedback through activating QUIZzes

Feedback beyond the grade is an important part of the learning process. ...
research
09/06/2021

Hocalarim: Mining Turkish Student Reviews

We introduce Hocalarim (MyProfessors), the largest student review datase...
research
06/11/2022

A Decomposition-Based Approach for Evaluating Inter-Annotator Disagreement in Narrative Analysis

In this work, we explore sources of inter-annotator disagreement in narr...
research
09/05/2018

Zero Shot Learning for Code Education: Rubric Sampling with Deep Learning Inference

In modern computer science education, massive open online courses (MOOCs...
research
01/23/2021

Are Top School Students More Critical of Their Professors? Mining Comments on RateMyProfessor.com

Student reviews and comments on RateMyProfessor.com reflect realistic le...
research
05/31/2021

Supporting Cognitive and Emotional Empathic Writing of Students

We present an annotation approach to capturing emotional and cognitive e...

Please sign up or login with your details

Forgot password? Click here to reset