Effective Feature Learning with Unsupervised Learning for Improving the Predictive Models in Massive Open Online Courses

12/12/2018
by   Mucong Ding, et al.
0

The effectiveness of learning in massive open online courses (MOOCs) can be significantly enhanced by introducing personalized intervention schemes which rely on building predictive models of student learning behaviors such as some engagement or performance indicators. A major challenge that has to be addressed when building such models is to design handcrafted features that are effective for the prediction task at hand. In this paper, we make the first attempt to solve the feature learning problem by taking the unsupervised learning approach to learn a compact representation of the raw features with a large degree of redundancy. Specifically, in order to capture the underlying learning patterns in the content domain and the temporal nature of the clickstream data, we train a modified auto-encoder (AE) combined with the long short-term memory (LSTM) network to obtain a fixed-length embedding for each input sequence. When compared with the original features, the new features that correspond to the embedding obtained by the modified LSTM-AE are not only more parsimonious but also more discriminative for our prediction task. Using simple supervised learning models, the learned features can improve the prediction accuracy by up to 17 overfitting to the dominant low-performing group of students, specifically in the task of predicting students' performance. Our approach is generic in the sense that it is not restricted to a specific supervised learning model nor a specific prediction task for MOOC learning analytics.

READ FULL TEXT
research
09/11/2018

Time Series Analysis of Clickstream Logs from Online Courses

Due to the rapidly rising popularity of Massive Open Online Courses (MOO...
research
01/23/2018

Hybrid Gradient Boosting Trees and NeuralNetworks for Forecasting Operating Room Data

Time series data constitutes a distinct and growing problem in machine l...
research
01/23/2018

Hybrid Gradient Boosting Trees and Neural Networks for Forecasting Operating Room Data

Time series data constitutes a distinct and growing problem in machine l...
research
02/16/2015

Unsupervised Learning of Video Representations using LSTMs

We use multilayer Long Short Term Memory (LSTM) networks to learn repres...
research
03/30/2019

EE-AE: An Exclusivity Enhanced Unsupervised Feature Learning Approach

Unsupervised learning is becoming more and more important recently. As o...
research
12/12/2018

Transfer Learning using Representation Learning in Massive Open Online Courses

In a Massive Open Online Course (MOOC), predictive models of student beh...
research
03/30/2020

Machine Learning String Standard Models

We study machine learning of phenomenologically relevant properties of s...

Please sign up or login with your details

Forgot password? Click here to reset