Skip to main content

Computer Science Department, MS Thesis Presentation, Pichayut Liamthong " Text Representations of Math Tutorial Videos for Clustering, Retrieval, and Learning Gain Prediction"

Thursday, March 25, 2021
1:00 pm to 2:00 pm


Pichayut (Ter) Liamthong

MS Student

WPI – Computer Science                                                                                          


 Thursday, March 25, 2021

 Time: 1:00 – 2:30 pm

Zoom Link:


Advisor: Prof. Jacob Whitehill

Reader: Prof. Joseph Beck


With the goal of making vast collections of open educational resources (YouTube, Khan Academy, etc.) more useful to learners, we explored how automatically extractable text representations of math tutorial videos can help to categorize the videos, search through them for specific content, and predict the individual learning gains of students who watch them. In particular,

(1) we devised novel text representations, based on the output of an automatic speech recognition system, that consider the frequency of different tokens (symbols, equations, etc.) as well as their proximity from each other in the transcript. Unsupervised learning experiments, conducted on 208 videos that explain 18 math problems about logarithms show that the clustering accuracy of our proposed methods reaches 85%, surpassing that of standard TF-IDF features (78% using log normalization).

(2) In a video search setting, the proposed text features can significantly reduce the number of videos and amount of time (up to 88% decrease in number of videos to watch in our study) that users need to spend looking for desired content in large video collections. Finally,

(3) in an experiment on Mechanical Turk with n=541 participants who watched a randomly assigned tutorial video, the proposed text features and their multiplicative interactions with students' prior knowledge provide a statistically significant benefit to predicting individual learning gains.