Přehled o publikaci
2024
E2Vec: Feature Embedding with Temporal Information for Analyzing Student Actions in E-Book Systems
MIYAZAKI, Yuma; Valdemar ŠVÁBENSKÝ; Yuta TANIGUCHI; Fumiya OKUBO; Tsubasa MINEMATSU et al.Basic information
Original name
E2Vec: Feature Embedding with Temporal Information for Analyzing Student Actions in E-Book Systems
Authors
MIYAZAKI, Yuma; Valdemar ŠVÁBENSKÝ; Yuta TANIGUCHI; Fumiya OKUBO; Tsubasa MINEMATSU and Atsushi SHIMADA
Edition
17th International Conference on Educational Data Mining (EDM 2024), p. 434-442, 9 pp. 2024
Publisher
International Educational Data Mining Society
Other information
Language
English
Type of outcome
Proceedings paper
Country of publisher
United States of America
Confidentiality degree
is not subject to a state or trade secret
Publication form
electronic version available online
References:
Marked to be transferred to RIV
No
Organization
Repository – Repository
Keywords in English
feature representation; fastText; digital textbooks; e-book EventStream; at-risk prediction; educational data mining
Changed: 16/7/2024 05:11, RNDr. Daniel Jakubík
Abstract
In the original language
Digital textbook (e-book) systems record student interactions with textbooks as a sequence of events called EventStream data. In the past, researchers extracted meaningful features from EventStream, and utilized them as inputs for downstream tasks such as grade prediction and modeling of student behavior. Previous research evaluated models that mainly used statistical-based features derived from EventStream logs, such as the number of operation types or access frequencies. While these features are useful for providing certain insights, they lack temporal information that captures fine-grained differences in learning behaviors among different students. This study proposes E2Vec, a novel feature representation method based on word embeddings. The proposed method regards operation logs and their time intervals for each student as a string sequence of characters and generates a student vector of learning activity features that incorporates time information. We applied fastText to generate an embedding vector for each of 305 students in a dataset from two years of computer science courses. Then, we investigated the effectiveness of E2Vec in an at-risk detection task, demonstrating potential for generalizability and performance.