D 2024

E2Vec: Feature Embedding with Temporal Information for Analyzing Student Actions in E-Book Systems

MIYAZAKI, Yuma; Valdemar ŠVÁBENSKÝ; Yuta TANIGUCHI; Fumiya OKUBO; Tsubasa MINEMATSU et al.

Basic information

Original name

E2Vec: Feature Embedding with Temporal Information for Analyzing Student Actions in E-Book Systems

Authors

MIYAZAKI, Yuma; Valdemar ŠVÁBENSKÝ; Yuta TANIGUCHI; Fumiya OKUBO; Tsubasa MINEMATSU and Atsushi SHIMADA

Edition

17th International Conference on Educational Data Mining (EDM 2024), p. 434-442, 9 pp. 2024

Publisher

International Educational Data Mining Society

Other information

Language

English

Type of outcome

Proceedings paper

Country of publisher

United States of America

Confidentiality degree

is not subject to a state or trade secret

Publication form

electronic version available online

References:

Marked to be transferred to RIV

No

Organization

Repository – Repository

Keywords in English

feature representation; fastText; digital textbooks; e-book EventStream; at-risk prediction; educational data mining
Changed: 16/7/2024 05:11, RNDr. Daniel Jakubík

Abstract

In the original language

Digital textbook (e-book) systems record student interactions with textbooks as a sequence of events called EventStream data. In the past, researchers extracted meaningful features from EventStream, and utilized them as inputs for downstream tasks such as grade prediction and modeling of student behavior. Previous research evaluated models that mainly used statistical-based features derived from EventStream logs, such as the number of operation types or access frequencies. While these features are useful for providing certain insights, they lack temporal information that captures fine-grained differences in learning behaviors among different students. This study proposes E2Vec, a novel feature representation method based on word embeddings. The proposed method regards operation logs and their time intervals for each student as a string sequence of characters and generates a student vector of learning activity features that incorporates time information. We applied fastText to generate an embedding vector for each of 305 students in a dataset from two years of computer science courses. Then, we investigated the effectiveness of E2Vec in an at-risk detection task, demonstrating potential for generalizability and performance.

Files attached