J 2026

Open Datasets in Learning Analytics: Trends, Challenges, and Best PRACTICE

ŠVÁBENSKÝ, Valdemar; Brendan FLANAGAN; Erwin Daniel LÓPEZ ZAPATA and Atsushi SHIMADA

Basic information

Original name

Open Datasets in Learning Analytics: Trends, Challenges, and Best PRACTICE

Authors

ŠVÁBENSKÝ, Valdemar; Brendan FLANAGAN; Erwin Daniel LÓPEZ ZAPATA and Atsushi SHIMADA

Edition

ACM Transactions on Knowledge Discovery from Data, ACM, 2026, 1556-4681

Other information

Language

English

Type of outcome

Article in a journal

Country of publisher

United States of America

Confidentiality degree

is not subject to a state or trade secret

References:

Marked to be transferred to RIV

No

Organization

Fakulta informatiky – Repository – Repository

Keywords in English

open data; public data; data sharing; data management; open science; learning analytics; educational data mining; artificial intelligence in education; AI in education; systematic literature review; systematic mapping study; survey

Links

GN25-15839I, research and development project.
Changed: 27/2/2026 00:51, RNDr. Daniel Jakubík

Abstract

In the original language

br /\>Results and contributions: Our study presents the most comprehensive collection and analysis of open educational datasets to date, along with the most detailed categorization. Of the 172 datasets identified, 143 were not captured in any prior survey of open data in learning analytics. We provide insights into the datasets’ context, analytical methods, use, and other properties. Based on this survey, we summarize the current gaps in the field. Furthermore, we list practical recommendations, advice, and 8-item guidelines under the acronym PRACTICE with a checklist to help researchers publish their data. Lastly, we share our original dataset: an annotated inventory detailing the discovered datasets and the corresponding publications. We hope these findings will support further adoption of open data practices in learning analytics communities and beyond.