D 2018

MIaS: Math-Aware Retrieval in Digital Mathematical Libraries

SOJKA, Petr; Michal RŮŽIČKA and Vít NOVOTNÝ

Basic information

Original name

MIaS: Math-Aware Retrieval in Digital Mathematical Libraries

Authors

SOJKA, Petr; Michal RŮŽIČKA and Vít NOVOTNÝ

Edition

Torino, Italy, Proceedings of the 27th ACM International Conference on Information and Knowledge Management (CIKM '18), p. 1923-1926, 4 pp. 2018

Publisher

Association for Computing Machinery

Other information

Language

English

Type of outcome

Proceedings paper

Country of publisher

Italy

Confidentiality degree

is not subject to a state or trade secret

Publication form

electronic version available online

References:

URL, URL

Marked to be transferred to RIV

Yes

RIV identification code

RIV/00216224:14330/18:00100679

Organization

Fakulta informatiky – Repository – Repository

ISBN

978-1-4503-6014-2

UT WoS

000455712300261

EID Scopus

2-s2.0-85058006184

Keywords (in Czech)

vyhledávání matematiky; DML; EuDML; digitální matematické knihovny

Keywords in English

Math Information Retrieval; DML; EuDML; Digital Mathematical Libraries

Links

MUNI/A/1213/2017, interní kód Repo. 1ET200190513, research and development project. 250503, interní kód Repo.
Changed: 6/9/2020 04:24, RNDr. Daniel Jakubík

Abstract

In the original language

Digital mathematical libraries (DMLs) such as arXiv, Numdam, and EuDML contain mainly documents from STEM fields, where mathematical formulae are often more important than text for understanding. Conventional information retrieval (IR) systems are unable to represent formulae and they are therefore ill-suited for math information retrieval (MIR). To fill the gap, we have developed, and open-sourced the MIaS MIR system. MIaS is based on the full-text search engine Apache Lucene. On top of text retrieval, MIaS also incorporates a set of tools for preprocessing mathematical formulae. We describe the design of the system and present speed, and quality evaluation results. We show that MIaS is both efficient, and effective, as evidenced by our victory in the NTCIR-11 Math-2 task.
Displayed: 6/5/2026 18:22