D 2018

Annotated Corpus of Czech Case Law for Reference Recognition Tasks

HARAŠTA, Jakub; Jaromír ŠAVELKA; František KASL; Adéla KOTKOVÁ; Pavel LOUTOCKÝ et. al.

Basic information

Original name

Annotated Corpus of Czech Case Law for Reference Recognition Tasks

Authors

HARAŠTA, Jakub (203 Czech Republic, belonging to the institution); Jaromír ŠAVELKA (203 Czech Republic); František KASL (203 Czech Republic, belonging to the institution); Adéla KOTKOVÁ (203 Czech Republic, belonging to the institution); Pavel LOUTOCKÝ (203 Czech Republic, belonging to the institution); Jakub MÍŠEK (203 Czech Republic, belonging to the institution); Daniela PROCHÁZKOVÁ (203 Czech Republic, belonging to the institution); Helena PULLMANNOVÁ (203 Czech Republic, belonging to the institution); Petr SEMENIŠÍN (203 Czech Republic, belonging to the institution); Tamara ŠEJNOVÁ (203 Czech Republic, belonging to the institution); Nikola ŠIMKOVÁ (703 Slovakia, belonging to the institution); Michal VOSINEK (203 Czech Republic, belonging to the institution); Lucie ZAVADILOVÁ (203 Czech Republic, belonging to the institution) and Jan ZIBNER (203 Czech Republic, belonging to the institution)

Edition

Cham, Text, Speech, and Dialogue: 21st International Conference, p. 239-250, 12 pp. 2018

Publisher

Springer Nature Switzerland AG

Other information

Language

English

Type of outcome

Proceedings paper

Country of publisher

Switzerland

Confidentiality degree

is not subject to a state or trade secret

Publication form

electronic version available online

References:

URL URL

RIV identification code

RIV/00216224:14220/18:00101155

Organization

Právnická fakulta – Repository – Repository

ISBN

978-3-030-00793-5

ISSN

UT WoS

000611532300026

EID Scopus

2-s2.0-85053907376

Keywords (in Czech)

rozpoznávání referencí; dataset; právní texty; manuální anotace

Keywords in English

Reference recognition; dataset; legal texts; manual annotation

Links

GA17-20645S, research and development project.
Changed: 23/3/2021 01:44, RNDr. Daniel Jakubík

Abstract

V originále

We describe an annotated corpus of 350 decisions of Czech top-tier courts which was gathered for a project assessing the relevance of court decisions in Czech law. We describe two layers of processing of the corpus; every decision was annotated by two trained annotators and then manually adjudicated by one trained curator to solve possible disagreements between annotators. This corpus was developed as training and testing material for reference recognition tasks which will be further used for research on assessment of legal importance. However, the overall shortage of available research corpora of annotated legal texts, particularly in Czech language, leads us to believe that other research teams may find it useful.
Displayed: 6/7/2025 17:22