SOJKA, Petr and Ondřej SOJKA. New Czechoslovak Hyphenation Patterns, Word Lists, and Workflow. TUGboat. Portland, OR 97208-2311, U.S.A: TUG, 2021, vol. 42, No 2, p. 152-158. ISSN 0896-3207. Available from: https://dx.doi.org/10.47397/tb/42-2/tb131sojka-czech.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name New Czechoslovak Hyphenation Patterns, Word Lists, and Workflow
Authors SOJKA, Petr (203 Czech Republic, guarantor, belonging to the institution) and Ondřej SOJKA (203 Czech Republic, belonging to the institution).
Edition TUGboat, Portland, OR 97208-2311, U.S.A, TUG, 2021, 0896-3207.
Other information
Original language English
Type of outcome Article in a journal
Country of publisher United States of America
Confidentiality degree is not subject to a state or trade secret
WWW URL URL URL URL URL
RIV identification code RIV/00216224:14330/21:00122189
Organization Fakulta informatiky – Repository – Repository
Doi http://dx.doi.org/10.47397/tb/42-2/tb131sojka-czech
Keywords (in Czech) dělení slov; generování vzorů; databáze slov; vícejazyčná sazba; slabičné algoritmy; patgen; soutěživé vzory
Keywords in English hyphenation; pattern generation; word list database; multilingual typesetting; syllabification algorithms; patgen; competing patterns
Links MUNI/A/1573/2020, interní kód Repo.
Changed by Changed by: RNDr. Daniel Jakubík, učo 139797. Changed: 6/9/2023 04:54.
Abstract
/tt\> for the generation of the new Czechoslovak hyphenation patterns that cover both Czech and Slovak languages. We show that developing universal, up-to-date, high-coverage and high-generalization hyphenation patterns is feasible, generated from semi-automatically prepared word lists from actual language usage. We evaluate the new approach and argue that the new Czechoslovak hyphenation patterns bring significant coverage and generalization improvements, and space savings. We share all the data, word lists, and workflow for reproducibility and usage.
Type Name Uploaded/Created by Uploaded/Created Rights
New_Czechoslovak_Hyphenation_Patterns__Word_Lists__and_Workflow__TUG_2021__1_.pdf Licence Creative Commons  File version 31/8/2021

Properties

Name
New_Czechoslovak_Hyphenation_Patterns__Word_Lists__and_Workflow__TUG_2021__1_.pdf
Address within IS
https://repozitar.cz/auth/repo/45487/1133669/
Address for the users outside IS
https://repozitar.cz/repo/45487/1133669/
Address within Manager
https://repozitar.cz/auth/repo/45487/1133669/?info
Address within Manager for the users outside IS
https://repozitar.cz/repo/45487/1133669/?info
Uploaded/Created
Tue 31/8/2021 02:19

Rights

Right to read
  • anyone on the Internet
Right to upload
 
Right to administer:
  • a concrete person Mgr. Lucie Vařechová, uco 106253
  • a concrete person RNDr. Daniel Jakubík, uco 139797
  • a concrete person Mgr. Jolana Surýnková, uco 220973
Attributes
 
Print
Add to clipboard Displayed: 19/5/2024 08:18