Corpus-based language comparison: From morphology to dependencies and beyond

Autori

DOI:

https://doi.org/10.21165/el.v54i1.4032

Abstract

We provide an overview of the Universal Dependencies multilingual corpus collection, its current status and numerous extensions, such as the UNER annotation of named entities or the CorefUD annotation of coreference and anaphora. We discuss the utility of the data in several areas of Digital Humanities, with a particular focus on comparative linguistics and typology.
Keywords: annotated corpus; treebank; morphology; syntax; typology.

Downloads

I dati di download non sono ancora disponibili.

Pubblicato

2025-12-17

Come citare

Zeman, D. (2025). Corpus-based language comparison: From morphology to dependencies and beyond. Estudos Linguísticos (São Paulo. 1978), 54(1), 259–275. https://doi.org/10.21165/el.v54i1.4032

Fascicolo

Sezione

Artigos