Corpus-based language comparison: From morphology to dependencies and beyond

Authors

DOI:

https://doi.org/10.21165/el.v54i1.4032

Abstract

We provide an overview of the Universal Dependencies multilingual corpus collection, its current status and numerous extensions, such as the UNER annotation of named entities or the CorefUD annotation of coreference and anaphora. We discuss the utility of the data in several areas of Digital Humanities, with a particular focus on comparative linguistics and typology.
Keywords: annotated corpus; treebank; morphology; syntax; typology.

Downloads

Download data is not yet available.

Published

2025-12-17

How to Cite

Zeman, D. (2025). Corpus-based language comparison: From morphology to dependencies and beyond. Estudos Linguísticos (São Paulo. 1978), 54(1), 259–275. https://doi.org/10.21165/el.v54i1.4032

Issue

Section

Artigos