Dataset

Universal Dependencies 2.5

Zeman, D. (Creator), Nivre, J. (Creator), Bouma, G. (Creator), Noord, van, G. (Creator), LINDAT/CLARIN, 1-Nov-2019

Dataset

Description

Universal Dependencies is a project that seeks to develop cross-linguistically consistent treebank annotation for many languages, with the goal of facilitating multilingual parser development, cross-lingual learning, and parsing research from a language typology perspective. The annotation scheme is based on (universal) Stanford dependencies (de Marneffe et al., 2006, 2008, 2014), Google universal part-of-speech tags (Petrov et al., 2012), and the Interset interlingua for morphosyntactic tagsets (Zeman, 2008).
Full list of authors can be found in the data repository.
Date made available1-Nov-2019
PublisherLINDAT/CLARIN
Geographical coverageGlobal
Access to the dataset Open
Contact researchdata@rug.nl

    Keywords on Datasets

  • universal dependencies, syntax, treebank, multilingual
Related Publications
  1. Expletives in Universal Dependency Treebanks

    Bouma, G., Nivre, J., Ovrelid, L., Haug, D., Hajic, J. & Solberg, P. E., 1-Nov-2018.

    Research output: Contribution to conferencePaperAcademic

View all (1) »

ID: 109565471