Building a Corpus of Indefinite Uses Annotated with Fine-grained Semantic Functions

Aloni, M., van Cranenburgh, A., Fernández, R. & Sznajder, M., May-2012, The eighth international conference on Language Resources and Evaluation (LREC). Istanbul, Turkey: European Language Resources Association (ELRA), p. 1511-1515 5 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

Natural languages possess a wealth of indefinite forms that typically differ in distribution and interpretation. Although formal semanticists have strived to develop precise meaning representations for different indefinite functions, to date there has hardly been any corpus work on the topic. In this paper, we present the results of a small corpus study where English indefinite forms any and some were labelled with fine-grained semantic functions well-motivated by typological studies. We developed annotation guidelines that could be used by non-expert annotators and calculated inter-annotator agreement amongst several coders. The results show that the annotation task is hard, with agreement scores ranging from 52% to 62% depending on the number of functions considered, but also that each of the independent annotations is in accordance with theoretical predictions regarding the possible distributions
of indefinite functions. The resulting annotated corpus is available upon request and can be accessed through a searchable online database
Original languageEnglish
Title of host publicationThe eighth international conference on Language Resources and Evaluation (LREC)
Place of PublicationIstanbul, Turkey
PublisherEuropean Language Resources Association (ELRA)
Number of pages5
Publication statusPublished - May-2012
Externally publishedYes

Download statistics

No data available

ID: 109827155