PIE Corpus

Haagsma, H. (Creator), University of Groningen, 17-Oct-2017


  • Hessel Haagsma (Creator)
  • Johan Bos (Contributor)
  • Barbara Plank (Contributor)


An evaluation corpus for the automatic detection of potentially idiomatic expressions (PIEs), based on the British National Corpus (BNC). This repository contains six json-files containing the annotations.
Date made available17-Oct-2017
PublisherUniversity of Groningen
Date of data production1-Sep-2017 - 17-Oct-2017
Access to the dataset Open

    Keywords on Datasets

  • idiomatic expressions, British National Corpus
Related Publications
  1. The Parallel Meaning Bank: Towards a Multilingual Corpus of Translations Annotated with Compositional Meaning Representations

    Abzianidze, L., Bjerva, J., Evang, K., Haagsma, H., van Noord, R., Ludmann, P., Nguyen, D-D. & Bos, J., 2017, Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics : Volume 2, Short Papers. p. 242-247 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

View all (1) »

View graph of relations

ID: 74541592