Skip to ContentSkip to Navigation
Research Center for Language and Cognition (CLCG) Research Computational Linguistics Research

Publications

Below you will find a list of highlighted publications of our members, starting from 2018. For more detailed overviews please take a look at the individual member pages.

2023

Making more of little data: Improving low-resource automatic speech recognition using data augmentation

Bartelds, M., San, N., McDonnell, B., Jurafsky, D., & Wieling, M. ACL 2023 [Paper]

WikiBio: a Semantic Resource for the Intersectional Analysis of Biographical Events

Stranisci, M., [...], and Caselli, T. ACL 2023 [Paper]

RAMP: Retrieval and Attribute-Marking Enhanced Prompting for Attribute-Controlled Translation

Gabriele Sarti et al. ACL 2023 [Paper]

Investigating interoperable event corpora: limitations of reusability of resources and portability of models

Caselli, T., Bos, J. Lang Resources & Evaluation 2023 [Paper]

Objective speech outcomes after surgical treatment for oral cancer: An acoustic analysis of a spontaneous speech corpus containing 32.850 tokens

Tienkamp, T. B., van Son, R.J.J.H., & Halpern, B. M. (2023). Journal of Communication Disorders 2023 [Paper]

Predicting citations in Dutch case law with natural language processing

Schepers, I., Medvedeva, M., Bruijn, M., Vols, M., Wieling, M. (2023). Artificial Intelligence and Law 2023 [Paper]

SPRAAKLAB: a mobile laboratory for collecting speech production data

Wieling, M., Rebernik, T., & Jacobi, J. (2023). 20th International Congress of Phonetic Sciences [Paper]

Communication Drives the Emergence of Language Universals in Neural Agents: Evidence from the Word-order/Case-marking Trade-off

Yuchen Lian, Arianna Bisazza, and Tessa Verhoef. TACL 2023 [Paper]

2022

Subword-Delimited Downsampling for Better Character-Level Translation

Lukas Edman, Antonio Toral and Gertjan van Noord. Findings of EMNLP 2022 [Paper]

DivEMT: Neural Machine Translation Post-Editing Effort Across Typologically Diverse Languages

Gabriele Sarti, Arianna Bisazza, Ana Guerberof and Antonio Toral. EMNLP 2022 [Paper]

Hyper-X: A Unified Hypernetwork for Multi-Task Multilingual Transfer

Ahmet Üstün, Arianna Bisazza, Gosse Bouma, Gertjan van Noord and Sebastian Ruder. EMNLP 2022 [Paper]

When does Parameter-Efficient Transfer Learning Work for Machine Translation?

Ahmet Üstün and Asa Cooper Stickland. EMNLP 2022 [Paper]

Specificity ratings for Italian data

Marianna Marcella Bolognesi and Tommaso Caselli. Behavior Research Methods [Paper]

Dead or Murdered? Predicting Responsibility Perception in Femicide News Reports

Gosse Minnema, Sara Gemelli, Chiara Zanchi, Tommaso Caselli and Malvina Nissim. AACL 2022. Best Paper Award [Paper]

Transparent Semantic Parsing with Universal Dependencies Using Graph Transformations

Wessel Poelman, Rik van Noord and Johan Bos. COLING 2022 [Paper]

Multi-Figurative Language Generation

Huiyuan Lai and Malvina Nissim. COLING 2022 [Paper]

How about Time? Probing a Multilingual Language Model for Temporal Relations

Tommaso Caselli, Irene Dini, Felice Dell’Orletta.
COLING 2022. Outstanding Paper Award [Paper]

Visual content analysis of visitors’ engagement with an instagrammable exhibition

Rhee, Pianzola, Choi, Hyung and Hwang. Museum Management and Curatorship [Paper]

Quantifying Language Variation Acoustically with Few Resources

Martijn Bartelds and Martijn Wieling. NAACL 2022 [Paper]

Neural representations for modeling variation in speech

Martijn Bartelds et. al. Journal of Phonetics [Paper]

UDapter: Typology-based Language Adapters for Multilingual Dependency Parsing and Sequence Labeling

Ahmet Üstün, Arianna Bisazza, Gosse Bouma and Gertjan van Noord. CL Journal 2022 [Paper]

Make the Best of Cross-lingual Transfer: Evidence from POS Tagging with over 100 Languages

Wietse de Vries, Martijn Wieling and Malvina Nissim. ACL 2022 [Paper]

Multilingual pre-training with Language and Task Adaptation for Multilingual Text Style Transfer

Huiyuan Lai, Antonio Toral and Malvina Nissim. ACL 2022 [Paper]

Rethinking the field of automatic prediction of court decisions

Masha Medvedeva, Martijn Wieling and Michel Vols. Artificial Intelligence and Law [Paper]

2021

Cognitive Benefits of Learning Additional Languages in Old Adulthood? Insights from an Intensive Longitudinal Intervention Study

Kliesch et al. (including Martijn Wieling). Applied Linguistics 2021 [Paper]

Accuracy Assessment of Two Electromagnetic Articulographs: Northern Digital Inc. WAVE and Northern Digital Inc. VOX

Teja Rebernik, Jidde Jacobi, Mark Tiede and Martijn Wieling. JSLHR 2021 [Paper]

Multilingual Unsupervised Neural Machine Translation with Denoising Adapters

Ahmet Üstün, Alexandre Berard, Laurent Besacier and Matthias Gallé. EMNLP 2021 [Paper]

The Effect of Efficient Messaging and Input Variability on Neural-Agent Iterated Language Learning

Yuchen Lian, Arianna Bisazza, Tessa Verhoef. EMNLP 2021 [Paper]

Generic resources are what you need: Style transfer tasks without task-specific parallel training data

Huiyuan Lai, Antonio Toral and Malvina Nissim. EMNLP 2021 [Paper]

Fighting the COVID-19 Infodemic: Modeling the Perspective of Journalists, Fact-Checkers, Social Media Platforms, Policy Makers, and the Society

F. Alam et al. (including Tommaso Caselli). EMNLP Findings 2021 [Paper]

Finding Narratives in News Flows: The Temporal Dimension of News Stories

Blanca Calvo Figueras, Tommaso Caselli, Marcel Broersma. Digital Humanities Quarterly (DHQ) 2021 [Paper]

On the Difficulty of Translating Free-Order Case-Marking Languages

Arianna Bisazza, Ahmet Üstün and Stephan Sportel. TACL 2021 [Paper]

Thank you BART! Rewarding Pre-Trained Models Improves Formality Style Transfer

Huiyuan Lai, Antonio Toral and Malvina Nissim. ACL 2021 [Paper]

Input Representations for Parsing Discourse Representation Structures: Comparing English with Chinese

Chunliu Wang, Rik van Noord, Arianna Bisazza and Johan Bos. ACL 2021 [Paper]

Adapting Monolingual Models: Data can be Scarce when Language Similarity is High

Wietse de Vries, Martijn Bartelds, Malvina Nissim and Martijn Wieling. ACL Findings 2021 [Paper]

As Good as New. How to Successfully Recycle English GPT-2 to Make Models for Other Languages

Wietse de Vries, Malvina Nissim. ACL Findings 2021 [Paper]

Prevalence of internalizing disorders, symptoms, and traits across age using advanced nonlinear models

Van Loo et al. (including Martijn Wieling). Psychological Medicine 2021 [Paper]

Identifying communicative functions in discourse with content types. Lang Resources & Evaluation

Tommaso Caselli, R. Sprugnoli and Giovanni Moretti. Lang Resources & Evaluation 2021 [Paper]

Universal Discourse Representation Structure Parsing

Jiangming Liu, Shay B Cohen, Mirella Lapata and Johan Bos. CL 2021 [Paper]

A review of data collection practices using electromagnetic articulography

Teja Rebernik, Jidde Jacobi, Roel Jonkers, Aude Noiray and Martijn Wieling. Laboratory Phonology 2021 [Paper]

Massive Choice, Ample Tasks (MACHAMP): A Toolkit for Multi-task Learning in NLP

Rob van der Goot, Ahmet Üstün, Alan Ramponi, Ibrahim Sharaf and Barbara Plank. EACL 2021 Demo Track - Outstanding Paper Award [Paper]

From Masked Language Modeling to Translation: Non-English Auxiliary Tasks Improve Zero-shot Spoken Language Understanding

Rob van der Goot et al. (including Ahmet Üstün). NAACL 2021 [Paper]

2020

UDapter: Language Adaptation for Truly Universal Dependency Parsing

Ahmet Üstün, Arianna Bisazza, Gosse Bouma and Gertjan van Noord. EMNLP 2020 [Paper]

Character-level representations still improve DRS parsing in the age of BERT

Rik van Noord, Antonio Toral and Johan Bos. EMNLP 2020 [Paper]

What's so special about BERT's layers? A closer look at the NLP pipeline in monolingual and multilingual models

Wietse de Vries, Andreas van Cranenburgh and Malvina Nissim. EMNLP Findings 2020 [Paper]

A New Acoustic-based Pronunciation Distance Measure

Martijn Bartelds, Caitlin Richter, Mark Liberman and Martijn Wieling. Frontiers in AI [Paper]

Embarrassingly Simple Unsupervised Aspect Extraction

Stéphan Tulkens and Andreas van Cranenburgh. ACL 2020 [Paper]

MAGPIE: A Large Corpus of Potentially Idiomatic Expressions

Hessel Haagsma, Johan Bos and Malvina Nissim. LREC 2020 [Paper]

A Set of Recommendations for Assessing Human–Machine Parity in Language Translation

Samuel Läubli, Sheila Castilho, Graham Neubig, Rico Sennrich, Qinlan Shen and Antonio Toral. JAIR 2020 [Paper]

On abstraction: decoupling conceptual concreteness and categorical specificity

Marianna Bolognesi, Christian Burgers and Tommaso Caselli. Cognitive Processing 2020 [Paper]

Fair Is Better than Sensational: Man Is to Doctor as Woman Is to Doctor

Malvina Nissim, Rik van Noord and Rob van der Goot. CL Squib 2020 [Paper]

2019

Using Machine Learning to Predict Decisions of the European Court of Human Rights

Masha Medvedeva, Michel Vols and Martijn Wieling. AI and Law Journal 2019 [Paper]

Linguistic Information in Neural Semantic Parsing with Multiple Encoders

Rik van Noord, Antonio Toral and Johan Bos. IWCS 2019 [Paper]

Post-editese: an Exacerbated Translationese

Antonio Toral, MT-Summit 2019. Best Paper Award [Paper]

The Effect of Translationese in Machine Translation Test Sets

Mike Zhang and Antonio Toral. WMT 2019 [Paper]

You Write like You Eat: Stylistic Variation as a Predictor of Social Stratification

Angelo Basile, Albert Gatt and Malvina Nissim. ACL 2019 [Paper]

Cross-Lingual Word Embeddings for Morphologically Rich Languages

Ahmet Üstün, Gosse Bouma and Gertjan van Noord. RANLP 2019 [Paper]

Vector space explorations of literary language

Andreas van Cranenburgh, Karina van Dalen-Oskam and Joris van Zundert. LRE 2019 [Paper]

2018

Discourse Semantics with Information Structure

Noortje Venhuizen, Johan Bos, Petra Hendriks and Harm Brouwer. Journal of Semantics 2018 [Paper]

Analyzing dynamic phonetic data using generalized additive mixed modeling: a tutorial focusing on articulatory differences between L1 and L2 speakers of English

Martijn Wieling. Journal of Phonetics 2018 [Paper]

TopiCS in Cognitive Science:  Special Issue:Miscommunication

Patrick G. T. Healey, Jan P. de Ruiter and Gregory J. Mills [Papers]

Post-editing Effort of a Novel with Statistical and Neural Machine Translation

Antonio Toral, Martijn Wieling and Andy Way. Frontiers in Digital Humanities 2018 [Paper]

Bleaching Text: Abstract Features for Cross-lingual Gender Prediction

Rob van der Goot, Nikola Ljubešić, Ian Matroos, Malvina Nissim and Barbara Plank. ACL 2018 [Paper]

Reproducibility in Computational Linguistics: Are We Willing to Share?

Martijn Wieling, Josine Rawee and Gertjan van Noord. Computational Linguistics 2018 [Paper]

Modeling Input Uncertainty in Neural Network Dependency Parsing

Rob van der Goot and Gertjan van Noord. EMNLP 2018 [Paper]

What can we learn from Semantic Tagging?

Mostafa Abdou, Artur Kulmizev, Vinit Ravishankar, Lasha Abzianidze and Johan Bos. EMNLP 2018 [Paper]

Exploring Neural Methods for Parsing Discourse Representation Structures

Rik van Noord, Lasha Abzianidze, Antonio Toral and Johan Bos. TACL 2018 [Paper]

Last modified:05 July 2023 3.38 p.m.