Skip to ContentSkip to Navigation
Over ons Praktische zaken Waar vindt u ons R.I.K. (Rik) van Noord, PhD

Publicaties

Quality Beyond A Glance: Revealing Large Quality Differences Between Web-Crawled Parallel Corpora

Do Language Models Care About Text Quality? Evaluating Web-Crawled Corpora Across 11 Languages

Exploring Self-Supervised Speech Representations for Cross-lingual Acoustic-to-Articulatory Inversion

Gaining More Insight into Neural Semantic Parsing with Challenging Benchmarks

Language Models on a Diet: Cost-Efficient Development of Encoders for Closely-Related Languages via Additional Pretraining

Towards Tailored Recovery of Lexical Diversity in Literary Machine Translation

Automatic Discrimination of Human and Neural Machine Translation in Multilingual Scenarios

Automatic Discrimination of Human and Neural Machine Translation: A Study with Multiple Pre-Trained Models and Longer Context

Building Domain-specific Corpora from the Web: the Case of European Digital Service Infrastructures

MaCoCu: Massive collection and curation of monolingual and bilingual data: focus on under-resourced languages