Wessel Poelman

I work as a PhD student at KU Leuven, where I am part of LAGoM under the supervision of Miryam de Lhoneux. My research asks a simple question: how do we know that multilingual NLP systems are improving for the reasons we think they are?

Before coming to Belgium, I worked at the Technical University of Munich as a research associate, completed a master’s in Information Science at the University of Groningen, and worked as a machine learning engineer at Web-IQ. More background is available on the About page or in my CV.

KU Leuven Profile · Google Scholar · Semantic Scholar · GitHub

Contact: wessel.poelman [at] kuleuven.be or contact [at] wesselpoelman.nl

Research

My work focuses on multilingual natural language processing: language sampling, tokenization, modeling choices, and evaluation. I am particularly interested in how linguistic differences between languages influence both model behavior and the claims we make about multilingual systems.

Current themes:

Evaluation and experimentation methodology for multilingual NLP (1, 2, 3)
Interaction of language characteristics and language modeling (4, 5, 6)
Tooling for multilingual language technology (7, 8)

If this sounds interesting, feel free to reach out!

Selected Publications

QQ: A Language Metadata Toolkit for Multilingual NLP

Wessel Poelman, Yiyi Chen & Miryam de Lhoneux.

Preprint

GitHub • Try the explorer!

Form and Meaning in Intrinsic Multilingual Evaluations

Wessel Poelman & Miryam de Lhoneux.

EACL 2026 • Oral Presentation

Confounding Factors in Relating Model Performance to Morphology

Wessel Poelman*, Thomas Bauwens* & Miryam de Lhoneux.

EMNLP 2025 • Oral Presentation

The Roles of English in Evaluating Multilingual Language Models

Wessel Poelman & Miryam de Lhoneux.

NoDaLiDa 2025

What is ''Typological Diversity'' in NLP?

Esther Ploeger*, Wessel Poelman*, Miryam de Lhoneux & Johannes Bjerva.

EMNLP 2024 • Oral Presentation

* shared first authorship

For a complete list, see my publications page.

Recent News

Jun 2026. Joining Barbara Plank at LMU for a three-month research stay.
Apr 2026. Paper on multilingual Wikipedia data quality accepted at ACL.
Mar 2026. Two papers at EACL.
Dec 2025. Research visit in Copenhagen, with talks at Aalborg University and ITU.
Nov 2025. Oral presentation at EMNLP.