Home | Research | Groups | Hinrich Schütze

Research Group Hinrich Schütze

Link to website at LMU

Hinrich Schütze

Prof. Dr.

Core PI

Computational Linguistics

Hinrich Schütze

holds the Chair of Computational Linguistics at LMU Munich.

His primary focus is linguistically-informed Neural NLP: His team uses deep understanding of language in its research and believes in the principle that learning is key to successful NLP – the same way that the language capabilities of humans are based on learning. The research areas are representation learning, multilinguality, machine learning for low-resource scenarios, cognitively motivated deep learning, linguistically informed deep learning (especially for morphology), digital humanities, and the intersection of NLP and robotics.

Team members @MCML

PostDocs

Link to website

Yunpu Ma

Dr.

→ Group Volker Tresp
Database Systems, Data Mining and AI
→ Co-Group Hinrich Schütze

Link to website

Philipp Wicke

Dr.

→ Group Hinrich Schütze
Computational Linguistics

Link to website

Axel Wisiorek

Dr.

→ Group Hinrich Schütze
Computational Linguistics

PhD Students

Link to website

Sebastian Gerstner

→ Group Hinrich Schütze
Computational Linguistics

Link to website

Ahmad Dawar Hakimi

→ Group Hinrich Schütze
Computational Linguistics

Link to website

Lea Hirlimann

→ Group Hinrich Schütze
Computational Linguistics

Link to website

Ayyoob Imani

→ Group Hinrich Schütze
Computational Linguistics

Link to website

Amir Hossein Kargaran

→ Group Hinrich Schütze
Computational Linguistics

Link to website

Molly Kennedy

→ Group Hinrich Schütze
Computational Linguistics

Link to website

Yihong Liu

→ Group Hinrich Schütze
Computational Linguistics

Link to website

Yuetian Lu

→ Group Hinrich Schütze
Computational Linguistics

Link to website

Chunlan Ma

→ Group Hinrich Schütze
Computational Linguistics

Link to website

Ali Modarressi

→ Group Hinrich Schütze
Computational Linguistics

Link to website

Ercong Nie

→ Group Hinrich Schütze
Computational Linguistics

Link to website

Jonas Rohweder

→ Group Hinrich Schütze
Computational Linguistics

Link to website

Leonor Veloso

→ Group Hinrich Schütze
Computational Linguistics

Link to website

Mingyang Wang

→ Group Hinrich Schütze
Computational Linguistics

Link to website

Yujun Wang

→ Group Hinrich Schütze
Computational Linguistics

Link to website

Haotian Ye

→ Group Hinrich Schütze
Computational Linguistics

Link to website

Shengqiang Zhang

→ Group Hinrich Schütze
Computational Linguistics

Recent News @MCML

Link to MCML Members Recognized in Research.com 2026 Computer Science Ranking

20.05.2026

MCML Members Recognized in Research.com 2026 Computer Science Ranking

Three Researchers Secure Top 10 Spots in Germany

Learn more

Link to MemAgents Workshop at ICLR 2026

06.05.2026

MemAgents Workshop at ICLR 2026

Interational Workshop Co-Organized by Hinrich Schütze, Yunpu Ma and Ercong Nie

Learn more

Link to MCML at ICLR 2026

22.04.2026

MCML at ICLR 2026

40 Accepted Papers (33 Main, and 7 Workshops)

Learn more

Link to Leonie Weissweiler Joins ScaDS.AI Dresden/Leipzig as Principal Investigator

01.04.2026

Leonie Weissweiler Joins ScaDS.AI Dresden/Leipzig as Principal Investigator

Research in Natural Language Processing and Computational Linguistics

Learn more

Show all news of this group

Publications @MCML

2026

[202]

J. Bi • D. Yan • Y. Wang • W. Huang • H. Chen • G. Wan • M. Ye • X. Xiao • H. Schütze • V. Tresp • Y. Ma
The Geometry of Reasoning: Self-Evaluation via Layerwise Trajectory Evolution.
ICML 2026 - 43rd International Conference on Machine Learning. Seoul, South Korea, Jul 06-11, 2026. To be published. URL

[201]

Y. Liu • R. Zhao • H. Schütze • M. A. Hedderich
Large Reasoning Models Are (Not Yet) Multilingual Latent Reasoners.
ACL 2026 - 64th Annual Meeting of the Association for Computational Linguistics. San Diego, CA, USA, Jul 02-07, 2026. To be published. Preprint available. arXiv

[200]

Y. Liu • J. Yu • M. Wang • Y. Zhang • E. Nie • S. Feng • D. Wang • K. Song • H. Schütze
SAD: A Large-Scale Strategic Argumentative Dialogue Dataset.
ACL 2026 - 64th Annual Meeting of the Association for Computational Linguistics. San Diego, CA, USA, Jul 02-07, 2026. To be published. Preprint available. arXiv

[199]

R. Zhao • Y. Liu • L. Altinger • H. Schütze • M. A. Hedderich
Evaluating Robustness of Large Language Models Against Multilingual Typographical Errors.
ACL 2026 - 64th Annual Meeting of the Association for Computational Linguistics. San Diego, CA, USA, Jul 02-07, 2026. To be published. Preprint available. arXiv GitHub

[198]

M. Li • E. Nie • H. Huang • X. Lv • G. Zhou
Dual-layer prompt ensembles: Leveraging system- and user-level instructions for robust LLM-based query expansion and rank fusion.
Information Fusion 131.104160. Jul. 2026. DOI

[197]

A. A. Sefat • A. H. Kargaran • F. Yvon • H. Schütze
GlotWeb: Web Indexing for Minority Languages.
WWW 2026 - ACM Web Conference. Dubai, United Arab Emirates, Jun 29-Jul 03, 2026. DOI GitHub

[196]

M. Fayyaz • A. Modarressi • H. Deilamsalehy • F. Dernoncourt • R. Rossi • T. Bui • H. Schütze • N. Peng
Steering MoE LLMs via Expert (De)Activation.
ICLR 2026 - 14th International Conference on Learning Representations. Rio de Janeiro, Brazil, Apr 23-27, 2026. To be published. Preprint available. arXiv GitHub

[195]

J. Lan • Z. Liu • U. Schlegel • R. Zhao • Y. Liu • H. Schütze • M. A. Hedderich • T. Seidl
Human Uncertainty-Aware Data Selection and Automatic Labeling in Visual Question Answering.
ICLR 2026 - 14th International Conference on Learning Representations. Rio de Janeiro, Brazil, Apr 23-27, 2026. To be published. Preprint available. arXiv

[194]

A. D. Hakimi • L. Hirlimann • I. Augenstein • H. Schütze
Do We Still Need Humans in the Loop? Comparing Human and LLM Annotation in Active Learning for Hostility Detection.
Preprint (Apr. 2026). arXiv

[193]

A. H. Kargaran • N. Nikeghbal • J. Diesner • F. Yvon • H. Schütze
GlotOCR Bench: OCR Models Still Struggle Beyond a Handful of Unicode Scripts.
Preprint (Apr. 2026). arXiv GitHub

[192]

P. H. L. de Araujo • M. A. Hedderich • A. Modarressi • H. Schütze • B. Roth
Persistent Personas? Role-Playing, Instruction Following, and Safety in Extended Interactions.
EACL 2026 - 19th Conference of the European Chapter of the Association for Computational Linguistics. Rabat, Morocco, Mar 24-29, 2026. DOI

[191]

Y. Liu • Y. Ma • Y. Lu • S. Chen • Z. Ding • V. Tresp
Parameter-Efficient Routed Fine-Tuning: Mixture-of-Experts Demands Mixture of Adaptation Modules.
Findings @EACL 2026 - Findings of the 19th Conference of the European Chapter of the Association for Computational Linguistics. Rabat, Morocco, Mar 24-29, 2026. DOI

[190]

R. Zhao • Y. Liu • H. Schütze • M. A. Hedderich
A Comprehensive Evaluation of Multilingual Chain-of-Thought Reasoning: Performance, Consistency, and Faithfulness Across Languages.
Findings @EACL 2026 - Findings of the 19th Conference of the European Chapter of the Association for Computational Linguistics. Rabat, Morocco, Mar 24-29, 2026. DOI

[189]

Y. Cao • M. Wang • H. Schütze
The Anatomy of an Edit: Mechanism-Guided Activation Steering for Knowledge Editing.
Preprint (Mar. 2026). arXiv

[188]

Y. Veitsman • Y. Liu • H. Schütze
Why Better Cross-Lingual Alignment Fails for Better Cross-Lingual Transfer: Case of Encoders.
Preprint (Mar. 2026). arXiv

[187]

S. Gerstner • H. Schütze
GLUScope: A Tool for Analyzing GLU Neurons in Transformer Language Models.
Preprint (Feb. 2026). arXiv

[186]

M. Li • E. Nie • S. Zhao • T. Chen • H. Huang • G. Zhou
Automatic In-Domain Exemplar Construction and LLM-Based Refinement of Multi-LLM Expansions for Query Expansion.
Preprint (Feb. 2026). arXiv

[185]

Z. S. Taghavi • A. Modarressi • H. Schütze • A. Marfurt
With Argus Eyes: Assessing Retrieval Gaps via Uncertainty Scoring to Detect and Remedy Retrieval Blind Spots.
Preprint (Feb. 2026). arXiv

[184]

Y. Wang • Aniri • J. Bi • S. Pirk • Y. Ma
ASCD: Attention-Steerable Contrastive Decoding for Reducing Hallucination in MLLM.
AAAI 2026 - 40th Conference on Artificial Intelligence. Singapore, Jan 20-27, 2026. DOI

[183]

Y. Liu • B. Xiong • H. Schütze
Evaluating Contextually Mediated Factual Recall in Multilingual Large Language Models.
Preprint (Jan. 2026). arXiv

[182]

Y. Liu • X. Li • M. Zhao • S. Zhang • Z. Wang • Q. Li • S. Feng • F. Ren • D. Wang • H. Schütze
High-Rank Structured Modulation for Parameter-Efficient Fine-Tuning.
Preprint (Jan. 2026). arXiv

[181]

Y. Lu • Y. Liu • H. Schütze
Relational Linearity is a Predictor of Hallucinations.
Preprint (Jan. 2026). arXiv

[180]

Q. Wang • V. Bach Nguyen • Y. Liu • F. Splitt • N. Feldhus • C. Seifert • H. Schütze • S. Möller • V. Schmitt
Parallel Universes, Parallel Languages: A Comprehensive Study on LLM-based Multilingual Counterfactual Example Generation.
Preprint (Jan. 2026). arXiv

[179]

Z. Wang • Y. Liu • M. Wang • E. Nie • D. Chen • Z. Zhao • S. Feng • D. Wang • X. Yang • Y. Zhang • H. Schütze
PlaM: Training-Free Plateau-Guided Model Merging for Better Visual Grounding in MLLMs.
Preprint (Jan. 2026). arXiv URL

[178]

Y. Xia • D. Ulmer • T. Blevins • Y. Liu • H. Schütze • B. Roth
Calibration Is Not Enough: Evaluating Confidence Estimation Under Language Variations.
Preprint (Jan. 2026). arXiv

[177]

S. Yan • X. Yang • Z. Huang • E. Nie • Z. Ding • Z. Li • X. Ma • J. Bi • K. Kersting • J. Z. Pan • H. Schütze • V. Tresp • Y. Ma
Memory-R1: Enhancing Large Language Model Agents to Manage and Utilize Memories via Reinforcement Learning.
Preprint (Jan. 2026). arXiv

2025

[176]

E. Nie • S. Yuan • B. Ma • H. Schmid • M. Färber • F. Kreuter • H. Schütze
Decomposed Prompting: Probing Multilingual Linguistic Structure Knowledge in Large Language Models.
Findings @IJCNLP 2025 - Findings of the 14th International Joint Conference on Natural Language Processing. Mumbai, India, Dec 20-24, 2025. URL

[175]

S. Yuan • E. Nie • L. Kouba • H. Schmid • H. Schütze • M. Färber
LLM in the Loop: Creating the ParaDeHate Dataset for Hate Speech Detoxification.
Findings @IJCNLP 2025 - Findings of the 14th International Joint Conference on Natural Language Processing. Mumbai, India, Dec 20-24, 2025. URL

[174]

I. Ziegler • A. Köksal • D. Elliott • H. Schütze
CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval and Augmentation.
Transactions of the Association for Computational Linguistics 13. Dec. 2025. DOI

[173]

Z. Cai • W. Hua • K. Li • Y. Ma • E. Nie • H. Schütze • K. Stanczak • M. E. Taylor
ICLR 2026 Workshop on Memory for LLM-Based Agentic Systems (MemAgents).
Preprint (Dec. 2025). URL

[172]

X. Wang • M. Wang • Y. Liu • H. Schütze • B. Plank
Refusal Direction is Universal Across Safety-Aligned Languages.
NeurIPS 2025 - 39th Conference on Neural Information Processing Systems. San Diego, CA, USA, Nov 30-Dec 07, 2025. URL

[171]

L. Veloso • L. Hirlimann • P. Wicke • H. Schütze
SLAyiNG: Towards Queer Language Processing.
QueerInAI @NeurIPS 2025 - Queer in AI Workshop at the 39th Conference on Neural Information Processing Systems. San Diego, CA, USA, Nov 30-Dec 07, 2025. arXiv

[170]

P. Mondorf • M. Wang • S. Gerstner • A. D. Hakimi • Y. Liu • L. Veloso • S. Zhou • H. Schütze • B. Plank
BlackboxNLP-2025 MIB Shared Task: Exploring Ensemble Strategies for Circuit Localization Methods.
BlackboxNLP @EMNLP 2025 - 8th Workshop on Analyzing and Interpreting Neural Networks for NLP at the Conference on Empirical Methods in Natural Language Processing. Suzhou, China, Nov 04-09, 2025. DOI

[169]

Y. Liu • R. Chen • L. Hirlimann • A. D. Hakimi • M. Wang • A. H. Kargaran • S. Rothe • F. Yvon • H. Schütze
On Relation-Specific Neurons in Large Language Models.
EMNLP 2025 - Conference on Empirical Methods in Natural Language Processing. Suzhou, China, Nov 04-09, 2025. DOI GitHub

[168]

N. Nikeghbal • A. H. Kargaran • J. Diesner
CoBia: Constructed Conversations Can Trigger Otherwise Concealed Societal Biases in LLMs.
EMNLP 2025 - Conference on Empirical Methods in Natural Language Processing. Suzhou, China, Nov 04-09, 2025. DOI GitHub

[167]

Z. Peng • X. Yin • R. Qian • P. Lin • Y. Liu • H. Zhang • C. Ying • Y. Luo
SolEval: Benchmarking Large Language Models for Repository-level Solidity Smart Contract Generation.
EMNLP 2025 - Conference on Empirical Methods in Natural Language Processing. Suzhou, China, Nov 04-09, 2025. DOI GitHub

[166]

Z. S. Taghavi • A. Modarressi • Y. Ma • H. Schütze
ImpliRet: Benchmarking the Implicit Fact Retrieval Challenge.
EMNLP 2025 - Conference on Empirical Methods in Natural Language Processing. Suzhou, China, Nov 04-09, 2025. DOI GitHub

[165]

M. Wang • L. Lange • H. Adel • Y. Ma • J. Strötgen • H. Schütze
Language Mixing in Reasoning Language Models: Patterns, Impact, and Internal Causes.
EMNLP 2025 - Conference on Empirical Methods in Natural Language Processing. Suzhou, China, Nov 04-09, 2025. DOI

[164]

C. Wu • B. Ma • Y. Liu • Z. Zhang • N. Deng • Y. Li • B. Chen • Y. Zhang • Y. Xue • B. Plank
M-ABSA: A Multilingual Dataset for Aspect-Based Sentiment Analysis.
EMNLP 2025 - Conference on Empirical Methods in Natural Language Processing. Suzhou, China, Nov 04-09, 2025. DOI

[163]

Y. Liu • M. Wang • A. H. Kargaran • F. Körner • E. Nie • B. Plank • F. Yvon • H. Schütze
Tracing Multilingual Factual Knowledge Acquisition in Pretraining.
Findings @EMNLP 2025 - Findings of the Conference on Empirical Methods in Natural Language Processing. Suzhou, China, Nov 04-09, 2025. DOI GitHub

[162]

E. Nie • H. Schmid • H. Schütze
Mechanistic Understanding and Mitigation of Language Confusion in English-Centric Large Language Models.
Findings @EMNLP 2025 - Findings of the Conference on Empirical Methods in Natural Language Processing. Suzhou, China, Nov 04-09, 2025. DOI

[161]

R. Zhao • A. Köksal • A. Modarressi • M. A. Hedderich • H. Schütze
Do We Know What LLMs Don't Know? A Study of Consistency in Knowledge Probing.
Findings @EMNLP 2025 - Findings of the Conference on Empirical Methods in Natural Language Processing. Suzhou, China, Nov 04-09, 2025. DOI

[160]

C. Ma • Y. Liu • H. Ye • H. Schütze
Exploring the Role of Transliteration in In-Context Learning for Low-resource Languages Written in Non-Latin Scripts.
MRL @EMNLP 2025 - 5th Multilingual Representation Learning Workshop at the Conference on Empirical Methods in Natural Language Processing. Suzhou, China, Nov 04-09, 2025. DOI

[159]

H. Ye • A. Wisiorek • A. Maronikolakis • Ö. Alaçam • H. Schütze
A Federated Approach to Few-Shot Hate Speech Detection for Marginalized Communities.
MRL @EMNLP 2025 - 5th Multilingual Representation Learning Workshop at the Conference on Empirical Methods in Natural Language Processing. Suzhou, China, Nov 04-09, 2025. DOI

[158]

P. Lin
Massively multilingual language modeling and adaptation.
Dissertation LMU München. Oct. 2025. DOI

[157]

L. Weissweiler • A. Köksal • H. Schütze
Hybrid Human-LLM Corpus Construction and LLM Evaluation for the Caused-Motion Construction.
The Northern European Journal of Language Technology 11.1. Oct. 2025. DOI

[156]

Y. Liu • M. Wang • F. Yvon • H. Schütze
On the Entity-Level Alignment in Crosslingual Consistency.
Preprint (Oct. 2025). arXiv

[155]

S. Dutta • T. Kaufmann • G. Glavaš • I. Habernal • K. Kersting • F. Kreuter • M. Mezini • I. Gurevych • E. Hüllermeier • H. Schütze
Problem Solving Through Human-AI Preference-Based Cooperation.
Computational Linguistics 51.4. Sep. 2025. DOI

[154]

M. Li • M. Luo • T. Lv • Y. Zhang • S. Zhao • E. Nie • G. Zhou
A Survey of Long-Document Retrieval in the PLM and LLM Era.
Preprint (Sep. 2025). arXiv

[153]

M. Li • X. Lv • J. Zou • T. Chen • C. Zhang • S. An • E. Nie • G. Zhou
Query Expansion in the Age of Pre-trained and Large Language Models: A Comprehensive Survey.
Preprint (Sep. 2025). arXiv

[152]

N. Kassner
Consistency and completeness of knowledge acquired by language models.
Dissertation LMU München. Aug. 2025. DOI

[151]

A. Köksal • M. Thaler • A. Imani • A. Üstün • A. Korhonen • H. Schütze
MURI: High-Quality Instruction Tuning Datasets for Low-Resource Languages via Reverse Instructions.
Transactions of the Association for Computational Linguistics 13. Aug. 2025. DOI GitHub

[150]

H. Yang • J. Lan • Y. Liu • H. Schütze • T. Seidl
Enhancing Robustness of Autoregressive Language Models against Orthographic Attacks via Pixel-based Approach.
Preprint (Aug. 2025). arXiv

[149]

J. Bi • Y. Wang • H. Chen • X. Xiao • A. Hecker • V. Tresp • Y. Ma
LLaVA Steering: Visual Instruction Tuning with 500x Fewer Parameters through Modality Linear Representation-Steering.
ACL 2025 - 63rd Annual Meeting of the Association for Computational Linguistics. Vienna, Austria, Jul 27-Aug 01, 2025. DOI

[148]

M. Fayyaz • A. Modarressi • H. Schütze • N. Peng
Collapse of Dense Retrievers: Short, Early, and Literal Biases Outranking Factual Evidence.
ACL 2025 - 63rd Annual Meeting of the Association for Computational Linguistics. Vienna, Austria, Jul 27-Aug 01, 2025. DOI

[147]

Y. Liu • H. Ye • C. Ma • M. Wang • H. Schütze
LangSAMP: Language-Script Aware Multilingual Pretraining.
ACL 2025 - 63rd Annual Meeting of the Association for Computational Linguistics. Vienna, Austria, Jul 27-Aug 01, 2025. DOI GitHub

[146]

E. Nie • B. Shao • Z. Ding • M. Wang • H. Schmid • H. Schütze
BMIKE-53: Investigating Cross-Lingual Knowledge Editing with In-Context Learning.
ACL 2025 - 63rd Annual Meeting of the Association for Computational Linguistics. Vienna, Austria, Jul 27-Aug 01, 2025. DOI GitHub

[145]

R. Pei • Y. Liu • P. Lin • F. Yvon • H. Schütze
Understanding In-Context Machine Translation for Low-Resource Languages: A Case Study on Manchu.
ACL 2025 - 63rd Annual Meeting of the Association for Computational Linguistics. Vienna, Austria, Jul 27-Aug 01, 2025. DOI

[144]

M. Wang • H. Adel • L. Lange • Y. Liu • E. Nie • J. Strötgen • H. Schütze
Lost in Multilinguality: Dissecting Cross-lingual Factual Inconsistency in Transformer Language Models.
ACL 2025 - 63rd Annual Meeting of the Association for Computational Linguistics. Vienna, Austria, Jul 27-Aug 01, 2025. DOI

[143]

I. Bueno • A. Bavaresco • J. M. Cunha • P. Wicke
Testing Spatial Intuitions of Humans and Large Language and Multimodal Models in Analogies.
Analogy-Angle II @ACL 2025 - 2nd Workshop on Analogical Abstraction in Cognition, Perception, and Language at the 63rd Annual Meeting of the Association for Computational Linguistics. Vienna, Austria, Jul 27-Aug 01, 2025. DOI

[142]

L. Hagström • E. Nie • R. Halifa • H. Schmid • R. Johansson • A. Junge
Language Model Re-rankers are Fooled by Lexical Similarities.
FEVER @ACL 2025 - 8th Fact Extraction and VERification Workshop at the 63rd Annual Meeting of the Association for Computational Linguistics. Vienna, Austria, Jul 27-Aug 01, 2025. DOI

[141]

A. D. Hakimi • A. Modarressi • P. Wicke • H. Schütze
Time Course MechInterp: Analyzing the Evolution of Components and Knowledge in Large Language Models.
Findings @ACL 2025 - Findings at the 63rd Annual Meeting of the Association for Computational Linguistics. Vienna, Austria, Jul 27-Aug 01, 2025. DOI

[140]

L. He • E. Nie • H. Schmid • H. Schütze • N. Mesgarani • J. Brennan
Large Language Models as Neurolinguistic Subjects: Discrepancy between Performance and Competence.
Findings @ACL 2025 - Findings at the 63rd Annual Meeting of the Association for Computational Linguistics. Vienna, Austria, Jul 27-Aug 01, 2025. DOI

[139]

A. H. Kargaran • Y. Liu • F. Yvon • H. Schütze
How Programming Concepts and Neurons Are Shared in Code Language Models.
Findings @ACL 2025 - Findings at the 63rd Annual Meeting of the Association for Computational Linguistics. Vienna, Austria, Jul 27-Aug 01, 2025. DOI GitHub

[138]

A. H. Kargaran • A. Modarressi • N. Nikeghbal • J. Diesner • F. Yvon • H. Schütze
MEXA: Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment.
Findings @ACL 2025 - Findings at the 63rd Annual Meeting of the Association for Computational Linguistics. Vienna, Austria, Jul 27-Aug 01, 2025. DOI

[137]

M. Wang • A. Stoll • L. Lange • H. Adel • H. Schütze • J. Strötgen
Bring Your Own Knowledge: A Survey of Methods for LLM Knowledge Expansion.
L2M2 @ACL 2025 - 1st Workshop on Large Language Model Memorization at the 63rd Annual Meeting of the Association for Computational Linguistics. Vienna, Austria, Jul 27-Aug 01, 2025. DOI

[136]

T. Lindenbauer • G. Groh • H. Schütze
From Knowledge to Noise: CTIM-Rover and the Pitfalls of Episodic Memory in Software Engineering Agents.
REALM @ACL 2025 - 1st Workshop for Research on Agent Language Models at the 63rd Annual Meeting of the Association for Computational Linguistics. Vienna, Austria, Jul 27-Aug 01, 2025. DOI

[135]

L. He • E. Nie • S. S. Dindar • A. Firoozi • A. Florea • V. Nguyen • C. Puffay • R. Shimizu • H. Ye • J. Brennan • H. Schmid • H. Schütze • N. Mesgarani
XCOMPS: A Multilingual Benchmark of Conceptual Minimal Pairs.
SIGTYP @ACL 2025 - 7th Workshop on Research in Computational Linguistic Typology and Multilingual NLP at the 63rd Annual Meeting of the Association for Computational Linguistics. Vienna, Austria, Jul 27-Aug 01, 2025. DOI

[134]

P. Lin • M. Thaler • D. Goschala • A. H. Kargaran • Y. Liu • A. Martins • H. Schütze
Construction-Based Reduction of Translationese for Low-Resource Languages: A Pilot Study on Bavarian.
SIGTYP @ACL 2025 - 7th Workshop on Research in Computational Linguistic Typology and Multilingual NLP at the 63rd Annual Meeting of the Association for Computational Linguistics. Vienna, Austria, Jul 27-Aug 01, 2025. DOI

[133]

Q. Feng • Y. Liu • H. Schütze
Your Pretrained Model Tells the Difficulty Itself: A Self-Adaptive Curriculum Learning Paradigm for Natural Language Understanding.
SRW @ACL 2025 - Student Research Workshop at the 63rd Annual Meeting of the Association for Computational Linguistics. Vienna, Austria, Jul 27-Aug 01, 2025. DOI

[132]

E. Özeren • Y. Liu • H. Schütze
HYPEROFA: Expanding LLM Vocabulary to New Languages via Hypernetwork-Based Embedding Initialization.
SRW @ACL 2025 - Student Research Workshop at the 63rd Annual Meeting of the Association for Computational Linguistics. Vienna, Austria, Jul 27-Aug 01, 2025. DOI

[131]

A. Modarressi • H. Deilamsalehy • F. Dernoncourt • T. Bui • R. A. Rossi • S. Yoon • H. Schütze
NoLiMa: Long-Context Evaluation Beyond Literal Matching.
ICML 2025 - 42nd International Conference on Machine Learning. Vancouver, Canada, Jul 13-19, 2025. URL URL

[130]

H. Ye
Multilinguality and inclusive language technologies for low-resource languages.
Dissertation LMU München. Jul. 2025. DOI

[129]

S. Yuan • E. Nie • B. Ma • M. Färber
Why Lift so Heavy? Slimming Large Language Models by Cutting Off the Layers.
IJCNN 2025 - International Joint Conference on Neural Networks. Rome, Italy, Jun 30-Jul 05, 2025. DOI

[128]

P. Wicke • M. M. Bolognesi
Red and blue language: Word choices in the Trump and Harris 2024 presidential debate.
PLOS One 20.6. Jun. 2025. DOI GitHub

[127]

C. Chan • Y. Yim • H. Zeng • Z. Zou • X. Cheng • Z. Sun • Z. Deng • K. Chung • Y. Ao • Y. Fan • C. Jiayang • E. Nie • G. Y. Wong • H. Schmid • H. Schütze • S. See • Y. Song
XToM: Exploring the Multilingual Theory of Mind for Large Language Models.
Preprint (Jun. 2025). arXiv

[126]

S. Yuan • E. Nie • Y. Sun • C. Zhao • W. LaCroix • M. Färber
Beyond Over-Refusal: Scenario-Based Diagnostics and Post-Hoc Mitigation for Exaggerated Refusals in LLMs.
Preprint (Jun. 2025). arXiv

[125]

S. Yuan • E. Nie • M. Tawfelis • H. Schmid • H. Schütze • M. Färber
Hateful Person or Hateful Model? Investigating the Role of Personas in Hate Speech Detection by Large Language Models.
Preprint (Jun. 2025). arXiv

[124]

L. K. Senel
Exploring the frontiers of word understanding and language model evaluation in NLP.
Dissertation LMU München. May. 2025. DOI

[123]

J. Bi • D. Yan • Y. Wang • W. Huang • H. Chen • G. Wan • M. Ye • X. Xiao • H. Schütze • V. Tresp • Y. Ma
CoT-Kinetics: A Theoretical Modeling Assessing LRM Reasoning Process.
Preprint (May. 2025). arXiv

[122]

S. Gerstner • H. Schütze
Understanding Gated Neurons in Transformers from Their Input-Output Functionality.
Preprint (May. 2025). arXiv

[121]

Y. Liu • X. Xu • E. Nie • Z. Wang • S. Feng • D. Wang • Q. Li • H. Schütze
Look Within or Look Beyond? A Theoretical Comparison Between Parameter-Efficient and Full Fine-Tuning.
Preprint (May. 2025). arXiv GitHub

[120]

Q. Wang • M. Wang • N. Feldhus • S. Ostermann • Y. Cao • H. Schütze • S. Möller • V. Schmitt
Through a Compressed Lens: Investigating the Impact of Quantization on LLM Explainability and Interpretability.
Preprint (May. 2025). arXiv

[119]

Z. Wang • X. Xu • Y. Liu • Y. Zhang • P. Lin • S. Feng • X. Yang • D. Wang • H. Schütze
Why Do More Experts Fail? A Theoretical Analysis of Model Merging.
Preprint (May. 2025). arXiv GitHub

[118]

P. Lin • A. F. T. Martins • H. Schütze
XAMPLER: Learning to Retrieve Cross-Lingual In-Context Examples.
Findings @NAACL 2025 - Findings of the Annual Conference of the North American Chapter of the Association for Computational Linguistics. Albuquerque, NM, USA, Apr 29-May 04, 2025. DOI GitHub

[117]

C. Ma • A. ImaniGooghari • H. Ye • R. Pei • E. Asgari • H. Schütze
Taxi1500: A Dataset for Multilingual Text Classification in 1500 Languages.
NAACL 2025 - Annual Conference of the North American Chapter of the Association for Computational Linguistics. Albuquerque, NM, USA, Apr 29-May 04, 2025. DOI

[116]

J. Yu • Y. Zhang • B. Wang • P. Lin • Y. Liu • S. Feng
SSMLoRA: Enhancing Low-Rank Adaptation with State Space Model.
NAACL 2025 - Annual Conference of the North American Chapter of the Association for Computational Linguistics. Albuquerque, NM, USA, Apr 29-May 04, 2025. DOI GitHub

[115]

P. Lin • A. F. T. Martins • H. Schütze
A Recipe of Parallel Corpora Exploitation for Multilingual Large Language Models.
NAACL 2025 - Findings of the Annual Conference of the North American Chapter of the Association for Computational Linguistics. Albuquerque, NM, USA, Apr 29-May 04, 2025. DOI

[114]

I. d. S. Bueno Júnior • H. Ye • A. Wisiorek • H. Schütze
Privacy-Preserving Federated Learning for Hate Speech Detection.
SRW @NAACL 2025 - Student Research Workshop at the Annual Conference of the North American Chapter of the Association for Computational Linguistics. Albuquerque, NM, USA, Apr 29-May 04, 2025. DOI

[113]

A. Modarressi • A. Köksal • A. Imani • M. Fayyaz • H. Schütze
MemLLM: Finetuning LLMs to Use An Explicit Read-Write Memory.
NFAM @ICLR 2025 - Workshop on New Frontiers in Associative Memories at the 13th International Conference on Learning Representations. Singapore, Apr 24-28, 2025. URL

[112]

A. Modarressi • A. Köksal • A. Imani • M. Fayyaz • H. Schütze
MemLLM: Finetuning LLMs to Use Explicit Read-Write Memory.
Transactions on Machine Learning Research. Apr. 2025. URL GitHub

[111]

J. Bi • Aniri • Y. Wang • D. Yan • W. Huang • Z. Jin • X. Ma • S. Yan • A. Hecker • M. Ye • X. Xiao • H. Schütze • V. Tresp • Y. Ma
PRISM: Self-Pruning Intrinsic Selection Method for Training-Free Multimodal Data Selection.
Preprint (Feb. 2025). arXiv GitHub

[110]

Y. Liu • C. Ma • H. Ye • H. Schütze
TransMI: A Framework to Create Strong Baselines from Multilingual Pretrained Language Models for Transliterated Data.
COLING 2025 - The 31st International Conference on Computational Linguistics. Abu Dhabi, United Arab Emirates, Jan 19-24, 2025. URL GitHub

[109]

Y. Liu • M. Wang • A. H. Kargaran • A. Imani • O. Xhelili • H. Ye • C. Ma • F. Yvon • H. Schütze
How Transliterations Improve Crosslingual Alignment.
COLING 2025 - The 31st International Conference on Computational Linguistics. Abu Dhabi, United Arab Emirates, Jan 19-24, 2025. URL

2024

[108]

A. H. Kargaran • F. Yvon • H. Schütze
GlotCC: An Open Broad-Coverage CommonCrawl Corpus and Pipeline for Minority Languages.
NeurIPS 2024 - 38th Conference on Neural Information Processing Systems. Vancouver, Canada, Dec 10-15, 2024. DOI

[107]

Y. Liu • Y. Zhang • Q. Li • T. Liu • S. Feng • D. Wang • Y. Zhang • H. Schütze
HiFT: A Hierarchical Full Parameter Fine-Tuning Strategy.
EMNLP 2024 - Conference on Empirical Methods in Natural Language Processing. Miami, FL, USA, Nov 12-16, 2024. DOI

[106]

A. Köksal • T. Schick • A. Korhonen • H. Schütze
LongForm: Effective Instruction Tuning with Reverse Instructions.
Findings @EMNLP 2024 - Findings of the Conference on Empirical Methods in Natural Language Processing. Miami, FL, USA, Nov 12-16, 2024. DOI GitHub

[105]

A. Modarressi • A. Köksal • H. Schütze
Consistent Document-Level Relation Extraction via Counterfactuals.
Findings @EMNLP 2024 - Findings of the Conference on Empirical Methods in Natural Language Processing. Miami, FL, USA, Nov 12-16, 2024. DOI

[104]

M. Wang • L. Lange • H. Adel • J. Strötgen • H. Schütze
Better Call SAUL: Fluent and Consistent Language Model Editing with Generation Regularization.
Findings @EMNLP 2024 - Findings of the Conference on Empirical Methods in Natural Language Processing. Miami, FL, USA, Nov 12-16, 2024. DOI

[103]

O. Xhelili • Y. Liu • H. Schütze
Breaking the Script Barrier in Multilingual Pre-Trained Language Models with Transliteration-Based Post-Training Alignment.
Findings @EMNLP 2024 - Findings of the Conference on Empirical Methods in Natural Language Processing. Miami, FL, USA, Nov 12-16, 2024. DOI GitHub

[102]

A. Yüksel • A. Köksal • L. K. Senel • A. Korhonen • H. Schütze
TurkishMMLU: Measuring Massive Multitask Language Understanding in Turkish.
Findings @EMNLP 2024 - Findings of the Conference on Empirical Methods in Natural Language Processing. Miami, FL, USA, Nov 12-16, 2024. DOI GitHub

[101]

R. Zhao • A. Köksal • Y. Liu • L. Weissweiler • A. Korhonen • H. Schütze
SynthEval: Hybrid Behavioral Testing of NLP Models with Synthetic CheckLists.
Findings @EMNLP 2024 - Findings of the Conference on Empirical Methods in Natural Language Processing. Miami, FL, USA, Nov 12-16, 2024. DOI GitHub

[100]

V. Hofmann • L. Weissweiler • D. Mortensen • H. Schütze • J. Pierrehumbert
Derivational Morphology Reveals Analogical Generalization in Large Language Models.
Preprint (Nov. 2024). arXiv

[99]

M. Thaler • A. Köksal • A. Leidinger • A. Korhonen • H. Schütze
How far can bias go? -- Tracing bias from pretraining data to alignment.
Preprint (Nov. 2024). arXiv

[98]

Y. Liu • F. Shi • D. Wang • Y. Zhang • H. Schütze
ChatZero: Zero-Shot Cross-Lingual Dialogue Generation via Pseudo-Target Language.
ECAI 2024 - 27th European Conference on Artificial Intelligence. Santiago de Compostela, Spain, Oct 19-24, 2024. DOI

[97]

Y. Liu • E. Nie • S. Feng • Z. Hua • Z. Ding • D. Wang • Y. Zhang • H. Schütze
A Unified Data Augmentation Framework for Low-Resource Multi-Domain Dialogue Generation.
ECML-PKDD 2024 - European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases. Vilnius, Lithuania, Sep 09-13, 2024. DOI GitHub

[96]

S. Ji • Z. Li • J. Paavola • P. Lin • P. Chen • D. O'Brien • H. Luo • H. Schütze • J. Tiedemann • B. Haddow
EMMA-500: Enhancing Massively Multilingual Adaptation of Large Language Models.
Preprint (Sep. 2024). arXiv

[95]

V. Blaschke • C. Purschke • H. Schütze • B. Plank
What Do Dialect Speakers Want? A Survey of Attitudes Towards Language Technology for German Dialects.
ACL 2024 - 62nd Annual Meeting of the Association for Computational Linguistics. Bangkok, Thailand, Aug 11-16, 2024. DOI

[94]

A. H. Kargaran • F. Yvon • H. Schütze
MaskLID: Code-Switching Language Identification through Iterative Masking.
ACL 2024 - 62nd Annual Meeting of the Association for Computational Linguistics. Bangkok, Thailand, Aug 11-16, 2024. DOI GitHub

[93]

Y. Liu • C. Ma • H. Ye • H. Schütze
TransliCo: A Contrastive Learning Framework to Address the Script Barrier in Multilingual Pretrained Language Models.
ACL 2024 - 62nd Annual Meeting of the Association for Computational Linguistics. Bangkok, Thailand, Aug 11-16, 2024. DOI

[92]

L. K. Senel • B. Fetahu • D. Yoshida • Z. Chen • G. Castellucci • N. Vedula • J. I. Choi • S. Malmasi
Generative Explore-Exploit: Training-free Optimization of Generative Recommender Systems using LLM Optimizers.
ACL 2024 - 62nd Annual Meeting of the Association for Computational Linguistics. Bangkok, Thailand, Aug 11-16, 2024. DOI

[91]

P. Wicke • L. Wachowiak
Exploring Spatial Schema Intuitions in Large Language and Vision Models.
Findings @ACL 2024 - Findings of the 62nd Annual Meeting of the Association for Computational Linguistics. Bangkok, Thailand, Aug 11-16, 2024. DOI GitHub

[90]

S. Yuan • E. Nie • M. Färber • H. Schmid • H. Schütze
GNNAVI: Navigating the Information Flow in Large Language Models by Graph Neural Network.
Findings @ACL 2024 - Findings of the 62nd Annual Meeting of the Association for Computational Linguistics. Bangkok, Thailand, Aug 11-16, 2024. DOI

[89]

M. Zhang • V. Gautam • M. Wang • J. Alabi • X. Shen • D. Klakow • M. Mosbach
The Impact of Demonstrations on Multilingual In-Context Learning: A Multidimensional Analysis.
Findings @ACL 2024 - Findings of the 62nd Annual Meeting of the Association for Computational Linguistics. Bangkok, Thailand, Aug 11-16, 2024. DOI

[88]

M. Wang • H. Adel • L. Lange • J. Strötgen • H. Schütze
Learn it or Leave it: Module Composition and Pruning for Continual Learning.
RepL4NLP-2024 @ACL 2024 - 9th Workshop on Representation Learning for NLP at the 62nd Annual Meeting of the Association for Computational Linguistics. Bangkok, Thailand, Aug 11-16, 2024. URL

[87]

A. Yüksel • A. Köksal • L. K. Senel • A. Korhonen • H. Schütze
TurkishMMLU: Measuring Massive Multitask Language Understanding in Turkish.
SIGTURK @ACL 2024 - 1st Workshop on Natural Language Processing for Turkic Languages at the 62nd Annual Meeting of the Association for Computational Linguistics. Bangkok, Thailand, Aug 11-16, 2024. Invited Talk. arXiv GitHub

[86]

M. Aßenmacher • A. Stephan • L. Weissweiler • E. Çano • I. Ziegler • M. Härttrich • B. Bischl • B. Roth • C. Heumann • H. Schütze
Collaborative Development of Modular Open Source Educational Resources for Natural Language Processing.
TeachingNLP @ACL 2024 - 6th Workshop on Teaching NLP at the 62nd Annual Meeting of the Association for Computational Linguistics. Bangkok, Thailand, Aug 11-16, 2024. URL

[85]

P. Wicke • L. Hirlimann • J. M. Cunha
Using Analogical Reasoning to Prompt LLMs for their Intuitions of Abstract Spatial Schemas.
Analogy-ANGLE 2024 @IJCAI 2024 - 1st Workshop on Analogical Abstraction in Cognition, Perception, and Languageat the 33rd International Joint Conference on Artificial Intelligence. Jeju, Korea, Aug 03-09, 2024. PDF

[84]

L. Weissweiler
Computational approaches to construction grammar and morphology.
Dissertation LMU München. Jul. 2024. DOI

[83]

Y. Liu • P. Lin • M. Wang • H. Schütze
OFA: A Framework of Initializing Unseen Subword Embeddings for Efficient Large-scale Multilingual Continued Pretraining.
Findings @NAACL 2024 - Findings of the Annual Conference of the North American Chapter of the Association for Computational Linguistics. Mexico City, Mexico, Jun 16-21, 2024. DOI

[82]

H. Ye • Y. Liu • C. Ma • H. Schütze
MoSECroT: Model Stitching with Static Word Embeddings for Crosslingual Zero-shot Transfer.
Insights from Negative Results @NAACL 2024 - 5th Workshop on Insights from Negative Results in NLP at the Annual Conference of the North American Chapter of the Association for Computational Linguistics. Mexico City, Mexico, Jun 16-21, 2024. DOI

[81]

M. Wang • H. Adel • L. Lange • J. Strötgen • H. Schütze
Rehearsal-Free Modular and Compositional Continual Learning for Language Models.
NAACL 2024 - Annual Conference of the North American Chapter of the Association for Computational Linguistics. Mexico City, Mexico, Jun 16-21, 2024. DOI

[80]

H. Chen • J. Büssing • D. Rügamer • E. Nie
Leveraging (Sentence) Transformer Models with Contrastive Learning for Identifying Machine-Generated Text.
SemEval @NAACL 2024 - 18th International Workshop on Semantic Evaluation at the Annual Conference of the North American Chapter of the Association for Computational Linguistics. Mexico City, Mexico, Jun 16-21, 2024. DOI

[79]

L. Hirlimann • S. Zhang • H. Schütze • P. Wicke
Robustness Testing of Multi-Modal Models in Varied Home Environments for Assistive Robots.
Preprint (Jun. 2024). arXiv

[78]

V. Blaschke • B. Kovačić • S. Peng • H. Schütze • B. Plank
MaiBaam: A Multi-Dialectal Bavarian Universal Dependency Treebank.
LREC-COLING 2024 - Joint International Conference on Computational Linguistics, Language Resources and Evalutaion. Torino, Italy, May 20-25, 2024. URL

[77]

A. H. Kargaran • F. Yvon • H. Schütze
GlotScript: A Resource and Tool for Low Resource Writing System Identification.
LREC-COLING 2024 - Joint International Conference on Computational Linguistics, Language Resources and Evalutaion. Torino, Italy, May 20-25, 2024. URL GitHub

[76]

A. Köksal • S. Severini • H. Schütze
SilverAlign: MT-Based Silver Data Algorithm for Evaluating Word Alignment.
LREC-COLING 2024 - Joint International Conference on Computational Linguistics, Language Resources and Evalutaion. Torino, Italy, May 20-25, 2024. URL

[75]

D. R. Mortensen • V. Izrailevitch • Y. Xiao • H. Schütze • L. Weissweiler
Verbing Weirds Language (Models): Evaluation of English Zero-Derivation in Five LLMs.
LREC-COLING 2024 - Joint International Conference on Computational Linguistics, Language Resources and Evalutaion. Torino, Italy, May 20-25, 2024. URL

[74]

L. Weissweiler • N. Böbel • K. Guiller • S. Herrera • W. Scivetti • A. Lorenzi • N. Melnik • A. Bhatia • H. Schütze • L. Levin • A. Zeldes • J. Nivre • W. Croft • N. Schneider
UCxn: Typologically Informed Annotation of Constructions Atop Universal Dependencies.
LREC-COLING 2024 - Joint International Conference on Computational Linguistics, Language Resources and Evalutaion. Torino, Italy, May 20-25, 2024. URL

[73]

S. Zhou • L. Weissweiler • T. He • H. Schütze • D. R. Mortensen • L. Levin
Constructions Are So Difficult That Even Large Language Models Get Them Right for the Wrong Reasons.
LREC-COLING 2024 - Joint International Conference on Computational Linguistics, Language Resources and Evalutaion. Torino, Italy, May 20-25, 2024. URL

[72]

A. Modarressi • A. Imani • M. Fayyaz • H. Schütze
RET-LLM: Towards a General Read-Write Memory for Large Language Models.
AGI @ICLR 2024 - Workshop on Artificial General Intelligence at the 12th International Conference on Learning Representations. Vienna, Austria, May 07-11, 2024. arXiv

[71]

V. Steinborn
Multilingual and multimodal bias probing and mitigation in natural language processing.
Dissertation LMU München. Apr. 2024. DOI

[70]

P. Lin • S. Ji • J. Tiedemann • A. F. T. Martins • H. Schütze
MaLA-500: Massive Language Adaptation of Large Language Models.
Preprint (Apr. 2024). arXiv GitHub

[69]

A. Maronikolakis • A. Köksal • H. Schütze
Sociocultural knowledge is needed for selection of shots in hate speech detection tasks.
LT-EDI 2024 - 4th Workshop on Language Technology for Equality, Diversity, Inclusion. St. Julian’s, Malta, Mar 21, 2024. URL

[68]

B. Ma • E. Nie • S. Yuan • H. Schmid • M. Färber • F. Kreuter • H. Schütze
ToPro: Token-Level Prompt Decomposition for Cross-Lingual Sequence Labeling Tasks.
EACL 2024 - 18th Conference of the European Chapter of the Association for Computational Linguistics. St. Julians, Malta, Mar 17-22, 2024. DOI

[67]

L. K. Senel • B. Ebing • K. Baghirova • H. Schütze • G. Glavaš
Kardeş-NLU: Transfer to Low-Resource Languages with Big Brother’s Help – A Benchmark and Evaluation for Turkic Languages.
EACL 2024 - 18th Conference of the European Chapter of the Association for Computational Linguistics. St. Julians, Malta, Mar 17-22, 2024. Outstanding Paper Award. DOI

[66]

P. Lin • C. Hu • Z. Zhang • A. Martins • H. Schütze
mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models.
Findings @EACL 2024 - Findings of the 18th Conference of the European Chapter of the Association for Computational Linguistics. St. Julians, Malta, Mar 17-22, 2024. URL

[65]

L. Weissweiler • A. Köksal • H. Schütze
Hybrid Human-LLM Corpus Construction and LLM Evaluation for Rare Linguistic Phenomena.
Preprint (Mar. 2024). arXiv

[64]

P. Wicke
Probing Language Models' Gesture Understanding for Enhanced Human-AI Interaction.
Preprint (Jan. 2024). arXiv

2023

[63]

X. Li • E. Nie • S. Liang
From Classification to Generation: Insights into Crosslingual Retrieval Augmented ICL.
Instruction Tuning and Instruction Following @NeurIPS 2023 - Workshop Instruction Tuning and Instruction Following at the 37th Conference on Neural Information Processing Systems. New Orleans, LA, USA, Dec 10-16, 2023. URL

[62]

S. Zhang • P. Wicke • L. K. Senel • L. Figueredo • A. Naceri • S. Haddadin • B. Plank • H. Schütze
LoHoRavens: A Long-Horizon Language-Conditioned Benchmark for Robotic Tabletop Manipulation.
Robot Learning @NeurIPS 2023 - 6th Robot Learning Workshop: Pretraining, Fine-Tuning, and Generalization with Large Scale Models at the 37th Conference on Neural Information Processing Systems. New Orleans, LA, USA, Dec 10-16, 2023. URL

[61]

X. Li • E. Nie • S. Liang
Crosslingual Retrieval Augmented In-context Learning for Bangla.
BLP 2023 - 1st Workshop on Bangla Language Processing. Singapore, Dec 07, 2023. DOI

[60]

Z. Zhang • H. Yang • B. Ma • D. Rügamer • E. Nie
Baby’s CoThought: Leveraging Large Language Models for Enhanced Reasoning in Compact Models.
BabyLM Challenge @CoNLL 2023) - 1st BabyLM Challenge at the 27th Conference on Computational Natural Language Learning. Singapore, Dec 06-10, 2023. DOI GitHub

[59]

N. Kassner • O. Tafjord • A. Sabharwal • K. Richardson • H. Schütze • P. Clark
Language Models with Rationality.
EMNLP 2023 - Conference on Empirical Methods in Natural Language Processing. Singapore, Dec 06-10, 2023. DOI

[58]

M. Wang • H. Adel • L. Lange • J. Strötgen • H. Schütze
GradSim: Gradient-Based Language Grouping for Effective Multilingual Training.
EMNLP 2023 - Conference on Empirical Methods in Natural Language Processing. Singapore, Dec 06-10, 2023. DOI

[57]

L. Weissweiler • V. Hofmann • A. Kantharuban • A. Cai • R. Dutt • A. Hengle • A. Kabra • A. Kulkarni • A. Vijayakumar • H. Yu • H. Schütze • K. Oflazer • D. Mortensen
Counting the Bugs in ChatGPT's Wugs: A Multilingual Investigation into the Morphological Capabilities of a Large Language Model.
EMNLP 2023 - Conference on Empirical Methods in Natural Language Processing. Singapore, Dec 06-10, 2023. DOI

[56]

A. H. Kargaran • A. Imani • F. Yvon • H. Schütze
GlotLID: Language Identification for Low-Resource Languages.
Findings @EMNLP 2023 - Findings of the Conference on Empirical Methods in Natural Language Processing. Singapore, Dec 06-10, 2023. DOI GitHub

[55]

A. Köksal • T. Schick • H. Schütze
MEAL: Stable and Active Learning for Few-Shot Prompting.
Findings @EMNLP 2023 - Findings of the Conference on Empirical Methods in Natural Language Processing. Singapore, Dec 06-10, 2023. DOI GitHub

[54]

A. Köksal • O. Yalcin • A. Akbiyik • M. T. Kilavuz • A. Korhonen • H. Schütze
Language-Agnostic Bias Detection in Language Models with Bias Probing.
Findings @EMNLP 2023 - Findings of the Conference on Empirical Methods in Natural Language Processing. Singapore, Dec 06-10, 2023. DOI GitHub

[53]

Y. Liu • H. Ye • L. Weissweiler • R. Pei • H. Schütze
Crosslingual Transfer Learning for Low-Resource Languages Based on Multilingual Colexification Graphs.
Findings @EMNLP 2023 - Findings of the Conference on Empirical Methods in Natural Language Processing. Singapore, Dec 06-10, 2023. DOI

[52]

E. Nie • H. Schmid • H. Schütze
Unleashing the Multilingual Encoder Potential: Boosting Zero-Shot Performance via Probability Calibration.
Findings @EMNLP 2023 - Findings of the Conference on Empirical Methods in Natural Language Processing. Singapore, Dec 06-10, 2023. DOI

[51]

V. Hangya • S. Severini • R. Ralev • A. Fraser • H. Schütze
Multilingual Word Embeddings for Low-Resource Languages using Anchors and a Chain of Related Languages.
MRL @EMNLP 2023 - 3rd Workshop on Multi-lingual Representation Learning at the Conference on Empirical Methods in Natural Language Processing. Singapore, Dec 06-10, 2023. DOI

[50]

A. Köksal • R. Aksitov • C.-C. Chang
Hallucination Augmented Recitations for Language Models.
Preprint (Nov. 2023). arXiv

[49]

L. Weissweiler • V. Hofmann • A. Köksal • H. Schütze
Explaining pretrained language models' understanding of linguistic structures using construction grammar.
Frontiers in Artificial Intelligence 6. Oct. 2023. DOI

[48]

B. Ma • E. Nie • H. Schmid • H. Schütze
Is Prompt-Based Finetuning Always Better than Vanilla Finetuning? Insights from Cross-Lingual Language Understanding.
KONVENS 2023 - 19th Conference on Natural Language Processing. Ingolstadt, Germany, Sep 18-22, 2023. URL

[47]

A. Maronikolakis • P. O’Grady • H. Schütze • M. Lyra
Improving Few-Shot Learning with Multilingual Transfer and Monte Carlo Training Set Selection.
LSD 2023 - CLASP Conference on Learning with Small Data. Gothenburg, Sweden, Sep 11-12, 2023. URL

[46]

E. Nie • H. Schmid • H. Schütze
Cross-Lingual Constituency Parsing for Middle High German: A Delexicalized Approach.
ALP @RANLP 2023 - 1st Workshop on Ancient Language Processing at the Conference on Recent Advances in Natural Language Processing. Varna, Bulgaria, Sep 08, 2023. URL

[45]

A. Imani • P. Lin • A. H. Kargaran • S. Severini • M. J. Sabet • N. Kassner • C. Ma • H. Schmid • A. Martins • F. Yvon • H. Schütze
Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages.
ACL 2023 - 61th Annual Meeting of the Association for Computational Linguistics. Toronto, Canada, Jul 09-14, 2023. DOI GitHub

[44]

Y. Liu • H. Ye • L. Weissweiler • P. Wicke • R. Pei • R. Zangenfeind • H. Schütze
A Crosslingual Investigation of Conceptualization in 1335 Languages.
ACL 2023 - 61th Annual Meeting of the Association for Computational Linguistics. Toronto, Canada, Jul 09-14, 2023. DOI

[43]

Y. Liu • S. Feng • D. Wang • Y. Zhang • H. Schütze
PVGRU: Generating Diverse and Relevant Dialogue Responses via Pseudo-Variational Mechanism.
ACL 2023 - 61th Annual Meeting of the Association for Computational Linguistics. Toronto, Canada, Jul 09-14, 2023. DOI

[42]

A. Modarressi • M. Fayyaz • E. Aghazadeh • Y. Yaghoobzadeh • M. T. Pilehvar
DecompX: Explaining Transformers Decisions by Propagating Token Decomposition.
ACL 2023 - 61th Annual Meeting of the Association for Computational Linguistics. Toronto, Canada, Jul 09-14, 2023. DOI GitHub

[41]

X. Wang • L. Weissweiler • H. Schütze • B. Plank
How to Distill your BERT: An Empirical Study on the Impact of Weight Initialisation and Distillation Objectives.
ACL 2023 - 61th Annual Meeting of the Association for Computational Linguistics. Toronto, Canada, Jul 09-14, 2023. DOI

[40]

Z. Han • R. Liao • J. Gu • Y. Zhang • Z. Ding • Y. Gu • H. Köppl • H. Schütze • V. Tresp
ECOLA: Enhancing Temporal Knowledge Embeddings with Contextualized Language Representations.
Findings @ACL 2023 - Findings of the 61th Annual Meeting of the Association for Computational Linguistics. Toronto, Canada, Jul 09-14, 2023. DOI

[39]

E. Nie • S. Liang • H. Schmid • H. Schütze
Cross-Lingual Retrieval Augmented Prompt for Low-Resource Languages.
Findings @ACL 2023 - Findings of the 61th Annual Meeting of the Association for Computational Linguistics. Toronto, Canada, Jul 09-14, 2023. DOI

[38]

P. Wicke
LMs stand their Ground: Investigating the Effect of Embodiment in Figurative Language Interpretation by Language Models.
Findings @ACL 2023 - Findings of the 61th Annual Meeting of the Association for Computational Linguistics. Toronto, Canada, Jul 09-14, 2023. DOI

[37]

Y. Liu • A. Chronopoulou • H. Schütze • A. Fraser
On the Copying Problem of Unsupervised NMT: A Training Schedule with a Language Discriminator Loss.
IWSLT 2023 - 20th International Conference on Spoken Language Translation. Toronto, Canada, Jul 09-14, 2023. DOI

[36]

P. Wicke • L. K. Senel • S. Zhang • L. Figueredo • A. Naceri • S. Haddadin • H. Schütze
Towards Language-Based Modulation of Assistive Robots through Multimodal Models.
Geriatronics Summit 2023 - 2nd Geriatronics Summit. Garmisch-Partenkirchen, Germany, Jul 02-03, 2023. arXiv

[35]

V. Steinborn • A. Maronikolakis • H. Schütze
Politeness Stereotypes and Attack Vectors: Gender Stereotypes in Japanese and Korean Language Models.
Preprint (Jun. 2023). arXiv

[34]

V. Blaschke • H. Schütze • B. Plank
A Survey of Corpora for Germanic Low-Resource Languages and Dialects.
NoDaLiDa 2023 - 24th Nordic Conference on Computational Linguistics. Tórshavn, Faroe Islands, May 22-24, 2023. URL

[33]

V. Blaschke • H. Schütze • B. Plank
Does Manipulating Tokenization Aid Cross-Lingual Transfer? A Study on POS Tagging for Non-Standardized Languages.
VarDial @EACL 2023 - 10th Workshop on NLP for Similar Languages, Varieties and Dialects at the 17th Conference of the European Chapter of the Association for Computational Linguistics. Dubrovnik, Croatia, May 02-06, 2023. DOI

[32]

Y. Liu • S. Feng • D. Wang • Y. Zhang • H. Schütze
Evaluate What You Can't Evaluate: Unassessable Quality for Generated Response.
Preprint (May. 2023). arXiv

[31]

H. Ye • Y. Liu • H. Schütze
A study of conceptual language similarity: comparison and evaluation.
Preprint (May. 2023). arXiv

[30]

L. Weissweiler • T. He • N. Otani • D. R. Mortensen • L. Levin • H. Schütze
Construction Grammar Provides Unique Insight into Neural Language Models.
GURT 2023 - Georgetown University Round Table on Linguistics. Washington D.C., USA, Mar 09-12, 2023. URL

2022

[29]

I. Ziegler • B. Ma • E. Nie • B. Bischl • D. Rügamer • B. Schubert • E. Dorigatti
What cleaves? Is proteasomal cleavage prediction reaching a ceiling?
LMRL @NeurIPS 2022 - Workshop on Learning Meaningful Representations of Life at the 36th Conference on Neural Information Processing Systems. New Orleans, LA, USA, Nov 28-Dec 09, 2022. URL

[28]

J. Li • M. Zhao • Y. Xie • A. Maronikolakis • P. Pu • H. Schütze
This joke is [MASK]: Recognizing Humor and Offense with Prompting.
TL4NLP @NeurIPS 2022 - 1st Transfer Learning for Natural Language Processing Workshop at the 36th Conference on Neural Information Processing Systems. New Orleans, LA, USA, Nov 28-Dec 09, 2022. URL

[27]

A. Imani • S. Severini • M. J. Sabet • F. Yvon • H. Schütze
Graph-Based Multilingual Label Propagation for Low-Resource Part-of-Speech Tagging.
EMNLP 2022 - Conference on Empirical Methods in Natural Language Processing. Abu Dhabi, United Arab Emirates, Nov 07-11, 2022. DOI

[26]

L. Weissweiler • V. Hofmann • A. Köksal • H. Schütze
The better your Syntax, the better your Semantics? Probing Pretrained Language Models for the English Comparative Correlative.
EMNLP 2022 - Conference on Empirical Methods in Natural Language Processing. Abu Dhabi, United Arab Emirates, Nov 07-11, 2022. DOI

[25]

P. Lin • J. Wang • H. Schütze • W. Li
Modeling Content-Emotion Duality via Disentanglement for Empathetic Conversation.
Preprint (Sep. 2022). arXiv

[24]

M. J. Sabet
Multilingual representations and models for improved low-resource language processing.
Dissertation LMU München. Jul. 2022. DOI

[23]

A. Maronikolakis • P. Baader • H. Schütze
Analyzing Hate Speech Data along Racial, Gender and Intersectional Axes.
GeBNLP 2022 - 4th Workshop on Gender Bias in Natural Language Processing. Seattle, WA, USA, Jul 15, 2022. DOI

[22]

S. Yuan • A. Maronikolakis • H. Schütze
Separating Hate Speech and Offensive Language Classes via Adversarial Debiasing.
WOAH 2022 - 6th Workshop on Online Abuse and Harms. Seattle, WA, USA, Jul 14, 2022. DOI

[21]

S. Severini • V. Hangya • M. J. Sabet • A. Fraser • H. Schütze
Don’t Forget Cheap Training Signals Before Building Unsupervised Bilingual Word Embeddings.
BUCC @LREC 2022 - 15th Workshop on Building and Using Comparable Corpora at the 13th International Conference on Language Resources and Evaluation. Marseille, France, Jun 21-23, 2022. URL

[20]

S. Severini • A. Imani • P. Dufter • H. Schütze
Towards a Broad Coverage Named Entity Resource: A Data-Efficient Approach for Many Diverse Languages.
LREC 2022 - 13th International Conference on Language Resources and Evaluation. Marseille, France, Jun 21-23, 2022. URL

[19]

V. Steinborn • P. Dufter • H. Jabbar • H. Schütze
An Information-Theoretic Approach and Dataset for Probing Gender Stereotypes in Multilingual Masked Language Models.
Findings @NAACL 2022 - Findings of the Annual Conference of the North American Chapter of the Association for Computational Linguistics. Seattle, WA, USA, Jun 10-15, 2022. DOI

[18]

M. Zhao • F. Mi • Y. Wang • M. Li • X. Jiang • Q. Liu • H. Schütze
LMTurk: Few-Shot Learners as Crowdsourcing Workers in a Language-Model-as-a-Service Framework.
Findings @NAACL 2022 - Findings of the Annual Conference of the North American Chapter of the Association for Computational Linguistics. Seattle, WA, USA, Jun 10-15, 2022. DOI

[17]

L. Weissweiler • V. Hofmann • M. J. Sabet • H. Schütze
CaMEL: Case Marker Extraction without Labels.
ACL 2022 - 60th Annual Meeting of the Association for Computational Linguistics. Dublin, Ireland, May 22-27, 2022. DOI

[16]

A. Imani • L. K. Senel • M. J. Sabet • F. Yvon • H. Schütze
Graph Neural Networks for Multiparallel Word Alignment.
Findings @ACL 2022 - Findings of the 60th Annual Meeting of the Association for Computational Linguistics. Dublin, Ireland, May 22-27, 2022. DOI

[15]

S. Sharifzadeh • S. M. Baharlou • M. Schmitt • H. Schütze • V. Tresp
Improving Scene Graph Classification by Exploiting Knowledge from Texts.
AAAI 2022 - 36th Conference on Artificial Intelligence. Virtual, Feb 22-Mar 01, 2022. DOI

2021

[14]

Y. Elazar • N. Kassner • S. Ravfogel • A. Ravichander • E. Hovy • H. Schütze • Y. Goldberg
Measuring and Improving Consistency in Pretrained Language Models.
Transactions of the Association for Computational Linguistics 9. Dec. 2021. DOI

[13]

A. Imani • M. J. Sabet • L. K. Senel • P. Dufter • F. Yvon • H. Schütze
Graph Algorithms for Multiparallel Word Alignment.
EMNLP 2021 - Conference on Empirical Methods in Natural Language Processing. Punta Cana, Dominican Republic, Nov 07-11, 2021. DOI

[12]

N. Kassner • O. Tafjord • H. Schütze • P. Clark
BeliefBank: Adding Memory to a Pre-Trained Language Model for a Systematic Notion of Belief.
EMNLP 2021 - Conference on Empirical Methods in Natural Language Processing. Punta Cana, Dominican Republic, Nov 07-11, 2021. DOI

[11]

M. Mozes • M. Schmitt • V. Golkov • H. Schütze • D. Cremers
Scene Graph Generation for Better Image Captioning?
Preprint (Sep. 2021). arXiv

[10]

A. Imani • M. J. Sabet • P. Dufter • M. Cysou • H. Schütze
ParCourE: A Parallel Corpus Explorer for a Massively Multilingual Corpus.
ACL 2021 - 59th Annual Meeting of the Association for Computational Linguistics. Bangkok, Thailand, Aug 01-06, 2021. DOI

[9]

P. Dufter • N. Kassner • H. Schütze
Static Embeddings as Efficient Knowledge Bases?
NAACL 2021 - Annual Conference of the North American Chapter of the Association for Computational Linguistics. Virtual, Jun 06-11, 2021. DOI

[8]

N. Kassner • P. Dufter • H. Schütze
Multilingual LAMA: Investigating Knowledge in Multilingual Pretrained Language Models.
EACL 2021 - 16th Conference of the European Chapter of the Association for Computational Linguistics. Virtual, Apr 19-23, 2021. DOI

2020

[7]

E. Asgari • M. J. Sabet • P. Dufter • C. Ringlstetter • H. Schütze
Subword Sampling for Low Resource Word Alignment.
Preprint (Dec. 2020). arXiv

[6]

N. Kassner • B. Krojer • H. Schütze
Are Pretrained Language Models Symbolic Reasoners over Knowledge?
CoNLL 2020 - 24th Conference on Computational Natural Language Learning. Virtual, Nov 19-20, 2020. DOI

[5]

N. Kassner • H. Schütze
BERT-kNN: Adding a kNN Search Component to Pretrained Language Models for Better QA.
Findings @EMNLP 2020 - Findings of the Conference on Empirical Methods in Natural Language Processing. Virtual, Nov 16-20, 2020. DOI

[4]

M. J. Sabet • P. Dufter • F. Yvon • H. Schütze
SimAlign: High Quality Word Alignments without Parallel Training Data using Static and Contextualized Embeddings.
Findings @EMNLP 2020 - Findings of the Conference on Empirical Methods in Natural Language Processing. Virtual, Nov 16-20, 2020. DOI

[3]

N. Kassner • H. Schütze
Negated and Misprimed Probes for Pretrained Language Models: Birds Can Talk, But Cannot Fly.
ACL 2020 - 58th Annual Meeting of the Association for Computational Linguistics. Virtual, Jul 05-10, 2020. DOI

[2]

A. Beyer • G. Kauermann • H. Schütze
Embedding Space Correlation as a Measure of Domain Similarity.
LREC 2020 - 12th International Conference on Language Resources and Evaluation. Marseille, France, May 13-15, 2020. URL

[1]

J. Jungmaier • N. Kassner • B. Roth
Dirichlet-Smoothed Word Embeddings for Low-Resource Settings.
LREC 2020 - 12th International Conference on Language Resources and Evaluation. Marseille, France, May 13-15, 2020. URL

©all images: LMU | TUM

2024-12-27 - Last modified: 2026-05-20