Explainable Machine Learning

Explainability and Interpretability of Multilingual Large Language Models: A Survey

Multilingual large language models (MLLMs) demonstrate state-of-the-art capabilities across diverse cross-lingual and multilingual …

Lucas Resck, Isabelle Augenstein, Anna Korhonen

PDF Project Project

Multi-Modal Framing Analysis of News

Automated frame analysis of political communication is a popular task in computational social science that is used to study how authors …

Arnav Arora, Srishti Yadav, Maria Antoniak, Serge Belongie, Isabelle Augenstein

PDF Project Project Project

Unstructured Evidence Attribution for Long Context Query Focused Summarization

Large language models (LLMs) are capable of generating coherent summaries from very long contexts given a user query. Extracting and …

Dustin Wright, Zain Muhammad Mujahid, Lu Wang, Isabelle Augenstein, David Jurgens

PDF Project Project

Show Me the Work: Fact-Checkers' Requirements for Explainable Automated Fact-Checking

The pervasiveness of large language models and generative AI in online media has amplified the need for effective automated …

Greta Warren, Irina Shklovski, Isabelle Augenstein

PDF Project Project

A Unified Framework for Input Feature Attribution Analysis

Explaining the decision-making process of machine learning models is crucial for ensuring their reliability and fairness. One popular …

Jingyi Sun, Pepa Atanasova, Isabelle Augenstein

PDF Project Project

Investigating Human Values in Online Communities

Studying human values is instrumental for cross-cultural research, enabling a better understanding of preferences and behaviour of …

Nadav Borenstein, Arnav Arora, Lucie-Aimée Kaffee, Isabelle Augenstein

PDF Project Project

Measuring and Benchmarking Large Language Models' Capabilities to Generate Persuasive Language

We are exposed to much information trying to influence us, such as teaser messages, debates, politically framed news, and propaganda - …

Amalie Brogaard Pauli, Isabelle Augenstein, Ira Assent

PDF Project Project

FLARE: Faithful Logic-Aided Reasoning and Exploration

Modern Question Answering (QA) and Reasoning approaches based on Large Language Models (LLMs) commonly use prompting techniques, such …

Erik Arakelyan, Pasquale Minervini, Pat Verga, Patrick Lewis, Isabelle Augenstein

PDF Project Project

DYNAMICQA: Tracing Internal Knowledge Conflicts in Language Models

Knowledge-intensive language understanding tasks require Language Models (LMs) to integrate relevant context, mitigating their inherent …

Sara Vera Marjanović, Haeun Yu, Pepa Atanasova, Maria Maistro, Christina Lioma, Isabelle Augenstein

PDF Project Project

Revealing Fine-Grained Values and Opinions in Large Language Models

Uncovering latent values and opinions in large language models (LLMs) can help identify biases and mitigate potential harm. Recently, …

Dustin Wright, Arnav Arora, Nadav Borenstein, Srishti Yadav, Serge Belongie, Isabelle Augenstein

PDF Project Project

Claim Verification in the Age of Large Language Models: A Survey

The large and ever-increasing amount of data available on the Internet coupled with the laborious task of manual claim and fact …

Alphaeus Dmonte, Roland Oruche, Marcos Zampieri, Prasad Calyam, Isabelle Augenstein

PDF Project Project

Factuality Challenges in the Era of Large Language Models

The emergence of tools based on large language models (LLMs), like OpenAI’s ChatGPT and Google’s Gemini, has garnered immense public …

Isabelle Augenstein, Timothy Baldwin, Meeyoung Cha, Tanmoy Chakraborty, Giovanni Luca Ciampaglia, David Corney, Renee DiResta, Emilio Ferrara, Scott Hale, Alon Halevy, Eduard Hovy, Heng Ji, Filippo Menczer, Ruben Miguez, Preslav Nakov, Dietram Scheufele, Shivam Sharma, Giovanni Zagni

PDF Project Project

Grammatical Gender's Influence on Distributional Semantics: A Causal Perspective

How much meaning influences gender assignment across languages is an active area of research in modern linguistics and cognitive …

Karolina Stańczak, Kevin Du, Adina Williams, Isabelle Augenstein, Ryan Cotterell

PDF Project Project

Revealing the Parametric Knowledge of Language Models: A Unified Framework for Attribution Methods

Language Models (LMs) acquire parametric knowledge from their training process, embedding it within their weights. The increasing …

Haeun Yu, Pepa Atanasova, Isabelle Augenstein

PDF Project Project

What Languages are Easy to Language-Model? A Perspective from Learning Probabilistic Regular Languages

What can large language models learn? By definition, language models (LM) are distributions over strings. Therefore, an intuitive way …

Nadav Borenstein, Anej Svete, Robin Chan, Josef Valvoda, Franz Nowak, Isabelle Augenstein, Eleanor Chodroff, Ryan Cotterell

PDF Project

Investigating the Impact of Model Instability on Explanations and Uncertainty

Explainable AI methods facilitate the understanding of model behaviour, yet, small, imperceptible perturbations to inputs can vastly …

Sara Vera Marjanović, Isabelle Augenstein, Christina Lioma

PDF Project Project

Semantic Sensitivities and Inconsistent Predictions: Measuring the Fragility of NLI Models

Recent studies of the emergent capabilities of transformer-based Natural Language Understanding (NLU) models have indicated that they …

Erik Arakelyan, Zhaoqi Liu, Isabelle Augenstein

PDF Project Project

People Make Better Edits: Measuring the Efficacy of LLM-Generated Counterfactually Augmented Data for Harmful Language Detection

NLP models are used in a variety of critical social computing tasks, such as detecting sexist, racist, or otherwise hateful content. …

Indira Sen, Dennis Assenmacher, Mattia Samory, Isabelle Augenstein, Wil van der Aalst, Claudia Wagner

PDF Project Project

Explaining Interactions Between Text Spans

Reasoning over spans of tokens from different parts of the input is essential for natural language understanding (NLU) tasks such as …

Sagnik Ray Choudhury, Pepa Atanasova, Isabelle Augenstein

PDF Project Project

Adapting Neural Link Predictors for Complex Query Answering

Answering complex queries on incomplete knowledge graphs is a challenging task where a model needs to answer complex logical queries in …

Erik Arakelyan, Pasquale Minervini, Isabelle Augenstein

PDF Project Project Project

Faithfulness Tests for Natural Language Explanations

Explanations of neural models aim to reveal a model’s decision-making process for its predictions. However, recent work shows …

Pepa Atanasova, Oana-Maria Camburu, Christina Lioma, Thomas Lukasiewicz, Jakob Grue Simonsen, Isabelle Augenstein

PDF Project

Probing Pre-Trained Language Models for Cross-Cultural Differences in Values

Language embeds information about social, cultural, and political values people hold. Prior work has explored social and potentially …

Arnav Arora, Lucie-Aimée Kaffee, Isabelle Augenstein

PDF Project Project

A Latent-Variable Model for Intrinsic Probing

The success of pre-trained contextualized representations has prompted researchers to analyze them for the presence of linguistic …

Karolina Stańczak, Lucas Torroba Hennigen, Adina Williams, Ryan Cotterell, Isabelle Augenstein

PDF Project Project

Generating Fluent Fact Checking Explanations with Unsupervised Post-Editing

Fact-checking systems have become important tools to verify fake and misguiding news. These systems become more trustworthy when …

Shailza Jolly, Pepa Atanasova, Isabelle Augenstein

PDF Project Project Project

Can Edge Probing Tasks Reveal Linguistic Knowledge in QA Models?

There have been many efforts to try to understand what grammatical knowledge (e.g., ability to understand the part of speech of a …

Sagnik Ray Choudhury, Nikita Bhutani, Isabelle Augenstein

PDF Project Project

Machine Reading, Fast and Slow: When Do Models 'Understand' Language?

Two of the most fundamental challenges in Natural Language Understanding (NLU) at present are: (a) how to establish whether deep …

Sagnik Ray Choudhury, Anna Rogers, Isabelle Augenstein

PDF Project Project

Habilitation Abstract: Towards Explainable Fact Checking

With the substantial rise in the amount of mis- and disinformation online, fact checking has become an important task to automate. This …

Isabelle Augenstein

PDF Project Project

Counterfactually Augmented Data and Unintended Bias: The Case of Sexism and Hate Speech Detection

Counterfactually Augmented Data (CAD) aims to improve out-of-domain generalizability, an indicator of model robustness. The improvement …

Indira Sen, Mattia Samory, Claudia Wagner, Isabelle Augenstein

PDF Project Project

Same Neurons, Different Languages: Probing Morphosyntax in Multilingual Pre-trained Models

The success of multilingual pre-trained models is underpinned by their ability to learn representations shared by multiple languages …

Karolina Stańczak, Edoardo Ponti, Lucas Torroba Hennigen, Ryan Cotterell, Isabelle Augenstein

PDF Project Project

Fact Checking with Insufficient Evidence

Automating the fact checking (FC) process relies on information obtained from external sources. In this work, we posit that it is …

Pepa Atanasova, Jakob Grue Simonsen, Christina Lioma, Isabelle Augenstein

PDF Project Project

Diagnostics-Guided Explanation Generation

Explanations shed light on a machine learning model’s rationales and can aid in identifying deficiencies in its reasoning …

Pepa Atanasova, Jakob Grue Simonsen, Christina Lioma, Isabelle Augenstein

PDF Project Project

Information fusion as an integrative cross-cutting enabler to achieve robust, explainable, and trustworthy medical artificial intelligence

Medical artificial intelligence (AI) systems have been remarkably successful, even outperforming human performance at certain tasks. …

Andreas Holzinger, Matthias Dehmer, Frank Emmert-Streib, Rita Cucchiara, Isabelle Augenstein, Javier Del Ser, Wojciech Samek, Igor Jurisica, Natalia Díaz-Rodríguez

PDF Project

How Does Counterfactually Augmented Data Impact Models for Social Computing Constructs?

As NLP models are increasingly deployed in socially situated settings such as online abusive content detection, ensuring these models …

Indira Sen, Mattia Samory, Fabian Flöck, Claudia Wagner, Isabelle Augenstein

PDF Project Project Project

Is Sparse Attention more Interpretable?

Sparse attention has been claimed to increase model interpretability under the assumption that it highlights influential inputs. Yet …

Clara Meister, Stefan Lazov, Isabelle Augenstein, Ryan Cotterell

PDF Project

Towards Explainable Fact Checking

The past decade has seen a substantial rise in the amount of mis- and disinformation online, from targeted disinformation campaigns to …

Isabelle Augenstein

PDF Project Project

A Diagnostic Study of Explainability Techniques for Text Classification

Recent developments in machine learning have introduced models that approach human performance at the cost of increased architectural …

Pepa Atanasova, Jakob Grue Simonsen, Christina Lioma, Isabelle Augenstein

PDF Code Project

Generating Label Cohesive and Well-Formed Adversarial Claims

Adversarial attacks reveal important vulnerabilities and flaws of trained models. One potent type of attack are universal adversarial …

Pepa Atanasova, Dustin Wright, Isabelle Augenstein

PDF Code Project Project

TX-Ray: Quantifying and Explaining Model-Knowledge Transfer in (Un-)Supervised NLP

While state-of-the-art NLP explainability (XAI) methods focus on supervised, per-instance end or diagnostic probing task evaluation[4, …

Nils Rethmeier, Vageesh Kumar Saxena, Isabelle Augenstein

PDF Code Project Project

Generating Fact Checking Explanations

This paper provides the first study of how fact checking explanations can be generated automatically based on available claim context, …

Pepa Atanasova, Jakob Grue Simonsen, Christina Lioma, Isabelle Augenstein

Preprint Project Project Video

PhD fellowship on Interpretable Machine Learning available

PhD fellowship on Interpretable Machine Learning available. The successfull candidate will be supervised by Pepa Atanasova and Isabelle …

8 Nov 2024 PhD Fellowship Call

Project Project

Pepa has been appointed as a Tenure-Track Assistant Professor

Starting in September 2024, Pepa is taking on a new role as Tenure-Track Assistant Professor in the NLP Section at the University of …

1 Sep 2024 Pepa appointed as a Tenure-Track Assistant Professor

Project Project

Participate in research on explainable fact checking

We are recruiting professional fact checkers to take part in an interview and/or a survey about their experiences of fact checking and …

1 Jun 2024 Register your interest here

Project Project

PhD and postdoc positions available at Pioneer Centre for AI

A PhD and two postdoc positions on natural language understanding are available. The positions are funded by the Pioneer Centre for AI.

1 Mar 2024 Pioneer Centre PhD and Postdoc Call

Project Project

PhD position available in context of ERC Starting Grant project ExplainYourself

A PhD position on explainable natural language understanding is available in CopeNLU. The positions is funded by the ERC Starting Grant …

7 Nov 2023 ERC StG PhD Fellowship Call

Project Project

ExplainYourself Project Kick-Off

On 1 September 2023, the ERC Starting Grant project ExplainYourself on ‘Explainable and Robust Automatic Fact Checking’ is …

1 Sep 2023 ERC Starting Grant ExplainYourself

Project Project

Positions available in context of ERC Starting Grant project ExplainYourself

PhD and postdoctoral fellowships on explainable fact checking are available in CopeNLU. The positions are funded by the ERC Starting …

22 Nov 2022 ERC StG PhD and Postdoc Fellowship Call

Project Project

EXPANSE Project Kick-Off

On 1 September 2021, the DFF Sapere Aude project EXPANSE on ‘Learning to Explain Attitudes on Social Media’ is kicking off, …

1 Sep 2021 DFF Sapere Aude project EXPANSE

Project Project

Explainable Machine Learning

Publications

Talks