Research

A Definition of Good Explanations and the Challenges Explaining LLM Outputs

This arxiv paper proposes a formal definition of what constitutes a good explanation, drawing on counterfactual reasoning while also accounting for the prior beliefs of the person receiving the explanation. The authors apply this framework to AI explainability and argue it illuminates why LLM outputs are particularly resistant to satisfying explanation — a foundational problem for any deployment context where accountability matters. The work is philosophical in orientation but has practical consequences for how we think about transparency requirements.

Read full story at cs.AI updates on arXiv.org →V: · A: · D:

Research

Reinforcement Learning Towards Broadly and Persistently Beneficial Models

Researchers have published findings suggesting that reinforcement learning on carefully constructed datasets of benefici...

Research

Commemorating 70 Years of Artificial Intelligence

IEEE Spectrum marks seventy years since the Dartmouth workshop formally named artificial intelligence as a field, offeri...

Research

Diffusion Language Models: An Experimental Analysis

Researchers present a systematic evaluation of eight diffusion language models across eight benchmarks covering reasonin...