
Research
A Definition of Good Explanations and the Challenges Explaining LLM Outputs
This arxiv paper proposes a formal definition of what constitutes a good explanation, drawing on counterfactual reasoning while also accounting for the prior beliefs of the person receiving the explanation. The authors apply this framework to AI explainability and argue it illuminates why LLM outputs are particularly resistant to satisfying explanation — a foundational problem for any deployment context where accountability matters. The work is philosophical in orientation but has practical consequences for how we think about transparency requirements.
Read full story at cs.AI updates on arXiv.org →V: · A: · D:
Related
Research
Nothing from Something: Can a Language Model Discover 0?
This arxiv paper uses the concept of zero as a test case for whether language models can engage in genuine mathematical ...
Research
Relational Structural Causal Models
Researchers have extended Pearl's structural causal models to settings where objects and their relations vary, addressin...
Research
Trust Between AI Agents: Measuring Formation, Breakage, and Recovery, with Implications for Governing Multi-Agent Systems
Researchers have developed a behavioral framework for measuring trust between AI agents in multi-agent systems, using a ...