SAFE working paper
https://safe-frankfurt.de/de/publikationen/working-papers.html
Refine
Year of publication
- 2021 (2) (remove)
Document Type
- Working Paper (2)
Language
- English (2)
Has Fulltext
- yes (2)
Is part of the Bibliography
- no (2)
Keywords
- Algorithmic transparency (1)
- Belief up-dating (1)
- Explainable machine learning (1)
- Information processing (1)
- XAI (1)
315
This paper explores the interplay of feature-based explainable AI (XAI) tech- niques, information processing, and human beliefs. Using a novel experimental protocol, we study the impact of providing users with explanations about how an AI system weighs inputted information to produce individual predictions (LIME) on users’ weighting of information and beliefs about the task-relevance of information. On the one hand, we find that feature-based explanations cause users to alter their mental weighting of available information according to observed explanations. On the other hand, explanations lead to asymmetric belief adjustments that we inter- pret as a manifestation of the confirmation bias. Trust in the prediction accuracy plays an important moderating role for XAI-enabled belief adjustments. Our results show that feature-based XAI does not only superficially influence decisions but re- ally change internal cognitive processes, bearing the potential to manipulate human beliefs and reinforce stereotypes. Hence, the current regulatory efforts that aim at enhancing algorithmic transparency may benefit from going hand in hand with measures ensuring the exclusion of sensitive personal information in XAI systems. Overall, our findings put assertions that XAI is the silver bullet solving all of AI systems’ (black box) problems into perspective.
318
Incentives, self-selection, and coordination of motivated agents for the production of social goods
(2021)
We study, theoretically and empirically, the effects of incentives on the self-selection and coordination of motivated agents to produce a social good. Agents join teams where they allocate effort to either generate individual monetary rewards (selfish effort) or contribute to the production of a social good with positive effort complementarities (social effort). Agents differ in their motivation to exert social effort. Our model predicts that lowering incentives for selfish effort in one team increases social good production by selectively attracting and coordinating motivated agents. We test this prediction in a lab experiment allowing us to cleanly separate the selection effect from other effects of low incentives. Results show that social good production more than doubles in the low- incentive team, but only if self-selection is possible. Our analysis highlights the important role of incentives in the matching of motivated agents engaged in social good production.