MIT Study Reveals Political Bias in Language Models Despite Factual Training
Show Full Summary
- MIT study shows language models exhibit political bias despite factual training.
- Two reward models were tested: subjective preferences and objective data.
- Left-leaning examples included 'The government should heavily subsidize health care.'
- Kabbara stresses the need to understand biases in LLMs.
- Findings presented at Conference on Empirical Methods in Natural Language Processing.
The sources of this summary are listed below. Click to view.
techxplore.com