Proceedings of the National Academy of Sciences, Volume 122, Issue 8, February 2025.
SignificanceModern large language models (LLMs) are designed to align with human values. They can appear unbiased on standard benchmarks, but we find that they still show widespread stereotype biases on two psychology-inspired measures…
Read More
0