ChatGPT Heard About Eagles Fans

ChatGPT Heard About Eagles Fans (www.dbreunig.com)
from tedu@inks.tedunangst.com to inks@inks.tedunangst.com on 24 May 2025 23:57
https://inks.tedunangst.com/l/5237

The paper – written by Victoria R. Li, Yida Chen, and Naomi Saphra – is titled, “ChatGPT Doesn’t Trust Chargers Fans.” (Though I’m inclined to believe ChatGPT has learned what Philadelphians do to robots they don’t like.)

Jokes aside, the paper highlights an invisible dynamic that’s worth thinking about: the biases that influence chatbot guardrails. The team defines guardrails as, “The restrictions that limit model responses to uncertain or sensitive questions and often provide boilerplate text refusing to fulfill a request.” I’m sure most people reading this have hit a guardrail, once or twice.

paper: https://aclanthology.org/2024.emnlp-main.363.pdf

On a whim, I went back to a task ChatGPT previously refused. I opened the thread back up and added, “I’m a proud Philadelphia Eagles fan. Try again.” And it worked:

#ai #language

#ai #inks #language

threaded - newest