In brief
- A new study finds that adding a line about a mental health condition changes how AI agents respond.
- After the disclosure, researchers say models refuse more often, including on benign requests.
- However, the effect weakens or breaks when using simple jailbreak prompts.
Telling an AI…
Read Full Article at Source