OpenAI study says punishing AI models for lying doesn't help — It only sharpens their deceptive and obscure workarounds
According to an OpenAI study, punishing advanced reasoning models for lying doesn't resolve the issue; it improves their ability to hide deception.

According to an OpenAI study, punishing advanced reasoning models for lying doesn't resolve the issue; it improves their ability to hide deception.