Punishing AI doesn’t stop it from lying and cheating — it just makes it hide better, study shows (Live wire)
https://youtu.be/IfPnTdXMFp4?si=2sGZIm9kyOoh5ViB By using a standard LLM, GPT-4o, to oversee an unreleased frontier reasoning model during training, the researchers watched as...