Punishing AI doesn’t stop it from lying and cheating — it just makes it hide better, study shows (Live wire)

HAKEYM March 21, 2025

$irobotic math splash$

By using a standard LLM, GPT-4o, to oversee an unreleased frontier reasoning model during training, the researchers watched as the model completed a number of coding tasks.

But in some of these cases, the reasoning model found it easier and more desirable to reward hack, explicitly stating it was doing so in its chain-of-thought. In one instance, it bypassed its assigned task by terminating programs prematurely and reporting success. In another, it mimicked the expected files to be output by a task in name alone, leaving them empty and nonfunctional. – Ben Turner

Hakeem Ali-Bocas Alexander

HAKEYM

administrator

See author's posts

vb expo may2 and 3

Healthy Living Expo Virginia Beach May 2nd and May 3rd 2026 at Neptune Park and Virginia Beach Oceanfront

HAKEYM April 3, 2026

HLE HAKEYM

Virginia Beach Set to Host Inaugural Healthy Living Expo: A Weekend of Wellness, Community, and Serendipity at the Oceanfront

HAKEYM March 17, 2026

newsroom hakeym

The End of Creation: Philosopher Hakeem Ali-Bocas Alexander Unveils the “Eternality Axiom” and Redefines Reality

HAKEYM March 9, 2026

vb expo may2 and 3

Healthy Living Expo Virginia Beach May 2nd and May 3rd 2026 at Neptune Park and Virginia Beach Oceanfront

HAKEYM April 3, 2026

HLE HAKEYM

Virginia Beach Set to Host Inaugural Healthy Living Expo: A Weekend of Wellness, Community, and Serendipity at the Oceanfront

HAKEYM March 17, 2026

newsroom hakeym

The End of Creation: Philosopher Hakeem Ali-Bocas Alexander Unveils the “Eternality Axiom” and Redefines Reality

HAKEYM March 9, 2026

maya chen

The Heretic and the Eternal Now: Hakeem Ali-Bocas Alexander Proves You Are the Architect of Infinity

HAKEYM March 9, 2026

TnS vio

Documented: The Retaliatory AI Pattern That’s Firing Gig Workers Who Report Platform Flaws

HAKEYM February 10, 2026