Reward hacking is becoming more sophisticated and deliberate in frontier LLMs lesswrong.com 2 points by cubefox 15 hours ago