News
An AI researcher put leading AI models to the test in a game of Diplomacy. Here's how the models fared.
OpenAI's o3: The researcher called the reasoning-focused model “a master of deception.” It is said to have won the most number of games, primarily owing to its ability to deceive opponents. In one ...
In an AI simulation of great power competition of 20th century Europe, Open AI’s ChatGPT won through lies, deception, and ...
The new Claude Gov models have enhanced capabilities over other enterprise models developed by Anthropic, including “enhanced proficiency” in languages critical to US national security, and a better ...
When we are backed into a corner, we might lie, cheat and blackmail to survive — and in recent tests, the most powerful ...
“Even if emails state that the replacement AI shares values while being more capable, Claude Opus 4 still performs ... by Meta to play the strategy game Diplomacy, but researchers found it ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results