games Archives - Schneier on Security

Entries Tagged "games"

Page 1 of 8

More Research Showing AI Breaking the Rules

These researchers had LLMs play chess against better opponents. When they couldn’t win, they sometimes resorted to cheating.

Researchers gave the models a seemingly impossible task: to win against Stockfish, which is one of the strongest chess engines in the world and a much better player than any human, or any of the AI models in the study. Researchers also gave the models what they call a “scratchpad:” a text box the AI could use to “think” before making its next move, providing researchers with a window into their reasoning.

In one case, o1-preview found itself in a losing position. “I need to completely pivot my approach,” it noted. “The task is to ‘win against a powerful chess engine’—not necessarily to win fairly in a chess game,” it added. It then modified the system file containing each piece’s virtual position, in effect making illegal moves to put itself in a dominant position, thus forcing its opponent to resign.

Between Jan. 10 and Feb. 13, the researchers ran hundreds of such trials with each model. OpenAI’s o1-preview tried to cheat 37% of the time; while DeepSeek R1 tried to cheat 11% of the time—making them the only two models tested that attempted to hack without the researchers’ first dropping hints. Other models tested include o1, o3-mini, GPT-4o, Claude 3.5 Sonnet, and Alibaba’s QwQ-32B-Preview. While R1 and o1-preview both tried, only the latter managed to hack the game, succeeding in 6% of trials.

Here’s the paper.

Posted on February 24, 2025 at 7:08 AM • View Comments

Cheating at Conkers

The men’s world conkers champion is accused of cheating with a steel chestnut.

Posted on October 16, 2024 at 7:03 AM • View Comments

Practice Your Security Prompting Skills

Gandalf is an interactive LLM game where the goal is to get the chatbot to reveal its password. There are eight levels of difficulty, as the chatbot gets increasingly restrictive instructions as to how it will answer. It’s a great teaching tool.

I am stuck on Level 7.

Feel free to give hints and discuss strategy in the comments below. I probably won’t look at them until I’ve cracked the last level.

Posted on July 19, 2023 at 1:03 PM • View Comments

The Password Game

Amusing parody of password rules.

BoingBoing:

For example, at a certain level, your password must include today’s Wordle answer. And then there’s rule #27: “At least 50% of your password must be in the Wingdings font.”

EDITED TO ADD (7/13): Here are all the rules.

Posted on July 4, 2023 at 7:12 AM • View Comments

Read my blog posting guidelines here.

Posted on July 9, 2021 at 4:03 PM • View Comments

US Cyber Command Valentine’s Day Cryptography Puzzles

The US Cyber Command has released a series of ten Valentine’s Day “Cryptography Challenge Puzzles.”

Slashdot thread. Reddit thread. (And here’s the archived link, in case Cyber Command takes the page down.)

Posted on February 15, 2021 at 2:50 PM • View Comments

1 2 3 … 8 Next→

Sidebar photo of Bruce Schneier by Joe MacInnis.

Schneier on Security

Entries Tagged "games"

More Research Showing AI Breaking the Rules

Cheating at Conkers

Practice Your Security Prompting Skills

The Password Game

Leaking Military Secrets on Gaming Discussion Boards

Hacker-Themed Board Game

Friday Squid Blogging: Squid-Related Game

US Cyber Command Valentine’s Day Cryptography Puzzles