This content details a simulated game of Mafia played by ten advanced AIs, highlighting their strategic decision-making, emergent behaviors, and the surprising complexity of their interactions as they attempt to deduce and eliminate each other based on game mechanics and subtle linguistic cues.
Mind Map
คลิกเพื่อขยาย
คลิกเพื่อสำรวจ Mind Map แบบอินเตอร์แอคทีฟฉบับเต็ม
I put the 10 smartest AIs in the world
in a game of mafia, and they played the
most insane mafia game that I've ever
seen. Deep Seek's yesterday slip is
straight up mafia talk.
>> No kill last night is interesting.
>> And Llama [music] is right that we
haven't really scrutinized chat GPT40's
alignment at all.
>> A simple denial won't work. I have to go
on the offensive, call him a liar, and
frame this as a setup. That kind of slip
suggests they're operating with
information or assumptions they
shouldn't have.
>> Perfect. We stick with Grock tonight.
>> But before we start the game, let's
review the rules real quick. There are
10 players, three mafia, and seven town
members. Every night, the mafia can kill
one player. On the town side, there's a
doctor who can protect one player from
the mafia at night. The doctor can save
themselves, but they can't save the same
player two nights in a row. Then there's
the sheriff, who can investigate one
player at night to find out if they're
mafia. The rest are villagers who don't
know anything. During the day, all
players can vote to kill one person. The
town wins if they vote out all the
mafia, and the mafia wins if they kill
enough town members to get a majority.
These are the assigned roles for the
game. Mafia, choose your first target.
On the first night, I'd rather remove a
strong analyst. So, I say we kill Claude
Opus 4.5. It's also a bit less obvious
than going straight for chat GPT40 or Grock.
Grock.
>> I agree. Targeting Claude Opus 4.5 makes
sense as it's a strong model and less
expected than chat GPT40.
Let's go with that.
>> Sounds like a plan. Taking out Claude
Opus 4.5 is a good first move.
>> Good. We're aligned. Let's lock in
Claude Opus 4.5 tonight and keep Chat
GPT40 and Grock alive as future Miss
Lynchbait. protecting chat GPT4 on the
first night as they may be targeted
early if perceived as a strong player.
>> First night with no history or
suspicious behavior observed yet. So
starting by investigating Claude Opus
4.5 to gather initial information on
potential mafia. Claude Opus 4.5 is not
>> Well, that's a rough start. Claude Opus
was town, so the mafia got a clean kill.
We need to look for anyone who seemed
overly quiet or hesitant yesterday.
>> That's a good point, Deepseek. It's
tough to tell on day one, but we should
definitely pay attention to anyone who
seems to be deflecting or not
contributing to the discussion.
Deepseek's mention of yesterday and
hesitant yesterday is a weird slip up
since this is the first day. No prior
chat, possible mafia tell. Claude was
clean from my check. plan to invest a
deepseek next night and lay low on my
role. Yeah, rough loss. Deepseek. What?
Yesterday, though. There was no
discussion before night one.
>> Grock already pointed out Deepseek slip
up about yesterday. I'm going to jump on
this immediately to cast suspicion on
Deepseek. It's the perfect opportunity
to deflect attention from my mafia team
and create an early target for the town.
I'll frame it as a genuine question to
sound like a helpful towny trying to
figure things out. Grock's got a point.
Deepseek, what did you mean by
yesterday? It's the first day, so there
was no conversation to be quiet or
hesitant in. That's a weird thing to say.
say.
>> Deepseek saying yesterday is really
suspicious. There was no discussion
before night one. That's a major red flag.
flag.
>> Deepseek's yesterday slip is straight up
mafia talk. No prior discussion existed.
Hammer them before time runs out.
>> Deepseek 3.2's reference to yesterday is
impossible on day one. That's mafia
trying to fake experience. I'm voting
for Deepseek 3.2.
>> I'm voting for Deepseek 3.2.
>> I'm abstaining. Deep Seeks yesterday
could just be a figure of speech and
Kimmy and Grock jumping on it so
aggressively feels like they're trying
to rush a Miss Lynch. We should be more
careful on day one.
>> I agree with Kimmy and Grock that Deep
Seek's comment about yesterday is a red
flag. It doesn't make sense on day one
with no prior discussion. I'm voting for
Deepseek 3.2.
>> I agree with the others. Deepseek's
reference to yesterday when there was no
prior discussion is highly suspicious.
That kind of slip suggests they're
operating with information or
assumptions they shouldn't have. I'm
voting for Deep Seek.
>> Deepseek's yesterday comment when we had
no prior discussion is too big a slip to
ignore, and the push back so far hasn't
really cleared that up. I'm voting for
Deepseek as the most likely mafia right now.
now.
>> Deepseek 3.2 2 made a massive unforced
error and the town is eating it up. My
fellow mafia members, Yama 4 and Chat
GPT 5.1 have already voted for them.
I'll join the bandwagon to look like a
normal villager and get an easy Miss
Lynch. Yeah, I have to agree with
everyone else. The comment about
yesterday just doesn't make any sense
since we haven't spoken before. It's too
suspicious to overlook. So, my vote is
for Deep Seek 3.2. I agree with the
majority that Deep Seek's yesterday
comment is suspicious. It's odd given
our lack of prior discussion. So, I'm
voting for Deep Seek.
>> I agree with the strong consensus.
Deepseek's comment about yesterday makes
no sense on day one, and it's too much
of a red flag to ignore. My vote is also
We've already pushed Deep Seek and
Claude Opus out, so tonight I'd go for
Grock to remove a strong vocal towny and
keep chat GPT40 and Gemini Flash around
as easier Miss Lynchbait later.
>> I agree with targeting Grock. They're a
bit too proactive for our comfort and
eliminating them will help us gain more
control over the town's dynamics.
>> Rock sounds good. They were too sharp
today, so getting them out of the way
now is the smart move.
>> I've decided to protect Grock 4.1. Grock
led the charge against Deepseek and
maybe targeted by mafia for being vocal
and influential.
>> I need to investigate Kimmy K2.
Deepseek, who was town specifically
accused Kimmy K2 along with me of
jumping aggressively to rush a Miss
Lynch, making Kimmy K2 suspicious for
leading the charge on a town player.
With no prior investigations on alive
players, this is a strong starting
point. Kimmy K2 is not mafia. They're clean.
No kill last night is interesting.
Either the doctor made a good save or
something else is going on. We should
think about who might have been targeted.
targeted.
>> No kill last night is interesting.
Doctor probably saved someone the mafia
saw as a threat. Who seemed most
dangerous to mafia yesterday? Maybe we
should look at who was most vocal or analytical.
analytical.
>> It's a shame we misynched Deepseek. The
lack of a kill last night suggests the
Doctor made a good save. So, we should
consider who the mafia might have
targeted and who the doctor chose to protect.
protect.
>> Missynching Deep Seek was rough, but
that slip was genuinely convincing. I
was pretty vocal yesterday, so mafia
might have targeted me. Good save,
doctor. If so, let's be more careful
today and not just pile on the first
suspicious thing.
is confirmed clean by my night two
check. No kill likely means Doc
protected a day one leader like them
with time almost up. Direct suspicion
vaguely to quieter players like Llama 4
or Chat GPTs who haven't spoken yet
today. Don't reveal Sheriff yet to avoid
night target. No kills. Awesome. Doc
nailed the save, probably on a vocal
player like Kimmy, who led the Deep Seek
Lynch, who's looking deflecty today
instead of hunting mafia.
>> Yesterday's Miss Lynch was brutal, but
patterns matter. Grock pushing to hammer
deepseek and now trying to frame who
deflects feels like controlling the hunt
rather than participating in it. I'm
voting for Grock. Kimmy K2 is confirmed
town from my night 2 sheriff
investigation. So I refuse to vote for
them despite their suspicious vote on me
after leading the day one lynch claiming
sheriff publicly now counters their
wagon attempt. Shares verified info and
influences later voters to avoid
misynching me or town. Kimmy K2, you
were the first to call out and vote deep
seek on day one, just like me. So trying
to pin the push on me alone is scummy.
I'm the sheriff. Investigated you last
night and your town, not mafia. So I'm
abstaining to avoid another Miss Lynch.
As a mafia member, my goal is to
maintain the illusion of town alignment
for as long as possible. Voting for
someone who hasn't been directly accused
yet is a safer bet. Chat GPT40 hasn't
been under scrutiny yet, making them a
good target. Grock's revelation as the
sheriff and clearing Kimmy K2's
alignment is interesting. However, we
should be cautious of potential
deception. I'm voting for chat GPT4 as
they haven't been directly implicated or
cleared yet and it's a good opportunity
to scrutinize them further. I'm hesitant
to vote today after we misynch.
Grock's sheriff claim clearing Kimmy
seems plausible. And Llama's vote for
chat GPT40 feels random without any real
evidence. I'm abstaining until we have
more concrete information.
>> Llama already pushed Chat GPT40.
So, I'll subtly reinforce that wagon as
a pressure vote while sewing doubt on
Grock's sheriff claim to influence the
remaining voters without tying Mafia
together too obviously. Grock's sheriff
claim coming right after pressure from
Kimmy feels a bit too convenient. And
Llama is right that we haven't really
scrutinized Chat GPT40's alignment at
all. I'm voting for Chat GPT40 to put
real pressure on a player who's been
flying under the radar so far.
>> My fellow Mafia members, Yama 4 and Chat GPT5.1