Phil Galfond Explains GTO: From Candy in Hand to Exploits Against Regs

Article cover

Level 1: Candy in Hand and the Power of Randomization

The basic idea of GTO is simple: play in a way that prevents your opponent from exploiting you. Galfond illustrates this with a child's example – he hides a candy in one hand, and the child must guess which hand it’s in. If he always hides it in the right hand, the opponent would quickly catch on. If he alternates regularly, a pattern might still emerge.

The solution is randomization. Coin flipping. If your strategy involves randomness, your opponent has no way to gain an advantage. This is the heart of GTO – a balanced strategy that cannot be systematically defeated.

Level 2: Rock, Paper, Scissors

At the teenage level, it’s about a known concept from the game rock, paper, scissors. If a player favors a certain pattern, it can be read. The only unbeatable strategy is randomizing between all options. In poker, this means balancing value bets and bluffs. It’s not about winning every hand. It’s about preventing the opponent from finding a pattern that can be exploited long-term.

Level 3: Ace-King-Queen and River Balance

At the college level, Galfond uses a simplified model – the so-called Ace-King-Queen. If a player goes all-in with aces, kings, and a certain percentage of queens as bluffs, they must set the value-to-bluff ratio correctly according to the pot odds. The opponent must defend (call) at the correct frequency to avoid being exploitable. The key concept is Minimum Defense Frequency - if a player defends too little, the opponent can bluff profitably more often. If they defend too often, value hands collect the maximum.

Level 4: Range Construction and Board Coverage

At the level of a regular, things get more complicated. It's not just about the river, but about constructing a range from the flop to the river. Galfond points out a common mistake – playing individual streets in isolation. If the flop range is poorly balanced, the turn and river will automatically be unbalanced too.

An important concept is board coverage. On the flop, you need to have combinations in your betting range that allow you to represent strength on future cards – flush turns, straight cards, or paired boards. Therefore, small sizing is often used on the flop to maintain a wider range of combinations. On the turn and river, the sizings widen, decisions become more polarized. Again, frequencies must be balanced.

Level 5: Stop Playing Like a Solver, Start Thinking Like One

The strongest part comes at the expert level. Galfond claims that trying to "play like a solver" is a mistake. Memorizing simulation outputs leads to mental overload and, paradoxically, to exploitation. It's impossible to perfectly reproduce solver strategies. If a player tries too hard to mimic GTO without understanding, gaps will appear in their range construction that a keen opponent will notice.

The solution? Learn concepts. Understand the "why" behind decisions. Create a simplified model that conserves mental capacity. Use that capacity for exploitative play. According to Galfond, it's in exploitative play where the chips are earned. If you know an opponent bluffs 60% when they should 30%, or doesn't bluff at all, that's where the edge lies.

GTO as a Foundation, Not a Goal

GTO isn’t about perfection - it's about being unexploitable. But the best players don’t stop at theoretical equilibrium. They use it as a foundation from which to launch into exploits. Through this five-level explanation, Galfond shows that even the most complex theory has a simple core: balance, frequencies, and understanding the logic behind decisions. And it’s the ability to think like a solver – not play like one – that distinguishes a good regular from an elite professional.

 

Sources – YouTube, PokerNews, PhilGalfond.com