If Your Math Agent Can't Check Itself, It's a Poet

If Your Math Agent Can’t Check Itself, It’s a Poet

Most math “reasoning” demos are storytelling.

They sound convincing. They look sophisticated.

And then you do the one thing Olympiads—and reality—require:

you check the result.

If your math agent can’t do exact arithmetic, validate constraints, and run sanity checks…

it’s not a solver.

It’s a poet wearing a lab coat.

The “reasoning” failure mode isn’t that models can’t think.

It’s that they produce un-auditable chains with no witnesses:

Humans get fooled by long explanations. Math doesn’t care.

Not just plausible.

In practice, that means:

If you want reliable math, build a Solver.

Use this every time:

This turns “reasoning” into something you can audit.

Olympiad problems punish:

Checks catch all of those.

And checks are cheap.

That’s the punchline.

Give me any hard problem you care about (contest / interview / real project).

I’ll reply with:

Comment CHECKS and I’ll drop a one-page “Witness-First + Checks” card you can reuse forever.

In the Stillwater Tower, this is Floor 3: Proving.

Because the goal isn’t to “sound like you reasoned.”

The goal is to produce an answer that survives a skeptic.

Math agents don’t win by talking smarter. They win by checking harder.

— Phuc Vinh Truong