🧩 The Reset Button — When AI Hits a Wall Again: ARC-AGI-3

NewBits Media
Mar 30
2 min read

NewBits Digest feature image for article on ARC-AGI-3, highlighting how the new benchmark resets the scoreboard for AI reasoning and adaptability.

This week, François Chollet’s ARC Prize Foundation dropped ARC-AGI-3—and it did something we’ve now seen a few times in this race:

It reset the scoreboard to near zero.

⚡ The Headline

Humans: 100%

AI models: still below 1%

Let that contrast sit.

Even the most advanced systems:

Gemini Pro → 0.37%

GPT-5.4 High → 0.26%

Opus 4.6 → 0.25%

Grok-4.20 → 0%

After billions invested and massive progress:

👉 Back to square one.

🎮 What Makes ARC-AGI-3 Different

This isn’t a knowledge test.

It’s a thinking test.

Agents are dropped into game-like environments with:

No instructions

No prior examples

No stated goals

They must:

Infer rules

Discover objectives

Build strategies

From scratch.

This is closer to how humans actually reason—and far from how most AI systems operate today.

🚀 Why This Keeps Happening

We’ve seen this pattern before:

ARC-AGI-2 scores started in the low single digits

Labs poured time and resources into optimizing for the benchmark

Progress then climbed dramatically, with top systems eventually pushing above 50%

Then ARC-AGI-3 arrives:

👉 And everything breaks again.

This is not failure.

This is stress testing the limits of intelligence itself.

🧠 The Real Question

Is AI actually learning to reason—

Or just getting better at:

Pattern matching

Scale brute force

Benchmark optimization

ARC-AGI-3 is designed specifically to expose that difference.

🌍 Zoom Out — The Bigger Arc

This is how the path toward more advanced AI unfolds:

Breakthrough

Optimization

Plateau

New benchmark

Reset

Repeat

Each cycle:

Raises the ceiling

Exposes new gaps

Forces deeper capability

🎯 What This Means

There’s a powerful parallel here:

The most advanced systems in the world just got humbled overnight.

And yet, if history holds:

👉 They’ll climb from below 1% to meaningful performance faster than many expect.

Because:

Resources will flood in

Focus will sharpen

Systems will adapt

🔑 Why It’s Important

Because this is the nature of real progress.

Not smooth.

Not linear.

But step-function leaps followed by hard resets.

ARC-AGI-3 reminds us of something critical:

We are still early.

But each reset is happening faster, with more intensity, and with higher stakes.

And eventually, one of these resets won’t just bounce back.

It will break through.

Enjoyed this article?

Stay ahead of the curve by subscribing to NewBits Digest, our weekly newsletter featuring curated AI stories, insights, and original content—from foundational concepts to the bleeding edge.

👉 Subscribe to NewBits Digest

👉 Register or Login at newbits.ai to like, comment, and join the conversation.

Want to explore more?

AI Solutions Directory: Discover AI models, tools & platforms.
AI Ed: Learn through our podcast series, From Bits to Breakthroughs.
AI Hub: Engage across our community and social platforms.

And remember, “It’s all about the bits…especially the new bits.”