Since the arrival of Chat GPT, I’ve felt that progress in AI has been largely incremental. Of course, even Chat GPT was/is mostly just a scaled up T

Why I’m excited about the Hierarchical Reasoning Model

submited by

Style Pass

2025-08-08 17:00:20

Since the arrival of Chat GPT, I’ve felt that progress in AI has been largely incremental. Of course, even Chat GPT was/is mostly just a scaled up Transformer model, which was introduced by Vaswani et al in 2017.

The belief that AI is advancing relatively slowly is a radical and unpopular opinion these days, and it’s been frustrating defending that view while most of the Internet talks excitedly about the latest AI Large-Language Models (LLMs).

But in my opinion the new Hierarchical Reasoning Model by Wang et al is a genuine leap forward. Their model is the first one which seems to have the ability to think. In addition — or perhaps because of this capability — the model is also extremely efficient in terms of required training samples, trainable parameter count and computational resources such as memory, because they use relatively local gradient propagation without Back-Propagation-Through-Time (BPTT), or equivalent.

Researchers have been trying for a while now to build models with increased reasoning capabilities. Some improvement has been achieved through Chain-of-Thought (CoT) prompting and related techniques to induce LLMs to systematically analyze a problem while generating their response. Researchers have also tried adding “thinking tokens” to induce more deliberative output. They have also designed “Agentic AI architectures”, which aim to allow AIs to tackle a broad range of user problems more independently. However, Agentic architectures almost always involve human engineers inventing a way to break down and frame broad problems into specific sub-problems which will the AI will find more manageable, helping it remain task-focused. Who is doing the thinking here?

Why I’m excited about the Hierarchical Reasoning Model

Leave a Comment

Related Posts

Recent Posts

Johns Hopkins is building classified versions of its AI wargaming tools for DoD, IC

ChatGPT will apologize for anything

Search code, repositories, users, issues, pull requests...

Acting NASA Administrator Reflects on Legacy of Astronaut Jim Lovell

Jim Lovell - Wikipedia

ng-openapi | Angular OpenAPI Client Generator

whoa there, pardner!

Why You Should Build Durable Workflows With Postgres

Optimized Autonomous Inference

Gold futures jump to record high after US tariffs on cast bars

LVFS Sustainability Plan

Query-Mutating Data Race in Go: Hiding in Plain Sight

Digit/fin: Fish Extensible Text Editor Written In Fish - Codeberg.org

M5 MacBook Pro No Longer Coming in 2025

Welcome to Rich Guy Fantasy Camp, where Jeff Bezos and wealthy men like him live in their own worlds

ChatGPT users hate GPT-5’s “overworked secretary” energy, miss their GPT-4o buddy

Three Tiers of Responses to Fact

Keep Your Photos and Videos Personal

Campaigning for Extinction: Eradication of Sparrows and the Great Famine in China

GRETA to Open a New Eye on the Nucleus – Berkeley Lab News Center