Continuing our walk through Moonshot AI’s technical paper for Kimi 2, let’s move on from how Moonshine rephrased data during pre-training and look

How Kimi 2 Became One of the Best Tool-Using Models

submited by

Style Pass

2025-07-30 21:30:41

Continuing our walk through Moonshot AI’s technical paper for Kimi 2, let’s move on from how Moonshine rephrased data during pre-training and look at how they taught Kimi how to use tools better in agentic workflows.

I like MCP, but I don’t like that it made everyone think tool use is plug-and-play. Yes, MCP standardizes how we express and add tool functions to our contexts (and remember, it’s one of many), but we still have to provide our model with instructions for when and how to use the tools available to it.

Connecting your model to random MCPs and then giving it a task is like giving someone a drill and teaching them how it works, then asking them to fix your sink. Is the drill relevant in this scenario? If it’s not, why was it given to me? It’s a classic case of context confusion.

The dominant, general pattern for instructing a model with tools is called ReAct, which stands for “reason” and “action.” ReAct isn’t fancy; it’s just a prompting technique. We provide a model with tools, then instruct it to reason about whether it should use these tools to accomplish a given task.

How Kimi 2 Became One of the Best Tool-Using Models

Leave a Comment

Related Posts

Recent Posts

Computer Science > Distributed, Parallel, and Cluster Computing

Search code, repositories, users, issues, pull requests...

Explore the Complete History of Blockchain Technology

A shocking record: Lightning bolt stretched 515 miles, crossed three states

On Sanctuary Cities, It's Trump vs. the 10th Amendment

Announcement – ConfigServer Services

Using a fan can make older adults hotter in a dry heat

Yale Environment 360

What Searching For Aliens Reveals About Ourselves

Achieving the lowest operating voltage yet for white organic LEDs

Some low-grade prostate cancers carry higher risks than biopsy suggests

Amap launches low-altitude navigation with 3D digital mapping in Shenzhen

QUIC for the kernel [LWN.net]

Honey, AI Capex Keeps Eating ... Everything

Maybe the Fastest Disk Usage Program on macOS

Computers and the older generation

Perchlorate brine formation from frost at the Viking 2 landing site

URSA - Leaderless & Stateless Kafka Streaming at 95% Lower Cost - StreamNative

Why I think AGI IS right around the corner

Do LLMs consider security? an empirical study on responses to programming questions