OpenAI releases 'gpt-oss-120b' and 'gpt-oss-20b', two SOTA open language models under the Apache 2.0 license. Both 128k context mo

gpt-oss: How to Run & Fine-tune

submited by

Style Pass

2025-08-08 20:00:03

OpenAI releases 'gpt-oss-120b' and 'gpt-oss-20b', two SOTA open language models under the Apache 2.0 license. Both 128k context models outperform similarly sized open models in reasoning, tool use, and agentic tasks. You can now run & fine-tune them locally with Unsloth!

Trained with RL, gpt-oss-120b rivals o4-mini and gpt-oss-20b rivals o3-mini. Both excel at function calling and CoT reasoning, surpassing o1 and GPT-4o.

OpenAI released a standalone parsing and tokenization library called Harmony which allows one to tokenize conversations to OpenAI's preferred format for gpt-oss. The official OpenAI cookbook article provides many more details on how to use the Harmony library.

Inference engines generally use the jinja chat template instead and not the Harmony package, and we found some issues with them after comparing with Harmony directly. If you see below, the top is the correct rendered form as from Harmony. The below is the one rendered by the current jinja chat template. There are quite a few differences!

We also made some functions to directly allow you to use OpenAI's Harmony library directly without a jinja chat template if you desire - you can simply parse in normal conversations like below:

gpt-oss: How to Run & Fine-tune

Leave a Comment

Related Posts

Recent Posts

Update on Malicious Gems Removal - RubyGems Blog

Do they even test this?!

Safe JSON in script tags: How not to break a site

Computer Science > Software Engineering

Newsom says California will hold special election to combat Trump, Texas redistricting

How America Wins the Invisible War

This site requires JavaScript

Here’s how deepfake vishing attacks work, and why they can be hard to detect

whoa there, pardner!

Meta's AI Strategy - Lightcone

Roland’s Tadeo Kikumoto on 808, part by part: the ukiyo-e drum machine

How Does Tor Really Work? The Definitive Visual Guide (2023)

Bitcoin Demand Shift: Coinbase's 60-Day BTC Premium Streak Is at Risk

The US has a bullfrog problem

Vector types and debug performance

How to Form an Opinion

Tsutomu Yamaguchi: The man who survived both atomic bombs

The Lean Startup: Zen, the Art of Failing Fast and Reclaiming Aesthetic Vision

Co-Founder and CTO of FusionAuth Daniel DeGroff on DIY Cyber Guy

Undefined behavior in C and C++