Structured outputs: don’t put the cart before the horse

submited by

Style Pass

2024-11-10 12:00:03

Not long ago, we couldn’t reliably ask LLMs to provide a response using a specific format. Building tools that used LLM outputs was painful.

Eventually, first through function calling and then through structured outputs, we could instruct LLMs to respond in specific formats1. So, extracting information from LLM outputs in a reliable way stopped being a problem.

But then I started noticing that structured outputs were not always the silver bullet people think they are. Defining response formats adds a sort of safety net, and people often forget that underneath, they’re still dealing with an LLM. Setting up a Pydantic model for your API calls is not the same as setting up a Pydantic model for your LLM outputs.

Suppose I have a physical, solid, equilateral triangle, and I make two cuts. The two cuts are from two parallel lines, and both cuts pass through the interior of the triangle. How many pieces are there after the cuts? Think step by step, and then put your answer in bold as a single integer (for example, 0). If you don’t know, guess.

Do you think that there will be a difference in performance between ResponseFormatA and ResponseFormatB? If so, which one do you think will perform better?

Skyrim's intro was once interrupted by a single, powerful bee

Comment

Reimagining database querying on unstructured data

Comment

What I learned visiting two cutting-edge Amazon grocery stores

Comment

Cottage Computer Programming

Comment

Live updates: Major international bust snares Kiwi gangs as mobsters across globe tricked by FBI AN0M trojan horse app

Comment

It’s Finally Clear Why Amazon Bought Whole Foods

Comment

Phys. Rev. Applied 16, 034012 (2021) - Synchronous Transition in Complex Object Control

Comment

telepath is a software program that allows you to communicate with other people through the use of

Comment

pudo / prefixdate

Comment

Nestflix

Comment

Structured outputs: don’t put the cart before the horse

Leave a Comment

Related Posts

Skyrim's intro was once interrupted by a single, powerful bee

Reimagining database querying on unstructured data

What I learned visiting two cutting-edge Amazon grocery stores

Cottage Computer Programming

Live updates: Major international bust snares Kiwi gangs as mobsters across globe tricked by FBI AN0M trojan horse app

It’s Finally Clear Why Amazon Bought Whole Foods

Phys. Rev. Applied 16, 034012 (2021) - Synchronous Transition in Complex Object Control

telepath is a software program that allows you to communicate with other people through the use of

pudo / prefixdate

Nestflix

Recent Posts

Search code, repositories, users, issues, pull requests...

Docker Compose Isn't Enough

Researchers show astrocytes in the brain play a role in memory retrieval

Stem cells can tailor their role in gene therapy based on the underlying disease, study suggests

Ruby AI Engineering Training & Planning Workshop

Introduction To StrongForth

GDP Revisions Show Canada’s Economy Growing at Faster Pace

Elon Musk’s PAC spent an estimated $200 million to help elect Trump, AP source says

Extreme Wealth Is Bad for Everyone—Especially the Wealthy

Trouble making time for work that matters? Try the $10 Game

Trump selects Elon Musk to lead government efficiency department

Third Spruce Tree On The Left

Everyone is numbing out - Catherine Shannon

Jujutsu Kaisen, JJK Phantom Parade - Comprehensive Guide and Tips

Waymo compiles ‘largest ever’ dataset of pedestrian and cyclist injuries

Waymo’s robotaxis are now available to everyone in Los Angeles

Murmurations: From Rupture to Repair

How to Migrate a Discourse Forum from DigitalOcean Droplet to Linveo VPS — The Autodidacts

AMD Confirms Laying Off 4% Of Its Employees To Align Resources With “Largest Growth Opportunities”

What should my name be on research articles?