At Modal, we’re obsessed with cold start latency. Earlier this year, we introduced memory snapshots to slash startup times by more than half. Today,

GPU Memory Snapshots: Supercharging Sub-second Startup

submited by

Style Pass

2025-07-31 16:30:06

At Modal, we’re obsessed with cold start latency. Earlier this year, we introduced memory snapshots to slash startup times by more than half. Today, we’re thrilled to announce the next evolution: GPU memory snapshots—bringing the same checkpoint/restore magic to GPU-accelerated workloads.

Our distributed file system uses a series of caches to store, directly in the worker memory, the most popular files used across Modal users. This is great because, for example, if torch is imported in one program, another program benefits because the torch files are now in the worker cache. This has a substantial impact in performance, usually 3-5x faster than when downloading files without a cache.

The lifecycle of a Modal Function involves a few stages: container cold boot and running inputs. Cold boot most commonly means two things: downloading your program files and reading your program into memory.

Reading a program into memory and starting up a Function takes time—sometimes a lot of time! What if we could take the memory representation of your program and save it into an image? That could save time by skipping reading files and re-creating your program in memory on every cold boot.

GPU Memory Snapshots: Supercharging Sub-second Startup

Leave a Comment

Related Posts

Recent Posts

Natural AI Image Generator Zero AI Artifacts Photorealistic Results Every Time

Is Energy Conserved in General Relativity?

CTE and normal aging are difficult to distinguish, new study finds

Learning that compounds.

Global ocean simulations examine tritium release from Fukushima

Live coding sucks | Mustapha Hadid

Tim Cook Has Now Been Apple's CEO for Longer Than Steve Jobs

Nasa intern loses job after accidental profanity-laced tweet to Space Council fellow

Build Privacy Tools, Go to Prison, Samourai Developers Change Pleas

Small Models, Big Wins: Agentic AI in Enterprise Explained

The Untold Impact of Cancellation

Breaking News: Newsreadeck 17+

Unfortunately We Are Not Living in a “Simulation”

Grand Canyon, Utah wildfires creating "fire clouds" that can form their own weather systems

Cybercrooks attached Raspberry Pi to bank network and drained ATM cash

SpaceX's Cellular Starlink Expands to Support IoT Devices

Evolution - Mental Model: AI-voiding Extinction

@dangaristo.bsky.social on Bluesky

Square Image & Photo Maker

Belgium Bans Internet Archive’s ‘Open Library’ in Sweeping Site Blocking Order