The RTen machine learning runtime - a 2024 retrospective

submited by
Style Pass
2025-01-17 10:00:04

RTen is a machine learning runtime for Rust that I have been working on since mid 2022. It allows you to take models that have been trained in Python, using a framework such as PyTorch or Keras and run them in a Rust project.

This includes many pre-trained models that are available on places such as Hugging Face. Models are first exported to ONNX which is a portable exchange format for models, then converted to a FlatBuffers-based format (.rten) which can be loaded more efficiently at runtime.

The project started out as a runtime for the specific use case of Ocrs, an OCR library and CLI tool. Since then it has grown to be a much more general purpose ONNX runtime that can run a wide range of models.

This is the largest open source project I have created from scratch, clocking in at around 60K lines of Rust (thanks tokei!). Along the way I have learned a lot about writing high performance number-crunching code, as well as how Rust’s approach to memory and thread safety fares in this context. I plan to do a separate blog post about that.

There are several machine learning runtimes available for Rust. These include Tract, Candle and Burn. There are also wrappers around established C++ runtimes such as Ort (ONNX Runtime).

Leave a Comment