Nov. 26, 2024: AMD today announced the release of ROCm Version 6.3 open-source platform, introducing tools and optimizations for AI, ML and HPC worklo

AMD Releases ROCm Version 6.3

submited by
Style Pass
2024-11-27 15:00:07

Nov. 26, 2024: AMD today announced the release of ROCm Version 6.3 open-source platform, introducing tools and optimizations for AI, ML and HPC workloads on AMD Instinct GPU accelerators.

ROCm 6.3 is engineered for a range of organizations, from AI startups to HPC-driven industries, and is designed to enhance developer productivity

Features of this release include SGLang integration for AI inferencing, a re-engineered FlashAttention-2 for AI training and inference, the introduction of multi-node Fast Fourier Transform (FFT) for HPC workflows and other features:

GenAI is transforming industries, but deploying large models often means grappling with latency, throughput, and resource utilization challenges. Enter SGLang, a new runtime supported by ROCm 6.3, purpose-built for optimizing inference of cutting-edge generative models such as LLMs and VLMs on AMD Instinct GPUs.

Whether you’re building customer-facing AI solutions or scaling AI workloads in the cloud, SGLang delivers the performance and ease-of-use needed to meet enterprise demands. Discover the powerful features of SGLang and learn how to seamlessly set up and run models on AMD Instinct GPU accelerators here > Get started now!

Leave a Comment