Stable Audio 2.0 sets a new standard in AI-generated audio, producing high-quality, full tracks with coherent musical structure up to three minutes in

Introducing Stable Audio 2.0

submited by
Style Pass
2024-04-03 12:30:06

Stable Audio 2.0 sets a new standard in AI-generated audio, producing high-quality, full tracks with coherent musical structure up to three minutes in length at 44.1kHz stereo.

The new model introduces audio-to-audio generation by allowing users to upload and transform samples using natural language prompts.

Stable Audio 2.0 was exclusively trained on a licensed dataset from the AudioSparx music library, honoring opt-out requests and ensuring fair compensation for creators.

Today, we are pleased to introduce Stable Audio 2.0 . This model enables high-quality, full tracks with coherent musical structure up to three minutes long at 44.1 kHz stereo from a single natural language prompt.

The new model goes beyond text-to-audio to include audio-to-audio capabilities. Users can now upload audio samples and, through natural language prompts, transform these samples into a wide array of sounds. This update also expands sound effect generation and style transfer, providing artists and musicians more flexibility, control, and an elevated creative process.

Stable Audio 2.0 builds upon Stable Audio 1.0 , which debuted in September 2023 as the first commercially viable AI music generation tool capable of producing high-quality 44.1kHz music, leveraging latent diffusion technology. It has since been named one of TIME’s Best Inventions of 2023 .

Leave a Comment