Stability AI released Stable Audio 3.0, marking a shift toward open-source audio generation. Three of the new models come with publicly available weights, allowing developers to run and modify them locally without relying on proprietary APIs.

The system generates music tracks up to six minutes long, a substantial increase from previous versions. Stability AI trained all models exclusively on licensed data, addressing longstanding concerns about copyright in generative audio. The company built Stable Audio 3.0 to support remixing and fine-tuning, giving users direct control over outputs.

Open weights matter here. Releasing model parameters publicly lets researchers inspect how the system works, spot bias or errors, and adapt it for specific use cases. It also distributes power away from any single company controlling audio generation. Competitors like OpenAI's Jukebox and Google's MusicLM largely operate as closed services.

The six-minute duration breaks a real limitation in earlier audio generators. Most previous systems maxed out at 30 seconds or two minutes. Longer tracks enable practical music production workflows. Musicians can now generate song-length compositions rather than short loops.

Training exclusively on licensed data signals Stability AI's attempt to navigate copyright friction that has plagued generative AI. Using properly licensed music reduces legal exposure and sidesteps complaints from artists about unauthorized data scraping. This approach comes as music industry organizations increasingly push back against AI companies that trained on copyrighted material without permission.

The remix and tuning features indicate Stable Audio 3.0 targets creative professionals, not just casual users. These tools let musicians integrate AI generation into actual production pipelines rather than treating it as a novelty.

Stability AI positions this release as democratizing audio generation. Open weights lower barriers to entry for researchers and smaller developers. It also lets the community identify and fix problems faster than proprietary models allow. The combination of longer outputs