Post
505
๐ New NVIDIA paper: Audio-SDS ๐
We adapt Score Distillation Sampling (SDS), originally developed for text-to-3D generation, to audio diffusion models, allowing us to reuse large pretrained models for new text-guided parametric audio tasks such as source separation, physically informed impact synthesis, and more.
๐ Project Page: https://research.nvidia.com/labs/toronto-ai/Audio-SDS/
๐ Full Paper: https://arxiv.org/abs/2505.04621
Check out more from NVIDIAโs Spatial Intelligence Lab here: https://research.nvidia.com/labs/toronto-ai/
This project was led by the great work of Jessie Richter-Powell, along with Antonio Torralba.
Notably, we find a new and exciting use case for Stable Audio Open ๐
We adapt Score Distillation Sampling (SDS), originally developed for text-to-3D generation, to audio diffusion models, allowing us to reuse large pretrained models for new text-guided parametric audio tasks such as source separation, physically informed impact synthesis, and more.
๐ Project Page: https://research.nvidia.com/labs/toronto-ai/Audio-SDS/
๐ Full Paper: https://arxiv.org/abs/2505.04621
Check out more from NVIDIAโs Spatial Intelligence Lab here: https://research.nvidia.com/labs/toronto-ai/
This project was led by the great work of Jessie Richter-Powell, along with Antonio Torralba.
Notably, we find a new and exciting use case for Stable Audio Open ๐