Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
nvidia
/
audio-flamingo-2
like
28
Follow
NVIDIA
26k
Audio-Text-to-Text
arxiv:
2503.03983
arxiv:
2402.01831
arxiv:
2204.14198
License:
mit
Model card
Files
Files and versions
Community
2
New discussion
New pull request
Resources
PR & discussions documentation
Code of Conduct
Hub documentation
All
Discussions
Pull requests
View closed (1)
Sort: Recently created
What visual model would you use in tandem? Distallignation?
1
#2 opened 9 days ago by
TimeLordRaps