Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
RobinsAIWorld 's Collections
Coaching Tools
LLM Craft
Robin's Tools
3D
Video
Image Editing
Upscalers
Audio
Splats
Coders
Content Production

Audio

updated Feb 17
Upvote
-

  • Running on Zero
    900
    900

    Whisper Turbo

    🤯

    Transcribe audio or YouTube videos to text


  • Sleeping
    3
    3

    Pdf2audio

    📚

    Transform text into engaging podcast dialogue


  • Running on T4
    1.08k
    1.08k

    Open NotebookLM

    🎙

    Personalised Podcasts For All - Available in 13 Languages


  • Running on Zero
    757
    757

    Video Dubbing (SoniTranslate)

    🌍

    Video Dubbing with Open Source Projects


  • Running
    1
    1

    Whisper Timestamped

    🕒

    In-browser speech recognition w/ word-level timestamps


  • Running on T4
    2.3k
    2.3k

    Bark

    🐶

    Generate realistic audio from text


  • Running
    157
    157

    Whisper Large V3 Turbo WebGPU

    🚀

    ML-powered speech recognition directly in your browser


  • Running on L40S
    2.24k
    2.24k

    Whisper

    📉

    Transcribe audio from microphone, files, or YouTube


  • Running on Zero
    98
    98

    Giant Music Transformer

    🦖

    Fast multi-instrumental music transformer


  • Running
    6
    6

    Kokoro TTS

    ❤

    Upgraded to v1.0!


  • Build error
    44
    44

    XTTS Voice Clone on CPU

    🚀

    Generate audio by cloning a voice


  • Running on Zero
    92
    92

    Voice Clone

    🎥

    Voice Clone Multilingual TTS


  • Running on Zero
    2.6k
    2.6k

    Kokoro TTS

    ❤

    Upgraded to v1.0!


  • Running
    1
    1

    Whisper.cpp WASM

    📉

    Transcribe audio files or live microphone input to text

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs