1 10 65

babycommando

AI & ML interests

ai!

Recent Activity

liked a Space 12 days ago

Xenova/kokoro-web

liked a model 14 days ago

microsoft/bitnet-b1.58-2B-4T

liked a model 21 days ago

nvidia/GR00T-N1-2B

View all activity

Organizations

babycommando's activity

liked a Space 12 days ago

Kokoro Web

🗣

ML-powered speech synthesis directly in your browser

liked a model 14 days ago

microsoft/bitnet-b1.58-2B-4T

Text Generation • Updated 14 days ago • 89.9k • 1k

liked a model 21 days ago

nvidia/GR00T-N1-2B

Robotics • Updated Mar 18 • 4.11k • 298

liked a Space 22 days ago

1.32k

Dia 1.6B

👯

Generate realistic dialogue from a script, using Dia!

liked a model 22 days ago

maxhirez/ShorNet

Graph Machine Learning • Updated 26 days ago • 1

upvoted a collection about 1 month ago

Llasa

Collection

TTS foundation model compatible with Llama framework (160k hours tokenized speech data released) • 11 items • Updated 4 days ago • 18

liked 2 models about 1 month ago

OuteAI/Llama-OuteTTS-1.0-1B-ONNX

Text-to-Speech • Updated Apr 7 • 53 • 8

unsloth/Llama-4-Scout-17B-16E-Instruct

Image-Text-to-Text • Updated 22 days ago • 6.06k • 55

upvoted a paper 8 months ago

NVLM: Open Frontier-Class Multimodal LLMs

Paper • 2409.11402 • Published Sep 17, 2024 • 75

liked 2 models 9 months ago

yukiarimo/yuna-ai-v2

Text Generation • Updated Sep 21, 2024 • 146 • 5

Qwen/Qwen2-Audio-7B-Instruct

Audio-Text-to-Text • Updated Jan 12 • 122k • 434

upvoted a paper 10 months ago

Learning to Manipulate Anywhere: A Visual Generalizable Framework For Reinforcement Learning

Paper • 2407.15815 • Published Jul 22, 2024 • 14

liked a model 10 months ago

PatronusAI/Llama-3-Patronus-Lynx-8B-Instruct

Text Generation • Updated Jul 22, 2024 • 343 • 42

upvoted a paper 10 months ago

Wavelets Are All You Need for Autoregressive Image Generation

Paper • 2406.19997 • Published Jun 28, 2024 • 32

liked a model 10 months ago

internlm/internlm-xcomposer2d5-7b

Visual Question Answering • Updated Jul 22, 2024 • 2.52k • 204

upvoted a paper 10 months ago

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Paper • 2407.03320 • Published Jul 3, 2024 • 96

liked 2 Spaces 11 months ago

759

Florence 2

📉

Analyze images to generate captions, detect objects, or perform OCR

544

AuraSR-v2

😻

Upscale images to x4

upvoted a paper 11 months ago

Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs

Paper • 2406.16860 • Published Jun 24, 2024 • 61