hysts's picture

hysts

hysts

·

AI & ML interests

Computer Vision

Recent Activity

new activity about 10 hours ago

Chaerin5/FoundHand:ZERO gpu and PyTorch mismatch

new activity 1 day ago

hysts-duplicates/Unique3D:🚩Report: It doesn't work to duplicate the space

updated a Space 1 day ago

hysts-duplicates/Unique3D

View all activity

Organizations

hysts's activity

upvoted an article 17 days ago

Article

Hugging Face and Cloudflare Partner to Make Real-Time Speech and Video Seamless with FastRTC

19 days ago

• 21

upvoted an article about 1 month ago

Article

Introducing Gradio's new Dataframe!

Mar 24

• 23

upvoted 2 articles about 2 months ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Mar 12

• 400

Article

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

Mar 4

• 74

upvoted an article 2 months ago

Article

FastRTC: The Real-Time Communication Library for Python

Feb 25

• 159

upvoted 4 articles 3 months ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.23k

Article

The AI tools for Art Newsletter - Issue 1

Jan 31

• 77

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 846

Article

Welcome to Inference Providers on the Hub 🔥

Jan 28

• 480

upvoted a collection 7 months ago

Moshi v0.1 Release

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 15 items • Updated 9 days ago • 228

upvoted a paper 11 months ago

An Introduction to Vision-Language Modeling

Paper • 2405.17247 • Published May 27, 2024 • 90

upvoted a paper about 1 year ago

LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens

Paper • 2402.13753 • Published Feb 21, 2024 • 116

upvoted 4 papers over 1 year ago

ChatAnything: Facetime Chat with LLM-Enhanced Personas

Paper • 2311.06772 • Published Nov 12, 2023 • 35

Music ControlNet: Multiple Time-varying Controls for Music Generation

Paper • 2311.07069 • Published Nov 13, 2023 • 45

Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation Models

Paper • 2311.06783 • Published Nov 12, 2023 • 28

I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models

Paper • 2311.04145 • Published Nov 7, 2023 • 35