Unchun Yang's picture

Unchun Yang

ucyang

·

https://ucyang.com/

AI & ML interests

None yet

Recent Activity

liked a model about 19 hours ago

microsoft/bitnet-b1.58-2B-4T-bf16

upvoted a paper about 19 hours ago

BitNet b1.58 2B4T Technical Report

upvoted a paper about 19 hours ago

ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

View all activity

Organizations

ucyang's activity

upvoted 2 papers about 19 hours ago

BitNet b1.58 2B4T Technical Report

Paper • 2504.12285 • Published 4 days ago • 58

ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

Paper • 2504.11536 • Published 5 days ago • 53

upvoted 2 papers 3 days ago

Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

Paper • 2504.03624 • Published 16 days ago • 13

OpenCodeReasoning: Advancing Data Distillation for Competitive Coding

Paper • 2504.01943 • Published 18 days ago • 13

upvoted 3 collections 3 days ago

OpenCodeReasoning

Reasoning data for supervised finetuning of LLMs to advance data distillation for competitive coding • 5 items • Updated 6 days ago • 7

Nemotron-H

Mamba-Transformer hybrid models • 5 items • Updated 6 days ago • 17

Open-Sora 2.0

3 items • Updated Mar 12 • 12

upvoted a collection 6 days ago

GLM-4-0414

GLM-4-0414 series model • 8 items • Updated 6 days ago • 102

upvoted a collection 11 days ago

Orpheus Multilingual Research Release

Beta Release of multilingual models. • 12 items • Updated 10 days ago • 76

upvoted 2 collections 12 days ago

Cogito v1 Preview

5 items • Updated 13 days ago • 101

OuteTTS 1.0

3 items • Updated 13 days ago • 4

upvoted a paper 13 days ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published 13 days ago • 163

upvoted a collection 14 days ago

Llama 4

Meta's new Llama 4 multimodal models, Scout & Maverick. Includes Dynamic GGUFs, 16-bit & Dynamic 4-bit uploads. Run & fine-tune them with Unsloth! • 15 items • Updated 4 days ago • 43

upvoted 2 articles 15 days ago

Article

Xet is on the Hub

Mar 18

• 47

Article

Welcome Llama 4 Maverick & Scout on Hugging Face!

16 days ago

• 140

upvoted a collection 15 days ago

Llama 4

Llama 4 release • 10 items • Updated 15 days ago • 438

upvoted a paper 16 days ago

ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning

Paper • 2503.19470 • Published 27 days ago • 17

upvoted an article 17 days ago

Article

You could have designed state of the art positional encoding

Nov 25, 2024

• 230

upvoted a collection 17 days ago

Gemma 3 QAT

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated 3 days ago • 147