Nguyen Van Thanh

NguyenVanThanhHust

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads

upvoted a paper 3 days ago

Mixtral of Experts

upvoted a paper 3 days ago

Blending Is All You Need: Cheaper, Better Alternative to Trillion-Parameters LLM

View all activity

Organizations

None yet

NguyenVanThanhHust's activity

upvoted 7 papers 3 days ago

Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads

Paper • 2401.10774 • Published Jan 19, 2024 • 57

Mixtral of Experts

Paper • 2401.04088 • Published Jan 8, 2024 • 160

Blending Is All You Need: Cheaper, Better Alternative to Trillion-Parameters LLM

Paper • 2401.02994 • Published Jan 4, 2024 • 52

upvoted an article 3 days ago

Article

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

•

May 7, 2024

• 80

upvoted an article 7 days ago

Article

Welcome FalconMamba: The first strong attention-free 7B model

and 5 others •

Aug 12, 2024

• 112

upvoted an article 9 days ago

Article

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

and 1 other •

Aug 17, 2022

• 91

upvoted a paper 12 days ago

LIMA: Less Is More for Alignment

Paper • 2305.11206 • Published May 18, 2023 • 26

upvoted an article 18 days ago

Article

Optimizing your LLM in production

•

Sep 15, 2023

• 18

upvoted a paper 28 days ago

MVD^2: Efficient Multiview 3D Reconstruction for Multiview Diffusion

Paper • 2402.14253 • Published Feb 22, 2024 • 7

upvoted an article 28 days ago

Article

Introduction to ggml

and 2 others •

Aug 13, 2024

• 202

upvoted 6 papers about 1 month ago

Large Language Models as Markov Chains

Paper • 2410.02724 • Published Oct 3, 2024 • 34

BEDLAM: A Synthetic Dataset of Bodies Exhibiting Detailed Lifelike Animated Motion

Paper • 2306.16940 • Published Jun 29, 2023 • 6

Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception

Paper • 2401.16158 • Published Jan 29, 2024 • 21

Multi-Track Timeline Control for Text-Driven 3D Human Motion Generation

Paper • 2401.08559 • Published Jan 16, 2024 • 9

Trillion 7B Technical Report

Paper • 2504.15431 • Published Apr 21 • 35

Tina: Tiny Reasoning Models via LoRA

Paper • 2504.15777 • Published Apr 22 • 55