1 39 370

chenhao

chenhaodev

dreamclinger

AI & ML interests

Note: check LLM performance on Med-task @ https://gist.github.com/chenhaodev

Recent Activity

liked a Space 1 day ago

ariG23498/qwen-od

liked a dataset 3 days ago

ncbi/MedCalc-Bench-v1.0

liked a Space 6 days ago

prithivMLmods/Multimodal-OCR

View all activity

Organizations

None yet

chenhaodev's activity

upvoted an article 18 days ago

Article

How to Build an MCP Server with Gradio

and 1 other •

22 days ago

• 109

upvoted a paper 20 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 229

upvoted a collection 21 days ago

GraphRAG Papers

Collection

Research relating graphs and GenAI. For discussion, find dedicated threads on https://discord.gg/graphrag • 47 items • Updated 14 days ago • 34

upvoted 3 articles 2 months ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

•

Jan 15

• 179

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

and 3 others •

Mar 12

• 417

Article

Finally, a Replacement for BERT: Introducing ModernBERT

and 14 others •

Dec 19, 2024

• 632

upvoted 2 papers 3 months ago

LightRAG: Simple and Fast Retrieval-Augmented Generation

Paper • 2410.05779 • Published Oct 8, 2024 • 4

StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization

Paper • 2410.08815 • Published Oct 11, 2024 • 50

upvoted 3 articles 4 months ago

Article

Open-source DeepResearch – Freeing our search agents

and 4 others •

Feb 4

• 1.25k

Article

MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era

•

Jan 15

• 44

Article

Halo: Open Source Health Tracking with Wearables

•

Nov 19, 2024

• 110

upvoted 2 papers 6 months ago

MedAlign: A Clinician-Generated Dataset for Instruction Following with Electronic Medical Records

Paper • 2308.14089 • Published Aug 27, 2023 • 30

EHRCon: Dataset for Checking Consistency between Unstructured Notes and Structured Tables in Electronic Health Records

Paper • 2406.16341 • Published Jun 24, 2024 • 13

upvoted 2 articles 6 months ago

Article

MedEmbed: Fine-Tuned Embedding Models for Medical / Clinical IR

•

Oct 20, 2024

• 43

Article

Better RAG 1: Advanced Basics

•

Mar 14, 2024

• 28

upvoted a paper 6 months ago

MagicQuill: An Intelligent Interactive Image Editing System

Paper • 2411.09703 • Published Nov 14, 2024 • 78

upvoted an article 6 months ago

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

•

Jul 29, 2024

• 320

upvoted a paper 6 months ago

Simple and Scalable Strategies to Continually Pre-train Large Language Models

Paper • 2403.08763 • Published Mar 13, 2024 • 52

upvoted 2 articles 7 months ago

Article

🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets

•

Jun 4, 2024

• 78

Article

How OpenGPT 4o works

•

Jul 17, 2024

• 38