merterbak (Mert Erbak)

liked a model 1 day ago

deepseek-ai/DeepSeek-R1-0528-Qwen3-8B

Text Generation • Updated 2 days ago • 10.3k • • 472

liked a model 2 days ago

deepseek-ai/DeepSeek-R1-0528

Text Generation • Updated 2 days ago • 16.4k • • 1.48k

updated a Space 8 days ago

39

Mistral OCR

🌆

Try out Mistral's latest OCR with pdfs and images

liked 3 models 9 days ago

reacted to clem's post with 🔥 15 days ago

Post

3116

Very cool to see

pytorch contributing on Hugging Face. Time to follow them to see what they're cooking!

2 replies

·

updated a Space 16 days ago

2

Seed Coder 8B Instruct

🚀

ByteDance Seed's coding focused Seed-Coder-8B-Instruct model

liked a model 16 days ago

Trendyol/TY-ecomm-embed-multilingual-base-v1.2.0

liked a Space 16 days ago

1

Multi Class Semantic Segmentation Of Intracranial Hemorrhage Types In Ct Scans

📈

Model architecture: Attention U-Net with ResNet101 BackBone

reacted to their post with 🔥 18 days ago

Post

2251

Qwen 3 technical report released🚀
Report: https://github.com/QwenLM/Qwen3/blob/main/Qwen3_Technical_Report.pdf

posted an update 18 days ago

Post

2251

Qwen 3 technical report released🚀
Report: https://github.com/QwenLM/Qwen3/blob/main/Qwen3_Technical_Report.pdf

liked a Space 18 days ago

2

Seed Coder 8B Instruct

🚀

ByteDance Seed's coding focused Seed-Coder-8B-Instruct model

published a Space 18 days ago

2

Seed Coder 8B Instruct

🚀

ByteDance Seed's coding focused Seed-Coder-8B-Instruct model

upvoted an article 19 days ago

Article

Vision Language Models (Better, Faster, Stronger)

By

and 4 others •

19 days ago

• 393

reacted to merve's post with 🔥 19 days ago

Post

5019

VLMS 2025 UPDATE 🔥

We just shipped a blog on everything latest on vision language models, including
🤖 GUI agents, agentic VLMs, omni models
📑 multimodal RAG
⏯️ video LMs
🤏🏻 smol models
..and more! https://huggingface.co/blog/vlms-2025

1 reply

·

reacted to their post with 🚀🔥 20 days ago

Post

2265

Seed-Coder released and it's designed for coding tasks, featuring base, instruct, and reasoning variants at an 8B parameter scale developed by ByteDance Seed team. Unlike traditional open source LLMs that rely on human crafted rules or annotated data for curating code pretraining datasets Seed-Coder introduces a model-centric data pipeline. The pipeline processes raw data from GitHub and web archives into four categories: file-level codes, repository-level codes, GitHub commits, and code-related web data.A quality filter LLM, evaluates code (for readability, modularity, clarity, and reusability) by removing the lowest 10% to create a 6 trillion token dataset supporting 89 programming languages.
Models: ByteDance-Seed/seed-coder-680de32c15ead6555c75b0e4
Github: https://github.com/ByteDance-Seed/Seed-Coder/tree/master
Paper: https://github.com/ByteDance-Seed/Seed-Coder/blob/master/Seed-Coder.pdf

posted an update 20 days ago

Post

2265

Seed-Coder released and it's designed for coding tasks, featuring base, instruct, and reasoning variants at an 8B parameter scale developed by ByteDance Seed team. Unlike traditional open source LLMs that rely on human crafted rules or annotated data for curating code pretraining datasets Seed-Coder introduces a model-centric data pipeline. The pipeline processes raw data from GitHub and web archives into four categories: file-level codes, repository-level codes, GitHub commits, and code-related web data.A quality filter LLM, evaluates code (for readability, modularity, clarity, and reusability) by removing the lowest 10% to create a 6 trillion token dataset supporting 89 programming languages.
Models: ByteDance-Seed/seed-coder-680de32c15ead6555c75b0e4
Github: https://github.com/ByteDance-Seed/Seed-Coder/tree/master
Paper: https://github.com/ByteDance-Seed/Seed-Coder/blob/master/Seed-Coder.pdf

liked a model 20 days ago

ByteDance-Seed/Seed-Coder-8B-Instruct

Text Generation • Updated 16 days ago • 7.23k • 91

Mert Erbak PRO

AI & ML interests

Recent Activity

Organizations

merterbak's activity

deepseek-ai/DeepSeek-R1-0528-Qwen3-8B

deepseek-ai/DeepSeek-R1-0528

Mistral OCR

ByteDance-Seed/BAGEL-7B-MoT

google/medgemma-4b-it

mistralai/Devstral-Small-2505

Seed Coder 8B Instruct

Trendyol/TY-ecomm-embed-multilingual-base-v1.2.0

Multi Class Semantic Segmentation Of Intracranial Hemorrhage Types In Ct Scans

Seed Coder 8B Instruct

Seed Coder 8B Instruct

Vision Language Models (Better, Faster, Stronger)

ByteDance-Seed/Seed-Coder-8B-Instruct