DeathGodlike (DeathGodlike)

liked a model about 1 month ago

turboderp/TinyLlama-1B-exl2

Updated Jan 4, 2024 • 2 • 7

liked 5 models about 2 months ago

reacted to KaiChen1998's post with 🔥 3 months ago

Post

4865

📢 Our EMOVA paper has been accepted by CVPR 2025, and we are glad to release all resources, including code (training & inference), datasets (training & evaluation), and checkpoints (EMOVA-3B/7B/72B)!

🤗 EMOVA is a novel end-to-end omni-modal LLM that can see, hear and speak. Given omni-modal (i.e., textual, visual and speech) inputs, EMOVA can generate both textual and speech responses with vivid emotional controls by utilizing the speech decoder and a style controller.

✨ EMOVA Highlights
✅ State-of-the-art omni-modality: EMOVA achieves SoTA comparable results on both vision-language and speech benchmarks simultaneously.
✅ Device adaptation: our codebase supports training/inference on both NVIDIA GPUs (e.g., A800 & H20) and Ascend NPUs (e.g., 910B3)!
✅ Modular design: we integrate multiple implementations of vision encoder, vision projector, and language model, even including the most recent DeepSeekMoE-tiny!

🔥 You are all welcome to try and star!
- Project page: https://emova-ollm.github.io/
- Github: https://github.com/emova-ollm/EMOVA
- Demo: Emova-ollm/EMOVA-demo

reacted to clem's post with ❤️ 3 months ago

Post

4684

We just crossed 1,500,000 public models on Hugging Face (and 500k spaces, 330k datasets, 50k papers). One new repository is created every 15 seconds. Congratulations all!

3 replies

·

reacted to tomaarsen's post with 🚀🔥 3 months ago

Post

6775

An assembly of 18 European companies, labs, and universities have banded together to launch 🇪🇺 EuroBERT! It's a state-of-the-art multilingual encoder for 15 European languages, designed to be finetuned for retrieval, classification, etc.

🇪🇺 15 Languages: English, French, German, Spanish, Chinese, Italian, Russian, Polish, Portuguese, Japanese, Vietnamese, Dutch, Arabic, Turkish, Hindi
3️⃣ 3 model sizes: 210M, 610M, and 2.1B parameters - very very useful sizes in my opinion
➡️ Sequence length of 8192 tokens! Nice to see these higher sequence lengths for encoders becoming more common.
⚙️ Architecture based on Llama, but with bi-directional (non-causal) attention to turn it into an encoder. Flash Attention 2 is supported.
🔥 A new Pareto frontier (stronger *and* smaller) for multilingual encoder models
📊 Evaluated against mDeBERTa, mGTE, XLM-RoBERTa for Retrieval, Classification, and Regression (after finetuning for each task separately): EuroBERT punches way above its weight.
📝 Detailed paper with all details, incl. data: FineWeb for English and CulturaX for multilingual data, The Stack v2 and Proof-Pile-2 for code.

Check out the release blogpost here: https://huggingface.co/blog/EuroBERT/release
* EuroBERT/EuroBERT-210m
* EuroBERT/EuroBERT-610m
* EuroBERT/EuroBERT-2.1B

The next step is for researchers to build upon the 3 EuroBERT base models and publish strong retrieval, zero-shot classification, etc. models for all to use. I'm very much looking forward to it!

1 reply

·

liked a model 3 months ago

sleepdeprived3/Meth-SD-6.0

Updated 17 days ago • 4

reacted to DualityAI-RebekahBogdanoff's post with 🚀 3 months ago

Post

2854

🚀 Duality is super excited to announce that our Kaggle competition is LIVE! Synthetic-to-Real Object Detection Challenge is LIVE! 🚦
Want to master AI training, learn industry-proven synthetic data workflows, and compete for public recognition and cash prizes?

👉 Join our Synthetic-to-Real Object Detection Challenge on Kaggle! https://www.kaggle.com/competitions/synthetic-2-real-object-detection-challenge/overview

Compete to build the top-performing model capable of detecting real-world objects—trained entirely on synthetic data. Master these industry-proven methods for faster, more targeted, and diverse dataset creation, and set yourself apart, unlocking today's most exciting AI opportunities.

Ready to test your skills?

🏆 The Challenge

Train an object detection model using synthetic images created with Falcon—Duality AI's cutting-edge digital twin simulation software—then evaluate your model on real-world imagery.

The Twist?
📈 Boost your model’s accuracy by creating and refining your own custom synthetic datasets using Falcon! Get access to the tools and double the data by following this link and creating a free account-
https://falcon.duality.ai/secure/documentation/ex-1-objdetection?sidebarMode=learn

Win Cash Prizes & Recognition
🔹 Earn cash and public shout-outs from the Duality AI accounts
Enhance Your Portfolio
🔹 Demonstrate your real-world AI and ML expertise in object detection to prospective employers and collaborators.
Expand Your Network
🔹 Engage, compete, and collaborate with fellow ML engineers, researchers, and students.

🚀 Put your skills to the test and join our Kaggle competition today: https://www.kaggle.com/competitions/synthetic-2-real-object-detection-challenge/overview

reacted to andito's post with 🔥 3 months ago

Post

2840

Extremely bullish on @CohereForAI 's Aya Vision (8B & 32B) - new SOTA open-weight VLMs

- 8B wins up to 81% of the time in its class, better than Gemini Flash
- 32B beats Llama 3.2 90B!
- Covers 23 languages, excels in image captioning, VQA & more
- Integrated on transformers from Day 0!

Efficient multimodal models are here to stay!!🔥
Check out their blog! https://huggingface.co/blog/aya-vision

reacted to samihalawa's post with 🔥 3 months ago

Post

2861

🥳🥳Just achieved 25m 59s of research with plain ChatGPT 🔥 Had it doing a complete internet search in just ONE call visiting 443 websites! Hard to beat huh!
PROMPT IN COMMENTS
Check out the Massive Article created by the prompt:
https://huggingface.co/blog/luigi12345/automating-lead-generation-with-ai

9 replies

·

liked a model 3 months ago

ArliAI/Mistral-Small-24B-ArliAI-RPMax-v1.4

Updated Feb 9 • 7 • 10

reacted to Locutusque's post with 👍 3 months ago

Post

2937

🎉 Exciting news, everyone! I've just released **Thespis-Llama-3.1-8B**, a new language model designed for enhanced roleplaying! ✨️

It's built on Llama-3.1 and fine-tuned with a focus on Theory of Mind reasoning to create more believable and engaging characters. It even learned a few tricks on its own, like adding in-character thought processes! 🧠

Check it out here: Locutusque/Thespis-Llama-3.1-8B

Give it a try and let me know what you think! I'm especially interested in feedback on how well the characters stay in role and if the responses feel natural. Looking forward to seeing what amazing stories you create! ✍️

reacted to onekq's post with 🔥🚀 3 months ago

Post

2768

Necessity is mother of invention. To understand ⚡FlashMLA⚡ by
🐋DeepSeek 🐋, the first question to ask is why.

The keyword here is H800, a lower-end product tailored for export control. The purpose here is to squeeze out as much performance as possible.

But here is the most important takeaway: this invention benefits EVERYONE.

2 replies

·

liked a model 3 months ago

Epiculous/Crimson_Dawn-v0.2

Text Generation • Updated Sep 9, 2024 • 43 • • 14

reacted to JingzeShi's post with 🚀 3 months ago

Post

3013

🤗Welcome to the Doge Edge Device Small language Model.

SmallDoge/Doge-160M-Instruct

DeathGodlike

AI & ML interests

Recent Activity

Organizations

DeathGodlike's activity

turboderp/TinyLlama-1B-exl2

sleepdeprived3/Gemma3-T4

sleepdeprived3/Mistral-V3-Tekken-T4

sleepdeprived3/Mistral-V7-Tekken-T4

sleepdeprived3/Mistral-V7-Tekken-E

sleepdeprived3/Llama-4-T4

sleepdeprived3/Meth-SD-6.0

ArliAI/Mistral-Small-24B-ArliAI-RPMax-v1.4

Epiculous/Crimson_Dawn-v0.2