DeathGodlike

DeathGodlike

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago
turboderp/TinyLlama-1B-exl2
liked a model about 2 months ago
sleepdeprived3/Gemma3-T4
liked a model about 2 months ago
sleepdeprived3/Mistral-V3-Tekken-T4
View all activity

Organizations

None yet

DeathGodlike's activity

reacted to KaiChen1998's post with πŸ”₯ 3 months ago
view post
Post
4865
πŸ“’ Our EMOVA paper has been accepted by CVPR 2025, and we are glad to release all resources, including code (training & inference), datasets (training & evaluation), and checkpoints (EMOVA-3B/7B/72B)!

πŸ€— EMOVA is a novel end-to-end omni-modal LLM that can see, hear and speak. Given omni-modal (i.e., textual, visual and speech) inputs, EMOVA can generate both textual and speech responses with vivid emotional controls by utilizing the speech decoder and a style controller.

✨ EMOVA Highlights
βœ… State-of-the-art omni-modality: EMOVA achieves SoTA comparable results on both vision-language and speech benchmarks simultaneously.
βœ… Device adaptation: our codebase supports training/inference on both NVIDIA GPUs (e.g., A800 & H20) and Ascend NPUs (e.g., 910B3)!
βœ… Modular design: we integrate multiple implementations of vision encoder, vision projector, and language model, even including the most recent DeepSeekMoE-tiny!

πŸ”₯ You are all welcome to try and star!
- Project page: https://emova-ollm.github.io/
- Github: https://github.com/emova-ollm/EMOVA
- Demo: Emova-ollm/EMOVA-demo
reacted to clem's post with ❀️ 3 months ago
view post
Post
4684
We just crossed 1,500,000 public models on Hugging Face (and 500k spaces, 330k datasets, 50k papers). One new repository is created every 15 seconds. Congratulations all!
Β·
reacted to tomaarsen's post with πŸš€πŸ”₯ 3 months ago
view post
Post
6775
An assembly of 18 European companies, labs, and universities have banded together to launch πŸ‡ͺπŸ‡Ί EuroBERT! It's a state-of-the-art multilingual encoder for 15 European languages, designed to be finetuned for retrieval, classification, etc.

πŸ‡ͺπŸ‡Ί 15 Languages: English, French, German, Spanish, Chinese, Italian, Russian, Polish, Portuguese, Japanese, Vietnamese, Dutch, Arabic, Turkish, Hindi
3️⃣ 3 model sizes: 210M, 610M, and 2.1B parameters - very very useful sizes in my opinion
➑️ Sequence length of 8192 tokens! Nice to see these higher sequence lengths for encoders becoming more common.
βš™οΈ Architecture based on Llama, but with bi-directional (non-causal) attention to turn it into an encoder. Flash Attention 2 is supported.
πŸ”₯ A new Pareto frontier (stronger *and* smaller) for multilingual encoder models
πŸ“Š Evaluated against mDeBERTa, mGTE, XLM-RoBERTa for Retrieval, Classification, and Regression (after finetuning for each task separately): EuroBERT punches way above its weight.
πŸ“ Detailed paper with all details, incl. data: FineWeb for English and CulturaX for multilingual data, The Stack v2 and Proof-Pile-2 for code.

Check out the release blogpost here: https://huggingface.co/blog/EuroBERT/release
* EuroBERT/EuroBERT-210m
* EuroBERT/EuroBERT-610m
* EuroBERT/EuroBERT-2.1B

The next step is for researchers to build upon the 3 EuroBERT base models and publish strong retrieval, zero-shot classification, etc. models for all to use. I'm very much looking forward to it!
  • 1 reply
Β·
reacted to DualityAI-RebekahBogdanoff's post with πŸš€ 3 months ago
view post
Post
2854
πŸš€ Duality is super excited to announce that our Kaggle competition is LIVE! Synthetic-to-Real Object Detection Challenge is LIVE! 🚦
Want to master AI training, learn industry-proven synthetic data workflows, and compete for public recognition and cash prizes?

πŸ‘‰ Join our Synthetic-to-Real Object Detection Challenge on Kaggle! https://www.kaggle.com/competitions/synthetic-2-real-object-detection-challenge/overview

Compete to build the top-performing model capable of detecting real-world objectsβ€”trained entirely on synthetic data. Master these industry-proven methods for faster, more targeted, and diverse dataset creation, and set yourself apart, unlocking today's most exciting AI opportunities.

Ready to test your skills?

πŸ† The Challenge

Train an object detection model using synthetic images created with Falconβ€”Duality AI's cutting-edge digital twin simulation softwareβ€”then evaluate your model on real-world imagery.

The Twist?
πŸ“ˆ Boost your model’s accuracy by creating and refining your own custom synthetic datasets using Falcon! Get access to the tools and double the data by following this link and creating a free account-
https://falcon.duality.ai/secure/documentation/ex-1-objdetection?sidebarMode=learn

Win Cash Prizes & Recognition
πŸ”Ή Earn cash and public shout-outs from the Duality AI accounts
Enhance Your Portfolio
πŸ”Ή Demonstrate your real-world AI and ML expertise in object detection to prospective employers and collaborators.
Expand Your Network
πŸ”Ή Engage, compete, and collaborate with fellow ML engineers, researchers, and students.

πŸš€ Put your skills to the test and join our Kaggle competition today: https://www.kaggle.com/competitions/synthetic-2-real-object-detection-challenge/overview
reacted to andito's post with πŸ”₯ 3 months ago
view post
Post
2840
Extremely bullish on @CohereForAI 's Aya Vision (8B & 32B) - new SOTA open-weight VLMs

- 8B wins up to 81% of the time in its class, better than Gemini Flash
- 32B beats Llama 3.2 90B!
- Covers 23 languages, excels in image captioning, VQA & more
- Integrated on transformers from Day 0!

Efficient multimodal models are here to stay!!πŸ”₯
Check out their blog! https://huggingface.co/blog/aya-vision
reacted to samihalawa's post with πŸ”₯ 3 months ago
view post
Post
2861
πŸ₯³πŸ₯³Just achieved 25m 59s of research with plain ChatGPT πŸ”₯ Had it doing a complete internet search in just ONE call visiting 443 websites! Hard to beat huh!
PROMPT IN COMMENTS
Check out the Massive Article created by the prompt:
https://huggingface.co/blog/luigi12345/automating-lead-generation-with-ai
Β·
reacted to Locutusque's post with πŸ‘ 3 months ago
view post
Post
2937
πŸŽ‰ Exciting news, everyone! I've just released **Thespis-Llama-3.1-8B**, a new language model designed for enhanced roleplaying! ✨️

It's built on Llama-3.1 and fine-tuned with a focus on Theory of Mind reasoning to create more believable and engaging characters. It even learned a few tricks on its own, like adding in-character thought processes! 🧠

Check it out here: Locutusque/Thespis-Llama-3.1-8B

Give it a try and let me know what you think! I'm especially interested in feedback on how well the characters stay in role and if the responses feel natural. Looking forward to seeing what amazing stories you create! ✍️
reacted to onekq's post with πŸ”₯πŸš€ 3 months ago
view post
Post
2768
Necessity is mother of invention. To understand ⚑FlashMLA⚑ by
πŸ‹DeepSeek πŸ‹, the first question to ask is why.

The keyword here is H800, a lower-end product tailored for export control. The purpose here is to squeeze out as much performance as possible.

But here is the most important takeaway: this invention benefits EVERYONE.
  • 2 replies
Β·
reacted to JingzeShi's post with πŸš€ 3 months ago