2 23 39

Adam Fetzer

Rexschwert

AI & ML interests

AI, Big Data, Data Science, Machine Learning, Computer Vision, Natural Language Processing

Recent Activity

liked a dataset 6 days ago

OmniSVG/MMSVG-Illustration

liked a model 8 days ago

Menlo/ReZero-v0.1-llama-3.2-3b-it-grpo-250404

liked a dataset 9 days ago

HuggingFaceM4/the_cauldron

View all activity

Organizations

Rexschwert's activity

liked a dataset 6 days ago

OmniSVG/MMSVG-Illustration

Viewer • Updated 15 days ago • 132k • 1.21k • 45

liked a model 8 days ago

Menlo/ReZero-v0.1-llama-3.2-3b-it-grpo-250404

Text Generation • Updated 7 days ago • 1.89k • 55

liked a dataset 9 days ago

HuggingFaceM4/the_cauldron

Viewer • Updated May 6, 2024 • 1.88M • 943k • 401

published a model 10 days ago

Rexschwert/llama3-empower-functions-large-v1.1-bnb-4bit

Updated Nov 1, 2024 • 1

reacted to nyuuzyou's post with 🔥👍 10 days ago

Post

5503

🇷🇺 Russian Forum Messages Dataset - nyuuzyou/ruforum

Collection of approximately 58 million Russian forum messages featuring:

- Complete message content from Russian online forums spanning 2010-2025
- Comprehensive metadata including unique message IDs and timestamps
- Full text content preserving original user discussions and interactions
- Monolingual dataset focused exclusively on Russian language content

This dataset offers a unique textual archive of Russian online conversations suitable for text generation, sentiment analysis, and language modeling research. Released to the public domain under CC0 1.0 license.

liked a model 28 days ago

Qwen/Qwen2.5-Omni-7B

Any-to-Any • Updated 9 days ago • 202k • 1.47k

liked 3 models about 1 month ago

reacted to mlabonne's post with 🔥 about 1 month ago

Post

9239

✂️ AutoAbliteration

I made a Colab notebook to automatically abliterate models.

It's quite general, so you can do interesting stuff like blocking a given language in the model outputs.

💻 Colab: https://colab.research.google.com/drive/1RmLv-pCMBBsQGXQIM8yF-OdCNyoylUR1?usp=sharing

upvoted 2 collections about 1 month ago

Granite Vision Models

Collection

3 items • Updated 8 days ago • 13

Granite 3.2 Language Models

Collection

3 items • Updated 8 days ago • 19

liked 3 models about 1 month ago

huihui-ai/granite-3.2-2b-instruct-abliterated

Text Generation • Updated Mar 12 • 20 • 5

ibm-granite/granite-3.2-2b-instruct

Text Generation • Updated 7 days ago • 23.8k • 47

NeuralAudioAI/NA_base

Text-to-Speech • Updated Mar 2 • 9

liked a dataset about 1 month ago

Thomas-X-Yang/gsm8k-prolog

Viewer • Updated Jul 22, 2024 • 8.79k • 108 • 13

reacted to tomaarsen's post with 🧠 about 1 month ago

Post

6670

An assembly of 18 European companies, labs, and universities have banded together to launch 🇪🇺 EuroBERT! It's a state-of-the-art multilingual encoder for 15 European languages, designed to be finetuned for retrieval, classification, etc.

🇪🇺 15 Languages: English, French, German, Spanish, Chinese, Italian, Russian, Polish, Portuguese, Japanese, Vietnamese, Dutch, Arabic, Turkish, Hindi
3️⃣ 3 model sizes: 210M, 610M, and 2.1B parameters - very very useful sizes in my opinion
➡️ Sequence length of 8192 tokens! Nice to see these higher sequence lengths for encoders becoming more common.
⚙️ Architecture based on Llama, but with bi-directional (non-causal) attention to turn it into an encoder. Flash Attention 2 is supported.
🔥 A new Pareto frontier (stronger *and* smaller) for multilingual encoder models
📊 Evaluated against mDeBERTa, mGTE, XLM-RoBERTa for Retrieval, Classification, and Regression (after finetuning for each task separately): EuroBERT punches way above its weight.
📝 Detailed paper with all details, incl. data: FineWeb for English and CulturaX for multilingual data, The Stack v2 and Proof-Pile-2 for code.

Check out the release blogpost here: https://huggingface.co/blog/EuroBERT/release
* EuroBERT/EuroBERT-210m
* EuroBERT/EuroBERT-610m
* EuroBERT/EuroBERT-2.1B

The next step is for researchers to build upon the 3 EuroBERT base models and publish strong retrieval, zero-shot classification, etc. models for all to use. I'm very much looking forward to it!

1 reply