German-English models, mostly merged, some sft/dpo
cstr
cstr
AI & ML interests
None yet
Recent Activity
reacted
to
hesamation's
post
with 👀
5 days ago
this paper has been blowing up
they train an open-source multimodal LLM (InternVL3) that can compete with GPT-4o and Claude 3.5 Sonnet by:
> training text and vision on a single stage
> a novel V2PE positional encoding
> SFT & mixed preference optimization
Paper: https://huggingface.co/papers/2504.10479
> test-time scaling
liked
a model
11 days ago
Revai/reverb-diarization-v2
liked
a model
12 days ago
diarizers-community/speaker-segmentation-fine-tuned-callhome-deu
Organizations
Collections
2
spaces
5
models
96
cstr/paraphrase-multilingual-MiniLM-L12-v2-mlx
Sentence Similarity
•
Updated
•
9
cstr/DeepSeek-R1-Distill-Llama-8B-abliterated-Q4_K_M-GGUF
Updated
•
4
cstr/aya-expanse-8b-Q4_K_M-GGUF
Updated
•
3
cstr/Ministral-8B-Instruct-2410-GGUF
Updated
•
3
•
1
cstr/whisper-large-v3-turbo-german-ggml
Automatic Speech Recognition
•
Updated
cstr/whisper-large-v3-turbo-german-int8_float32
Automatic Speech Recognition
•
Updated
•
32
•
1
cstr/salamandra-7b-instruct-GGUF
Text Generation
•
Updated
•
44
•
2
cstr/whisper-large-v3-turbo-int8_float32
Automatic Speech Recognition
•
Updated
•
58
cstr/llama3.1-8b-spaetzle-v119
Updated
•
2
cstr/llama3.1-8b-spaetzle-v90
Updated
•
8
•
2
datasets
9
cstr/mistralorpo_conv
Viewer
•
Updated
•
21.6k
•
21
cstr/phi3orpo
Viewer
•
Updated
•
2.62k
•
24
cstr/capybara_de_sharegpt
Viewer
•
Updated
•
16k
•
23
cstr/hermes_de_sharegpt
Viewer
•
Updated
•
205k
•
30
cstr/Capybara-de-snippets
Updated
•
87
cstr/intel_orca_dpo_pairs_de
Viewer
•
Updated
•
12.9k
•
34
•
2
cstr/ultrafeedback-binarized-preferences-cleaned-de-2
Viewer
•
Updated
•
664
•
25
cstr/ultrafeedback-binarized-preferences-cleaned-de
Viewer
•
Updated
•
8.93k
•
24
cstr/ultrafeedback-binarized-preferences-cleaned-de-3
Viewer
•
Updated
•
3.44k
•
36