Gryphe Padar
Gryphe
AI & ML interests
None yet
Recent Activity
new activity
5 minutes ago
Gryphe/Pantheon-Proto-RP-1.8-30B-A3B:Data/Training requirements?
new activity
about 4 hours ago
Gryphe/Pantheon-Proto-RP-1.8-30B-A3B:How did you do the "rewriting pipeline?"
new activity
about 6 hours ago
Gryphe/Pantheon-Proto-RP-1.8-30B-A3B:Update Metadata
Organizations
Gryphe's activity
Data/Training requirements?
2
#4 opened 1 day ago
by
ToastyPigeon

How did you do the "rewriting pipeline?"
2
#5 opened about 16 hours ago
by
marcuscedricridia
Update Metadata
#6 opened about 13 hours ago
by
dylanebert

Update generation_config.json
#1 opened about 17 hours ago
by
lucyknada

Update generation_config.json
#3 opened 5 days ago
by
lucyknada

This is the first Qwen3 A3B model that doesnt immediately start repeating itself
3
#2 opened 6 days ago
by
SuperbEmphasis
Feedback after some use
❤️
👍
2
4
#1 opened 7 days ago
by
AlecFoster
Latest Pantheon release
#264 opened 13 days ago
by
Gryphe

[Why?]
1
#2 opened about 2 months ago
by
Darkknight535

Fix max_model_length (set to 128k)
#3 opened 2 months ago
by
mrfakename

24B version?
8
#2 opened 4 months ago
by
AuriAetherwiing

Bigger models of this?
👍
👀
2
3
#6 opened 4 months ago
by
Adzeiros

Sample of training data format?
3
#7 opened 3 months ago
by
jaypickle
You have my utmost respect.
2
#3 opened 3 months ago
by
AliceThirty
But Censored!?
5
#2 opened 3 months ago
by
DarkCesare
Odd punctuation formatting
1
#5 opened 4 months ago
by
Shifusen

Repetition and token leaking
14
#3 opened 4 months ago
by
Varkoyote

Dataset
👀
3
1
#1 opened 4 months ago
by
mrfakename

Great RP model in only 12B! A few notes and sampler settings for llama.cpp server inside.
👍
1
2
#2 opened 4 months ago
by
ubergarm