Dimitris Roussis
droussis
AI & ML interests
All things data for LLMs, NMT, evaluation, safety, alignment, and more
Recent Activity
updated
a Space
2 days ago
ilsp/README
upvoted
a
collection
2 days ago
Krikri 8B
updated
a collection
2 days ago
Krikri 8B
Organizations
droussis's activity
Question about number of samples and questions
π
1
6
#4 opened about 1 month ago
by
MeiGao
Thinking token generation
π
1
3
#2 opened 3 months ago
by
thtang
What languages ββwere you trained in?
π
1
2
#7 opened 2 months ago
by
NickyNicky

Bug on the tokenizer, using the code that you provided for the inference.
6
#2 opened 3 months ago
by
Ptrnk
Seems very promising
β€οΈ
5
6
#1 opened 3 months ago
by
gstrat88
Is this the same as Kurage?
2
#2 opened 6 months ago
by
droussis

About context size and difference in quality
3
#1 opened about 1 year ago
by
droussis

Future plans (Llama 3?)
π
1
1
#3 opened about 1 year ago
by
velocity

LLama-Factory inference issue
π
2
14
#2 opened about 1 year ago
by
ianss
Regarding quality assessment
2
#1 opened over 1 year ago
by
droussis

Community request: more languages
β€οΈ
1
8
#1 opened over 1 year ago
by
emre

Which part of HC3?
#1 opened over 1 year ago
by
droussis

The model output is totally corrupted
π
4
4
#5 opened over 1 year ago
by
fernandofernandes

Fix weights by putting the right value in `lm_head.weight`
π
1
3
#3 opened almost 2 years ago
by
sgugger

The model output is totally corrupted
π
4
4
#5 opened over 1 year ago
by
fernandofernandes
