AI & ML interests

Reinforcement Learning, Large Language Models, Value Alignment

Recent Activity

alignmentforever  updated a dataset about 9 hours ago
PKU-Alignment/InterMT
alignmentforever  updated a model about 13 hours ago
PKU-Alignment/InterMT-Judge
alignmentforever  published a model about 19 hours ago
PKU-Alignment/InterMT-Judge
View all activity