Goedel-Prover-DPO / README.md
linyongver's picture
Update README.md
f381722 verified
metadata
license: mit

This is the DPO model trained on the top of Goedel-Prover-SFT. Goedel-LM/Goedel-Prover-DPO achieves over 60% on miniF2F by Pass@32.

Citation

@misc{lin2025goedelproverfrontiermodelopensource,
      title={Goedel-Prover: A Frontier Model for Open-Source Automated Theorem Proving}, 
      author={Yong Lin and Shange Tang and Bohan Lyu and Jiayun Wu and Hongzhou Lin and Kaiyu Yang and Jia Li and Mengzhou Xia and Danqi Chen and Sanjeev Arora and Chi Jin},
      year={2025},
      eprint={2502.07640},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2502.07640}, 
}