Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ernanhughes 's Collections
programmer.ie

programmer.ie

updated 6 days ago

Papers I have written about on my blog.

Upvote
-

  • MARS: A Multi-Agent Framework Incorporating Socratic Guidance for Automated Prompt Optimization

    Paper • 2503.16874 • Published Mar 21 • 44

  • System Prompt Optimization with Meta-Learning

    Paper • 2505.09666 • Published 23 days ago • 69

  • UniRL: Self-Improving Unified Multimodal Models via Supervised and Reinforcement Learning

    Paper • 2505.23380 • Published 8 days ago • 23

  • DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning

    Paper • 2505.23754 • Published 8 days ago • 15

  • Guided by Gut: Efficient Test-Time Scaling with Reinforced Intrinsic Confidence

    Paper • 2505.20325 • Published 14 days ago • 44
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs