Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Yiming Zheng's picture
1 9 76

Yiming Zheng

ZYM666
Alex0007's profile picture
·

AI & ML interests

None yet

Recent Activity

upvoted an article about 1 month ago
Transformers v5: Simple model definitions powering the AI ecosystem
liked a dataset 3 months ago
gaia-benchmark/GAIA
liked a dataset 4 months ago
Multilingual-Multimodal-NLP/TableBench
View all activity

Organizations

T-MARS's profile picture

ZYM666 's collections 1

Alignment
  • Direct Preference Optimization: Your Language Model is Secretly a Reward Model

    Paper • 2305.18290 • Published May 29, 2023 • 64
  • Towards Efficient and Exact Optimization of Language Model Alignment

    Paper • 2402.00856 • Published Feb 1, 2024 • 1
  • A General Theoretical Paradigm to Understand Learning from Human Preferences

    Paper • 2310.12036 • Published Oct 18, 2023 • 19
  • Statistical Rejection Sampling Improves Preference Optimization

    Paper • 2309.06657 • Published Sep 13, 2023 • 14
Alignment
  • Direct Preference Optimization: Your Language Model is Secretly a Reward Model

    Paper • 2305.18290 • Published May 29, 2023 • 64
  • Towards Efficient and Exact Optimization of Language Model Alignment

    Paper • 2402.00856 • Published Feb 1, 2024 • 1
  • A General Theoretical Paradigm to Understand Learning from Human Preferences

    Paper • 2310.12036 • Published Oct 18, 2023 • 19
  • Statistical Rejection Sampling Improves Preference Optimization

    Paper • 2309.06657 • Published Sep 13, 2023 • 14
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs