Dynamic Noise Preference Optimization for LLM Self-Improvement via Synthetic Data

Published in Under Review by ICLR 2025, 2024