PARM: Multi-Objective Test-Time Alignment via Preference-Aware Autoregressive Reward ModelJan 1, 2025·Baijiong Lin,Weisen Jiang,Yuancheng Xu,Hao ChenYing-Cong Chen· 0 min read PDF Cite CodeType1PublicationProceedings of the International Conference on Machine Learning (ICML)Last updated on Mar 19, 2026 AuthorsYing-Cong ChenAssistant Professor ← Orchestrating Audio: Multi-Agent Framework for Long-Video Audio Synthesis Jan 1, 2025POSTA: A Go-to Framework for Customized Artistic Poster Generation Jan 1, 2025 →