2025 MATH-AI@NeurIPS Simultaneous Multi-objective Alignment Across Verifiable and Non-verifiable Rewards Yiran Shen, Yu Xia, Jonathan Chang, and Prithviraj Ammanabrolu 2025 arXiv Code Under Review Preference-Based Learning in Audio Applications: A Systematic Analysis Aaron Broukhim, Yiran Shen, Prithviraj Ammanabrolu, and Nadir Weibel 2025 arXiv EMNLP SAND: Boosting LLM Agents with Self-Taught Action Deliberation Yu Xia, Yiran Shen, Junda Wu, Tong Yu, Sungchul Kim, Ryan A Rossi, Lina Yao, and Julian McAuley 2025 arXiv COLM In-context Ranking Preference Optimization Junda Wu, Rohan Surana, Zhouhang Xie, Yiran Shen, Yu Xia, Tong Yu, Ryan A Rossi, Prithviraj Ammanabrolu, and Julian McAuley 2025 arXiv COLM A Survey on Personalized and Pluralistic Preference Alignment in Large Language Models Zhouhang Xie, Junda Wu, Yiran Shen, Yu Xia, Xintong Li, Aaron Chang, Ryan Rossi, Sachin Kumar, Bodhisattwa Prasad Majumder, Jingbo Shang, Prithviraj Ammanabrolu, and Julian McAuley 2025 arXiv