Learning Reliable Rewards for LLMs via RLHF: Robustness, Adaptivity and Beyond
October 14th, 2025 by Tuo Zhao
Join us and create the career you love
Work in a collaborative, open-ended, publish-friendly environment, and build AI technology on top of the rich visual graph structure inherent to Pinterest, and ship products to 550M+ users.