mR3: Multilingual Rubric-Agnostic Reward Reasoning Models
David Anugraha, Shou-Yi Hung, Zilu Tang, En-Shiun Annie Lee, Derry Tanti Wijaya, Genta Indra Winata
Published in ICLR Poster, 2025
We introduce mR3, a massively multilingual, rubric-agnostic reward reasoning model trained on 72 languages.
