Ploutos, 02 June 2025
R3: Robust Rubric-Agnostic Reward Models
A technical talk to Machine Learning researchers, professionals and entrepreneurs based on previously published paper R3: Robust Rubric-Agnostic Reward Models.
Ploutos, 02 June 2025
A technical talk to Machine Learning researchers, professionals and entrepreneurs based on previously published paper R3: Robust Rubric-Agnostic Reward Models.
Toronto Machine Learning Summit, 15 July 2024
A technical talk to Machine Learning researchers, professionals and entrepreneurs based on previously published paper ProxyLM: Predicting Language Model Performance on Multilingual Tasks via Proxy Models.