We are a researcher community developing scientifically grounded research outputs and robust deployment infrastructure for broader impact evaluations.evalevalai.comJoined June 2025
🚨 New blog: The AI Evaluation Chart Crisis 📝
From misleading bar heights to missing error bars, recent model launches have sparked debate on AI evals. In our new blogpost, we dig into what’s broken, why it matters and how they should be presented 👇
evalevalai.com/documentation/…
Join us for the Eval Eval Coalition Social at @FAccTConference tomorrow Tuesday June 24th from 4-4:30 pm during the coffee break! We would love to have you join us and we look forward to seeing you there!! #FAccT2025#EvalEval
85 Followers 365 Followingco-founder at @anthromindinc | ex-google AI engineer | building next-gen scalable oversight systems for AI | https://t.co/2EFb7GMpjC
298 Followers 432 FollowingPhD Student in Language Analysis and Processing at @upvehu @Hitz_zentroa @IxaTaldea. Working on Improving Language Models for Low-resource Languages.
1K Followers 2K FollowingScience of AI evaluations + U.S. AI policy @RANDCorporation | @Harvard_Law '26, @SchwarzmanOrg '23, @GTOMSCS '22 | Views mine only 🏳️🌈 🎉
247 Followers 925 FollowingPhD-ing @uniofoxford researching LLM explainability and interpretability + doing some evals work along the way | Applied AI @The_IGC | Prev @Cambridge_Uni
313 Followers 486 FollowingLisa is my name, AI governance is my game @ interface | affiliate @RANDCorporation | Prev. @GovAI_ @BCG @LSEnews | https://t.co/4EfEYQfEzY
she/her
247 Followers 497 Followingthree large language models in a trench coat
phd student @ harvard psych
social cognition, models, moral reasoning, culture, methods, scalable oversight
738 Followers 1K FollowingLong document understanding, Multilingual Evals and efficient models mainly, but other #NLProc applications in free time | vim enthusiast
2K Followers 2K FollowingAssistant Professor of Computer Science at ETH Zurich working in natural language processing (#NLProc), machine learning and education (#edtech).
6K Followers 407 FollowingEvals @HuggingFace 🐍✨
"The future is already here, it’s just not very evenly distributed" (Gibson)
Not an AGI believer, LLMs are good at form not substance
57K Followers 858 FollowingFiguring out AI @allen_ai, open models, RLHF, fine-tuning, etc
Contact via email.
Writes @interconnectsai
Wrote The RLHF Book
Mountain runner
No recent Favorites. New Favorites will appear here.