Chinmaya Andukuri @chinmaya_mohan

applied research @CapitalOne, previously @StanfordAILab / @StanfordHAI scandukuri.github.io Joined December 2023

Tweets

18
Followers

53
Following

291
Likes

34

Chinmaya Andukuri @chinmaya_mohan

5 days ago

have been enjoying dipping my toes into `verifiers` and @PrimeIntellect environments hub - just pushed an eval environment for MultiChallenge (@scale_AI) to the Environments Hub my env: app.primeintellect.ai/dashboard/envi… main page: scale.com/leaderboard/mu…

0 0 0 28 0

Chinmaya Andukuri @chinmaya_mohan

12 months ago

if you’re at @COLM_conf, come say hi tomorrow and talk to us about LM self-improvement + clarification!

Philipp Fränken @jphilippfranken

12 months ago

if you’re at @COLM_conf, come say hi tomorrow and talk to us about LM self-improvement + clarification!

0 3 24 4K 2

0 3 9 1K 1

Philipp Fränken @jphilippfranken

a year ago

Constitutional AI showed LMs can learn to follow constitutions by labeling their own outputs. But why can't we just tell a base model the principles of desired behavior and rely on it to act appropriately? Introducing SAMI: Self-Supervised Alignment with Mutual Information!

3 35 153 80K 138

Download Gif

Philipp Fränken @jphilippfranken

a year ago

Excited to share OffTheRails: A moral reasoning benchmark beyond trolley problems! We present a simple prompting pipeline for generating moral reasoning evaluations with language models using causal templates 🔵→🟠

1 9 48 10K 18

Download Image

Kanishk Gandhi @gandhikanishk

a year ago

Language models struggle to search, not due to an architecture problem, but a data one! They rarely see how to search or backtrack. We show how LLMs can be taught to search by representing the process of search in language as a flattened string, a stream of search (SoS)!

8 108 586 104K 544

Download Gif

Rafael Rafailov @ NeurIPS @rm_rafailov

2 years ago

Multi-turn interactive RL should be a bigger focus. Current methods are not well-suited for this - i.e. PPO can't train with user in the loop generally and offline Q-learning still does not work at scale. It's interesting to see more work in that direction.

Chinmaya Andukuri @chinmaya_mohan

Chinmaya Andukuri @chinmaya_mohan

Chinmaya Andukuri @chinmaya_mohan

Philipp Fränken @jphilippfranken

Philipp Fränken @jphilippfranken

Philipp Fränken @jphilippfranken

Kanishk Gandhi @gandhikanishk

Rafael Rafailov @ NeurIPS @rm_rafailov

Philipp Fränken @jphilippfranken

noahdgoodman @noahdgoodman

Philipp Fränken @jphilippfranken

Chinmaya Andukuri @chinmaya_mohan

Philipp Fränken @jphilippfranken

consuelo Pittman @aroliamkay

Heidi @Vwuogal937

Shubhra Mishra @shubhramishra_

Theresa @Ovwipif728

PageChaucer @w32pnDWYtPhBv

Pepe.init @PepeInit

Max Lamparth @MLamparth

Ethan Liu @ethantsliu

Wen-Ding Li @xu3kev

Chloe @amiyukie12552

Jacob X. Li @jacobli99

Raj Palleti @ COLM 20... @rajpalleti314

merve @mervenoyann

Sarah Chieng @SarahChieng

Ankit Gupta @AnkitGuptaAI

Mike Allton | AI for ... @mike_allton

Souradip Chakraborty @SOURADIPCHAKR18

Genta Winata @gentaiscool

Mingyang Zhou @MingyangKevinZh

Omar Shaikh @oshaikh13

Fausto Pedro Garcia M... @faustospain

Zuxin Liu @LiuZuxin

Bang An @bang_an_

arion das @ArionDas

Ruibo Liu @RuiboLiu

Yiran Wu @YiranWu18

Jean Paul @JeanPau70893016

Hongli Zhan ✈️ IC... @HongliZhan

Abdelrahman Zayed @AbdelZayed1

Boyuan Zheng@ICML @boyuan__zheng

Deqing Fu @DeqingFu

Yiming Zhang @yimingz0

Javier Rando @javirandor

Xinyue Liu @irisiris_l

Jiayi Geng @JiayiiGeng

quazo @quazotheduck

Lichang Chen @LichangChen2

AIProductDB @AIProductDB

Joe Mayo @JoeMayo

Toothirmp @Toothirmpohtos

Jinen Setpal @48bitmachine

Michael Y. Li @michaelyli__

xiaodong dong @Andy214_Dong

Ashutosh Mehra @ashutoshmehra

Violet X. @ZiyuX

Philipp Fränken @jphilippfranken

MoonRide @moonride303

Max @maxzpchen

Zeecoder @ZianaOtoyi

Zory Zhang @zory_zhang

Julian Schrittwieser @Mononofu

Stephen McAleer @McaleerStephen

fatih kadir akın @fkadev

Synthetic Users @syntheticusers

Parth Chadha @parth_29

Arnav Singhvi @arnav_thebigman

Constellation Researc... @constellationr

Apollo Research @apolloaievals

Shubhra Mishra @shubhramishra_

Ezgi Korkmaz @EzgiKorkmazAI

Aran Komatsuzaki @arankomatsuzaki

Pluralis Research @PluralisHQ

Yi Tay @YiTayML

Jiaxin Zhang @jxzhangjhu

vincent @vvhuang_

Neil Chowdhury @ChowdhuryNeil

Zory Zhang @zory_zhang