Steven Xia @steven_xia_
PhD Student @illinoisCDS studying SE \\ Undergrad @eceuoft 2T1 steven.cs.illinois.edu Champaign-Urbana, Illinois Joined February 2013-
Tweets57
-
Followers363
-
Following172
-
Likes1K
We're releasing a new iteration of SWE-bench, in collaboration with the original authors, to more reliably evaluate AI models on their ability to solve real-world software issues. openai.com/index/introduc…
I'm on the way to USENIX Security'24. We will present one paper about privacy-preserving app authentication. I'm also happy to discuss TEE-based AI security and other AI-related security topics. If you're interested, please get in touch with me! #usenix #USESEC2024
Introducing OpenAutoCoder-Agentless😺: A simple agentless solution solves 27.3% GitHub issues on SWE-bench Lite with ~$0.34 each, outperforming all open-source AI SW agents! It's fully open-source, try it out: 🧑💻github.com/OpenAutoCoder/… 📝huggingface.co/papers/2407.01…
Magicoder: Source Code Is All You Need paper page: huggingface.co/papers/2312.02… introduce Magicoder, a series of fully open-source (code, weights, and data) Large Language Models (LLMs) for code that significantly closes the gap with top code models while having no more than 7B…
In the past 6-mon release of HumanEval+ we have been improving its toolchain usability and dataset quality from v0.1.0 to v0.1.7 releases. 🔥 Now we release MBPP+, a new benchmark in EvalPlus v0.2.0: tinyurl.com/4pw82wb8 🧵
Introducing the EvalPlus leaderboard! evalplus.github.io/leaderboard.ht… 🔥28 models have been evaluated on coding HumanEval & HumanEval+ 🔥7B CodeLlama outperforms ~16B models e.g. StarCoder&CodeGen 🔥Phind-CodeLlama-34B-v2 and WizardCoder-Python-34B-V1 as open models both beat ChatGPT 🧵
OK now we have @steven_xia_ on his work with @YuxiangWei9 and @LingmingZhang, all of @plfmse. Talking about program repair using large pre-trained language models
We welcome everyone to try out 📚𝐇𝐮𝐦𝐚𝐧𝐄𝐯𝐚𝐥+! A dataset to reflect the "real" correctness of LLM-generated code. Using📚𝐇𝐮𝐦𝐚𝐧𝐄𝐯𝐚𝐥+ is the same as HumanEval. You can easily pip install it and evaluate in our prepared sandbox (optional). github.com/evalplus/evalp…
We welcome everyone to try out 📚𝐇𝐮𝐦𝐚𝐧𝐄𝐯𝐚𝐥+! A dataset to reflect the "real" correctness of LLM-generated code. Using📚𝐇𝐮𝐦𝐚𝐧𝐄𝐯𝐚𝐥+ is the same as HumanEval. You can easily pip install it and evaluate in our prepared sandbox (optional). github.com/evalplus/evalp… https://t.co/yxqWHdBaKZ
🚨 Evaluating LLM-generated code on datasets with just "3 test-cases" is NOT enough! 🚨 We built ✨HumanEval+✨: improving HumanEval with up to thousands of new tests to fully evaluate functional correctness of LLM generated code! @JiaweiLiu_ @YuyaoStarling @LingmingZhang
🚨 Evaluating LLM-generated code on datasets with just "3 test-cases" is NOT enough! 🚨 We built ✨HumanEval+✨: improving HumanEval with up to thousands of new tests to fully evaluate functional correctness of LLM generated code! @JiaweiLiu_ @YuyaoStarling @LingmingZhang https://t.co/qFuzrO9zac
Is Your Code Generated by ChatGPT Really Correct? Rigorous Evaluation of Large Language Models for Code Generation Proposes EvalPlus – a code synthesis benchmarking framework to rigorously evaluate the functional correctness of LLM-synthesized code. arxiv.org/abs/2305.01210
GAME WINNER! SERIES WINNER! LET’S. GO. LEAFS. NATION!!!!!!!!!
Today we will present our new work "NNSmith: Generating Diverse and Valid Test Cases for Deep Learning Compilers" @ASPLOSConf at Session 2A (1pm PT). Welcome to join our talk to learn how we build an automated tool for finding 70+ bugs (and more now!) for emerging DL compilers!
🚨LLMs are Zero-Shot Fuzzers! Excited to share our TitanFuzz🤖 work @issta23: #LLMs can be directly applied for both generative and mutative fuzzing, while being fully automated, generalizable, and applicable to domains challenging for traditional fuzzers (such as DL systems)🧵
2. Automated Program Repair in the Era of Large Pre-trained Language Models. Authors: Chunqiu Steven Xia (@steven_xia_), Yuxiang Wei (@YuxiangWei9), Lingming Zhang (@LingmingZhang). Pre-print: arxiv.org/abs/2210.14179

Talia Ringer 💚 @TaliaRinger
30K Followers 7K Following Professor, @plfmse, @IllinoisCS! Proof Automation. @SigplanM & CCF Founder. Israeli-American for peace, equality, justice. Mom. They/היא, ND, bi
Jiawei Liu @JiaweiLiu_
2K Followers 971 Following phd'ing at uiuc. software engineering x llms. hunting good programs. towards high-quality automation.
Natalie Enright Jerge... @nenrightjerger
1K Followers 454 Following Director, Division of Engineering Science, ECE Professor at University of Toronto. Research in Computer Architecture. Mother. She/her
Tianyin Xu @tianyin_xu
5K Followers 1K Following Watchman in a cornfield @IllinoisCDS @ECEILLINOIS @ACMSIGOPS
Reyhan @Reyhaneh
2K Followers 764 Following Assistant Professor of @plfmse at @IllinoisCS, Director of Intelligent CAT Lab (https://t.co/wO38Gqxs7n), PhD @UCIbrenICS, @Google PhD Fellow
Software Practices La... @ubc_spl
646 Followers 220 Following We study programming languages, verification, and software engineering in the Department of Computer Science at the University of British Columbia @ubc
Jintao Huang @JintaoHuang9
58 Followers 711 Following PhD student working on blockchain security. Views are mine.
Abhinav (・o�... @MajorTimbWlf21
400 Followers 1K Following Researcher in Responsible AI | intern @lossfunk | roots @nexttechlab | @FCBarcelona
Valerie Chen @valeriechen_
2K Followers 509 Following phd student @mldcmu @SCSatCMU + intern @allhands_ai | building @CopilotArena | previously @NYUDataScience @MSFTResearch @yale @CMU_Robotics @IBMResearch
0XYing labs @0xyingX
465 Followers 2K Following 猫宇宙|混乱中立|骑墙派|星舰地球|橙色药丸|市场是我验证哲学的实验室|自由是目标,规律是路径,收益是副产物|Crypto ∞ Consciousness
Fan Zhou @FaZhou_998
1K Followers 837 Following Qwen Coding @Alibaba_Qwen. Prev: Core member @XLangNLP, Intern @MSFTResearch.
Mehil Shah @shahMehil_
92 Followers 416 Following Ph.D. Student in Computer Science @DalhousieU, RA @RaiseDal, Working at the intersection of DL and SE to build reliable, safe and trustworthy DL Systems!
Vijay Bolina @vijaybolina
4K Followers 6K Following I build and lead deeply technical teams solving some of the hardest problems in the world. Prev CISO @GoogleDeepMind, @Mandiant, @BoozAllen, USG. Tweets my own.
Shan Reddy @rshanreddy
1K Followers 2K Following on a mission to make education more personal, more effective, and more human
Forever° @Forever1815584
50 Followers 4K Following
Andrey @andrey_firebear
6 Followers 445 Following
Zexiong Ma @mazexiong
2 Followers 63 Following
Sujith Joseph @sujithjoseph
124 Followers 2K Following
Fahad Shah @sfahad
948 Followers 7K Following @Leadership @DataScience @HP @AzureML @Happily Married 😊
jose Ruiz @joru1000
438 Followers 5K Following C-level Technology lead, strongly focused on Generative AI. Researching on practical production use cases across the Enterprise (yes... as everybody else)
Dung Doan @dungdx34
333 Followers 7K Following
Xiaoze Jin @xiaoze_jin
2K Followers 8K Following Familyman | Angel Investor | Entrepreneur | Connector | Techie/Hacker | Founders and Startups
Rajiv @jeeves
1K Followers 5K Following Public Data Works (PDW) building data tools for the public @_ipno_ LLEAD @city_bureau https://t.co/GWyI4saqda @invinst CPDP
Sergio Soage @Sergio_Soage
876 Followers 6K Following artificial intelligence, math. Random stuff @ https://t.co/tqV9OIPsWE
Frank Shiwei Feng @ShiweiFeng3
209 Followers 725 Following Ph.D. Candidate @PurdueCS | Ex-Intern @Apple Working on Trustworthy Autonomous System & AI Safety
Sophia Xu @thesophiaxu
4K Followers 3K Following applied epistemology | language machines languaging
Andrea Michi @andreamichi
2K Followers 1K Following Co-Founder @depthfirstlabs / Building intelligence to detect and remediate software vulnerabilities / Prev post-training / RL for Gemini @GoogleDeepMind
Marcelo Pérez @mperezjodal
179 Followers 854 Following AI Engineer, Philosophy Dropout, Winner of gold medal in National Mathematical Olympiad
Yiqing Xie @YiqingXieNLP
176 Followers 202 Following ✨ Synthetic data; Auto Eval; Code-Gen; 🎓 PhD student @LTIatCMU; MSCS @dmguiuc. 👩💻 previously Intern @meta; @MSFTResearch * 2; @AlibabaDAMO.
Justin T Chiu @justintchiu
653 Followers 911 Following generating code; phd in ml from Cornell; former Child
Amir @Amir_Mashmool
788 Followers 2K Following Doctoral Researcher in SE | Uni of Bremen | interested in Software Engineering, Program Comprehension, Empirical Software Engineering and related topics.
Hang @2076148h
3 Followers 520 Following
aikedaer999 @aikedaer999
0 Followers 103 Following
Albert Örwall @aorwall
196 Followers 448 Following Building Moatless Tools (https://t.co/TSKAwaVXmT) and https://t.co/DJDebZ3Qog
Rajko Radovanović @rajko_rad
6K Followers 5K Following AI/infra @a16z (partner to amazing teams eg @MistralAI @udiomusic @LumaLabsAI @cursor_ai @theworldlabs @braintrustdata) - alt @rajko_alt
Siqi Zhu @realagi25
229 Followers 2K Following cs phd @siebelschool, focusing on LLMs. prev. @Tsinghua_Uni @bytedancetalk
He Ye @ye_he_ye
218 Followers 195 Following Assistant professor at @ucl, co-founder of @Euni_AI, working on code agents.
oooisbdn @oooisbdn
2 Followers 48 Following
Talia Ringer 💚 @TaliaRinger
30K Followers 7K Following Professor, @plfmse, @IllinoisCS! Proof Automation. @SigplanM & CCF Founder. Israeli-American for peace, equality, justice. Mom. They/היא, ND, bi
Jiawei Liu @JiaweiLiu_
2K Followers 971 Following phd'ing at uiuc. software engineering x llms. hunting good programs. towards high-quality automation.
Caroline Lemieux @cestlemieux
2K Followers 200 Following https://t.co/jwo69lmnOx / https://t.co/Ap8qucFGBD
Natalie Enright Jerge... @nenrightjerger
1K Followers 454 Following Director, Division of Engineering Science, ECE Professor at University of Toronto. Research in Computer Architecture. Mother. She/her
Rohan Padhye @moarbugs
2K Followers 535 Following Assistant Professor at @S3DatCMU @SCSatCMU. PhD from @Berkeley_EECS. Connessiur of hot sauce.
Andreas Zeller @AndreasZeller
9K Followers 219 Following Software researcher at @CISPA. Testing and analyzing software for a better world. Find me at @[email protected] or @[email protected].
University of Toronto... @UofTEngineering
15K Followers 419 Following Official account of the University of Toronto Faculty of Applied Science & Engineering, Canada's #1 Engineering School 🇨🇦
Ningning Xie @xnningxie
4K Followers 308 Following Having fun with types! @UofTCompSci @GoogleDeepMind
Tianyin Xu @tianyin_xu
5K Followers 1K Following Watchman in a cornfield @IllinoisCDS @ECEILLINOIS @ACMSIGOPS
Reyhan @Reyhaneh
2K Followers 764 Following Assistant Professor of @plfmse at @IllinoisCS, Director of Intelligent CAT Lab (https://t.co/wO38Gqxs7n), PhD @UCIbrenICS, @Google PhD Fellow
Software Practices La... @ubc_spl
646 Followers 220 Following We study programming languages, verification, and software engineering in the Department of Computer Science at the University of British Columbia @ubc
Courtney Miller @courtneyelta
384 Followers 221 Following Software engineering phd student @SCSatCMU. Enjoys empirical open source software supply chain sustainability and security research, cycling, and climbing
Valerie Chen @valeriechen_
2K Followers 509 Following phd student @mldcmu @SCSatCMU + intern @allhands_ai | building @CopilotArena | previously @NYUDataScience @MSFTResearch @yale @CMU_Robotics @IBMResearch
Anne Ouyang @anneouyang
8K Followers 938 Following Building @Standard_Kernel, CS PhD student @Stanford | prev: cuDNN @Nvidia, M.Eng, B.S. in CS @MIT | efficient scalable self-improving AI systems | 🌽KernelBench
Andrea Michi @andreamichi
2K Followers 1K Following Co-Founder @depthfirstlabs / Building intelligence to detect and remediate software vulnerabilities / Prev post-training / RL for Gemini @GoogleDeepMind
Courtney Cronin @CourtneyRCronin
119K Followers 2K Following Chicago Bears & much more @ESPN. @ESPNRadio host. Hear me @ESPN1000. Prev: Vikings/ESPN (2017-21), @mercnews, @clarionledger. IU alum. [email protected]
Caleb Williams @CALEBcsw
198K Followers 872 Following Caleb “Superman” Williams “Hakuna Matata” DABEARS🐻 #18
Justin T Chiu @justintchiu
653 Followers 911 Following generating code; phd in ml from Cornell; former Child
Avi Sil @aviaviavi__
2K Followers 535 Following Senior Director, Applied Science @Oracle | Past - Manager + Principal Scientist @IBMResearch AI | Tweets are my own opinion
Albert Örwall @aorwall
196 Followers 448 Following Building Moatless Tools (https://t.co/TSKAwaVXmT) and https://t.co/DJDebZ3Qog
He Ye @ye_he_ye
218 Followers 195 Following Assistant professor at @ucl, co-founder of @Euni_AI, working on code agents.
Xiaoning_Du @xiaoning_du
224 Followers 173 Following Lecturer @ Faculty of Information Technology, Monash University
Yu Yang @YuYang_i
6K Followers 784 Following reasoning research @OpenAI 🍓 | UCLA CS PhD | Ex. Microsoft Research, Meta FAIR, NVIDIA Research
UIUC Free Food @UIUCFreeFood
3K Followers 1 Following Free food around the UIUC Campus. Know about free food? Submit the form below 👇🥳🍴. Student-generated responses in one place.
Ziqi Zhang @ZiqiCharles
102 Followers 275 Following CS Postdoc at @UofIllinois. Graduated from @PKU1898. Software engineering, AI, and computer security.
Maliheh (Mali) Izadi @MalihehIzadi
693 Followers 694 Following Assistant Prof @TUDelft, Research at the intersection of #ML, #SE. Currently addressing #LLMs4Code challenges. Director of @AISE_tudelft research lab
Blaze (Balázs Galamb... @gblazex
1K Followers 1K Following A Smooth Guy; Developer of SmoothScroll for macOS, Windows & Google Chrome.
Shushan Arakelyan ✨... @sharakelyan
1K Followers 539 Following Researcher at Microsoft. Previously: PhD @CSatUSC, MPhil @Cambridge_Uni
Ofir Press @OfirPress
15K Followers 7K Following I build tough benchmarks for LMs and then I get the LMs to solve them. SWE-bench & SWE-agent. Postdoc @Princeton. PhD @nlpnoah @UW.
Yangruibo Robin Ding @RobinDing3
264 Followers 381 Following Incoming Assist. Prof. @UCLA. CS Ph.D. @Columbia. Formerly, @GoogleDeepMind, @AmazonScience, @IBMResearch. LLMs, Reasoning, Agents, and Software Engineering.
Atharv Sonwane @twm_as
218 Followers 907 Following CS PhD @ Cornell Prev. RF @ Microsoft Research India, CS @ BITS Goa AI, PL, Robotics
Naman Jain @StringChaos
2K Followers 1K Following PhD @UCBerkeley ; Research @cursor_ai | Projects - LiveCodeBench, DeepSWE, R2E-Gym, GSO, Syzygy, LMArena Coding | Past: @MetaAI @AWS @MSFTResearch @iitbombay
Kexun Zhang @kexun_zhang
1K Followers 796 Following PhD student at @LTIatCMU. Previously at @ucsbNLP, @ZJU_china. language lover.
John Yang @jyangballin
4K Followers 803 Following 🌲 CS PhD @Stanford 🤖 SWE-bench + agent + smith 🎓 Prev. @princeton_nlp 🐯; @Berkeley_EECS 🐻
Zhou Yang @Zhou_Yang_X
203 Followers 142 Following I’m a PhD Student at Singapore Management University, working on “beyond accuracy of code models” like robustness, security, privacy, etc.
Noah Patton @Noah_T_Patton
14 Followers 42 Following Computer Science PhD Student at the University of Texas at Austin
Yi Li @liyistc
559 Followers 413 Following Associate Professor @NTUsg @ntu_srslab | previous PhD @UofTCompSci
Ruijie Meng @RuijieMeng
344 Followers 467 Following Incoming tenure-track faculty at CISPA | PhD at @NUSComputing | Software Security
kathy @_kathycheng
189 Followers 83 Following PhD candidate @uoftmie | HCI + product design collaboration | prev. HCI intern with @ADSKResearch
Thanh Le-Cong @ThanhLeCong2705
148 Followers 384 Following PhD Student @Unimelb. PhD Fellow @GoogleAI. Research Intern @Amazon. Ex Research Engineer @smusg. Working on Reliable AI4Code.
Soyeon Park @_runiel
380 Followers 308 Following Security Researcher @ Samsung Research America | DARPA AIxCC Winner @TeamAtlanta24
Pinjia He @PinjiaHE
1K Followers 576 Following Assistant Professor at The Chinese University of Hong Kong, Shenzhen (CUHK-Shenzhen) @cuhksz.
Nan Jiang @NanJiang719
92 Followers 96 Following PhD, PurdueCS | Senior Applied Scientist, Microsoft Office AI | Generative AI for Software Engineering
Stefan Nagy @snagycs
1K Followers 1K Following Faculty @uutah. My lab hunts bugs: https://t.co/R74Wl128A9. Mastodon: [email protected] Bluesky: https://t.co/6sKvEYpXMF
Shuai Wang @wangshuai901
1K Followers 1K Following Associate Professor in CSE at HKUST | Software and Systems Security | Reverse Engineering | AI (LLM) Security and Privacy
Vincenzo Riccio @p1ndsvin
1K Followers 1K Following Assistant Professor @uniud 🇮🇹 Previously, postdoc @usisoftware 🇨🇭 Born and raised in Napoli 💙 Software testing for AI 🛠️🤖🛠️
Jenny Liang @jennytliang
535 Followers 294 Following she/her phd: @S3DatCMU ugrad: @uwcse, @UW_iSchool prev: @Apple, @MSFTResearch, @allen_ai
Yiling Lou @yiling__LOU
728 Followers 273 Following Incoming Assistant Professor @UIUC. Researcher on Software Engineering & AI.
Eliscia (she/her) @elisciasinclair
73 Followers 77 Following Clinical Psychology PhD student conducting gambling research 🎰 @ADMHLabTMU | @UBCPsych Alum
Sharven Wong @SharvenW
285 Followers 455 Following Assistant Professor in Software Engineering. Once a visiting scholar at Southern University of Science and Technology (Shenzhen).