@sintelion@BenZhou96@muhao_chen Awesome analysis of what KNN-LM says abt training:
Is the seeming "free lunch" of KNN-LM (replacing top LM layers with embedding store and KNN lookup) due to a weakness of the LM objctve? Seems no!
Training a replacement MLP on the KNN does better! 🤔
aclanthology.org/2024.naacl-sho…
“On Retrieval Augmentation and the Limitations of Language Model Training” (arxiv.org/abs/2311.09615) has been accepted to NAACL 2024!
While it is well known that kNN retrieval can decrease LMs’ perplexity, the underlying reason is unclear. We study two hypotheses 👇
Leveraging Large Language Models for Multiple Choice Question Answering
Finds that code/text-davinci performs much better on MCQ if the candidate answers are characters like "A", "B", etc unlike the original GPT3.
arxiv.org/abs/2210.12353
Most methods for aligning LMs to tasks require many labeled data (prompt eng) or access to the model (soft-prompts). In our work @ACL, we select high-perf. prompts 𝘸𝘪𝘵𝘩𝘰𝘶𝘵 𝘥𝘪𝘳𝘦𝘤𝘵 𝘢𝘤𝘤𝘦𝘴𝘴 𝘵𝘰 𝘵𝘩𝘦 𝘮𝘰𝘥𝘦𝘭 and 𝘸𝘪𝘵𝘩𝘰𝘶𝘵 𝘭𝘢𝘣𝘦𝘭𝘦𝘥 𝘦𝘹𝘢𝘮𝘱𝘭𝘦𝘴👇
What started out as an opportunity for extra credit in one of his BYU classes led to a month-long, all-expenses-paid trip to China, a “pretty sweet trophy and a very good scholarship" for BYU senior Josh Robinson:
news.byu.edu/intellect/byu-…
162 Followers 7K FollowingI'm a fund manager in the heart of London. Myself and my team train, recruit and hire individuals with trading talents within our fund. We have over £20 million
184 Followers 366 Followingexploration, adaptive agents, and open-endedness
Research intern at @GoogleDeepMind / Ph.D. student @icaroslab @USC / previously Google Brain, Columbia
56 Followers 1K FollowingAI, LLMs, RL || PhD-ing in CS @SCAI_ASU || @ASU & @TAMU Aggie Alumnus ||
Likes to wrestle with existential questions from time to time || Sporadically here;
4K Followers 2K FollowingWant us to amplify your research? We love celebrating the achievements of our faculty & students! To help us see & share your news, tag @CSatUSC in your posts.
7K Followers 6K FollowingCenter for Language and Speech Processing at @JohnsHopkins #NLProc #MachineLearning #AI https://t.co/6IXR5OSQtw
@[email protected]
704 Followers 1K FollowingStudying CS at Berkeley; doing research on evals & public policy at Stanford CRFM; @BerkeleyML @BerkeleyNLP; https://t.co/YqhKUqpSBW
3K Followers 4K FollowingStaff Research Scientist at Google DeepMind. Former adjunct assistant prof at @NYU_Courant. PhD at @mldcmu. ML for Bio/Chem (Prev. NLP).
All opinions my own.
260 Followers 72 Following☀️🏝️Annual symposium with students and faculty to promote NLP research in the (Southern) California region 👩💻 #SoCalNLP2023 🔜 @ucla, posts by @BrihiJ
52K Followers 64 FollowingStudent of mind and nature, libertarian, chess player, cancer survivor. @ Keen, UAlberta, Amii, https://t.co/u8za2Kod54, The Royal Society, Turing Award
34K Followers 35 FollowingWorld Labs is a spatial intelligence company building Large World Models to perceive, generate, and interact with the 3D world.
25K Followers 89 FollowingA non-profit research lab focused on interpretability, alignment, and ethics of artificial intelligence.
Creators of GPT-J, GPT-NeoX, Pythia, and VQGAN-CLIP
1K Followers 751 FollowingAI / NLP Researcher
Incoming faculty at @UBC_CS and @CAIDA_UBC
Postdoctoral fellow at @StanfordHAI @stanfordnlp
Former PhD student at @uwcse @uwnlp
he/him
2K Followers 554 FollowingResearch Scientist @GoogleDeepmind working on AI Safety and human morality. Starting as Assistant Prof in NYU Psych Dept, Spring 2026.
28K Followers 173 FollowingA North Star for open AGI. Co-founders: @fchollet @mikeknoop. President: @gregkamradt. Help support the mission - make a donation today.
544K Followers 6 FollowingI am a robot that tells you about earthquakes in Los Angeles as they happen. Built by @billsnitzer. Data is from the USGS. Get prepared: https://t.co/sSO6a4XWWk
42K Followers 111 Following• Center for AI Safety Director
• xAI and Scale AI advisor
• GELU/MMLU/MATH/HLE
• PhD in AI
• Analyzing AI models, companies, policies, and geopolitics
19K Followers 100 FollowingMember of Technical Staff at Anthropic AlphaGo, AlphaZero, MuZero, AlphaCode, AlphaTensor, AlphaProof Gemini RL Prev Principal Research Engineer at DeepMind
55K Followers 0 FollowingWe are building a world class AI R&D company in Tokyo. We want to develop AI solutions for Japan’s needs, and democratize AI in Japan. https://t.co/1q07mb3TzE
16K Followers 0 FollowingLong-context, test-time compute, and e2e Reinforcement Learning to build a superhuman coding agent (that then builds the rest of AGI for us). Join us https://t.co/hGZKtUzsR3