Duolingo AI is hiring new PhDs and PhD interns! Come define the future of AI-driven education!
(New PhDs must graduate by Aug 2025, interns by Aug 2026)
jobs.duolingo.com
Wanna know gpt-3.5-turbo's embed size? We find a way to extract info from LLM APIs and estimate gpt-3.5-turbo’s embed size to be 4096. With the same trick we also develop 25x faster logprob extraction, audits for LLM APIs, and more!
📄 arxiv.org/abs/2403.09539
Here’s how 1/🧵
LMs are increasingly large🐘 and proprietary🔒 — what if we could “tune”🔧 them without accessing their internal weights?
Enter: proxy-tuning, which operates on only the *outputs* of LMs at decoding-time to achieve the effect of direct tuning!
📄: arxiv.org/abs/2401.08565 1/
🚨 New Dataset Alert 🚨 I'm extremely excited to announce Universal NER v1, available now.
It is gold-standard human annotations of 18 datasets covering 12 languages, based on Universal Dependencies texts. This is the first data release of the UNER project.
1/3
at #ACL2023NLP ! I'll be presenting a poster at BEA on Thursday: work with @BenNaismithELT and @JillBurstein on discourse grading with GPT-4. come say hi :)
⚡️New paper!⚡️
It’s tempting to interpret chain-of-thought explanations as the LLM's process for solving a task. In this new work, we show that CoT explanations can systematically misrepresent the true reason for model predictions.
arxiv.org/abs/2305.04388
🧵
🌟Update🌟there's been a lot of debate of whether Theory of Mind (ToM) has emerged in new models (ChatGPT/GPT-3.5/GPT4), as people have reported qualitative/anecdotal evidence of good performance on these types of examples.
TLDR; still no neural ToM... 🧵
🌟Update🌟there's been a lot of debate of whether Theory of Mind (ToM) has emerged in new models (ChatGPT/GPT-3.5/GPT4), as people have reported qualitative/anecdotal evidence of good performance on these types of examples.
TLDR; still no neural ToM... 🧵
wait,
what??
why do Bard AND ChatGPT *both* write an anodyne story about a young woman in idyllic "Willow Creek" at sundown???
(details: it's not deterministic, often you get a different town name, different phrasing, etc. broad strokes are similar though. gpt-4 does it too.)
arxiv.org/abs/2210.15097
We propose contrastive decoding (CD), a more reliable search objective for text generation by contrasting LMs of different sizes. CD takes a large LM (expert LM e.g. OPT-13b) and a small LM (amateur LM e.g. OPT-125m) and maximizes their logprob difference
In addition to proposing an interesting new truncation sampler, this paper draws a connection between truncation and n-gram smoothing that refined my understanding of the motivations for truncation sampling in general. This is a great read!
In addition to proposing an interesting new truncation sampler, this paper draws a connection between truncation and n-gram smoothing that refined my understanding of the motivations for truncation sampling in general. This is a great read!
📢 Welcome to ar5iv.org
Change the "X" in any arXiv article link to the "5" in ar5iv to get a modern HTML5 document.
Thread: what is included, why now, and how we hope to merge back into arXiv.
#OA#OpenScience#preprints
1/10
We introduce 𝐁𝐢𝐝𝐢𝐦𝐞𝐧𝐬𝐢𝐨𝐧𝐚𝐥 𝐋𝐞𝐚𝐝𝐞𝐫𝐛𝐨𝐚𝐫𝐝𝐬 (𝐁𝐢𝐥𝐥𝐛𝐨𝐚𝐫𝐝𝐬), a generalization of leaderboards that simultaneously drives progress in natural language generation and its evaluation.
arxiv.org/abs/2112.04139nlp.cs.washington.edu/billboard/
1/5
How can we make language models be less data-hungry?
We show that fully compositional word representations grounded on related words and definitions from an external lexicon can do that.
Joint work w/ @PhoebeNLP,@nlpnoah at @emnlp2020arxiv.org/abs/2009.11523#NLProc (1/3)
8K Followers 892 FollowingAssociate professor at NYU (Courant CS + Center for Data Science) | advisor for @bespokelabsai | large language models and NLP | he/him
5K Followers 1K FollowingComputational Linguist and Professional Nerd at Georgetown University
he/him pronouns, ALL the prepositions
@[email protected] @complingy.bsky.social
15K Followers 7K FollowingI build tough benchmarks for LMs and then I get the LMs to solve them. SWE-bench & SWE-agent. Postdoc @Princeton. PhD @nlpnoah @UW.
378 Followers 561 FollowingPhD Student in STAI (https://t.co/MMYVUHhbeR) at @uni_tue and @MPI_IS (IMPRS-IS), part of @KImachtSchule and @VivaconAgua. Also elinguyen in 🦋
290 Followers 961 FollowingPhD student in Computational Linguistics @cl_uzh. Interested in language modeling, human language processing, drag race, you name it. he/him 🏳️🌈
371 Followers 4K FollowingConsulting in technological and strategic monitoring in the field of IT. Implementation of artificial intelligence development projects and associated services.
10K Followers 1K FollowingAssistant Professor @UBC_CS & @VectorInst working on Natural Language Processing. Book: https://t.co/aBnNW4HaQ3. 🦋: @veredshwartz.bsky.social
103 Followers 680 FollowingTweeting about prompt engineering and LLMs guidance. Ai Engineer. Texthero | IBM Research | EPFL. Learning to be a 10X software dev with AI. Follow along!
2K Followers 648 FollowingMostly gone to better places :) I like language, birds, cats, trains, buses, long walks, cities, and other things 🌻 "vulnerable road user" opinions mine
8K Followers 892 FollowingAssociate professor at NYU (Courant CS + Center for Data Science) | advisor for @bespokelabsai | large language models and NLP | he/him
5K Followers 1K FollowingComputational Linguist and Professional Nerd at Georgetown University
he/him pronouns, ALL the prepositions
@[email protected] @complingy.bsky.social
15K Followers 7K FollowingI build tough benchmarks for LMs and then I get the LMs to solve them. SWE-bench & SWE-agent. Postdoc @Princeton. PhD @nlpnoah @UW.
290 Followers 961 FollowingPhD student in Computational Linguistics @cl_uzh. Interested in language modeling, human language processing, drag race, you name it. he/him 🏳️🌈
1K Followers 750 FollowingAI / NLP Researcher
Incoming faculty at @UBC_CS and @CAIDA_UBC
Postdoctoral fellow at @StanfordHAI @stanfordnlp
Former PhD student at @uwcse @uwnlp
he/him
10K Followers 1K FollowingAssistant Professor @UBC_CS & @VectorInst working on Natural Language Processing. Book: https://t.co/aBnNW4HaQ3. 🦋: @veredshwartz.bsky.social
256 Followers 317 FollowingPhD, they/them 🏳️🌈 postdoc researching usable security and privacy at New York University. Likes the dungeons and the dragons. Photo by @_ericzeng
648K Followers 35 FollowingWe're an AI safety and research company that builds reliable, interpretable, and steerable AI systems. Talk to our AI assistant @claudeai on https://t.co/FhDI3KQh0n.
733 Followers 267 Followinglanguage is irreducibly contextual and multimodal. phd @uwcse (nlp) & former eng @google, now indie dev. posting: academia ∪ "AI" ∪ building software ∪ travel
3K Followers 2K FollowingResearch Scientist at Meta. 10-yr test-of-time ACL 22, Best Demo ACL 25, Best Resource Paper ACL 24, Best Theme Paper ACL 24, Best Student Paper NAACL 15 🏳️🌈
428 Followers 138 FollowingAsst. professor at @UofR Ling and Data Science |
NLP for low-resource, endangered, and Indigenous languages |
formerly @uwlinguistics, @uwnlp
6K Followers 36 FollowingWe work on natural language processing, machine learning, linguistics, and deep learning. PIs: Dan Klein, @alsuhr, @sewon__min