Nicolay Rusnachenko @nicolayr_
Multimodal NLP (🖼+📝) Research Fellow @BU_Research / ex- RA in IR @UniofNewcastle, IR + LLM software developer https://t.co/bkVpkvVaTz . The opinions are mine. nicolay-r.github.io Bournemouth / London, UK Joined December 2015-
Tweets343
-
Followers71
-
Following97
-
Likes949
llama-3-8b's in-context learning is unbelievable. reddit.com/r/LocalLLaMA/c…
This is valuable mention on the balance between the size of the model and the amount of data used for training. Meta AI did a great and incredible work on training Llama 3 👏
This is valuable mention on the balance between the size of the model and the amount of data used for training. Meta AI did a great and incredible work on training Llama 3 👏
Valuable 💎 contribution and view from causal prospects on Sentiment Analysis domain 👀
The upcoming Llama-3-400B+ will mark the watershed moment that the community gains open-weight access to a GPT-4-class model. It will change the calculus for many research efforts and grassroot startups. I pulled the numbers on Claude 3 Opus, GPT-4-2024-04-09, and Gemini.…
Allright, Llama-3 8B and 70B is out! 🚀 8B-Instruct: huggingface.co/meta-llama/Met… 70B-Instruct: huggingface.co/meta-llama/Met…
Allright, Llama-3 8B and 70B is out! 🚀 8B-Instruct: huggingface.co/meta-llama/Met… 70B-Instruct: huggingface.co/meta-llama/Met…
💯 Came up with the same conclusion so far:
I can't believe and expect that MistralAI aimed at it 🚀💎👀 #moe #finetuning
I can't believe and expect that MistralAI aimed at it 🚀💎👀 #moe #finetuning
DINOv2 is definitely one of the popular solutions for modality encoding in MLLM solutions in cutting edge systems. Excited to be aware of and let DeiT in too 💎
DINOv2 is definitely one of the popular solutions for modality encoding in MLLM solutions in cutting edge systems. Excited to be aware of and let DeiT in too 💎
Reasoning Revision used to be the common approach to strengthening LLM answers on your CoT 🔗 and instruction prompts. Here is another way on how to do this: Fine-Grained Rewards 👀
Reasoning Revision used to be the common approach to strengthening LLM answers on your CoT 🔗 and instruction prompts. Here is another way on how to do this: Fine-Grained Rewards 👀
After receiving community feedback, we added @GoogleDeepMind Gemini 1.5 Pro's results. 👇 Gemini 1.5 Pro's vision ability was significantly improved compared to 1.0 Pro and matched GPT-4's performance on our VisualWebBench! 🏆 Its action prediction (e.g., predicting what would…
After receiving community feedback, we added @GoogleDeepMind Gemini 1.5 Pro's results. 👇 Gemini 1.5 Pro's vision ability was significantly improved compared to 1.0 Pro and matched GPT-4's performance on our VisualWebBench! 🏆 Its action prediction (e.g., predicting what would… https://t.co/kQnZzztfEh
We just released Mixtral-8x22B-v0.1 and Mixtral-8x22B-Instruct-v0.1: - Free to use under Apache 2.0 license - Outperforms all open models - Native function calling - Masters English, French, Italian, German and Spanish. - Seq_len = 64K mistral.ai/news/mixtral-8…
Wonder we could see something that mimics RNN form the LLM and self- attention prospects 🤔
Wonder we could see something that mimics RNN form the LLM and self- attention prospects 🤔
Hyeonbin Hwang @ronalhwang
145 Followers 198 Following M.S. Student @kaist_ai https://t.co/bQW6mlGzDNHossein A. (Saeed) Ra.. @srahmanidashti
755 Followers 2K Following PhD Student at WI (@ucl_wi_group) | @FAICDT1 | @UCLRegina @Uniie_Qtz
133 Followers 2K Following The secret to success is to work a little harder than others every day.✨🎆Ekue @ekpodar
984 Followers 4K Following I am interested in Tech/AI, Marketing, and complex systems, I will posts random stuff in those categoriesCharlie Clarke @claclarke
874 Followers 1K Following Building and evaluating search and QA systems. Humbly serving as one of the @ass_deans for @WaterlooMath.Dongwei Jiang @Dongwei__Jiang
128 Followers 227 Following Spent six years working in industry as a speech researcher, currently I'm shifting my focus to LLM and studying at @JohnsHopkins as a master's studentZixuan Yi @ZixuanYI_
201 Followers 429 Following PhD student @TerrierTeam, University of Glasgow. Previous algorithm engineer @ Bosch GmbH Views my own. He/Him.Yi Yin @Yi_Yin__
3 Followers 46 Following Postdoctoral Researcher @UniofOxford Co-Inventor of OxNNet, an ultrasound diagnosis tool Lead Organiser, OxfordX-ML (Cross-Disciplinary Machine Learning)Alpay Ariyak @AlpayAriyak
1K Followers 2K Following 𝗔𝗜 @RunPod_io | 𝗟𝗲𝗮𝗱: @OpenChatDev (𝟲𝟬𝟬𝗸+ 𝗱𝗼𝘄𝗻𝗹𝗼𝗮𝗱𝘀 on HuggingFace🤗)double blue rose @ubaidjan78
1K Followers 1K Following Meeting is a kind of fate, and the intersection of souls gives us endless romantic feelings.Jin Yuzn @JYuzn59553
61 Followers 5K FollowingZahra Abbasiantaeb @z_abbasiantaeb
89 Followers 217 Following PhD candidate at @uva_amsterdam, Conversational Search and Information RetrievalSole Pera @DrCh0le
1K Followers 644 Following Associate Professor @wisdelft @tudelft, Misses Rosario, 🧉 Enthusiast, #RecSys #InformationRetrieval #IR4U2 #KidRecJoel Mackenzie @joelmmackenzie
273 Followers 360 Following Lecturer at the University of Queensland. Information Retrieval Research. https://t.co/Aquuj1pPup @UQSchoolITEE @ielabgroupKaustubh Dholé @ ECI.. @KaustubhDhole
324 Followers 1K Following NLP Researcher @EmoryUniversity Prev: AI R&D Lead @TheAmeliaAI (2015-2021) Organizer💎@gem_workshop, Mentor @LogmlSchool 22 🎓@BITS_Pilani @TIFRScienceXi Wang @wangxieric
794 Followers 921 Following Lecturer of NLP @sheffielduni, previous Research Fellow @UCL, PhD @TerrierTeam Glasgow Uni. Research interests in Personalised Conversational AI, NLP and IR.Lei Shi @lshi_ncl
147 Followers 118 Following Senior Lecturer (Associate Professor) in Human-Computer Interaction, Machine Learning, Education, Learning Analytics, User Modelling at @openlab_nclOliver Lemon ( @olive.. @oliverlemon
3K Followers 3K Following Chief AI Officer and Co-founder, Alana AI Ltd. Academic Co-Lead of UK National Robotarium. Professor, Director of Interaction Lab: conversational AI and LLMs.Yashar Deldjoo @yashardel
774 Followers 1K Following Tenure Track (Rtd-b), Asst. Professor @polibaofficial; #RecSys #GenerativeAI #FairML #TrustworthyML #Multimedia #FashionYihao Xue @xue_yihao65785
328 Followers 368 Following CS PhD student @UCLA Robustness | Representation LearningAI Deeply @AiDeeply
398 Followers 5K Following AI is reshaping the world. Who are the people and companies driving the change? Visit our website to search more than 5,000 profiles.Yihua Zhu @rrenDa6
47 Followers 304 Following Robotics Master at @BristolUni. CS Master and PhD in @univkyoto, NLP and ML. 📷🚵♀️⚽️🎾Amarendhar Reddy @AmarendharRed17
3 Followers 63 FollowingNegar Arabzadeh @NegarEmpr
937 Followers 812 Following PhD candidate at University of Waterloo | Interested in Information RetrievalKristina Gligorić @krisgligoric
845 Followers 607 Following CS Postdoc @Stanford @StanfordNLP, @snsf_ch fellow. PhD @EPFL_en, Ex Intern @GoogleAI @mpi_sws_. NLP, Computational Social Science. https://t.co/hclg9MYZ6eKanaad Pathak @kanaadpathak
114 Followers 205 Following PhD Student in IIR at the University of Strathclyde | DoSSIER Project on Economic Models of Interactive Search| Opinions and sentiments are my ownDung Doan @dungdx34
198 Followers 5K FollowingCarl Chan @CChan27646
124 Followers 3K FollowingAlexander Pugantsov @pugantsov
511 Followers 828 Following Postdoc @UniPadova 🇮🇹 • PhD from @UofGlasgow • NLP/IR - Fairness, Transfer & Quantification LearningJack McKechnie @JackMcK1999
99 Followers 99 Following PhD student at the University of Glasgow. @TerrierTeam @IR_GlasgowPierce Perkins @PPerkins38293
72 Followers 3K FollowingAnna(●'◡'●) @adfhjkpuy
25 Followers 457 Following ✨ | Chasing sunsets, flavors, and profitable investments | Turning passion into a lifestyle | Eat, Travel, Invest, RepeatCecilia Grant @grant_ceci49884
64 Followers 3K FollowingNoorreaut @noorreaut12617
9 Followers 1K FollowingRoman Leventov @leventov
1K Followers 532 Following An independent researcher of AI, hybrid intelligence, AI safety, and AI impacts. [email protected] for contact. https://t.co/pIgoGMT6tb…ACM CHIIR 2024 @ACM_CHIIR
2K Followers 576 Following CHIIR (“cheer”) is the ACM SIGIR Conference on Human Information Interaction and Retrieval. Next up: 10th-14th March 2024 in Sheffield, UK #CHIIR2024James Morrison Rubin @import_jmr
6K Followers 6K Following Product Lead | Bringing Gemini to life @Google Tweets are my own. Retweets are not endorsements. Joyful Learning MachinesMechana electroniks @NBGmechana
50 Followers 457 Following Healthcare IT, 🏥, Wrestling, Astronomy 🔭 , Networking, and Anime,Mikhail Burtsev @MikhailBurtsev
951 Followers 288 Following Landau AI Fellow @ LIMS, Founder of open-source conversational AI framework - https://t.co/TWm2mcySvtOren Sultan @oren_sultan
678 Followers 595 Following AI Researcher & Data Scientist @Lightricks, CS PhD Candidate #AI #NLP @HebrewU, advised by @HyadataLab 🇮🇱 | prev. @TU_Muenchen 🇩🇪 @UniMelb 🇦🇺 8200 UnitMartin Salo @salomartin
810 Followers 5K Following Building AI to give everyone the ability to eat well every day. Co-Founder & CTO Yummy. Previously co-founded @realeyesit.1997 @antoxagolubev
9 Followers 16 FollowingJaap Kamps @jkamps
994 Followers 700 FollowingTaelin @VictorTaelin
16K Followers 893 Following Founder of @HigherOrderComp Building the massively parallel future of computing Reaching AGI to cure all diseases and suffering is all that matterselvis @omarsar0
188K Followers 479 Following Building with LLMs @dair_ai • Prev: Meta AI, Galactica LLM, PapersWithCode, Elastic, PhD • Creator of the Prompting Guide (~4M learners)Zahra Abbasiantaeb @z_abbasiantaeb
89 Followers 217 Following PhD candidate at @uva_amsterdam, Conversational Search and Information RetrievalClaire Rogers @ClaireRPsych
93 Followers 163 Following PhD student • Strathclyde university • Dementia, Neuro, and IR research • Aspiring Clinical Neuropsychologist • Mam to a beautiful LabWeijia Shi @WeijiaShi2
5K Followers 963 Following PhD student @uwcse @uwnlp | Visiting Researcher @MetaAI | Undergrad @CS_UCLA | https://t.co/eLBQmgkvymBackdrop @withBackdrop
7K Followers 0 Following Powering builder energy ⚡️ Where builders shape the frontier of tech, together. 👉️ Join us https://t.co/UnMVIXoXUFAleksandr V. Petrov @asash
547 Followers 234 Following PhD researcher (Recommender Systems) @TerrierTeam, University of Glasgow. Ex. Senior Software Engineer @Amazon. The opinions are mine.Josiane Mothe @JosianeMotheFr
147 Followers 90 Following Professor in CS at the Université de Toulouse, IRIT, CNRSXi Wang @wangxieric
794 Followers 921 Following Lecturer of NLP @sheffielduni, previous Research Fellow @UCL, PhD @TerrierTeam Glasgow Uni. Research interests in Personalised Conversational AI, NLP and IR.Ana-Maria Bucur @bucuram
1K Followers 733 Following PhD Student at Interdisciplinary School of Doctoral Studies, University of Bucharest. Researcher at @PRHLT. Working on NLP for Mental HealthKaustubh Dholé @ ECI.. @KaustubhDhole
324 Followers 1K Following NLP Researcher @EmoryUniversity Prev: AI R&D Lead @TheAmeliaAI (2015-2021) Organizer💎@gem_workshop, Mentor @LogmlSchool 22 🎓@BITS_Pilani @TIFRScienceSole Pera @DrCh0le
1K Followers 644 Following Associate Professor @wisdelft @tudelft, Misses Rosario, 🧉 Enthusiast, #RecSys #InformationRetrieval #IR4U2 #KidRecLei Shi @lshi_ncl
147 Followers 118 Following Senior Lecturer (Associate Professor) in Human-Computer Interaction, Machine Learning, Education, Learning Analytics, User Modelling at @openlab_nclKristina Gligorić @krisgligoric
845 Followers 607 Following CS Postdoc @Stanford @StanfordNLP, @snsf_ch fellow. PhD @EPFL_en, Ex Intern @GoogleAI @mpi_sws_. NLP, Computational Social Science. https://t.co/hclg9MYZ6eKaterina Drakoulaki @KDrakoulaki
986 Followers 2K Following Navigating academia: interested in language, music, rhythm, and the brain. @frictionlessd8a fellow for Research Reproducibility #devlangdis advocateDebasis Ganguly @debforit
650 Followers 375 Following Lecturer/Asst. Professor at the School of Computing, University of Glasgow (@UofGlasgow/@GlasgowCS/@IDAglasgow/@ir_glasgow)L. Dietz @deeds@masto.. @lauradietz99
1K Followers 632 Following CS Prof@University of New Hampshire: Text Retrieval Machine Learning and Analytics (TREMA)Oren Sultan @oren_sultan
678 Followers 595 Following AI Researcher & Data Scientist @Lightricks, CS PhD Candidate #AI #NLP @HebrewU, advised by @HyadataLab 🇮🇱 | prev. @TU_Muenchen 🇩🇪 @UniMelb 🇦🇺 8200 UnitJames Morrison Rubin @import_jmr
6K Followers 6K Following Product Lead | Bringing Gemini to life @Google Tweets are my own. Retweets are not endorsements. Joyful Learning MachinesJack McKechnie @JackMcK1999
99 Followers 99 Following PhD student at the University of Glasgow. @TerrierTeam @IR_GlasgowGlasgow IR Group @ir_glasgow
1K Followers 128 Following Glasgow Information Retrieval Group @GlasgowCSAlexander Pugantsov @pugantsov
511 Followers 828 Following Postdoc @UniPadova 🇮🇹 • PhD from @UofGlasgow • NLP/IR - Fairness, Transfer & Quantification LearningSebastian Ruder @seb_ruder
80K Followers 1K Following Multilingual LLMs @cohere • Prev: @GoogleDeepMind • Newsletter: https://t.co/7JGh2qpG98ECIR2024 @ecir2024
645 Followers 126 Following European Conference on Information Retrieval 2024 in Glasgow, Scotland.Martin Salo @salomartin
810 Followers 5K Following Building AI to give everyone the ability to eat well every day. Co-Founder & CTO Yummy. Previously co-founded @realeyesit.Eliezer Yudkowsky ⏹.. @ESYudkowsky
175K Followers 89 Following The original AI alignment person. Missing punctuation at the end of a sentence means it's humor. If you're not sure, it's also very likely humor.Tatiana Alvares-Sanch.. @T_A_Sanches
207 Followers 518 Following Urban Analytics, Geospatial Analysis, Cities, ArchitectureRoman Leventov @leventov
1K Followers 532 Following An independent researcher of AI, hybrid intelligence, AI safety, and AI impacts. [email protected] for contact. https://t.co/pIgoGMT6tb…ACM CUI 2024 @ACM_CUI
1K Followers 1K Following We are @TheOfficialACM @SIGCHI conference on Conversational User Interfaces. #cui2024 will be in Luxembourg City, Luxembourg on 8–10 July.Vivs @vivstamou
68 Followers 656 Following Brain (ex|vs) machina Postdoctoral Researcher in NLP @ArchimedesUnit PhD in Computational Psycholinguistics @varlokosta_lab MSc @ims_StuttgartBodhisattwa Majumder @mbodhisattwa
1K Followers 794 Following Research @allen_ai. Scientific Discovery, Language & Interactive Agents. PhD @ucsd_cse, @AdobeResearch Fellow. Prev @googleai @metaai @msftresearchShubham Chatterjee | .. @ShubhamC526
1K Followers 2K Following Research Associate | University of Edinburgh , Scotland | Neural IR | Representation Learning | Conversational IR | Tweets are my own opinionYuling Gu @gu_yuling
386 Followers 664 Following Predoctoral researcher @allen_ai | @nyuniversity ➡️ @UW ➡️ @allen_ai @[email protected]Newcastle University @UniofNewcastle
55K Followers 2K Following Official page for Newcastle University UK, a founding member of the Russell Group of Research intensive universities, and a Global Top 110 university. #WeAreNCLJohanne @JTrippas
856 Followers 486 Following @RMIT Vice-Chancellor's Research Fellow. Prior, Doreen Thomas Research Fellow @unimelb. Interested in making the web accessible. Even the snail reached the ark.Luke Zettlemoyer @LukeZettlemoyer
8K Followers 2K FollowingTodd Austin @ToddMAustin
2K Followers 3K Following Computer Science professor @UMich, security researcher, computer architect, entrepreneur, dad, friend, gamer, one who loves loving things!ranlp2019 @ranlp2019
89 Followers 24 Following Conference Recent Advances in Natural Language Processing (RANLP2019). September 2019 Varna, Bulgaria.Suzan Verberne 🤹�.. @suzan
3K Followers 2K Following 1980 | Full Professor of Natural Language Processing (@TMLeiden), @LIACS @UniLeiden | Lives in #Nijmegen | Mother of 2 | 👩🏻💻👩🏻🏫🤹♀️🌱 🎼Ian Soboroff | ian@id.. @ian_soboroff
3K Followers 316 Following I don’t use Twitter anymore since the takeover, sorry. Search me up, I’m pretty much the only “Ian Soboroff” around.Negar Arabzadeh @NegarEmpr
937 Followers 812 Following PhD candidate at University of Waterloo | Interested in Information RetrievalKrisztian Balog @krisztianbalog
2K Followers 294 Following Professor of computer science @UniStavanger, leading @iai_group. Staff research scientist @GoogleAI. Current focus: https://t.co/5JiH909fhxAt 15:00 on 22/4/24, Weronika Łajewska @weronika_laj from University of Stavanger will give an #IRTalk talk entitled "Grounded and Transparent Response Generation for Conversational Information-Seeking Systems". Details at: samoa.dcs.gla.ac.uk/events/viewtal… @GlasgowCS @ir_glasgow
PPO, DPO, IPO, KTO, BCO… now my language model is not only secretly a reward model but also a Q function?? I really need use my PTO now ⛱️
had to google this to keep up with llm training discourse (subsequently facepalmed because I probably should have figured the latin pattern out bi now)
Big news: I’ve started a part-time PhD in Clinical Neuroscience at the University of Cambridge! 🇬🇧 I will be studying with complex brain disorders through the analysis of spatially-resolved and single nucleus transcriptomics with Mina Ryten 🧠🧬
llama-3-8b's in-context learning is unbelievable. reddit.com/r/LocalLLaMA/c…
Llama 3 8B is trained on 15T tokens! 😱 This is in accordance with our recent scaling law in #minicpm paper(arxiv.org/abs/2404.06395): Compute optimal data size should be 200 times larger than model size 🤩 Chinchilla Optimal is dead! lol #llama3 #scaling #minicpm
@ZhijingJin @bschoelkopf @radamihalcea @mrinmayasachan @ZhihengLyu This is an amazing outlook on sentiment analysis and classification! My team and I tried to bring in the need for nuances in our paper here: aclanthology.org/2023.emnlp-mai… Glad more work is done to understand sentiment classification.
We've spent a tremendous amount of time reflecting whether the NLP task of Sentiment Classification (x=review, y=rating) is causal or anticausal since 2020. Check out our 2024 latest answer➡️arxiv.org/abs/2404.11055 💡We combined Causality and Psychology insights & improved #LLMs!
These numbers are insane. I can't even imagine what the larger one(s) will be. Looks like Mistral 7B might be dead as of today though, and maybe even sonnet lol My favorite is the huge gains in coding capabilities
Quick announcement: My team and I (and many others) are moving from Google Research to Google DeepMind. My team's research agenda won't be changing much --- we'll keep radiating fields as brightly as we can. Very excited for what's next! blog.google/inside-google/…
LLaMA-3 is a prime example of why training a good LLM is almost entirely about data quality… TL;DR. Meta released LLaMA-3-8B/70B today and 95% of the technical info we have so far is related to data quality: - 15T tokens of pretraining data - More code during pretraining…
The last few important links - Official Meta Llama 3 site: llama.meta.com/llama3 - Llama 3 Model Card: github.com/meta-llama/lla…
🔥 BIG ANNOUNCEMENT! Llama 3 is out with SOTA performance! and YES, we are making it available in IBM watsonx NOW Some details about this new LLM: - It comes in two sizes, 8B and 70B - Trained by Meta on custom build 24k GPU clusters - Content length is 8k (llama 2 was 4k) -…
Are you curious how good is the Llama-3-8B-Instruct model? Join our discussion here: huggingface.co/MaziyarPanahi/…
How much 💸 do you need to train LLaMA3? Here's the breakdown. Assume: · 15 trillion tokens · train 1 epoch · 8B dense model The computation requirement is around 15 ÷ 1.25 × 3 = 36 times the cost of JetMoE training = 1,080,000 H100 GPU hours = 3 to 4 million USD
The graph is not even large enough to fit the (in training) 400B.
The purpose of humanity is to expand convex hulls