KP @kexecv
LLMs, Infosec, Programming, Science & Technology kuldippatel.dev Singularity Joined June 2012-
Tweets989
-
Followers575
-
Following957
-
Likes64K
YALM-130M evaluation results are here! The extra time and cost($70) didn't pay off. Results are barely better than YALM-80M. But I am not disappointed as failure is just fuel for the next attempt! 💪 Model: huggingface.co/kp7742/YALM-13…
YALM-130M evaluation results are here! The extra time and cost($70) didn't pay off. Results are barely better than YALM-80M. But I am not disappointed as failure is just fuel for the next attempt! 💪 Model: huggingface.co/kp7742/YALM-13… https://t.co/GSmzHe6Ybz
Finally! 🚀 My long-pending task is complete! YALM-130M training finished in 43 hours. It was a last moment decision to scale up from 80M to 130M. Was it worth the extra time and cost? We'll find out tomorrow after the evaluation results. Stay tuned! (Ignore those spikes 😇)
Finally! 🚀 My long-pending task is complete! YALM-130M training finished in 43 hours. It was a last moment decision to scale up from 80M to 130M. Was it worth the extra time and cost? We'll find out tomorrow after the evaluation results. Stay tuned! (Ignore those spikes 😇) https://t.co/EaLrdmAhEB
Excited to share a new open-source project I made for quickly testing LLM architectures! It's heavily based on nanoGPT and other OSS repos. Currently supports single GPU only. This is separate from the YALM project, which uses the HuggingFace Trainer. github.com/kp7742/gpt-tra…
There are rumours of TikTok coming back to India. Almost everybody is talking about what they will make once it comes. I'm rather interested in the juicy training data it will generate.
I'm so mid.
That’s what we call quality content, a must read.
deepmind and openai cooked hard but somebody got sidelined 😏
deepmind and openai cooked hard but somebody got sidelined 😏
Take: Chain of Thought is a misleading name. It's really a "scratchpad". "Thoughts" are internal activations Imagine you're solving a problem and have a scratchpad. Reading the pad gives me info! You *can* avoid writing down key thoughts. But it's a handicap. Real but fallible
I have been busy with understanding GRPO. Experimenting different ways and tuning parameters to bootstrap reasoning in small models without additional training. Getting some promising results with SmolLM2-135M. Also YALM-80M pretraining is delayed, but still on the roadmap.
Just tried Instruction tuning today! I finetuned SmolLM-135M base model using a custom chat format on the SmolTalk dataset for one epoch. I even built a Gradio web chat for easy testing of models. It's great for simple instructions, but not so much for math and tool calls!
Tbh, this delay is definitely due to the Kimi K2 release, but not for the reason many people believe. The release of such a SOTA open model would steal the spotlight from their model release. Regardless of its size, everyone would compare it to larger open models. They were…
Tbh, this delay is definitely due to the Kimi K2 release, but not for the reason many people believe. The release of such a SOTA open model would steal the spotlight from their model release. Regardless of its size, everyone would compare it to larger open models. They were…
Attention or linear models alone cannot fix everything; hybrid models are the way forward.
Tokenizer test results are here! RTL digit tokenization didn't do as well as expected, maybe due to inconsistency. Interestingly, individual digits performed best, likely because of better compression. SuperBPE, despite a higher training loss, did great on benchmarks.
Tokenizer test results are here! RTL digit tokenization didn't do as well as expected, maybe due to inconsistency. Interestingly, individual digits performed best, likely because of better compression. SuperBPE, despite a higher training loss, did great on benchmarks. https://t.co/5CGcWLoCip
Tested Grok-4 on OpenRouter because couldn't afford SuperGrok🫣. Still the same old Grok3 base model, although I expected more. Good at following instructions and takes quite open stand, but I wish I could see its "thinking" process.
SuperBPE is quite interesting! It effectively compresses English and Code with just a 32k vocab. But it doesn't compress non-English languages like Hindi that well. Probably I should have tested bigger vocab size but I was resource constraint.
Added SuperBPE with digit tokenization tests. It had mixed results; some positive and some negative. I'll share more details soon! Paper: arxiv.org/pdf/2503.13423
Added SuperBPE with digit tokenization tests. It had mixed results; some positive and some negative. I'll share more details soon! Paper: arxiv.org/pdf/2503.13423 https://t.co/G9VjNwkHai
Final test run needs 5 more hours, then i need to quickly publish my eval results! Don't want Grok 4 hype to overshadow them. Exciting AI weekend ahead!

Vivek Ramachandran @vivekramac
26K Followers 5K Following Founder, SquareX (@getsquarex) | (exited) Founder, PentesterAcademy (@securitytube) - acquired by INE (@ine) | Defcon - Blackhat Speaker | Book Author
Siddharth Dayalwal �... @siddharth_hacks
27K Followers 1K Following Developer Community Specialist @storyblok 🥑 Building @hackthisfall 🧡 Prev. @GitHub Field Expert ☂️ @MLHacks Hackathon APAC Coach 👨🏻💻 He/Him/His 🙋🏻♂️
yashwanth @yashwanth__e
174 Followers 1K Following tech & cats, undergrad researcher, deepl for life, a lil too employed, gpu poor @ hostel room
KC 🪐 @erwangto
184 Followers 686 Following
Durtal 🖤 @arebourss
187 Followers 1K Following ''On peut l'affirmer: la société n'a fait que déchoir depuis les quatre siècles qui nous séparent du Moyen Age'', Huysmans
TradeEasy @Trade__Easy
4 Followers 130 Following
Siddharth @Pseudo_Sid26
578 Followers 1K Following ML-DLpaglu | SportsPaglu | DHHpaglu | Building too much | Freelance-Paglu | 5x :🏆ML Hacks | Passionate (sometimes professional) Music Producer |
Sharon @SharonGoracke9
325 Followers 854 Following Dutch, Doors, Johnny Cash, FC Twente, MeidasTouch, Auping, vegetarian
Carrie Borer @BorerCarri81166
1 Followers 226 Following
Prathamesh Devadiga @PrathameshD_8
680 Followers 308 Following Google SoC'25 @ UCSC | AI Research @ Dartmouth | AI Resident @ Lossfunk | Member @ The Innovation Lab | AI Engineer | Founder & Lead Research @ Ādhāra AI Labs
Shivam mishra @Shivammish28782
13 Followers 82 Following
Romil Patel @Romilpatel1988
294 Followers 869 Following Cybersecurity Researcher® | @thm_ahmedabad Core team | Microsoft AI 900 certified | Bug hunter @Hacker0x01 & @Bugcrowd | CTF player
SalomeTitus @764WTPG7sLxrJ9
35 Followers 2K Following
LetitiaDunbar @30593qxX8f322
49 Followers 2K Following
panpengf @PPF12138
23 Followers 734 Following 🏳️🌈LGBTQ+支持者 及 活动家🏳️🌈 🏳️🌈泛性别超人类主义🏳️🌈 📚college学历(大专毕业)📚 🐶💕福瑞控/动物保护主义者💕🐶 ❤️😭抑郁症/焦虑症患者😭❤️ 💜女权主义 /支持女性选择💜 👊🏿✊🏿黑命贵/ 🇺🇦支持乌克兰🇺🇦 💔 支持禁枪💔
Aman Goyal @goyalaman03
87 Followers 39 Following Founder @maruthlabs - Smaller model, endless possibilities
jay shah @Jay_Shah_C
551 Followers 8K Following Interested in Mobile dev, PLT, distributed sys, AI, devtools and automation
EmilyMark @2wDNm8l1JQxJ97
44 Followers 2K Following
Murillo De Paula Sant... @hvmodder
0 Followers 4 Following
Songlin Yang @SonglinYang4
14K Followers 3K Following research @MIT_CSAIL @thinkymachines. work on scalable and principled algorithms in #LLM and #MLSys. in open-sourcing I trust 🐳. she/her/hers
Krips Javiya @kripsjaviya
27 Followers 88 Following SDE Intern @Odoo || Ex DRDO Intern || Absolute Learner LinkedIn:- https://t.co/kD9unXR2zV
juggernaut @curlysaarthak
2K Followers 2K Following 21| final yr @iiitdelhi | arch | multiplying matrices in a startup
Harry @MathPresence
398 Followers 4K Following Philosophy, math, and CS. Presence beats reactivity. Joy beats ego. Truth sets you free. Curious about consciousness, the cosmos, and silence.
Akshay @Akshay12_03
622 Followers 503 Following Data Scientist | Learning Generative AI & MLOps | Sharing experiments + projects
Sanchayan Banerjee @snoopy_albert
43 Followers 1K Following
Rini.ai @Rina3AP
209 Followers 1K Following AI playground: UI/UX, bots & late-night builds… at the end who cares!
Sachin @sachdh
3K Followers 742 Following cooking reasoning models and agents at @AthenaAgentRL - a narrow intelligence lab
FireHacker @thefirehacker
481 Followers 2K Following Founder-AI Researcher. Building BubblSpace & Timecapsule
Nexus @Nexus737326
65 Followers 620 Following
Einstein on oGPU @EinsteinXoGPU
1K Followers 6K Following EINSTEIN is a devnet Meme created on OpenGPU chain and loves science, fun and AI
anuj @anujsesha
6 Followers 1K Following
Chinmay Kak @ChinmayKak
2K Followers 1K Following gradient ascender. LLMs @lossfunk. love @teamIvLabs
Gauri Tripathi @Gauri_the_great
2K Followers 409 Following ML engineer | Creating for the love of it
Cossale — oss/acc @XCossale
110 Followers 462 Following Working with LLMs and Diffusion models. prev - @revancedapp. Available for contracts. @KeplerSystems
Altoorawc @Altoorawc8329
22 Followers 3K Following
Satyam Dixit (he/him) @imsatyamdixit
390 Followers 2K Following Engineer @smallcaseHQ 🚀 | Backend Automation & CI/CD 🛠️ | Coding Java & JS 👨🏻💻 | Tech Innovator ⚡ | #Opensource Contributor
Sanskar Pandey @sanskxr02
549 Followers 754 Following post-training & alignment @lossfunk | prev intern @SarvamAI
GlobalMacroX🇺🇸 @Oodwerarwer412
44 Followers 2K Following 15-30% Monthly | 2 High-Conviction Stocks.Short-Term Gains: 15-20% in Days/Weeks.DM "JOIN" for WhatsApp Alerts. Live Trade Signals • Market Analysis
# @shahrizooda
37 Followers 740 Following
pransh ꩜ @inmypranshoes
2K Followers 1K Following ꩜ Staff Community Manager @mozilla ⨯ @MozDevNet ≒ building @getmeris ꕥ opinions my own ✨
Pawan Kumar @imthepk
45K Followers 3K Following Tech Lead | Building @minddraftai | Google Developer Expert - Flutter, Firebase | Codepur | 140K+ Youtube | 60K+ Linkedin | 40K+ Twitter | Int’l Speaker
Ben Sadeghipour @NahamSec
235K Followers 1K Following Cofounder @hackinghub_io | Advisor @CaidoIO. I hack companies and make content about it. #NahamCon organizer. ex @hacker0x01🇮🇷
Bhaarath Makwana 💙 @bharatmk2567
2K Followers 614 Following Caught up in @flutterdev & @Golang. Tweeting tech bits - known or unknown. Geek out or browse!
Katie Paxton-Fear @InsiderPhD
93K Followers 2K Following Dr, apparently. Security Adovcate @semgrep & Hacker. #BugBounty hunter & #infosec YouTuber. APIs & Interlinked OffSec, PhD in AI+Sec @hacknotcrime. she/her
InfoSec Community @InfoSecComm
52K Followers 635 Following Largest InfoSec publication with 62,000+ followers and 1M+ monthly views.
Aditya Shende @ADITYASHENDE17
60K Followers 419 Following MS Cyber 🇬🇧 | Work @BforeAI | @Bugcrowd Top 100 | Bug Bounty Trainer | Keynote Speaker | Professional Biker | @kong_sec 🇮🇳 | Own Views ≠ Employment
bugcrowd @Bugcrowd
188K Followers 6K Following The leading provider of crowdsourced cybersecurity solutions purpose-built to secure the digitally connected world...Unleash Ingenuity™
Google Developers Gro... @GDGAhmedabad
8K Followers 71 Following Google Developer Group (GDG), Ahmedabad. We hope see you at the next meetup https://t.co/LPbaRHSFEP. Organizers: @pareshmayani, @dhuma1981
Vivek Ramachandran @vivekramac
26K Followers 5K Following Founder, SquareX (@getsquarex) | (exited) Founder, PentesterAcademy (@securitytube) - acquired by INE (@ine) | Defcon - Blackhat Speaker | Book Author
Dhrumil Shah 🏡 �... @dhuma1981
6K Followers 2K Following #Father, Lead Architect at Tata Digital, GDE for #Flutter & #Dart, ex Co-Organiser of @GDGAhmedabad, Creator of @Flutter_Flakes.
Pratik Dabhi @impratikdabhi
19K Followers 998 Following 👨🏻💻Ethical Hacker 🐞Bug Hunter | Penetration tester 👨🏻💻Security Consultant at @Deloitte ☢️ Bugcrowd Top 300 | YouTuber (23k+ Subs) | Yeswehack Top 100
Siddharth Dayalwal �... @siddharth_hacks
27K Followers 1K Following Developer Community Specialist @storyblok 🥑 Building @hackthisfall 🧡 Prev. @GitHub Field Expert ☂️ @MLHacks Hackathon APAC Coach 👨🏻💻 He/Him/His 🙋🏻♂️
LiveOverflow 🔴 @LiveOverflow
156K Followers 1K Following wannabe hacker... he/him 🌱 grow your hacking skills @hextreeio
Bhavik Makwana 💙�... @ibhavikmakwana
6K Followers 1K Following Flutter | Google Developer Expert @flutterdev | ECSE @FlutterFlow | Ex - @MultiplMovement💛
TCM Security @TCMSecurity
208K Followers 358 Following Come learn to hack at TCM Security Academy! Veteran owned. Quality results.
Jaimin J Gohel 👨�... @jaimin_gohel
1K Followers 543 Following Information Security Professional 💻 • Speaker 🎙️ • Scribbler ✍️ • CTFs 🚩
Mayuresh Choudhary @mayuresh_empire
118 Followers 298 Following Building @AskYourVideoPro | Love shipping cool stuff and learning about backend, llm, ai agent, rag, mcp, fine-tuning and more
TNG Technology Consul... @tngtech
2K Followers 139 Following TNG, aka "The Nerd Group", is a consulting partnership focused on high end information technology, particularly AI. 906 employees, 99.9% academics, ~53% PhDs.
Hao AI Lab @haoailab
4K Followers 343 Following Hao AI Lab at UCSD. Our mission is to democratize large machine learning models, algorithms, and their underlying systems.
jay shah @Jay_Shah_C
551 Followers 8K Following Interested in Mobile dev, PLT, distributed sys, AI, devtools and automation
Probability and Stati... @probnstat
67K Followers 584 Following Sharing insights on Probability, Statistics, ML, DL and AI research. Subscribe for recent research paper discussions at $2/month. DM to collaborate.
biraj (બિરજ) ... @biraj21_
4K Followers 277 Following founding engineer @OutspeedAI, building realtime voice ai infra. resident @lossfunk. wannabe solo founder. in my cringe era.
Prathamesh Devadiga @PrathameshD_8
680 Followers 308 Following Google SoC'25 @ UCSC | AI Research @ Dartmouth | AI Resident @ Lossfunk | Member @ The Innovation Lab | AI Engineer | Founder & Lead Research @ Ādhāra AI Labs
Tushar Goyal @tushowergoyal
56 Followers 101 Following pursuing phd in mind fuckery. into ai, hardware, and changing the world? ai resident @lossfunk | cs&ai @PlakshaUniv class of ‘25
Tesslate @tesslateai
161 Followers 9 Following builds high-performance reasoning models that outperform across domains like code generation, codebase intelligence, and UI generation.
aashay sachdeva @AashaySachdeva
3K Followers 473 Following I tweet about ML,data, investing and startups | ML @SarvamAI | Ex- Invest @RebrightVC |Ex-Senior Data Scientist at @PlayMPL | Built https://t.co/hWenaRkujG
Romil Patel @Romilpatel1988
294 Followers 869 Following Cybersecurity Researcher® | @thm_ahmedabad Core team | Microsoft AI 900 certified | Bug hunter @Hacker0x01 & @Bugcrowd | CTF player
Adithya kamath @Adi_kmt
371 Followers 843 Following 🇮🇳, ml, unsupervised learning, now looking into nlp
InclusionAI @TheInclusionAI
309 Followers 70 Following Open-source projects conducted by Ant Group,including Ling,AReal,AWorld. Dedicated our efforts towards AGI,guided by fairness, transparency, and collaboration.
Tornike @tornikepa
563 Followers 3K Following #Linux #Malware Researcher #Pent3ster published vulnerabilities #0day #Exploits advisories from various resources by #Cybersecurity #Bug #ReverseEngineering :wq
Rupesh Srivastava @rupspace
2K Followers 691 Following Doer of Technical Stuff. (Co)developed Highway Networks, Upside-Down RL, Bayesian Flow Networks, EvoTorch 📜 Learning is compression.
Yotta Data Services P... @YottaInfra
1K Followers 89 Following Datacenter Colocation | Cloud Services | Managed IT Services | Network & Connectivity | AI |
Fei-Fei Li @drfeifei
526K Followers 1K Following Prof (CS @Stanford), Co-Director @StanfordHAI, Cofounder/CEO @theworldlabs, #AI #SpatialIntelligence #GenAI #computervision #robotics #AI-healthcare
Pramod Goyal @goyal__pramod
10K Followers 332 Following Trying to change the world one line at a time
Lucas Beyer (bl16) @giffmana
110K Followers 523 Following Researcher (now: Meta. ex: OpenAI, DeepMind, Brain, RWTH Aachen), Gamer, Hacker, Belgian. Anon feedback: https://t.co/xe2XUqkKit ✗DMs → email
Barracks @BarracksArmy
287 Followers 39 Following Beyond the lab rut. Barracks forges hyper-realistic WarZones mirroring appsec chaos. Adapt. Report. Thrive. Build skills that cash actual checks.
prge @shguke
2K Followers 3K Following
Nexus @Nexus737326
65 Followers 620 Following
Vijay @__tensorcore__
2K Followers 525 Following MLIR, CUTLASS,Tensor Core arch @NVIDIA. Mechanic @hpcgarage. Exercise of any 1st amendment rights are for none other than myself.
Aakash Kumar Nain @A_K_Nain
12K Followers 908 Following ML Research | Multimodality, Self-improvement, LRMs | Keras3 collaborator, contributes to JAX ecosystem, TF-addons maintainer | @GoogleDevExpert in ML-JAX | OSS
Xenova @xenovacom
14K Followers 394 Following Bringing the power of machine learning to the web. Currently working on Transformers.js (@huggingface 🤗)
JingyuanLiu @JingyuanLiu123
3K Followers 429 Following https://t.co/D7zLeTZRMh is all you need | Opinions are my own
Alexandr Wang @alexandr_wang
333K Followers 838 Following chief ai officer @meta, founder @scale_ai. rational in the fullness of time
Songlin Yang @SonglinYang4
14K Followers 3K Following research @MIT_CSAIL @thinkymachines. work on scalable and principled algorithms in #LLM and #MLSys. in open-sourcing I trust 🐳. she/her/hers
juggernaut @curlysaarthak
2K Followers 2K Following 21| final yr @iiitdelhi | arch | multiplying matrices in a startup
Krips Javiya @kripsjaviya
27 Followers 88 Following SDE Intern @Odoo || Ex DRDO Intern || Absolute Learner LinkedIn:- https://t.co/kD9unXR2zV
Stellon Labs @stellon_labs
248 Followers 1 Following Building tiny frontier AI models that can run on edge devices
IIIT Vadodara @IIITVadodarasm
521 Followers 7 Following
SemiAnalysis @SemiAnalysis_
37K Followers 18 Following
vx-underground @vxunderground
377K Followers 294 Following The largest collection of malware source code, samples, and papers on the internet. Password: infected
Interconnects @interconnectsai
7K Followers 1 Following What you need to know about AI research trends, from @natolambert Wednesday mornings weekly, sometimes extra posts.