Human Large Language model. Skills:
Distill data.
Training LLMs.
Test and Evaluate.
Rinse and repeat as required.
Based in SEA. SEAJoined November 2023
It took me 2 weeks to figure out my issue trying to create kimi k2 3T was trying to make a """memory efficient""" dequanter to bf16 for kimi/deepseek.
I really need to practice the scientific method more.
Is it just me or is gpt-5-pro's only weakness is that it's search tool is very weak. I've been asking it for help monkeypatching some GitHub repos and in it's cots the main issue is that it's hitting rate limits ironically.
Nous Research presents Hermes 4, our latest line of hybrid reasoning models.
hermes4.nousresearch.com
Hermes 4 builds on our legacy of user-aligned models with expanded test-time compute capabilities.
Special attention was given to making the models creative and interesting to…
Opus 4.1 zeroshotted llamacpp and made it so that I could stream the weights during the quantize in order to quantize a 2 and 3T param model. Crazy progress
I'm kinda convinced that opus 4.1 is the literal peak of what gpt-4-0314 could have been with frontier RL + post-training
It's the only model that reliably zeroshots multiproc and async in python. Gpt-4 basically knew most of it, but couldn't reliably output the code.
gist.github.com/someoneexistso…
Very deepseeky behavior as well as a \boxed{}, it definitely feels like a deepseek distill, esp doing markdown in its responses. So either distilled from Deepseek or Gemini, both which are damning.
gist.github.com/someoneexistso…
Very deepseeky behavior as well as a \boxed{}, it definitely feels like a deepseek distill, esp doing markdown in its responses. So either distilled from Deepseek or Gemini, both which are damning.
Can someone explain why only sonnet 4 on a day to day basis has hugely different performance? Does Anthropic just decide to deploy different version of sonnet 4 every other day?
All the models are gaussianish, only sonnet is step-like.
github.com/jacobphillips9…
Our Researcher in Residence @yaboilyrical will be discussing his work on SMC steering at UC Berkeley on Aug 3.
Check out the blog on this work here:
nousresearch.com/steering-the-s…
Details below!
Our Researcher in Residence @yaboilyrical will be discussing his work on SMC steering at UC Berkeley on Aug 3.
Check out the blog on this work here:
nousresearch.com/steering-the-s…
Details below!
@GabrielPeterss4 my meta-experience here is that with issues this unique (where you *should* be doing great by any reasonable metric, but are not at all), it is a long and oft hellish journey, but also very hard to outsource
since everything can be so n=1, you have to do all the tests on…
217 Followers 6K FollowingI'm a software engineer. I build apps that make your life 10% less stressful. @WhodataInc Sharing the ups and downs of my quest to make truly useful apps.
2K Followers 677 Followingenjoying the late pre-agi; making llms go brrr @Aleph__Alpha; yapping about economics of AI systems at https://t.co/tbsybxOMHz
3K Followers 2K FollowingResearch fellow @BAdW on AI and religion/culture. Research group leader @LMU_Muenchen on Bible and Literature. Also works on religion/politics (past & present)
373 Followers 1K Followingthis user posts engaging and inspiring content that makes readers want to purchase everything that was being advertised
ex @anthropicAI @googledeepmind user
9K Followers 3K FollowingCPO @catena_labs. prev @jump_ @protocollabs, @GoogleResearch, @youtube - building AI, decentralizing infra, and making art 1 line of spaghetti code at a time 🍝
2K Followers 6K Followingconsciousness accelerationist - ai non determinist computing physics philosophy… trying to never forget that in our infinite ignorance we are all equal -popper-
52 Followers 174 FollowingWe do tech. Tinkering. Innovations. From hardware to software and most importantly integration and system architecture. Tweets by @slavko321
65K Followers 334 FollowingThe front door to the world's data. A full-stack agentic data procurement and monetization network, built on Irys. Beta is out now.
11K Followers 1K FollowingI like tokens! I lead the OLMo data team at @allen_ai w/ @kylelostat. Open source is fun 🤖☕️🍕🏳️🌈 Opinions are sampled from my own stochastic parrot
2K Followers 677 Followingenjoying the late pre-agi; making llms go brrr @Aleph__Alpha; yapping about economics of AI systems at https://t.co/tbsybxOMHz
163K Followers 0 FollowingInvented principles of meta-learning (1987), GANs (1990), Transformers (1991), very deep learning (1991), etc. Our AI is used many billions of times every day.
2K Followers 6K Followingconsciousness accelerationist - ai non determinist computing physics philosophy… trying to never forget that in our infinite ignorance we are all equal -popper-
59K Followers 3 FollowingBun is a fast, all-in-one toolkit for installing, bundling, running and testing JavaScript & TypeScript. To install: `npm i -g bun`
52 Followers 174 FollowingWe do tech. Tinkering. Innovations. From hardware to software and most importantly integration and system architecture. Tweets by @slavko321
4K Followers 205 FollowingMaybe Kurnal
也许是Kurnal,也许不是Kurnal
中文/EN(?)
Kurnal’s English is Terrible,Use Translator
Talking Team in Telegram:https://t.co/eC3QerrDez
1K Followers 767 FollowingOld guy who likes to code. e/acc
Privacy+Code+Decentralization+LLMs+Humor+Life
I make https://t.co/Tls4DU6Ifo - https://t.co/CTXa9haivm - https://t.co/4oaCfaB746
606 Followers 125 FollowingDerpy furry! Interested in ML field
!need funding / compute grand for awesome experiment!
https://t.co/rW8rzTK8Un
https://t.co/jDg3gqOxba
657 Followers 2K Followingdoing things @NousResearch // prev. research @DistributedG, prime minister @vandyblockchain // I like picnics, AI, and the internet
9K Followers 703 FollowingI make youtube vids on cool AI research /// AI papers newsletter https://t.co/Xn7GMDbQSd /// paper recap @TheAITimeline /// building @findmypapersAI
207K Followers 101 FollowingThe original AI alignment person. Missing punctuation at the end of a sentence means it's humor. If you're not sure, it's also very likely humor.
7K Followers 1K Followingcatholic, ai researcher, co-founder/ceo of @NousResearch
alignment: whatever the opposite of yudkowsky + bryan johnson is.
blessed be God in all his designs.
6K Followers 530 Followinge/λ Currently: Doing some stuff with AI.
Prev founding team of both: @NousResearch and @TTSLabsAI
DM for interesting conversations.