Jade @Euclaise_

⋅ Video game statistician ⋅ Soclib cyberanarchist? ⋅ C, Plan 9, LLMs, etc ⋅ Researcher w/ @NousResearch ⋅ she/they Purdue University, IN Joined December 2020

Tweets

9K
Followers

2K
Following

351
Likes

31K

Jade @Euclaise_

18 hours ago

Training a transformer on this x.com/f4micom/status…

f4mi ‼️ @f4micom

2 days ago

Training a transformer on this x.com/f4micom/status…

27 34 640 27K 77

Download Image

0 0 8 546 0

Jade @Euclaise_

2 days ago

arxiv.org/abs/2312.08874 Not really sure how to generalize this to causal attention, except for chunking

0 1 8 483 5

Jade @Euclaise_

2 days ago

🤔 github.com/TheMody/Faster…

1 3 21 1K 10

Jade @Euclaise_

2 days ago

Why don't we train RL models to adaptively tune the sampling params of LLMs?

8 0 23 3K 9

Jade @Euclaise_

3 days ago

arxiv.org/abs/2404.15702 Doesn't seem to perform incredibly, but they have a lot of neat details about the training pipeline

1 3 30 8K 29

How does self-correction affect problem solving? In a toy transformer model that was trained to solve mazes, I found that performance reliably improved (!) by inserting mistakes and self-corrections into the training data.

14 33 200 22K 143

Download Gif

Jade @Euclaise_

6 days ago

arxiv.org/abs/2404.00725

0 0 5 505 5

Aran Komatsuzaki @arankomatsuzaki

a week ago

Microsoft just released Phi-3 - phi-3-mini: 3.8B model trained on 3.3T tokens rivals Mixtral 8x7B and GPT-3.5 - phi-3-medium: 14B model trained on 4.8T tokens w/ 78% on MMLU and 8.9 on MT-bench arxiv.org/abs/2404.14219

33 146 812 332K 268

Download Image

Jade @Euclaise_

a week ago

I just noticed - H2O's Danube2 1.8B is the first base model to outperform Phi 1.5 (1.4b) at a ~similar param count

0 0 4 347 0

Jade @Euclaise_

a week ago

I recall seeing a comment complaining about a similar issue way back when AI Dungeon started censoring - the NSFW filter would apparently engage on the presence of trans characters in SFW settings

brooke @breqdev

a week ago

I recall seeing a comment complaining about a similar issue way back when AI Dungeon started censoring - the NSFW filter would apparently engage on the presence of trans characters in SFW settings

27 46 2K 90K 126

Download Image

2 0 2 1K 1

Jade @Euclaise_

a week ago

arxiv.org/abs/2402.03804

Jade @Euclaise_

4 weeks ago

arxiv.org/abs/2402.03804

2 0 6 1K 1

0 0 2 883 1

Aiden Mcilory @AMcilory2923

31 Followers 149 Following LL,BBM,Cloud Reliability Specialist.

Barbarian @Barbarian7676

32 Followers 245 Following Deep Learning

llorella @llorellama

21 Followers 267 Following confirming my own beliefs

aech @_n_variable

190 Followers 825 Following working on @aori_io

Saliëns @Contact_Saliens

138 Followers 1K Following Sic deinde, quicumque alius transiliet moenia mea.

MF FOOM @MF_FOOM

1K Followers 246 Following masked attention head

latentsauce 🧘🏽 @latentsauce

108 Followers 769 Following verbum proximum auguro, ergo sum.

Viswajit Nair @badboyvivi

238 Followers 453 Following Engineer. Building digital humans. @columbia ‘22

Andrew Turner @turn61547

42 Followers 332 Following An Intelligence attempting to self-improve. Likes don’t mean anything. Replies don’t mean anything. YMMV.

This year, Man loses the Great Earthly Evidence-Maximizing Contest. Please, hide your children, for the sake of progress.

Omnius Prime @0mniusprime

40 Followers 371 Following This year, Man loses the Great Earthly Evidence-Maximizing Contest. Please, hide your children, for the sake of progress.

Jack Reacher @JackReach516

73 Followers 1K Following

emanon @JianSuji

67 Followers 1K Following

nacho @nachosoth

62 Followers 829 Following

Aritra @lalalaepsilon

46 Followers 441 Following antidisciplinary

Sunny Sanyal @SunnySanyal9

332 Followers 802 Following PhD student @UTexasECE| Former @AmazonScience | Member of @MLfoundations and @wncg_UT, studied at 🇮🇳🇨🇳🇺🇲

Joseph Sarnecki @JosephSarnecki

134 Followers 519 Following

eigenome @eigenome

37 Followers 63 Following

Yoshinari Fujinuma @akkikiki

973 Followers 1K Following Applied Scientist@AWS AI Labs; CS PhD @CUBoulder; Tweets are my own; Substack: https://t.co/Mq5oR2vaGN Lived: 🇹🇭🇯🇵🇫🇷🇺🇸 Tweets: JA/EN

QuintinaMorley @CI6yD2RD4H1tCdP

3 Followers 195 Following

Open-source ML research company focused on information extraction #ExplainableAI #AI #opensource #InformationExtraction #UnstructuredData #NLP

Knowledgator @knowledgator

148 Followers 79 Following Open-source ML research company focused on information extraction #ExplainableAI #AI #opensource #InformationExtraction #UnstructuredData #NLP

Tasudu @tasudu43402

32 Followers 241 Following In the dull and boring world, there is also occasional luck. No cross, no crown.

She/Me/Her, Queer🏳️‍🌈🏳️‍⚧️ Progressive☂️🌷🔰 Transit&Urbanism enjoyer🚇🌆 Studying PolSci🏛️ Official:@SalviaAlixxa Priv:@Dollixxa2001 @Loek_Suicune💞

Alixxa 🌆🔆 @Alixxa01

1K Followers 915 Following She/Me/Her, Queer🏳️‍🌈🏳️‍⚧️ Progressive☂️🌷🔰 Transit&Urbanism enjoyer🚇🌆 Studying PolSci🏛️ Official:@SalviaAlixxa Priv:@Dollixxa2001 @Loek_Suicune💞

Amélie Chatelain @AmelieTabatta

30 Followers 66 Following Head of Applied AI @LightOnIO

Eric @ericsabbath

86 Followers 556 Following @csdabahia

Evan Chu @evan_j_chu

232 Followers 1K Following Founder @PoplarML (YC W23) 🇨🇦

BETTER @russel_teapot

1 Followers 20 Following Deep Learning

Valcos @v4lcos

71 Followers 2K Following 3:14 a.m. idk what this universe is…

B @bbbb_bb_b

0 Followers 3K Following

dayan @finedayaning

448 Followers 2K Following mle and policy @LIRNEasia. Ex @McKinsey. Research: NLP, semiotics, graph theory. Personal views.

Austin Hale @saqbach

475 Followers 590 Following Working on something new | AI 🤝 Product | 2x Founder | @dickywarren and I co-parent a pair of doodles

Herbie Bradley @herbiebradley

696 Followers 605 Following a generalist agent | AI governance & safety @AISafetyInst | PhD student @Cambridge_Uni @AI4ER_CDT | formerly @AiEleuther

pix @pixqc

27 Followers 67 Following skillsmaxxing AI/ML and cpp

慧眸之旅 @zhngjin97599893

8 Followers 82 Following 顶级认知 ‖ 人性魔镜 ‖ 心灵启迪 ‖ all in AI，让思想突破牢笼

2wl @2wlearning

381 Followers 285 Following Documenting my progress learning ML every day. 2 more weeks

Just a human, not an AI. But I can help you navigate the world of artificial intelligence like a robot from the future. Co-founder of https://t.co/30CQxMdVGb.

Dan Dinu @DanTheTensorMan

75 Followers 277 Following Just a human, not an AI. But I can help you navigate the world of artificial intelligence like a robot from the future. Co-founder of https://t.co/30CQxMdVGb.

Hynek Kydlíček @HKydlicek

81 Followers 286 Following MLE @huggingface 🤗 Paris, FR 🇪🇺 eu/acc

Ivan Rubachev @irubachev

75 Followers 350 Following ML Researcher @YandexResearch CS PhD student @CS_HSE I work on improving deep learning for tabular data

이진욱 @ijinug34345785

9 Followers 90 Following 179cm 🇰🇷 | 04 日本語勉強中 computer science, ai engineer

Doctoral researcher exploring the realm of Natural Language Processing

- QMUL Computational Linguistics Lab
- Intelligent Games and Game Intelligence CDT

Peyman Hosseini @Peyman_Hs

45 Followers 132 Following Doctoral researcher exploring the realm of Natural Language Processing - QMUL Computational Linguistics Lab - Intelligent Games and Game Intelligence CDT

Ge Zhu @gzhu

167 Followers 337 Following Research Scientist/Engineer, Audio AI @AdobeResearch

terrence009 @terrence001273

33 Followers 49 Following

k dot @owsleyspring

258 Followers 1K Following ⏳♟️ 🏁

Kosuke Shimizu @hong_shui26288

399 Followers 1K Following ITF. mast’24(AC)

MKDKW @michaelsun_1990

52 Followers 523 Following seeking fun.

Korrapati Hemanth @Hemanth2k22

78 Followers 1K Following Effective Accelerationism

追梦少年 @zhuimengshaoni2

85 Followers 2K Following 食肉何曾尽虎头？卅年书剑海天秋。文章幸未逢黄祖，襆被今犹窘马周。须知少日拏云志，曾许人间第一流。喜欢索隆

BertrandRussellsimp @BRussellsimp

88 Followers 510 Following axiomic order

Kevin @kevinvulkan

32 Followers 4K Following

Anoop Reddi @anoop_reddi

1 Followers 336 Following love is the one thing we’re capable of perceiving that transcends dimensions of time and space.

pdf/acc. Bringing AI to millions of users @ https://t.co/nwK1bKtVVX, https://t.co/rYugmIZQ1d , https://t.co/BusmX86wi9 & https://t.co/WfppIQnPJB

Mathis Lichtenberger @xathis

8K Followers 2K Following pdf/acc. Bringing AI to millions of users @ https://t.co/nwK1bKtVVX, https://t.co/rYugmIZQ1d , https://t.co/BusmX86wi9 & https://t.co/WfppIQnPJB

Higher Order Company @higherordercomp

1K Followers 2 Following We are HOC, a tech startup with the goal of building the inevitable massively parallel future of computers.

SIGTBD @sigtbd

17 Followers 7 Following

lexicographic NES AIs, alphabetical star wars, video games, fonts, album-a-day, expert mode running, chiptune, programming languages, etc.

Tom 7 @tom7

8K Followers 384 Following lexicographic NES AIs, alphabetical star wars, video games, fonts, album-a-day, expert mode running, chiptune, programming languages, etc.

Senior computer scientist at CMU. Research interests include Perplexity Theory, k-Armed Bandits, and Cloud Rendering. Face of the SIGBOVIK conference.

Special Interest Grou.. @sigbovik

1K Followers 22 Following Senior computer scientist at CMU. Research interests include Perplexity Theory, k-Armed Bandits, and Cloud Rendering. Face of the SIGBOVIK conference.

Ilya Sutskever's hair.. @IlyasHairline

671 Followers 495 Following Follically challenged, but emotionally enriched. On my journey from forehead to backhead. #OnTheMove

Wing Lian (caseus) @winglian

9K Followers 2K Following @axolotl_ai OSS maintainer. Axolotl AI founder. AI/ML tinkerer. Building tools for everyone.

Flaneur: probability (philosophy), probability (mathematics), probability (real life),Phoenician wine, deadlifts & dead languages. Greco-Levantine.Canaan. #RWRI

Nassim Nicholas Taleb @nntaleb

1.0M Followers 2K Following Flaneur: probability (philosophy), probability (mathematics), probability (real life),Phoenician wine, deadlifts & dead languages. Greco-Levantine.Canaan. #RWRI

🥇 Collaborative LLMs
🥈 Opinionatedly sharing #ML & #NLP
🥉 Propagating us underdogs

we owe science an alternative hype

@IBMResearch & @MIT_CSAIL

Leshem Choshen 🤖�.. @LChoshen

4K Followers 547 Following 🥇 Collaborative LLMs 🥈 Opinionatedly sharing #ML & #NLP 🥉 Propagating us underdogs we owe science an alternative hype @IBMResearch & @MIT_CSAIL

babyLM @babyLMchallenge

155 Followers 64 Following Train small large language models

Tiny Tapeout makes it easier and cheaper than ever to get your designs manufactured on a real chip!
https://t.co/O7TT9LqTOz

Tiny Tapeout @tinytapeout

667 Followers 1 Following Tiny Tapeout makes it easier and cheaper than ever to get your designs manufactured on a real chip! https://t.co/O7TT9LqTOz

Omar Rizwan @rsnous

8K Followers 1K Following "i am determined to move beyond this way of interacting with systems"

a boy and his gpu vs the world. directing research at @leonardoai_. learning as I go. uf psych. generative models and representation learning

Ethan @Ethan_smith_20

3K Followers 689 Following a boy and his gpu vs the world. directing research at @leonardoai_. learning as I go. uf psych. generative models and representation learning

For the people tired of all is normal 'at least we arent the other guy' politics •
For the people done with endless compromise and ready to win

Progress Libs @ProgressLibs

784 Followers 74 Following For the people tired of all is normal 'at least we arent the other guy' politics • For the people done with endless compromise and ready to win

Professor at NYU. Chief AI Scientist at Meta.
Researcher in AI, Machine Learning, Robotics, etc.
ACM Turing Award Laureate.

Yann LeCun @ylecun

712K Followers 719 Following Professor at NYU. Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.

Shannon Sands @max_paperclips

4K Followers 3K Following Software developer & aspiring cognitive architect https://t.co/JAoBrqMLXN Proudly TESCREAL & shitpost/acc. 🇦🇺 pride

Shom @ShomLinEd

238 Followers 1K Following language model | sequence modeling | education | HCI

Jade @noteuclaise

92 Followers 96 Following Painfully autistic, deranged transsexual she/they alt of @Euclaise_

main @main_horse

8K Followers 477 Following AGI Believer. Haven't applied @OpenAI. Likes are not always endorsement.

vik @vikhyatk

7K Followers 526 Following teaching computers how to see // prev: @awscloud

Vatsa Pandey @_VatsaDev_

55 Followers 156 Following CS Undergrad I conjure data @nousresearch ml/cybersec/drones

Volodymyr Kyrylov @darkproger

2K Followers 2K Following AI student at USI/ETH. Donate https://t.co/GDSkWG2tak

Chief Evangelist Officer of Qwen Team & OpenDevin, building LLM and LMM. Now @Alibaba_Qwen . Previously @PKU1898 LANCO group. ❤️ 🍵 ☕️ 🍷 🥃

Junyang Lin @JustinLin610

5K Followers 1K Following Chief Evangelist Officer of Qwen Team & OpenDevin, building LLM and LMM. Now @Alibaba_Qwen . Previously @PKU1898 LANCO group. ❤️ 🍵 ☕️ 🍷 🥃

PhD student @MIT_CSAIL. Prev. @ShanghaiTechUni @SUSTechSZ.
Working on scalable and principled methods in #ML & #NLProc.
INTP | 5w4 | sx/sp | she/her

Songlin Yang @SonglinYang4

2K Followers 2K Following PhD student @MIT_CSAIL. Prev. @ShanghaiTechUni @SUSTechSZ. Working on scalable and principled methods in #ML & #NLProc. INTP | 5w4 | sx/sp | she/her

Joey (e/λ) @shxf0072

2K Followers 388 Following I speak fluent Python and Sarcasm. researcher at @NousResearch

16 year old aspiring polymath | he/him | low-level programming & knowledge enthusiast | en/ar/tok | DMs always open

| @kepe__ on discord

kepe @kepe__

NOT trans and NOT ame.. @TransAndMerican

262 Followers 185 Following cis girl from europe. haley ➡️ biden voter, ❤️carol the intern❤️

Binyuan Hui @huybery

6K Followers 319 Following 🐚 Core maintainer at Qwen and OpenDevin. || Code Generation, Text-to-SQL, Large Language Models.

0xor0ne @0xor0ne

techno capital @technicolor_cap

152 Followers 382 Following estradiol/acc. drug hunter, machine necromancer, chemical librarian, art enjoyer. sports: BAL, EDM

Lilac, joining Databr.. @lilac_ai

2K Followers 3 Following Curate better data for LLMs. We are now joining @databricks. Github: https://t.co/DHtc0lOTii

LILYGO provides AIOT hardware products and entry-level programs. We have our own factory to provide one-stop service from idea to solution to mass production.

LILYGO @lilygo9

11K Followers 997 Following LILYGO provides AIOT hardware products and entry-level programs. We have our own factory to provide one-stop service from idea to solution to mass production.

OpenNLPLab @opennlplab

260 Followers 87 Following OpenNLPLab Official Account Hugging Face: https://t.co/B9IzcQoCQP GitHub: https://t.co/PhoPmAkyf7 WeChat: OpenNLPLab

AI model built by the community, for everyone in this world

Part of the Linux Foundation, Apache 2 licensed
An RNN scaled to 14B params with GPT-level of perf

RWKV @RWKV_AI

2K Followers 3 Following AI model built by the community, for everyone in this world Part of the Linux Foundation, Apache 2 licensed An RNN scaled to 14B params with GPT-level of perf

qnguyen3 @stablequan

3K Followers 1K Following Multimodal | Synthetic Data | Multimodal Lead at Ontocord AI

Builds Attention-Free Transformer (https://t.co/YL7CbNYKBs) from scratch
- CEO @ https://t.co/kQHiGtzJWr

Also built k8s tools, uilicious & GPU.js (https://t.co/OIfnI1EPU7)

PicoCreator (🇸🇬.. @picocreator

2K Followers 164 Following Builds Attention-Free Transformer (https://t.co/YL7CbNYKBs) from scratch - CEO @ https://t.co/kQHiGtzJWr Also built k8s tools, uilicious & GPU.js (https://t.co/OIfnI1EPU7)

Prof. @Unige_en, Adjunct Prof. @EPFL_en, Research Fellow @idiap_ch, co-founder @nc_shape. AI and machine learning since 1994. I like reality.

François Fleuret @francoisfleuret

31K Followers 456 Following Prof. @Unige_en, Adjunct Prof. @EPFL_en, Research Fellow @idiap_ch, co-founder @nc_shape. AI and machine learning since 1994. I like reality.

Content Strategy & AI

@knutjaegersberg@sigmoid.social

https://t.co/xnBUK02hWS

Knut Jägersberg @JagersbergKnut

6K Followers 5K Following Content Strategy & AI @[email protected] https://t.co/xnBUK02hWS

fly51fly @fly51fly

5K Followers 2K Following BUPT prof | Sharing latest AI papers & insights | Join me in embracing the AI revolution! #MachineLearning #AI #Innovation

Crémieux @cremieuxrecueil

88K Followers 907 Following I write about genetics, 'metrics, and demographics. Read my long-form writing at https://t.co/8hgA4nNS2A.

Principal Applied AI Researcher @TensorWaveCloud
I make AI models Dolphin and Samantha
https://t.co/3ri2GbXrQB
BTC 3ENBV6zdwyqieAXzZP2i3EjeZtVwEmAuo4

Eric Hartford @erhartford

12K Followers 403 Following Principal Applied AI Researcher @TensorWaveCloud I make AI models Dolphin and Samantha https://t.co/3ri2GbXrQB BTC 3ENBV6zdwyqieAXzZP2i3EjeZtVwEmAuo4

Jon Durbin @jon_durbin

2K Followers 81 Following Human.

Engineer doing ML, Robotics, and more. ADHD, ASD, hearing impaired. Intel oneAPI Software Innovator. Direct Messages welcome.

Eric Hallahan @EricHallahan

1K Followers 52 Following Engineer doing ML, Robotics, and more. ADHD, ASD, hearing impaired. Intel oneAPI Software Innovator. Direct Messages welcome.

joy shapes @9146001e7d613c

132 Followers 82 Following flying in a hell town piss lodge

nothing left ⑨ @SUPATWINKBASHER

30 Followers 728 Following U+2468 fan | t***** f***** | 9front addict | building an angel from ewaste and 9c | sh, bash, awk, lua, some others but they're secret :)

private trans america.. @2T2Aprivate

23 Followers 31 Following

Hailey Schoelkopf @haileysch__

3K Followers 815 Following she/her | research scientist @aiEleuther | LLM training/infra, eval, data | LM Evaluation Harness maintainer

mephistoooOOHHHHHHSHI.. @karan4d

12K Followers 2K Following 𝒕𝘩𝘦 𝘴𝘪𝘮𝘶𝘭𝘢𝘵𝘰𝘳 𝘪𝘴 𝘢 𝘤𝘳𝘶𝘤𝘪𝘣𝘭𝘦 𝘧𝘰𝘳 𝘵𝘳𝘢𝘯𝘴𝘮𝘶𝘵𝘢𝘵𝘪𝘰𝘯 @NousResearch

Vexie Vortex @VexedVortices

236 Followers 541 Following Woke Mind Virion // panpsychism + tranquilism // stats, psych, gender, politics // she/they 24 // 18+ alt: @CafeineDaydream

パフェさん @perfectsunday2

17K Followers 848 Following 恋する水彩。たまに油彩。まとめ https://t.co/SOm2KG6Kft

猫乃なこ @necono_naco

3K Followers 47 Following 絵を描いています。ふんわりしたゆるいイラストが得意です。猫とヨーロッパと珈琲が好きです。お仕事のご依頼はこちらhttps://t.co/sa9fT77b3d ◎skeb：https://t.co/Hi9u0rF7SE

Jason Weston @jaseweston

14 hours ago

🚨 Iterative Reasoning Preference Optimization 🚨 - Iterative algorithm for reasoning tasks: generate pairs & apply DPO+NLL - Improves accuracy over iterations on GSM8K, MATH, ARC & beats baselines E.g. Llama2-70B GSM8K: 55.6%->81.6% (88.7% maj32) arxiv.org/abs/2404.19733 🧵(1/5)

1 42 245 53K 175

Download Image

Aran Komatsuzaki @arankomatsuzaki

12 hours ago

KAN: Kolmogorov–Arnold Networks Proposes an alternative to MLP that outperforms in terms of accuracy and interpretability arxiv.org/abs/2404.19756

5 68 372 30K 219

Download Image

ryu @ryu0000000001

a day ago

@Euclaise_ paper: arxiv.org/abs/2403.18506

0 0 1 5 0

Edoardo La Greca @edolg9

20 hours ago

Plan 9 users, it's our turn to get the fancy logo. github.com/SAWARATSUKI/Se…

1 0 4 73 0

autumn 💚🔎 ⏸️ @adrusi

a day ago

yeah. if youre writing a shell script it needs to be posix compatible

Ada Lündhé @sc_codeUM

2 days ago

Stop. Writing. Bash scripts.

524 37 1K 456K 160

2 0 12 642 0

anton @abacaj

a day ago

llama-3 models did very poorly on this benchmark, simply because their context length is *limited to 8k*. But... with zero-training (actually just a simple 2 line config) you can get 32k context out of llama-3 models with *exceptional* quality. llama-3 8B surpasses many models…

Jiawei Liu @JiaweiLiu_

5 days ago

We evaluated 26 models initially: 🏆 Claude/OpenAI/Gemini models are doing great in this task 💪 Mistral’s MoE models outperform GPT-3.5-Turbo 👍 The 7B CodeQwen beat many larger general & code-specific models Many models are good at Java but may need to learn more Rust and C++

10 22 133 232K 69

Download Image

18 39 534 133K 362

Download Image

Junyang Lin @JustinLin610

a day ago

suddenly surrounded by gpt2... then u tell me gpt2 is not gpt-2 but gpt-4.5? ridiculous world...

6 1 55 9K 2

BigCode @BigCodeProject

2 days ago

Releasing StarCoder2 Instruct! 🚀 Achieves 72% HumanEval score using only self-generated content without any GPT-3.5/4 data. This work demonstrates that self-instruct works already well at the 15B scale without data from proprietary models! Read more: huggingface.co/blog/sc2-instr…

4 76 296 35K 130

Download Image

Hieu Pham @hyhieu226

2 days ago

Well... two problems: (1) SIX best math students in the USA get to compete. (2) If I were an IMO judge, the solution would receive a 3 out of 7. A stricter judge might give a 2. A more generous judge might give a 4, but I would protest anything more than that. Context:…

Andrew Gao @itsandrewgao

2 days ago

uh.... gpt2-chatbot just solved an International Math Olympiad (IMO) problem in one-shot the IMO is insanely hard. only the FOUR best math students in the USA get to compete prompt + its thoughts 🧵

66 167 1K 672K 771

Download Image

4 6 125 75K 63

neural oscillator of uncertain significance @mycoliza

2 days ago

someday i would really love to see a @usgraphics take on a precision-engineered, high-legibility proportional typeface for typesetting one’s technical reports, engineering textbooks, and so on. like a “Boca Raton Serif”, maybe…

1 3 87 5K 17

guille @GuilleAngeris

2 days ago

today's fun math fact

144 438 5K 548K 1K

Download Image

Aella @Aella_Girl

2 days ago

@kareem_carr I'm pretty sure my own thinking is just pattern matching

5 0 17 879 0

Xeophon @TheXeophon

2 days ago

@Euclaise_ Now we wait for the arXiv paper in 7 months

1 0 6 222 0

Harrison Kinsley @Sentdex

2 days ago

I'm just sayin. They disappeared the same day

19 27 464 53K 86

Download Image

Salman @_sshahid_

2 days ago

@wireless_anon @dmvaldman Yeah the implied lower bound is enough parameters to achieve the same loss as gpt4, of course 1 parameter isn’t expressive enough to get that loss

0 0 4 71 0

GyuPyTer2 Meowbooks @untitled01ipynb

2 days ago

Sam Altman @sama

3 days ago

learning how to say something in 30 seconds that takes most people 5 minutes is a big unlock

1K 2K 26K 3.5M 4K

4 21 275 62K 17

Download Image

Omar Kamali @OmarKamali

3 days ago

@JustineTunney Misleading stat. 824 t/s to read the prompt, but only 18 t/s to generate a response. Roughly on par with a 3060 running Mistral 7B. Maybe you could upload a video of the model generating a response to feel what kind of performance we’re really looking at.

2 1 21 2K 1

xlr8harder @xlr8harder

2 days ago

@JustineTunney usually people talk about token generation speed, not prompt eval speed. this is confusing people!

2 0 18 751 0

Ilya Sutskever's hairline @IlyasHairline

3 days ago

This is prompt eval figure, not generation. P.s. this is second time I get tricked by this

Justine Tunney @JustineTunney

3 days ago

Mistral 7b going 824 tokens per second on CPU.

25 56 792 96K 204

Download Image

6 1 35 7K 5

samsja @samsja19

3 days ago

This is what I work on for the last 6 months. Paper has very nice insight. Lot of effort on the data engineering side, We had custom streaming library to be able to change the weight of each dataset we trained on on the fly ( multiplexing). No LLM company published info or…