Just Loki @LokiJulianus

Prince With a Thousand Enemies Nod Joined April 2016

Tweets

68K
Followers

66K
Following

311
Likes

51K

Just Loki @LokiJulianus

2 minutes ago

The $15 “silver” chain race.

Patrick Casey @restoreorderusa

19 minutes ago

The $15 “silver” chain race.

31 31 456 8K 23

Download Image

0 2 8 290 1

Grace Kind @kindgracekind

2 hours ago

Oh this LLLMM? It’s just predicting the next language model

5 1 42 1K 0

nahr @nahrzf

6 hours ago

meta did the funniest thing

0 12 266 11K 7

Download Image

> "I think it's time to admit defeat" How often do you see LLMs capitulate instead of doubling down or gaslighting you? Sadly 8B Llama is struggling with The Diamond Problem (as do all <10B models that don't cheat egregiously), but its attitude sure is more human-like now.

Teortaxes▶️ @teortaxesTex

6 days ago

15 1 61 13K 5

9 4 80 6K 7

Download Image

Just Loki @LokiJulianus

4 hours ago

In fairness to the lisp machine people: most higher level languages cannot easily implement a full common lisp interpreter even today and not for lack of trying.

The Floomer King @TrumpWasHere

4 hours ago

In fairness to the lisp machine people: most higher level languages cannot easily implement a full common lisp interpreter even today and not for lack of trying.

0 0 1 3K 0

1 0 9 3K 0

nisten @nisten

8 hours ago

He’s change his instagram name to Zuck lol.

163 59 1K 498K 328

Download Video

Just Loki @LokiJulianus

4 hours ago

And just like that.

2 0 11 2K 1

Download Image

Just Loki @LokiJulianus

4 hours ago

tyfys.

2 1 16 2K 3

Download Image

Mike Lewis @ml_perception

6 hours ago

Yes, both the 8B and 70B are trained way more than is Chinchilla optimal - but we can eat the training cost to save you inference cost! One of the most interesting things to me was how quickly the 8B was improving even at 15T tokens.

Felix @felix_red_panda

7 hours ago

7 6 144 31K 20

Download Image

10 19 223 20K 28

Andrew Curran @AndrewCurran_

5 hours ago

Llama 3 70B never stopped learning. He says the only reason they stopped its training was that they eventually had to decide: 'Do we want to spend our GPUs on training the 70B model further?' or should we start training what's next?

Dwarkesh Patel @dwarkesh_sp

8 hours ago

110 223 2K 228K 1K

Download Video

13 23 205 25K 44

Download Image

kache (dingboard.com) @yacineMTB

4 hours ago

meta ai finally solved the slopism problem

15 7 236 13K 20

Download Image

kache (dingboard.com) @yacineMTB

5 hours ago

>is it MoE or dense? >haha, it's a good model sir >it's dense

18 6 258 16K 23

Download Image

Noelle @AuroraNemoia

6 hours ago

Mistral-7B is dead.

Noelle @AuroraNemoia

7 hours ago

Mistral-7B is dead. https://t.co/QDecAEpzqt

0 0 24 28K 3

9 17 295 28K 82

Download Image

Aston Zhang @astonzhangAZ

8 hours ago

Llama 3 has been my focus since joining the Llama team last summer. Together, we've been tackling challenges across pre-training and human data, pre-training scaling, long context, post-training, and evaluations. It's been a rigorous yet thrilling journey: 🔹Our largest models…

96 161 1K 99K 242

Download Image

Just Loki @LokiJulianus

5 hours ago

Lads, are we finally free of the debt inherited by OAi's decision to not open source gpt-3...

anton @abacaj

5 hours ago

Lads, are we finally free of the debt inherited by OAi's decision to not open source gpt-3...

27 15 384 23K 38

0 3 22 3K 2

sophia (the deuteronomist) @cis_female

7 hours ago

llama-3-70B is as good or better than sonnet but ~10x cheaper, about as cheap as Haiku. Llama has just demolished everything below gpt-4 level

3 4 58 21K 11

Download Image

AI at Meta @AIatMeta

5 hours ago

Hearing feedback from the community about the adverse impacts of false refusals, we developed new mitigations to address this. Llama 3 70B exhibits less than a third of the false refusals of Llama 2 70B, making Llama 3 our most helpful model to date.

7 8 98 7K 2

Paul Graham @paulg

5 hours ago

I wish Twitter had a way to send all the people calling me antisemitic for talking about Palestine to the same place as the people calling me a genocide supporter for saying that blocking traffic is a stupid tactic, so they could argue with one another instead of me.

131 61 2K 95K 74

Andrej Karpathy @karpathy

5 hours ago

Congrats to @AIatMeta on Llama 3 release!! 🎉 ai.meta.com/blog/meta-llam… Notes: Releasing 8B and 70B (both base and finetuned) models, strong-performing in their model class (but we'll see when the rankings come in @ @lmsysorg :)) 400B is still training, but already encroaching…

92 511 4K 303K 1K

AI at Meta @AIatMeta

7 hours ago

Llama 3 delivers a major leap over Llama 2 and demonstrates SOTA performance on a wide range of industry benchmarks. The models also achieve substantially reduced false refusal rates, improved alignment and increased diversity in model responses — in addition to improved…