If you truly want to learn RL, ditch the readymade gym environments.
Make a custom environment on your own. You’ll understand how to structure rewards, observations, random initialization states, etc. also, how to debug and render. This is the most practical skill you can get…
Apple did more for AI than anyone else: they proved through peer-reviewed publications that LLMs are just neural networks and, as such, have all the limitations of other neural networks trained in a supervised way, which I and a few other voices tried to convey, but the noise…
My most elitist opinion is that I don’t care about anyone’s takes if they can’t do calculus and linear algebra. Math is the most abstract, purified form of thinking. If you can’t do that, you definitely can’t do the harder and messier thinking needed for politics, philosophy, etc
MUST WATCH: in Paris, JD Vance just delivered one of the most morally clear, pro-American, and courageous speeches you will see all about America’s global leadership on AI. 15 minutes of pure FIRE 🔥
ELON: EUROPE HAS WAY MORE BUREAUCRACY THAN THE U.S.
"You don't just have the provincial and national level, you also have the EU on top of that.
To be totally frank, EU headquarters in Brussels is essentially a cathedral to bureaucracy."
Source: 2025 WELT Economic Summit…
ELON: EUROPE HAS WAY MORE BUREAUCRACY THAN THE U.S.
"You don't just have the provincial and national level, you also have the EU on top of that.
To be totally frank, EU headquarters in Brussels is essentially a cathedral to bureaucracy."
Source: 2025 WELT Economic Summit… https://t.co/O0mhkJXlvk
> Western AI labs deliberately hide technical details of their training runs
> Chinese AI lab publishes detailed technical report of their training run
> Mainstream media doesn’t know how to read a technical report
> Nvidia loses 600 billion dollar
Mini-R1: Reproduce @deepseek_ai R1 „aha moment“ a RL tutorial! Recreate an RL "aha moment" using Group Relative Policy Optimization (GRPO) and train an open model using reinforcement learning to teach it self-verification and search abilities all on its own to solve the Countdown…
"Thinking + Long Context Reasoning" is incredibly powerful.
I gave Gemini 2.0 Flash Thinking Experimental 01-21 Model 2 documents: NeurIPS 2022 "chain of thought" and ACL 2022 "reframing instructions" papers with the prompt: "Combine ideas of these 2 papers and create a new…
8K Followers 9K FollowingGERMANY FIRST - only when everything works for us will we feed the rest of the world!
Loosely based on Matthew 5:3-27: Blessed are the spiritually poor far-left
184 Followers 2K FollowingTeam leader. Senior SW architect and developer. I retweet very selected content about IT, social, zen and fun. #Linux #Java #Kotlin #Solidity #Crypto
6K Followers 1K Followingbuilding the post-IDE IDE at https://t.co/hDpglja33W - coined “context engineering”, prev @replicatedhq @SproutSocial - ai that works pod @ https://t.co/69BhaNtWfd
36K Followers 968 FollowingAuthor of https://t.co/arW0hnVET0 and https://t.co/RN9xXOzhON. @sourcegraph working on @ampcode. Ex-@zeddotdev. Programming where the rubber hits the road.
17K Followers 105 FollowingI build sane open-source RL tools. MIT PhD, creator of Neural MMO and founder of PufferAI. DM for business: non-LLM sim engineering, RL R&D, infra & support.
11K Followers 50 FollowingAn open-source declarative framework for building modular AI software. Programming—not prompting—LLMs via higher-level abstractions & optimizers.
31K Followers 6 FollowingA network of engineers enhanced by and building with AI.
Organizers of the AI Engineer Summit, AI Engineer World's Fair, and AI Engineer Europe.
36K Followers 5K FollowingExperienced Data Science Leader | PhD in Machine Learning | 4x Author | Black Belt 🥋 in Time Series | Chief Conformal Prediction Promoter| Mathematician |
3K Followers 3 Followingthe command-line interface (CLI) tool that brings AI assistance directly into your development workflow.
CA: Fc7tEqyfHPoWQXdiAqx62d7WeuH7Zq1DHwa2ihDpump
17 Followers 110 FollowingI'm a freelance software engineer specializing in Angular. Created applications used by millions, authored the book Effective Angular, and speek at conferences.
28K Followers 1K FollowingResearch at @GoogleDeepMind. Controllable World Simulators (GNNs, Structured World Models, Neural Assets). Veo Team (Ingredients to Video Co-Lead)
42K Followers 1K FollowingSecular Bayesian.
Professor of Machine Learning @Cambridge_CL. Talent aficionado at https://t.co/RbJkoLguey
Alum of @Twitter, Magic Pony and @Balderton
19K Followers 20 FollowingA high-throughput and memory-efficient inference and serving engine for LLMs. Join https://t.co/lxJ0SfX5pJ to discuss together with the community!