I've spent thousands of dollars testing AI models on real-world tasks & inadvertently learned the nuances of the major model APIs.
Here's an unordered list of things I've stumbled upon via TaxCalcBench (testing models' ability to calculate tax returns) about the OpenAI,…
"AI isn't replacing radiologists" good article
Expectation: rapid progress in image recognition AI will delete radiology jobs (e.g. as famously predicted by Geoff Hinton now almost a decade ago). Reality: radiology is doing great and is growing.
There are a lot of imo naive…
"AI isn't replacing radiologists" good article
Expectation: rapid progress in image recognition AI will delete radiology jobs (e.g. as famously predicted by Geoff Hinton now almost a decade ago). Reality: radiology is doing great and is growing.
There are a lot of imo naive…
Exciting product, and some unexpected data in the video about how reliance on tax professionals has increased in recent years, even as more filing software has hit the market.
Exciting product, and some unexpected data in the video about how reliance on tax professionals has increased in recent years, even as more filing software has hit the market.
DeepMind’s updated frontier safety framework. It includes a greater role for safety cases, a harmful manipulation critical capability level, and an adapted approach to misalignment and ML R&D capabilities.
DeepMind’s updated frontier safety framework. It includes a greater role for safety cases, a harmful manipulation critical capability level, and an adapted approach to misalignment and ML R&D capabilities.
I feel amusingly self-conscious speaking to AI Voice Mode out in public, imperiously asking it to summarize thousands of years of history and to stop asking follow-up questions. Sort of reminiscent of the 2009 era dads with the early bluetooth/blackberry get-up aggressively…
I’m not sure a single superpersuasive AI system will exist, or can exist. People build antibodies to extreme persuasion from a single source, in a similar way to how one might recognize the talent of an extremely good debater, while still being wary of their agenda.
To the…
Waymo is coming to SFO! The airport has approved a pilot permit to begin autonomous rides. This rollout will happen in phases—and we’ll keep you updated every step of the way until anyone can request a @Waymo ride right from @flySFO.
Frontier models were bad at recognizing and appraising antique persian rugs for a very long time, but in recent months they have improved markedly at this.
266 Followers 823 FollowingInternational Relations & AI. @GovAI_ Within DPhil @Politics_Oxford. Former @hdx. Not here often. Find me at https://t.co/iUck0PV8tQ
1K Followers 788 FollowingAssistant Professor in Psychology at Stony Brook University. I’m interested in how people interact with LLMs and they impact they might have on our psychology.
1K Followers 1K FollowingAI policy writer at Google DeepMind. Past: novels (“The Imperfectionists” & others); ghostwrote “We Are Bellingcat”; intl NY Times; the AP.
767 Followers 980 FollowingPolitical theorist. Married to @diatkinson. Art by Hilma Af Klint. Recent paper on sexual media & consent: https://t.co/2vkYpqC6Yi
98 Followers 232 FollowingThinker | Coder | Recursing a lot on AI, humanity & the optimal | Driven by the goal of preventing AI catastrophic outcomes | https://t.co/9t3nCckqtx | https://t.co/3ur2uIZhXo
898 Followers 379 Following🇿🇦 || Computational Neuroscience || Fairness in AI || @UniofOxford | Internships: Google DeepMind | Microsoft Research || World Wide Dishes
4K Followers 2K FollowingDirector of https://t.co/gCEDoKdKBT at Uni of Cambridge | Researching Big Risks, and impacts of AI & emerging tech. Opinions own
266 Followers 823 FollowingInternational Relations & AI. @GovAI_ Within DPhil @Politics_Oxford. Former @hdx. Not here often. Find me at https://t.co/iUck0PV8tQ
52K Followers 64 FollowingStudent of mind and nature, libertarian, chess player, cancer survivor. @ Keen, UAlberta, Amii, https://t.co/u8za2Kod54, The Royal Society, Turing Award
167 Followers 360 FollowingI help run the RAND Meselson Center (AI, cybersecurity, and biosecurity policy). Previously, self-driving cars (Aurora) and research centers in AI and bio.
1K Followers 788 FollowingAssistant Professor in Psychology at Stony Brook University. I’m interested in how people interact with LLMs and they impact they might have on our psychology.
1K Followers 1K FollowingAI policy writer at Google DeepMind. Past: novels (“The Imperfectionists” & others); ghostwrote “We Are Bellingcat”; intl NY Times; the AP.
767 Followers 980 FollowingPolitical theorist. Married to @diatkinson. Art by Hilma Af Klint. Recent paper on sexual media & consent: https://t.co/2vkYpqC6Yi
22K Followers 321 FollowingGlobally ranked top 20 forecaster 🎯
AI is not a normal technology. I'm working at @IAPSai to shape AI for global prosperity and human freedom.
20K Followers 9K FollowingProgramme Director @ARIA_research | accelerate mathematical modelling with AI and categorical systems theory » build safe transformative AI » cancel heat death
3K Followers 75 FollowingCo-founder of Writely (aka Google Docs) and 7 other startups. Now at the Golden Gate Institute for AI, working to bring AI’s toughest questions into focus.
898 Followers 379 Following🇿🇦 || Computational Neuroscience || Fairness in AI || @UniofOxford | Internships: Google DeepMind | Microsoft Research || World Wide Dishes