In agent demos, everything’s smooth.
In prod? You get messy inputs, long chains, weird edge cases — that’s when things snap.
We treat agents like code → write scenario tests first, simulate full workflows, then iterate until green. Think TDD, but for LLMs.
More on how we do it…
“Do I really need evals?”
The real q: how do you know your AI agents will behave in prod?
Prototypes don’t need them. Scaling products do.
That’s why we built Agent Simulations; Unit tests for AI.
The only way to know if you can ship reliably.
OSS: github.com/langwatch/scen…
We’re hosting a Meetup in our office in Amsterdam on Sept 18 all about agentic AI. 👀 👀 👀
Talks from:
• @_rchaves_ (CTO, LangWatch) → Beyond Unit Tests: why agent simulations are redefining AI agent testing.
• Deepak Grewal (Kong) → Agentic AI -> powering the next wave…
In Amsterdam and want to spend an evening networking and learning all about agentic AI? Come to our @Meetup with @LangWatchAI on September 18th!
RSVP to save your spot > bit.ly/3JyBwgY
The gap between model release hype and production reality is always bigger than it looks.
OpenAI’s new GPT-5 headlines focus on the measurable: fewer hallucinations, better reasoning, faster responses. All great gains.
But the real story? How it works in your workflows, with…
First impressions of Grok 4
✅ it passes all the Scenario agent simulation tests on the 13 different agent frameworks in create-agent-app
❌ probably because of the reasoning, but facing quite high latency using it as an agent
🤔 on our vibe coding test, the website it designs…
Now you can ship AI agents faster with developer-first testing. LangWatch Scenario allows you to test your agents like you test your code.
That’s because:
❌Manual testing doesn't scale.
❌"Vibe checking" isn't systematic.
❌Hope isn't a strategy.
That's why we’re building…
99 Followers 371 FollowingEl Real Madrid ayer, hoy y siempre.
Las manos de los periodistas FUERA del club.
Ganar 3 champions seguidas está chupado, como todo el mundo sabe.
2K Followers 3K FollowingInterested in Software Development, AI, ML, Web, Blockchain. Opensource and Decentralization fan. Believer in lifelong learning. Now deep diving into agentic AI
344 Followers 5K FollowingNLP Engineer working on Indic languages | Earlier taught Physics for IIT-JEE | Alumni @iitdelhi | Self-taught programmer | Fascinated by the field of Languages
88 Followers 879 FollowingDirector - Tech Leader | Product Development and Delivery | Hands on | Healthcare | Strategy, Governance and Compliance | AI-First approach and data analytics
496 Followers 1K FollowingIm a curious, a passionate dreamer, a perfectionist human being with PGP key id 29FB71DF. Currently managing director at Barista Ventures
849 Followers 168 FollowingExecuteAutomation helps people to understand software, automation, AI, cloud, testing, & more..
Available on YouTube, Medium & Udemy. Teaching over 350,000+
57K Followers 160 Following1 in 50 developers worldwide is staying up to date with https://t.co/X5nzZaiIQ5. Being part of the other 49 might sound cool, but it’s not. Check it out 👇
3K Followers 425 FollowingSimplifying LLMs, Al Agents, RAGs and ML for you! • Sr. Data Scientist • A Decade of Experience • Top 1% @ Topmate • Creator of AwesomeNeuron
5K Followers 1K FollowingAuthor of Developer Marketing Does Not Exist. I help dev-focused marketers build a content strategy to reach more developers. Previously @zapier, @sendgrid
528K Followers 881 FollowingI run a portfolio of internet companies and host @startupideaspod. CEO: @latecheckoutplz we build companies like @ideabrowser, @meetLCA, @boringmarketer etc
2K Followers 881 FollowingMCP (Makes Context Perfect)
Building personal superintelligence without Mark 🫡🫡🤖
Most popular project: https://t.co/g9uOGSZhhQ
Now building cool stuff with @p0
193K Followers 107 FollowingWe're sharing/showcasing best of @github projects/repos. Follow to stay in loop. Promoting Open-Source Contributions. UNOFFICIAL, but followed by github
4K Followers 3K FollowingCo-Founder and CEO of @weaviate_io. I 😍 all things related to tech, machine learning, digital business, open-source, fashion, and music