Have an AI application you want to add logging to but don't know where to start?
Our engineer Alex walks through how to set up basic logging in Braintrust with an @OpenAI Agents SDK application:
You can now retroactively apply scorers to historical data to find issues you missed.
Manually select from your logs, apply a scorer while in a single log view, or bulk apply to the last 50 logs in your current view.
NEW Greylock Change Agents: Evaluating Agents with Braintrust
There's been a lot of discourse around evals recently, so it's timely to drop the recording of Change Agents with @ankrgyl on the topic of how he thinks about Evaluating Agents.
Timestamps:
(0:00) - Intro
(1:22) -…
“Diamond allows you to treat more code reviews as if they were the most important code review in the world.”
Today, Diamond runs on every Braintrust PR, enforcing quality with zero friction.
Read our case study with @braintrustdata and learn more about how we're helping…
How do you build reliable AI tools at scale?
@withgraphite 's solution: systemic evaluation.
See how Graphite said goodbye to ad-hoc manual testing and leverages Braintrust to ship features like their AI code reviewer, Diamond:
braintrust.dev/blog/graphite?…
11 Followers 49 FollowingThe DLS DApp & $DLS work together to secure legacy media, reward users, and enable private ownership on Web3 through blockchain technology.
2K Followers 903 FollowingFounder of ProSights (YC W24), AI finance automations trusted by over half of the 25 largest PE firms. Former IB/PE and @harvardswimdive
231 Followers 595 FollowingDeveloper, agile aficionado, cyclist and master pancake craftsman. Currently helping Square take advantage of cloud computing.
1K Followers 576 Following🥁 dans le duo https://t.co/De7vi2JjhP 🏍️ Consultant #seo au sein de la meilleure agence : Noiise #Nantes https://t.co/9B41Pfmv0E éditeur de sites minimum 92% webperf
128 Followers 1K FollowingInfoSec dude. Programmer. Problem solver. Runner. Gamer. Coffee addict. Purveyor of good beer. Chaotic good. Thoughts and opinions expressed are my own.
511 Followers 435 FollowingSmall cap investor. Looking for interesting companies with asymmetric upside potential.
You can read my write-ups here: ➡️ https://t.co/NUp9KsyBkq
136 Followers 949 Following28 | Tweets on joys of life, solopreneurship, product management, investing | days roll over into years and time flees, carpe diem
527 Followers 3K FollowingSoftware Professional. Tweets/retweets don't mean I endorse the views or opinions. Tweets/RTs only for sharing the views/opinions.
247K Followers 1K FollowingAt Greylock, we are the first partner to consumer and enterprise software entrepreneurs. Newsletter: https://t.co/4tHdH9xmvk
347K Followers 1K FollowingDeepMind Research Scientist. Opinions my own. Inventor of GANs. Lead author of https://t.co/M6vl8pEQ4I Founding chairman of @pubhealthaction
121K Followers 639 FollowingMila Scientific Director. Ex @Google DeepMind & Twitter Cortex. Father of 4. // Directeur scientifique à Mila. Ex @Google DeepMind & Twitter Cortex. Père de 4.
494K Followers 152 FollowingNobel Laureate. Co-Founder & CEO @GoogleDeepMind - working on AGI. Solving disease @IsomorphicLabs. Trying to understand the fundamental nature of reality.
1.4M Followers 1K FollowingBuilding @EurekaLabsAI. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets.
453K Followers 77 FollowingTensors and neural networks in Python with strong hardware acceleration. PyTorch is an open source project at the Linux Foundation. #PyTorchFoundation
1.3M Followers 1K FollowingCo-Founder of Coursera; Stanford CS adjunct faculty. Former head of Baidu AI Group/Google Brain. #ai #machinelearning, #deeplearning #MOOCs