Researcher focusing on LLMs: https://t.co/iVZDFdIQiE
Previously, dev tools and infra for ML. Ex @Github, @Airbnb, @DataRobot. @fastdotai core contributor.hamel.dev Portland, ORJoined September 2012
First, figure out WHAT the product is, WHAT problem it solves, WHY people want to use it. Get some traction & usage data.
Then, bring in ML folks who can help with HOW to measure & improve, based on the data.
Doing it reversed is like building solutions to nonexistent problems.
First, figure out WHAT the product is, WHAT problem it solves, WHY people want to use it. Get some traction & usage data.
Then, bring in ML folks who can help with HOW to measure & improve, based on the data.
Doing it reversed is like building solutions to nonexistent problems.
And ML folks don’t just train models. They’re also bring rigor, data driven analysis, best practices of how to use data and work with non-deterministic output, how to build products based on the above, etc
These are things that typical software engineers may not focus on.
And ML folks don’t just train models. They’re also bring rigor, data driven analysis, best practices of how to use data and work with non-deterministic output, how to build products based on the above, etc
These are things that typical software engineers may not focus on.
After 25 years of working in the ML space I agree that many early stage companies (Series A-ish and earlier) shouldn’t be hiring MLEs
However I think companies need a fractional MLE (.05) to help ensure the foundations get built properly for MLEs later on.
Data Eng,…
After 25 years of working in the ML space I agree that many early stage companies (Series A-ish and earlier) shouldn’t be hiring MLEs
However I think companies need a fractional MLE (.05) to help ensure the foundations get built properly for MLEs later on.
Data Eng,…
Berkeley Function Calling Leaderboard: Introducing Consistent 8 X V100 with pay-as-you-go pricing for measuring costs and latency.
In depth: We fix inconsistency in the cost and latency calculation for open-source models, which are now all calculated when serving the model with…
Berkeley Function Calling Leaderboard: Introducing Consistent 8 X V100 with pay-as-you-go pricing for measuring costs and latency.
In depth: We fix inconsistency in the cost and latency calculation for open-source models, which are now all calculated when serving the model with…
I made pdftext, a small tool that extracts text like pymupdf, but with an Apache license (mupdf is AGPL). It can pull out blocks and lines or plain text.
Find it here - github.com/VikParuchuri/p… .
This is the prompt they are using for Llama-3 function calling (seems to work well even though it's not specifically fine tuned for that): github.com/ShishirPatil/g…
This is the prompt they are using for Llama-3 function calling (seems to work well even though it's not specifically fine tuned for that): github.com/ShishirPatil/g… https://t.co/lNetEc4f8Y
Our first Build with Claude contest was a success! We received tons of great submissions from @AnthropicAI devs.
Here are the 5 winning projects (in no particular order)🧵
Jeremy is THE most talented infra/devops person I know. He’s created a specialized IDE + Copilot for DevOps : K8s, Cloud, etc - based on notebooks!
It’s open source, accompanied by a blog that shows his thinking 👇
Jeremy is THE most talented infra/devops person I know. He’s created a specialized IDE + Copilot for DevOps : K8s, Cloud, etc - based on notebooks!
It’s open source, accompanied by a blog that shows his thinking 👇
I'm up to 96k context for Llama 3 8B. Using PoSE, we did continued pre-training of the base model w 300M tokens to extend the context length to 64k. From there we increased the RoPE theta to further attempt to extend the context length.
🧵
Cool tool by @HamelHusain (and a primer on why declarative job execution, like with @BacalhauProject is so critical, even against black box models like LLMs!) - Debugging AI With Adversarial Validation bit.ly/3xOWy52
Don’t blindly base your decision on which LLM to use on broken benchmarks like MMLU...
If you are serious about choosing the right LLM for your use case, you NEED to create an eval of your own.
Let’s talk about how you can make one 🧵
x.com/nearcyan/statu…
Don’t blindly base your decision on which LLM to use on broken benchmarks like MMLU...
If you are serious about choosing the right LLM for your use case, you NEED to create an eval of your own.
Let’s talk about how you can make one 🧵
x.com/nearcyan/statu…
Does anyone have a reference implementation of function calling on llama 3 + vllm + outlines
Surely someone has an open example - its helpful to see how other people are doing it b/c there are lots of ways of accomplishing this
267K Followers 906 FollowingMachine learning & AI researcher writing at https://t.co/A0tXWzG1p5. LLM research engineer @LightningAI. Previously stats professor at UW-Madison.
25K Followers 554 FollowingResources to take your Machine Learning skills to the next level
🧪 Senior Data Scientist, RecSys @NVIDIAAI
🏫 @fastdotai trained DL Eng
📝 https://t.co/By87iXx5Pu
38K Followers 3K FollowingBuilding @modal_labs when I'm not posting bangers about data and software. Previously built the music rec sys at Spotify and ran the eng team at Better.
54K Followers 1K FollowingPhD at 19 |
Founder and CEO at @MedARC_AI |
Research Director at @StabilityAI |
@kaggle Notebooks GM |
Biomed. engineer @ 14 |
TEDx talk➡https://t.co/xPxwKTq6Qb
10K Followers 391 Following🤗 Technical Lead for the Accelerate Project | Passionate about Open Source | Nerd who enjoys touching the grass | #ADHD | He/Him
59K Followers 2K Following✨Keep it simple, make it scale. AI should be about empowering people, building understanding, & making dreams realities. 👩💻GenAI @GoogleDeepMind ex-@GitHub
47K Followers 1K FollowingCo-founder and CTO at @huggingface 🤗. ML/AI for everyone, building products to propel communities fwd. @Stanford + @Polytechnique
48K Followers 2K FollowingChief AI & Co-founder @AnacondaInc; invented @pyscript_dev, @PyData @Bokeh @Datashader. Former physicist. A student of the human condition. bsky: @wang.social
35K Followers 1K FollowingMachine learning and language models R&D. Builder. Writer. Visualizing AI, ML, and LLMs one concept at a time. @Cohere. https://t.co/TquuQXlLOJ
62K Followers 16K FollowingNewsletter exploring AI & ML
- Weekly trends
- LLM/FM insights
- Unicorn spotlights
- Global dynamics
- History
Led by @kseniase_
Elevate your AI game 👇🏼
86 Followers 138 FollowingBuilding an AI writing product at Instrumentl by day. Building tools to make working with AI easier at night. Starting to post more after a long time 😅
104 Followers 780 FollowingWeb, mobile & infrastructure technologies. Mixing business and tech since Debian 6. I dev cool things & manage (CTO) awesome ideas and people. Oh, and music.
286 Followers 1K FollowingI love science, entrepreneurship, and building things.
This is my playground to run little experiments and share my ideas, projects, and learnings.
125 Followers 606 Followinge/acc - I organise sand that reasons to complete work by manipulating intricately arranged pixels in a fabricated digital world.
267K Followers 906 FollowingMachine learning & AI researcher writing at https://t.co/A0tXWzG1p5. LLM research engineer @LightningAI. Previously stats professor at UW-Madison.
25K Followers 554 FollowingResources to take your Machine Learning skills to the next level
🧪 Senior Data Scientist, RecSys @NVIDIAAI
🏫 @fastdotai trained DL Eng
📝 https://t.co/By87iXx5Pu
38K Followers 3K FollowingBuilding @modal_labs when I'm not posting bangers about data and software. Previously built the music rec sys at Spotify and ran the eng team at Better.
186K Followers 877 FollowingCofounded and lead @PyTorch at Meta.
Also dabble in robotics at NYU.
AI is delicious when it is accessible and open-source.
379K Followers 77 FollowingTensors and neural networks in Python with strong hardware acceleration. PyTorch is an open source project at the Linux Foundation. #PyTorchFoundation
54K Followers 1K FollowingPhD at 19 |
Founder and CEO at @MedARC_AI |
Research Director at @StabilityAI |
@kaggle Notebooks GM |
Biomed. engineer @ 14 |
TEDx talk➡https://t.co/xPxwKTq6Qb
10K Followers 391 Following🤗 Technical Lead for the Accelerate Project | Passionate about Open Source | Nerd who enjoys touching the grass | #ADHD | He/Him
59K Followers 2K Following✨Keep it simple, make it scale. AI should be about empowering people, building understanding, & making dreams realities. 👩💻GenAI @GoogleDeepMind ex-@GitHub
3K Followers 345 FollowingCreator of the OpenWebText and OpenGPT2. @PyTorch Core Reviewer. PhD Student at @Cornell (interning at @MosaicML) Previously at @FacebookAI and @BrownUniversity
664 Followers 527 FollowingStatistician. Creator of Chrome extensions "XCoach" for timed @X sessions and daily stats, and "TextLinks" for mnemonic URL bar quick links. Free in the store.
714 Followers 342 Followingbuilding @circlebackai (yc w24). previously built things @stripe and @twitter, a campervan, https://t.co/IaKhUNj4yx, watchai.
2K Followers 981 FollowingCo-Founder at Phonic. Previously @Stanford CS PhD Dropout, @MosaicML, CS @MIT. I tend to be wrong, but the learning process makes it enjoyable. 🇵🇰🇺🇲
2K Followers 651 FollowingA 501(c)(3) shared community space promoting and encouraging technical, scientific and artistic skills through individual projects, collaboration and education.
458 Followers 1K FollowingPast: Data+ML @lyft and Slew of Things in 🇺🇸🇮🇱🇨🇳. Alum @wharton @HopkinsEngineer @Yale Please consider buying my data kthx @procurefyi @frontier_optic
3K Followers 28 FollowingProduct at Prefect, building Marvin. Former CTO Openrole. Former Head of DS @ Insight Data Science. Math PhD @ UCLA, recovering academic.
11K Followers 347 FollowingAt CodiumAI, we change how developers test and analyze their code, by providing AI-powered interactive code integrity tools.