Co-founder and CTO at Nexa AI, Industrial Veteran from Google & Amazon, and Stanford alumni. Committed to lifelong learning and advancing AI technology.Joined October 2021
Now with CPU, GPU, and Snapdragon NPU support in one unified architecture—packed into a lightweight 60MB installer. No more juggling installers, APIs, or backend-specific builds.
⭐ If Nexa SDK helps you, give us a star:
GitHub: github.com/NexaAI/nexa-sdk
Blog:…
Now with CPU, GPU, and Snapdragon NPU support in one unified architecture—packed into a lightweight 60MB installer. No more juggling installers, APIs, or backend-specific builds.
⭐ If Nexa SDK helps you, give us a star:
GitHub: github.com/NexaAI/nexa-sdk
Blog:…
This week in #AI
🔵 Qualcomm and @nexa_ai bring multimodal on-device AI to phones, cars, PCs, and more powered by Qualcomm Hexagon NPU: bit.ly/3VLfSsq
🔵 @TheRegister spoke with Qualcomm VP Upendra Kulkarni about how #SnapdragonXSeries is driving a shift in personal…
Nexa AI's Hyperlink product turns local AI models into real productivity tools—pick from Hugging Face, point them at your folders, and get insights in each model’s unique voice. Check below video : Qwen3-1.7B for speed + clarity, and GPT-OSS for deep, rigorous reasoning.
Nexa AI's Hyperlink product turns local AI models into real productivity tools—pick from Hugging Face, point them at your folders, and get insights in each model’s unique voice. Check below video : Qwen3-1.7B for speed + clarity, and GPT-OSS for deep, rigorous reasoning.
🚀 Nexa SDK now lets you host a local multimodal AI inference server — right on your device.
🔹 Ecosystem support
• GGUF — compact, quantized for efficient local inference
• MLX — lightweight, optimized for Apple Silicon
🔹 Platform support
• CPU & GPU — run GGUF + MLX models…
🚀 Nexa SDK now lets you host a local multimodal AI inference server — right on your device.
🔹 Ecosystem support
• GGUF — compact, quantized for efficient local inference
• MLX — lightweight, optimized for Apple Silicon
🔹 Platform support
• CPU & GPU — run GGUF + MLX models…
🚀 Excited to share that Nexa AI’s OmniNeural model and NexaML Engine have been officially featured by Qualcomm on their blog and social channels!
1. OmniNeural-4B — the world’s first truly NPU-native multimodal large model, enabling AI agents to run directly on-device without…
🚀 Excited to share that Nexa AI’s OmniNeural model and NexaML Engine have been officially featured by Qualcomm on their blog and social channels!
1. OmniNeural-4B — the world’s first truly NPU-native multimodal large model, enabling AI agents to run directly on-device without…
💻 AIPC just leveled up.
Ever waste hours hunting for info or files you know you saved?
Now imagine asking your PC like ChatGPT — and getting the answer in seconds.
Meet Hyperlink by Nexa AI. A ChatGPT-grade agent for your files — fully on your device. 100% private.
Hyperlink…
You asked, we listened and iterated our product.
One of the most requested features for Nexa SDK is here: native Python bindings.
Now you can:
- Run LLMs, VLMs, ASR & TTS from Python
- Use Hugging Face models (GGUF, MLX)
- Integrate seamlessly into your own workflows
Any model.…
You asked, we listened and iterated our product.
One of the most requested features for Nexa SDK is here: native Python bindings.
Now you can:
- Run LLMs, VLMs, ASR & TTS from Python
- Use Hugging Face models (GGUF, MLX)
- Integrate seamlessly into your own workflows
Any model.…
Until now, runtimes like llama.cpp and Ollama were text-only. Nexa SDK is the first edge engine to deliver full Gemma-3n multimodal inference — with multiple image inputs, fully local on Windows GPU. In just one line of code, you can unlock true multimodal experiences at the…
Until now, runtimes like llama.cpp and Ollama were text-only. Nexa SDK is the first edge engine to deliver full Gemma-3n multimodal inference — with multiple image inputs, fully local on Windows GPU. In just one line of code, you can unlock true multimodal experiences at the…
Run llama3.2-3B on Snapdragon NPU with 2X performance vs others, in one line of code:
sdk.nexa.ai/model/Llama3.2…
nexa infer NexaAI/Llama3.2-3B-NPU-Turbo
Run llama3.2-3B on Snapdragon NPU with 2X performance vs others, in one line of code:
sdk.nexa.ai/model/Llama3.2…
nexa infer NexaAI/Llama3.2-3B-NPU-Turbo
Most teams trying to get their models onto NPUs today face the same roadblocks:
❌ Conversions have to be done case by case, with tons of manual work
❌ Support is usually limited to only a few model families (Llama, Qwen, etc.)
That’s the pain point we set out to solve.
With…
Most teams trying to get their models onto NPUs today face the same roadblocks:
❌ Conversions have to be done case by case, with tons of manual work
❌ Support is usually limited to only a few model families (Llama, Qwen, etc.)
That’s the pain point we set out to solve.
With…
We have successfully enabled multimodal (image) support for Gemma-3n in Nexa SDK — a highly requested capability that is currently not available in llama.cpp or Ollama so far, where Gemma-3n remains text-only. @GoogleDeepMind @Google#AI#Multimodal#Gemma3n#Innovation
We have successfully enabled multimodal (image) support for Gemma-3n in Nexa SDK — a highly requested capability that is currently not available in llama.cpp or Ollama so far, where Gemma-3n remains text-only. @GoogleDeepMind @Google#AI#Multimodal#Gemma3n#Innovation
🚀 We’ve launched OmniNeural — the world’s first multimodal AI model optimized for NPU. With Nexa SDK, devs can run NPU models in just 1 line of code. 🧵
1️⃣ AI Action Assistant
Voice-driven control that actually does things: call, text, email — instantly, offline, and private.…
10 Followers 66 FollowingDigital Coin Bank mints coin NFT's that are an artistic representation of the digital currency focusing on the numizmatic beauty aspect of the coins
3 Followers 298 FollowingThe latest news and more from Rolling Stone magazine and https://t.co/qu7A98O95S. Got a tip? Share it here: https://t.co/Vh2uz40Dv1
9K Followers 6K FollowingProduct Lead @Firebase (Serverless & AI). Working all day, testing AI tools and models all night.
Prev: Microsoft. Opinions are my own
693 Followers 65 FollowingRun AI models like Llama, Gemma, and more on your iPhone and iPad. Offline. No login. No data collection. Powered by Apple MLX.
452K Followers 77 FollowingTensors and neural networks in Python with strong hardware acceleration. PyTorch is an open source project at the Linux Foundation. #PyTorchFoundation
1.2M Followers 279 FollowingWe’re a team of scientists, engineers, ethicists and more, committed to solving intelligence, to advance science and benefit humanity.
18K Followers 1K FollowingVP of DevRel for @GitHub. Previously Executive Director @dotnetfdn and original creator of the @Microsoft org on @GitHub (he/him)
63K Followers 458 FollowingFreelance marketer building TechnoBizzVault, where I help professionals discover modern tools so they can be productive without burning out.
330K Followers 4K FollowingCo-founder of Tiny w/ @_Sparling_. We own @Dribbble, @Serato, @Letterboxd, @AeroPress, and 35+ other wonderful companies. Author of Never Enough.
784 Followers 207 FollowingRobotics Engineer/Founder. Into Multimodal GenAI. Was previously at @microsoft @paypal @jibo @MistyRobotics. Hit me up for a pickleball or squash game anytime!
13K Followers 2K FollowingBuilding Digital PR backlinks for SaaS and other SEO clients
📰 Featured on Forbes, BusinessInsider, Verge, DigitalTrends...
Need backlinks? 💬 DM me!
.
56K Followers 854 FollowingFiguring out AI @allen_ai, open models, RLHF, fine-tuning, etc
Contact via email.
Writes @interconnectsai
Wrote The RLHF Book
Mountain runner
281K Followers 2K FollowingSenior Editor @verge ║ Sign up to Notepad, my weekly newsletter on Microsoft's big bets at https://t.co/KqkAib2CKP ║ Tips? Msg on Signal app: tomwarren.01
9K Followers 6K FollowingProduct Lead @Firebase (Serverless & AI). Working all day, testing AI tools and models all night.
Prev: Microsoft. Opinions are my own
15K Followers 841 FollowingQualcomm Ventures is the venture investment arm of Qualcomm, Inc. Investing in cutting edge, innovative #startups.
San Diego, CA
51K Followers 915 FollowingPresident and CEO of @Qualcomm. Husband and proud father. I share my views about wireless and related technologies. Opinions are mine.
7K Followers 698 FollowingMarketing leader & @Qualcomm CMO telling stories about the next-gen tech that’s changing the world. Sports, pop culture, and @Snapdragon fan. Opinions are mine.
2K Followers 245 FollowingChief Commercial Officer @Qualcomm. Passionate about how connected tech will transform our lives. Music and music tech aficionado. Tweets are my own.
No recent Favorites. New Favorites will appear here.