Zack Li-Nexa AI @zacklearner

Co-founder and CTO at Nexa AI, Industrial Veteran from Google & Amazon, and Stanford alumni. Committed to lifelong learning and advancing AI technology. Joined October 2021

Tweets

154
Followers

334
Following

336
Likes

178

Zack Li-Nexa AI @zacklearner

14 hours ago

Now with CPU, GPU, and Snapdragon NPU support in one unified architecture—packed into a lightweight 60MB installer. No more juggling installers, APIs, or backend-specific builds. ⭐ If Nexa SDK helps you, give us a star: GitHub: github.com/NexaAI/nexa-sdk Blog:…

NEXA AI @nexa_ai

15 hours ago

1 1 6 59 0

Download Video

0 0 0 3 0

Zack Li-Nexa AI @zacklearner

2 days ago

Try Nexa SDK server for Mac CPU & GPU, our NPU support is coming soon!

NEXA AI @nexa_ai

2 days ago

Try Nexa SDK server for Mac CPU & GPU, our NPU support is coming soon!

1 1 13 238 1

Download Video

0 0 0 41 0

Qualcomm @Qualcomm

5 days ago

On-device #AI is accelerating fast. With @nexa_ai, we’re tapping OmniNeural 4B and NexaML Engine directly into our Qualcomm Hexagon NPU, bringing scalable, multimodal intelligence to mobile, IoT & beyond. Learn more: bit.ly/3VLfSsq

5 14 48 8K 5

Qualcomm @Qualcomm

4 days ago

This week in #AI 🔵 Qualcomm and @nexa_ai bring multimodal on-device AI to phones, cars, PCs, and more powered by Qualcomm Hexagon NPU: bit.ly/3VLfSsq 🔵 @TheRegister spoke with Qualcomm VP Upendra Kulkarni about how #SnapdragonXSeries is driving a shift in personal…

6 13 22 3K 4

Download Image

Zack Li-Nexa AI @zacklearner

3 days ago

Nexa AI's Hyperlink product turns local AI models into real productivity tools—pick from Hugging Face, point them at your folders, and get insights in each model’s unique voice. Check below video : Qwen3-1.7B for speed + clarity, and GPT-OSS for deep, rigorous reasoning.

NEXA AI @nexa_ai

3 days ago

2 2 9 493 3

Download Video

0 1 3 191 0

Zack Li-Nexa AI @zacklearner

4 days ago

🚀 Nexa SDK now lets you host a local multimodal AI inference server — right on your device. 🔹 Ecosystem support • GGUF — compact, quantized for efficient local inference • MLX — lightweight, optimized for Apple Silicon 🔹 Platform support • CPU & GPU — run GGUF + MLX models…

NEXA AI @nexa_ai

4 days ago

1 7 42 19K 2

Download Video

0 0 0 65 0

Zack Li-Nexa AI @zacklearner

5 days ago

🚀 Excited to share that Nexa AI’s OmniNeural model and NexaML Engine have been officially featured by Qualcomm on their blog and social channels! 1. OmniNeural-4B — the world’s first truly NPU-native multimodal large model, enabling AI agents to run directly on-device without…

Qualcomm @Qualcomm

5 days ago

5 14 48 8K 5

0 0 2 80 0

NEXA AI @nexa_ai

6 days ago

💻 AIPC just leveled up. Ever waste hours hunting for info or files you know you saved? Now imagine asking your PC like ChatGPT — and getting the answer in seconds. Meet Hyperlink by Nexa AI. A ChatGPT-grade agent for your files — fully on your device. 100% private. Hyperlink…

5 7 37 55K 5

Download Video

Zack Li-Nexa AI @zacklearner

7 days ago

You asked, we listened and iterated our product. One of the most requested features for Nexa SDK is here: native Python bindings. Now you can: - Run LLMs, VLMs, ASR & TTS from Python - Use Hugging Face models (GGUF, MLX) - Integrate seamlessly into your own workflows Any model.…

NEXA AI @nexa_ai

7 days ago

1 1 9 377 1

Download Image

0 0 0 60 0

Zack Li-Nexa AI @zacklearner

a week ago

Until now, runtimes like llama.cpp and Ollama were text-only. Nexa SDK is the first edge engine to deliver full Gemma-3n multimodal inference — with multiple image inputs, fully local on Windows GPU. In just one line of code, you can unlock true multimodal experiences at the…

NEXA AI @nexa_ai

a week ago

1 5 15 2K 4

Download Video

0 0 1 94 0

Zack Li-Nexa AI @zacklearner

a week ago

Run llama3.2-3B on Snapdragon NPU with 2X performance vs others, in one line of code: sdk.nexa.ai/model/Llama3.2… nexa infer NexaAI/Llama3.2-3B-NPU-Turbo

NEXA AI @nexa_ai

a week ago

Run llama3.2-3B on Snapdragon NPU with 2X performance vs others, in one line of code: sdk.nexa.ai/model/Llama3.2… nexa infer NexaAI/Llama3.2-3B-NPU-Turbo

3 4 13 8K 5

Download Video

0 0 1 40 0

Zack Li-Nexa AI @zacklearner

2 weeks ago

Most teams trying to get their models onto NPUs today face the same roadblocks: ❌ Conversions have to be done case by case, with tons of manual work ❌ Support is usually limited to only a few model families (Llama, Qwen, etc.) That’s the pain point we set out to solve. With…

NEXA AI @nexa_ai

2 weeks ago

1 3 7 428 1

0 0 3 48 0

Zack Li-Nexa AI @zacklearner

3 weeks ago

We have successfully enabled multimodal (image) support for Gemma-3n in Nexa SDK — a highly requested capability that is currently not available in llama.cpp or Ollama so far, where Gemma-3n remains text-only. @GoogleDeepMind @Google #AI #Multimodal #Gemma3n #Innovation

NEXA AI @nexa_ai

3 weeks ago

1 2 11 304 0

Download Video

0 0 0 21 0

Zack Li-Nexa AI @zacklearner

3 weeks ago

Run SDXL image generation model in your laptop with one line of code, try Nexa SDK

0 0 0 28 0

Zack Li-Nexa AI @zacklearner

3 weeks ago

Try Nexa SDK with latest Qwen model: github.com/NexaAI/nexa-sdk

Qwen @Alibaba_Qwen

3 weeks ago

Try Nexa SDK with latest Qwen model: github.com/NexaAI/nexa-sdk

25 37 346 36K 58

0 0 0 27 0

Zack Li-Nexa AI @zacklearner

3 weeks ago

🚀 We’ve launched OmniNeural — the world’s first multimodal AI model optimized for NPU. With Nexa SDK, devs can run NPU models in just 1 line of code. 🧵 1️⃣ AI Action Assistant Voice-driven control that actually does things: call, text, email — instantly, offline, and private.…