vespa.ai @vespaengine

https://t.co/abkb8IjPSH - the open source platform for combining data and AI, online. Vectors/tensors, full-text, structured data; ML model inference at scale. vespa.ai Joined September 2017

Tweets

416
Followers

3K
Following

5
Likes

547

vespa.ai @vespaengine

18 hours ago

Having trouble keeping up? Guidebook to the State-of-the-Art Embeddings and Information Retrieval by @aapo_tanskanen at @thoughworks is out today - a great resource to get up to date. linkedin.com/pulse/guideboo…

0 6 19 1K 12

Download Image

Adam Hevenor @aHev

a week ago

This is what customer obsession looks like. Props to @vespaengine team for promoting what is effectively 40x cheaper for the user.

Jo Kristian Bergum @jobergum

a week ago

This is what customer obsession looks like. Props to @vespaengine team for promoting what is effectively 40x cheaper for the user.

1 0 11 3K 3

Download Image

2 4 20 3K 4

vespa.ai @vespaengine

a week ago

If you are using vector embeddings, reading this post might be the most profitable ten minutes you'll ever spend.

Jo Kristian Bergum @jobergum

a week ago

If you are using vector embeddings, reading this post might be the most profitable ten minutes you'll ever spend.

1 17 75 10K 41

0 1 9 1K 7

Jo Kristian Bergum @jobergum

a week ago

Matryoshka 🤝 Binary vectors: Slash vector search costs with Vespa We announce support for combining matryoshka and binary quantization in Vespa’s native hugging-face embedder and discuss how this slashes vector search costs. blog.vespa.ai/combining-matr…

1 17 75 10K 41

vespa.ai @vespaengine

2 weeks ago

The latest Vespa newsletter is here to help you stay up to date on what's happening on the leading edge in RAG, IR and vector search: - A new SPLADE embedder - ONNX models with float16 - @cohere embedding model guides - Support for an array of chunks with ColBERT - And list of…

0 3 18 3K 3

Jo Kristian Bergum @jobergum

3 weeks ago

If you are in Paris tonight, you should check out this meetup with @kraune from @vespaengine aicamp.ai/event/eventdet…

1 2 4 2K 0

vespa.ai @vespaengine

3 weeks ago

Seems everybody is migrating their search and recommendation systems from Elastic to Vespa now. Here's the experience of Stanby, Japan's leading job search site: blog.vespa.ai/migrating-to-t…

0 5 16 1K 3

Jo Kristian Bergum @jobergum

a month ago

Happy binary text embedding week! Created a quick notebook demonstrating: - Using @mixedbreadai embed-large-v1 model with the new sentence-transformer API for - How to index binary embeddings with HNSW indexing in @vespaengine - float-binary re-ranking! pyvespa.readthedocs.io/en/latest/exam…

tomaarsen @tomaarsen

a month ago

5 92 372 43K 248

0 11 57 7K 29

Download Image

Jo Kristian Bergum @jobergum

a month ago

A new Vespa sample app is out, featuring the brand new native Vespa splade embedder. Thank you for open-sourcing the sparse encoder model @prithivida and to @NirantK for uploading to HF! github.com/vespa-engine/s… search.vespa.ai/search?query=W…

1 9 28 2K 11

Download Image

Jo Kristian Bergum @jobergum

a month ago

This is an example of a phased coarse-to-fine ranking pipeline. This you can only do with @vespaengine. Nils also touches on the limitations of default provisioning settings of EBS volumes. An insightful thread on both performance and storage-tier economics for vector search.

Nils Reimers @Nils_Reimers

a month ago

2 6 34 8K 19

Download Image

0 4 20 4K 15

Download Image

Jo Kristian Bergum @jobergum

a month ago

Binary embeddings from @cohere with @vespaengine! - HNSW index with hamming distance over 1024 bits! - Re-ranking with the dot product between full query vector (1024 floats) against an unpacked float version of the binary embedding. Notebook: pyvespa.readthedocs.io/en/latest/exam…

4 15 84 6K 41

Download Image

vespa.ai @vespaengine

2 months ago

When GigaOm named Vespa Leader in their Sonar for Vector Databases, one of the categories where we scored Excellent were Embedding Flexibility - why? Vespa lets you create embeddings in four ways: - On your own, outside Vespa: Just pass tensors directly in documents and…

1 2 11 1K 1

Jo Kristian Bergum @jobergum

2 months ago

Talking about single-vector databases, for the 200K long-document MLDR dataset we store 614M vectors on a single node for late context level interaction or late cross-context interaction. Tensors is the way blog.vespa.ai/announcing-lon…

4 11 66 16K 44

Download Image

Distinguished Engineer @vespaengine. Tweets about Vespa, search, recommendation, ranking, and IR. CET.
#StandWithUkraine 💙💛

Jo Kristian Bergum @jobergum

9K Followers 814 Following Distinguished Engineer @vespaengine. Tweets about Vespa, search, recommendation, ranking, and IR. CET. #StandWithUkraine 💙💛

Nils Reimers @Nils_Reimers

10K Followers 434 Following Director of Machine Learning @Cohere | ex-huggingface | Creator of SBERT (https://t.co/MKKOMfuQ4C)

Jeremy Howard @jeremyphoward

222K Followers 5K Following 🇦🇺 Co-founder: @AnswerDotAI & @FastDotAI ; Hon Professor: @UQSchoolITEE ; Digital Fellow: @Stanford

Machine learning and language models R&D. Builder. Writer. Visualizing AI, ML, and LLMs one concept at a time. @Cohere. https://t.co/TquuQXlLOJ

Jay Alammar @JayAlammar

35K Followers 1K Following Machine learning and language models R&D. Builder. Writer. Visualizing AI, ML, and LLMs one concept at a time. @Cohere. https://t.co/TquuQXlLOJ

I profess CS-ly at the @UWaterloo and gaze into the technological crystal ball at @Primal. I used to write code for @Twitter and slides for @Cloudera.

Jimmy Lin @lintool

13K Followers 842 Following I profess CS-ly at the @UWaterloo and gaze into the technological crystal ball at @Primal. I used to write code for @Twitter and slides for @Cloudera.

Managing Consultant at OpenSource Connections, helping you build amazing AI & search applications. Also hachyderm dot io slash @flaxsearch

Charlie Hull @FlaxSearch

2K Followers 860 Following Managing Consultant at OpenSource Connections, helping you build amazing AI & search applications. Also hachyderm dot io slash @flaxsearch

Doug Turnbull @softwaredoug

3K Followers 754 Following Search @Reddit; ex @Shopify & @o19s; Books: Relevant Search & AI Powered Search

🥑 DevRel @Streamlit @SnowflakeDB
🪶 𝕏 about #AI, #LLMs, #DataScience, #WebApps, #SEO
💕 My heart is open source
🌍 Nature Lover
👀 My views!

Charly Wargnier @DataChaz

112K Followers 31K Following 🥑 DevRel @Streamlit @SnowflakeDB 🪶 𝕏 about #AI, #LLMs, #DataScience, #WebApps, #SEO 💕 My heart is open source 🌍 Nature Lover 👀 My views!

Husband, dad, enjoys working distributed, likes distributed systems, data stores, JVM/Java & Basketball/Streetball, now at https://t.co/YPftyN9nyB

Alexander Reelsen @spinscale

3K Followers 1K Following Husband, dad, enjoys working distributed, likes distributed systems, data stores, JVM/Java & Basketball/Streetball, now at https://t.co/YPftyN9nyB

Creator @datasetteproj, co-creator Django. PSF board. @nichemuseums. Hangs out with @natbat + @cleopaws. He/Him. Mastodon: https://t.co/t0MrmnJW0K

Simon Willison @simonw

71K Followers 5K Following Creator @datasetteproj, co-creator Django. PSF board. @nichemuseums. Hangs out with @natbat + @cleopaws. He/Him. Mastodon: https://t.co/t0MrmnJW0K

he/him · Lecturer (Assistant Professor) at @GlasgowCS @TerrierTeam · working at the intersection of IR&NLP · PhD from @Georgetown IRLab

Website: https://t.co/TvZBNq61Ey

Sean MacAvaney @macavaney

1K Followers 479 Following he/him · Lecturer (Assistant Professor) at @GlasgowCS @TerrierTeam · working at the intersection of IR&NLP · PhD from @Georgetown IRLab Website: https://t.co/TvZBNq61Ey

One day I'd like to open a shop in London specialising in pirate memory games 🧩 I speak 🥨💂‍♂️🤌🥖🍣🐻 .NET dev 💻 PG Data Science 🧮 linguist by heart 🌈🇪🇺

Dan(i(el(e))) −·�.. @LelViLamp

8 Followers 86 Following One day I'd like to open a shop in London specialising in pirate memory games 🧩 I speak 🥨💂‍♂️🤌🥖🍣🐻 .NET dev 💻 PG Data Science 🧮 linguist by heart 🌈🇪🇺

pierrix00 @pierrix00

67 Followers 191 Following

Vox - e/acc @TheVoxxx

586 Followers 175 Following

Policy Director General @RSMErasmus & Honorary Prof. @EBS_Global HWU.
Passionate for Universities, Business Schools, #excellence #impact #RRI #RRBM #PRME

Wilfred (Willem) Mijn.. @wmijnhardt

2K Followers 3K Following Policy Director General @RSMErasmus & Honorary Prof. @EBS_Global HWU. Passionate for Universities, Business Schools, #excellence #impact #RRI #RRBM #PRME

More generalist than specialist. Industrial engineer turned into ML/DL. Currently DE @RecordlyData by day, Applied DS research by night with @aapo_tanskanen

Rasmus Toivanen @RasmusToivanen

655 Followers 2K Following More generalist than specialist. Industrial engineer turned into ML/DL. Currently DE @RecordlyData by day, Applied DS research by night with @aapo_tanskanen

Patrick Clear @PatrickClear8

8 Followers 42 Following

Abdulrahman Tabaza @embed_dim

4 Followers 799 Following enjoyer of various vector spaces, encoders and modalities

Running strategy @ravenpack. Board member @ Poocho. Previously @thirdpointllc and @factset. Currently obessing over domain-spec LLMs & vectorizing everything

Aakarsh Ramchandani @2sidesofacoin

99 Followers 529 Following Running strategy @ravenpack. Board member @ Poocho. Previously @thirdpointllc and @factset. Currently obessing over domain-spec LLMs & vectorizing everything

SenaBeren @findingmerit

287 Followers 3K Following

Karolina @_sandtweets

356 Followers 294 Following aGVsbG8=

Amir Soleimani @ASoleimaniB

233 Followers 253 Following AI Engineer at Sdu. Generative AI for Legal Research

juwee @juweeism

335 Followers 370 Following show me your embeddings

Billy Vythikowski @vythikowski

29 Followers 317 Following

Pratheek Rebala @pratheekrebala

922 Followers 2K Following News developer @publicintegrity. Shop steward @publici_union.

Anthony Bordonaro @bordo_anthony

454 Followers 1K Following Engineering Manager, @carltonfc supporter, green tea enthusiast

Data Engineer @Sony. Ex-UBS, Vanderbilt, Harvard. https://t.co/kOPPM3IA54. Google Scholar: https://t.co/UWXNmktdq0.
Opinions my own.

Leonce Nshuti @LeonceNshuti

278 Followers 2K Following Data Engineer @Sony. Ex-UBS, Vanderbilt, Harvard. https://t.co/kOPPM3IA54. Google Scholar: https://t.co/UWXNmktdq0. Opinions my own.

sportscarfan45 @unsafetensors

50 Followers 651 Following

mrpsycox.eth @mrpsyc0x

61 Followers 148 Following ETHRome organizer 💛❤️ Discord: mrpsycox.eth

TENTANANO @tentanano

7 Followers 125 Following

Cloud & Data Analytics | Product Developer | Tech Enthusiast | Interests - System Design & Architecture | Learning- Minimalism | 5x OCI & 4x Azure Cloud

Karthic Natarajan @karthicn_

I help enterprises understand and use artificial intelligence. Leveraging my 25 years of enterprise software experience in emerging technology to drive results.

Mark R. Hinkle @mrhinkle

7K Followers 5K Following I help enterprises understand and use artificial intelligence. Leveraging my 25 years of enterprise software experience in emerging technology to drive results.

YChu.eth @ychudoteth

365 Followers 2K Following Invent the decentralized future with love! | Prev: Engineer at ByteDance

Romain Damery @RomainDamery

2K Followers 2K Following Leading technical SEO and other nerdy stuff @AmsiveAgency / ((bb) || !(bb)) / 🇺🇸 🇫🇷 🇪🇺 🇧🇷 🏳️‍🌈

Ian Maurer 🧬🤖�.. @imaurer

1K Followers 793 Following CTO @GenomOncology #genomics #precisiononcology #nlp

ikitekeryollarda @ikitekeryolda

18 Followers 302 Following İki Teker Hayat

Sayan Chakraborty @shockrobortyy

150 Followers 905 Following ML @Qualcomm (prev: @BrownUniversity, @paytminsider, @bigbinary, @clarisights)

Elorm Dokosi @ElormDokosi

123 Followers 561 Following Computer science and engineering student. Learning to be an indie hacker on the side.

dzh886 @dengzihao88

22 Followers 586 Following

Gordon Lindsay (Busin.. @GLbusiness_twit

1 Followers 4 Following

Jon Page @jonpage0

71 Followers 403 Following

CryptoLaika @DemonsZzh

126 Followers 1K Following

Consiglieri @ConsiglieriVita

84 Followers 2K Following

Unwiring the future as CTO of @R3Coms | #Wireless #IIoT | Ex academic @TUBerlin | Husband and father of two | Opinions my own | https://t.co/32UvARYu5g

Vlado Handziski @vlahan

fraserxu @fraserxu

624 Followers 398 Following Lead engineer @envato, past organiser of @jsconfchina and Shanghai JavaScript meetup.

yours truly @ulfbert_inc

41 Followers 529 Following Programmer.

Tomasz Kobylinski @TmoaszKo

10 Followers 42 Following

Abey @abeytheo

101 Followers 235 Following AI Engineer・独学で日本語学者

Jack&Penny @Jack870202

34 Followers 246 Following Computer Engineer, Texas Hold'em Lover and Crazy Sports fans

Jimmy Sticks @loss_gobbler

128 Followers 423 Following Chief Symbiosis Sorcerer at @stickshiftAI ////////////////// lair dweller // FAANG quitter // cyborgism enjoyer

Lukas Slezevicius @lukaslezevicius

37 Followers 461 Following Co-founder and CEO of Octocom

Aidan Jones @aidan_jones

138 Followers 1K Following Music, Movies & Microcode

tm @tm57312196

14 Followers 187 Following Interest: physical understanding of consciousness, wisdom etc.

takato @taca10

281 Followers 473 Following Webエンジニア

The AI context platform for everyone. Memories 🧠, Preferences 👍, Semantic Search 🔍 (and more) for AI Models, Agents, and Multi-Agent systems #fluffyvectors

FluffyVectors ☁️�.. @FluffyVectors

7 Followers 88 Following The AI context platform for everyone. Memories 🧠, Preferences 👍, Semantic Search 🔍 (and more) for AI Models, Agents, and Multi-Agent systems #fluffyvectors

WΞNDΞL @0xwendel

2K Followers 482 Following Attention Token Engineer, Medical LLM Inference OP and sometimes Solidity Dev, https://t.co/yRPtosJ9Hi

Software QA. Former Medical device QA.
IT未経験QA立ち上げ → そのままマネージャー3年目。Playwrightが好き。もっとコーディングがしたいお年頃

All tweets are on my own

うりんつ@QA @yurinzflet

12 Followers 83 Following Software QA. Former Medical device QA. IT未経験QA立ち上げ → そのままマネージャー3年目。Playwrightが好き。もっとコーディングがしたいお年頃 All tweets are on my own

Robert @clarity99

500 Followers 970 Following gestalt psychotherapist, mindfulness teacher and an all around geek. ;) Moved to Mastodon: https://t.co/Qzlp34vmWV

takono @takono0807

112 Followers 325 Following Web系ソフトウェアエンジニアです。楽しいチームでいい感じのプロダクトをつくりたい。技術以上に人間力を身につけたい30代。

Georgi D. Gospodinov @ggospodi

258 Followers 872 Following Technologist and entrepreneur PhD in math https://t.co/Wb6iH1UClJ

Black Birkin @iidecat

0 Followers 427 Following

Jo Kristian Bergum @jobergum

9K Followers 814 Following Distinguished Engineer @vespaengine. Tweets about Vespa, search, recommendation, ranking, and IR. CET. #StandWithUkraine 💙💛

LangChain @LangChainAI

137K Followers 24 Following ⚡ Build context-aware, reasoning applications ⚡

“All methods are sacred if they are internally necessary” (GP @amplifypartners, prev @canvasvc; Head of Data @Mattermark; @palantirtech; @c4ads)

Sarah Catanzaro @sarahcat21

12K Followers 1K Following “All methods are sacred if they are internally necessary” (GP @amplifypartners, prev @canvasvc; Head of Data @Mattermark; @palantirtech; @c4ads)

Thiago Guerrera @Thiagogm

5K Followers 139 Following Working on https://t.co/kTksMcNTfG. Statistics is my craft.

Jon Bratseth @jonbratseth

358 Followers 47 Following CEO https://t.co/5qXgcEp1MU Build things and help people.

Jo Kristian Bergum @jobergum

a day ago

Three things you should know about scaling embedding retrieval systems: - Embedding dimensionality cost: linear with dims - Embedding inference cost: quadratic with tokens - ANN (HNSW) cost: sub-linear with documents

1 5 39 2K 18

Jo Kristian Bergum @jobergum

21 hours ago

A 20-page book (+ charts and tables) guidebook to state-of-the-art embeddings and information retrieval. linkedin.com/pulse/guideboo…

2 14 88 8K 105

Jo Kristian Bergum @jobergum

a day ago

Looking at the MTEB leaderboard this AM. Amazingly, mxbai-embed-large-v1 ranks at 12 despite its small size relative to the other B-parameters models. In addition to strong performance for a relatively small size, it comes with MRL and BQL flexibility. blog.vespa.ai/combining-matr…

0 0 24 2K 11

Download Image

Jo Kristian Bergum @jobergum

2 weeks ago

Hamming distance got a nice speedup in Vespa this week, 38% faster. It is approaching 1 billion 64-dimensional int8 hamming distances per second on a single CPU socket, 20x faster than the normalized dot product (1024-dim). Exact nearest neighbor search.

2 2 53 4K 17

Download Image

Aniket Rege @wregss

5 days ago

I'll be discussing MRL, including recent developments from @OpenAI embedding models and excellent work built on MRL from folks at @vespaengine , @nomic_ai, Sentence Transformers (@tomaarsen) @supabase , to name a few. 2/n

1 0 5 143 0

Jo Kristian Bergum @jobergum

6 days ago

Random observation, but we at @vespaengine have been used to running ML workloads on Vespa with deep-learned embeddings in production at scale for more than a decade.

1 1 20 2K 8

Download Image

Adam Hevenor @aHev

a week ago

This is what customer obsession looks like. Props to @vespaengine team for promoting what is effectively 40x cheaper for the user.

Jo Kristian Bergum @jobergum

a week ago

The emphasis of both MRL and BQL is on sacrificing accuracy by a few % in exchange for a much lower cost. By using a compact representation of the text embeddings, the systems can run on less expensive hardware or require less memory, resulting in cost savings.

1 0 11 3K 3

Download Image

2 4 20 3K 4

cody collier @cmcollier

a week ago

I keep having to re-check my calculations. Can I really fit decent quality embeddings for my 40 million docs into a few GB of ram?! Vespa integrating MRL + binarization + bit packing for a huge win 🔥🔥 Looking forward to working through this post and trying it out on my data.…

Jo Kristian Bergum @jobergum

a week ago

1 17 75 10K 41

1 1 20 2K 10

Thomas Thoresen @thomas_thoresen

a week ago

After 3 weeks at @vespaengine, I still get more and more impressed every day by both the engine and the team 🤩 Check out this blog post to explore some features that sets it apart!

Jo Kristian Bergum @jobergum

a week ago

1 17 75 10K 41

0 1 12 558 0

Jo Kristian Bergum @jobergum

a week ago

We are very excited about this direction, as it unlocks many new use cases that are no longer prohibitively expensive to serve in production—making more unstructured data useful. The new Vespa embedder functionality for MRL and BQ is available in Vespa 8.332.5 and above. Enjoy

0 0 8 517 0

Jo Kristian Bergum @jobergum

a week ago

The unique aspect of MRL and BQL is that they introduce minimal computational overhead during embedding model training. Both techniques are post-processing steps performed after model inference.

2 0 6 517 0

Jo Kristian Bergum @jobergum

a week ago

Approaching 10,000 queries per second with 100,000 vectors equals close to 1B hamming distance computations per second! This is on a single CPU.

1 0 4 122 0

Jo Kristian Bergum @jobergum

a week ago

Calculating the hamming distance is approximately 20 times faster (2ms), enabling users to experience faster search and higher query throughput with the same resources. In practical terms, organizations can reduce CPU-related costs by 20x

1 0 4 109 0

Jo Kristian Bergum @jobergum

a week ago

With BQL (binary quantization) to binary vectors, we both get 32x less memory resource footprint but also a significant speedup over the float representations.

1 0 4 104 0

Download Image

Jo Kristian Bergum @jobergum

a week ago

On serving performance, MRL gives us a linear reduction in cost. The graph above demonstrates that reducing the float dimensions via MRL from 1024 to 512 results in a two-fold speedup.

1 0 3 112 0

Download Image

Jo Kristian Bergum @jobergum

a week ago

The representations can be used in a phased retrieval and ranking pipeline in Vespa

1 0 4 120 0

Download Image

Jo Kristian Bergum @jobergum

a week ago

With the announced Vespa hugging-face-embedder support, developers can easily obtain multiple vector representations with a single inference call

1 0 4 137 0

Download Image

Jo Kristian Bergum @jobergum

a week ago

.@mixedbreadai's combination of MRL and BQL retains 90% accuracy (on MTEB Retrieval task) using 64-dimensional int8 (512 bits) binarized from 512 float dimensions (first 512 out of 1024). This binary representation reduces storage-related costs by 64 compared to the baseline

1 0 6 181 0

Download Image

Jo Kristian Bergum @jobergum

a week ago

Since both techniques are simple post-processing steps over the embedding vector representation, we can produce multiple representations with a single model inference call. One inference pass is vital because model inference is a significant cost driver for embedding retrieval