Isabelle Mohr @isabelle_mohr
MLE @JinaAI_ 🤖 Interested in all things Machine Learning! Joined April 2022-
Tweets63
-
Followers236
-
Following228
-
Likes124
2025 could be the year of Deep(Re)Search. Test-time compute and reasoning model are transforming search systems now. With <think>, users have been educated to accept delayed gratification—longer waiting times in exchange for higher-quality, actionable results, much like the…
Great work from MMTEB team! We have 3 contributors from @JinaAI_ ! @michael_g_u @jupyterjazz @isabelle_mohr
Our submission to ECIR 2025 on jina-embeddings-v3 has been accepted! 🎉 At the ECIR Industry Day @jupyterjazz takes the stage to share how we train the latest version of our text embedding model. More details: ecir2025.eu
Stop paying the OpenAI tax. The best AI devtools are actually open-source, free to use, and give you full control over your data and privacy. While proprietary AI dominated early headlines, the true revolution is happening in open source - where a flourishing ecosystem of…
I'm excited to attend @weaviate_io's AI Hack Night tonight!🚀#hacknightberlin
Join our next ML reading group featuring VisRAG on Nov 29th:
We extended our priprint about late chunking, a novel method to make embeddings of chunks context-aware. We added: - Algorithm for long documents - Training method to make late chunking more effective - Comparison to Anthropic's contextual embedding arxiv.org/abs/2409.04701
Want to learn more about Embeddings, Rerankers and ColBERT? Come to my talk on Thursday at @qdrant_engine's Vector Space Event 😎 more info here -> lu.ma/88rdjnhg
With @isabelle_mohr at #bbuzz Thank you for being there for my session. 💙
Got my Weaviate👕 here ;) thanks @femke_plantinga and @philipvollet ! This afternoon my colleague @isabelle_mohr and @saahil will hold two presentations at Berlin Buzzwords, I’ll brief Jina CLIP and upcoming models @JinaAI_ . Come and join our presentation!
Got my Weaviate👕 here ;) thanks @femke_plantinga and @philipvollet ! This afternoon my colleague @isabelle_mohr and @saahil will hold two presentations at Berlin Buzzwords, I’ll brief Jina CLIP and upcoming models @JinaAI_ . Come and join our presentation!
Clip your schedules for next week because Andreas and I will present our latest text&image embedding model with advanced text capabilities 😉 Paper: arxiv.org/abs/2405.20204 🤗: huggingface.co/jinaai/jina-cl… API: jina.ai/embeddings/
Clip your schedules for next week because Andreas and I will present our latest text&image embedding model with advanced text capabilities 😉 Paper: arxiv.org/abs/2405.20204 🤗: huggingface.co/jinaai/jina-cl… API: jina.ai/embeddings/
This week I explored chunking methods: the Semantic Chunker from @llama_index on jinaai/wikisections dataset on Hugging Face. Varying the buffer size had pretty much no effect, while increasing the breakpoint percentile threshold increased chunking precision by a lot! @JinaAI_
Last night I had the pleasure of giving a talk together with @jupyterjazz at the Data Meetup Berlin hosted by Netlight! Love the knowledge sharing, and most importantly, to connect with so many passionate and interested people in the field. See you at the next one! #embeddings
I'll be giving a talk together with @jupyterjazz next week in Berlin about our German-English bilingual embedding model. If you wanna know how we trained this model and how to use it in a RAG pipeline, you better RSVP and attend! See ya there 🚀 meetup.com/data-meetup-be… @JinaAI_
A ColBERT variant, but support a bit longer context :) huggingface.co/jinaai/jina-co… cc @jobergum @lateinteraction @bclavie
Saw @JinaAI_'s excellent long context (8192!) ColBERT earlier today? Eager to give long-document ColBERT a shot? New joint🫅colbert-ai and🪤RAGatouille release now supports any maximum length the underlying model can handle (& dynamically adjusts maxlen when encoding in-memory)
A few days ago, @JinaAI_ released two new bilingual embedding models (German-English & Chinese-English), each supporting a max sequence length of 8K tokens! 🤯 ... and now you can use them with 🤗 Transformers.js, for cross-language retrieval, clustering, and so much more! 👇
A few days ago, @JinaAI_ released two new bilingual embedding models (German-English & Chinese-English), each supporting a max sequence length of 8K tokens! 🤯 ... and now you can use them with 🤗 Transformers.js, for cross-language retrieval, clustering, and so much more! 👇 https://t.co/75vlCg30Vh
We’re finally here with 2 new models, we call it bilingual embedding models, it allows you to perform monolingual and cross-lingual retrieval tasks, the future models are always X+EN, X is the main language and EN as the bridging language. Here are the first two: German-English…
Our German-English and Chinese English embedding models are open-source now 🚀 huggingface.co/jinaai/jina-em… huggingface.co/jinaai/jina-em…
Our German-English and Chinese English embedding models are open-source now 🚀 huggingface.co/jinaai/jina-em… huggingface.co/jinaai/jina-em… https://t.co/nhT5m2kJ6W

Uriel Toy @toy_uriel22415
81 Followers 1K Following
Bhik Singh @BhikSingh50597
6 Followers 267 Following
supercoderhawk @supercoderhawk
71 Followers 2K Following NLP engineer at patsnap. NLP, deep learning researcher.
Yassine El Kheir @YassineElkheir
49 Followers 583 Following PhD Student at DFKI & Technical University of Berlin
NovaTech AI @NovatechAi
141 Followers 556 Following Novatech AI 💡 Innovating with Artificial Intelligence for a smarter future. 🚀 AI-driven solutions for business, technology, and beyond.
Lloyd @llerussell
160 Followers 483 Following Applied Scientist @wayve_ai, former Neuroscientist @NeuralCompLab
DeniseHousman @O92p5GRshlmp4mU
83 Followers 1K Following
ONLYTRADES @0nlyTrad3s
506 Followers 8K Following
shubham शुभम�... @shubh_pawar
152 Followers 969 Following AI Researcher. Machine Learning. NLP. LLMs. Science. Engineering. Food. Books. Travel. Art. He/him. #StopAsianHate. #BlackLivesMatter. All views personal.
Thijs Bergkamp @ThijsBergkamp
82 Followers 7K Following
Joseph Pollack #Ï �... @josephpollack
2K Followers 5K Following 🤖AI❤️Data enjoyer , building robots to helps folks learn things quicker.
Gpbhupinder @gpbhupinder
469 Followers 7K Following 👨💻 Full-Stack Developer & AI Integration Expert 🚀 From concept to launch, we bring your tech vision to life
L X @LX0744820469038
1 Followers 127 Following
Artur Tanona @ArturTanona
943 Followers 2K Following ML engineer. Husband and father. I am exploring how much "AI" can help solve legal issues.
LY @YantoLiem11
205 Followers 4K Following
W. Maximillian de Joh... @WJohnsonbourg
165 Followers 3K Following
Varad D @varad_d33297
2 Followers 328 Following
Origami Duck @OrigamiDuck
37 Followers 2K Following
Ja'Crispy @VaishnavVarma3
96 Followers 2K Following Founding MLE @ ScyAI | professional token counter | NLP, RL, Agentic frameworks | GPU poor 😔 | GitHub https://t.co/EKTIFmO279
Karthik Raja @Kitrak_rev
61 Followers 1K Following Tech enthusiast | Passionate about AI, ML, and NLP | Quest to build Jarvis kind of machine🧠🤖| Luv u 3K |
Louis Dupont @TheLouisDupont
5 Followers 85 Following Deep Learning Engineer & LLM Consultant | Working on local AI solutions
Prompt @engineerrprompt
2K Followers 1K Following Creator of localGPT | Building something cool! Generative AI, Tech, Arts, Life!
David Cannan @cdasmktcda
257 Followers 936 Following DevOps & Software Engineer—Autodidact Problem Solver—Solo Dad of Triplets—Reformed Member of Society—Lurking Class Citizen 🫠
YounesIO @YounesAka
87 Followers 433 Following SWE. Rust. Python. TLA+. Prolog. Julia. Typescript. AI/ML.
Zed @NotXzed
2K Followers 153 Following
Philip Austin @austin_phi62961
279 Followers 2K Following
Greg Jennings @jenningsgreg
1K Followers 5K Following VP of Engineering @anacondainc, enabling the next generation of data science and AI-powered applications. Opinions are my own
Sriniketh J @srini047
397 Followers 2K Following SWE @arrcusinc | Ex-@Zoho | @gdscpsgitar Lead 22' | OS
Joon Kim @jnkm1024
1 Followers 244 Following
Raphael Mansuy 🍵 @raphaelmansuy
2K Followers 5K Following Data Engineering | DataScience | AI & Innovation 🚀 🚀 CTO of ELITIZON https://t.co/qkVpOCgY0m 🤖 Co-Founder of QuantaLogic https://t.co/f9qamWSbjE
JHU CLSP @jhuclsp
7K Followers 6K Following Center for Language and Speech Processing at @JohnsHopkins #NLProc #MachineLearning #AI https://t.co/6IXR5OSQtw @[email protected]
Edwin Puertas @OafToBark23
782 Followers 2K Following AI Software Architect | NLP Researcher | Member of IEEE Artificial Intelligence Standards Committee | PhD
ELOQUENCE AI @eloquenceai
295 Followers 3K Following #Multilingual and Cross-cultural interactions for context-aware, and #bias-controlled dialogue systems for safety-critical applications
Thomas Thoresen @thomas_thoresen
236 Followers 524 Following Father, athlete, coder. Working on @vespaengine
al.wikah @wikah_al
357 Followers 4K Following TICE || EdTech Researcher || Data Scientist || Rubyist || Pythonist || JS-Disciple || App Dev || Startup || Innovation
Tony Wu @tonywu_71
1K Followers 372 Following Multimodal, RAG, Agents | ColPali co-first author | @centralesupelec 🇫🇷 x @Cambridge_Uni 🇬🇧 | Core Researcher at @hcompany_ai 🧑🏻💻
enrica @hyp_enri
47 Followers 69 Following
Wyatt Walls @lefthanddraft
10K Followers 510 Following Tech law and legal tech. Exploring, red-teaming and breaking LLMs.
Krishna Mohan @KMohan2006
3K Followers 337 Following Denoising present to hopefully get brighter future | loves diffusion models
merve @mervenoyann
80K Followers 5K Following open-sourceress at @huggingface 🧙🏻♀️proud Aegean, I work on computer vision, VLMs & agents | gençleri serbest bırakın
Johannes Hagemann @johannes_hage
8K Followers 2K Following co-founder/cto @PrimeIntellect | decentralized AI, longevity, techno-optimism
sigh swoo... @sighswoon
39K Followers 1K Following developing a language with the invisible … author of Notes on Shapeshifting , next book coming spring ,, ig: sighswoon
Ruben Hassid @RubenHssd
38K Followers 509 Following Founder of https://t.co/n6tTy5Q7uX - bootstrapped
Yingjun Wu // Vibe Mo... @YingjunWu
4K Followers 1K Following Founder @RisingWaveLabs. stream processing, lakehouses, random AI stuffs. Previously @awscloud Redshift, @IBMResearch Almaden. PhD @NUSingapore @CMUDB.
Danica Fine @TheDanicaFine
2K Followers 325 Following Opinions my own. Developer Advocate. 🥑 ❄️ https://t.co/bYTfXzjNpI
Edward Bennett @bbuzz
89 Followers 374 Following
Angie Jones @techgirl1908
113K Followers 610 Following VP Eng, AI Tools & Enablement | International Keynote Speaker | Java Champion | GitHub Star | Inventor {27 patents} | Working on AI agents and MCP @blocks
Lizzie Siegle @lizziepika
5K Followers 2K Following devrel🥑@cloudflare👩🏻💻, mixed, twin, yimby🏘️. 💕#gsw, #gsv,🎾, 🚌 ,🚴🏼,🏃♀️,📚. 💌: https://t.co/5UCoHs1uG5. {she,her} 🦋: @lizziesiegle.com
PyData London @pydatalondon
6K Followers 254 Following PyData meetups in London for data-loving pythonistas. Powered by @emlynclay, @ianozsvald, @john_sandall
Budapest ML Forum @budapestmlforum
41 Followers 85 Following International data science, ML and AI conference, held between 10-12 June, 2024.
Merantix Group @Merantix
1K Followers 356 Following The Merantix Group invests, builds, and connects to drive impactful AI made in Europe.
Zain @ZainHasan6
5K Followers 2K Following AI builder & teacher | AI/ML https://t.co/PDkARZxKEc | Eng ℕΨ @UofT | ex-(Vector DBs, Health tech, Lecturer) | decoding AI’s future - follow for insights! 🇨🇦🇵🇰
Aiven @aiven_io
4K Followers 942 Following We manage your open source data infrastructure in the cloud – so you can get back to developing great apps. We’re hiring – come work with us
Femke Plantinga @femke_plantinga
9K Followers 600 Following learn with me about AI. growth @weaviate_io
Aizhamal Nurmamat kyz... @iamaijamal
418 Followers 286 Following All about open source 🤍 Member @TheASF | PMC @ApacheAirflow | Committer @ApacheBeam | ex @Google & @Sysdig | Opinions my own. She/Her. From Kyrgyzstan 🇰🇬
Riona MacNamara @rionam
1K Followers 2K Following unreliable narrator. She/her Views are mine alone.
Alessandro Benedetti @AlexBenedetti
412 Followers 39 Following Apache Lucene/Solr PMC member and committer. Director and R&D Software Engineer at Sease Ltd. Information Retrieval lover, snowboarder and Beach Volley player.
Anshum Gupta @anshumgupta
1K Followers 596 Following Committer on Apache Lucene/Solr. Building search @ , Search engines and more... Barça and Messi supporter. Tweets/opinions are mine!
Daniele Antuzi @dantuzi
29 Followers 45 Following Studente di Informatica all'università di Pisa e grande sostenitore del più grande giocatore della storia bianconera: Alessandro Del Piero
Hans-Peter Grahsl �... @hpgrahsl
2K Followers 473 Following Developer 🥑 Advocate at @Decodableco, Ex-Red Hat, formerly Engineer/Trainer/Consultant - also proud husband, 🦁 hearted dad of 2 and ☕️ aficionado.
Bilge @bilgeycl
850 Followers 652 Following #boykot ✨👩🏻💻💃🏻✨ | DevRel Engineer 🥑 @deepset_ai for @Haystack_AI | 🦋 https://t.co/v96H0Vv6Fi | TR/EN | she/her
@berlinbuzzwords@flos... @berlinbuzzwords
3K Followers 780 Following A conference on storing, processing, streaming, and searching large amounts of digital data | #bbuzz | June 09-11, 2024 | @[email protected]
cloud @cloud11665
11K Followers 2K Following SIMD enjoyer, tensor rotator, LLM inference optimizoor | Technical Staff @ https://t.co/gQXVxhjcOm
Zhuohan Li @zhuohan123
9K Followers 865 Following mts @ openai | cs phd @ 🌁 uc berkeley | building @vllm_project | machine learning system | the real agi is the friends we made along the way
Jo Kristian Bergum @jobergum
19K Followers 1K Following CEO https://t.co/sYmUNgPBq8 - The retrieval engine for agents.
Mahmoud Mabrouk @mmabrouk_
574 Followers 577 Following Building @agenta_ai - open-source LLMOps ⚙️ integrated prompt management, evaluation, and observability (3k⭐) 📤 Follow for AI Engineering and LLMOps content
Leonie @helloiamleonie
15K Followers 654 Following I do Machine Learning at @weaviate_io and write about it on the Internet | Google Developer Expert (Kaggle)
Noé Achache @noe_achache
133 Followers 167 Following Lead Data Scientist @Sicara_fr / Computer Vision, GenAI and Vector Databases / Speaker and blog writer
Vladimir Blagojevic @vladblagoje
697 Followers 419 Following Natural Language Processing; AI SCPD @Stanford, MSc @YorkUniversity, Software Engineer @deepset_ai, ex @RedHat, @BlackBerry
Bob van Luijt @bobvanluijt
4K Followers 3K Following Co-Founder and CEO of @weaviate_io. I 😍 all things related to tech, machine learning, digital business, open-source, fashion, and music
Florian Juengermann @florian_jue
3K Followers 441 Following co-founder @listenlabs. prev cse @harvard, autopilot @tesla
Morgan McGuire @morgymcg
3K Followers 4K Following Applied AI @weights_biases | ex-Facebook Safety | https://t.co/a7i7G5dkLG | 🇮🇪 | Came for the bants, stayed for the rants
Sease @SeaseLtd
635 Followers 344 Following We build Search solutions and #AI integrations with cutting-edge Machine Learning such as #LargeLanguageModels (#RAG, #VectorBased search) and #LearningToRank.
Sofie Van Landeghem @OxyKodit
2K Followers 555 Following NLP engineer, open-source developer, owner of OxyKodit, implementing tailored NLP solutions. ⚠️ INACTIVE: https://t.co/LU2joJCyAq, https://t.co/gdBBtGjV2I
search founder @n0riskn0r3ward
2K Followers 2K Following Solo entrepreneur passionate about AI and search tech. Building a niche search product and sharing what I learn along the way.
Tarun Tater @taruntater3
94 Followers 697 Following Research Scientist @AmazonAds Multimodal NLP PhD student @ Uni. Stuttgart. #NLProc Prev: @IBMResearch | IIIT-Bangalore
Omar Khattab @lateinteraction
24K Followers 3K Following Asst professor @MIT EECS & CSAIL (@nlp_mit). Author of https://t.co/VgyLxl0oa1 and https://t.co/ZZaSzaRaZ7 (@DSPyOSS). Prev: CS PhD @StanfordNLP. Research @Databricks.
Dmitry Kan @DmitryKan
1K Followers 693 Following Host of Vector Podcast: https://t.co/XuA7zOLP97 Senior Product Manager (Search, ML) @tomtom.
Chris Jenkins @chrimbo_jenks
51 Followers 106 Following PhD student IMS/Uni-Stuttgart / computers, linguistics / diachronic semantic change / "everything is pragmatics"
Nan Wang @nanwang_t
352 Followers 515 Following Co-founder & CTO @JinaAI | Ex-Zalando & Tencent | Build AI & IR Systems | Open-source enthusiast | Speaker & contributor (40+ talks) | PhD