Kevin Liu @kevinjqliu
Seattle, WA Joined December 2016-
Tweets225
-
Followers221
-
Following375
-
Likes2K
We want `brew install tpchgen-cli` to work, but that requires the project to be "popular" enough according to homebrew (30 forks, 30 watchers and 75 stars) We just need 6 more forks and 24 watchers. Can you help us out? github.com/clflushopt/tpc… Deets: github.com/clflushopt/tpc…
🚀 Manage Lance Tables Seamlessly Across Any Catalog with Lance Namespace & Apache Spark! Lance Namespace (t.ly/ACv8L) is an open specification that standardizes how you access and manage collections of Lance tables across metadata services like Apache Hive…
tldr: the cutting edge in open-source file formats is now a Linux Foundation project 🚀🚀🚀
Got upgraded on my flight to NYC this morning and everyone in 1st class had their laptops out, using Iceberg as their table format. I looked back into the rest of the plane - everyone was still using Hive tables. Was just a very interesting observation.
first slowly and then all at once
I know a dev from #OneLake, and he showed me something I can't even hint at, otherwise, @JoshCaplan1984 will send some killer ninjas. It's just beautiful
Analytics Accelerator Library for Amazon S3 and Iceberg docs.google.com/document/d/13s… A very interesting set of optimizations for reading Iceberg/parquet from S3. cc @andrewlamb1111
Just wrote a new blog post: how I spent a year making the world's fastest Parquet loader in JavaScript, and all the optimizations that went into it. TLDR: Hyparquet can load a parquet file from S3 in 155ms. While duckdb-wasm took 3466ms the same file!
Amazing product from an even more amazing team. Congrats on the new round!! Super excited to see the future of multimodal
Amazing product from an even more amazing team. Congrats on the new round!! Super excited to see the future of multimodal
The Apache Iceberg™ v3 table spec is officially ratified! A huge thank you to the entire open source community for this collaborative effort. This unlocks powerful new capabilities for developers, including native row lineage for reliable CDC workflows, client-side table…
#DataAISummit features a ton of sessions focused on open source and is the place to explore the latest advancements in Apache Spark, Delta Lake, Apache Iceberg, MLflow, Unity Catalog and more. With a career rooted in open source, @dennylee shares some of his top session picks…
#apacheiceberg that looks like a peace treaty 🤓 youtube.com/watch?v=JgEjf9…
@kevinjqliu you are alright 😎 #apacheiceberg youtube.com/watch?v=3N2KEU…
20x faster TPCH data generator availably via pip install: pip install tpchgen-cli Blog from @kevinjqliu: kevinjqliu.github.io/blog/posts/tpc…
This is a better version of the talk I gave for the workshop at Iceberg Summit Thanks @tlberglund youtu.be/TsmhRZElPvM?si…
> Since Google Cloud first brought Iceberg into its environment six months ago, adoption has tripled, Ahmad said. In fact, she added, Google Cloud’s support for Iceberg is market-leading in terms of performance and capabilities. bigdatawire.com/2025/04/11/goo…
🚀 this was not on my bingo card
🚀 this was not on my bingo card
We're live from #IcebergSummit 2025! Our very own EVP of Product, Christian Kleinerman, kicked things off on the 'Shaping the Future of Apache Iceberg' panel in front of a sold-out crowd. Join us for a day full of live workshops, technical deep-dives, face-to-face networking,…
Data lakes are taking over

EthelSharp @ZG2Om2YYu9Fo20h
0 Followers 267 Following
Evie @a7kM3fw1j1Y5a
11 Followers 861 Following
nandhu kishore @nandhukishore91
54 Followers 1K Following
RAJESH P S @RajPulkunta
71 Followers 2K Following Data Viz Enthusiast | Tableau - Desktop Certified Associate & Desktop Specialist | Power BI Data Analyst Associate
Shane Tohill @ShaneToh
228 Followers 3K Following
Alex Dupler @alexdupler
2K Followers 1K Following Sr PM @ Microsoft Advertising (Bing Ads) focused on BI & Data Infra. Mostly reading & posting about Mariners, Power BI, Tech, and Politics.
BenMakesDataEasy @data_ben
4K Followers 7K Following Follower of Jesus | #SQLServer | Python | Machine Learning | Building an app for #PowerBI | https://t.co/SOfRhybr2Z | Helping make data easier to work with
Miqert @Miqert919
15 Followers 1K Following
Fwuihau @Fwuihau5935
15 Followers 1K Following
Godfrey Veum-Wunsch @VeumWunsch87739
60 Followers 4K Following
Amr Awadallah 🤖 @awadallah
36K Followers 16K Following Founder/CEO of @Vectara (Trusted GenAI for the Enterprise). Founder/ex-CTO @Cloudera, ex-VP at Yahoo & Google. PhD EE Stanford. IG, FB, LI: @awadallah.
Emma @Machoman1992
344 Followers 5K Following Good food, yoga, horseback riding, long walks, and meaningful work make up my tranquility. Life is not perfect, but I have learned to love it
Maria @klotz81maria
302 Followers 3K Following
Eguouawqqu @Eguouawqqu3492
23 Followers 2K Following
Twircar @Twircar9902
10 Followers 533 Following
Phillip LeBlanc @leblancphill
473 Followers 222 Following Co-founder @spice_ai - Building composable, ready-to-use data and AI infra in Spice․ai OSS
Ameen Patel @Ameen_ml
1K Followers 2K Following LLM Inference & Serving @togethercompute, prev @AmazonScience, @uwaterloo
Heber Bednar @BednarHebe91422
22 Followers 2K Following
Derek Moore @derekm00r3
3K Followers 6K Following Scientist, technologist, programmer, entrepreneur, engineer https://t.co/CDu5AxFN1m
Douglas Marks @DouglasMar31112
58 Followers 3K Following
Josh Caplan @JoshCaplan1984
3K Followers 252 Following Product lead for Microsoft OneLake. Formally worked on SQL Server Analysis Services and Power BI. Opinions are my own.
Tpt @Tpt93
399 Followers 422 Following #RDF, #SPARQL, @Wikisource and @Wikidata enthusiast. @Oxigraph main developer. Mastodon : @[email protected]
Krishna Vishal @EigenVectorizer
606 Followers 743 Following Building @ApacheIggy | Low latency message streaming | OSS | 🎓 @iitmadras '17
Orin Kuhic @OKuhic55651
67 Followers 3K Following
Tim Berglund @tlberglund
12K Followers 1K Following VP DevRel at @Confluent. Father of three, grandfather of four. Believer in Christ. Opinions should be your own.
FrameIsEverything @_frameframe_
104 Followers 2K Following
Satheytur @SatheyturwHqBf
9 Followers 304 Following
Darius @scdarius
131 Followers 389 Following
Lacey0621 @Sung76926020
22 Followers 935 Following I am an entrepreneur, mainly in the clothing foreign trade, I like to travel, fitness, camping, hiking, food, I like to read the news on X and see some truth, a
Nairrtas @NairrtasrDZeL
35 Followers 4K Following
Craig Kerstiens @craigkerstiens
9K Followers 884 Following Product and eng @crunchydata. I blog at https://t.co/K49pnYYXpL Curate https://t.co/0DWATfO0yf. Previously @Microsoft, @citusdata, @Heroku, Truviso
Thyrue @ThyruemIAJXd
137 Followers 3K Following
Charly Wargnier @DataChaz
136K Followers 45K Following Ex @Streamlit @Snowflake Maestro 🪄 • X about AI agents, LLMs, web apps, Python & SEO • My ❤️ is open source • DM for collabs 📩
Yingjun Wu // Vibe Mo... @YingjunWu
4K Followers 1K Following Founder @RisingWaveLabs. stream processing, lakehouses, random AI stuffs. Previously @awscloud Redshift, @IBMResearch Almaden. PhD @NUSingapore @CMUDB.
DaisyPepys @UfCF7x5u2sLmL
59 Followers 7K Following
polars data @DataPolars
7K Followers 7 Following Dataframes powered by a multithreaded, vectorized query engine, written in Rust.
Ritchie Vink @RitchieVink
3K Followers 170 Following Author of Polars | CEO & Founder Polars Inc | Building scalable Polars
Ritvik Kapila @RitvikKapila
211 Followers 202 Following ML Research @Essential_AI, MS CS @UCSanDiego, B. Tech. @iitdelhi
Spiral @SpiralDB
319 Followers 11 Following Multimodal warehousing that works with the tools you love. “Storage Packed In Recursive Arrays & Layers”
Shengjia Zhao @shengjia_zhao
52K Followers 231 Following Chief Scientist @ Meta MSL. Formerly MTS @ OpenAI, PhD @ Stanford. I train models. All opinions my own.
Kenny Daniel @platypii
1K Followers 2K Following Machine Learning 🤖 Parachutes 🪂 and Bunnies 🐰 Formerly Algorithmia. Currently using JavaScript to make better AI.
DSPy @DSPyOSS
10K Followers 48 Following An open-source declarative framework for building modular AI software. Programming—not prompting—LLMs via higher-level abstractions & optimizers.
Nick Frichette @Frichette_n
6K Followers 2K Following Staff Security Researcher @datadoghq | DEF CON/Black Hat main stage speaker | he/him | OSCP OSWE | Tweets are my own | Created https://t.co/QGWMJjv9pc
Jonathan Frankle @jefrankle
20K Followers 725 Following Chief AI Scientist @databricks via MosaicML.
AJ Stuyvenberg @astuyve
8K Followers 2K Following AWS Hero 💫 Staff eng @Datadoghq 👨🏻💻 sometimes streamer 🎥 AKA Aaron Stuyvenberg Ask me about your p99
ParadeDB @paradedb
1K Followers 3 Following ParadeDB is a modern Elasticsearch alternative built on Postgres. Built for real-time, update-heavy workloads. ⭐ Star us: https://t.co/UL5Eovbw2O
Pat Patterson 🇬�... @metadaddy
7K Followers 2K Following Dad, husband, ultrarunner, Chief Technical Evangelist at @Backblaze. Previously @Citrix, @StreamSets, @SalesforceDevs, @Huawei, @SunMicrosystems.
M12 - Microsoft's Ven... @M12vc
12K Followers 314 Following Our mission is to accelerate the future of technology through investments, insights, and meaningful partnership with Microsoft.
Sandeep Pawar @PawarBI
5K Followers 1K Following Principal PM @ Microsoft Fabric CAT | 🇮🇳🇺🇲 https://t.co/ZDAcUyygbo | Tweets & opinions are my own
LanceDB @lancedb
3K Followers 52 Following Developer-friendly, open source AI-Native Multimodal Lakehouse https://t.co/wXn4tw66HV
Maxime Rivest 🧙... @MaximeRivest
4K Followers 779 Following Easy LLM context for all! ✨pip install attachments Inspired by: ggplot2, DSPy, claudette, dplyr, OpenWebUI! Follow for: API design, AI, and Data 🐍CC📜🛠 maker
Ning Sun @Sunng
2K Followers 849 Following Programmer, @Greptime. #Clojure and #Rustlang are my favorites. (Arch)Linux user. Map and book lover. Board(War)game addict. Birding. Husband.
Phillip LeBlanc @leblancphill
473 Followers 222 Following Co-founder @spice_ai - Building composable, ready-to-use data and AI infra in Spice․ai OSS
#DataAISummit @Data_AI_Summit
20K Followers 759 Following #DataAISummit (formerly #SparkAISummit) is the global event for the data community. The conference is organized by @Databricks.
Ameen Patel @Ameen_ml
1K Followers 2K Following LLM Inference & Serving @togethercompute, prev @AmazonScience, @uwaterloo
Nikhil Benesch @nikhilbenesch
707 Followers 174 Following Systems engineer @turbopuffer. Former CTO @MaterializeInc. Accidental data enthusiast. Find me on Bluesky: https://t.co/72LSo4iKXj
changhiskhan @changhiskhan
2K Followers 1K Following CEO/Cofounder @lancedb, The AI-Native Multimodal Lakehouse. Early pandas co-author. Turning caffeine into code since the last century
Ion Stoica @istoica05
5K Followers 20 Following Professor at UC Berkeley, co-founder of Databricks, Anyscale, LMArena, Conviva.
javi santana @javisantana
15K Followers 758 Following Co-founder of @Tinybirdco - ClickHouse for deverlopers
Gwen (Chen) Shapira @gwenshap
28K Followers 10K Following Co-founder of @niledatabase. Making SaaS global, elastic and chill. Find me at: https://t.co/uyuHg400cp
Quentin Lhoest 🤗 @lhoestq
4K Followers 297 Following Datasets @huggingface | Open Source + HF Dataset Hub
Matt Silverlock 🐀 @elithrar
7K Followers 1K Following “who do we say the rules are for?” “other people.” • VP of product: storage & databases @cloudflare • https://t.co/OLM4gzyGsa
Arun Ulag @arunulag
7K Followers 613 Following Corporate Vice President of Azure Data @Microsoft, runs Microsoft Fabric, Azure SQL, Cosmos, Postgres, MySQL, Data Factory, Service Bus, Synapse, and Power BI
Alex Konrad @alexrkonrad
86K Followers 4K Following Founder and editor of @upstartsmediaco, a new tech publication focused on the startup ecosystem. Previously @Forbes senior editor. Email: [email protected]
Yingjun Wu // Vibe Mo... @YingjunWu
4K Followers 1K Following Founder @RisingWaveLabs. stream processing, lakehouses, random AI stuffs. Previously @awscloud Redshift, @IBMResearch Almaden. PhD @NUSingapore @CMUDB.
Nimtable @nimtable
54 Followers 0 Following The open-source control plane for your Iceberg lakehouse.
Databricks @databricks
81K Followers 1K Following Databricks is the data and AI company, helping data + AI teams solve the world’s toughest problems.
Tim Berglund @tlberglund
12K Followers 1K Following VP DevRel at @Confluent. Father of three, grandfather of four. Believer in Christ. Opinions should be your own.
Zheng Hu @openinx
19 Followers 189 Following Software Engineer at Databricks . Apache Iceberg & HBase PMC member. Open source evangelist. Tech writer.
boris tane @boristane
8K Followers 2K Following building workers observability @cloudflaredev, prev founder @baselimehq (acquired by cloudflare)
Cloudflare Developers @CloudflareDev
45K Followers 123 Following Have questions, or building something cool with Cloudflare's Developer products? We're here to help. For help with your account please try @CloudflareHelp
Micah Wylde @mwylde
594 Followers 328 Following Stream processing @Cloudflare. Prev co-founder @ArroyoSystems, @Splunk, @LyftEng, @GetSift.
Negar Arabzadeh @NegarEmpr
1K Followers 962 Following Postdoc at @UCBerkeley Sky Lab | Interested in Information Retrieval | 👩🏻💻Prev : @google, @MSFTResearch, @SpotifyResearch 📚:@UWaterloo
Nadim Hossain @NadimHossain
4K Followers 1K Following CPO | ex VP Product @Databricks | Founder @BrightFunnel | ex @Uber ATG | Talk about #data #ai #analytics #saas #selfdriving #vc #sf | #tennis #boxing fan