Kevin Liu @kevinjqliu
Seattle, WA Joined December 2016-
Tweets241
-
Followers225
-
Following385
-
Likes2K
you can query #MicrosoftFabric DWH in your laptop using #duckdb thanks to the apache iceberg rest catalog
There's now an Apache Datafusion <> Apache Iceberg integration in Python datafusion.apache.org/python/user-gu… huge thanks to the datafusion community 🥳
Today's Future Data Systems Seminar Speaker: @RussSpitzer will present the internals of @ApacheIceberg's query planner and execution engine. Zoom talk open to public at 4:30pm ET. YouTube video available after: db.cs.cmu.edu/events/futured…
First Look at #onelake #apacheiceberg REST Catalog, please notice it is coming soon and not in production yet #MicrosoftFabric youtu.be/_QRE-2u3DQ4
Zero-copy, bi-directional data access, powered by Apache Iceberg and the Iceberg REST Catalog protocol. We built the OneLake Table API on open standards. And It just works! With this Snowflake integration and all open source clients in the Iceberg ecosystem.
Zero-copy, bi-directional data access, powered by Apache Iceberg and the Iceberg REST Catalog protocol. We built the OneLake Table API on open standards. And It just works! With this Snowflake integration and all open source clients in the Iceberg ecosystem.
2 months ago, I got access to a beta release of #onelake #Apacheiceberg REST Catalog, first thing I run it with #duckdb in my laptop😀
Iceberg Support in DuckDB youtube.com/watch?v=kJkpVX… Thanks for presenting @the_Tmonster
Open Table Format (OTF) の Podcast、 #OTFTalk 第26回を公開しました。AWSの疋田さん(べりんぐさん) @_Bassari をゲストに「PyIcebergの活用」についてお話をうかがいました。 @simosako creators.spotify.com/pod/profile/ot…
We want `brew install tpchgen-cli` to work, but that requires the project to be "popular" enough according to homebrew (30 forks, 30 watchers and 75 stars) We just need 6 more forks and 24 watchers. Can you help us out? github.com/clflushopt/tpc… Deets: github.com/clflushopt/tpc…
🚀 Manage Lance Tables Seamlessly Across Any Catalog with Lance Namespace & Apache Spark! Lance Namespace (t.ly/ACv8L) is an open specification that standardizes how you access and manage collections of Lance tables across metadata services like Apache Hive…
tldr: the cutting edge in open-source file formats is now a Linux Foundation project 🚀🚀🚀
Got upgraded on my flight to NYC this morning and everyone in 1st class had their laptops out, using Iceberg as their table format. I looked back into the rest of the plane - everyone was still using Hive tables. Was just a very interesting observation.
first slowly and then all at once
I know a dev from #OneLake, and he showed me something I can't even hint at, otherwise, @JoshCaplan1984 will send some killer ninjas. It's just beautiful
Analytics Accelerator Library for Amazon S3 and Iceberg docs.google.com/document/d/13s… A very interesting set of optimizations for reading Iceberg/parquet from S3. cc @andrewlamb1111
Just wrote a new blog post: how I spent a year making the world's fastest Parquet loader in JavaScript, and all the optimizations that went into it. TLDR: Hyparquet can load a parquet file from S3 in 155ms. While duckdb-wasm took 3466ms the same file!
Amazing product from an even more amazing team. Congrats on the new round!! Super excited to see the future of multimodal
Amazing product from an even more amazing team. Congrats on the new round!! Super excited to see the future of multimodal
The Apache Iceberg™ v3 table spec is officially ratified! A huge thank you to the entire open source community for this collaborative effort. This unlocks powerful new capabilities for developers, including native row lineage for reliable CDC workflows, client-side table…
#DataAISummit features a ton of sessions focused on open source and is the place to explore the latest advancements in Apache Spark, Delta Lake, Apache Iceberg, MLflow, Unity Catalog and more. With a career rooted in open source, @dennylee shares some of his top session picks…

Miguel @MiguelyLina
6 Followers 400 Following Single mother, Dutch nationality, immigrated to the United States, has lived in the US for 18 years
Rilella Aubly @RAubly14722
0 Followers 277 Following
Johnson Lee @Johnson40235768
6 Followers 304 Following
Marc Cenac @mrcnc84
2 Followers 16 Following
simongaoo @simongaoo90155
4 Followers 48 Following
べりんぐ @_Bassari
999 Followers 773 Following Apache Iceberg探窟家です。最近はOpenSearchにハマっています blog https://t.co/fpqNQIpkQw work @awscloud All views are my own
Marc ⛅️ @MarcSelwan
716 Followers 603 Following Product @Cloudflare working on R2 Data Catalog, Pipelines, and R2 SQL 🧊. Databases, streaming, distributed systems, tech, games, & 🎸
nandhu kishore @nandhukishore91
60 Followers 1K Following
RAJESH P S @RajPulkunta
72 Followers 2K Following Data Viz Enthusiast | Tableau - Desktop Certified Associate & Desktop Specialist | Power BI Data Analyst Associate
Shane Tohill @ShaneToh
233 Followers 3K Following
Alex Dupler - y @alexdupler
2K Followers 1K Following Sr PM @ Microsoft Advertising (Bing Ads) focused on BI & Data Infra. Mostly reading & posting about Mariners, Power BI, Tech, and Politics.
BenMakesDataEasy @data_ben
4K Followers 7K Following Follower of Jesus | #SQLServer | Python | Machine Learning | Building an app for #PowerBI | https://t.co/SOfRhybr2Z | Helping make data easier to work with
Miqert @Miqert919
16 Followers 1K Following
Fwuihau @Fwuihau5935
19 Followers 1K Following
Will Manning @_willmanning
450 Followers 1K Following CEO, Co-founder @SpiralDB; TSC Chair @vortexdotdev
Amr Awadallah 🤖 @awadallah
36K Followers 16K Following Founder/CEO of @Vectara (Trusted GenAI for the Enterprise). Founder/ex-CTO @Cloudera, ex-VP at Yahoo & Google. PhD EE Stanford. IG, FB, LI: @awadallah.
Emma @Machoman1992
343 Followers 5K Following Good food, yoga, horseback riding, long walks, and meaningful work make up my tranquility. Life is not perfect, but I have learned to love it
Maria @klotz81maria
306 Followers 3K Following
Eguouawqqu @Eguouawqqu3492
23 Followers 2K Following
Twircar @Twircar9902
19 Followers 1K Following
Phillip LeBlanc @leblancphill
473 Followers 223 Following Co-founder @spice_ai - Building composable, ready-to-use data and AI infra in Spice․ai OSS
Ameen Patel @Ameen_ml
1K Followers 1K Following Inference @PrimeIntellect, prev @togethercompute, @AmazonScience, @uwaterloo
Derek Moore @derekm00r3
3K Followers 6K Following Scientist, technologist, programmer, entrepreneur, engineer https://t.co/CDu5AxFN1m
Josh Caplan @JoshCaplan1984
3K Followers 253 Following Product lead for Microsoft OneLake. Formally worked on SQL Server Analysis Services and Power BI. Opinions are my own.
Tpt @Tpt93
399 Followers 425 Following #RDF, #SPARQL, @Wikisource and @Wikidata enthusiast. @Oxigraph main developer. Mastodon : @[email protected]
Krishna Vishal @EigenVectorizer
617 Followers 750 Following Building @ApacheIggy | Low latency message streaming | OSS | 🎓 @iitmadras '17
Orin Kuhic @OKuhic55651
84 Followers 4K Following
Tim Berglund @tlberglund
12K Followers 1K Following VP DevRel at @Confluent. Father of three, grandfather of four. Believer in Christ. Opinions should be your own.
FrameIsEverything @_frameframe_
104 Followers 2K Following
Satheytur @SatheyturwHqBf
8 Followers 303 Following
Darius @scdarius
131 Followers 389 Following
Lacey0621 @Sung76926020
22 Followers 928 Following I am an entrepreneur, mainly in the clothing foreign trade, I like to travel, fitness, camping, hiking, food, I like to read the news on X and see some truth, a
Nairrtas @NairrtasrDZeL
33 Followers 4K Following
Craig Kerstiens @craigkerstiens
9K Followers 884 Following Product and eng @crunchydata. I blog at https://t.co/K49pnYYXpL Curate https://t.co/0DWATfO0yf. Previously @Microsoft, @citusdata, @Heroku, Truviso
Thyrue @ThyruemIAJXd
135 Followers 3K Following
Vipul Vaibhaw @vaibhaw_vipul
14K Followers 2K Following Founding Engineer @pre6ai Open source ❤️. Math and Systems. Most posts are notes to myself.
Justyna Lucznik @JustynaLucznik
6K Followers 332 Following Program Manager on the Azure Synapse team focusing on Spark & Data Science. Passionate about statistics, ML and data visualization.
tom ebergen @the_Tmonster
51 Followers 63 Following
OTF Talk @OTFTalk
97 Followers 3 Following OTF Talk は、OTF: Open Table Format の技術的な解説や最新トピック等をゲストをむかえてお話をうかがうPodcastです。Spotify, Amazon Music, Apple Podcast, Youtube等でお聞きいただけます。 host: @simosako #otftalk
べりんぐ @_Bassari
999 Followers 773 Following Apache Iceberg探窟家です。最近はOpenSearchにハマっています blog https://t.co/fpqNQIpkQw work @awscloud All views are my own
Andrew Jefferson @EastlondonDev
4K Followers 2K Following Vibe Engineering Influencer vibe-tools https://t.co/s1EEYGPne1 Yo! MCP https://t.co/L1EgmWqGYm Formerly CTO@Bobsled, Eng @ Neo4j, Tractable & Apple
Marc ⛅️ @MarcSelwan
716 Followers 603 Following Product @Cloudflare working on R2 Data Catalog, Pipelines, and R2 SQL 🧊. Databases, streaming, distributed systems, tech, games, & 🎸
polars data @DataPolars
7K Followers 7 Following Dataframes powered by a multithreaded, vectorized query engine, written in Rust.
Ritchie Vink @RitchieVink
3K Followers 169 Following Author of Polars | CEO & Founder Polars Inc | Building scalable Polars
Ritvik Kapila @RitvikKapila
238 Followers 205 Following ML Research @Essential_AI, MS CS @UCSanDiego, B. Tech. @iitdelhi
Spiral @SpiralDB
623 Followers 12 Following Multimodal warehousing that works with the tools you love. “Storage Packed In Recursive Arrays & Layers”
Will Manning @_willmanning
450 Followers 1K Following CEO, Co-founder @SpiralDB; TSC Chair @vortexdotdev
Shengjia Zhao @shengjia_zhao
52K Followers 231 Following Chief Scientist @ Meta MSL. Formerly MTS @ OpenAI, PhD @ Stanford. I train models. All opinions my own.
Kenny Daniel @platypii
1K Followers 2K Following Machine Learning 🤖 Parachutes 🪂 and Bunnies 🐰 Formerly Algorithmia. Currently using JavaScript to make better AI.
DSPy @DSPyOSS
11K Followers 50 Following An open-source declarative framework for building modular AI software. Programming—not prompting—LLMs via higher-level abstractions & optimizers.
Nick Frichette @Frichette_n
6K Followers 2K Following Staff Security Researcher @datadoghq | DEF CON/Black Hat main stage speaker | he/him | OSCP OSWE | Tweets are my own | Created https://t.co/QGWMJjv9pc
Jonathan Frankle @jefrankle
20K Followers 733 Following Chief AI Scientist @databricks via MosaicML.
AJ Stuyvenberg @astuyve
8K Followers 2K Following AWS Hero 💫 Staff eng @Datadoghq AKA Aaron Stuyvenberg Ask me about your p99
ParadeDB @paradedb
1K Followers 4 Following The transactional Elasticsearch alternative built on Postgres ⭐ Star us: https://t.co/UL5Eovbw2O
Pat Patterson 🇬�... @metadaddy
7K Followers 2K Following Dad, husband, ultrarunner, Chief Technical Evangelist at @Backblaze. Previously @Citrix, @StreamSets, @SalesforceDevs, @Huawei, @SunMicrosystems.
M12 - Microsoft's Ven... @M12vc
12K Followers 314 Following Our mission is to accelerate the future of technology through investments, insights, and meaningful partnership with Microsoft.
Sandeep Pawar @PawarBI
5K Followers 1K Following Principal PM @ Microsoft Fabric CAT | 🇮🇳🇺🇲 https://t.co/ZDAcUyygbo | Tweets & opinions are my own
LanceDB @lancedb
3K Followers 52 Following Developer-friendly, open source AI-Native Multimodal Lakehouse https://t.co/wXn4tw66HV
Maxime Rivest 🧙... @MaximeRivest
4K Followers 786 Following Easy LLM context for all! ✨pip install attachments Inspired by: ggplot2, DSPy, claudette, dplyr, OpenWebUI! Follow for: API design, AI, and Data 🐍CC📜🛠 maker
Ning Sun @Sunng
2K Followers 846 Following Programmer, @Greptime. #Clojure and #Rustlang are my favorites. (Arch)Linux user. Map and book lover. Board(War)game addict. Birding. Husband.
Phillip LeBlanc @leblancphill
473 Followers 223 Following Co-founder @spice_ai - Building composable, ready-to-use data and AI infra in Spice․ai OSS
#DataAISummit @Data_AI_Summit
20K Followers 758 Following #DataAISummit (formerly #SparkAISummit) is the global event for the data community. The conference is organized by @Databricks.
Ameen Patel @Ameen_ml
1K Followers 1K Following Inference @PrimeIntellect, prev @togethercompute, @AmazonScience, @uwaterloo
Nikhil Benesch @nikhilbenesch
711 Followers 175 Following Systems engineer @turbopuffer. Former CTO @MaterializeInc. Accidental data enthusiast. Find me on Bluesky: https://t.co/72LSo4iKXj
changhiskhan @changhiskhan
2K Followers 1K Following CEO/Cofounder @lancedb, The AI-Native Multimodal Lakehouse. Early pandas co-author. Turning caffeine into code since the last century
Ion Stoica @istoica05
5K Followers 20 Following Professor at UC Berkeley, co-founder of Databricks, Anyscale, LMArena, Conviva.
javi santana @javisantana
15K Followers 762 Following Co-founder of @Tinybirdco - I love moving data around
Gwen (Chen) Shapira @gwenshap
28K Followers 10K Following Co-founder of @niledatabase. Making SaaS global, elastic and chill. Find me at: https://t.co/uyuHg400cp
Quentin Lhoest 🤗 @lhoestq
4K Followers 298 Following Datasets @huggingface | Open Source + HF Dataset Hub
Matt Silverlock 🐀 @elithrar
7K Followers 1K Following “who do we say the rules are for?” “other people.” • VP of product: storage & databases @cloudflare • https://t.co/OLM4gzyGsa
Arun Ulag @arunulag
7K Followers 613 Following Corporate Vice President of Azure Data @Microsoft, runs Microsoft Fabric, Azure SQL, Cosmos, Postgres, MySQL, Data Factory, Service Bus, Synapse, and Power BI
Alex Konrad @alexrkonrad
87K Followers 4K Following Founder and editor of @upstartsmediaco, a new tech publication focused on the startup ecosystem. Previously @Forbes senior editor. Email: [email protected]
Yingjun Wu // Vibe Mo... @YingjunWu
4K Followers 1K Following Founder @RisingWaveLabs. stream processing, lakehouses, random AI stuffs. Previously @awscloud Redshift, @IBMResearch Almaden. PhD @NUSingapore @CMUDB.