-
Tweets3K
-
Followers12K
-
Following727
-
Likes532
@DbrxMosaicAI DBRX outperforms @OpenAI GPT-4 on realistic, domain-specific benchmark datasets. For example, on a customer support summarization use-case👇👇👇 Still neck and neck but it shows that open models can be the no-brainer choice for actual enterprise applications.
Okay @databricks what're you cooking behind this space its so fast lmao
Today we released an open source model, DBRX, that beats all previous open source models on the standard benchmarks. The model itself is a Mixture of Experts (MoE), that's roughly twice the brains (132B) but half the cost (36B) of Llama2-70B. Making it both smart and cheap. Since…
🚨 Announcing DBRX-Medium 🧱, a new SoTA open weights 36b active 132T total parameter MoE trained on 12T tokens (~3e24 flops). Dbrx achieves 150 tok/sec while clearing a wide variety of benchmarks. Deep dive below! 1/N
Databricks Ventures is excited to share its latest investment — @glean! 🎉 We’re thrilled to accelerate their momentum as the industry’s leading AI-powered work assistant, & bring the secure deployment of #AI & knowledge building to all enterprises👇bit.ly/3wqsQ5s
🎉We’re excited to announce our $43M Series B funding led by @NEA, with continued support from @a16z and @generalcatalyst. metronome.com/blog/series-b?…
A good listen about the foundational product (spark) of a foundational software company (databricks). well done @rxin and great interview @chakrabartis sudipchakrabarti.substack.com/p/from-spark-t…
DONDA is the new FAANG Deepmind Open AI Nvidia Databricks Anthropic
This is the first time that I talked extensively about Apache Spark's history, design philosophies, and the evolutions. Over the years, there have been a lot of misconceptions (why it came into existence, memory vs disk). This should explain all of them. sudipchakrabarti.substack.com/p/from-spark-t…
Databricks Vector Search is in public preview! Augment your #LLM apps and synchronize source data, eliminating the need to write and maintain separate data pipelines. Find out why our customers already love it and how you can easily get started 👇 dbricks.co/3R9m2A1
Today, more than 80% of the table metadata updates on Databricks are AI-assisted. It took 2 engineers, 1 month, and less than $1000 in compute cost to develop the custom LLM for this task. databricks.com/blog/creating-…
The founders of Databricks put together this strategy blog on where we think data platforms are headed in the future. We're moving Databricks quickly in this direction. This is very exciting and is the outcome of the MosaicML acquisition we did earlier this year!…
“ Solid understanding of Spark and ability to write, debug, and optimize Spark code. “ I was today years old when I discovered that openAI relies heavily on Spark.
At Replit, we extensively use Spark on Databricks for all our training data work. We run highly customized transformations on code like parsing, deduping, PII redaction, code filtering, tokenization, and more, written with low-level Spark primitives. High quality data is key.
At Replit, we extensively use Spark on Databricks for all our training data work. We run highly customized transformations on code like parsing, deduping, PII redaction, code filtering, tokenization, and more, written with low-level Spark primitives. High quality data is key.
The trade-offs of Query Optimization is a *perfect* problem for #AI. Chief Architect @rxin walking us through new prediction optimizations in Databricks! #LakehouseAI #DataAISummit
We’re over the moon to announce we’ve agreed to acquire @MosaicML, a leading generative AI platform. Together, we’ll make generative AI accessible for every organization, so you can build, own & secure models with your data. databricks.com/mosaic-news
🚨SOLD OUT 🚨 We’ve sold out in-person tickets for #DataAISummit! But you can still catch all the keynotes, select technical sessions, and live-streamed community events when you join us virtually! And it’s entirely free. Hurry & register: bit.ly/3ICPryg
Big news: we've agreed to acquire @MosaicML, a leading generative AI platform. I couldn’t be more excited to join forces once the deal closes. databricks.com/mosaic-news
MPT-30B is here! Same MPT architecture, 30B parameters, > 1T tokens, 8k context window, trained on H100s, great perf (esp on coding), single-GPU inference, commercially usable, and massively upgraded instruct and chat datasets. Take it for a spin! huggingface.co/spaces/mosaicm…
Learn about Mosaic, a Databricks Labs project, for geospatial processing. Tuesday, June 28 @ 4:45pm. databricks.com/dataaisummit/s…
Matei Zaharia @matei_zaharia
39K Followers 1K Following CTO at @Databricks and CS prof at @UCBerkeley. Working on data+AI, including @ApacheSpark, @DeltaLakeOSS, @MLflow, https://t.co/94gROE5Xa0. https://t.co/nmRYAKG0LZAndy Pavlo (@andy_pav.. @andy_pavlo
29K Followers 205 Following Associate Prof. of Databases @CarnegieMellon. Co-Founder @OtterTuneAIDatabricks @databricks
70K Followers 1K Following Databricks is the data and AI company, helping data teams solve the world’s toughest problems.Delta Lake @DeltaLakeOSS
8K Followers 66 Following Delta Lake is an open-source storage framework that enables building a Lakehouse architecture for Spark, Flink, Trino, Hive, Scala, Java, Rust, Python, & more!Joe Hellerstein @joe_hellerstein
15K Followers 894 Following Berkeley CS Prof, focused on data and computation.Gwen (Chen) Shapira @gwenshap
26K Followers 9K Following Co-founder of @niledatabase. Making SaaS global, elastic and chill. Find me at: https://t.co/uyuHg400cpPeter Wang 🦋 @pwang
48K Followers 2K Following Chief AI & Co-founder @AnacondaInc; invented @pyscript_dev, @PyData @Bokeh @Datashader. Former physicist. A student of the human condition. bsky: @wang.socialEric Sammer @esammer
13K Followers 715 Following ceo at @decodableco! prev: @splunk, @rocanainc (acq'd), @cloudera. open source / dist systems / data. o'reilly author. [email protected]martin_casado @martin_casado
50K Followers 2K Following GP @ a16z ... questionable heuristics in a grossly underdetermined worldJacek Laskowski @jace.. @jaceklaskowski
7K Followers 874 Following Freelance Data Engineer | #ApacheSpark #DeltaLake #Databricks #ApacheKafka #KafkaStreams | Java Champion | @theASF | #DatabricksBeaconsMim @mim_djo
9K Followers 3K Following #Fabric Enthusiast, Small Data And self service, #Microsoftemployee since Nov 2023 , but my tweets are my ownAmr Awadallah 🤖 @awadallah
36K Followers 14K Following Founder/CEO of @Vectara (Trusted GenAI for the Enterprise). Founder/ex-CTO @Cloudera, ex-VP at Yahoo & Google. PhD EE Stanford. IG, FB, LI: @awadallah.ABC @Ubunta
3K Followers 3K Following Data & ML Infrastructure for Healthcare https://t.co/FwocCiCQAT Opinions are पड़ोसी' In 🇩🇪Berlin from 🇮🇳Kolkata/छत्तीसगढ़Arun Kumar @TweetAtAKK
5K Followers 256 Following Assoc Prof at UC San Diego CSE & HDSI. HDSI Faculty Fellow. Research on data management & ML systems. Wisconsin PhD. Freethinker. Poet. Memester. Gay. He/him.Simon Whiteley @MrSiWhiteley
3K Followers 589 Following Director of Engineering / Owner of @AdvAnalyticsUK, Speaker & Consultant. Spark Nerd. Londoner, foodie & gamer! Microsoft MVP. Databricks Beacon. He/Him.Sebastian @sscdotopen
3K Followers 2K Following Incoming professor of data management for ML at @bifoldberlin. Ex-@UvA_Amsterdam, @NYUDataScience, @Twitter intern; member of @TheASF & @EFF. Views are my own.Sarah Catanzaro @sarahcat21
12K Followers 1K Following “All methods are sacred if they are internally necessary” (GP @amplifypartners, prev @canvasvc; Head of Data @Mattermark; @palantirtech; @c4ads)holden karau @holdenkarau
17K Followers 2K Following she/her, OSS Big Data. ❤️🛵 ☕️ spark. I don't represent my employer. Live @ https://t.co/uOyeZtBXx0 , https://t.co/GB3Ok0vbVAEd Huang @dxhuang
5K Followers 1K Following Go/Rust Hacker/Co-founder & CTO of @PingCAP, Grand architect of #TiDB/#TiKV. Distributed SQL with 💗/ 🎵 nerd. Open for ☕️ @SF Bay Area, they/themHenry Robinson @HenryR
7K Followers 482 Following Infrastructure @SlackHQ. Distributed systems engineer. I have of late - but wherefore I know not - lost all my data.Raju @IitgRranjan
10 Followers 172 Followingdaniela @LunnaHtl
0 Followers 3 FollowingKenneth Apeh @apehken
96 Followers 1K Following Data Storyteller | Numbers whisperer | Data analysis, visualization, strategy | Python & Machine learning | Let's connect!Kevin Nejad @kevin_nejad
291 Followers 2K Following PhD ML & Comp.Neuro @UniofOxford prev:@UofBristol, prev. @nyuniversity, MSc Applied Maths @EdinburghUni ,BSc CompSci @KingsCollegeLonxland2023 @xland202352226
290 Followers 3K FollowingSviatoslav Makhynko @SviatMak
81 Followers 2K Following Husband & father of four (soon five) | Software Engineer | *BSD/Linux/MacOS | OSS | PsychologyRobin Hood @robinwood2015
227 Followers 2K Following If the truth is a cruel mistress, then a lie must be a nice girl....Sai Achalla @sachalla3
0 Followers 33 FollowingWhore to the Corp. @WhoreToTheCorp
96 Followers 745 Following I tweet about Technology & Business. Owning my owning. Pre-bunking bullshit.Shawn Charles🎤🔥 @ShawnBasquiat
32K Followers 3K Following 🧑🏾💻Ex-FAANG Software Engineer 🥑Senior ML Developer Advocate @ Coming Soon 🏗️Building Tech CommunitiesWangErxi @WangErxi
76 Followers 398 Followingz @helifee2
5 Followers 50 FollowingRohan Paul @rohanpaul_ai
13K Followers 1K Following ML Engineer (e/acc) 📌 https://t.co/x0IIWfnOt8 🚀 https://t.co/QEO4CKRl1b Open LLMs is Happiness 💡 Ex Deutsche & HSBC. DM for collaboration.l10u69070hlun @10znooqza49z
5 Followers 738 Following We first transfer USDT to you TRC20, you return 90% to BEP20, you get 10% , 2K per day Our co hv a large amt of USDT need to from TRC20 convert to BEP20 networkYifei Chen @cyifei2023
4 Followers 54 FollowingLeslie K @IamLeslieK
61 Followers 2K FollowingJason Checketts @jasonchecketts
1K Followers 5K Following #SOLIDWORKS #SanDiego #USC #FightOn #CAD #CAM #CAE #VELO3D #ABAQUS #PDM #PLM #bodyboarding #goengineer #ENOVIA #CATIA #DELMIA #disruptive #innovationJeff Tatarchuk @jtatarchuk
1K Followers 2K Following Co-founder @tensorwavecloud - Pioneering the next wave of AI compute. Need GPUs? DM me.venkatasai @venkyprince3
39 Followers 1K FollowingVan Tuan DANG @vtd38
2 Followers 26 FollowingJoseph Manuel Blanco .. @blanco_joseph02
70 Followers 845 Following Full Stack Software Developer Technology Enthusiast SaaS Entrepreneur 3 Years of experiencePolaris AI @PolarisAGI
14 Followers 185 FollowingwisRobert @wisRobert1
56 Followers 404 FollowingMarkus Rauhalahti, Ph.. @MRauhalahti
848 Followers 5K Following Independent researcher: molecular design/nanotech, human-AI collab, informetrics, DIY-scihw. Prev compchem/biophys/info phd&postdoc @helsinkiuniDevon Ferreira @devonf
3K Followers 2K Following Investor | Advisor | Partner @NetworkMedici | prev @avax @Immutable | ex @oakley @disney | NFA DYORfzw @notbadfzw
69 Followers 288 FollowingAI curious, Web3.0 po.. @FairAInow
11 Followers 339 Following Web2.0 _ Web3.0 _ AI_futurist ~ $UBT evangelistshebli mikailli @sheblidat
80 Followers 681 FollowingHogan Happy @HoganisHappy
29 Followers 110 FollowingHarold @haroldmoma
432 Followers 4K Following Solutions Architect at @AWSCloud. I read RFCs for fun. musings about tech,music,politics,books and others. Opinions here my own.Kaiming @ AutoMQ @wan0573
49 Followers 539 Following Architect & Lead Evangelist @AutoMQ_lab. Formerly lead CDC Platform @alibaba_cloud & co-founder @CloudCanal. Interested in data streaming & CDC.Alo @Hal90910
0 Followers 2K Followingraywang @raywang5800
3 Followers 99 FollowingShirin @acodingtraveler
1K Followers 4K Following Software Engineer👩🏻💻 | Former Tech Transfer Specialist + Innovative Partnerships @NASA Tech+Design | ML+DS | @UCLA @Caltech | Writer | MusicianSteffi Li @steffi_li
99 Followers 360 Following Product & GTM @zilliz_universe. @Wharton alum. Cosmopolitan. Adventurer. Antevasin.Aniruth Narayanan @aniruthn
36 Followers 147 Following people are on https://t.co/UMyVNl8DVs apparently so guess i'm here too @ucberkeleymet | apm @databricks, @retool, @tesla, @ey_us, @microsoft, @workivaRyan Carson @ryancarson
143K Followers 11K Following CEO Founder for 20 years 👉 Built and sold 3 startups 🧑💻 Senior AI Dev Community Lead @intel 💬 Opinions my ownMarcel Mao @mzp0514
0 Followers 66 FollowingBlack. @MickaelBaye
288 Followers 3K Following Crazy Developer / Basket-ball Lover / Esport fan / Starcraft forever newbie / That's just me ! =PAbhishek Tripathi @cloudandtechie
29 Followers 244 Following ⚡️ Data • SQL • Data Engineering 👨💻 Big Data • Databricks . BI Architect ☁️ Microsoft Azure Certified Data Engineer #dataengineering #dataengineerSonali Kharwar @SonaliKharwar6
1 Followers 144 Followingnull @from_0_to_null
396 Followers 1K FollowingMatei Zaharia @matei_zaharia
39K Followers 1K Following CTO at @Databricks and CS prof at @UCBerkeley. Working on data+AI, including @ApacheSpark, @DeltaLakeOSS, @MLflow, https://t.co/94gROE5Xa0. https://t.co/nmRYAKG0LZAndy Pavlo (@andy_pav.. @andy_pavlo
29K Followers 205 Following Associate Prof. of Databases @CarnegieMellon. Co-Founder @OtterTuneAIDatabricks @databricks
70K Followers 1K Following Databricks is the data and AI company, helping data teams solve the world’s toughest problems.Delta Lake @DeltaLakeOSS
8K Followers 66 Following Delta Lake is an open-source storage framework that enables building a Lakehouse architecture for Spark, Flink, Trino, Hive, Scala, Java, Rust, Python, & more!Joe Hellerstein @joe_hellerstein
15K Followers 894 Following Berkeley CS Prof, focused on data and computation.Gwen (Chen) Shapira @gwenshap
26K Followers 9K Following Co-founder of @niledatabase. Making SaaS global, elastic and chill. Find me at: https://t.co/uyuHg400cpPeter Wang 🦋 @pwang
48K Followers 2K Following Chief AI & Co-founder @AnacondaInc; invented @pyscript_dev, @PyData @Bokeh @Datashader. Former physicist. A student of the human condition. bsky: @wang.socialmartin_casado @martin_casado
50K Followers 2K Following GP @ a16z ... questionable heuristics in a grossly underdetermined worldJacek Laskowski @jace.. @jaceklaskowski
7K Followers 874 Following Freelance Data Engineer | #ApacheSpark #DeltaLake #Databricks #ApacheKafka #KafkaStreams | Java Champion | @theASF | #DatabricksBeaconsPeter Boncz @peterabcz
1K Followers 71 Following Professor Analytical Data Systems @cwi_da and @VUamsterdam. researcher, systems architect, educator, entrepreneurAmr Awadallah 🤖 @awadallah
36K Followers 14K Following Founder/CEO of @Vectara (Trusted GenAI for the Enterprise). Founder/ex-CTO @Cloudera, ex-VP at Yahoo & Google. PhD EE Stanford. IG, FB, LI: @awadallah.Arun Kumar @TweetAtAKK
5K Followers 256 Following Assoc Prof at UC San Diego CSE & HDSI. HDSI Faculty Fellow. Research on data management & ML systems. Wisconsin PhD. Freethinker. Poet. Memester. Gay. He/him.Martin Kleppmann @martinkl
50K Followers 993 Following Find me at @martin.kleppmann.com on Bluesky, @[email protected] on Mastodon. Author of @intensivedata, Associate Professor @Cambridge_CL. he/himKelly Sommers @kellabyte
47K Followers 325 Following Backend brat, big data, distributed diva. Relentless learner. Voids warranties. BitEarther. World isn’t round or a flat plane, it’s a simulation on a flat fileholden karau @holdenkarau
17K Followers 2K Following she/her, OSS Big Data. ❤️🛵 ☕️ spark. I don't represent my employer. Live @ https://t.co/uOyeZtBXx0 , https://t.co/GB3Ok0vbVAEd Huang @dxhuang
5K Followers 1K Following Go/Rust Hacker/Co-founder & CTO of @PingCAP, Grand architect of #TiDB/#TiKV. Distributed SQL with 💗/ 🎵 nerd. Open for ☕️ @SF Bay Area, they/themHenry Robinson @HenryR
7K Followers 482 Following Infrastructure @SlackHQ. Distributed systems engineer. I have of late - but wherefore I know not - lost all my data.Andreas Kipf @andreaskipf
1K Followers 481 Following Professor @utn_nuremberg. Research on the intersection of data systems and ML. 🏊♂️🚴♂️🏃♂️Databricks Mosaic Res.. @DbrxMosaicAI
30K Followers 115 Following We remove the barriers to state-of-the-art generative AI model development and make data + AI available to all.Tianqi Chen @tqchenml
15K Followers 973 Following AssistProf @mldcmu and @CSDatCMU. Chief Technologist @OctoML. Creator of @XGBoostProject, @ApacheMXNet, @ApacheTVM. Member https://t.co/QYyfjQNp4p, @TheASF.Hunter (Let's hop on .. @huntercoldcalls
32K Followers 337 Following Cold Caller ☎️ Hustler🏃250 dials before you wake up ☀️Sending solid gold leads to my AE 💪 Let's hop on a quick 5 minute call 📅Alex Cohen 🤠 @anothercohen
189K Followers 1K Following Having fun on the internet. Building something new. Side project: https://t.co/XmAstakS3p. Previously led consumer product @carbonhealthMichael Carbin @mcarbin
3K Followers 371 Following Associate Professor in EECS at @MIT | Founding Advisor at @mosaicml | Programming Systems | Neural Networks | Approximate ComputingRam Parameswaran @_ram_
23K Followers 5K Following Founder : Octahedron | Research : https://t.co/kw2dvBKY3PJonathan Frankle @jefrankle
16K Followers 685 Following Chief Scientist, Neural Networks @Databricks via MosaicML. PhD @MIT_CSAIL. BS/MS @PrincetonCS. DC area native. Making AI efficient for everyone at @DbrxMosaicAILauren Balik @laurenbalik
11K Followers 1K Following Data Wrangler. Investor. Bearish on most things. Human capital, infrastructure, SaaS. Writing things here: https://t.co/9Edqal5SsLGergely Orosz @GergelyOrosz
249K Followers 2K Following Writing @Pragmatic_Eng, the #1 technology newsletter on Substack. Author of @EngGuidebook. Formerly Uber & Skype.George Porter @georgemporter
4K Followers 937 Following Computer Science Professor at @UCSD, focusing on networking and systems.Eric Topol @EricTopol
694K Followers 589 Following physician-scientist, author, editor. Ground Truths: https://t.co/YhatcBT0hAShant Hovsepian @superdupershant
433 Followers 472 Following Data, Viz, SQL, Systems, Linux; ex-Co-Founder & ex-CTO;Ihab Ilyas @ihabilyas
1K Followers 194 Following Professor of Computer Science at the University of Waterloo @uwaterloo, @Apple, co-founder: @Tamr_Inc, Inductiv, https://t.co/O9CXNkEZ4HEthan Mollick @emollick
211K Followers 552 Following Professor @Wharton studying AI, innovation & startups. Democratizing education using tech Book: https://t.co/CSmipbJ2jV Substack: https://t.co/UIBhxu4bgqMatthew Rocklin @mrocklin
9K Followers 99 Following Open source maintainer. @dask_dev author. CEO at @CoiledHQ Additionally I try to be a decent human and help the world from melting.Dominik Moritz @domoritz
4K Followers 1K Following HTML decoder. Prof @cmuhcii @cmudig and researcher @apple. Interactive vis tools (e.g. @vega_vis). PhD @uwcse @uwdata. Also at https://t.co/LRqaJK44lXTao Feng @photoft45
601 Followers 2K Following TLM @ Databricks | ex Lyft Data Platform | Apache Airflow PMC | co-creator of Amundsen (LF AI) | Views are my ownNadim Hossain @NadimHossain
4K Followers 1K Following VP Product @Databricks | Founder @BrightFunnel | ex @Uber ATG | Talk about #data #ai #analytics #saas #selfdriving #vc #sf | #tennis #boxing fanBarr Moses @BM_DataDowntime
3K Followers 1K Following Co-Founder and CEO of Monte Carlo. https://t.co/FUUJPkkyDtTiark Rompf @tiarkrompf
2K Followers 541 Following Purdue University (We're hiring! grad students, post-docs, faculty)Zaheera Valani @zaheerav
489 Followers 586 Following Mom/Wife; #Seattle Site Director @databricks leading partner integration #Engineering #Product #Data #AI @tableau & @microsoft alum 👨👩👦👦🇰🇪 ➡️🇨🇦➡️🇺🇸Internal Tech Emails @TechEmails
525K Followers 900 Following Internal tech industry emails that surface in public records. 🔍Trung Phan @TrungTPhan
699K Followers 4K Following Write on business with @workweekinc. Co-host @niapodcast. Building an AI research app: https://t.co/fZ5ObIyBGIKanit Ham Wong @kanitw
2K Followers 875 Following Visualization at @databricks. Co-author of Vega-Lite, Voyager, and TensorFlow Graph Visualizer. Formerly @uwdata and @apple. Views are my own.Hyukjin Kwon @HyukjinKwon4
14 Followers 9 FollowingAnastasia Ailamaki @ailamaki
1K Followers 54 Following Professor of Computer Science at the Swiss Federal Institute of Technology (EPFL)✨ Jean Yang ✨ @jeanqasaur
25K Followers 4K Following New Product @getpostman. Founded @akitasoftware. Programming, APIs, and developer experience. Former programming languages professor @CSDatCMU.Barzan Mozafari @BarzanMozafari
966 Followers 94 Following Associate Professor at the University of Michigan, Ann Arbor. Co-founder at Keebo, Inc.Pat Helland @PatHelland
5K Followers 1K Following Building distributed systems & databases since 1978. Now at Salesforce. Dropped out of UC Irvine in 1976. Write for ACM Queue & blog @ https://t.co/MYYTVzxjyjYiying Zhang @yiying__zhang
1K Followers 99 Following Associate Professor of Computer Science at UCSD. Data-Center Systems, Cloud Systems, Systems and Machine LearningGideon Yu @gideonyu
16K Followers 120 Following Co-Owner & Former President @49ers | Former CFO @Facebook & @YouTube | Lead Initial Investor @Square | Board of Directors @PGA | ChristianKaren Bajza-Terlouw @kbajza13
916 Followers 599 Following Community and DevRel, looking for my next 🏠 | Previously Airbyte, Databricks, Docker | Upcycler @ https://t.co/sUxZAGe0gd Thoughts are my own.Nate Silver @NateSilver538
3.4M Followers 2K Following New Book, On The Edge, August 13: https://t.co/WeCLEOd4BeLennart C. L. Kats @lennartcl
339 Followers 214 Following Software engineer @databricks. Previously principal engineer @awscloud, CTO @cloud9ide, creator @spoofax. CompSci PhD. Views my own.Stefan Zeiger @StefanZeiger
4K Followers 194 Following Scala coder, dev tools @databricks, ex Slick tech lead, ex Scala compiler team memberAdrian Ionescu @adrionescu
7 Followers 20 Followingmmhmm @mmhmmapp
18K Followers 424 Following Turn heads on video. Make every virtual meeting better by being on screen with your presentation. For support: https://t.co/41usttLsKFOH undergrad in hallways at Berkeley CS: "if you look at my code, its *self-documenting*". 🐣
One of the biggest tragedies in academia is how some days there are two events that provide cookies and some days there are none.
Here are some great conversation starters when talking to family over the Christmas Holiday: 1. What are your goals for the New Year? 2. How is your job going? 3. Who is the decision maker at your company? 4. Can you intro me to them during a quick call later this week?
I cannot bring a child into this world and have him believe that rockets need wings to generate lift. It's a travesty. What, he is just going to live his life thinking there is air in space?!
@rxin Now when people ask me about the origins of @ApacheSpark I don’t have to rack my aging memories and can just point at this.
@joe_hellerstein If only the silos were broken down by teaching new grad students a combined DB/Systems course instead of two separate courses... 😁
Amsterdam is increasingly a place for data systems companies, with @weaviate_io, @duckdblabs, @motherduck, @DataPolars, @ClickHouseDB, @citusdata & @MonetDB. By far biggest is @databricks with 100s of engineers creating e.g., Delta Lake bit.ly/3RkBRVR - 🙏 @rxin et al!
A friend who works at Google said he wanted to switch to bi weekly sprints on his team. His manager took him aside and said they don't use the term "sprints" on his teams at Google because it's insensitive language. Because not everyone is able to run fast.
@matei_zaharia @rxin How about adding an AI-generated knowledge graph to the data dictionary? One step closer.....
@rxin I loved this sentence - "The SaaS LLMs are an engineering marvel, but they are very general models that need to address all the use cases from table generation to conversing about the meaning of life. The generality means it needs to have an extremely large number of parameters,…
At Replit, we extensively use Spark on Databricks for all our training data work. We run highly customized transformations on code like parsing, deduping, PII redaction, code filtering, tokenization, and more, written with low-level Spark primitives. High quality data is key.
“ Solid understanding of Spark and ability to write, debug, and optimize Spark code. “ I was today years old when I discovered that openAI relies heavily on Spark.
@marcua @sh_reya @_yujiewang @StringChaos And I've missed you @marcua! It's been a blast to revisit your sorts and joins paper with @sirrice @samrmadden as well as work from others in the old "crowd crowd" @jiannwang @tim_kraska @rxin @alkispolyzotis
.@rxin's @Data_AI_Summit keynote: @ApacheSpark has turned 10 (since it became a @TheASF project)! @derrickharris (@FortuneMagazine ) wrote in 2015 that "Spark is the Taylor Swift of #data" #opensource #BigData #ApacheSpark #dataAISummit
Can’t beat @rxin with @taylorswift13 #DataAISummit
The trade-offs of Query Optimization is a *perfect* problem for #AI. Chief Architect @rxin walking us through new prediction optimizations in Databricks! #LakehouseAI #DataAISummit