Databricks

{{Short description|American software company}}

{{Use mdy dates|date=March 2024}}

{{Infobox company

|logo = Databricks_logo.svg

|name = Databricks, Inc.

|type = Private

|industry = Computer software

|founders = {{ubl|Ali Ghodsi|Andy Konwinski|Ion Stoica|Patrick Wendell|Reynold Xin|Matei Zaharia|Arsalan Tavakoli}}

|key_people = {{ubl|Ali Ghodsi|(CEO)|Ion Stoica|(Executive chairman)}}

|location_city = San Francisco, California

|location_country = United States

|foundation = {{Start date and age|2013}}{{cite web|url=https://www.reuters.com/technology/databricks-nears-record-95-billion-vc-raise-eyes-extra-45-billion-debt-2024-12-13/|title=Exclusive: Databricks nears record $9.5 billion VC raise, eyes extra $4.5 billion debt|website=Reuters|date=December 13, 2024|author1=Krystal Hu|author2=Kenrick Cai|author3=Echo Wang|access-date=December 13, 2024}}

|area_served =

| revenue = {{Increase}} $1.6 billion (2023){{Cite news |last=Lin |first=Belle |date=2024-03-06 |title=AI is Driving Record Sales at Multibillion-Dollar Databricks. An IPO Can Wait … |url=https://www.wsj.com/articles/ai-is-driving-record-sales-at-multibillion-dollar-databricks-an-ipo-can-wait-f8a55bd4 |work=The Wall Street Journal |url-access=subscription |archive-url=https://archive.today/20240306145258/https://www.wsj.com/articles/ai-is-driving-record-sales-at-multibillion-dollar-databricks-an-ipo-can-wait-f8a55bd4 |archive-date=2024-03-06 |url-status=live}}

| num_employees = {{circa|8,000}} (2025){{cite web |url=https://www.cnbc.com/2025/01/22/meta-backs-databricks-as-the-data-analytics-startup-inches-toward-ipo.html |title=Meta backs Databricks as the data analytics startup inches toward IPO |date=2025-01-22 |publisher=CNBC |first=Novet |last=Jordan |archive-url=https://web.archive.org/web/20250122160822/https://www.cnbc.com/2025/01/22/meta-backs-databricks-as-the-data-analytics-startup-inches-toward-ipo.html |archive-date=2025-01-22 |url-status=live }}

|homepage = {{URL|databricks.com}}

}}

Databricks, Inc. is a global data, analytics, and artificial intelligence (AI) company, founded in 2013 by the original creators of Apache Spark.{{Cite news|url=https://www.forbes.com/sites/dereksaul/2023/09/14/top-ipo-prospect-databricks-scores-43-billion-valuation-thanks-to-500-million-funding-round-including-ai-titan-nvidia/?sh=5750590b6bce/|title=Top IPO Prospect Databricks Scores $43 Billion Valuation Thanks To $500 Million Funding Round Including AI Titan Nvidia|last=Saul|first=Derek|date=2023-09-14|newspaper=Forbes|access-date=2024-03-26|archive-date=2024-09-04|archive-url=https://web.archive.org/web/20240904181924/https://www.forbes.com/sites/dereksaul/2023/09/14/top-ipo-prospect-databricks-scores-43-billion-valuation-thanks-to-500-million-funding-round-including-ai-titan-nvidia/?sh=5750590b6bce/|url-status=live}} The company provides a cloud-based platform to help enterprises build, scale, and govern data and AI, including generative AI and other machine learning models.{{Cite news |url=https://www.fastcompany.com/91033457/databricks-most-innovative-companies-2024/ |title=How Databricks is helping customers develop their own customized AI models |last=Sullivan |first=Mark |date=2024-03-19 |newspaper=Fast Company|access-date=2024-03-19}}

Databricks pioneered the data lakehouse, a data and AI platform that combines the capabilities of a data warehouse with a data lake, allowing organizations to manage and use both structured and unstructured data for traditional business analytics and AI workloads.{{Cite news |url=https://www.theregister.com/2023/11/16/databricks_sinks_lakehouse_in_bid/ |title=Databricks' lakehouse becomes foundation under fresh layer of AI dreams |last=Clark |first=Lindsay |date=2023-11-16 |newspaper=The Register |access-date=2023-11-16 |archive-date=2024-09-04 |archive-url=https://web.archive.org/web/20240904181923/https://www.theregister.com/2023/11/16/databricks_sinks_lakehouse_in_bid/ |url-status=live }} The company similarly develops Delta Lake, an open-source project to bring reliability to data lakes for machine learning and other data science use cases.{{Cite web |date=2019-04-24 |title=Databricks launches Delta Lake, an open source data lake reliability project |url=https://venturebeat.com/2019/04/24/databricks-launches-delta-lake-an-open-source-data-lake-reliability-project/ |access-date=2021-04-06 |website=VentureBeat |language=en-US |archive-date=2022-03-24 |archive-url=https://web.archive.org/web/20220324004540/https://venturebeat.com/2019/04/24/databricks-launches-delta-lake-an-open-source-data-lake-reliability-project/ |url-status=live }}

History

=2013-2021=

{{stack|File:DatabricksBooth.jpg}}

Databricks grew out of the AMPLab project at University of California, Berkeley that was involved in making Apache Spark, an open-source distributed computing framework built atop Scala.{{Cite web |last= |first= |date=2021-09-08 |title=Databricks, SiFive, and Anyscale founders explain how they all built their red-hot startups out of a legendary UC Berkeley lab |url=https://www.businessinsider.com/uc-berkeley-labs-databricks-sifive-anyscale-riselab-amplab-2021-9 |access-date=2025-05-18 |website=Business Insider |language=en-US}} The company was founded by Ali Ghodsi, Andy Konwinski, Arsalan Tavakoli-Shiraji, Ion Stoica, Matei Zaharia, Patrick Wendell, and Reynold Xin.{{Cite web |date=2023-03-03 |title=Founders |url=https://www.databricks.com/company/founders |access-date=2025-05-18 |website=Databricks |language=en-US}}

In November 2017, the company was announced as a first-party service on Microsoft Azure via integration Azure Databricks.{{Cite web |title=Microsoft makes Databricks a first-party service on Azure |url=https://techcrunch.com/2017/11/15/microsoft-makes-databricks-a-first-party-service-on-azure/ |access-date=2021-04-06 |website=TechCrunch |date=15 November 2017 |language=en-US |archive-date=September 4, 2024 |archive-url=https://web.archive.org/web/20240904181924/https://techcrunch.com/2017/11/15/microsoft-makes-databricks-a-first-party-service-on-azure/ |url-status=live }}

In February 2021, together with Google Cloud, Databricks provided integration with the Google Kubernetes Engine and Google's BigQuery platform.{{Cite web |title=Databricks brings its lakehouse to Google Cloud |url=https://techcrunch.com/2021/02/17/databricks-brings-its-lakehouse-to-google-cloud/ |access-date=2021-02-18 |website=TechCrunch |date=17 February 2021 |language=en-US |archive-date=September 4, 2024 |archive-url=https://web.archive.org/web/20240904181925/https://techcrunch.com/2021/02/17/databricks-brings-its-lakehouse-to-google-cloud/ |url-status=live }} At this point in time, the company said more than 5,000 organizations used its products.{{Cite web |last=Konrad |first=Alex |date=February 2, 2021 |title=Databricks Raises $1 Billion At $28 Billion Valuation, With The Cloud's Elite All Buying In |url=https://www.forbes.com/sites/alexkonrad/2021/02/01/databricks-at-28-billion-valuation-from-aws-google-microsoft-salesforce/ |access-date=July 29, 2021 |website=Forbes |language=en |archive-date=February 1, 2021 |archive-url=https://web.archive.org/web/20210201183055/https://www.forbes.com/sites/alexkonrad/2021/02/01/databricks-at-28-billion-valuation-from-aws-google-microsoft-salesforce/ |url-status=live }}

Fortune ranked Databricks as one of the "Best Large Workplaces for Millennials" in 2021.{{Cite magazine |url=https://fortune.com/best-workplaces-millennials/2021/databricks/ |title=100 Best Large Workplaces for Millennials |date=June 16, 2021 |magazine=Fortune |access-date=2021-07-16 |archive-date=March 24, 2022 |archive-url=https://web.archive.org/web/20220324004537/https://fortune.com/best-workplaces-millennials/2021/databricks/ |url-status=live }}

=2022-2023=

In November 2023, Databricks unveiled the Databricks Data Intelligence Platform, a new offering that combines the unification benefits of the lakehouse with MosaicML’s Generative AI technology to enable customers to better understand and use their own proprietary data.{{Cite news|url=https://www.forbes.com/sites/kenrickcai/2023/11/14/databricks-data-intelligence-platform-mosaicml/?sh=2a494f717c1a/|title=Databricks' New AI Product Adds A ChatGPT-Like Interface To Its Software|last=Cai|first=Kenrick|date=2023-11-16|newspaper=Forbes|publisher=|access-date=2023-11-16|archive-date=2024-09-04|archive-url=https://web.archive.org/web/20240904181923/https://www.forbes.com/sites/kenrickcai/2023/11/14/databricks-data-intelligence-platform-mosaicml/?sh=2a494f717c1a/|url-status=live}}

The firm was valued at $62 billion in December 2024, following a funding round that raised one of the largest amounts in history, an equivalent to the largest single AI investment ever made.{{Cite news |date=2024-12-17 |title=Why AI company Databricks just scored one of the biggest funding rounds in history |url=https://www.fastcompany.com/91248820/why-ai-company-databricks-just-scored-one-of-the-biggest-funding-rounds-in-history |work=Fast Company}}

In early March 2025, Databricks announced it would invest $1 billion in San Francisco's downtown.{{Citation |last=Waxmann|first=Laura |year=March 5, 2025 |title=San Francisco tech company Databricks to invest $1 billion in city |publisher=San Francisco Chronicle |url=https://www.sfchronicle.com/sf/article/databricks-investment-ai-company-20205000.php |access-date=March 30, 2025}}

Databricks partnered with Anthropic in March 2025, with the latter's AI products to be put on the Databricks Data Intelligence Platform.{{Citation |year=March 27, 2025 |title=Databricks and Anthropic partner to help companies build AI agents |publisher=The Hindu |url=https://www.thehindu.com/sci-tech/technology/databricks-and-anthropic-partner-to-help-companies-build-ai-agents/article69381019.ece |access-date=March 30, 2025}} The deal was for five years and $100 million.{{Citation |last=Lin |first=Belle |year=March 26, 2025 |title=Anthropic, Databricks Team Up in Scramble for AI Revenue |publisher=The Wall Street Journal |url=https://www.wsj.com/articles/anthropic-databricks-team-up-in-scramble-for-ai-revenue-e15fe750 |access-date=March 30, 2025}} Ali Ghodsi remains CEO of Databricks.

Acquisitions

In June 2020, Databricks bought Redash, an open-source tool for data visualization and building of interactive dashboards.{{Cite web |date=24 June 2020 |title=Databricks acquires Redash, a visualizations service for data scientists |url=https://techcrunch.com/2020/06/24/databricks-acquires-redash-a-visualizations-service-for-data-scientists/ |access-date=2021-04-06 |website=TechCrunch |language=en-US}} In 2021, it bought German no-code company 8080 Labs whose product, bamboolib, allowed data exploration without any coding.{{cite web |url=https://www.cnbc.com/2021/10/06/hot-software-start-up-databricks-makes-no-code-acquisition-.html |title=$38 billion software start-up Databricks makes acquisition to leave code behind |website=CNBC |date=October 6, 2021 |author=Eric Rosenbaum |access-date=February 20, 2022 |archive-date=October 6, 2021 |archive-url=https://web.archive.org/web/20211006142433/https://www.cnbc.com/2021/10/06/hot-software-start-up-databricks-makes-no-code-acquisition-.html |url-status=live }} In May 2023, Databricks bought data security group Okera, extending Databricks data governance capabilities.{{cite web |last=Palazzolo |first=Stephanie |date=May 3, 2023 |title=Exclusive: $38 billion data and AI darling Databricks acquires security startup Okera |url=https://www.businessinsider.com/data-artificial-intelligence-startup-databricks-acquire-governance-security-okera-2023-5 |url-access=subscription |url-status=live |archive-url=https://web.archive.org/web/20230503195102/https://www.businessinsider.com/data-artificial-intelligence-startup-databricks-acquire-governance-security-okera-2023-5 |archive-date=May 3, 2023 |website=Business Insider}} In June, it bought the open-source generative AI startup MosaicML for $1.4{{nbsp}}billion.{{cite news |last1=Datta |first1=Tiyashi |last2=Hu |first2=Krystal |date=June 26, 2023 |title=Databricks strikes $1.3 billion deal for generative AI startup MosaicML |url=https://www.reuters.com/markets/deals/databricks-strikes-13-bln-deal-generative-ai-startup-mosaicml-2023-06-26/ |publisher=Reuters |access-date=June 27, 2023 |archive-date=June 26, 2023 |archive-url=https://web.archive.org/web/20230626130755/https://www.reuters.com/markets/deals/databricks-strikes-13-bln-deal-generative-ai-startup-mosaicml-2023-06-26/ |url-status=live }}{{cite news |last=Council |first=Stephen |date=June 26, 2023 |title=SF tech firm Databricks to buy 2-year-old startup for $21 million per employee |url=https://www.sfgate.com/tech/article/databricks-mosaicml-ghodsi-sf-tech-18171502.php |work=SFGATE |access-date=June 27, 2023 |archive-date=June 26, 2023 |archive-url=https://web.archive.org/web/20230626221915/https://www.sfgate.com/tech/article/databricks-mosaicml-ghodsi-sf-tech-18171502.php |url-status=live }} In October, Databricks bought data replication startup Arcion for $100 million.{{Cite web |last= |first= |date=2023-10-23 |title=After $43B valuation, Databricks acquires data replication startup Arcion for $100M |url=https://techcrunch.com/2023/10/23/after-43b-valuation-databricks-acquires-data-replication-startup-arcion-for-100m/ |access-date=2023-10-23 |website=TechCrunch |language=en-US}} In what is believed to be its sixth acquisition, Databricks bought Tabular, a data-management system used by open source AI, for over $1 billion.{{Cite news |date=5 June 2024 |editor-last=Galloni |editor-first=Allessandra |title=Databricks to buy data management firm Tabular for over $1 bln |url=https://www.reuters.com/markets/deals/databricks-buy-data-management-firm-tabular-over-1-bln-2024-06-04/ |archive-date= |work=Reuters}}

In March 2023, in response to the popularity of OpenAI's ChatGPT, the company introduced an open-source language model, named Dolly after Dolly the sheep, that allowed developers to create chatbots. Dolly uses fewer parameters to produce similar results as ChatGPT, but Databricks had not released formal benchmark tests to show whether its bot actually matched the performance of ChatGPT.{{cite news |last1=Hu |first1=Krystal |last2=Nellis |first2=Stephen |date=March 24, 2023 |title=Databricks pushes open-source chatbot as cheaper ChatGPT alternative |url=https://www.reuters.com/technology/databricks-pushes-open-source-chatbot-cheaper-chatgpt-alternative-2023-03-24/ |publisher=Reuters |archive-url=https://web.archive.org/web/20230325141855/https://www.reuters.com/technology/databricks-pushes-open-source-chatbot-cheaper-chatgpt-alternative-2023-03-24/ |archive-date=March 25, 2023 |url-status=live}}{{cite news |last=Loften |first=Angus |date=March 24, 2023 |title=Databricks Launches 'Dolly,' Another ChatGPT Rival |url=https://www.wsj.com/articles/databricks-launches-dolly-another-chatgpt-rival-31fd0f5f |newspaper=The Wall Street Journal |url-access=subscription |archive-url=https://archive.today/20230324125524/https://www.wsj.com/amp/articles/databricks-launches-dolly-another-chatgpt-rival-31fd0f5f |archive-date=March 24, 2023 |url-status=live}}{{cite news |last=Goldman |first=Sharon |date=March 24, 2023 |title=Databricks debuts ChatGPT-like Dolly, a clone any enterprise can own |url=https://venturebeat.com/ai/databricks-debuts-chatgpt-like-dolly-a-clone-any-enterprise-can-own/ |newspaper=VentureBeat |archive-url=https://web.archive.org/web/20230411011910/https://venturebeat.com/ai/databricks-debuts-chatgpt-like-dolly-a-clone-any-enterprise-can-own/ |archive-date=April 11, 2023 |url-status=live}}

Databricks reported $1.6 billion in revenue for the 2023 fiscal year, more than doubling its previous level.{{cite web |last1=Wilhelm |first1=Ron Miller and Alex |title=Databricks keeps marching forward with $1.6B in revenue |url=https://techcrunch.com/2024/03/07/databricks-revenue-numbers-ipo/ |website=TechCrunch |access-date=8 March 2024 |date=7 March 2024 |archive-date=March 12, 2024 |archive-url=https://web.archive.org/web/20240312035412/https://techcrunch.com/2024/03/07/databricks-revenue-numbers-ipo/ |url-status=live }}

In 2025, Databricks acquired a serverless database startup, Neon,{{Cite web |date=2025-05-13 |title=Databricks Agrees to Acquire Neon to Deliver Serverless Postgres for Developers + AI Agents |url=https://www.databricks.com/company/newsroom/press-releases/databricks-agrees-acquire-neon-help-developers-deliver-ai-systems |access-date=2025-05-16 |website=Databricks |language=en-US}} for around $1 billion.{{Cite web |last=Novet |first=Jordan |date=2025-05-14 |title=Databricks is buying database startup Neon for about $1 billion |url=https://www.cnbc.com/2025/05/14/databricks-is-buying-database-startup-neon-for-about-1-billion.html |access-date=2025-05-16 |website=CNBC |language=en}}

Funding

In September 2013, Databricks announced it raised $13.9 million from Andreessen Horowitz and said it aimed to offer an alternative to Google's MapReduce system.{{cite web |url=https://gigaom.com/2013/09/25/databricks-raises-14m-from-andreessen-horowitz-wants-to-take-on-mapreduce-with-spark/ |title=Databricks raises $14M from Andreessen Horowitz, wants to take on MapReduce with Spark |last=Harris |first=Derrick |date=September 25, 2013 |access-date=September 28, 2014 |archive-date=January 15, 2022 |archive-url=https://web.archive.org/web/20220115071749/https://gigaom.com/2013/09/25/databricks-raises-14m-from-andreessen-horowitz-wants-to-take-on-mapreduce-with-spark/ |url-status=dead }}{{cite web |url=http://radar.oreilly.com/2013/09/databricks-aims-to-build-next-generation-analytic-tools-for-big-data.html |title=Databricks aims to build next-generation analytic tools for Big Data |last=Lorica |first=Ben |date=September 25, 2013 |access-date=September 28, 2014 |publisher=O'Reilly Media |archive-date=July 4, 2014 |archive-url=https://web.archive.org/web/20140704090043/http://radar.oreilly.com/2013/09/databricks-aims-to-build-next-generation-analytic-tools-for-big-data.html |url-status=live }} Microsoft was a noted investor of Databricks in 2019, participating in the company's Series E at an unspecified amount.{{Cite web |title=Databricks raises $250M at a $2.75B valuation for its analytics platform |url=https://techcrunch.com/2019/02/05/databricks-raises-250m-series-e-for-its-analytics-platform/ |access-date=2021-04-08 |website=TechCrunch |date=5 February 2019 |language=en-US |archive-date=September 4, 2024 |archive-url=https://web.archive.org/web/20240904181925/https://techcrunch.com/2019/02/05/databricks-raises-250m-series-e-for-its-analytics-platform/ |url-status=live }}{{Cite web |last=Novet |first=Jordan |date=2019-02-05 |title=Microsoft used to scare start-ups but is now an 'outstandingly good partner,' says Silicon Valley investor Ben Horowitz |url=https://www.cnbc.com/2019/02/04/microsoft-invests-in-databricks-funding-at-2point7-billion-valuation.html |access-date=2021-04-06 |website=CNBC |language=en |archive-date=February 5, 2019 |archive-url=https://web.archive.org/web/20190205175057/https://www.cnbc.com/2019/02/04/microsoft-invests-in-databricks-funding-at-2point7-billion-valuation.html |url-status=live }} The company has raised $1.9 billion in funding, including a $1 billion Series G led by Franklin Templeton at a $28 billion post-money valuation in February 2021. Other investors include Amazon Web Services, CapitalG (a growth equity firm under Alphabet Inc.) and Salesforce Ventures. In August 2021, Databricks finished its eighth round of funding by raising $1.6 billion and valuing the company at $38 billion.{{cite news |last=Mellor |first=Chris |date=2021-09-01 |title=Databricks raises data lake of cash at monstrous $380bn valuation |url=https://blocksandfiles.com/2021/09/01/databricks-raises-data-lake-of-cash-at-monstrous-38bn-valuation/ |accessdate=2021-09-04 |work=Blocks & Files |archive-date=September 1, 2021 |archive-url=https://web.archive.org/web/20210901152223/https://blocksandfiles.com/2021/09/01/databricks-raises-data-lake-of-cash-at-monstrous-38bn-valuation/ |url-status=live }}

In December 2024, Databricks announced a $10 billion financing at a valuation of $62 billion.{{Cite news |date=2024-12-17 |title=Databricks Is Raising $10 Billion, in One of the Largest Venture Capital Deals |work=The New York Times |url=https://www.nytimes.com/2024/12/17/technology/databricks-funding-venture-capital-deals.html |archive-url=http://web.archive.org/web/20241218031025/https://www.nytimes.com/2024/12/17/technology/databricks-funding-venture-capital-deals.html |archive-date=2024-12-18 |access-date=2024-12-19 |language=en |last1=Griffith |first1=Erin }}

class="wikitable"

|+Funding rounds

!Series

!Date

!Amount (million $)

!Lead investors

A

|2013

|13.9

|Andreessen Horowitz

B

|2014

|33{{cite web |last=Miller |first=Ron |date=June 30, 2014 |title=Databricks Snags $33M In Series B And Debuts Cloud Platform For Processing Big Data |url=https://techcrunch.com/2014/06/30/databricks-snags-33m-in-series-b-and-debuts-cloud-platform-for-processing-big-data/ |access-date=September 28, 2014 |publisher=TechCrunch |archive-date=July 1, 2014 |archive-url=https://web.archive.org/web/20140701061716/https://techcrunch.com/2014/06/30/databricks-snags-33m-in-series-b-and-debuts-cloud-platform-for-processing-big-data/ |url-status=live }}

|New Enterprise Associates

C

|2016

|60{{Cite web |last=Shieber |first=Jonathan |title=Databricks raises $60 million to be big data's next great leap forward |url=https://techcrunch.com/2016/12/15/databricks-raises-60-million-to-be-big-datas-next-great-leap-forward/ |access-date=2016-12-16 |website=TechCrunch |date=15 December 2016 |archive-date=December 15, 2016 |archive-url=https://web.archive.org/web/20161215181533/https://techcrunch.com/2016/12/15/databricks-raises-60-million-to-be-big-datas-next-great-leap-forward/ |url-status=live }}

|New Enterprise Associates

D

|2017

|140{{Cite web |title=Databricks Secures $140 Million to Accelerate Analytics and Artificial Intelligence in the Enterprise |url=https://databricks.com/company/newsroom/press-releases/databricks-secures-140-million-accelerate-analytics-artificial-intelligence-enterprise |access-date=2019-05-16 |website=Databricks |date=22 August 2017 |language=en-US |archive-date=January 13, 2022 |archive-url=https://web.archive.org/web/20220113182651/https://databricks.com/company/newsroom/press-releases/databricks-secures-140-million-accelerate-analytics-artificial-intelligence-enterprise |url-status=live }}

|Andreessen Horowitz

E

|Feb. 2019

|250{{Cite web |title=Databricks' $250 Million Funding Supports Explosive Growth and Global Demand for Unified Analytics; Brings Valuation to $2.75 Billion |url=https://databricks.com/company/newsroom/press-releases/databricks-250-million-funding-supports-explosive-growth-and-global-demand-for-unified-analytics-brings-valuation-to-2-75-billion |access-date=2019-02-05 |website=Databricks |date=5 February 2019 |language=en-US |archive-date=January 15, 2022 |archive-url=https://web.archive.org/web/20220115063219/https://databricks.com/company/newsroom/press-releases/databricks-250-million-funding-supports-explosive-growth-and-global-demand-for-unified-analytics-brings-valuation-to-2-75-billion |url-status=live }}

|Andreessen Horowitz

F

|Oct. 2019

|400{{Cite web |title=Databricks announces $400M round on $6.2B valuation as analytics platform continues to grow |url=https://techcrunch.com/2019/10/22/databricks-announces-400m-round-on-6-2b-valuation-as-analytics-platform-continues-to-grow/ |access-date=2019-10-24 |website=TechCrunch |date=22 October 2019 |language=en-US |archive-date=September 4, 2024 |archive-url=https://web.archive.org/web/20240904182030/https://techcrunch.com/2019/10/22/databricks-announces-400m-round-on-6-2b-valuation-as-analytics-platform-continues-to-grow/ |url-status=live }}

|Andreessen Horowitz

G

|Jan. 2021

|1,000{{Cite web |title=Databricks raises $1B at $28B valuation as it reaches $425M ARR |url=https://techcrunch.com/2021/02/01/databricks-raises-1b-at-28b-valuation-as-it-reaches-425m-arr/ |access-date=2021-02-14 |website=Tech Crunch |date=February 2021 |language=en-US |archive-date=November 3, 2021 |archive-url=https://web.archive.org/web/20211103061840/https://techcrunch.com/2021/02/01/databricks-raises-1b-at-28b-valuation-as-it-reaches-425m-arr/ |url-status=live }}

|Franklin Templeton Investments

H

|Aug. 2021

|1,600{{Cite web |title=Databricks raises $1.6B at $38B valuation as it blasts past $600M ARR |url=https://techcrunch.com/2021/08/31/databricks-raises-1-6b-at-38b-valuation-as-it-blasts-past-600m-arr/ |access-date=2021-07-01 |website=Tech Crunch |language=en-US |archive-date=December 30, 2021 |archive-url=https://web.archive.org/web/20211230073558/https://techcrunch.com/2021/08/31/databricks-raises-1-6b-at-38b-valuation-as-it-blasts-past-600m-arr/ |url-status=live }}

|Morgan Stanley

I

|Sep. 2023

|500{{Cite web |last1=Nishant |first1=Niket |last2=Hu |first2=Krystal |date=2023-09-14 |title=Databricks raises over $500 mln at $43 bln valuation |url=https://www.reuters.com/technology/databricks-raises-over-500-mln-43-bln-valuation-2023-09-14/ |access-date=2023-09-20 |publisher=Reuters |language=en-US}}

|Capital One Ventures, Nvidia

J

|Dec. 2024

|10,000{{Cite web |last=Tan |first=Huileng |date=December 18, 2024 |title=Databricks is raising a gigantic funding round |url=https://www.businessinsider.com/databricks-ai-series-j-tech-funding-round-valuation-ghodsi-thrive-2024-12 |website=Business Insider}}

|Thrive Capital

Products

Databricks develops and sells a cloud data platform using the marketing term "lakehouse", a portmanteau of "data warehouse" and "data lake".{{Cite journal |last1=Michael |first1=Armbrust |last2=Ghodsi |first2=Ali |last3=Xin |first3=Reynold |last4=Zaharia |first4=Matei |date=January 2021 |title=Lakehouse: A New Generation of Open Platforms that Unify Data Warehousing and Advanced Analytics |url=http://cidrdb.org/cidr2021/papers/cidr2021_paper17.pdf |journal=Conference on Innovative Data Systems Research |access-date=July 29, 2021 |archive-date=December 22, 2020 |archive-url=https://web.archive.org/web/20201222143821/http://cidrdb.org/cidr2021/papers/cidr2021_paper17.pdf |url-status=live }} Databricks' Lakehouse is based on the open-source Apache Spark framework that allows analytical queries against semi-structured data without a traditional database schema.{{Cite web |date=2021-02-01 |title=With massive $1B infusion, Databricks takes aim at IPO and rival Snowflake |url=https://siliconangle.com/2021/02/01/massive-1b-infusion-databricks-takes-aim-ipo-rival-snowflake/ |access-date=2021-04-08 |website=SiliconANGLE |language=en-US |archive-date=April 6, 2023 |archive-url=https://web.archive.org/web/20230406192640/https://siliconangle.com/2021/02/01/massive-1b-infusion-databricks-takes-aim-ipo-rival-snowflake/ |url-status=live }} In October 2022, Lakehouse received FedRAMP authorized status for use with the U.S. federal government and contractors.{{cite news |last=Simone |first=Stephanie |url=https://www.kmworld.com/Articles/News/News/Databricks-achieves-FedRAMP-Authorized-status-155465.aspx |title=Databricks achieves FedRAMP Authorized status |work=KMWorld |publisher=Information Today |date=2022-10-17 |accessdate=2022-10-20 |archive-date=October 20, 2022 |archive-url=https://web.archive.org/web/20221020153530/https://www.kmworld.com/Articles/News/News/Databricks-achieves-FedRAMP-Authorized-status-155465.aspx |url-status=live }}

The company has also created Delta Lake, MLflow and Koalas, open source projects that span data engineering, data science and machine learning.{{Cite web |title=The Two Sigma Ventures Open Source Index |url=https://twosigmaventures.com/open-source-index/ |url-status=live |archive-url=https://web.archive.org/web/20221129150944/https://twosigmaventures.com/open-source-index/ |archive-date=November 29, 2022 |access-date=2021-04-08 |website=Two Sigma Ventures |language=en}}{{Cite web |title=MLOps Tools - Ranking. OSS Insight |url=https://ossinsight.io/collections/ml-ops-tools |url-status=live |archive-url=https://web.archive.org/web/20240904182429/https://ossinsight.io/collections/ml-ops-tools/ |archive-date=September 4, 2024 |access-date=2024-04-03 |website=OSS Insight |language=en}}

In June 2020, Databricks launched Delta Engine, a fast query engine for Delta Lake,{{Cite web |date=2020-06-24 |title=Databricks Cranks Delta Lake Performance, Nabs Redash for SQL Viz |url=https://www.datanami.com/2020/06/24/databricks-cranks-delta-lake-performance-nabs-redash-for-sql-viz/ |access-date=2021-04-08 |website=Datanami |archive-date=July 9, 2020 |archive-url=https://web.archive.org/web/20200709021845/https://www.datanami.com/2020/06/24/databricks-cranks-delta-lake-performance-nabs-redash-for-sql-viz/ |url-status=live }} compatible with Apache Spark and MLflow.{{Cite web |date=2019-04-24 |title=Databricks launches Delta Lake, an open source data lake reliability project |url=https://venturebeat.com/2019/04/24/databricks-launches-delta-lake-an-open-source-data-lake-reliability-project/ |access-date=2021-04-08 |website=VentureBeat |language=en-US |archive-date=March 24, 2022 |archive-url=https://web.archive.org/web/20220324004540/https://venturebeat.com/2019/04/24/databricks-launches-delta-lake-an-open-source-data-lake-reliability-project/ |url-status=live }}

In November 2020, Databricks introduced Databricks SQL (previously called SQL Analytics) for running business intelligence and analytics reporting on top of data lakes. Analysts can query data sets with standard SQL or use connectors to integrate with business intelligence tools like Holistics, Tableau, Qlik, SigmaComputing, Looker, and ThoughtSpot.{{Cite web |title=Databricks launches SQL Analytics |url=https://techcrunch.com/2020/11/12/databricks-launches-sql-analytics-builds-itself-a-lake-house/ |access-date=2021-04-08 |website=TechCrunch |date=2020-11-12 |language=en-US |archive-date=2024-09-04 |archive-url=https://web.archive.org/web/20240904182532/https://techcrunch.com/2020/11/12/databricks-launches-sql-analytics-builds-itself-a-lake-house/ |url-status=live }}

Databricks offers a platform for other workloads, including machine learning, data storage and processing, streaming analytics, and business intelligence.{{Cite web |last=Brust |first=Andrew |title=Databricks, champion of data "lakehouse" model, closes $1B series G funding round |url=https://www.zdnet.com/article/databricks-champion-of-data-lakehouse-model-closes-1b-series-g-funding-round/ |access-date=2021-04-08 |website=ZDNet |language=en |archive-date=2021-02-01 |archive-url=https://web.archive.org/web/20210201193337/https://www.zdnet.com/article/databricks-champion-of-data-lakehouse-model-closes-1b-series-g-funding-round/ |url-status=live }}

In early 2024, Databricks released the Mosaic set of tools for customizing, fine-tuning and building AI systems. It includes AI Vector Search for building RAG models; AI Model Serving, a service for deploying, governing, querying and monitoring models fine-tuned or pre-deployed by Databricks; and AI Pretraining, a platform for enterprises to create their own LLMs.{{Cite news |url=https://siliconangle.com/2024/03/13/data-powered-ai-revolutionizing-enterprise-databricks-supercloud6/ |title=Riding the data-powered AI wave: Inside Databricks' unified stack solution |date=2024-03-14 |newspaper=Databricks |language=en-US |access-date=2024-04-05 |archive-date=September 4, 2024 |archive-url=https://web.archive.org/web/20240904182429/https://siliconangle.com/2024/03/13/data-powered-ai-revolutionizing-enterprise-databricks-supercloud6/ |url-status=live }}

In March 2024, Databricks released DBRX, an open-source foundation model. It has a mixture-of-experts architecture and is built on the MegaBlocks open-source project.{{Cite news |url=https://siliconangle.com/2024/03/27/databricks-open-sources-large-language-model/ |title=Databricks open-sources its own large language model, DBRX |date=2024-03-27 |newspaper=Databricks |language=en-US |access-date=2024-04-05 |archive-date=April 5, 2024 |archive-url=https://web.archive.org/web/20240405134948/https://siliconangle.com/2024/03/27/databricks-open-sources-large-language-model/ |url-status=live }} DBRX cost $10 million to create. At the time of launch, it was the fastest open-source LLM, based on commonly-used industry benchmarks. It beat other models like Llama 2 at solving logic puzzles and answering general knowledge questions, among other tasks. And while it has 136 billion parameters, it only uses 36 billion, on average, to generate outputs.{{Cite news |url=https://www.wired.com/story/dbrx-inside-the-creation-of-the-worlds-most-powerful-open-source-ai-model/ |title=Inside the Creation of the World's Most Powerful Open Source AI Model |date=2024-03-27 |newspaper=Databricks |language=en-US |access-date=2024-04-05 |archive-date=September 4, 2024 |archive-url=https://web.archive.org/web/20240904182552/https://www.wired.com/story/dbrx-inside-the-creation-of-the-worlds-most-powerful-open-source-ai-model/ |url-status=live }} DBRX also serves as a foundation for companies to build or customize their own AI models. Companies can also use proprietary data to generate higher-quality outputs for specific use cases.{{Cite news |url=https://www.fastcompany.com/91070566/databricks-ai-model-enterprise |title=Databricks' new open-source AI model could offer enterprises a leaner alternative to OpenAI's GPT-3.5 |date=2024-03-27 |newspaper=Databricks |language=en-US |access-date=2024-04-05 |archive-date=September 4, 2024 |archive-url=https://web.archive.org/web/20240904182431/https://www.fastcompany.com/91070566/databricks-ai-model-enterprise |url-status=live }}

In addition to building the Databricks platform, the company has co-organized massive open online courses about Spark{{Cite news |date=2014-12-02 |title=Databricks to run two massive online courses on Apache Spark |url=https://databricks.com/blog/2014/12/02/announcing-two-spark-based-moocs.html |url-status=live |archive-url=https://web.archive.org/web/20220113191350/https://databricks.com/blog/2014/12/02/announcing-two-spark-based-moocs.html |archive-date=January 13, 2022 |access-date=2016-12-16 |newspaper=Databricks |language=en-US}} and a conference for the Spark community called the Data + AI Summit,{{Cite web |title=Data + AI Summit |url=https://databricks.com/dataaisummit |url-status=live |archive-url=https://web.archive.org/web/20220423234458/https://databricks.com/dataaisummit |archive-date=April 23, 2022 |access-date=2021-04-08 |website=Databricks |language=en-US}} formerly known as Spark Summit.[https://towardsdatascience.com/highlights-from-data-ai-summit-2021-3abfd9aaccaa/ Highlights from DATA+AI Summit 2021] Towards Data Science. June 27, 2021

= Collaborations =

In December 2024, Databricks along with Wiz and Workday has decided to run their products on top of AWS via the new button called "Buy with AWS button".{{Cite web |last=Novet |first=Jordan |date=2024-12-04 |title=Amazon rolls out Buy with AWS button to let software vendors more easily sell to its cloud customers |url=https://www.cnbc.com/2024/12/04/amazon-will-sell-software-with-buy-with-aws-button-for-partner-sites.html |access-date=2024-12-08 |website=CNBC |language=en}}

Operations

Databricks is headquartered in San Francisco.{{Cite web |last=staff |first=CNBC com |date=2020-06-16 |title=36. Databricks |url=https://www.cnbc.com/2020/06/16/databricks-disruptor-50.html |access-date=2021-04-08 |website=CNBC |language=en |archive-date=December 24, 2022 |archive-url=https://web.archive.org/web/20221224144520/https://www.cnbc.com/2020/06/16/databricks-disruptor-50.html |url-status=live }} It also has operations in Canada, the Netherlands, the United Kingdom, and elsewhere.{{cite web |url=https://www.databricks.com/company/contact/office-locations |title=Worldwide locations |accessdate=2022-10-20 |archive-date=June 7, 2023 |archive-url=https://web.archive.org/web/20230607093613/https://www.databricks.com/company/contact/office-locations |url-status=live }}

References