DataStax

{{Short description|Data management company}}

{{Use American English|date=December 2024}}

{{Use mdy dates|date=December 2024}}

{{Infobox company

| name = DataStax

| logo = DataStax Logo 2023.png

| logo_caption = Logo used since 2023

| type = Private

| location_city = Santa Clara, CA

| location_country = United States

| foundation = April 2010

| founder = {{ubl

| Jonathan Ellis

| Matt Pfeil

}}

| key_people = {{ubl

| Chet Kapoor{{cite web | url=https://www.datastax.com/blog/2019/10/announcing-our-new-ceo | title=Announcing Our New CEO }} (CEO)

| Davor Bonaci (CTO)

| Ed Anuff (CPO)

| Don Dixon (CFO)

| Brad Gyger (CRO)

| Jason McClelland (CMO)

| Chris Vogel (CPO)

}}

| industry = Database Technologies

| genre = Multi-Model DBMS

| num_employees = 800+ (June 2022)

| homepage = {{Official URL}}

}}

DataStax, Inc. is a real-time data for AI company based in Santa Clara, California.{{cite news |last1=Gage |first1=Deborah |title=DataStax Raises {{US$|long=no|106 Million}} in New Pre-IPO Round, Chips Away at Oracle |url=https://blogs.wsj.com/venturecapital/2014/09/04/datastax-raises-106-million-in-new-pre-ipo-round-chips-away-at-oracle/ |publisher=Wall Street Journal |date=4 September 2014}} Its product Astra DB is a cloud database-as-a-service based on Apache Cassandra. DataStax also offers DataStax Enterprise (DSE), an on-premises database built on Apache Cassandra, and Astra Streaming, a messaging and event streaming cloud service based on Apache Pulsar. As of June 2022, the company has roughly 800 customers distributed in over 50 countries.{{cite news |last1=Banks |first1=Martin |title=DataStax adds Oracle to provide practical collaboration |url=https://diginomica.com/2017/10/06/datastax-adds-oracle-provide-practical-collaboration/ |publisher=Diginomica.com |date=6 October 2017}}{{cite news |last1=Clancy |first1=Heather |title=DataStax just scored a big partnership with HP. Here's why. |url=http://fortune.com/2015/04/14/datastax-hp-sales-partnership/ |publisher=Fortune |date=14 April 2015}}{{cite web |title=Cassandra vendor DataStax secures {{US$|long=no|115m}} investment for {{US$|long=no|1.6b}} valuation |url=https://www.theregister.com/2022/06/15/datastax_funding/ |website=theregister.com |access-date=August 8, 2022}}

History

DataStax was built on the open source NoSQL database Apache Cassandra. Cassandra was initially developed internally at Facebook to handle large data sets across multiple servers, and was released as an Apache open source project in 2008.{{cite news |last1=Jackson |first1=Joab |title=Apache Cassandra Ready for the Enterprise |url=https://www.cio.com/article/2403250/data-management/apache-cassandra-ready-for-the-enterprise.html |publisher=CIO |date=18 October 2011 |access-date=5 September 2018 |archive-date=6 September 2018 |archive-url=https://web.archive.org/web/20180906014025/https://www.cio.com/article/2403250/data-management/apache-cassandra-ready-for-the-enterprise.html |url-status=dead }} In 2010, Jonathan Ellis and Matt Pfeil left Rackspace, where they had worked with Cassandra, to launch Riptano in Austin, Texas.{{cite web|url=https://www.wired.com/2014/08/datastax/|title=OUT IN THE OPEN: THE ABANDONED FACEBOOK TECH THAT NOW HELPS POWER APPLE|date=4 August 2014|publisher=Wired|accessdate=18 September 2017}}{{cite news |last1=Clark |first1=Don |title=Start-Up Riptano Predicts Success With Cassandra Database |url=https://blogs.wsj.com/venturecapital/2010/10/26/start-up-predicts-success-with-cassandra-database/ |publisher=Wall Street Journal |date=26 October 2010}} Ellis and Pfeil later renamed the company DataStax, and moved its headquarters to Santa Clara, California.{{cite news |last1=Harris |first1=Derrick |title=NoSQL is growing up, and DataStax just raised $106M to prove it |url=https://gigaom.com/2014/09/04/nosql-startup-datastax-raises-106m-as-it-grows-into-being-an-enterprise-software-company/ |archive-url=https://web.archive.org/web/20140906173005/http://gigaom.com/2014/09/04/nosql-startup-datastax-raises-106m-as-it-grows-into-being-an-enterprise-software-company/ |url-status=dead |archive-date=September 6, 2014 |publisher=gigaom.com |date=4 September 2014}}

The company went on to create its own enterprise version of Cassandra, a NoSQL database called DataStax Enterprise (DSE).

In 2019, Chet Kapoor was named the company's new CEO, taking over from Billy Bosworth.{{cite web |title=Former Google VP Chet Kapoor joins DataStax as CEO |url=https://siliconangle.com/2019/10/22/former-google-vp-chet-kapoor-joins-datastax-new-ceo/ |website=siliconangle.com |date=22 October 2019 |access-date=February 22, 2021}}

File:Datastax logo.svg

In May 2020, DataStax released Astra DB, a DBaaS for Cassandra applications.{{cite web |title=Cassandra Now Officially In the Cloud with Datastax Astra |url=https://www.datanami.com/2020/05/12/cassandra-now-officially-in-the-cloud-with-datastax-astra/ |website=datanami.com |date=12 May 2020 |access-date=February 26, 2021}} In November 2020, DataStax released K8ssandra, an open source distribution of Cassandra on Kubernetes.{{cite web |title=DataStax unveils K8ssandra as cloud-native Cassandra |url=https://www.zdnet.com/article/datastax-unveils-k8ssandra-as-cloud-native-cassandra/ |website=ZDNet |access-date=February 26, 2021}} In December 2020, DataStax released Stargate, an open source data API gateway.{{cite web |title=Meet Stargate, DataStax's GraphQL for databases |url=https://www.zdnet.com/article/meet-stargate-datastaxs-graphql-for-databases-first-stop-cassandra/ |website=ZDNet |access-date=February 26, 2021}}

After acquiring streaming event vendor Kesque in January 2021,{{cite web |title=DataStax enters event streaming market with Apache Pulsar |url=https://www.techtarget.com/searchdatamanagement/news/252495492/DataStax-enters-event-streaming-market-with-Apache-Pulsar |website=techtarget.com |access-date=August 8, 2022}} the company launched Luna Streaming, a data streaming platform for Apache Pulsar.{{cite web |title=DataStax acquires Kesque |url=https://techcrunch.com/2021/01/27/datastax-acquires-kesque-as-it-gets-into-data-streaming/ |website=techcrunch.com |access-date=February 26, 2021}} DataStax then rebuilt the Kesque technology into Astra Streaming.{{cite web |title=DataStax cofounder on evolving Cassandra for modern workloads |url=https://venturebeat.com/2021/07/26/datastax-cofounder-on-evolving-cassandra-for-modern-workloads/ |website=venturebeat.com |access-date=August 8, 2022}} The Astra Streaming cloud service became generally available on June 29, 2022.{{Cite web |date=2022-06-29 |title=DataStax Astra gets support for Kafka, RabbitMQ and JMS in bid to capture the ‘full data story’ |url=https://diginomica.com/datastax-astra-gets-support-kafka-rabbitmq-and-jms-bid-capture-full-data-story |access-date=2023-03-30 |website=diginomica.com |language=en}} With the release, the company added API-level support for messaging tools Apache Kafka, RabbitMQ and Java Message Service, in addition to Apache Pulsar.{{cite web |title=DataStax Astra gets support for Kafka, RabbitMQ and JMS in bid to capture the 'full data story' |url=https://diginomica.com/datastax-astra-gets-support-kafka-rabbitmq-and-jms-bid-capture-full-data-story |website=diginomica.com |access-date=August 8, 2022}} Astra Streaming can connect to a larger data platform by utilizing DataStax's Astra DB cloud service.

Starting in 2023, DataStax began incorporating artificial intelligence and machine learning into its platform.{{cite web |title=DataStax brings vector database search to multicloud with Astra DB |url=https://venturebeat.com/data-infrastructure/datastax-brings-vector-database-search-to-multicloud-with-astra-db/ |website=venturebeat.com |access-date=December 1, 2023}} In January 2023, the company acquired Kaskada, developer of a platform that helps organizations use data for AI applications.{{cite web |title=AI feature engineering is focus as DataStax acquires Kaskada |url=https://venturebeat.com/ai/ai-feature-engineering-is-focus-as-datastax-acquires-kaskada/ |website=venturebeat.com |access-date=December 1, 2023}} DataStax made the formerly proprietary Kaskada technology open source, and integrated it into its Luna ML service, which was launched on May 4, 2023.{{cite web |title=DataStax extends AI feature engineering with Luna ML |url=https://venturebeat.com/ai/datastax-extends-ai-feature-engineering-with-luna-ml/ |website=venturebeat.com |access-date=December 1, 2023}} With the acquisition, former Kaskada CEO Davor Bonaci was named DataStax chief technology officer and executive vice president.

On May 24, 2023, DataStax announced that it would be partnering with ThirdAI to bring large language models to DSE and AstraDB, to help developers develop generative AI applications.{{cite web |title=DataStax taps ThirdAI to bring generative AI to its database offerings |url=https://www.infoworld.com/article/3697708/datastax-taps-thirdai-to-bring-generative-ai-to-its-database-offerings.html |website=infoworld.com |access-date=December 1, 2023}}

In June 2023, the company announced the development of a GPT-based schema translator in its Astra Streaming cloud service. The Astra Streaming GPT Schema Translator uses generative AI to automatically generate schema mappings, to enable data integration and interoperability between multiple systems and data sources.{{cite web |title=DataStax Plumbs AI Into Smarter Data Pipelines |url=https://www.forbes.com/sites/adrianbridgwater/2023/06/16/datastax-plumbs-ai-into-smarter-data-pipelines/?sh=1ff0203db666 |website=forbes.com |access-date=December 1, 2023}}

On July 18, 2023, the company announced a partnership with Google to make semantic search available in its Astra DB cloud database for developers building generative AI applications.

On September 13, 2023, DataStax launched the LangStream open source project, which works with Astra DB and supports vector databases including Milvus and Pinecone. LangStream enables developers to better work with streaming data sources, using Apache Kafka technology and generative AI to help build event-driven architectures.{{cite web |title=DataStax takes aim at event-driven AI with open source LangStream project |url=https://venturebeat.com/data-infrastructure/datastax-takes-aim-at-event-driven-ai-with-open-source-langstream-project/ |website=venturebeat.com |access-date=December 1, 2023}}

In November 2023, DataStax announced RAGStack, a simplified commercial offering for RAG (retrieval-augmented generation) based on LangChain and Astra DB vector search.{{cite web |title=With RAGStack, DataStax enables generative AI models to gain additional context from third-party data |url=https://siliconangle.com/2023/11/02/ragstack-datastax-makes-easy-generative-ai-models-gain-additional-context-3rd-party-data/ |website=siliconangle.com |access-date=December 1, 2023}}

On February 25, 2025, IBM announced its intention to acquire DataStax.{{Cite web |title=IBM to Acquire DataStax, Deepening watsonx Capabilities and Addressing Generative AI Data Needs for the Enterprise |url=https://newsroom.ibm.com/2025-02-25-ibm-to-acquire-datastax,-deepening-watsonx-capabilities-and-addressing-generative-ai-data-needs-for-the-enterprise |access-date=2025-02-26 |website=IBM Newsroom |language=en-us}}{{Cite web |title=Accelerating Production AI and Bringing NoSQL Data at Scale to All Enterprises |url=https://www.datastax.com/blog/ibm-plans-to-acquire-datastax |access-date=2025-02-26 |website=DataStax |language=en}}

Products

=Astra DB=

Astra DB is available on cloud services such as Microsoft Azure, Amazon Web Services, and Google Cloud Platform.{{cite web |title=DataStax offers serverless, NoSQL Astra DB across multiple regions, clouds |url=https://www.infoworld.com/article/3633648/datastax-offers-serverless-nosql-astra-db-across-multiple-regions-clouds.html |website=infoworld.com |access-date=August 8, 2022}} In February 2021, DataStax announced the serverless version of Astra DB, offering developers pay-as-you-go data.{{cite web |title=DataStax Astra serverless DBaaS optimizes deployments |url=https://www.techtarget.com/searchdatamanagement/news/252496978/DataStax-Astra-serverless-DBaaS-optimizes-deployments |website=techtarget.com |access-date=August 8, 2022}}

In March 2022, DataStax introduced new change data capture (CDC) capabilities to its Astra DB cloud service. Astra DB CDC is powered by Apache Pulsar, which allows developers to manage operational and streaming data in one place.{{cite web |title=DataStax CEO: Every use case doesn't need a new database |url=https://www.infoworld.com/article/3656950/datastax-ceo-every-use-case-doesnt-need-a-new-database.html |website=infoworld.com |access-date=August 8, 2022}} DataStax leads the open-source Starlight,

which provides a compatibility layer for different protocols on top of Apache Pulsar.{{cite web |title=DataStax extends Astra Streaming event data platform |url=https://www.techtarget.com/searchdatamanagement/news/252522180/DataStax-extends-Astra-Streaming-event-data-platform |website=techtarget.com |access-date=August 8, 2022}}

On February 8, 2023, DataStax launched Astra Block, a cloud-based service based on the Ethereum blockchain to support building Web3 applications, available as part of Astra DB. Astra Block can be used by developers to stream enhanced data from the Ethereum blockchain to build or scale Web3 experiences on Astra DB.{{cite web |title=DataStax launches Astra Block to support Web3 applications |url=https://www.infoworld.com/article/3687057/datastax-launches-astra-block-to-support-web3-applications.html |website=infoworld.com |access-date=December 1, 2023}}

Astra DB supports open source LangChain technology, making it easier for developers to create generative AI applications.

=DSE=

Version 1.0 of the DataStax Enterprise (DSE), released in October 2011, was the first commercial distribution of the Cassandra database, designed to provide real-time application performance and heavy analytics on the same physical infrastructure.{{cite news |last1=Cohan |first1=Peter |title=DataStax Partners With Oracle In $46B Database Market |url=https://www.forbes.com/sites/petercohan/2017/11/24/datastax-partners-with-oracle-in-46-billion-database-market/#4834af397f44 |work=Forbes.com |date=24 Nov 2017 |access-date=5 September 2018 |archive-date=5 September 2018 |archive-url=https://web.archive.org/web/20180905215540/https://www.forbes.com/sites/petercohan/2017/11/24/datastax-partners-with-oracle-in-46-billion-database-market/#4834af397f44 |url-status=dead }}{{cite news |last1=Harris |first1=Derrick |title=DataStax gets $11M, fuses NoSQL and Hadoop |url=https://gigaom.com/2011/09/20/datastax-gets-11m-fuses-nosql-and-hadoop/ |archive-url=https://archive.today/20130124002104/http://gigaom.com/2011/09/20/datastax-gets-11m-fuses-nosql-and-hadoop/ |url-status=dead |archive-date=January 24, 2013 |publisher=gigaom.com |date=20 September 2011}} It grew to include advanced security controls, graph database models, operational analytics and advanced search capabilities.{{cite news |last1=Carey |first1=Scott |title=How DataStax wants its NoSQL platform to drive the 'right now economy' |url=https://www.computerworlduk.com/data/how-datastax-wants-its-nosql-platform-drive-right-now-economy-3664812/ |publisher=Computerworld UK |date=4 October 2017 |access-date=5 September 2018 |archive-date=5 September 2018 |archive-url=https://web.archive.org/web/20180905214937/https://www.computerworlduk.com/data/how-datastax-wants-its-nosql-platform-drive-right-now-economy-3664812/ |url-status=dead }}

In April 2016, the company announced the release of DataStax Enterprise Graph, adding graph data model functionality to DSE.{{cite news |last1=Miller |first1=Ron |title=DataStax adds graph databases to enterprise Cassandra product set |url=https://techcrunch.com/2016/04/12/datastax-adds-graph-databases-to-enterprise-cassandra-product-set/ |publisher=techcrunch.com |date=12 April 2016}}

In March 2017, DataStax announced the release of its DSE platform 5.1, which included improved search capabilities, improved security control, improvements to its Graph data management and improvements to operational analytics performance. DataStax also announced a shift in strategy, with an added focus on customer experience applications. Rather than a new set of technologies, the company started to offer advice on best practice to users of its core DSE platform.{{cite web|url=http://diginomica.com/2017/03/15/datastax-ceo-launches-new-cx-strategy-focusing-shifting-tech-business/|title=DataStax CEO launches new CX strategy – focus shifting from tech to business|date=15 March 2017|publisher=diginomica|accessdate=12 September 2017}}

In April 2018, DataStax released DSE 6, with the new version focused on businesses using a hybrid cloud computing model, with all the benefits of a distributed cloud database on any public cloud or on-premise, twice the responsiveness and ability to handle twice the throughput.{{cite news |last1=Sargent |first1=Jenna |title=DataStax Enterprise 6 released with double the Apache Cassandra performance |url=https://sdtimes.com/data/datastax-enterprise-6-released-with-double-the-apache-cassandra-performance/ |publisher=San Diego Times |date=19 April 2018}}{{cite news |last1=Whiting |first1=Rick |title=DataStax Pushes The Cloud Database Performance Boundary With New Release |url=https://www.crn.com/news/applications-os/300102226/datastax-pushes-the-cloud-database-performance-boundary-with-new-release.htm?itc=refresh |publisher=crn.com |date=17 April 2018}}

In December 2018, DataStax released DSE 6.7, which offers enterprise customers five key new feature upgrades, including: improved analytics, geospatial search, improved data protection in the cloud, enhanced performance insights and new developer integration tools with Apache Kafka Connector and certified production Docker images.{{cite web|url=https://www.datastax.com/press-release/datastax-announces-datastax-enterprise-67/|title=DataStax announces the release of DSE 6.7 |publisher=datastax.com}}

In April 2020, DataStax released DSE 6.8, offering enterprises new capabilities for bare-metal performance and to support more workloads, and serving as a Kubernetes operator for Cassandra.{{cite web |title=DataStax |url=https://www.crn.com/slide-shows/cloud/the-coolest-database-system-companies-of-the-2020-big-data-100/3 |website=crn.com |date=28 April 2020 |access-date=February 22, 2021}}

DSE 7.0 was introduced in August 2023. It offers enhancements in cloud-native operations and generative AI capabilities, and includes vector search.{{cite web |title=DataStax Announces Vector Search for DataStax Enterprise |url=https://www.datanami.com/this-just-in/datastax-announces-vector-search-for-datastax-enterprise/ |website=datanami.com |access-date=December 1, 2023}}

Funding and IPO

In September 2014, DataStax raised {{US$|long=no|106 million}} in a Series E funding round, raising the total investment in the company to {{US$|long=no|190 million}}. On June 15, 2022, the company announced it had raised an additional {{US$|long=no|115 million}}, at a {{US$|long=no|1.6 billion}} valuation.{{cite web |title=DataStax raises $115M to advance its data stack |url=https://www.techtarget.com/searchdatamanagement/news/252521567/DataStax-raises-115M-to-advance-its-data-stack |website=techtarget.com |access-date=August 8, 2022}}

In 2020, Mergermarket reported that DataStax was preparing for an initial public offering that could launch in 2021.{{cite web |title=Venture Capital-Backed Tech Firm Exits To Watch In 2021 |url=https://www.forbes.com/sites/mergermarket/2021/01/12/venture-capital-backed-tech-firm-exits-to-watch-in-2021/?sh=602d123d57cb |website=forbes.com |access-date=February 22, 2021}} However, in June 2022, DataStax CEO Chet Kapoor said that the company would not rush into an IPO.

See also

References

{{Reflist}}