Michael Stonebraker#Aurora and StreamBase
{{Short description|American computer scientist (born 1943)}}
{{for|the American football player|Mike Stonebreaker}}
{{Infobox scientist
| name = Michael Stonebraker
| image = Michael Stonebraker P1120062.jpg
| image_size =
| alt =
| caption = Michael Stonebraker giving the 2015 Turing lecture
| birth_date = {{Birth date and age|1943|10|11}}
| birth_place = Newburyport, Massachusetts, U.S.{{cite web | title = Michael Stonebraker - A.M. Turing Award Winner| url=https://amturing.acm.org/award_winners/stonebraker_1172121.cfm | access-date=2018-02-06}}
| death_date =
| death_place =
| fields = Computer science
| workplaces = University of California, Berkeley
University of Michigan
Massachusetts Institute of Technology
| education = Princeton University (BS)
University of Michigan (MS, PhD)
| doctoral_advisor = Arch Waugh Naylor
| thesis_title = The Reduction of Large Scale Markov Models for Random Chains
| thesis_url = https://www.proquest.com/docview/302585708/
| doctoral_students =
| notable_students = Joseph M. Hellerstein
Andy Pavlo
Clifford A. Lynch{{cite web |title=Ph.D. Dissertations {{!}} EECS at UC Berkeley |url=https://www2.eecs.berkeley.edu/Pubs/Dissertations/Faculty/stonebraker.html |website=www2.eecs.berkeley.edu}}
Margo Seltzer
Dale Skeen{{MathGenealogy |id=31091}}
Marti Hearst{{cite web |url=https://people.ischool.berkeley.edu/~hearst/papers/neat.pdf|title=Nice: or what it was like to be Mike's student}}
Leilani Battle{{cite thesis | last=Battle | first=Leilani Marie | title=Behavior-driven optimization techniques for scalable data exploration | publisher=Massachusetts Institute of Technology | date=2017 | hdl=1721.1/111853 | url=https://dspace.mit.edu/handle/1721.1/111853 | access-date=2023-12-27}}
| known_for = Ingres, Postgres, Vertica, Streambase, Illustra, VoltDB, SciDB
| influences =
| influenced =
| awards = IEEE John von Neumann Medal (2005)
ACM Turing Award (2014)
| website = {{URL|csail.mit.edu/user/1547}}
| footnotes =
| spouse = Beth
}}
Michael Ralph Stonebraker (born October 11, 1943{{cite journal
|title=Contributors
|journal=IEEE Transactions on Systems, Man, and Cybernetics
|issue=4
|pages=562–564
|date=Sep 1972
| volume= | doi=10.1109/TSMC.1972.4309174
}}) is an American computer scientist specializing in database systems. Through a series of academic prototypes and commercial startups, Stonebraker's research and products are central to many relational databases. He is also the founder of many database companies, including Ingres Corporation, Illustra, Paradigm4, StreamBase Systems, Tamr, Vertica, VoltDB and Hopara, and served as chief technical officer of Informix. For his contributions to database research, Stonebraker received the 2014 Turing Award, often described as "the Nobel Prize for computing."
Stonebraker's career can be broadly divided into two phases: his time at University of California, Berkeley when he focused on relational database management systems such as Ingres and Postgres, and, starting in 2001, at Massachusetts Institute of Technology (MIT) where he developed more novel data management techniques such as C-Store, H-Store, SciDB and DBOS.{{Cite web |title=Postgres pioneer Michael Stonebraker promises to upend the database once more |url=https://www.theregister.com/AMP/2023/12/26/michael_stonebraker_feature/ |access-date=2023-12-27 |website=www.theregister.com}} Stonebraker is currently a professor emeritus at UC Berkeley and an adjunct professor at MIT's Computer Science and Artificial Intelligence Laboratory.{{Cite web|url=https://www2.eecs.berkeley.edu/Faculty/Homepages/stonebraker.html|title=Michael Stonebraker |website=www2.eecs.berkeley.edu|access-date=2018-03-16}}{{Cite web|url=https://www.csail.mit.edu/person/michael-stonebraker|title=Michael Stonebraker {{!}} MIT CSAIL|website=www.csail.mit.edu|access-date=2018-03-16}} He is also known as an editor for the book Readings in Database Systems.
Life
Stonebraker grew up in Milton, New Hampshire.[http://archive.computerhistory.org/resources/access/text/2012/12/102635858-05-01-acc.pdf Oral History of Michael Stonebraker; 2012-08-23] Retrieved 2018-08-26. He earned his B.S.E. in electrical engineering from Princeton University in 1965, and his M.S. and Ph.D. from the University of Michigan in 1967 and 1971{{cite thesis |degree=PhD |first=Michael Ralph |last=Stonebraker |title=The Reduction of Large Scale Markov Models for Random Chains |institution=University of Michigan |date=1971 |url=http://dl.acm.org/citation.cfm?id=905670 |id={{ProQuest|302585708}} |oclc=634008426}} respectively. His awards include the IEEE John von Neumann Medal and the first SIGMOD Edgar F. Codd Innovations Award. In 1994 he was inducted as a Fellow of the Association for Computing Machinery.{{cite web |url=http://portal.acm.org/author_page.cfm?id=81337493529 |title=Michael Ralph Stonebraker - ACM author profile page |access-date=2011-07-27}} In 1997, he was elected a member of the National Academy of Engineering for the development and commercialization of relational and object-relational database systems. In March 2015 it was announced he won the 2014 ACM Turing Award.{{Cite web|url = http://newsoffice.mit.edu/2015/michael-stonebraker-wins-turing-award-0325|title = Michael Stonebraker wins $1 million Turing Award|date = March 25, 2015|access-date = March 25, 2015|website = MIT News|publisher = Massachusetts Institute of Technology|last = Conner-Simons|first = Adam}} In September 2015, he won the 2015 Commonwealth Award, chosen by council members of MassTLC.Geller, Jessica. [http://www.betaboston.com/news/2015/09/16/masstlc-awards-companies-executives-at-annual-gala/ "PTC Chief Heppelman named CEO of the year by Mass. tech council."] betaBoston. The Boston Globe. Sept. 16, 2015 {{Webarchive|url=https://web.archive.org/web/20160107103054/http://www.betaboston.com/news/2015/09/16/masstlc-awards-companies-executives-at-annual-gala/ |date=2016-01-07 }}
= The Berkeley years (1971–2000) =
Stonebraker joined University of California, Berkeley, as an assistant professor in 1971, and taught in the computer science department for twenty-nine years. It was there that he did his early pioneering work on relational databases.
== Ingres ==
In 1973, Stonebraker and his colleague Eugene Wong started researching relational database systems after reading a series of seminal papers published by Edgar F. Codd on the relational data model.{{Cite journal | last1 = Codd | first1 = E. F. | author-link1 = Edgar F. Codd | title = A relational model of data for large shared data banks | doi = 10.1145/362384.362685 | journal = Communications of the ACM | url = http://www.seas.upenn.edu/~zives/03f/cis550/codd.pdf | volume = 13 | issue = 6 | pages = 377–387 | year = 1970 | s2cid = 207549016 }}
Their project, known as Ingres (Interactive Graphics and Retrieval System),{{Cite journal | last1 = Stonebraker | first1 = M. | author-link1 = Michael Stonebraker| last2 = Held | first2 = G. | last3 = Wong | first3 = E. | last4 = Kreps | first4 = P. | title = The design and implementation of INGRES | doi = 10.1145/320473.320476 | journal = ACM Transactions on Database Systems | volume = 1 | issue = 3 | pages = 189 | year = 1976 | citeseerx = 10.1.1.109.957 | s2cid = 1514658 }} was one of the first systems (along with System R from IBM) to
demonstrate that it was possible to build a practical and efficient implementation of the relational model. A number of key ideas from INGRES are still widely used in relational systems, including the use of B-trees, primary-copy replication, the query rewrite approach to views and integrity constraints, and the idea of rules/triggers for integrity checking in an RDBMS. Additionally, much experimental work was done that provided insights into how to build a locking system that could provide satisfactory transaction performance.{{cite web
|url=http://redbook.cs.berkeley.edu/redbook3/lec2.html
|access-date=2009-11-24
|publisher=Joseph Hellerstein
|title=Relational Roots
|year=1998
}}
By the mid-1970s, Stonebraker's team produced, using a rotating team of student programmers, a usable relational database system. At the time Ingres was considered "low end" compared to IBM's System R, as it ran on Unix-based Digital Equipment Corporation machines as opposed to the "big iron" IBM mainframes.{{citation needed|date=December 2011}}
By the early 1980s, however, the performance and capabilities of these low-end machines were seriously threatening IBM's mainframe market, and with the threat came the ability of Ingres to become a viable, "real" product for a large number of applications. Ingres used a variation of the BSD license for a nominal fee, and soon a number of companies took advantage of this to create commercial versions of Ingres.{{citation needed|date=December 2011}}
These included Stonebraker, who with fellow Berkeley professors Larry Rowe and Eugene Wong helped found Relational Technology, Inc., later called Ingres Corporation. Subsequently, sold to Computer Associates, Ingres was re-established as an independent company in 2005, and later renamed Actian. Other startups based on Ingres include Sybase, founded by Robert Epstein, a student on the project, and Britton Lee, Inc. Sybase's code was later used as a basis for Microsoft SQL Server.{{cite web
|url=http://redbook.cs.berkeley.edu/redbook3/lec1.html
|access-date=2009-11-24
|publisher=Joseph Hellerstein
|title=Motivation & DBMS Architecture Overview
|year=1998
}}
== Postgres ==
After founding Relational Technology, Stonebraker and Rowe began a "post-Ingres" effort, to address the limitations of the relational model. The new project was named POSTGRES (POST inGRES),{{Cite journal | last1 = Stonebraker | first1 = M. | author-link1 = Michael Stonebraker| last2 = Rowe | first2 = L. A. | doi = 10.1145/16856.16888 | title = The design of POSTGRES | journal = ACM SIGMOD Record | volume = 15 | issue = 2 | pages = 340 | year = 1986 | doi-access = free }} and was designed to add support for complex data types to database systems and improve end-to-end performance of data-intensive applications. Postgres provided an object relational programming model in which fields could be complex datatypes, and where users could register new types as well as scalar and aggregate functions over those types. Postgres was extensible in a number of other ways, making it easy for programmers to modify or add to the optimizer, query language, runtime, and indexing frameworks. These features improved both database programmability and performance, and made it possible to push large portions of a number of applications inside the database, including geographic information systems and time series processing. This had the effect of substantially broadening the commercial database market.{{citation needed|date=December 2023}}
Postgres was also offered using a BSD-like license, and the code forms the basis of the free software, PostgreSQL. Stonebraker also led an effort to commercialize the code, creating Illustra which was purchased by Informix. PostgreSQL has been used as the basis for a number of other startup companies, including Aster Data Systems, EnterpriseDB, and Greenplum.{{citation needed|date=December 2023}}
Informix acquired Illustra in 1996, and Stonebraker became Informix's CTO, a position he held until September 2000. Informix integrated Illustra's O–R mapping and DataBlades into the 7.x OnLine product, resulting in Informix Universal Server (IUS), or more generally, Version 9.{{citation needed|date=December 2023}}
== Mariposa and Cohera ==
After the Postgres project, Stonebraker initiated the Mariposa{{Cite journal | last1 = Stonebraker | first1 = M. | author-link1 = Michael Stonebraker| last2 = Aoki | first2 = P. M. | last3 = Litwin | first3 = W. | last4 = Pfeffer | first4 = A. | last5 = Sah | first5 = A. | last6 = Sidell | first6 = J. | last7 = Staelin | first7 = C. | last8 = Yu | first8 = A. | doi = 10.1007/s007780050015 | title = Mariposa: A wide-area distributed database system | journal = The VLDB Journal the International Journal on Very Large Data Bases | volume = 5 | pages = 48–63 | year = 1996 | citeseerx = 10.1.1.68.5480 | s2cid = 5062284 }} project which became the basis of Cohera Corporation. Mariposa built a federated database over an economic model of resource trading, in which data distributed across multiple organizations could be integrated and queried from a single relational interface, governed by site-specific policies that would charge for data processing and storage. These economic policies allowed traditional ideas in query optimization to be carried out over competing sites, and also served as the basis for data storage, replication and movement within a federation.
Cohera's initial mission was to commercialize Mariposa, but eventually focused on a business-to-business catalog management application on the core federated data integration engine. Cohera's intellectual property was purchased by PeopleSoft in 2001,
and used as the basis of PeopleSoft's Enterprise Catalog Management. PeopleSoft was in turn purchased by Oracle Corporation in 2004.{{citation needed|date=December 2023}}
= The MIT years (2001–present) =
{{BLP unreferenced section|date=December 2023}}
Stonebraker became an adjunct professor at MIT in 2001, where he began another series of research projects and founded a number of companies.
== Aurora and StreamBase ==
In the Aurora Project, Stonebraker, along with colleagues from Brandeis University, Brown University, and MIT, focused on data management for streaming data, using a new data model and query language. Unlike relational systems, which "pull" data and process it a record at a time, in Aurora, data is "pushed", arriving asynchronously from external data sources (such as stock ticks, news feeds, or sensors.) The output is itself a stream of results (such as windowed averages) that are sent to users.{{Cite journal | last1 = Abadi | first1 = D. J. | last2 = Carney | first2 = D. | last3 = Etintemel| first3 = U. | last4 = Cherniack | first4 = M. | last5 = Convey | first5 = C. | last6 = Lee | first6 = S. | last7 = Stonebraker | first7 = M. | author-link7 =Michael Stonebraker| last8 = Tatbul | first8 = N. | last9 = Zdonik | first9 = S. | doi = 10.1007/s00778-003-0095-z | title = Aurora: A new model and architecture for data stream management | journal = The VLDB Journal the International Journal on Very Large Data Bases | volume = 12 | issue = 2 | pages = 120 | year = 2003 | citeseerx = 10.1.1.6.1187 | s2cid = 8101432 }}
Stonebraker co-founded StreamBase Systems in 2003 to commercialize the technology behind Aurora.
== C-Store and Vertica ==
In the C-Store project, started in 2005, Stonebraker, along with colleagues from Brandeis, Brown, MIT, and University of Massachusetts Boston, developed a parallel, shared-nothing column-oriented DBMS for data warehousing. By dividing and storing data in columns, C-Store is able to perform less I/O and get better compression ratios than conventional database systems that store data in rows.(Print edition title: Database Pioneer Rethinks How Data is Organized.{{cite magazine |magazine=InformationWeek |url=https://www.informationweek.com/database-pioneer-rethinks-the-best-way-to-organize-data/d/d-id/1064893 |title=Database Pioneer Rethinks The Best Way To Organize Data |date=February 21, 2008 |author=Charles Babcock}}
Stonebraker explained that it's because similar data items are side-by-side: Name,Name,Name,Name vs. Name,Address,Zip,Phone#. In 2005, Stonebraker co-founded Vertica to commercialize the technology behind C-Store.{{cite web
|title=The Vertica Analytic Database: C-Store 7 Years Later" (PDF)
|website=VLDB.org |date=August 28, 2012
|url=http://vldb.org/pvldb/vol5/p1790_andrewlamb_vldb2012.pdf}}
== Morpheus and Goby ==
In 2006, Stonebraker started the Morpheus project, along with researchers from the University of Florida. Morpheus is a data integration system which relies on a collection of "transforms" to mediate between data sources. Each transform provides a queryable interface to particular web site or service, and Morpheus makes it possible to search for and compose multiple transforms to provide a new service or a unified view of several services.
In 2009, Stonebraker co-founded Goby,{{Citation | url = http://www.goby.com/ | title = Goby}}. a local search company based on ideas from Morpheus, for people to explore new things to do in free time.
== H-Store and VoltDB ==
{{BLP unreferenced section|date=December 2023}}
In 2007, with researchers from Brown University, MIT, and Yale University, Stonebraker started the H-Store project. H-Store is a distributed main-memory online transaction processing (OLTP) system designed to provide very high throughput on transaction processing workloads.
In 2009, Stonebraker co-founded, and then served as an adviser to, VoltDB a commercial startup based on ideas from the H-Store project.
== SciDB ==
In 2008, along with David DeWitt and researchers from Brown, MIT, Portland State University, SLAC, the University of Washington, and the University of Wisconsin–Madison, Stonebraker started SciDB{{Cite book | last1 = Brown | first1 = P. G. | chapter = Overview of sciDB | doi = 10.1145/1807167.1807271 | title = Proceedings of the 2010 international conference on Management of data - SIGMOD '10 | pages = 963–968 | year = 2010 | isbn = 9781450300322 | s2cid = 14544985 }}{{Cite book | last1 = Stonebraker | first1 = M. | author-link1 = Michael Stonebraker| last2 = Brown | first2 = P. | last3 = Poliakov | first3 = A. | last4 = Raman | first4 = S. | chapter = The Architecture of SciDB | doi = 10.1007/978-3-642-22351-8_1 | title = Scientific and Statistical Database Management | series = Lecture Notes in Computer Science | volume = 6809 | pages = 1–16 | year = 2011 | isbn = 978-3-642-22350-1 }} an open-source DBMS specially designed for scientific research applications.{{cite news
| url=https://www.theregister.co.uk/2010/09/13/michael_stonebraker_interview/
| title=SciDB: Relational daddy answers Google, Hadoop, NoSQL
| work=The Register
| date=2010-09-13
| access-date=2012-01-11 }}
He founded Paradigm4 with Marilyn Matz, who became CEO. Paradigm4 developed SciDB, used mostly by life sciences and financial markets. Novartis, Foundation Medicine, and the National Institutes of Health are some of the company's clients.Alspach, Kyle. [http://bostinno.streetwise.co/2015/11/30/boston-startup-funding-golden-seeds-backs-paradigm4/ "New Money: MassChallenge Alum Gets Dorm Room Fund Investment; Drone Co. Raises Seed Round."] BostInno. Nov. 30, 2015 {{Webarchive|url=https://web.archive.org/web/20160207050543/http://bostinno.streetwise.co/2015/11/30/boston-startup-funding-golden-seeds-backs-paradigm4/ |date=2016-02-07 }}
== NoSQL ==
In 2010 and 2011, Stonebraker criticized the NoSQL movement.{{Cite journal | last1 = Stonebraker | first1 = M. | author-link1 = Michael Stonebraker| title = SQL databases v. NoSQL databases | doi = 10.1145/1721654.1721659 | journal = Communications of the ACM | volume = 53 | issue = 4 | pages = 10–11 | year = 2010 | s2cid = 13959501 }}{{Cite journal | last1 = Stonebraker | first1 = M. | author-link1 = Michael Stonebraker| title = Stonebraker on NoSQL and enterprises | doi = 10.1145/1978542.1978546 | journal = Communications of the ACM | volume = 54 | issue = 8 | pages = 10–11 | year = 2011 | s2cid = 36572502 }}{{Cite journal | last1 = Stonebraker | first1 = M. | author-link1 = Michael Stonebraker| last2 = Abadi | first2 = D. | last3 = Dewitt | first3 = D. J. | last4 = Madden | first4 = S. | last5 = Paulson | first5 = E. | last6 = Pavlo | first6 = A. | last7 = Rasin | first7 = A. | title = MapReduce and parallel DBMSs | doi = 10.1145/1629175.1629197 | journal = Communications of the ACM | volume = 53 | pages = 64–71 | year = 2010 | s2cid = 61484899 }}
Notable students
{{BLP unreferenced section|date=December 2023}}
Stonebraker trained more than 30 students, including:
- Daniel Abadi, co-founder of Hadapt (acquired by Teradata)
- Michael J. Carey, professor at UC Irvine
- Paula Hawthorn, co-founder of Britton Lee
- Marti Hearst, professor at UC Berkeley
- Joseph M. Hellerstein, professor at UC Berkeley
- Clifford A. Lynch, executive director of the Coalition for Networked Information
- Margo Seltzer, professor at the University of British Columbia, founder and former CTO of Sleepycat Software
- Dale Skeen, founder and CEO of Vitria
- Sunita Sarawagi, professor at IIT Bombay
Selected works
- {{cite book |last1=Joseph M. Hellerstein |last2=Michael Stonebraker |date=2015 |edition=5th |title=Readings in Database Systems |url=http://www.redbook.io/ |publisher=MIT Press |author1-link=Joseph M. Hellerstein }}
- {{cite journal |last1=Michael Stonebraker |last2=Randy Katz, David Patterson, John Ousterhout |date=1988 |title=THE DESIGN OF XPRS |url=http://db.cs.berkeley.edu/papers/ERL-M88-19.pdf |journal=VLDB |pages=318–330 |access-date=25 March 2015}}
References
{{Reflist}}
External links
{{Commons category|Michael Stonebraker}}
- {{Cite web
|url=http://s2k-ftp.cs.berkeley.edu/nasa_e2e/mike.html
|date=1995-12-23
|title=Michael Stonebraker | publisher = University of California | location = Berkeley, CA, USA
|access-date=2009-07-14
}}
- {{Cite web
|url=http://www.csail.mit.edu/user/1547
|title=Michael Stonebraker
|work=User Profiles
|publisher=MIT | location = MA, USA
|date=2009-07-09
|access-date=2009-07-14
}}
- {{Citation | chapter-url = http://www.dbms2.com/category/michael-stonebraker/ | chapter = Michael Stonebraker | title = DBMS2 | last = Monash | first = Curt}}, a series of recent interviews and comments about and by Stonebraker.
- {{Cite web
|url=http://publications.csail.mit.edu/abstracts/abstracts07/stonebraker-abstract1/stonebraker-abstract1.html
|title=Morpheus: A Data Integration Toolkit
|work=CSAIL Research Abstracts
|publisher=MIT
|location=MA, USA
|access-date=2009-11-22
|archive-url=https://web.archive.org/web/20100616104149/http://publications.csail.mit.edu/abstracts/abstracts07/stonebraker-abstract1/stonebraker-abstract1.html
|archive-date=2010-06-16
|url-status=dead
}}
- {{Citation | url = http://www.goby.com/ | title = Goby}}, a new search engine to find fun things to do in one's free time (co-founded by Stonebraker)
- {{Citation | url = https://www.networkworld.com/article/937146/database-pioneer-stonebraker-rocks-1m-nobel-prize-in-computing.html | title = Database pioneer Stonebraker rocks $1M "Nobel Prize in Computing"| date = 2015-03-25}}
{{Turing Award laureates}}
{{Authority control}}
{{DEFAULTSORT:Stonebraker, Michael}}
Category:American computer businesspeople
Category:American computer scientists
Category:Foreign members of the Russian Academy of Sciences
Category:People from Milton, New Hampshire
Category:People from Newburyport, Massachusetts
Category:Princeton University alumni
Category:Turing Award laureates
Category:UC Berkeley College of Engineering faculty
Category:University of Michigan alumni
Category:1994 fellows of the Association for Computing Machinery