XLDB

XLDB (eXtremely Large DataBases) was a yearly conference about databases, data management and analytics held from 2007 to 2019. The definition of extremely large refers to data sets that are too big in terms of volume (too much), and/or velocity (too fast), and/or variety (too many places, too many formats) to be handled using conventional solutions. This conference dealt with the high-end of very large databases (VLDB). It was conceived and chaired by Jacek Becla.

History

In October 2007, data experts gathered at SLAC National Accelerator Lab for the [https://web.archive.org/web/20080417002612/http://www-conf.slac.stanford.edu/xldb07/ First Workshop on Extremely Large Databases]. As a result, the XLDB research community was formed to meet the rapidly growing demands of the largest data systems. In addition to the original invitational workshop, an open conference, tutorials, and annual satellite events on different continents were added. The main event, held annually at Stanford University gathers over 300 attendees. XLDB is one of the data systems events catering to both academic and industry communities. For 2009, the workshop was co-located with VLDB 2009 in France to reach out to non-US research communities.{{Cite web|url=https://www.symmetrymagazine.org/breaking/2009/09/16/building-the-biggest-scientific-databases|title=Building the biggest scientific databases|website=symmetry magazine|language=en|access-date=2019-04-15}} XLDB 2019 followed Stanford's Conference on Systems and Machine Learning (SysML).{{Cite web|url=https://conf.slac.stanford.edu/xldb2019/|title=XLDB Extremely Large Databases 2019|website=XLDB Extremely Large Databases 2019|access-date=2019-04-15}}

Goals

The main goals of this community include:{{ cite web | url=http://www-conf.slac.stanford.edu/xldb09/docs/xldb09_welcomeTalk.ppt | year=2009 | last=Becla| first=Jacek | title=XLDB 3 Welcome | access-date=2009-08-29 }}

  • Identify trends, commonalities and major roadblocks related to building extremely large databases
  • Bridge the gap between users trying to build extremely large databases and database solution providers worldwide
  • Facilitate development and growth of practical technologies for extremely large data stores

XLDB Community

As of 2013, the community consisted of over one thousand members including:

  1. Scientists who develop, use, or plan to develop or use XLDB for their research, from laboratories.
  2. Commercial users of XLDB.
  3. Providers of database products, including commercial vendors and representatives from open source database communities.
  4. Academic database researchers.

XLDB Conferences, Workshops and Tutorials

The community met annually at Stanford University through 2019. Occasional satellite events were held in Asia and Europe.

A detailed report or videos was produced after each workshop.

class="wikitable"
Year

! Place

! Link

! Report

! Comments

2019

| Stanford

| [https://conf.slac.stanford.edu/xldb2019/]

|

| 12th XLDB Conference

2018

| Stanford

| [https://conf.slac.stanford.edu/xldb2018/]

|

| 11th XLDB Conference

2017

| Clermont-Ferrand

| [http://xldb2017.uca.fr]

|

| 10th XLDB Conference

2016

| Stanford

| [https://web.archive.org/web/20150521105100/http://www-conf.slac.stanford.edu/xldb2015/]

|

| 9th XLDB Conference

2015

| Stanford

| [https://web.archive.org/web/20150521105100/http://www-conf.slac.stanford.edu/xldb2015/]

|

| 8th XLDB Conference

2014

| [http://www.on.br/ Observatório Nacional], Rio_de_Janeiro

| [https://web.archive.org/web/20150219081443/http://xldb-rio2014.linea.gov.br/]

|

| Satellite XLDB Workshop in South America

2014

| Stony_Brook_University

| [https://web.archive.org/web/20150521052839/http://www3.cs.stonybrook.edu/~xldb/]

|

| XLDB-Healthcare Workshop

2013

| Stanford

| [https://conf-slac.stanford.edu/xldb-2013/]

|

| 7th XLDB Conference

2013

| CERN, Geneva/Switzerland

| [https://archive.today/20130410033807/http://xldb-europe-workshop-2013.web.cern.ch/]

|

| Satellite XLDB Workshop in Europe

2012

| Stanford

| [http://www-conf.slac.stanford.edu/xldb2012/]

| [http://www.jstage.jst.go.jp/article/dsj/12/0/12_12_023/_pdf]

| 6th XLDB Conference, Workshop & Tutorials

2012

| Beijing, China

| [https://web.archive.org/web/20120708164351/http://idke.ruc.edu.cn/xldb/www.xldb-asia.org/home.html]

| [http://www.xldb.org/wp-content/uploads/2012/09/XLDBAsia2012Report.pdf]

| Satellite XLDB Conference in Asia

2011

| SLAC

| [https://web.archive.org/web/20110426125951/http://www-conf.slac.stanford.edu/xldb2011/]

| [https://www.jstage.jst.go.jp/article/dsj/11/0/11_012-010/_pdf]

| 5th XLDB Conference and Workshop

2011

| Edinburgh, UK

| [https://web.archive.org/web/20160303221547/http://xldb.eu/xldb_europe_2011/]

| not available

| Satellite XLDB Workshop in Europe

2010

| SLAC

| [https://web.archive.org/web/20110727234052/http://www-conf.slac.stanford.edu/xldb2010/]

| [https://www.jstage.jst.go.jp/article/dsj/9/0/9_xldb10/_pdf]

| 4th XLDB Conference and Workshop

2009

| Lyon, France

| [https://web.archive.org/web/20110727234623/http://www-conf.slac.stanford.edu/xldb2009/]

| [https://www.jstage.jst.go.jp/article/dsj/8/0/8_xldb09/_pdf]

| 3rd XLDB Workshop

2008

| SLAC

| [https://web.archive.org/web/20110727234818/http://www-conf.slac.stanford.edu/xldb2008/]

| [https://www.jstage.jst.go.jp/article/dsj/7/0/7_7-196/_pdf]

| 2nd XLDB Workshop

2007

| SLAC

| [https://web.archive.org/web/20110727235121/http://www-conf.slac.stanford.edu/xldb2007/]

| [https://www.jstage.jst.go.jp/article/dsj/7/0/7_becla0223/_pdf]

| 1st XLDB Workshop

Tangible results

XLDB events led to initiating an effort to build a new open source, science database called [https://web.archive.org/web/20090220121225/http://scidb.org/ SciDB].{{ cite web | url=http://www.jstage.jst.go.jp/article/dsj/7/0/88/_pdf | year=2008 | last=Becla| first=Jacek | title=Report from the SciDB Workshop |access-date=2008-09-29}}{{dead link|date=July 2016 |bot=InternetArchiveBot |fix-attempted=yes }}

The XLDB organizers started defining a [http://www.xldb.org/science-benchmark/ science benchmark] for scientific data management systems called SS-DB.

At [https://archive.today/20130416124054/http://xldb.org/2012 XLDB 2012] the XLDB organizers announced that two major databases that support arrays as first-class objects (MonetDB SciQL and SciDB) have formed a working group in conjunction with XLDB. This working group is proposing a common syntax (provisionally named “ArrayQL”) for manipulating arrays, including array creation and query.

See also

References

{{reflist}}

Further reading

  • Pavlo A., Paulson E., Rasin A., Abadi D. J., Dewitt D. J., Madden S., and Stonebraker M., ''A Comparison of Approaches to Large-Scale Data Analysis," Proceedings of the 2009 ACM SIGMOD, https://web.archive.org/web/20090611174944/http://database.cs.brown.edu/sigmod09/benchmarks-sigmod09.pdf
  • {{cite book |doi=10.1117/12.671721|chapter=Designing a multi-petabyte database for LSST|title=Observatory Operations: Strategies, Processes, and Systems|year=2006|editor1-last=Silva|editor1-first=David R|last1=Becla|first1=Jacek|last2=Hanushevsky|first2=Andrew|last3=Nikolaev|first3=Sergei|last4=Abdulla|first4=Ghaleb|last5=Szalay|first5=Alex|last6=Nieto-Santisteban|first6=Maria|last7=Thakar|first7=Ani|last8=Gray|first8=Jim|s2cid=3204824|volume=6270|pages=62700R|editor2-first=Rodger E|editor2-last=Doxsey |arxiv=cs/0604112 }}
  • Becla, J., & Wang, D. L. 2005, Lessons Learned from Managing a Petabyte, downloaded from https://web.archive.org/web/20110604223735/http://www.slac.stanford.edu/pubs/slacpubs/10750/slac-pub-10963.pdf on 2007-11-25.
  • {{cite journal |last1=Bell |first1=Gordon |last2=Gray |first2=Jim |last3=Szalay |first3=Alex |title=Petascale Computational Systems |year=2007 |bibcode=2007cs........1165B |arxiv=cs/0701165 }}
  • Duellmann, D. 1999, Petabyte Databases, ACM SIGMOD Record, vol. 28, p. 506, https://web.archive.org/web/20071012015357/http://www.sigmod.org/sigmod/record/issues/9906/index.html#TutorialSessions.
  • Hanushevsky, A., & Nowak, M. 1999, Pursuit of a Scalable High Performance Multi-Petabyte Database, 16th IEEE Symposium on Mass Storage Systems, pp. 169–175, http://citeseer.ist.psu.edu/217883.html.
  • Shiers, J., Building Very Large, Distributed Object Databases, downloaded from https://web.archive.org/web/20070915101842/http://wwwasd.web.cern.ch/wwwasd/cernlib/rd45/papers/dbprog.html on 2007-11-25.