Google Dataset Search
{{short description|Search engine for datasets from Google}}
Google Dataset Search is a search engine from Google that helps researchers locate online data that is freely available for use.{{Cite journal|last=Castelvecchi|first=Davide|date=2018-09-05|title=Google unveils search engine for open data|journal=Nature|volume=561|issue=7722|pages=161–162|language=EN|doi=10.1038/d41586-018-06201-x|pmid=30206390|bibcode=2018Natur.561..161C|s2cid=52190512|issn=0028-0836}} The company launched the service on September 5, 2018, and stated that the product was targeted at scientists and data journalists. The service was out of beta as of January 23, 2020.{{cite web |last1=Noy |first1=Natasha |title=Discovering millions of datasets on the web |url=https://www.blog.google/products/search/discovering-millions-datasets-web/ |website=The Keyword |date=23 January 2020 |access-date=18 June 2020}}
Google Dataset Search complements Google Scholar, the company's search engine for academic studies and reports.{{Cite news|url=https://www.theverge.com/2018/9/5/17822562/google-dataset-search-service-scholar-scientific-journal-open-data-access|title=Google launches new search engine to help scientists find the datasets they need|work=The Verge|access-date=2018-09-07}}
Features
Dataset Search can filter results based on the desired type of data (for example, focusing on images or text). It is also available in mobile.{{cite web |last1=Noy |first1=Natasha |title=Discovering millions of datasets on the web |url=https://www.blog.google/products/search/discovering-millions-datasets-web/ |website=The Keyword |date=23 January 2020 |access-date=18 June 2020}}
Technology
Dataset Search is heavily reliant on dataset providers' use of metadata in accordance with the standards defined by the schema.org consortium.{{cite web |last1=Google |first1=Vincent |title=FAQ - Structured data markup for datasets |url=https://support.google.com/webmasters/thread/1960710 |website=Search Console Help |publisher=Google Inc. |access-date=20 June 2020}} According to the Google AI blog,
{{Blockquote|text=When Google's search engine processes a Web page with schema.org/Dataset mark-up, it understands that there is dataset metadata there and processes that structured metadata to create "records" describing each annotated dataset on a page. The use of schema.org allows developers to embed this structured information into HTML, without affecting the appearance of the page while making the semantics of the information visible to all search engines.{{cite web |last1=Burgess |first1=Matthew |last2=Noy |first2=Natasha |title=Building Google Dataset Search and Fostering an Open Data Ecosystem |url=https://ai.googleblog.com/2018/09/building-google-dataset-search-and.html |website=Google AI blog |access-date=20 June 2020}}}}
Versions
Dataset Search was initially released in beta on September 5, 2018.{{cite web |last1=Noy |first1=Natasha |title=Making it easier to discover datasets |url=https://www.blog.google/products/search/making-it-easier-discover-datasets/ |website=The Keyword |date=5 September 2018 |access-date=27 June 2020}} It moved out of beta on January 23, 2020.{{cite web |last1=Noy |first1=Natasha |title=Discovering millions of datasets on the web |url=https://blog.google/products/search/discovering-millions-datasets-web/ |website=The Keyword |date=23 January 2020 |access-date=27 June 2020}}
References
{{Reflist}}
External links
- {{Official website|https://datasetsearch.research.google.com}}
{{Google LLC}}