Sqoop
{{Infobox software
| name = Apache Sqoop
| logo = Apache Sqoop logo.svg
| released = {{Start date and age|2009|06|01|df=yes}}
| screenshot =
| caption =
| developer = Apache Software Foundation
| discontinued = yes
| latest release version = 1.4.7
| latest release date = {{Start date and age|2017|12|06}}
| latest preview version =
| latest preview date =
| operating system = Cross-platform
| repo = {{URL|https://gitbox.apache.org/repos/asf?p{{=}}sqoop.git|Sqoop Repository}}
| programming language = Java
| genre = Data management
| license = Apache License 2.0
| website = {{URL|https://sqoop.apache.org}}
}}
Sqoop is a command-line interface application for transferring data between relational databases and Hadoop.{{cite web |url=https://sqoop.apache.org|title=Hadoop: Apache Sqoop|access-date=Sep 8, 2012}}
The Apache Sqoop project was retired in June 2021 and moved to the Apache Attic.{{Cite web|title=moving Sqoop to the Attic|url=http://mail-archives.apache.org/mod_mbox/sqoop-user/202106.mbox/browser|access-date=2021-06-27|website=mail-archives.apache.org}}
Description
Sqoop supports incremental loads of a single table or a free form SQL query as well as saved jobs which can be run multiple times to import updates made to a database since the last import. Imports can also be used to populate tables in Hive or HBase.{{cite web |url=https://blogs.apache.org/sqoop/entry/apache_sqoop_overview|title=Apache Sqoop - Overview|access-date=Sep 8, 2012}} Exports can be used to put data from Hadoop into a relational database. Sqoop got the name from "SQL-to-Hadoop".{{cite web |url=https://blog.cloudera.com/blog/2009/06/introducing-sqoop/|title=Introducing Sqoop|access-date=Jan 1, 2019}}
Sqoop became a top-level Apache project in March 2012.{{cite web |url=https://blogs.apache.org/sqoop/entry/apache_sqoop_graduates_from_incubator|title=Apache Sqoop Graduates from Incubator|access-date=Sep 8, 2012}}
Informatica provides a Sqoop-based connector from version 10.1.
Pentaho provides open-source Sqoop based connector steps, Sqoop Import and Sqoop Export, in their ETL suite Pentaho Data Integration since version 4.5 of the software. Microsoft uses a Sqoop-based connector to help transfer data from Microsoft SQL Server databases to Hadoop.{{cite web |url=https://www.microsoft.com/en-us/download/details.aspx?id=27584|title=Microsoft SQL Server Connector for Apache Hadoop|website=Microsoft |access-date=Sep 8, 2012}}
Couchbase, Inc. also provides a Couchbase Server-Hadoop connector by means of Sqoop.{{cite web|url=http://www.couchbase.com/develop/connectors/hadoop|title=Couchbase Hadoop Connector|access-date=Sep 8, 2012|url-status=dead|archive-url=https://web.archive.org/web/20120825184036/http://www.couchbase.com/develop/connectors/hadoop|archive-date=2012-08-25}}
See also
References
{{Reflist|refs=
| url = http://www.dbta.com/Editorial/News-Flashes/Big-Data-Analytics-Vendor-Pentaho-Announces-Tighter-Integration-with-Cloudera-Extends-Visual-Interface-to-Include-Hadoop-Sqoop-and-Oozie-84025.aspx
| title = Big Data Analytics Vendor Pentaho Announces Tighter Integration with Cloudera; Extends Visual Interface to Include Hadoop Sqoop and Oozie
| publisher = Database Trends and Applications (dbta.com)
| date = 2012-07-27
| access-date = 2015-12-08
| archive-url = https://web.archive.org/web/20151208144234/http://www.dbta.com/Editorial/News-Flashes/Big-Data-Analytics-Vendor-Pentaho-Announces-Tighter-Integration-with-Cloudera-Extends-Visual-Interface-to-Include-Hadoop-Sqoop-and-Oozie-84025.aspx
| archive-date = 2015-12-08
| quote = Pentaho’s Business Analytics 4.5 is now certified on Cloudera’s latest releases, Cloudera Enterprise 4.0 and CDH4. Pentaho also announced that its visual design studio capabilities have been extended to the Sqoop and Oozie components of Hadoop.
}}
| url = http://wiki.pentaho.com/display/EAI/Sqoop+Export
| title = Sqoop Export
| publisher = Pentaho
| date = 2015-12-10
| access-date = 2015-12-10
| archive-url = https://web.archive.org/web/20151210171525/http://wiki.pentaho.com/display/EAI/Sqoop+Export
| archive-date = 2015-12-10
| quote = The Sqoop Export job allows you to export data from Hadoop into an RDBMS using Apache Sqoop.
}}
| url = http://wiki.pentaho.com/display/EAI/Sqoop+Import
| title = Sqoop Import
| publisher = Pentaho
| date = 2015-12-10
| access-date = 2015-12-10
| archive-url = https://web.archive.org/web/20151210170913/http://wiki.pentaho.com/display/EAI/Sqoop+Import
| archive-date = 2015-12-10
| quote = The Sqoop Import job allows you to import data from a relational database into the Hadoop Distributed File System (HDFS) using Apache Sqoop.
}}
}}
Bibliography
{{Refbegin}}
- {{Cite book |first1 = Tom
|last1 = White
|title = Hadoop: The Definitive Guide
|edition = 2nd
|chapter = Chapter 15: Sqoop
|year = 2010
|publisher = O'Reilly Media
|pages = [https://archive.org/details/hadoopdefinitive0000whit/page/477 477–495]
|isbn = 978-1-449-38973-4
|chapter-url = https://archive.org/details/hadoopdefinitive0000whit/page/477
}}
{{Refend}}
External links
- {{Official website|https://sqoop.apache.org}}
- [https://cwiki.apache.org/confluence/display/SQOOP/Home Sqoop Wiki]
- [https://web.archive.org/web/20140202154003/http://qnalist.com/q/sqoop-user Sqoop Users Mailing List Archives]
{{Apache Software Foundation}}