AWS Glue
{{Short description|Serverless computing platform}}
{{Infobox software
| name = AWS Glue
| logo =
| developer = Amazon.com
| released = {{Start date and age|2017|8}} {{Cite web|url=https://aws.amazon.com/about-aws/whats-new/2017/08/introducing-aws-glue-a-simple-flexible-and-cost-effective-extract-transfer-and-load-etl-service/|title=Introducing AWS Glue: A Simple, Flexible, and Cost-Effective Extract, Transfer, and Load (ETL) Service}}
| operating system = Cross-platform
| language = English
}}
AWS Glue is an event-driven, serverless computing platform provided by Amazon as a part of Amazon Web Services. It was introduced in August 2017.{{cite web |url=https://www.parkmycloud.com/aws-services-list/ |title=AWS Services List |website=ParkMyCloud |access-date=October 6, 2020}}
Overview
The primary purpose of Glue is to scan other services{{cite web |url=https://digitalcloud.training/aws-glue/ |title=AWS Glue: crawlers and use cases |date=5 January 2022 |access-date=July 13, 2022}} in the same Virtual Private Cloud (or equivalent accessible network element even if not provided by AWS), particularly S3.{{cn|date=November 2023}} The jobs are billed according to compute time, with a minimum count of 1 minute.{{cite web |url=https://aws.amazon.com/about-aws/whats-new/2020/08/aws-glue-version-2-featuring-10x-faster-job-start-times-1-minute-minimum-billing-duration/ |title=AWS Glue version 2.0 featuring 10x faster job start times and 1-minute minimum billing duration |website=AWS |date=August 10, 2020 |access-date=October 6, 2020}} Glue discovers the source data to store associated meta-data (e.g. the table's schema of field names, types lengths) in the AWS Glue Data Catalog (which is then accessible via AWS console or APIs).{{cite web |url=https://docs.aws.amazon.com/glue/latest/webapi/WebAPI_Welcome.html |title=AWS Glue API Documentation |website=AWS |access-date=October 6, 2020}}
Languages supported
Scala and Python are officially supported {{as of|2020|lc=1}}.{{cite web |url=https://aws.amazon.com/about-aws/whats-new/2018/01/aws-glue-now-supports-scala-in-addition-to-python/#:~:text=AWS%20Glue%20now%20supports%20the,jobs%20that%20can%20be%20scheduled |title=AWS Glue Now Supports Scala in Addition to Python |website=AWS |date=January 12, 2018 |access-date=October 6, 2020}}
Catalog interrogation via API
The catalog can be read in AWS console (via browser) and via API divided into topics including:{{cite web |url=https://docs.aws.amazon.com/glue/latest/dg/aws-glue-api-catalog.html |title=Catalog API |website=AWS |access-date=October 8, 2020}}
- Database API
- Table API
- Partition API
- Connection API
- User-Defined Function API
- Importing an Athena Catalog to AWS Glue
See also
{{div col}}
- Event-driven architecture
- Serverless Framework
- Function as a service
- Google Cloud Functions
- Azure Function
- Oracle Cloud Platform
{{div col end}}
References
{{Reflist}}
External links
- {{Official website|https://aws.amazon.com/glue}}
- [https://www.3ritechnologies.com/course/aws-amazon-web-services/ AWS Course]
{{Amazon}}