What is aws glue

Last updated: April 1, 2026

Quick Answer: AWS Glue is an Amazon Web Services managed extract, transform, and load (ETL) service that helps organizations prepare and combine data from various sources for analytics and machine learning applications.

Key Facts

AWS Glue is a fully managed, serverless ETL service eliminating the need to manage infrastructure
Automatically discovers, catalogs, and transforms data from diverse data sources using the AWS Glue Data Catalog
Supports data sources including Amazon S3, Redshift, RDS, and on-premises databases
Provides both visual and code-based interfaces for creating ETL jobs using Spark or Python scripts
Enables data preparation for analytics, data warehousing, and machine learning pipelines at scale

Overview of AWS Glue

AWS Glue is a fully managed extract, transform, and load (ETL) service provided by Amazon Web Services. It simplifies the process of preparing data for analysis by automatically discovering data sources, cataloging their structure, and transforming data for use in analytics, data warehousing, and machine learning applications. Organizations use AWS Glue to make data readily available for business intelligence and analytics.

Key Components

AWS Glue Data Catalog automatically discovers and catalogs metadata from various data sources, creating a unified view of an organization's data assets. AWS Glue ETL provides tools for building, testing, and running ETL jobs that transform data. AWS Glue DataBrew offers a visual data preparation interface for non-technical users. These components work together to streamline data workflows.

Data Sources and Integration

AWS Glue connects to multiple data sources including Amazon S3, Amazon Redshift, RDS, DynamoDB, and on-premises databases. It can also integrate with streaming data sources through AWS Kinesis. This broad compatibility makes AWS Glue suitable for consolidating data from heterogeneous environments. Organizations can bring data from different systems into a single, organized repository.

Job Creation and Execution

Users can create ETL jobs through multiple methods. The visual editor allows drag-and-drop job creation without code. For advanced users, AWS Glue supports Apache Spark and Python scripts for custom transformations. Jobs run on serverless infrastructure, automatically scaling based on workload requirements. This flexibility accommodates both simple data movements and complex transformation logic.

Benefits and Use Cases

AWS Glue reduces the time and effort required for data integration tasks. It eliminates infrastructure management overhead through its serverless architecture, allowing teams to focus on data transformation logic. Common use cases include preparing data for data lakes, feeding data warehouses, enabling machine learning pipelines, and facilitating data migration projects. The pay-as-you-go pricing model means organizations only pay for resources actually used.

More What Is in Daily Life

Also in Daily Life

More "What Is" Questions

What is ugc law What Is ELI5 : At the cellular level, what is different about animals that can regrow body parts and ones that can't What is qbr in football What is energy What is equity in finance What is the main function of the headrest in a car What Is eli5 what is a system in physics What is abitur in english

Trending on WhatAnswers

How Does GPS Work Why do i sleep so much Why does the plush and velvet material cause me so much discomfort to the point it feels painful and makes me nauseous difference between ai and ml How To Start a Business

Browse by Topic

Arts Business Daily Life Education Food Geography Health History Language Law Mathematics Nature Politics Psychology Science Space Sports Technology

Browse by Question Type

Can You Difference Between Does How Does How To Is It What Causes What Does What Is When Was Where Is Who Is Why Do Why Is

Sources

Wikipedia - Data TransformationCC-BY-SA-4.0
Wikipedia - Amazon Web ServicesCC-BY-SA-4.0