About 379,000 results
Open links in new tab
  1. Getting started with the AWS Glue Data Catalog

    The AWS Glue Data Catalog is your persistent technical metadata store. It is a managed service that you can use to store, annotate, and share metadata in the AWS Cloud.

  2. Getting started with the Amazon Glue Data Catalog

    The Amazon Glue Data Catalog is your persistent technical metadata store. It is a managed service that you can use to store, annotate, and share metadata in the Amazon Cloud.

  3. A Guide to AWS Glue: Data Catalog, Databases, Crawler

    Oct 31, 2023 · In this guide, we will explore various aspects of AWS Glue, including the AWS Glue Data Catalog, databases, tables, partitions, crawlers, connections, jobs, triggers, and …

  4. Data discovery and cataloging in AWS Glue

    The AWS Glue Data Catalog is a centralized repository that stores metadata about your organization's data sets. It acts as an index to the location, schema, and runtime metrics of …

  5. Query External Data with AWS Glue Data Catalog

    Oct 14, 2025 · Amazon AWS Glue Data Catalog is a centralized metadata management service that helps data professionals discover data and supports data governance in AWS cloud.

  6. AWS Glue Data Catalog - AWS Prescriptive Guidance

    The AWS Glue Data Catalog is a centralized metadata repository for all your data assets across various data sources. It provides a unified interface to store and query information about data …

  7. AWS Glue Data Catalog Views: Empowering Data Integration

    Mar 15, 2025 · This guide will delve deep into the implications, features, and benefits of this new functionality, providing a comprehensive understanding of how AWS Glue Data Catalog views …

  8. Decoding AWS Glue: Managing Data Catalogs and Querying

    Jan 14, 2025 · This article explores how AWS Glue manages and stores metadata in the Data Catalog, providing seamless access to data residing in Amazon S3. It highlights the role of …

  9. Aws Glue Data Datalog Explained: An In-Depth Guide - CastorDoc

    Mar 6, 2025 · It is designed to simplify the process of data discovery, conversion, and job scheduling for big data applications. This guide will provide an in-depth understanding of the …

  10. Managing the Data Catalog - AWS Glue

    The AWS Glue Data Catalog is a central metadata repository that stores structural and operational metadata for your Amazon S3 data sets. Managing the Data Catalog effectively is …

  11. CDC Data Pipeline on AWS: S3, Glue, and Redshift Integration …

    4 days ago · AWS Glue processes these changes, applying transformations and maintaining a catalog of the data structure. Finally, Redshift serves as the analytics layer where business …

  12. Serverless Data Integration – AWS Glue – Amazon Web Services

    You can discover and connect to more than 100 diverse data sources, manage your data in a centralized data catalog, and visually create, run, and monitor data pipelines to load data into …

  13. AWS Glue Workflow Management for Streamlined ETL Processes

    6 days ago · Learn how to optimize your ETL processes with AWS Glue Workflow Management. Enhance your data integration and streamline operations for better performance.

  14. Bringing your data into the AWS Glue Data Catalog - AWS Lake …

    Learn to create federated catalogs and databases in the AWS Glue Data Catalog, and manage metadata for data in Amazon S3 data lakes and Amazon Redshift data warehouses without …

  15. Catalogs API - AWS Glue

    Turns on or off data lake access for Apache Spark applications that access Amazon Redshift databases in the Data Catalog from any non-Redshift engine, such as Amazon Athena, …

  16. Using AWS Glue Data Catalog views with Apache Spark in EMR …

    Jun 5, 2025 · This demonstration showcases the versatility and cross-account capabilities of Data Catalog views and access through various AWS analytics services.

  17. Introducing AWS Glue Data Catalog usage metrics for API usage

    Jun 26, 2025 · AWS Glue Data Catalog is a centralized repository that stores metadata about your organization’s datasets. With its unified interface that acts as an index, you can store and …

  18. Create an AWS Glue Data Catalog with AWS DMS

    Nov 17, 2023 · In this post, we show you how to automatically create an AWS Glue Data Catalog of desired tables, including ones without data, from a relational database using AWS DMS and …

  19. Serverless Data Integration – AWS Glue Features – AWS

    The AWS Glue Data Catalog is your persistent metadata store for all your data assets, regardless of where they are located. The Data Catalog contains table definitions, job definitions, …

  20. Working with AWS Glue Data Catalog views in AWS Glue

    You can create and manage views in the AWS Glue Data Catalog, commonly known as AWS Glue Data Catalog views. These views are useful because they support multiple SQL query …

  21. Serverless Data Integration – AWS Glue FAQsAWS

    Find answers to frequently asked questions about AWS Glue, a serverless ETL service that crawls your data, builds a data catalog, and performs data cleansing, data transformation, and …

  22. Using files in Amazon S3 for the data source - AWS Glue

    Recursive: Choose this option if you want AWS Glue to read data from files in child folders at the S3 location. If the child folders contain partitioned data, AWS Glue doesn't add any partition …

  23. Simplify data discovery for business users by adding data

    Aug 23, 2021 · In this post, we discuss how to use AWS Glue Data Catalog to simplify the process for adding data descriptions and allow data analysts to access, search, and discover …

  24. Visualize data lineage using Amazon SageMaker Catalog for …

    Oct 13, 2025 · The generation of data lineage in SageMaker Catalog operates through an automated system that captures metadata and relationships between different data artifacts …