Simple ETL Pipeline Using Snowflake, AWS, and PySpark
ETL処理がシンプルになる!AWS Glue 3.0で使えるようになったPySparkの関数紹介 - KAKEHASHI Tech Blog
Harmonize, Query, and Visualize Data from Various Providers using AWS Glue, Amazon Athena, and Amazon QuickSight | AWS Big Data Blog
AWS Cloud Data Engineering End-to-End Project — AWS Glue ETL Job, S3, Apache Spark | by Dogukan Ulu | Medium
AWS Dojo - Free Workshops, Exercises and Tutorials for Amazon Web Services
Crafting serverless streaming ETL jobs with AWS Glue | AWS Big Data Blog
AWS Glue PySpark to SQLite - BladeBridge
AWS Glue Job with PySpark. — How to create a custom glue job and… | by Yuvaraj Ravikumar | Medium
COVID-19 data pipeline on AWS feat. Glue/PySpark, Docker, Great Expectations, Airflow, and Redshift, templated in CF/CDK, deployable via Github Actions : r/dataengineering
Run unit tests for Python ETL jobs in AWS Glue using the pytest framework - AWS Prescriptive Guidance
Building AWS Glue Job using PySpark - YouTube
Data preprocessing for machine learning on Amazon EMR made easy with AWS Glue DataBrew | AWS Big Data Blog
Crafting Serverless ETL Pipeline Using AWS Glue and PySpark -
AWS Glue PySpark: Essential Tools for AWS Pipelines
GitHub - aws-samples/aws-glue-test-data-generator: AWS Glue Configurable Test Data Generator for S3 Data Lakes and DynamoDB
AWS Dojo - Workshop - Building AWS Glue Job using PySpark - Part:1(of 2)
How to extract, transform, and load data for analytic processing using AWS Glue (Part 2) | AWS Database Blog
Process data with varying data ingestion frequencies using AWS Glue job bookmarks | AWS Big Data Blog
Optimizing Spark applications with workload partitioning in AWS Glue | AWS Big Data Blog
amazon web services - PySpark Project Deployment on AWS Glue with Package dependency and External Library - Stack Overflow
Building Python modules from a wheel for Spark ETL workloads using AWS Glue 2.0 | AWS Big Data Blog
Monitoring jobs using the Apache Spark web UI - AWS Glue