Module 1: Introduction
- Welcome to the Course!
- Prerequisites and Course Overview
- Exam Overview
- What is Data Analytics?
Module 2: Amazon Simple Storage Service
- Introduction to S3
- Getting Data Into S3 - Concepts, AWS Management Console, AWS CLI (Part 1)
- Getting Data Into S3 - Boto3 (Part 2)
- S3 Multipart Upload (Part 1)
- S3 Multipart Upload (Part 2)
- S3 Storage Classes
- S3 Lifecycle Policies
- S3 Security and Encryption
- Checkpoint
- Programmatically Utilizing Data From S3
Module 3: Databases In AWS
- Introduction to Databases In AWS
- Database Engine Types
- Relational Database Service (RDS)
- Neptune
- DocumentDB
- Serverless Options
- Checkpoint
- Programmatically Utilizing S3 Select
Module 4: Collecting Streaming Data
- Introduction to Collecting Streaming Data
- The Kinesis Family
- Kinesis Data Streams (Part 1)
- Kinesis Data Streams (Part 2)
- Kinesis Data Firehose (Part 1)
- Kinesis Data Firehose (Part 2)
- Kinesis Video Streams
- Kinesis Data Analytics (Part 1)
- Kinesis Data Analytics (Part 2)
- Amazon Managed Service for Kafka (MSK)
- Checkpoint
Module 5: Data Collection and Getting Data Into AWS
- Introduction to Data Collection and Getting Data Into AWS
- Direct Connect, Snowball, Snowball Edge, Snowmobile
- Database Migration Service
- Data Pipeline
- Lambda, API Gateway, and CloudFront (Part 1)
- Lambda, API Gateway, and CloudFront (Part 2)
- Comparing Our Options
- Checkpoint
Module 6: Amazon Elastic Map Reduce (EMR)
- Introduction to Amazon Elastic Map Reduce (EMR)
- Apache Hadoop and EMR Software Collection
- EMR Architecture
- EMR Operations - Transient vs Long-Running
- EMR Operations - Choosing an Instance Type
- EMR Operations - Choosing the Right Number of Instances
- EMR Operations - On-Demand and Spot Instances
- EMR Operations - Monitoring and Resizing Clusters
- EMR File Storage and Compression
- Checkpoint
- Data Analytics with Spark and EMR
Module 7: Using Redshift
- Introduction to Using Redshift
- Redshift Architecture
- Redshift in the AWS Service Ecosystem
- Redshift Use Cases
- Redshift Table Design
- Redshift Spectrum
- Querying Data from Multiple Redshift Spectrum Tables
Module 8: Redshift Maintenance and Operations
- Launching a Redshift Cluster
- Resizing a Redshift Cluster
- Utilizing Vacuum and Deep Copy
- Backup and Restore
- Monitoring
- Checkpoint
- Manually Migrating Data Between Redshift Clusters
Module 9: AWS Glue, Athena, and QuickSight
- Introduction to AWS Glue, Athena, and QuickSight
- Glue Data Catalog (Part 1)
- Glue Data Catalog (Part 2)
- Glue Jobs (Part 1)
- Glue Job Demo (Part 2)
- Glue Jobs (Part 3)
- Job Bookmarks
- Getting Started with Athena
- Athena Demo
- When To Use Athena
- QuickSight Visualizations and Dashboards
- QuickSight Security and Authentication
- Checkpoint
Module 10: Elasticsearch
- Introduction to Elasticsearch
- Using Elasticsearch
- Visualizing Elasticsearch Data
- Checkpoint
- Implementing an Elasticsearch Backed Search Microservice
Module 11: AWS Security Services
- Introduction to AWS Security Services
- IAM
- KMS
- Secrets Manager
- VPC Network Security Features
- Checkpoint
- Advanced S3 Security