| Day | Task | Start Date | Completion Date | Reference Material |
|---|---|---|---|---|
| 2 | Database Concepts - Lesson: Review fundamental database concepts like Database (a structured information system) and Session (a working session). - Learn about architecture: Relational Databases (RDBMS), including Primary Key (to uniquely identify a row) and Foreign Key (to create links between tables). - Understand the technique: Normalization, a technique of dividing data into multiple tables (using keys) to prevent data duplication. - Understand the technique: Performance Optimization: + Index: A data structure that speeds up retrieval (read) operations, but increases the cost of writes. + Partition: Dividing a large table into many smaller parts to speed up queries. + Execution Plan: The set of steps the database decides to use to access data (e.g., whether to use an Index or not). - Understand the technique: Ensuring integrity and speed: + Database Log: Records all changes, important for recovery and replication. + Buffer: A temporary storage area in RAM, helping to speed up reads because reading from RAM is faster than reading from disk. - Lesson: Database Classification: + RDBMS (ACID): Fixed structure (Schema), storage optimized (Normalization), scales vertically (Vertical Scaling). + NoSQL (BASE): Flexible structure (Dynamic Schema), performance optimized (Denormalization), scales horizontally (Horizontal Scaling). - Lesson: System Classification: + OLTP (Online Transaction Processing): Transaction processing systems (banking, ordering), need to quickly handle read/write/update operations and ensure integrity (roll back). + OLAP (Online Analytical Processing): Data Warehouse systems, store historical data for complex analysis (reporting, finding trends). | 13/10/2025 | 13/10/2025 | Module 06 |
| 3 | Amazon RDS - Learn about the service: Amazon RDS (Relational Database Service), a fully managed relational database service, supporting popular engines (MySQL, PostgreSQL, Oracle, etc.). - Lesson: The goal of RDS is to automate administrative tasks (updates, backups) so users can focus on the application. - Understand the technique: Automated Backups of the database and transaction logs, allowing for Point-in-Time Recovery within a 35-day window. - Learn about architecture: Multi-AZ (High Availability) + Automatically creates a standby replica in another AZ. + Uses Synchronous Replication. + Supports Automatic Failover if the primary database fails. - Learn about architecture: Read Replicas (Read Performance Optimization) + Creates read-only copies to offload the primary database (e.g., for reporting tasks). + Uses Asynchronous Replication, which can cause “replication lag”. - Lesson: RDS is often used for OLTP applications and is protected by Security Groups. | 14/10/2025 | 14/10/2025 | Module 06 |
| 4 | Amazon Aurora - Learn about the service: Amazon Aurora, a database developed by AWS, compatible with MySQL/PostgreSQL, part of the RDS service but with higher performance (3-5x faster). - Learn about architecture: The biggest difference for Aurora is the redesigned storage layer. - Learn about architecture: An Aurora “Cluster” consists of 1 Writer (write instance) and up to 15 Readers (read instances), all sharing a single (Cluster Volume) storage partition. - Understand the technique: Data on the Cluster Volume is replicated 6 times across 3 AZs to ensure durability. - Lesson: Aurora’s outstanding advantage is Zero Replication Lag because the Readers read from the same volume as the Writer. - Understand the technique: Enterprise features like Backtrack (rewind the database without restoring) and Global Database (create read-only replicas in different Regions). | 15/10/2025 | 15/10/2025 | Module 06 |
| 5 | Amazon Redshift - Learn about the service: Amazon Redshift, a petabyte-scale Data Warehouse service, optimized for OLAP. - Learn about architecture: Massively Parallel Processing (MPP). + Leader Node: Receives, parses, and coordinates queries. + Compute Nodes: Store and execute parts of the work in parallel. - Understand the technique: Columnar Storage. + Unlike OLTP (stores by row), Redshift stores data from the same column together. + This technique is extremely efficient for analytical (OLAP) queries (e.g., Calculate average age only needs to read the Age column).- Understand the technique: Redshift Spectrum, allows running SQL queries directly on data in Amazon S3 without needing to load it. Amazon ElastiCache - Learn about the service: Amazon ElastiCache, a high-speed in-memory caching service. - Objective: Speed up applications and reduce the load on the primary database (like RDS). - Learn about supported engines: Redis (supports many data types, often preferred) and Memcached. - Lesson: It is the user’s responsibility to write and manage the Caching Logic (logic that decides what and when to cache) within their application. | 16/10/2025 | 16/10/2025 | Module 06 |
| 6 | Lab: 000005 - Getting Started with Amazon RDS 1. Create a database on Amazon RDS 2. Connect the application to the DB 3. Backup and Restore Lab: 000043 - Migrating Databases with DMS and SCT 1. Preparation steps 2. Oracle to Amazon Aurora (PostgreSQL) 2.1 Convert Schema 2.2 Migrate database. [Supplemental Research] - Database Internals - Document to learn how databases work internally. Database Internals Deep Distributed Systems [Supplemental Research] - The Data Warehouse Toolkit - Document to learn how to design and the techniques used in building a Data-warehouse Data Warehouse Toolkit Definitive Dimensional | 17/10/2025 | 17/10/2025 | Module 06 |