https://github.com/willie-conway/relational-database-administration-capstone-project
π§± Relational Database Administration Capstone Project focuses on design, secure, optimize, and automate OLTP & Data Warehouse systems using MySQL, PostgreSQL, Apache Airflow, and shell scripting. πΎππβοΈ
https://github.com/willie-conway/relational-database-administration-capstone-project
airflow backup data-pipelines data-warehousing database-admin database-security encryption etl mysql oltp optimization phpmyadmin phppgadmin postgresql restore shell-scripting sql
Last synced: about 2 months ago
JSON representation
π§± Relational Database Administration Capstone Project focuses on design, secure, optimize, and automate OLTP & Data Warehouse systems using MySQL, PostgreSQL, Apache Airflow, and shell scripting. πΎππβοΈ
- Host: GitHub
- URL: https://github.com/willie-conway/relational-database-administration-capstone-project
- Owner: Willie-Conway
- License: mit
- Created: 2025-06-23T16:08:18.000Z (11 months ago)
- Default Branch: main
- Last Pushed: 2025-06-24T01:42:02.000Z (11 months ago)
- Last Synced: 2025-07-01T07:59:56.388Z (11 months ago)
- Topics: airflow, backup, data-pipelines, data-warehousing, database-admin, database-security, encryption, etl, mysql, oltp, optimization, phpmyadmin, phppgadmin, postgresql, restore, shell-scripting, sql
- Language: Python
- Homepage:
- Size: 16.7 MB
- Stars: 1
- Watchers: 0
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# π₯οΈ IBM Relational Database Administration Capstone Project
This capstone project highlights the hands-on application of core relational database administration skills by simulating a real-world enterprise environment.
I played the role of a **Database Administrator (DBA)** responsible for designing, managing, and securing both OLTP and OLAP systems using MySQL and PostgreSQL, with automation and performance tuning included.
---
## π Project Overview
As a final project in the [IBM Relational Database Administrator Professional Certificate](https://www.coursera.org/professional-certificates/ibm-database-administrator), this capstone involved:
- Designing transactional and analytical databases
- Implementing ETL workflows using Apache Airflow
- Performing backups and automating restore procedures
- Securing data with encryption and granular access control
- Optimizing SQL queries and database memory usage
---
## π§ What I Learned
β
OLTP and Data Warehouse schema design and deployment
β
Data encryption, column-level access control, and secure user management
β
Backup, restore, and cron job automation
β
Performance tuning with indexing and query optimization
β
End-to-end ETL pipeline development using Apache Airflow
---
## π§° Skills & Tools Used
- π¬ MySQL & π PostgreSQL
- βοΈ Apache Airflow
- π SQL Grants & Data Encryption
- πΎ Backup & Restore Scripting (cron, `.sh`)
- π OLTP & OLAP Systems
- π§± Schema Design & Indexing
- π§ Linux Terminal, Shell Scripting
---
## π¦ Capstone Modules Breakdown
| Module | Description |
|--------|-------------|
| π§ **1. OLTP Database & Platform Architecture** | Created MySQL-based OLTP schema with indexes and constraints |
| ποΈ **2. Data Warehouse Design** | Built dimensional models using PostgreSQL with star schema |
| π **3. ETL & Pipelines** | Developed Apache Airflow DAGs to automate data ingestion and transformation |
| πΎ **4. Backup & Restore Automation** | Scripted full backup and recovery pipelines using cron and shell |
| π **5. Security & Optimization** | Encrypted sensitive data, configured user access roles, and optimized queries |
| π€ **6. Final Submission** | Submitted full database system with automation and documentation |
---
## π Project Assets
```
π OLTP Schema & Sales Database (MySQL)
π PostgreSQL Data Warehouse
π Apache Airflow DAGs & ETL Scripts
π Backup/Restore Scripts (cron, bash)
π Security Scripts (GRANT, ENCRYPTION)
π Optimization SQL (Indexes, EXPLAIN)
```
---
## π§ͺ Lab Highlights
### π Access Management & Data Encryption
- Defined roles: `db_admin`, `db_analyst`, `db_reporter`, `db_external`
- Encrypted sales amount fields using `VARBINARY` and passphrases
- Column- and table-level GRANT statements
### β»οΈ Backup & Restore Automation
- Created cron jobs to back up the sales database every 3 minutes
- Wrote scripts to truncate tables, zip backups, and restore via terminal
- Maintained logs and `.tar.gz` backup archives
### π ETL with Apache Airflow
- Parsed web logs and loaded structured data to databases
- Implemented DAGs for extract β transform β load
- Used custom Python scripts for ingestion and transformation
### π§± Data Warehouse & Reporting
- Created dimensional tables: `DimDate`, `DimCategory`, `DimCountry`, `FactSales`
- Performed OLAP queries using `ROLLUP`, `CUBE`, and `GROUPING SETS`
- Generated Materialized Query Tables (MQTs)
### βοΈ Database Optimization
- Created indexes and tested pre/post-query performance
- Analyzed memory usage using MySQL EXPLAIN
- Improved query execution time by optimizing data types and schema
---
## π§ Summary
This project brought together all major elements of relational database administration, allowing me to:
- Simulate enterprise-grade database systems
- Automate routine tasks using Linux tools
- Secure and manage data access at scale
- Apply performance tuning methods in real-world contexts
---
## π Outcome
β
**Confident in managing RDBMS systems (MySQL, PostgreSQL)**
β
**Experienced with automation, security, and optimization workflows**
β
**Ready for DBA or backend-focused roles involving data infrastructure**
---
## πΌ Ideal For
- Recruiters seeking skilled DBAs or backend engineers
- Employers looking for automated, secure data systems
- Projects requiring optimized RDBMS solutions and ETL integration
---
## π Related Projects
- [ETL Automation using Apache Airflow](#)
- [Data Warehouse with PostgreSQL](#)
- [SQL Query Optimization & Indexing](#)
---
## π€ Letβs Connect
Explore more of my work and collaborations:
π [My Portfolio](#) | πΌ [LinkedIn](#) | π [GitHub Projects](#)