https://github.com/willie-conway/relational-database-administration-capstone-project
π§± Relational Database Administration Capstone Project focuses on design, secure, optimize, and automate OLTP & Data Warehouse systems using MySQL, PostgreSQL, Apache Airflow, and shell scripting. πΎππβοΈ
https://github.com/willie-conway/relational-database-administration-capstone-project
airflow backup data-pipelines data-warehousing database-admin database-security encryption etl mysql oltp optimization phpmyadmin phppgadmin postgresql restore shell-scripting sql
Last synced: 3 months ago
JSON representation
π§± Relational Database Administration Capstone Project focuses on design, secure, optimize, and automate OLTP & Data Warehouse systems using MySQL, PostgreSQL, Apache Airflow, and shell scripting. πΎππβοΈ
- Host: GitHub
- URL: https://github.com/willie-conway/relational-database-administration-capstone-project
- Owner: Willie-Conway
- License: mit
- Created: 2025-06-23T16:08:18.000Z (4 months ago)
- Default Branch: main
- Last Pushed: 2025-06-23T16:47:41.000Z (4 months ago)
- Last Synced: 2025-06-23T17:44:59.773Z (4 months ago)
- Topics: airflow, backup, data-pipelines, data-warehousing, database-admin, database-security, encryption, etl, mysql, oltp, optimization, phpmyadmin, phppgadmin, postgresql, restore, shell-scripting, sql
- Language: Python
- Homepage:
- Size: 16.3 MB
- Stars: 0
- Watchers: 0
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# π₯οΈ IBM Relational Database Administration Capstone Project
![]()
![]()
This capstone project highlights the hands-on application of core relational database administration skills by simulating a real-world enterprise environment.
I played the role of a **Database Administrator (DBA)** responsible for designing, managing, and securing both OLTP and OLAP systems using MySQL and PostgreSQL, with automation and performance tuning included.---
## π Project Overview
As a final project in the [IBM Relational Database Administrator Professional Certificate](https://www.coursera.org/professional-certificates/ibm-database-administrator), this capstone involved:
- Designing transactional and analytical databases
- Implementing ETL workflows using Apache Airflow
- Performing backups and automating restore procedures
- Securing data with encryption and granular access control
- Optimizing SQL queries and database memory usage---
## π§ What I Learned
β OLTP and Data Warehouse schema design and deployment
β Data encryption, column-level access control, and secure user management
β Backup, restore, and cron job automation
β Performance tuning with indexing and query optimization
β End-to-end ETL pipeline development using Apache Airflow---
## π§° Skills & Tools Used
- π¬ MySQL & π PostgreSQL
- βοΈ Apache Airflow
- π SQL Grants & Data Encryption
- πΎ Backup & Restore Scripting (cron, `.sh`)
- π OLTP & OLAP Systems
- π§± Schema Design & Indexing
- π§ Linux Terminal, Shell Scripting---
## π¦ Capstone Modules Breakdown
| Module | Description |
|--------|-------------|
| π§ **1. OLTP Database & Platform Architecture** | Created MySQL-based OLTP schema with indexes and constraints |
| ποΈ **2. Data Warehouse Design** | Built dimensional models using PostgreSQL with star schema |
| π **3. ETL & Pipelines** | Developed Apache Airflow DAGs to automate data ingestion and transformation |
| πΎ **4. Backup & Restore Automation** | Scripted full backup and recovery pipelines using cron and shell |
| π **5. Security & Optimization** | Encrypted sensitive data, configured user access roles, and optimized queries |
| π€ **6. Final Submission** | Submitted full database system with automation and documentation |---
## π Project Assets
```
π OLTP Schema & Sales Database (MySQL)
π PostgreSQL Data Warehouse
π Apache Airflow DAGs & ETL Scripts
π Backup/Restore Scripts (cron, bash)
π Security Scripts (GRANT, ENCRYPTION)
π Optimization SQL (Indexes, EXPLAIN)```
---
## π§ͺ Lab Highlights
### π Access Management & Data Encryption
- Defined roles: `db_admin`, `db_analyst`, `db_reporter`, `db_external`
- Encrypted sales amount fields using `VARBINARY` and passphrases
- Column- and table-level GRANT statements### β»οΈ Backup & Restore Automation
- Created cron jobs to back up the sales database every 3 minutes
- Wrote scripts to truncate tables, zip backups, and restore via terminal
- Maintained logs and `.tar.gz` backup archives### π ETL with Apache Airflow
- Parsed web logs and loaded structured data to databases
- Implemented DAGs for extract β transform β load
- Used custom Python scripts for ingestion and transformation### π§± Data Warehouse & Reporting
- Created dimensional tables: `DimDate`, `DimCategory`, `DimCountry`, `FactSales`
- Performed OLAP queries using `ROLLUP`, `CUBE`, and `GROUPING SETS`
- Generated Materialized Query Tables (MQTs)### βοΈ Database Optimization
- Created indexes and tested pre/post-query performance
- Analyzed memory usage using MySQL EXPLAIN
- Improved query execution time by optimizing data types and schema---
## π§ Summary
This project brought together all major elements of relational database administration, allowing me to:
- Simulate enterprise-grade database systems
- Automate routine tasks using Linux tools
- Secure and manage data access at scale
- Apply performance tuning methods in real-world contexts---
## π Outcome
β **Confident in managing RDBMS systems (MySQL, PostgreSQL)**
β **Experienced with automation, security, and optimization workflows**
β **Ready for DBA or backend-focused roles involving data infrastructure**---
## πΌ Ideal For
- Recruiters seeking skilled DBAs or backend engineers
- Employers looking for automated, secure data systems
- Projects requiring optimized RDBMS solutions and ETL integration---
## π Related Projects
- [ETL Automation using Apache Airflow](#)
- [Data Warehouse with PostgreSQL](#)
- [SQL Query Optimization & Indexing](#)---
## π€ Letβs Connect
Explore more of my work and collaborations:
π [My Portfolio](#) | πΌ [LinkedIn](#) | π [GitHub Projects](#)