{"id":31754628,"url":"https://github.com/caogiathinh/caogiathinh","last_synced_at":"2025-10-09T18:22:01.220Z","repository":{"id":310590786,"uuid":"1040455289","full_name":"caogiathinh/caogiathinh","owner":"caogiathinh","description":null,"archived":false,"fork":false,"pushed_at":"2025-10-07T01:26:22.000Z","size":47,"stargazers_count":5,"open_issues_count":0,"forks_count":0,"subscribers_count":0,"default_branch":"main","last_synced_at":"2025-10-07T03:26:04.520Z","etag":null,"topics":["airflow","database","dataengineer","dbt","dsa","linux","python","spark","sql"],"latest_commit_sha":null,"homepage":"","language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/caogiathinh.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null,"notice":null,"maintainers":null,"copyright":null,"agents":null,"dco":null,"cla":null}},"created_at":"2025-08-19T02:27:08.000Z","updated_at":"2025-10-01T08:41:55.000Z","dependencies_parsed_at":"2025-08-19T04:25:07.875Z","dependency_job_id":"545e1be2-3f10-42f0-b374-ec4f2ad2190e","html_url":"https://github.com/caogiathinh/caogiathinh","commit_stats":null,"previous_names":["caogiathinh/caogiathinh"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/caogiathinh/caogiathinh","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/caogiathinh%2Fcaogiathinh","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/caogiathinh%2Fcaogiathinh/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/caogiathinh%2Fcaogiathinh/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/caogiathinh%2Fcaogiathinh/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/caogiathinh","download_url":"https://codeload.github.com/caogiathinh/caogiathinh/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/caogiathinh%2Fcaogiathinh/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":279001943,"owners_count":26083226,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-10-09T02:00:07.460Z","response_time":59,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["airflow","database","dataengineer","dbt","dsa","linux","python","spark","sql"],"created_at":"2025-10-09T18:21:59.106Z","updated_at":"2025-10-09T18:22:01.215Z","avatar_url":"https://github.com/caogiathinh.png","language":null,"funding_links":[],"categories":[],"sub_categories":[],"readme":"# Cao Gia Thịnh\n### Data Engineer\n\n\u003cp align=\"center\"\u003e\n  \u003cimg src=\"https://readme-typing-svg.demolab.com?font=Fira+Code\u0026weight=600\u0026size=25\u0026duration=4000\u0026pause=1000\u0026color=059669\u0026center=true\u0026vCenter=true\u0026width=500\u0026lines=Hi%2C+I'm+Cao+Gia+Thinh;Aspiring+Data+Engineer;Building+Scalable+Data+Solutions\" alt=\"Typing SVG\" /\u003e\n\u003c/p\u003e\n\nWelcome to my GitHub profile!\n\nI'm Cao Gia Thinh, a final-year Computer Science student with a deep focus on Data Engineering. I am passionate about designing and building scalable, high-performance data systems that transform raw data into valuable insights to support business decision-making.\n\n---\n\n## 📊 GitHub Stats\n\n\u003cp align=\"center\"\u003e\n  \u003cimg src=\"https://raw.githubusercontent.com/caogiathinh/caogiathinh/output/github-contribution-grid-snake-dark.svg#gh-dark-mode-only\" /\u003e\n  \u003cimg src=\"https://raw.githubusercontent.com/caogiathinh/caogiathinh/output/github-contribution-grid-snake.svg#gh-light-mode-only\" /\u003e\n\u003c/p\u003e\n\n\u003cp align=\"center\"\u003e\n  \u003cimg src=\"https://github-readme-streak-stats-eight.vercel.app?user=caogiathinh\u0026theme=dark\u0026short_numbers=true\" /\u003e\n\u003c/p\u003e\n\n## 🛠️ Tech Stack \u0026 Core Competencies\n\n\u003cp align=\"left\"\u003e\n  \u003cimg src=\"https://img.shields.io/badge/Python-3776AB?style=for-the-badge\u0026logo=python\u0026logoColor=white\" alt=\"Python\"/\u003e\n  \u003cimg src=\"https://img.shields.io/badge/SQL-4479A1?style=for-the-badge\u0026logo=postgresql\u0026logoColor=white\" alt=\"SQL\"/\u003e\n  \u003cimg src=\"https://img.shields.io/badge/Apache Spark-E25A1C?style=for-the-badge\u0026logo=apachespark\u0026logoColor=white\" alt=\"Apache Spark\"/\u003e\n  \u003cimg src=\"https://img.shields.io/badge/dbt-FF694B?style=for-the-badge\u0026logo=dbt\u0026logoColor=white\" alt=\"dbt\"/\u003e\n  \u003cimg src=\"https://img.shields.io/badge/Google Cloud-4285F4?style=for-the-badge\u0026logo=googlecloud\u0026logoColor=white\" alt=\"Google Cloud\"/\u003e\n  \u003cimg src=\"https://img.shields.io/badge/Docker-2496ED?style=for-the-badge\u0026logo=docker\u0026logoColor=white\" alt=\"Docker\"/\u003e\n  \u003cimg src=\"https://img.shields.io/badge/PostgreSQL-316192?style=for-the-badge\u0026logo=postgresql\u0026logoColor=white\" alt=\"PostgreSQL\"/\u003e\n  \u003cimg src=\"https://img.shields.io/badge/Git-F05032?style=for-the-badge\u0026logo=git\u0026logoColor=white\" alt=\"Git\"/\u003e\n  \u003cimg src=\"https://img.shields.io/badge/Kestra-4B286D?style=for-the-badge\u0026logo=kestra\u0026logoColor=white\" alt=\"Kestra\"/\u003e\n\n\u003c/p\u003e\n\n---\n\n## 🚀 Key Projects\n\nThese are my flagship projects that showcase my skills and experience.\n\n### 1. [urban-mobility-elt-pipeline](https://github.com/caogiathinh/urban_mobility_elt_pipeline)\n*Built a complete data platform on Google Cloud to collect, process, and analyze retail data from various sources.*\n\n- **Orchestration:** Leveraged **Kestra** (deployed on Cloud Composer) to schedule and orchestrate data ingestion pipelines from parquet files.\n- **Data Lake \u0026 Warehouse:** Stored raw data in **Google Cloud Storage (GCS)**. Subsequently, cleaned, transformed, and loaded the data into **Google BigQuery** using **Apache Spark**.\n- **Data Modeling:** Implemented a **Star Schema** within BigQuery to optimize for analytical queries.\n- **Deployment:** Containerized the entire application and its dependencies using **Docker** to ensure consistency across environments.\n\n**Technologies:** `GCP (BigQuery, GCS, Composer)`, `Kestra`, `Apache Spark`, `Docker`, `Python`, `SQL`, `dbt`, `Google Data Studio`.\n\n---\n\n### 2. [modern-data-warehouse](https://github.com/caogiathinh/modern-data-warehouse)\n*Designed and implemented a modern data warehouse to empower Sales and Marketing teams with advanced analytics.*\n\n- **ETL \u0026 Transformation:** Using SQL to extract, transform, and load from source to destination data warehouse.\n- **Data Warehouse Design:** Architected a DWH schema on **Microsoft SQL Server**. \n\n**Technologies:** `T-SQL`, `MS SQL SERVER`.\n\n\n## 📫 Let's Connect!\n\nI'm always open to discussing new opportunities, interesting projects, or anything related to data and technology. Feel free to reach out!\n\n\u003cp align=\"left\"\u003e\n  \u003ca href=\"https://www.linkedin.com/in/cao-gia-thịnh-72634a32a\" target=\"_blank\"\u003e\n    \u003cimg src=\"https://img.shields.io/badge/LinkedIn-0077B5?style=for-the-badge\u0026logo=linkedin\u0026logoColor=white\" alt=\"LinkedIn\"/\u003e\n  \u003c/a\u003e\n  \u003ca href=\"mailto:your.email@example.com\"\u003e\n    \u003cimg src=\"https://img.shields.io/badge/Email-D14836?style=for-the-badge\u0026logo=gmail\u0026logoColor=white\" alt=\"Email\"/\u003e\n  \u003c/a\u003e\n\u003c/p\u003e****\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fcaogiathinh%2Fcaogiathinh","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fcaogiathinh%2Fcaogiathinh","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fcaogiathinh%2Fcaogiathinh/lists"}