https://github.com/mekwiset/mekwiset
https://github.com/mekwiset/mekwiset
Last synced: 3 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/mekwiset/mekwiset
- Owner: MekWiset
- Created: 2024-07-21T16:28:09.000Z (11 months ago)
- Default Branch: main
- Last Pushed: 2024-10-16T10:18:30.000Z (8 months ago)
- Last Synced: 2024-10-18T00:58:28.453Z (8 months ago)
- Size: 2.2 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
Hi there đ, I'm Mek
![]()
As a passionate Junior Data Engineer, I specialize in leveraging data to drive impactful insights and solutions. I thrive on solving complex data challenges and enjoy working with diverse technologies to optimize data pipelines and workflows. With a strong foundation in data engineering tools and methodologies, I am particularly interested in exploring advancements in data science, cloud computing, and big data analytics.My goal is to contribute to innovative projects that harness the power of data to create meaningful outcomes and drive progress in various industries.
- đĢ How to reach me: [email protected]
- đ Know about my experiences: [https://www.linkedin.com/in/siriwat-wisetpakdeewong-78b641256/](https://www.linkedin.com/in/siriwat-wisetpakdeewong-78b641256/)
- âī¸ View my resume: [Click Here](https://drive.google.com/file/d/1UP6nrpnJiu_JMxbAWhsqxt13RWENKJbf/view?usp=sharing)
đ Connect with me:
[](https://www.linkedin.com/in/siriwat-wisetpakdeewong-78b641256/)
[](mailto:[email protected])
[](https://medium.com/@siriwatwisetpakdeewong)đģ Languages and Tools:














### đ Recent Medium Articles
![]()
---
## đ Featured Projects
### 1. [LiquorSales Data Migration Pipeline](https://github.com/MekWiset/LiquorSales_Data_Migration_Pipeline)
- **âšī¸ Description:** Big Data Migration from GCP to Azure.
- **đ Achievements:** Successfully migrated over 19 million rows of data from Google Cloud Storage to Azure Data Lake, ensuring data integrity with zero data loss.
- **đ¯ Technologies used:**
- **Processing Tools:** PySpark
- **GCP Services:** Google Cloud Storage (GCS), BigQuery
- **Azure Services:** Azure Data Factory, Azure Data Lake Storage Gen 2, Databricks, Key Vault
- **Others:** Docker### 2. [Medallion Data Lakehouse](https://github.com/MekWiset/Medallion_DataLakehouse_project)
- **âšī¸ Description:** Building a Data Lakehouse using the Medallion architecture.
- **đ Achievements:** Developed a scalable Data Lakehouse architecture using the Medallion framework, facilitating efficient data storage, processing, and analysis with seamless integration across Azure services.
- **đ¯ Technologies used:**
- **Processing Tools:** DBT (Data Build Tool)
- **Azure Services:** Azure SQL Database, Azure Data Lake Storage Gen 2, Databricks, Azure Key Vault### 3. [Realtime Data Streaming](https://github.com/MekWiset/Realtime_Data_Streaming_project)
- **âšī¸ Description:** Real-time data ingestion to Cassandra using Airflow, Kafka, and Spark.
- **đ Achievements:** Engineered a robust real-time data streaming pipeline, enabling low-latency data ingestion into Cassandra and ensuring consistent data flow and processing across multiple platforms.
- **đ¯ Technologies used:**
- **Processing Tools:** PySpark
- **Orchestration Tools:** Airflow, Kafka, Zookeeper
- **Monitoring:** Confluent
- **Storage:** Cassandra
- **Others:** Docker
---