An open API service indexing awesome lists of open source software.

https://github.com/daniel-jcvv/daniel-jcvv

👨‍💻 Data Engineer | 3+ years enterprise experience with Telcel & Citi Banamex Develop ETL pipelines, data governance, and cloud solutions. Building scalable data architectures and automated workflows for Fortune 500 clients. Tech Stack: Python, SQL Server, Oracle, Apache Airflow, PySpark
https://github.com/daniel-jcvv/daniel-jcvv

agentic-ai apache-airflow apache-kafka apache-spark automation business-intelligence citi-bank-apis data-analysis data-engineering data-lake data-warehouse etl-pipeline medallion-architecture mlops n8n-workflow python rag sql-server

Last synced: 2 months ago
JSON representation

👨‍💻 Data Engineer | 3+ years enterprise experience with Telcel & Citi Banamex Develop ETL pipelines, data governance, and cloud solutions. Building scalable data architectures and automated workflows for Fortune 500 clients. Tech Stack: Python, SQL Server, Oracle, Apache Airflow, PySpark

Awesome Lists containing this project

README

          

# Juan Daniel García Belman
## 🤖 AI Data Automation Engineer | 📊 Data Infrastructure | ⚡ Agentic Workflows

[![Portfolio](https://img.shields.io/badge/Portfolio-Visit%20Now-blue?style=for-the-badge&logo=vercel)](https://danieljcvv-portfolio.vercel.app)
[![LinkedIn](https://img.shields.io/badge/LinkedIn-Connect-0077B5?style=for-the-badge&logo=linkedin)](https://www.linkedin.com/in/juan-daniel-garcia-belman-99a298aa)
[![Email](https://img.shields.io/badge/Email-Contact-D14836?style=for-the-badge&logo=gmail)](mailto:danielgb331@outlook.com)

![Status](https://img.shields.io/badge/Status-Actively%20working%20on%20AI%20Automation-brightgreen?style=flat-square)
![Experience](https://img.shields.io/badge/Experience-4%2B%20Years-orange?style=flat-square)
![Location](https://img.shields.io/badge/Location-Querétaro,%20México-red?style=flat-square)

---

## 👨‍💻 About Me

**AI Data Automation and Data Engineer** with over 4 years of professional experience supporting Fortune 500 clients (Telcel, Citi Banamex).
Hands-on in pipeline operations, multi-environment deployment, data reconciliation, and metadata governance. Currently building AI automation workflows with n8n and Python and Agentic AI.

---

## 🛠️ Tech Stack

### **AI & Automation**
![n8n](https://img.shields.io/badge/n8n-FF6D5A?style=flat&logo=n8n&logoColor=white)
![Python](https://img.shields.io/badge/Python_Automation-3776AB?style=flat&logo=python&logoColor=white)
![AI Agents](https://img.shields.io/badge/AI_Agents-FF4B4B?style=flat&logo=openai&logoColor=white)
![LangChain](https://img.shields.io/badge/LangChain-1C3C3C?style=flat&logo=chainlink&logoColor=white)
![RAG](https://img.shields.io/badge/RAG_Pipelines-00A67E?style=flat&logo=semantic-web&logoColor=white)

### **Data Engineering & Architecture**
![Databricks](https://img.shields.io/badge/Databricks-FF3621?style=flat&logo=databricks&logoColor=white)
![T-SQL](https://img.shields.io/badge/T--SQL-025E8C?style=flat&logo=microsoft-sql-server&logoColor=white)
![PySpark](https://img.shields.io/badge/PySpark-E25A1C?style=flat&logo=apache-spark&logoColor=white)
![Power BI](https://img.shields.io/badge/Power%20BI-F2C811?style=flat&logo=power-bi&logoColor=black)

### **Cloud & DevOps**
![Azure](https://img.shields.io/badge/Microsoft%20Azure-0078D4?style=flat&logo=microsoft-azure&logoColor=white)
![ADF](https://img.shields.io/badge/Azure%20Data%20Factory-0078D4?style=flat&logo=microsoft-azure&logoColor=white)
![ADLS Gen2](https://img.shields.io/badge/ADLS%20Gen2-0078D4?style=flat&logo=microsoft-azure&logoColor=white)
![Docker](https://img.shields.io/badge/Docker-2496ED?style=flat&logo=docker&logoColor=white)
![Git](https://img.shields.io/badge/Git-F05032?style=flat&logo=git&logoColor=white)

---

## Professional Experience

### Freelance AI Automation Projects | *Independent*
**May 2025 – Present**
- AI voice sales agent for hardware distributor — inventory lookup + order tracking 24/7 (n8n, Docker)
- Automated fuel cost optimizer — scrapes 100+ gas stations, delivers top-3 via dashboard (n8n, Python)

### ETL Pipeline Operations — NTT DATA | *Telcel Client*
**May 2024 – Apr 2025**
- Operated 150+ ADF pipelines processing 5M+ daily records; ingesting 500-600 .unl files via SFTP into ADLS Gen2 with Get Metadata validation against .verf control files, resolving failures on 24/7 on-call rotation
- Executed daily reconciliation through Databricks notebooks (SQL + PySpark) on Job clusters, applying MERGE on Delta tables and validating control figures via Azure SQL stored procedures
- Maintained CI/CD workflow through Azure DevOps Git integration, promoting pipelines across environments via ARM templates with naming conventions.

### Metadata Governance Analyst — NTT DATA | *Citi Banamex Client*
**Feb 2023 – May 2024**
- Maintained naming standards across Raw/Staging/Curated layers in ADLS Gen2; used Unity Catalog for lineage tracking, tagging policies, and naming compliance validation
- Processed metadata change requests via JIRA, running SQL-based validation against DAMA DMBOK conventions; maintained CNBV regulatory documentation in Confluence

### Data Analyst — NTT DATA | *Citi Banamex Client*
**Feb 2022 – Feb 2023**
- Analyzed banking financial datasets using SQL/Excel for data reconciliation; built 10+ Power BI dashboards (DAX) for executive reporting
- Automated monthly validation of 500+ data attributes with Python (OpenPyXL, pyodbc), reducing processing time from 8 hours to 45 minutes

### Industrial Process Analyst — ZF TRW
**May 2018 – May 2021**
- Cost-saving machining process optimization (+150 units/hour, 5% cost savings) and built Excel dashboards

---

## Portfolio Projects

### [End-to-End Inventory ETL Pipeline with AI Dashboard](https://github.com/Daniel-jcVv/inventory-autopilot)
> Automated inventory health pipeline: dead stock detection, overstock alerts, and demand forecasting from raw ERP exports.
- **Stack:** Python, pandas, SQL Server, Streamlit, n8n Automation
- **Live demo:** [inventory-autopilot.streamlit.app](https://inventory-autopilot-3mlc5ryu4wsxppx26jvbqe.streamlit.app/)

### [Fleet Data Pipeline and Cost Reporting](https://github.com/Daniel-jcVv/gps-fleet-analytics)
> ETL pipeline and interactive dashboard for fleet fuel consumption and route analysis.
- **Stack:** Python, pandas, SQLite, Streamlit

---

## Education

**B.Eng. in Manufacturing Technologies**
Universidad Politécnica de Guanajuato, 2015

## Certifications

- **IBM Data Warehousing Engineer** — Coursera, 2024
- **Google Data Analytics** — Coursera, 2022

## Languages

Spanish: Native | English: Intermediate (B1+, technical reading/writing)

---

### Soli Deo Gloria
*Ora et labora*