https://github.com/mylethidiem/zero-to-hero
Project for learning, practicing code: Python, SQL, C/C++, Data science/Data Analysis, AI/Machine learning
https://github.com/mylethidiem/zero-to-hero
ai cpp data-analysis data-science deep-learning machine-learning mlops python sql
Last synced: about 1 month ago
JSON representation
Project for learning, practicing code: Python, SQL, C/C++, Data science/Data Analysis, AI/Machine learning
- Host: GitHub
- URL: https://github.com/mylethidiem/zero-to-hero
- Owner: mylethidiem
- License: mit
- Created: 2017-12-20T17:41:28.000Z (over 8 years ago)
- Default Branch: main
- Last Pushed: 2026-02-21T16:02:37.000Z (about 2 months ago)
- Last Synced: 2026-02-21T22:45:24.371Z (about 2 months ago)
- Topics: ai, cpp, data-analysis, data-science, deep-learning, machine-learning, mlops, python, sql
- Language: Jupyter Notebook
- Homepage:
- Size: 89.1 MB
- Stars: 2
- Watchers: 1
- Forks: 1
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT.md
- Security: SECURITY.md
Awesome Lists containing this project
README
# Zero to Hero: From Beginner to AI Pro
[](https://www.udemy.com/course/python-3-deep-dive-part-1/) [](https://git-scm.com/) [](https://www.kaggle.com/banhmuy) [](https://opensource.org/licenses/MIT)
Welcome to **Zero to Hero**! This repository chronicles my self-taught journey from coding novice to proficient in Data Science and Artificial Intelligence. It's a structured roadmap packed with curated resources, practical tips, code examples, and collaboration guidelines. Whether you're starting from scratch or refining your skills, use this as your blueprint to level up.
> **Mission:** Transform "zero" knowledge into "hero" expertise through consistent learning, hands-on practice, and team-ready habits. Fork it, contribute, or adapt it for your own path!
## Table of Contents
- [Introduction](#introduction)
- [Learning Overview](#learning-overview)
- [C++](#c++)
- [Git](#git)
- [Python](#python)
- [Data Science & AI](#data-science--ai)
- [Practice Projects & Profiles](#practice-projects--profiles)
- [Collaboration Guidelines](#collaboration-guidelines)
- [Commit Message Conventions](#commit-message-conventions)
- [Git Workflow Rules](#git-workflow-rules)
- [How to Contribute](#how-to-contribute)
- [Certificates](#certificates)
- [License](#license)
## Introduction
This repo serves as both a personal learning log and a public resource hub. Key focus areas:
- **Foundations:** Programming basics (C++, Python) and version control (Git).
- **Advanced Skills:** Data manipulation, machine learning, and AI deployment.
- **Real-World Application:** Links to portfolios, challenges, and projects.
- **Best Practices:** Guidelines for clean code, teamwork, and efficient workflows.
Started in 2024, updated as of October 2025. Track progress via commit history—each update marks a milestone!
## Learning Overview
Step-by-step resources with milestones and pro tips. Prioritize free/open-access materials where possible.
### C++
**Focus:** Build core programming intuition—variables, loops, functions, OOP, and performance tuning. Ideal for systems-level understanding.
- **Resources:**
- [DaynhaUHoc C++](https://cpp.daynhauhoc.com/) (Free, beginner-friendly in Vietnamese).
- Udemy: "Beginning C++ Programming - From Beginner to Beyond".
- NVNS Academy: Advanced modules on STL and multithreading.
- **Milestones:**
- Compile/run a "Hello World" program.
- Implement arrays, pointers, and classes.
- Solve 10+ problems on HackerRank C++ track.
- **Pro Tip:** Use Valgrind for memory debugging—catches leaks early.
### Git
**Focus:** Master version control to collaborate without chaos. From solo commits to PRs in large teams.
- **Resources:**
- [Official Git Documentation](https://git-scm.com/book/en/v2) (Free, in-depth guide).
- [Git - The Simple Guide](https://rogerdudler.github.io/git-guide/) (Quick reference).
- [Best Git Practices for Teams](https://www.geeksforgeeks.org/git/best-git-practices-to-follow-in-teams/) (Workflow strategies).
- Video: [Quản lý Phiên Bản Code với Git/GitHub](https://www.youtube.com/watch?v=LbNd2XgWFe0) (Practical Vietnamese tutorial).
- **Milestones:**
- Set up a local repo, add commits, and push to GitHub.
- Create/merge branches and resolve conflicts.
- Automate with GitHub Actions for linting/tests.
- **Pro Tip:** Adopt semantic versioning (SemVer) for releases: MAJOR.MINOR.PATCH.
### Python
**Focus:** Versatile language for automation, data analysis, and ML. Emphasize readable, modular code.
- **Resources:**
- Udemy: [Python 3 Deep Dive (Parts 1-4)](https://www.udemy.com/course/python-3-deep-dive-part-1/) (Comprehensive, from basics to advanced topics like iterators and context managers).
- [PEP 8 Style Guide](https://peps.python.org/pep-0008/) (Enforce clean, professional code).
- **Milestones:**
- Write scripts for file handling and APIs.
- Build a CLI tool with argparse.
- Optimize with list comprehensions and decorators.
- **Pro Tip:** Lint with Black and Flake8—keeps code consistent across teams.
### Data Science & AI
**Focus:** Turn data into insights. Cover stats, ML algorithms, neural networks, and ethics.
- **Resources:**
- [AIVN Academy](https://aivietnam.edu.vn) (Vietnamese-focused courses on ML fundamentals).
- Additional: Udemy (ML A-Z), Coursera (Deep Learning Specialization), Kaggle Learn (Free micro-courses), AWS/NVIDIA (Cloud AI certs).
- **Milestones:**
- EDA on a Kaggle dataset with pandas/Seaborn.
- Train/deploy a scikit-learn model.
- Experiment with TensorFlow/PyTorch for image classification.
- **Pro Tip:** Version experiments with MLflow—tracks params, metrics, and artifacts.
## Practice Projects & Profiles
Apply knowledge through challenges and portfolios. Here's my progress:
- 📊 **[Kaggle Profile](https://www.kaggle.com/banhmuy)**:An online community for data scientists and machine learners.
- 🤖 **[DeepML Profile](https://www.deep-ml.com/profile/mzOHLfAKLVauQjHcZOdJxLdgiTS2)**: Learn Linear Algebra, Machine Learning, Deep Learning, NLP and Computer Vision practice
- 💻 **[LeetCode Profile](https://leetcode.com/lethidiemmy961996)**: 200+ solved problems in arrays, trees, and dynamic programming.
- 🐵 **[DataLemur - Ace the SQL & Data Science Interview](https://t.co/JG4qmAAbho)**(No profile page): Practice the most common SQL, Statistics, ML, and Python questions asked in FAANG Data Science & Data Analyst interviews!
- 📈 **[Hackerrank Profile](https://www.hackerrank.com/profile/lethidiemmy96191)**: A programming practice platform that provides competitive challenges and tests for learning skills and preparing for technology careers.
- 🤗 **[Hugging Profile](https://huggingface.co/elizabethmyn)**: The platform where the machine learning community collaborates on models, datasets, and applications. Explore AI Apps.
**Starter Project Ideas:**
- **C++/Git:** Version a simple calculator app with feature branches.
- **Python/DS:** Web scraper for stock data + visualization dashboard.
- **AI:** Fine-tune a Hugging Face model for sentiment analysis on Vietnamese text.
## Collaboration Guidelines
Open to PRs! These rules ensure smooth, professional teamwork.
### Commit Message Conventions
Imperative style with module prefixes for traceability.
**Format:** `[Module]: [Action verb] [Brief description]`
| ✅ **Examples (Do)** | ❌ **Examples (Don't)** |
|---------------------|------------------------|
| `Scripts: Update batch files for faster processing` | `Scripts: fixed paths` |
| `Powershell: Add folder size calculation function` | `update feature` |
| `Data: Refactor CSV parsing to handle edge cases` | `added file` |
**Best Practices:**
- Verbs: Add, Fix, Update, Refactor, Remove, Optimize.
- Capitalize first letter; no trailing period.
- Body (optional): Blank line + details (e.g., "Resolves #42. Improves runtime by 15%.").
### Git Workflow Rules
GitFlow-inspired: `dev` as integration branch, `main` for releases.
1. **Branching:**
- Default: Work on `dev`.
- No direct commits to `main` (except tags).
- Naming: `feature/` (e.g., `feature/user-auth`), `fix/` (e.g., `fix/api-bug`).
2. **Commit & Push:**
- `git pull` upstream before `git push` to sync.
- Follow conventions; no large files/secrets (.env).
- Use `git add -p` for selective staging.
3. **Pull Requests:**
- From your branch to `dev`.
- Description: Goal, changes, tests passed, screenshots if UI-related.
- @tag reviewers; aim for 1-2 approvers.
4. **Conflicts:**
- Resolve locally, rebase if needed, test fully.
- Unsure? Open an issue for guidance.
5. **Structure & Ignores:**
- Folders: `src/` (code), `tests/` (unit tests), `docs/` (notes).
- Files: Descriptive names (e.g., `data_loader.py`).
- `.gitignore`: Temps, models, large CSVs—use [GitHub's template](https://github.com/github/gitignore).
## How to Contribute
1. Fork/clone the repo.
2. Branch: `git checkout -b feature/your-contribution`.
3. Commit/push following guidelines.
4. Open PR to `dev`—include motivation and impact.
5. Discuss via issues for features or bugs.
Your contributions make this repo stronger—thanks for joining the journey!
## Certificates
Key achievements validating the "Hero" status:
1. [Kaggle: Python](https://www.kaggle.com/learn/certification/banhmuy/python) – Core programming skills.
2. [Kaggle: Pandas](https://www.kaggle.com/learn/certification/banhmuy/pandas) – Data wrangling expertise.
3. [CodeLearn Profile](https://codelearn.io/profile/920858) – Algorithms and web challenges.
## License
MIT License – Free to use, modify, and distribute. See [LICENSE](LICENSE) for details. © 2024-2025 [Your Name]. 🚀