https://github.com/staciax/data-science
https://github.com/staciax/data-science
data-science matplotlib numpy skit-learn
Last synced: about 2 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/staciax/data-science
- Owner: staciax
- Created: 2025-11-15T07:54:43.000Z (about 2 months ago)
- Default Branch: dev
- Last Pushed: 2025-11-15T07:58:01.000Z (about 2 months ago)
- Last Synced: 2025-11-15T09:23:40.746Z (about 2 months ago)
- Topics: data-science, matplotlib, numpy, skit-learn
- Language: Python
- Homepage:
- Size: 6.6 MB
- Stars: 0
- Watchers: 0
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Data Science Projects
My data science journey with machine learning projects using Python.
## Algorithms
### Supervised Learning
- **K-Nearest Neighbors (KNN)** - Classify based on nearest neighbors
- **Decision Trees** - Tree structure for classification and regression
- **Random Forest** - Multiple decision trees working together
### Unsupervised Learning
- **K-Means Clustering** - Group data into K clusters
- **Hierarchical Clustering** - Create cluster hierarchies
- **Principal Component Analysis (PCA)** - Reduce dimensions while keeping important info
## Datasets
I've used these datasets to practice and learn:
1. **Wheat Seeds** - Different types of wheat seeds
2. **Earthquake Alert** - Earthquake prediction
3. **Wine Quality** - Red and white wine ratings
4. **Iris** - Flower species classification
5. **Digits** - Handwritten numbers
Check [data/README.md](data/README.md) for more info about the datasets.
## Getting Started
### Prerequisites
- Python 3.13+
- uv package manager
### Installation
Clone and install:
```bash
git clone https://github.com/staciax/data-science.git
cd data-science
uv sync
```
## License
Open source for educational use.
## Contributing
Feel free to open issues or submit pull requests!