An open API service indexing awesome lists of open source software.

https://github.com/staciax/data-science


https://github.com/staciax/data-science

data-science matplotlib numpy skit-learn

Last synced: about 2 months ago
JSON representation

Awesome Lists containing this project

README

          

# Data Science Projects

My data science journey with machine learning projects using Python.

## Algorithms

### Supervised Learning
- **K-Nearest Neighbors (KNN)** - Classify based on nearest neighbors
- **Decision Trees** - Tree structure for classification and regression
- **Random Forest** - Multiple decision trees working together

### Unsupervised Learning
- **K-Means Clustering** - Group data into K clusters
- **Hierarchical Clustering** - Create cluster hierarchies
- **Principal Component Analysis (PCA)** - Reduce dimensions while keeping important info

## Datasets

I've used these datasets to practice and learn:

1. **Wheat Seeds** - Different types of wheat seeds
2. **Earthquake Alert** - Earthquake prediction
3. **Wine Quality** - Red and white wine ratings
4. **Iris** - Flower species classification
5. **Digits** - Handwritten numbers

Check [data/README.md](data/README.md) for more info about the datasets.

## Getting Started

### Prerequisites

- Python 3.13+
- uv package manager

### Installation

Clone and install:
```bash
git clone https://github.com/staciax/data-science.git
cd data-science
uv sync
```

## License

Open source for educational use.

## Contributing

Feel free to open issues or submit pull requests!