Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/thaikoz/data-science

This repository contains various data science projects and exercises implemented in Python using Jupyter Notebooks. It is intended for educational purposes and to document my learning journey in data science.
https://github.com/thaikoz/data-science

numpy pandas python

Last synced: about 2 months ago
JSON representation

This repository contains various data science projects and exercises implemented in Python using Jupyter Notebooks. It is intended for educational purposes and to document my learning journey in data science.

Awesome Lists containing this project

README

        

# Data Science Learning Repository

This repository contains various data science projects and exercises implemented in Python using Jupyter Notebooks. It is intended for educational purposes and to document my learning journey in data science.

## Table of Contents

1. [Introduction](#introduction)
2. [Setup and Installation](#setup-and-installation)
3. [Projects and Exercises](#projects-and-exercises)
4. [Technologies Used](#technologies-used)

## Introduction

This repository is a collection of data science projects and exercises that I have worked on to enhance my skills in data analysis, machine learning, and data visualization. Each project is documented in a Jupyter Notebook, showcasing the problem-solving approach, methodology, and results.

## Setup and Installation

To run the notebooks in this repository, you need to have Python and Jupyter Notebook installed on your system. You can follow the steps below to set up the environment:

1. **Clone the repository**:

```bash
git clone https://github.com/ThaikoZ/data-science.git
cd data-science
```

2. **Create a virtual environment** (optional but recommended):

```bash
python -m venv venv
source venv/bin/activate # On Windows, use `venv\Scripts\activate`
```

3. **Install the required packages**:

```bash
pip install -r requirements.txt
```

4. **Launch Jupyter Notebook**:

```bash
jupyter notebook
```

## Projects and Exercises

1. **Data Analysis Projects**:
- Exploratory Data Analysis (EDA)
- Statistical Analysis
- Data Cleaning and Preprocessing

2. **Machine Learning Projects**: (Future)
- Supervised Learning (Regression, Classification)
- Unsupervised Learning (Clustering, Dimensionality Reduction)
- Model Evaluation and Tuning

3. **Data Visualization Projects**: (Future)
- Visualizing Data with Matplotlib and Seaborn
- Interactive Visualizations with Plotly

## Technologies Used

- **Python**: The main programming language used.
- **Jupyter Notebook**: For interactive computing and data exploration.
- **Pandas**: For data manipulation and analysis.
- **NumPy**: For numerical computing.
- **Matplotlib**: For static data visualizations.
- **Seaborn**: For statistical data visualizations.
- **Scikit-Learn**: For machine learning algorithms and tools.
- **Plotly**: For interactive visualizations.

---

Feel free to contribute by submitting issues or pull requests if you find any improvements or additional projects to include. Happy learning!