An open API service indexing awesome lists of open source software.

https://github.com/iamfoysal/data-analysis

This repository contains various examples and exercises to help learn data science using Python.
https://github.com/iamfoysal/data-analysis

data-analysis data-science database jupyter-notebook python3

Last synced: about 2 months ago
JSON representation

This repository contains various examples and exercises to help learn data science using Python.

Awesome Lists containing this project

README

          

Welcome to the Data Analysis repository! This repository contains various examples and exercises to help learn data science using Python.

## Contents

- **Jupyter Notebooks**: Interactive notebooks for data analysis and visualization.
- **Lexical Analysis**: Examples of lexical analysis in different programming languages (C, C++, Java, Python).
- **Scripts**: Python scripts for data manipulation and analysis.

## Getting Started

1. **Clone the repository**:
```sh
git clone https://github.com/iamfoysal/data-analysis.git
cd data-analysis
```

2. **Set up the virtual environment**:
```sh
python -m venv venv
source venv/bin/activate # On Windows use `venv\Scripts\activate`
```

3. **Install the required packages**:
```sh
pip install -r requirements.txt
```

4. **Run Jupyter Notebook**:
```sh
jupyter notebook
```

### Lexical Analysis

- `lexical_analysis/lexical_analysis_c/lexical_analysis.c`: Lexical analysis example in C.
- `lexical_analysis/lexical_analysis_cpp/lexical_analysis.cpp`: Lexical analysis example in C++.
- `lexical_analysis/lexical_analysis_java/lexical_analysis.java`: Lexical analysis example in Java.
- `lexical_analysis/lexical_analysis_py/lexical_analysis.py`: Lexical analysis example in Python.

### Jupyter Notebooks

- `data_frame.ipynb`: Creating and manipulating data frames using pandas.
- `ds_functions.ipynb`: Data cleaning functions using pandas.
- `read_data_from_database.ipynb`: Reading data from a SQLite database into a pandas DataFrame.
- `ds_data_preparation.ipynb`: Data preparation and cleaning using pandas.

### Data Generation

- `generator/data_generate.py`: Script to generate fake data and insert it into a SQLite database.
- `generator/products_csv.py`: Sample CSV file containing product data.

### Database

- `db/users.db`: SQLite database file containing user data for analysis and testing purposes.

### Data Files

- `files/`: Directory containing various files for data analysis, including CSV, JSON, and other data formats.