https://github.com/madhurimarawat/python-projects
This repository contains the projects that I made in the Python programming language.
https://github.com/madhurimarawat/python-projects
automobile-dataset big-data-analytics csv-files data-analysis-python data-visualization data-visualization-project documentation email-spam-classification google-app-data image-background-removal image-processing nba-dataset python python-libraries rock-paper-scissors-game streamlit streamlit-deployment streamlit-webapp
Last synced: 4 months ago
JSON representation
This repository contains the projects that I made in the Python programming language.
- Host: GitHub
- URL: https://github.com/madhurimarawat/python-projects
- Owner: madhurimarawat
- Created: 2023-06-02T13:43:29.000Z (about 2 years ago)
- Default Branch: main
- Last Pushed: 2024-11-26T16:49:19.000Z (7 months ago)
- Last Synced: 2025-01-15T01:16:55.831Z (5 months ago)
- Topics: automobile-dataset, big-data-analytics, csv-files, data-analysis-python, data-visualization, data-visualization-project, documentation, email-spam-classification, google-app-data, image-background-removal, image-processing, nba-dataset, python, python-libraries, rock-paper-scissors-game, streamlit, streamlit-deployment, streamlit-webapp
- Language: Jupyter Notebook
- Homepage: https://python-email-spam-classification.streamlit.app/
- Size: 17.6 MB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Python-Projects
This repository contains the projects that I made in the Python programming language.
## About Python Programming
--> Python is a high-level, general-purpose, and very popular programming language.
--> Python programming language (latest Python 3) is being used in web development, Machine Learning applications, along with all cutting-edge technology in Software Industry.
--> Python is available across widely used platforms like Windows, Linux, and macOS.
--> The biggest strength of Python is huge collection of standard library.---
## Mode of Execution Used--> Colaboratory, or “Colab” for short, is a product from Google Research which allows anybody to write and execute python code in Jupyter notebook through the browser.
--> Visit colab at:
--> Create account using google account.
--> Once account creation is done, we can directly start coding in colab.
--> It supports Python and R.
--> Files are directly saved in Google Drive.
--> To install python library this command is used-
```
pip install library_name
```
---## About Projects
Complete Description about the project and resources used.
---
## **1. Data Visualization Automobile Dataset**
--> Data Visualization is the presentation of data in pictorial format.
--> Target was to see which automobile gives the most features and variations using data visualization.
--> In this project visualization of CSV file containing data of automobiles is done in python.
--> Data visualization is done to analyze which body-style of car gives the most features.
--> Patterns found in the analysis are listed.### Dataset Used
#### **Automobile Dataset**
--> Dataset is taken from: 🔗![]()
--> This contains data about various automobile in Comma Separated Value (CSV) format.
--> CSV file contains the details of automobile-mileage,length,body-style among other attributes.
--> It contains the following dimensions-[60 rows X 6 columns].
--> The csv file is already preprocessed ,thus their is no need for data cleaning.---
## **2. Data Visualization Nba Dataset**
--> Data Visualization is the presentation of data in pictorial format.
--> Target was to see the features and variations of this dataset and the best data visualization technique for this dataset.
--> In this project visualization of CSV file containing data of new york taxi trip is done in python.
--> Data analysis is done to decide suitable data visulization technique.
--> Finally Data Visualization is done to represent the analysis in an understandable way.### Dataset Used
#### **NBA Players Dataset**--> Dataset is taken from:
![]()
--> This contains data about various NBA Players in Comma Separated Value (CSV) format.
--> CSV file contains the details of players-height,weight,team,position among other attributes.
--> It contains the following dimensions-[457 rows X 9 columns].
--> The csv file is already preprocessed ,thus their is no need for data cleaning.---
## **3. Rock, Paper, Scissors Unleashed**
--> This Contains a simple and interactive implementation of rock, paper and scissors game.
--> Color code and emojis are also added.---
## **4. Image Background Remover**
--> This contains a simple scratch implementation of background remover in python.
--> Using this we can detect face from images and remove background.---
## **5. Big Data Analytics New York Taxi Dataset**
--> Data Visualization is the presentation of data in pictorial format.
--> Data Cleaning and Visualization is done for this dataset.### Dataset Used
#### **New York Taxi Trip Dataset Dataset**
Dataset is taken from: 🔗
![]()
---
## **6. Google App Store Exploratory Data Analysis (EDA)**--> Data Visualization is the presentation of data in pictorial format.
--> Data Cleaning and Visualization is done for this dataset.
--> This Project contains code for EDA and Data Visualization of Google App store data.
--> All findings are summarized, code is also explained.
--> At the end of this project, I have explained all findings in brief.
--> After all graphs explanation is also given to understand the results.---
### 7. Email Spam Classification
--> **Objective**: Develop a machine learning system for email spam classification using Logistic Regression, focusing on accuracy and efficiency.
--> **Preprocessing & Feature Engineering**: Text data is preprocessed using TF-IDF vectorization, converting raw email text into meaningful features for the model.
--> **Model**: Logistic Regression is used for binary classification, achieving 94.32% accuracy, with strong precision and recall metrics.
--> **Deployment**: The model is deployed in a Streamlit web app, enabling users to input email text and receive real-time spam classification results.




---
## Libraries Used
Short Description about all libraries used in Project.
- Pandas (Panel Data/ Python Data Analysis) - This library is mostly used for analyzing,
cleaning, exploring, and manipulating data. - Matplotlib - It is a data visualization and graphical plotting library.
- Seaborn - It is an extension of Matplotlib library used to create more attractive and
informative statistical graphics.
---
## Thanks for Visiting 😄
Drop a 🌟 if you find this repository useful.
If you have any doubts or suggestions, feel free to reach me.
📫 How to reach me: [](https://www.linkedin.com/in/madhurima-rawat/)