An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with data-analysis-python

A curated list of projects in awesome lists tagged with data-analysis-python .

https://github.com/gdsbook/book

This book serves as an introduction to a whole new way of thinking systematically about geographic data, using geographical analysis and computation to unlock new insights hidden within data.

data-analysis-python data-science geographic-data geographical-information-system spatial-analysis spatial-data-analysis spatial-statistics statistics

Last synced: 15 Mar 2025

https://github.com/ptmcg/littletable

An in-memory database of Python objects, searchable using quasi-SQL API

data-analysis-python database python

Last synced: 16 May 2025

https://github.com/hemansnation/data-analyst-roadmap

Data-Analyst-Roadmap for Professionals. This roadmap contains 8 Chapters that can be completed in 8 weeks, whether you are a fresher in the field or an experienced professional who wants to transition into Data Analysis.

analytics data-analysis data-analysis-python data-analytics data-science numpy predictive-analytics project-based-learning python statistics tableau

Last synced: 15 Apr 2025

https://github.com/spratiher9/sparkora

Powerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟

apache apache-spark data data-analysis data-analysis-python data-analytics easy-to-use eda exploratory-data-analysis open-source opensource pyspark python python3 toolkit

Last synced: 17 Mar 2025

https://github.com/kylejgillett/sounderpy

A python package that helps you to access and plot vertical profile data for meteorological analysis

atmospheric-science atmospheric-sciences data-analysis-python meteorology python weather weather-data

Last synced: 06 Apr 2025

https://github.com/anselmoo/spectrafit

📊📈🔬 SpectraFit is a command-line and Jupyter-notebook tool for quick data-fitting based on the regular expression of distribution functions.

console-application curve-fitting data-analysis data-analysis-python data-science data-visualization fitting juypter-notebook python science science-research scientific-plotting spectral-analysis spectroscopy

Last synced: 09 Apr 2025

https://github.com/mrankitgupta/python-roadmap

I am sharing Python lessons from scratch to intermediate with practice sets which I have studied into my Journey of 66DaysofData into Data Analytics.

66daysofdata analytics ankitgupta data-analysis data-analysis-python data-analytics data-mining data-science data-structures data-visualization jupyter matplotlib mrankitgupta numpy pandas programming python python-library python3

Last synced: 22 Apr 2025

https://github.com/jincheng9/python-tutorial

Python tutorial,量化交易,涵盖基础、中级和高级教程

data data-analysis-python data-analyst data-science django flask numpy pandas python quant quant-dev tutorial

Last synced: 07 May 2025

https://github.com/shreeparab1890/fifa-wc-2022-qatar-data-analysis-eda

This is a Jupyter Notebook( iPython Notebook) with Data Analysis (EDA) on FIFA WC Qatar 2022 match data.

data-analysis data-analysis-python data-science data-visualization eda fifa matplotlib-pyplot numpy pandas plotly-express python-3

Last synced: 01 Jan 2025

https://github.com/akashkobal/predictive-analytics

Predictive analytics is conceptual in nature to achieve competitive strategy across industries. The learner's will be benefited in this repository to know about modern data analytic concepts and develop the skills for analysing and synthesizing data sets for decision making in the firms.

akash akash-kobal data-analysis-python data-analytics data-analytics-python datascience machinelearning predictive-analytics predictive-analytics-course predictive-analytics-tutorial predictive-analytics-using-python predictiveanalytics pres

Last synced: 05 Dec 2024

https://github.com/misaghmomenib/data-analysis-projects

A Repository Featuring a Collection of Data Analysis Projects, Showcasing Various Techniques and Tools for Extracting Insights From Data. Explore, Learn, and Utilize These Projects to Enhance Your Data Analysis Skills and Workflows.

data-analysis data-analysis-python data-visualization jupyter-notebook open-source python

Last synced: 13 Apr 2025

https://github.com/prak112/esg-profile

Assessing stock-price fluctuations of companies based on their ESG-profiles

data-analysis-python pdf-scraping sustainability-score

Last synced: 12 Apr 2025

https://github.com/jen-uis/customer-segmentation-analysis

This repository contains materials for the Spring 2024 STAT 208 class, specifically for Team 8. All materials are the property of Team 8, University of California, Riverside, A. Gary Anderson School of Management. Thank you for viewing our repository.

business-analytics customer-segmentation customer-segmentation-analysis data-analysis-python jupyter-notebook marketing-analytics marketingdata project-repository python-3 team-project university-of-california-riverside

Last synced: 20 Nov 2024

https://github.com/ahammadmejbah/ibm-project-03-analyzing-spreadsheet-data-with-python

Spreadsheets are computer programs that allow users to enter, view, and change information in a gridlike format. One of the most useful programs on a PC is a spreadsheet, which allows users to organize data in tabular form. A spreadsheet's primary purpose is to store numerical data and sometimes a few words of text.

data-analysis data-analysis-python data-analyst data-science excel python spreadsheet

Last synced: 27 Apr 2025

https://github.com/ranish-shrestha/sales_data_analysis

Sales data analysis using Python.

data-analysis-python python-project

Last synced: 18 Mar 2025

https://github.com/ikanurfitriani/project-data-analysis-python

This repository contains the results of data analysis learning using the Python.

data-analysis data-analysis-project data-analysis-python python

Last synced: 21 Mar 2025

https://github.com/easonlai/scraping_data_from_pdf

Code repository sample to demonstrate how to scrape table data from PDF file.

camelot data-analysis-python data-analytics data-scraping pdf python python3

Last synced: 26 Apr 2025

https://github.com/prak112/coursera-ibm_capstone

Cluster analysis of specific venues within a given geographical zone (district/borough)

capstone-project data-analysis-python geospatial-analysis geospatial-visualization ibm-datascience-certification

Last synced: 04 Mar 2025

https://github.com/ikanurfitriani/top-1000-instagram-influencer-2022

Simple analysis of Top 1000 Instagram Influencer Profiles 2022.

analysis data-analysis-python data-analytics data-cleaning data-visualization

Last synced: 21 Mar 2025

https://github.com/mallickboy/land-slide-prediction

📌 Landslide Detection with Satellite Imagery & Machine Learning . The CNN model trained on multi-channel satellite images. Achieved 89.2% F1-score by using NDVI, slope, and elevation features. Optimized model accuracy through hyperparameter tuning and threshold adjustments.

cnn data-analysis-python data-visualization deep-learning hdf5-format keras-tensorflow machine-learning tensorflow

Last synced: 05 Apr 2025

https://github.com/iguptashubham/online-retail-sales

This Power BI dashboard, designed for marketing strategists, analyzes sales trends and customer behavior. It provides key insights empowering them to identify sales opportunities and optimize marketing campaigns, ultimately boosting business sales.

dashboard data data-analysis data-analysis-project data-analysis-project-powerbi data-analysis-python data-project data-science powerbi project

Last synced: 03 Mar 2025

https://github.com/sevdanurgenc/data-analytics-lecture-notes

In this repo, I have the course contents of the Data Analytics training, which will be given to Innova Technology by the cooperation of Academy Peak Information Technologies Training and Consultancy between 27 - 29 September 2021.

data-analysis-python data-analytics python

Last synced: 23 Mar 2025

https://github.com/whis99/userfunnelanalysis

An ecommerce user funnel conversion data analysis with matplotlib & python.

data-analysis data-analysis-python data-analyst data-visualization google-colab jupyter-notebook matplotlib python

Last synced: 02 Mar 2025

https://github.com/raghul-m/data-analysis-with-jupyternotebook-python

This repository consists of notebooks where I learned the basics of getting started with data analysis using Python libraries

beautifulsoup4 data-analysis-python eda jupyter-notebook matplotlib-python numpy pandas-python regular-expression seaborn-python webscraping

Last synced: 13 Mar 2025

https://github.com/geo-y20/uber-rides-data-analysis

This project aims to analyze Uber ride data to understand various aspects of ride usage, such as the distribution of rides across different categories, purposes, months, days, and times.

dashboard dashboard-templates data data-analysis data-analysis-python data-analytics data-visualization pandas powerbi python recommendation-system rides uber

Last synced: 25 Feb 2025

https://github.com/ituvtu/datamining-ab-testing

This project focuses on conducting A/B testing to evaluate the effectiveness of two marketing campaigns. Using statistical analysis and hypothesis testing, we determine which campaign is more effective in improving conversion rates.

a-b-testing data-analysis data-analysis-python data-mining ipynb jupyter jupyter-notebook python

Last synced: 16 Jan 2025

https://github.com/vara-co/space-missions

Space Missions Over Time (1957-2022): Successes vs Failures, and Rocket Usage

data-analysis data-analysis-python history matplotlib pandas pandas-python space space-race spaceships team-project

Last synced: 28 Mar 2025

https://github.com/sumidcyber/dataviz-master

This Python application provides a user-friendly interface to load and visualize the contents of a CSV file. Users can choose from various types of graphs and perform analyses on the dataset.

data-analysis data-analysis-project data-analysis-python database databases python python3

Last synced: 16 Mar 2025

https://github.com/sirinemaaroufi/music-and-mental-health

This project explores the link between music and mental health by analyzing listeners' demographics, musical preferences, and mental health data. Using machine learning, it predicts the music effects on mental health.

data-analysis-python data-science mental-health music music-therapy predictive-modeling python

Last synced: 06 May 2025

https://github.com/lorenzopegorari/cremonaroadaccidentsanalysis

Analysis of road accidents that happened in Cremona, Lombardy, Italy, between 2009 and 2022. Made using Jupyter Notebook, pandas, seaborn and matplotlib.

data-analysis-python jupyter-notebook matplotlib pandas python seaborn

Last synced: 23 Apr 2025

https://github.com/hayatiyrtgl/data_analysis_project

Financial data analysis: preprocess, visualize, calculate technical indicators.

data-analysis data-analysis-python data-science dataframe numpy pandas python python3 stock-price-prediction talib trade-analysis

Last synced: 08 Apr 2025

https://github.com/vidhi1290/zomato-data-analysis

Zomato Data Analysis - Explore the world of Zomato restaurant data through Python and data analysis. Uncover trends and insights using Pandas for data manipulation and Matplotlib for visualization. Join us in this journey to reveal the hidden stories within the data!

data-analysis data-analysis-python data-science data-visualization dataprocessing machine-learning machine-learning-algorithms matplotlib numpy pandas python scikit-learn zomato-data-analysis

Last synced: 28 Mar 2025

https://github.com/oguzhansarigol/expected-goals-xg-data-analysis

We will analyze in our code the expected goal locations of football players based on which areas of the field, at what times, with which feet, using which parts of their bodies, and from which angles and distances they are most likely to score.

data-analysis-python expected-goals numpy pandas-python python

Last synced: 06 Apr 2025

https://github.com/soumyaco/hotel-price-data-analysis

Data analysis on Indian hotels price. A beginners guide to data analysis.

data-analysis-python data-science data-visualization matplotlib seaborn-plots

Last synced: 31 Mar 2025

https://github.com/camara94/acm

Ce page regroupe du matériel pédagogique pour des enseignements des techniques factorielles, essentiellement l’analyse en composantes principales (ACP), l’analyse factorielle des correspondances (AFC), l’analyse des correspondances multiples (ACM), l’analyse factorielle des données mixtes (AFDM) et le positionnement multidimensionnel (multidimensional scaling – MDS).

acm data-acquisition data-analysis-python data-mining python

Last synced: 09 Apr 2025

https://github.com/rajputrockstar/election-results-dashboard

his Streamlit application displays election results for various constituencies, parties, and states in India.

data-analysis-python pandas python python-modules python-script python3 scraping scrapy selenium-python streamlit streamlit-application streamlit-dashboard streamlit-webapp

Last synced: 22 Mar 2025

https://github.com/flyingfathead/neurograph-framework

A versatile tool for visualizing entropy loss in TensorFlow-based neural network training, providing insightful scatter plots with annotations.

data-analysis data-analysis-python data-visualization entropy graph graphs neural-network neural-networks neural-networks-visualization nn python python3 tensorflow tensorflow2 training visualization visualization-tools

Last synced: 28 Feb 2025

https://github.com/mariaorabi/data-mining-disease-tweets-analysis

Analyze tweets related to diseases using data mining techniques to derive insights and patterns.

data-analysis-python data-anlysis data-mining data-processing disease juypter-notebook nlp python tweet-analysis

Last synced: 24 Feb 2025

https://github.com/mubassim-khan/stack-overflow-developer-survey-2023

This repository contains the code for data analysis of Stack Overflow Developer Survey 2023, containing the digital representation of most used languages and much more. View README for more descriptive overview of repository.

data-analysis data-analysis-python matplotlib-pyplot numpy pandas-python

Last synced: 05 Mar 2025

https://github.com/agungbudiwirawan/data_science_in_telco-data_cleansing

Data cleansing using python: handling missing data values, outliers, and standardized values.

data-analysis-python data-cleansing data-science pandas python

Last synced: 31 Mar 2025

https://github.com/mastercruelty/gokart-data-hub

It manages data about gokart races and plot graphs about your times!

data-analysis-python data-science data-visualization gokart matplotlib pandas race

Last synced: 15 Apr 2025

https://github.com/soumyaco/spaceship-titanic

Famous Kaggle competition solution notebook with step by step guide.

data-analysis-python data-science kaggle-competition machine-learning python3 spaceship-titanic

Last synced: 31 Mar 2025

https://github.com/mayankyadav23/data-analysis-with-python

This repository showcases data analysis projects using Python and libraries like Numpy, Pandas, Matplotlib and Seaborn. Key projects include visualizing medical data, analyzing page view trends, and predicting sea level changes. Explore to see Python's data analysis capabilities!

data-analysis-python demographic-data-analyzer freecodecamp mean-variance-standard-deviation-calculator medical-data-visualizer page-view-time-series-visualizer sea-level-predictor

Last synced: 27 Feb 2025

https://github.com/enricogoerlitz/ml-models

This project contains my ML-Models and represents an documentation of these.

data-analysis data-analysis-python data-science keras-tensorflow machine-learning neural-networks python sklearn tensorflow2

Last synced: 28 Apr 2025

https://github.com/enricogoerlitz/data-analysis

This project contains my Data-Analysis and represents an documentation of these.

business-intelligence data-analysis data-analysis-python data-visualization numpy pandas power-bi power-bi-dax python

Last synced: 28 Apr 2025

https://github.com/robertopatino1/oscars2023_data_analysis

A deep data science analysis involving tweets regarding the upcoming Academy Awards

data data-analysis-python data-science data-visualization html jupyter-notebook lda-model machine-learning python trends tweepy twitter

Last synced: 16 Jun 2025

https://github.com/gui-sitton/churn-finalproject

predict its customers' churn. If it is discovered that a user is planning to switch operator, the company will offer them promotional codes and special plan options.

churn-prediction data-analysis-python data-science data-visualization

Last synced: 18 Mar 2025

https://github.com/gustapinto/jupyter-notebooks

Some data analysis experiments expressed as notebooks

data-analysis-python jupyter-notebook

Last synced: 05 Apr 2025

https://github.com/mustika-putri-m/analysis-of-sales-transactions-in-an-online-shop---london

Crucial Question 1. How was the sales trend over the months? 2. What are the most frequently purchased products? 3. How many products does the customer purchase in each transaction? 4. What are the most profitable segment customers? 5. Based on your findings, what strategy could you recommend to the business to gain more profit?

data data-analysis-python data-analytics data-visualization ecommerce

Last synced: 26 Mar 2025

https://github.com/camara94/afc

Ce page regroupe du matériel pédagogique pour des enseignements des techniques factorielles, essentiellement l’analyse en composantes principales (ACP), l’analyse factorielle des correspondances (AFC), l’analyse des correspondances multiples (ACM), l’analyse factorielle des données mixtes (AFDM) et le positionnement multidimensionnel (multidimensional scaling – MDS).

acp data-analysis-python data-science ia

Last synced: 09 Apr 2025

https://github.com/misaghmomenib/stock-momentum-analysis

A Python-based Data Analysis Tool Designed to Evaluate Stock Momentum. Leverages Historical Market Data to Identify Trends, Predict Price Movements, and Assist in Making Informed Investment Decisions.

data-analysis data-analysis-python data-visualization git open-source python

Last synced: 10 Apr 2025

https://github.com/melihadelalic/python-dielectronanalysis

This repository contains files and documentation from the analysis of dielectron collision events, performed using Python for data interpretation and visualization. It includes Jupyter notebooks, the dataset from CERN open data, and visualizations of the results.

anaconda-environment data-analysis-python jupyter-notebook particle-physics

Last synced: 14 Mar 2025

https://github.com/gauff/belgianelectriccarmarketanalyser

Python tool for analyzing the belgian second hand electric car market by scraping and visualizing data from multiple car listing websites. Features parallel web scraping, price tracking, and interactive dashboards.

automotive beautifulsoup4 car-market dash data-analysis-python data-cleaning data-visualization electric-vehicles market-analysis pandas parallel-processing plotly price-comparison price-monitoring selenium web-scraping-python

Last synced: 14 Mar 2025

https://github.com/gajendrasharma-github/app_store

Capstone Project with 2.4 Million Records

classification data-analysis-python regression

Last synced: 06 Mar 2025

https://github.com/satyamtripathi8/tools_for_data_science

Introduction to Data Science Tools(Python)

data-analysis-python matplotlib-pyplot numpy pandas

Last synced: 27 Feb 2025

https://github.com/mr-dhan/eda-sales-customer-transactions

Dalam dunia bisnis ritel yang kompetitif, pemahaman mendalam terhadap perilaku pelanggan merupakan fondasi penting untuk pengambilan keputusan strategis. Namun, data transaksi pelanggan seringkali berjumlah besar dan kompleks, sehingga memerlukan proses analisis yang efektif untuk mengungkap insight yang berharga.

dashboard data data-analysis data-analysis-python data-science data-visualization eda python

Last synced: 11 Jun 2025

https://github.com/administroot/ehs_scrutinizer_preview

一款EHS数据分析软件,用于降低报告出错率(Preview版)

data-analysis-python ehs excel sqlite3

Last synced: 16 Mar 2025

https://github.com/sajjad425/missingvalue

This repository provides a guide on handling missing values in Python, covering identification methods, imputation techniques (mean, median, mode, fill, interpolation), advanced methods (KNN, multiple imputation), and best practices. It includes practical examples for both numerical and categorical data.

data data-analysis-python data-science missing-value-handling missing-value-imputation

Last synced: 04 Apr 2025

https://github.com/sumit-sinha9/uber-rides-data-analysis

This project aims to analyze Uber ride data to understand various aspects of ride usage, such as the distribution of rides across different categories, purposes, months, days, and times.

data-analysis-python data-analytics data-visualization pandas-python powerbi python rec uber

Last synced: 12 Jun 2025

https://github.com/camara94/data_analyse_series_temporelles

Dans ce tutoriel, nous allons répondre aux questions suivantes: 1. Lire les données Microsoft à l'aide du package **Pandas Data reader** 2. Obtenez le **prix maximum** de l'action de **2017 à 2022** 3. Quelle est la **date du cours le plus élevé** de l'action ? 4. Quelle est la **date du cours le plus bas** de l'action ?

data-analysis data-analysis-python data-science data-structures-and-algorithms data-visualization serie series-forecasting

Last synced: 09 Apr 2025

https://github.com/arfazrll/data-analyst-dashboard

Data Analyst Dashboard is an interactive tool designed to help data analysts explore, analyze, and visualize datasets with ease. Using Dash and Plotly.

csv-files dashboards data-analysis-python python streamlit

Last synced: 20 Feb 2025

https://github.com/toluwalase-taiwo/altschoolafrica

This repository showcases my projects and learnings from my one-year diploma course with AltSchool Africa School of Data, Data Science Track. The projects demonstrate my skills and knowledge in data science, machine learning, and programming.

data-analysis-python data-science jupyter-notebook python-programming

Last synced: 20 Feb 2025

https://github.com/sabdikay/telco-customer-churn-analysis-ibm-dataset

This project explores customer churn trends for a company in California using an IBM dataset. Built in a Jupyter Notebook, it employs pandas, NumPy, matplotlib, seaborn, plotly, and scipy to clean, analyze, and visualize data. Through statistical tests and interactive maps, it uncovers key drivers behind customer cancellations

business-intelligence customer-churn data-analysis data-analysis-python data-visualization exploratory-data-analysis jupyter-noteboook matplotlib numpy pandas plotly predictive-modeling python scipy seaborn statistical-analysis

Last synced: 20 Feb 2025

https://github.com/giyanellow/time-series-analysis-on-philippine-debt-and-inflation

A Time Series Analysis on the Philippine Inflation Rate with some predictions using RandomForest.

data-analysis data-analysis-python machine-learning python random-forest

Last synced: 20 Feb 2025

https://github.com/oguzhansarigol/xg_analysis_python

We will analyze in our code the expected goal locations of football players based on which areas of the field, at what times, with which feet, using which parts of their bodies, and from which angles and distances they are most likely to score.

data-analysis-python expected-goals numpy pandas-python python

Last synced: 19 Dec 2024

https://github.com/fimblo/leanstats

Compute lean metrics on Kanban ticket data. Analyse cycle-time, throughput, and more.

data-analysis-python kanban lean-metrics

Last synced: 10 Mar 2025

https://github.com/controldata23/product-sales-from-amazon

This is an Exploratory Data Analysis done on the Amazon Product Sales dataset from kaggle.

data-analysis-python data-cleaning data-exploration data-visualisation eda matplotlib

Last synced: 14 Jun 2025

https://github.com/edwinrlambert/investigating-netflix-movies

Demonstrates data analysis and visualization techniques for Netflix movies using Python in a Jupyter notebook. This is a DataCamp project.

data-analysis data-analysis-python netflix python

Last synced: 12 Mar 2025

https://github.com/thinzarhninyu/dap

Notes and Projects for Data Analysis with Python course from FreeCodeCamp.org

data-analysis data-analysis-python ipynb jupyter-notebook python

Last synced: 02 Mar 2025