An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/rebeccamorolong/mtn-customer-churn-prdiction

This project addresses a real-world business problem: predicting customer churn for MTN, a major telecommunications company. Customer churn significantly impacts profitability, and accurate predictions enable targeted retention efforts.

anova-test matplotlib-pyplot numpy pandas python seaborn

Last synced: 29 Apr 2026

https://github.com/monddavila/online-retail-data-analysis

Online Retail Exploratory Data Analysis with Python

data-analysis jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 29 Apr 2026

https://github.com/fantaso/data-analysis-and-manipulation-with-pandas

Project consists to analyse a publicly available movie dataset found in https://www.kaggle.com/beyjin/movies-1990-to-2017 and use Python tools like Pandas in order to get some initial insights about the dataset and finally proceeding to clean, transform and save a new version of the dataset in a better structure thinking about storing the data in a database.

matplotlib numpy pandas python3 seaborn

Last synced: 29 Apr 2026

https://github.com/shwetapardhi/assignment-04-simple-linear-regression-1

Q1) Delivery_time -> Predict delivery time using sorting time. Build a simple linear regression model by performing EDA and do necessary transformations and select the best model using R or Python. EDA and Data Visualization, Feature Engineering, Correlation Analysis, Model Building, Model Testing and Model Predictions using simple linear regressi

correlation-analysis data-visualization distplot eda feature-engineering model-building model-prediction model-testing numpy ols-regression p-value pandas python regression-plot rsquare-values seaborn simple-linear-regression smf statsmodel t-score

Last synced: 30 Apr 2026

https://github.com/priyansusahoo/face_recognition

Major Project: Attendance system project using Face Recognition

face-recognition numpy opencv-python python3

Last synced: 30 Apr 2026

https://github.com/nikhilsree5/walmartcasestudy

Analysis of Customer Spending Habits at Walmart Inc

clt eda numpy pandas python3 statistics visualization

Last synced: 30 Apr 2026

https://github.com/alirezasaharkhiz9/python-for-dataanalysis

The educational repository for data manipulation and analysis with the libraries numpy, pandas, and matplotlib.

matplotlib numpy pandas

Last synced: 30 Apr 2026

https://github.com/fikri-rouzan/student-stress-levels-classification

Proyek pemodelan machine learning untuk mengklasifikasikan tingkat stres mahasiswa berdasarkan parameter input akademik dan psikologis.

joblib jupyter-notebook matplotlib numpy pandas python scikit-learn seaborn streamlit

Last synced: 08 Jun 2026

https://github.com/ayushshrivstava/es654_prereqtest

Prerequisite test for the Machine Learning course ES654

jax matplotlib numpy pandas sympy

Last synced: 30 Apr 2026

https://github.com/poltanek/crane-random-search

Skeleton code for a simulation of a problem similar to the Crane problem however, implementing using random search algorithm expending random nodes

numpy python python3 random-search-algorithm

Last synced: 01 May 2026

https://github.com/pedasoft-consult/sentiment_analysis

This project involves analyzing customer reviews to classify them as positive or negative using Logistic Regression. The workflow includes text preprocessing, feature extraction, training a model, making predictions, and evaluating its performance.

nltk numpy pandas sklearn

Last synced: 30 Apr 2026

https://github.com/darinochka/course_python_iad

ДЗ\Семинары майнора "Интеллектуальный анализ данных"

machine-learning numpy pandas python3

Last synced: 30 Apr 2026

https://github.com/beyzabasarir/spotify-youtube-eda

Exploratory Data Analysis of global songs' performance metrics on Spotify and YouTube with Python, using Jupyter Notebook.

eda exploratory-data-analysis jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 30 Apr 2026

https://github.com/lmizner/codecademy_biodiversity

Using knowledge of data analysis to interpret data about endangered animals for the National Park Service

barplot jupyter-notebook matplotlib-pyplot numpy pandas python seaborn

Last synced: 30 Apr 2026

https://github.com/johnrobertsantos/exploratory-data-analysis-using-python

This project represents the culmination of my Python learning journey.

jupyter matplotlib numpy pandas python seaborn

Last synced: 30 Apr 2026

https://github.com/badranalyst/e-commerce-customer-analysis-data-science-foundations-case-study

This case study explores e-commerce customer data through data exploration, pre-processing, and splitting. It includes model building and training to analyze customer behavior. Python libraries like Pandas, NumPy, Matplotlib, and Seaborn are used for the analysis and model development.

data-analysis data-science dataset eda exploratory-data-analysis machine-learning matplotlib ml model-building model-training numpy pandas pre-processing python seaborn

Last synced: 01 May 2026

https://github.com/sandip2512/play-store-app-review-analysis

EDA on Play Store apps and user sentiment reviews

eda heatmaps matplotlib numpy pandas plotly seaborn sentiment-analysis

Last synced: 01 May 2026

https://github.com/abdoomohamedd/python-data-analysis-projects

A collection of data analysis projects that I have worked on. Each project involves cleaning data, performing exploratory data analysis (EDA), and creating visualizations to extract insights. The projects utilize various Python libraries, including pandas, numpy, matp

data-analysis data-analysis-python data-cleaning data-visualization matplotlib matplotlib-pyplot numpy pandas python

Last synced: 01 May 2026

https://github.com/paulo-santos-ds/analise_dados_musicais

Dashboard para compreender as preferências musicais dos ouvintes, tendências regionais e o impacto de artistas específicos.

numpy pandas powerbi python

Last synced: 01 May 2026

https://github.com/aniketwithpython/fundamentals-of-scientific-computing

Fundamentals of Scientific Computing (MS1980) Notebook

jupyter matplotlib numpy pandas python

Last synced: 01 May 2026

https://github.com/laszlokorte/canny-notebook

Numpy Canny Edge detector implementation to visualize each step

canny-edge-detection image-processing jupyter-notebook numpy

Last synced: 02 May 2026

https://github.com/v-goncharenko/freespace

Implementation of signal propagation in free space

numpy python signal-processing

Last synced: 02 May 2026

https://github.com/fabriciocovalesci/sistema-vendas-graficos

Sistema simples de vendas e estoque, com gráficos usando Matplotlib

matplotlib matplotlib-pyplot numpy python-script python3 venv virtualenv

Last synced: 02 May 2026

https://github.com/vermenea/masters---ad-signaling-pathway-using-hsp90-inhibitor

Influence of 17-AAG a Hsp90 inhibitor on signaling pathway in Atopic Dermatitis

matplotlib numpy pandas python seaborn statmodels

Last synced: 02 May 2026

https://github.com/hadjuse/spotify-recommendation-knn

Here is a personal project where i use KNN project to classify songs

data-science graphics interpretation knn-classification machine-learning matplotlib music numpy plotly spotify

Last synced: 03 May 2026

https://github.com/krumyakimov/house-price-developments-in-bulgaria

I developed this project for my final exam in the Data Science course at SoftUni.

data-science jupyter-notebook matplotlib numpy pandas scipy seaborn

Last synced: 03 May 2026

https://github.com/kisaa-fatima/foreground-segmentation-using-kmeans-face-recognization-using-knn

Implement a basic version of the interactive image cut-out/segmentation approach called Lazy Snapping. The program uses K-Means Clustering to segment images into foreground and background based on user-provided seed pixels.

face-recognition foreground-segmentation kmeans-clustering knn-classification matplotlib numpy opencv2 pil

Last synced: 03 May 2026

https://github.com/k-ilyas/ai-projects-handwritten-digit-recognition-titanic-survivor-analysis

AI Projects: Handwritten Digit Recognition & Titanic Survivor Analysis using scikit-learn

machine-learning numpy pandas sickit-learn

Last synced: 03 May 2026

https://github.com/hansalemaos/a_pandas_ex_fastloc

Up to 25 times faster than df.loc by combining np.select and numexpr.evaluate (works with utf-8)

dfloc fast iloc loc numexpr numpy pandas python

Last synced: 03 May 2026

https://github.com/nak-dt/deep-decision-forest

A custom ensemble tree model inspired by Deep Forest

ai machine-learning machine-learning-algorithms numpy pandas python sklearn

Last synced: 03 May 2026

https://github.com/sadmanca/imdb-scraper

Scrapes IMDb's movie database and outputs the data to CSV files.

beautifulsoup data-scraping imdb numpy pandas python requests

Last synced: 03 May 2026

https://github.com/codeslash21/tmdb_data_analysis

We analysed TMDB dataset which contains around 11000 movies details. We analyzed to find some interesting facts about the dataset.

data-analysis data-visualization matplotlib nanodegree-project numpy pandas python tmdb-movie

Last synced: 03 May 2026

https://github.com/pramodyasahan/binary-classifier

This repository houses the code for a machine learning model designed to predict customer churn. The model is built using Support Vector Machine (SVM) from the scikit-learn library and incorporates preprocessing, pipeline, and grid search techniques for optimal performance.

numpy pandas scikit-learn

Last synced: 03 May 2026

https://github.com/zongxr/dcic2025-riverwaterprediction

闽江,福建省最大独流入海河流,流域降水丰沛。在新型电力系统和新阶段水利高质量发展的环境下,如何缓解流域防汛压力,提升水资源利用率是水电企业高质量发展的重中之重。目前,基于传统水文学原理已实现未来9小时甲级精度的洪水预报,但存在遇见期与预见精度上仍有提升空间,因此,通过结合人工智能技术实现流域入库流量的精准预测,对保障水库安全调度、防洪及清洁能源稳定供应具有重大战略意义。

lightgbm machine-learning numpy pandas sklearn time-series xgboost

Last synced: 03 May 2026

https://github.com/manikantasanjay/youtube_exploratory_data_analytics

This is a Mini Project made using Python Libraries like Numpy, Pandas, Matplotlib etc to perform Data Analytics on US Video and Comments dataset.

jupyter-notebook matplotlib numpy pandas plotly python seaborn wordcloud

Last synced: 03 May 2026

https://github.com/syed-m-nofel/python-data-science-fundamentals

Python notebooks for data manipulation (Pandas/NumPy) and API workflows – from basics to practical examples.

api beginner-friendly data-analysis data-science http-requests jupyter-notebook numpy pandas pandas-dataframe python tutorial

Last synced: 03 May 2026

https://github.com/asayda01/airport_operations_analytics

The project analyzes the performance of Hold Baggage Screening X-ray Machines, identifying trends in system load, bottlenecks, and operator efficiency. Using an interactive Streamlit app, users can explore data-driven insights to optimize airport screening processes and improve overall operational performance.

data-science dataanalysis datainsights datavisualization numpy pandas python streamlit

Last synced: 03 May 2026

https://github.com/coderooz/handwritten_digit_recognition

This project uses a Convolutional Neural Network (CNN) to recognize handwritten digits from the MNIST dataset. The model is built using TensorFlow and Keras.

colab handwritten-digit-recognition image-classification matplotlib neural-network numpy python tensorflow

Last synced: 03 May 2026

https://github.com/naritanara/tf-test

Mostres de ML i IA amb TensorFlow basades en els tutorials oficials per al PR

numpy python research-project tensorflow tensorflow-tutorials

Last synced: 03 May 2026

https://github.com/fnfkkengine/website-performance-data-analysis-project

🌐 Analyze website performance data to gain insights on user engagement, traffic trends, and optimize your digital presence effectively.

analysis analytics dashboard data-analyst data-collection data-preprocessing data-visualization dax funnel-analysis jupyter-notebook matplotlib mysql numpy paid-search power-query project-management python storytelling

Last synced: 04 May 2026

https://github.com/anas436/ibm-cloud-data-analysis-of-cars

Exploratory Data Analysis of Cars in Python

ibmwatson matplotlib numpy pandas python3 seaborn

Last synced: 04 May 2026

https://github.com/aymanmomin/facebook-ego-network-analysis-for-political-outreach-optimization

Analyzing Facebook ego networks to identify influential users and communities for optimizing political advertising. Leverages network science techniques (Louvain communities, centrality metrics, threshold models) to simulate influence spread.

community-detection course-project facebook-network-analysis louvain-algorithm louvain-community-detection matplotlib network-analysis network-graphs networkx-library numpy pandas python3 pyvisgraph research-paper scipy-stats sklearn-metrics tqdm

Last synced: 04 May 2026

https://github.com/timothyjan/intro-machine-learning-polynomial-regression

We use NumPy to implement vectorized linear and polynomial regression models and compare their performance using separate training and test sets.

numpy

Last synced: 04 May 2026

https://github.com/jalijuhola/neurcomputing-course

Some of tasks from neurocomputing course

mypy numpy python

Last synced: 04 May 2026

https://github.com/damisparks/become_data_analyst

Are you new to Data Analysis ? Here you will find simple notebook that will help through your journey. These are personal projects I work on and still working.

data data-analysis data-visualization matplotlib numpy pandas-tutorial

Last synced: 04 May 2026

https://github.com/arsalan692/python-telusko-playlist

A collection of beginner Python scripts and practice exercises from my early learning days, inspired by Telusko and other resources.

basic-programming bs4 matplotlib numpy oop pandas python requests selenium tkinter

Last synced: 04 May 2026

https://github.com/charlenry/machine_learning_estimer_prix_voiture

Projet de Machine Learning pour estimer le prix d'une voiture d'occasion via une application

joblib jupyter-notebook matplotlib-pyplot numpy oop-python pandas plotly-express python sklearn-linear-model sklearn-linear-regression

Last synced: 04 May 2026

https://github.com/florasteve/ml-foundations-day1

Day-1 ML foundations focused on linear algebra: vectors, dot products, norms, angles, projections, and basic matrix operations—implemented in a Jupyter notebook with NumPy/Matplotlib, clear 2D visuals, a self-quiz, and a brief reflection. Emphasizes how these math primitives map to core ML/AI ideas (similarity, least squares, geometric transforms).

ai data-science education jupyter-notebook linear-algebra machine-learning matplotlib numpy python3 terminal-mac tutorial visualization vscode

Last synced: 04 May 2026

https://github.com/sugha22/data_analysis

Actively upskilling in data science with hands-on learning during a Green Internship focused on environmental sustainability. Used Pandas and NumPy for data preprocessing and cleaning, and created visual dashboards in Excel and Tableau. Gained experience in interpreting sustainability metrics and communicating insights through data storytelling and

data-cleaning data-collection data-storytelling data-transformation excel numpy pandas preprocessing sustainability-metrics tableau-dashboards visualization

Last synced: 04 May 2026

https://github.com/secary/maths7027

Mathematical Foundations of Data Science

latex mathematics numpy pandas

Last synced: 04 May 2026

https://github.com/ronitjariwala/-prodigy_ds_01

Prodigy InfoTech Data Science Internship Task-1

matplotlib numpy pandas python seaborn

Last synced: 04 May 2026

https://github.com/jakubfr4czek/apartment-prices-analysis

This repository contains an analysis of apartments prices in Krakow. It was made for "Rachunek prawdopodobieństwa i statystyka" (probability and statistics) course i took at AGH University of Science.

agh agh-wi analysis csv jupyter jupyter-notebook krakow numpy pandas price-analysis price-prediction probability probability-statistics python statistics statistics-learning

Last synced: 04 May 2026

https://github.com/abhinav330/911-emergency-calls-analysis

This Python Notebook analyzes emergency call data from the '911.csv' dataset. It uses various data visualization techniques to explore and gain insights into the emergency call data, including the types of calls, reasons for calls, and call patterns over time.

data-analysis data-science data-visualization eda exploratory-data-analysis exploratory-data-visualizations numpy pandas python

Last synced: 09 Jun 2026

https://github.com/abhishekyadav915/e-commerce-sales-analysis

E-Commerce Sales Analysis is a data analysis project that explores sales data from an e-commerce platform to uncover insights and trends. The analysis includes visualizing sales performance, customer behavior, and product trends to help optimize business strategies and improve customer satisfaction.

matplotlib-pyplot numpy pandas-library plotly python3

Last synced: 04 May 2026

https://github.com/pabs-code/img-cartoonizer-using-opencv

A streamline app using 3 ways to cartoonized an image using OpenCV and Python.

bilateral-filtering color-quantization edge-detection edge-enhancement laplacian-edge-detection numpy opencv python

Last synced: 04 May 2026

https://github.com/siddhantborse/atmosviz

Atmos Viz is a Python-based project designed to analyze, visualize, and predict global temperature trends across various cities and countries using time-series analysis and advanced data science techniques. Leveraging historical climate data, this project integrates machine learning models, geospatial mapping, and interactive visualizations to unco

geopandas geospatial-analysis gis matplotlib numpy pandas plotly python scikit-learn seaborn shapefiles time timeseries-analysis timeseries-data

Last synced: 05 May 2026

https://github.com/jokoum/sentiment-classification

Multi-layer Perceptron classification of product reviews written in Greek language

flask machine-learning multilayer-perceptron-network neural-network numpy python sentiment-classification

Last synced: 05 May 2026

https://github.com/thekartikeyamishra/object-detection-for-retailer

The AI Customer Feedback Summarizer is a Python-based tool that uses extractive summarization techniques to provide concise summaries of customer feedback.

ai computer-vision cv2 numpy pillow python torch torchvision yolov5

Last synced: 05 May 2026

https://github.com/burhanali2211/marksanalyzer

This is a simple CLI-based tool that helps analyze student marks by calculating key statistics such as the highest and lowest scores.

analysis analyzer arrays marks numpy

Last synced: 05 May 2026

https://github.com/iamwatchdogs/retail-insights-from-superstore-data

An simple Data Analysis Project for finding insight within the Super store data set

conda-environment data-analytics jupyter-notebook matplotlib numpy pandas python

Last synced: 05 May 2026

https://github.com/vc-nithesh944/pandas_numpy

Practice repository for NumPy & Pandas with scripts, notebooks, and notes for future revision

numpy pandas

Last synced: 05 May 2026

https://github.com/kbunwong/stock-price-prediction-using-lstm

This project is about predicting stock prices with more accuracy using LSTM algorithm. For this project we have fetched real-time data from yfinance library.

google-colab jupyter-notebook keras-tensorflow lstm-model machine-learning matplotlib numpy pandas python visualization yfinance

Last synced: 05 May 2026

https://github.com/sowmi0304/predictive-modeling-titanic-survival-prediction

Machine learning project to predict passenger survival on the Titanic using classification algorithms, data preprocessing, and model evaluation techniques.

machine-learning matplotlib numpy pandas python seaborn

Last synced: 05 May 2026

https://github.com/elijahondiek/data-analysis-udacity

Udacity Data Analysis Nano Degree course resources

jupyter-notebook matplotlib numpy pandas python

Last synced: 05 May 2026

https://github.com/hitthecodelabs/petalanalyticsstreamlit

Web application developed with Streamlit that predicts the Iris flower type based on its physical features

matplotlib model numpy pickle python scikit-learn sklearn streamlit

Last synced: 05 May 2026

https://github.com/rayan-alam-uoit/space-x-launch-prediction-tool

A full-stack data science project analyzing SpaceX launch data using web scraping, REST APIs, SQL, interactive visualizations, and machine learning in Python to explore trends and predict launch success.

dash data-science folium jupyter-notebook machine-learning matplotlib numpy pandas plotly python seaborn sql sqlite

Last synced: 05 May 2026

https://github.com/ranaessam03/climate-prediction

A machine learning project to predict rainfall using Decision Tree, k-Nearest Neighbors (kNN), and Naïve Bayes, with preprocessing, model evaluation, and custom kNN implementation.

decision-trees knn-classification machine-learning naive-bayes-classifier numpy sklearn

Last synced: 05 May 2026

https://github.com/ahmedabdalkreem/airline_passenger_satisfaction

In this project, we work in machine learning project that show the satisfacion of customer from the service that have in airline this service make it like this experience or not.

decision-trees gradient-boosting knn logistic-regression machine-learning matplotlib naive-bayes numpy pandas preprocessing python random-forest svm visualization

Last synced: 05 May 2026

https://github.com/anushkundu/crime-pattern-analysis

Analyzing Crime Patterns in Montgomery County, USA: An Inclusive Study Based on NIBRS Data (2016-2022)

data-analysis data-visualization descriptive-statistics matplotlib numpy pandas python seaborn

Last synced: 05 May 2026

https://github.com/pyladiesams/python-oss-sprints-mar2024

Sprints to contribute to the most popular data and AI Python OSS libraries

conda dask geopandas jupyter matplotlib numpy pandas pymc sklearn

Last synced: 05 May 2026

https://github.com/abdelrahman-amen/neural_style_transfer

This project applies neural style transfer to create new images by merging the content of one image with the artistic style of another.

neuralstyletransfer numpy os python streamlit tensorflow

Last synced: 05 May 2026

https://github.com/waghraj1699/fraud-detection-project

A fraud detection system with machine learning will be able to detect risk based on your historical data

data-processing decision-tree-classifier machine-learning numpy pandas

Last synced: 05 May 2026

https://github.com/nxhawk/mln

Machine learning is a branch of artificial intelligence (AI) and computer science which focuses on the use of data and algorithms to imitate the way that humans learn, gradually improving its accuracy.

decision-tree-classifier genetic-algorithm jupyter-notebook machine-learning-algorithms numpy pandas sklearn-library

Last synced: 06 May 2026

https://github.com/vikkiezdev/ai-global-index-analysis

This project analyzes the AI readiness of 62 countries using key indicators like government strategy, commercial activity, research, development, and infrastructure. Through data cleaning, EDA, and visualization, it identifies key drivers of AI adoption and competitiveness.

cleaning-data correlation-analysis eda matplotlib numpy pandas python3 seaborn statistical-analysis

Last synced: 06 May 2026

https://github.com/leavers/npblob

Transfer numpy array between Python and JavaScript losslessly

array javascript matrix numpy python streaming typescript

Last synced: 06 May 2026

https://github.com/shahriarha/programming

Python problem solving, HR-30 days of code, DSA, NumPy

algorithms-and-data-structures cpp numpy programming python

Last synced: 06 May 2026