An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/hansalemaos/a_pandas_ex_fastloc

Up to 25 times faster than df.loc by combining np.select and numexpr.evaluate (works with utf-8)

dfloc fast iloc loc numexpr numpy pandas python

Last synced: 03 May 2026

https://github.com/nak-dt/deep-decision-forest

A custom ensemble tree model inspired by Deep Forest

ai machine-learning machine-learning-algorithms numpy pandas python sklearn

Last synced: 03 May 2026

https://github.com/sadmanca/imdb-scraper

Scrapes IMDb's movie database and outputs the data to CSV files.

beautifulsoup data-scraping imdb numpy pandas python requests

Last synced: 03 May 2026

https://github.com/mzayles/data-preparation.py

Curso Programação em Python para Data Science | Preparando e tratando dados de uma base de vendas com Pandas e NumPy.

numpy pandas python

Last synced: 03 May 2026

https://github.com/akhil-krishnan-r/super_market_analysis

The growth of supermarkets in most populated cities are increasing and market competitions are also high. This dataset is one of the historical sales of supermarket company which has recorded in 3 different branches for 3 months data. Predictive data analytics methods are easy to apply with this dataset

data-visualization matplotlib numpy pandas seaborn

Last synced: 03 May 2026

https://github.com/kaustavmodak/business-aided-customer-feedback-assessment-system

A Streamlit-based sentiment analysis app that classifies customer reviews into Positive, Neutral, or Negative using a pre-trained ML mode

framework machine-learning matplotlib nlp nltk numpy pandas pickle regex scikit-learn seaborn sentiment-analysis streamlt tfidf-vectorizer

Last synced: 03 May 2026

https://github.com/jonad/finding_donors

Predicting income with UCI Census Income Dataset using supervised machine learning algorithms

numpy pandas scikit-learn scikitlearn-machine-learning

Last synced: 03 May 2026

https://github.com/muskanmi/data_analysis_python

Data analysis on students result dataset using python libraries.

boxplot countplots data-analysis numpy pandas pie-chart python3 seaborn

Last synced: 03 May 2026

https://github.com/lmizner/codecademy_product_defects

Practicing rules of probability, set theory, and distributions

cdf jupyter-notebook numpy pmf ppf python rvs scipy-stats

Last synced: 03 May 2026

https://github.com/alvii147/discretetimelib

Python library for analysis of discrete time signals and systems

numpy pandas scipy signals-and-systems sympy

Last synced: 03 May 2026

https://github.com/schukark/intervalpylib

Interval solver of systems of equations to find the workspace area of a robot

krawczyk matplotlib numpy python3 robotics solving-algorithm sympy systems-of-equations visualization

Last synced: 03 May 2026

https://github.com/feguedi/clasificador-numeros

Proyecto final de la materia de Inteligencia Artificial donde se desarrolla un software que pueda clasificar los números de la base de datos MNIST.

artificial-intelligence numpy python3

Last synced: 03 May 2026

https://github.com/fnfkkengine/website-performance-data-analysis-project

🌐 Analyze website performance data to gain insights on user engagement, traffic trends, and optimize your digital presence effectively.

analysis analytics dashboard data-analyst data-collection data-preprocessing data-visualization dax funnel-analysis jupyter-notebook matplotlib mysql numpy paid-search power-query project-management python storytelling

Last synced: 04 May 2026

https://github.com/aymanmomin/facebook-ego-network-analysis-for-political-outreach-optimization

Analyzing Facebook ego networks to identify influential users and communities for optimizing political advertising. Leverages network science techniques (Louvain communities, centrality metrics, threshold models) to simulate influence spread.

community-detection course-project facebook-network-analysis louvain-algorithm louvain-community-detection matplotlib network-analysis network-graphs networkx-library numpy pandas python3 pyvisgraph research-paper scipy-stats sklearn-metrics tqdm

Last synced: 04 May 2026

https://github.com/bpkaur/whats-in-a-name

Exploring dataset of first names of babies born in the US in order to uncover interesting stories

data-analysis datacamp numpy pandas python3

Last synced: 04 May 2026

https://github.com/ortanav2/parabola-visualizer

Simple parabola-visualizer made with matplot+numpy [Formula: a(x+d)²+e]

mathematics matplotlib numpy parabola visualization

Last synced: 04 May 2026

https://github.com/jalijuhola/neurcomputing-course

Some of tasks from neurocomputing course

mypy numpy python

Last synced: 04 May 2026

https://github.com/damisparks/become_data_analyst

Are you new to Data Analysis ? Here you will find simple notebook that will help through your journey. These are personal projects I work on and still working.

data data-analysis data-visualization matplotlib numpy pandas-tutorial

Last synced: 04 May 2026

https://github.com/abhivur/graduate-income-forecaster

Contributors: Abdussalam Raheem, Chiara Su, and Joseph Botros

matplotlib numpy pandas python scikit-learn seaborn

Last synced: 04 May 2026

https://github.com/arjunravi26/deep_learning

This repo contains codes and notes to learn deep learning

ann autoencoders cnn deep-learning deepseek-r1 lstm numpy rnn-tensorflow tensorflow transformer

Last synced: 04 May 2026

https://github.com/sweta-kaundilya/python_for_data_analysis

Learning Python and all the relevant libraries in python for Data field.

cufflinks data-analysis data-science matplotlib numpy pandas plotly python seaborn

Last synced: 04 May 2026

https://github.com/sumaiyyaf/movie-industry-analysis

This project focuses on the acquisition and in-depth analysis of movie data to generate strategic recommendations, guiding our hypothetical company's successful entry into the movie industry.

exploratory-data-analysis numpy pandas python visualization webscraping

Last synced: 04 May 2026

https://github.com/sugha22/data_analysis

Actively upskilling in data science with hands-on learning during a Green Internship focused on environmental sustainability. Used Pandas and NumPy for data preprocessing and cleaning, and created visual dashboards in Excel and Tableau. Gained experience in interpreting sustainability metrics and communicating insights through data storytelling and

data-cleaning data-collection data-storytelling data-transformation excel numpy pandas preprocessing sustainability-metrics tableau-dashboards visualization

Last synced: 04 May 2026

https://github.com/nordic96/lanedetection

Lane Detection Module using OpenCv and Python for educational purpose.

computer-vision lane-detection matplotlib-pyplot numpy opencv python3

Last synced: 04 May 2026

https://github.com/saifalibaig/crop-yield-prediction

🌾 A machine learning-based crop production prediction system using historical Indian agricultural data with advanced regression models and hyperparameter tuning.

eda feature-encoding feature-selection kaggle-dataset lasso-regression linear-regression matplotlib-pyplot numpy onehot-encoding pandas python3 ridge-regression seaborn simpleimputer xgboost-regression

Last synced: 04 May 2026

https://github.com/ronitjariwala/-prodigy_ds_01

Prodigy InfoTech Data Science Internship Task-1

matplotlib numpy pandas python seaborn

Last synced: 04 May 2026

https://github.com/0nom4d/308reedpipes

Mathematical EPITECH Project - This project computes a representation of a reedpipe.

coverage epitech epitech-project matplotlib-pyplot numpy pytest python scipy

Last synced: 04 May 2026

https://github.com/youssefyaser/scrape-the-imdb-site-for-the-top-250-movies

Web scraping the top 250 movies in IMDB site.

data-analysis numpy pandas python

Last synced: 04 May 2026

https://github.com/hanzopgp/randomkagglesolving

Trying to build models to resolve some interesting Kaggle datasets thanks to deep learning.

artificial-intelligence deep-learning jupyter-notebook machine-learning matplotlib numpy personal tensorflow

Last synced: 04 May 2026

https://github.com/chathumiamarasinghe/nn-training-model

A comprehensive project for training neural networks to solve real-world problems. This repository includes customizable code for building, training, and evaluating neural network architectures using popular deep learning frameworks.

jupyter-notebook matplotlib numpy phyton scikit-learn

Last synced: 04 May 2026

https://github.com/srevenant/data-science-alpine

A docker container for data science, using alpine linux and python3

alpine data numpy pandas python3 science scipy xgboost

Last synced: 05 May 2026

https://github.com/bhaskarkulshrestha/rock-vs-mine-prediction

A machine learning project to predict whether an object detected underwater is a rock or a mine using Logistic Regression.

colab-notebook logistic-regression machine-learning numpy pandas python sckit-learn supervised-learning

Last synced: 05 May 2026

https://github.com/badranalyst/residential-unit-prices-data-analysis-application

Python-based analysis of residential unit prices, focusing on data cleaning, visualization, and exploratory data analysis (EDA). Key features include price distribution, and correlation analysis between factors like size, location, and pricing.

data-analysis data-visualization dataset matplotlib numpy pandas python seaborn

Last synced: 05 May 2026

https://github.com/lruizap/tomography-with-algebraic-topology

This project aims to analyze images using algebraic topology. The goal is to extract structural information from images through mathematical concepts such as connected components and loops.

gudhi jupyter-notebook matplotlib numpy opencv python tomography

Last synced: 05 May 2026

https://github.com/cicku/en.650.672

HW of EN.650.672

analytics data-analysis numpy pandas

Last synced: 05 May 2026

https://github.com/ranaessam03/climate-prediction

A machine learning project to predict rainfall using Decision Tree, k-Nearest Neighbors (kNN), and Naïve Bayes, with preprocessing, model evaluation, and custom kNN implementation.

decision-trees knn-classification machine-learning naive-bayes-classifier numpy sklearn

Last synced: 05 May 2026

https://github.com/tapas-gope/diwali-sales-analysis

Analyze Diwali Sales Data to improve customer experience and sales.

exploratory-data-analysis matplotlib-pyplot numpy pandas python seaborn

Last synced: 05 May 2026

https://github.com/ahmedabdalkreem/airline_passenger_satisfaction

In this project, we work in machine learning project that show the satisfacion of customer from the service that have in airline this service make it like this experience or not.

decision-trees gradient-boosting knn logistic-regression machine-learning matplotlib naive-bayes numpy pandas preprocessing python random-forest svm visualization

Last synced: 05 May 2026

https://github.com/batrick-swaistan/nalaiya_thiran

Nalaiya Thiran IBM Course Training

matplotlib numpy pandas-dataframe python

Last synced: 05 May 2026

https://github.com/jonad/quora_data_analysis

Exploratory data analysis and feature engineering on Quora dataset.

exploratory-data-analysis feature-extraction numpy pandas python3 quora-dataset

Last synced: 05 May 2026

https://github.com/sahaninilesh/future_ds_01_social-media-sentiments-analysis

The Social Media Sentiments Analysis Dataset captures a vibrant tapestry of emotions, trends, and interactions across various social media platforms. This dataset provides a snapshot of user-generated content, encompassing text, timestamps, hashtags, countries, likes, and retweets. Each entry unveils unique stories—moments of surprise, excitement,

jupyter matplotlib numpy pandas python

Last synced: 05 May 2026

https://github.com/amruta33/lead_scoring_case_study

An education company named X Education sells online courses to industry professionals. On any given day, many professionals who are interested in the courses land on their website and browse for courses.

bagging linear-regression numpy pandas python3

Last synced: 05 May 2026

https://github.com/ayushdutta-ai/face-recognition-model

A custom face recognition model built from scratch using Python, OpenCV, Pillow, and Face libraries, utilizing LBPH for efficient facial feature extraction and accurate identification.

face haar-cascade lbph-face-recognizer numpy opencv pillow

Last synced: 05 May 2026

https://github.com/waghraj1699/fraud-detection-project

A fraud detection system with machine learning will be able to detect risk based on your historical data

data-processing decision-tree-classifier machine-learning numpy pandas

Last synced: 05 May 2026

https://github.com/hms75/movie_rating_analysis

A movie rating analysis which identifies trends amongst a dataset of 5000 movies.

data-analysis data-visualization matplotlib-pyplot numpy pandas python

Last synced: 05 May 2026

https://github.com/romanectatyana/fuel_norm

Practical data analytics case: calculating and optimizing fuel consumption norms for vehicles and heavy equipment. (Аналітичний проєкт із визначення норм витрат палива для автотранспорту та спецтехніки.)

matplotlib numpy pandas plotly power-bi

Last synced: 06 May 2026

https://github.com/eshansugeesh/fico-score-loan-default-modeling-project

Credit risk assessment using FICO score segmentation, loan default modeling, discretization techniques, and log-likelihood evaluation for predictive analytics in financial services.

bucketing classification credit-risk customer-segmentation data-science discretization fico-score financial-analytics loan-analysis loan-default log-likelihood machine-learning numpy pandas predictive-modeling risk-modeling scikit-learn segmentation statistical-modelling

Last synced: 06 May 2026

https://github.com/abdelrahman-amen/active_learning_with_different_query_strategies

This project explores the implementation of active learning techniques, focusing on various query strategies to optimize the selection of informative data points for model training. It aims to reduce the amount of labeled data required while improving model performance, especially in scenarios with limited labeled data.

activelearning cuda entropy kldivergence margin numpy python pyto uncertainty

Last synced: 06 May 2026

https://github.com/skorupcia/winequality-white

winequality-white.csv visualization

matplotlib numpy pandas seaborn sklearn

Last synced: 06 May 2026

https://github.com/abhinav330/customer-behavior-analysis-linear-regression

This repository explores customer behavior data for an NYC clothing company with both a mobile app and website. They want to understand which platform drives higher sales.

data-analysis data-science data-visualization eda exploratory-data-analysis jupyter jupyter-notebook linear-regression machine-learning machine-learning-algorithms machinelearning-python numpy pandas python regression-analysis

Last synced: 06 May 2026

https://github.com/kunalpisolkar24/gradientdescent

A Python implementation of gradient descent for linear regression, visualized with cost function history, parameter trajectory, and a 3D cost surface.

cost-function gradient-descent numpy visualization

Last synced: 06 May 2026

https://github.com/sopheakchan/dull-razor-algorithm

DullRazor hair removal for dermoscopic skin images.

numpy opencv python

Last synced: 06 May 2026

https://github.com/fazal-coder/capturing_img

python code for Creating blue color image

loop numpy opencv processing python scheduler threading

Last synced: 06 May 2026

https://github.com/elmezianech/classifyreviews_nlp

Revolutionize customer feedback analysis with our NLP Insights Analyzer. Utilize cutting-edge text preprocessing to transform raw reviews into a machine-friendly format. Explore sentiment models, such as Logistic Regression and Naive Bayes, employing cross-validation for model robustness.

accuracy-score ai countvectorizer cross-validation joblib logistic-regression machine-learning ml multinomialnb naive-bayes-classifier nltk-library numpy pandas randomforestclassifier svc

Last synced: 06 May 2026

https://github.com/christiansassi/signal-image-and-video-project

Project developed by Pietro Bologna (@bolognapietro) and Christian Sassi for the Signal, Image and Video course.

image-processing mri-segmentation numpy opencv python

Last synced: 06 May 2026

https://github.com/zotho/grav_euler

Euler gravity compute

gravity numpy python

Last synced: 06 May 2026

https://github.com/voorhs/ml-practice

ML algorithms implementation

aimasters lomonosov-msu machine-learning numpy python

Last synced: 07 May 2026

https://github.com/malintha-induwara/python-for-ds

This repository contains my personal notes, exercises, and examples for Python

numpy pandas python webscraping

Last synced: 07 May 2026

https://github.com/waghraj1699/ipl-win-probability-predictor

Web app to predict live probability of win percentage of match

data-preprocessing machine-learning ml-algorithms numpy pandas

Last synced: 07 May 2026

https://github.com/jonad/titanic_survival

Basic decision tree model for Titanic dataset. Predict who will survive.

jupyter-notebook matplotlib numpy pandas python3 titanic-survival-prediction

Last synced: 07 May 2026

https://github.com/suhas-005/jovian-data-analysis-course-assignment

These are my assignments for Data Analysis : Zero to Pandas course by Jovian.ai

data-analysis data-analytics numpy pandas python

Last synced: 07 May 2026

https://github.com/miss-mhv/retail_strategy_and_analytics

The purpose of this work is to analyze data from a supermarket to better understand the types of customers who buy chips and their buying behavior in the area.

jupyter-notebook numpy pandas python3

Last synced: 07 May 2026

https://github.com/jonad/chicago_crime_analysis

Exploratory data analysis on Chicago Crime dataset.

jupyter-notebook matplotlib numpy pandas python3

Last synced: 07 May 2026

https://github.com/s-hirata0831/graphicsprocessing

Pythonによる画像処理基礎実験(with OpenCV)

numpy opencv python

Last synced: 07 May 2026

https://github.com/pointer2alvee/machine-learning-algorithms-scratch

Core Machine Learning concepts , algorithms & maths implemented completely from scratch using Python & numpy only.

machine-learning-algorithms numpy pandas probabilistic-machine-learning python statistical-models

Last synced: 07 May 2026

https://github.com/krishbharadwaj5678/gestureninja

Play Fruit Ninja using just your index finger with real time gesture tracking. No mouse, no keyboard — just your webcam and smooth slicing.

automation computer-vision cvzone finger-tracking fruit-ninja game-development gamedev gesture-recognition hand-tracking mediapipe mouse mouse-control numpy opencv pyautogui python realtime virtualmouse webcam

Last synced: 07 May 2026

https://github.com/shefreenkaur/web-scraping-and-word-frequencies

This project analyzes word frequencies in BC Legislative documents using Stanford CoreNLP and Python. The program extracts text from PDF documents, processes it using natural language processing techniques, and generates a comprehensive word frequency analysis.

analytics chromedriver easyocr nlp numpy pandas pymupdf python selenium stanfordcorenlp webscraping wordfrequency

Last synced: 07 May 2026

https://github.com/mnitin-reddy/coffee-shop-sales-analytics

This project analyzes coffee shop sales data to uncover key insights, focusing on customer behavior, product performance, and sales trends. Using Pandas, Matplotlib, and Seaborn, the analysis identifies peak sales hours, popular product categories, and opportunities for upselling or bundling.

datasceince datavisualization exploratory-data-analysis matplotlib numpy pandas python seaborn

Last synced: 07 May 2026

https://github.com/ebadshabbir/bluff-detection-model-polynomial_regression-

This project predicts employee salaries based on position levels using Linear and Polynomial Regression models. It trains models with degrees 2, 3, and 4 on a dataset of job titles, position levels, and salaries, and predicts the salary for a position level of 6.5. The results are visualized to compare model performance.

jupyter-notebook linear-regression machine-learning matplotlib-pyplot multiple-linear-regression numpy polynomial-regression python

Last synced: 07 May 2026

https://github.com/imskully/python-cv-fault-detection

Fault detection algorithm that determines whether or not a fault exists within a ring using raw computer vision.

ccl computer-vision matplotlib morphology numpy opencv python

Last synced: 07 May 2026

https://github.com/hackintoshrao/first-neural-network

Simple Neural Network with a hidden layer written from scratch using Numpy for prediction the bike sharing sales

deep-learning neural-networks numpy python

Last synced: 07 May 2026

https://github.com/pocketfullofdata/electric-vehicles-market-size-analysis

This project analyzes the growth, adoption trends, and future projections of the electric vehicle (EV) market. Using data analysis and visualization techniques, it examines key factors like sales trends, and consumer adoption to understand the evolving landscape of the EV industry.

analysis data jupyter-notebook matplotlib numpy python seaborn vscode

Last synced: 07 May 2026

https://github.com/pereira-elizabeth/aah-2d-hermitian

Minimal 2D AAH model: build, diagonalize, and plot spectrum (NumPy/Matplotlib)

condensed-matter numpy physics scipy tight-binding

Last synced: 07 May 2026

https://github.com/jnayak1/naive_bayes_cars

Using Naive Bayes and SGD ML algorithms to predict if a new car will decrease in price

cars machine-learning naive-bayes numpy sklearn

Last synced: 07 May 2026

https://github.com/prajakta1321/exoplanet-atmospheric-characterization-1

A machine learning project to classify exoplanets using light curve image data. Developed as part of the ML4SCI GSoC 2025 Test Task. Includes data processing, CNN-based model, and full report.

classification colab-notebook dbscan gsoc-2025 machine-learning-algorithms matplotlib-python ml numpy open-source pca-analysis python3 seaborn

Last synced: 07 May 2026

https://github.com/navid-kalaei/numpy-neural-network

Neural network implemented in Numpy to predict bike sharing on real data

deep-learning neural-network numpy python

Last synced: 07 May 2026

https://github.com/tinaland101/carcinoma-regression-project

The goal of this project is to analyze and visualize the results of a clinical study involving the treatment of squamous cell carcinoma (SCC) in mice. As a senior data analyst, you are tasked with preparing a technical report for Pymaceuticals, Inc.

linear-regression mathplotlib numpy pandas pyhton

Last synced: 08 May 2026