An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/hansalemaos/a_pandas_ex_split_on_common_char

Splits the elements in a Series using a common separator character and returns a DataFrame.

numexpr numpy pandas separator split string

Last synced: 14 Jan 2026

https://github.com/anikdascodes/fantasy-cricket-project

🏏 Python simulation for generating 20K+ unique fantasy cricket teams with probability-based player selection and constraint optimization. Advanced algorithms, data analysis & visualizations included.

algorithm constraint-satisfaction cricket data-science fantasy-sports jupyter machine-learning numpy optimization pandas plotly probability python simulation sports-analytics

Last synced: 28 Apr 2026

https://github.com/bdurga26/fake-news-detection

This project aims to find whether the given news is real or fake by using Machine learning Algorithms.

ds ml numpy pandas python sklearn

Last synced: 14 Apr 2026

https://github.com/camsai/jupyterlite

CAMSAI JupyterLite is a lightweight, browser-based environment tailored for AI-driven materials science research. It integrates CAMSAI tools, schemas, and workflows, enabling users to validate data, run simulations, and explore materials science applications without requiring local installation or setup.

chemistry jupyterlite materials materials-design materials-informatics materials-science matplotlib numpy pyodide python scipy

Last synced: 02 May 2026

https://github.com/andremenezesds/machine_learning_test

Testing among various Machine Learning models and parameters, in order to further study their behaviour for Classification, Regression and Clustering analysis.

classification-models clustering-models machine-learning numpy pandas regression-models sklearn

Last synced: 02 May 2026

https://github.com/hadjuse/spotify-recommendation-knn

Here is a personal project where i use KNN project to classify songs

data-science graphics interpretation knn-classification machine-learning matplotlib music numpy plotly spotify

Last synced: 03 May 2026

https://github.com/bhavna-kale/cars-eda-project

Project analyzing used car market data to identify high-impact price drivers and depreciation curves, presented through an interactive web application.

data-analysis excel matplotlib numpy pandas python3 searborn streamlit

Last synced: 03 May 2026

https://github.com/kisaa-fatima/foreground-segmentation-using-kmeans-face-recognization-using-knn

Implement a basic version of the interactive image cut-out/segmentation approach called Lazy Snapping. The program uses K-Means Clustering to segment images into foreground and background based on user-provided seed pixels.

face-recognition foreground-segmentation kmeans-clustering knn-classification matplotlib numpy opencv2 pil

Last synced: 03 May 2026

https://github.com/hansalemaos/a_pandas_ex_fastloc

Up to 25 times faster than df.loc by combining np.select and numexpr.evaluate (works with utf-8)

dfloc fast iloc loc numexpr numpy pandas python

Last synced: 03 May 2026

https://github.com/ebrahimar/ai-voice-cloner-xtts-v2

A Streamlit web app for AI-powered voice cloning using Coqui XTTS v2. Record or upload reference voices, clone speech in multiple languages, and generate natural audio outputs.

ai-voice coqui-tts deep-learning json multilingual-tts numpy pydub python speech-synthesis streamlit text-to-speech tts-model voice-cloning xtts-v2

Last synced: 03 May 2026

https://github.com/akhil-krishnan-r/super_market_analysis

The growth of supermarkets in most populated cities are increasing and market competitions are also high. This dataset is one of the historical sales of supermarket company which has recorded in 3 different branches for 3 months data. Predictive data analytics methods are easy to apply with this dataset

data-visualization matplotlib numpy pandas seaborn

Last synced: 03 May 2026

https://github.com/kaustavmodak/business-aided-customer-feedback-assessment-system

A Streamlit-based sentiment analysis app that classifies customer reviews into Positive, Neutral, or Negative using a pre-trained ML mode

framework machine-learning matplotlib nlp nltk numpy pandas pickle regex scikit-learn seaborn sentiment-analysis streamlt tfidf-vectorizer

Last synced: 03 May 2026

https://github.com/jonad/finding_donors

Predicting income with UCI Census Income Dataset using supervised machine learning algorithms

numpy pandas scikit-learn scikitlearn-machine-learning

Last synced: 03 May 2026

https://github.com/lmizner/codecademy_product_defects

Practicing rules of probability, set theory, and distributions

cdf jupyter-notebook numpy pmf ppf python rvs scipy-stats

Last synced: 03 May 2026

https://github.com/schukark/intervalpylib

Interval solver of systems of equations to find the workspace area of a robot

krawczyk matplotlib numpy python3 robotics solving-algorithm sympy systems-of-equations visualization

Last synced: 03 May 2026

https://github.com/pedasoft-consult/product_purchase_prediction

As a newly hired AI Engineer, your task is to predict customer behavior based on various features such as age, income, and gender. This exercise involves cleaning the data, training a decision tree model, and evaluating the model's performance to understand the key factors influencing customer purchasing decisions.

machine-learning matplotlib numpy pandas python seaborn sklearn

Last synced: 04 May 2026

https://github.com/swarup-ranjan-mondal/chat-analyzer

A streamlit app to analyze whatsapp personal and group chats.

matplotlib numpy pandas python seaborn streamlit

Last synced: 04 May 2026

https://github.com/youssefyaser/scrape-the-imdb-site-for-the-top-250-movies

Web scraping the top 250 movies in IMDB site.

data-analysis numpy pandas python

Last synced: 04 May 2026

https://github.com/alphan26/optimal-logistics-locator

This is a project in which we estimate the biomass avaibility of places due to their index and determine the optimal preprocessing depot and biorafinery in Gujarat, India

numpy pandas python sckit-learn

Last synced: 04 May 2026

https://github.com/siddhantborse/atmosviz

Atmos Viz is a Python-based project designed to analyze, visualize, and predict global temperature trends across various cities and countries using time-series analysis and advanced data science techniques. Leveraging historical climate data, this project integrates machine learning models, geospatial mapping, and interactive visualizations to unco

geopandas geospatial-analysis gis matplotlib numpy pandas plotly python scikit-learn seaborn shapefiles time timeseries-analysis timeseries-data

Last synced: 05 May 2026

https://github.com/vladd12/machine-learning-practice

Π Π΅ΠΏΠΎΠ·ΠΈΡ‚ΠΎΡ€ΠΈΠΉ для ΡƒΡ‡Π΅Π±Π½ΠΎΠΉ ΠΏΡ€Π°ΠΊΡ‚ΠΈΠΊΠΈ ΠΈ Ρ€Π°Π·Π»ΠΈΡ‡Π½Ρ‹Ρ… курсов, связанных с ИИ

json keras-tensorflow machine-learning numpy pandas pickle tensorflow2 torch xml

Last synced: 05 May 2026

https://github.com/burhanali2211/marksanalyzer

This is a simple CLI-based tool that helps analyze student marks by calculating key statistics such as the highest and lowest scores.

analysis analyzer arrays marks numpy

Last synced: 05 May 2026

https://github.com/vc-nithesh944/pandas_numpy

Practice repository for NumPy & Pandas with scripts, notebooks, and notes for future revision

numpy pandas

Last synced: 05 May 2026

https://github.com/kbunwong/stock-price-prediction-using-lstm

This project is about predicting stock prices with more accuracy using LSTM algorithm. For this project we have fetched real-time data from yfinance library.

google-colab jupyter-notebook keras-tensorflow lstm-model machine-learning matplotlib numpy pandas python visualization yfinance

Last synced: 05 May 2026

https://github.com/zafir100100/cancer-stage-prediction

This code predicts cancer data using various regression models, calculates their average R-squared scores, and prints the best model.

cross-validation data-analysis data-preprocessing decision-trees gradient-boosting linear-regression machine-learning-algorithms numpy pandas random-forest regression scikit-learn

Last synced: 05 May 2026

https://github.com/lohiyah/dataset_summariser

A Python tool for generating concise summaries of datasets using Pandas and NumPy for analysis, and Matplotlib and Seaborn for insightful visualizations. Ideal for quick exploratory data analysis (EDA).

matplotlib numpy pandas seaborn

Last synced: 05 May 2026

https://github.com/rayan-alam-uoit/space-x-launch-prediction-tool

A full-stack data science project analyzing SpaceX launch data using web scraping, REST APIs, SQL, interactive visualizations, and machine learning in Python to explore trends and predict launch success.

dash data-science folium jupyter-notebook machine-learning matplotlib numpy pandas plotly python seaborn sql sqlite

Last synced: 05 May 2026

https://github.com/sahaninilesh/future_ds_01_social-media-sentiments-analysis

The Social Media Sentiments Analysis Dataset captures a vibrant tapestry of emotions, trends, and interactions across various social media platforms. This dataset provides a snapshot of user-generated content, encompassing text, timestamps, hashtags, countries, likes, and retweets. Each entry unveils unique storiesβ€”moments of surprise, excitement,

jupyter matplotlib numpy pandas python

Last synced: 05 May 2026

https://github.com/nkamilla/titanic-eda

Exploratory Data Analysis of the Titanic dataset using Python (Pandas, NumPy, Matplotlib). Includes data cleaning, visualizations, correlations, and key business insights.

data-analysis eda jupyter-notebook matplotlib numpy pandas python titanic-dataset

Last synced: 05 May 2026

https://github.com/hms75/movie_rating_analysis

A movie rating analysis which identifies trends amongst a dataset of 5000 movies.

data-analysis data-visualization matplotlib-pyplot numpy pandas python

Last synced: 05 May 2026

https://github.com/shwetapardhi/assignment-1-q9_b--basic-statistics-level-1

Q9) Calculate Skewness, Kurtosis & draw inferences on the following data SP and Weight(WT) Use Q9_b.csv

kurtosis matplotlib-pyplot numpy pandas python seaborn skewness

Last synced: 06 May 2026

https://github.com/kashinathbiradar/bangalore-housing-price-prediction

The objective of the project is to create a machine learning model. We are doing a supervised learning and our aim is to do predictive analysis to predict housing price.

css deta doc html machine-learning matplotlib numpy pandas preprocessing seaborn sklearn sql

Last synced: 06 May 2026

https://github.com/chopikus/raytracing-bench

Benchmarking ray tracers in Python, Java, Numpy, Cupy, Pypy

cupy java numpy pypy python raytracing

Last synced: 06 May 2026

https://github.com/naveen88112/genetic_algos_maze_solver

Genetic Algorithm Maze Solver This project utilizes a Genetic Algorithm (GA) to solve the optimization of a maze problem. It evolves the population of promising solutions to the best solution finding shortest path between start and end point using selection, crossover, and mutation.

deap-library genetic-algorithms matplotlib maze-solver numpy python

Last synced: 06 May 2026

https://github.com/williyam-m/company-registration-trends

Utilized Linear Regression from scikit-learn to predict future company registration trends.

flask matplotlib numpy pandas-python scikit-learn

Last synced: 06 May 2026

https://github.com/kavayk29/text-summarization-fine-tuning-pegasus-model

This project uses the Pegasus model to generate concise, coherent text summaries. Fine-tuned on large datasets, it excels in abstractive summarization, ideal for automating tasks like summarizing articles and research papers.

datasets huggingface-transformers matplotlib nltk numpy pandas teansform tqdm

Last synced: 06 May 2026

https://github.com/hansalemaos/cv2_collage

Create a collage from images with OpenCV

collage numpy opencv python

Last synced: 06 May 2026

https://github.com/piero24/lossfuncition-4sn

This research will show an innovative method useful in the segmentation of polyps during the screening phases of colonoscopies. To do this we have adopted a new approach which consists in merging the hybrid semantic network (HSNet) architecture model with the Reagion-wise(RW) as a loss function for the backpropagation process.

dataset deep-learning deep-neural-networks loss loss-functions matplotlib neural-network numpy opencv polyp-detection polyp-segmentation python python3 pytorch segmentation segmentation-models torchvision transformers unipd unipd-dei

Last synced: 06 May 2026

https://github.com/bartosz-ziolkowski/social_data

Data analysis of San Francisco's crime data set from 2003 to 2018

data-science data-visualization jupyter-notebook numpy pandas python

Last synced: 06 May 2026

https://github.com/jai0212/course-selector-uoft

A program that suggests university courses based on your program and provides a custom GUI to customize your course plan. Created to help UofT students

beautifulsoup csv graphs graphs-algorithms networkx-graph numpy os parser pickle pillow plotly python tkinter-gui tkmacosx

Last synced: 06 May 2026

https://github.com/harmanveer-2546/student-performance-in-exam

Student performance analysis and prediction using datasets has become an essential component of modern education systems. With the increasing availability of data on student, schools and universities are using advanced analytics and machine learning algorithms to gain insights into student performance and predict future outcomes.

classification-report cluster dbscan dimensionality-reduction k-means-clustering labelling linear-regression logistic-regression numpy pandas pre-processing random-forest-classifier test-train-split unsupervised-machine-learning visualization

Last synced: 06 May 2026

https://github.com/abdulrahmanaymann/data-storage-and-information-retrieval

This is a Python project that performs tokenization, stop word removal, positional indexing, phrase query searching, term frequency-inverse document frequency (TF-IDF) calculation, cosine similarity computation, and document ranking.

cosine-similarity information-retrieval nltk numpy pandas postional-index python ranking stop-word-removal tf-idf tokenization

Last synced: 06 May 2026

https://github.com/kianaabrisham/svm-from-scratch

Linear SVM from scratch with hinge loss + decision boundaries

classification from-scratch fundamentals hinge-loss numpy optimization scikit-learn svm

Last synced: 07 May 2026

https://github.com/malintha-induwara/python-for-ds

This repository contains my personal notes, exercises, and examples for Python

numpy pandas python webscraping

Last synced: 07 May 2026

https://github.com/jonad/titanic_survival

Basic decision tree model for Titanic dataset. Predict who will survive.

jupyter-notebook matplotlib numpy pandas python3 titanic-survival-prediction

Last synced: 07 May 2026

https://github.com/suhas-005/jovian-data-analysis-course-assignment

These are my assignments for Data Analysis : Zero to Pandas course by Jovian.ai

data-analysis data-analytics numpy pandas python

Last synced: 07 May 2026

https://github.com/miss-mhv/retail_strategy_and_analytics

The purpose of this work is to analyze data from a supermarket to better understand the types of customers who buy chips and their buying behavior in the area.

jupyter-notebook numpy pandas python3

Last synced: 07 May 2026

https://github.com/ayushmankumar7/siamese-network-with-one-shot-learning

Implementation of this paper - https://www.cs.cmu.edu/~rsalakhu/papers/oneshot1.pdf

numpy tensorflow tf-keras

Last synced: 07 May 2026

https://github.com/ganesh774218/eda-book-store

Exploratory data analysis on a book store dataset to uncover sales trends, popular genres, and top publishers.

data-visualization datacleaning datamanipulation eda matplotlib numpy pandas python pythonp pythonproject seaborn

Last synced: 07 May 2026

https://github.com/krishbharadwaj5678/gestureninja

Play Fruit Ninja using just your index finger with real time gesture tracking. No mouse, no keyboard β€” just your webcam and smooth slicing.

automation computer-vision cvzone finger-tracking fruit-ninja game-development gamedev gesture-recognition hand-tracking mediapipe mouse mouse-control numpy opencv pyautogui python realtime virtualmouse webcam

Last synced: 07 May 2026

https://github.com/shefreenkaur/web-scraping-and-word-frequencies

This project analyzes word frequencies in BC Legislative documents using Stanford CoreNLP and Python. The program extracts text from PDF documents, processes it using natural language processing techniques, and generates a comprehensive word frequency analysis.

analytics chromedriver easyocr nlp numpy pandas pymupdf python selenium stanfordcorenlp webscraping wordfrequency

Last synced: 07 May 2026

https://github.com/imskully/python-cv-fault-detection

Fault detection algorithm that determines whether or not a fault exists within a ring using raw computer vision.

ccl computer-vision matplotlib morphology numpy opencv python

Last synced: 07 May 2026

https://github.com/ifimust/room_generator

Room generator service for 2D game levels

flask google-app-engine microservice numpy procedural-generation python

Last synced: 07 May 2026

https://github.com/pesala-x/data-science-ai-assessments

this is inClass practice application for Python-for-Data-Science-AI

data-types functions jupyter-notebook loops machine-learning numpy pandas-python python python3 seaborn

Last synced: 07 May 2026

https://github.com/jimmymugendi/bulding-a-decision-tree-to-predict-customer-churn

This repo desribes bulding a decision tree to predict customer churn in a given organisation

accuracy-score decision-tree-classifier matplotlib-pyplot numpy pandas-dataframe scikit-learn

Last synced: 07 May 2026

https://github.com/arnie1x/massive-dataset-lab

This project involves basic data manipulation with JSON files, focusing on tasks related to data processing of the MASSive dataset and file management.

data-science numpy pandas pandas-dataframe python3

Last synced: 07 May 2026

https://github.com/bettenoch/yolo_real_time_object_detector

The YOLO Real-Time Object Detector leverages the ultralytics package to provide robust object detection capabilities. This project is designed to identify and classify objects in images, video streams, and live webcam feeds with high accuracy and efficiency.

numpy object-detection object-tracking pandas python3 real-time-processing ultralytics yolov8

Last synced: 07 May 2026

https://github.com/isabelacaldeira/plongee

Here is a program that we made using Python and Jupyter Notebook used to solve a physic problem about diving

jupyter-notebook matplotlib numpy physics python university university-project

Last synced: 08 May 2026

https://github.com/louisblankemeier/numpytex

Latex script generator that generates an aesthetically appealing table using a numpy array.

latex numpy script-generator table

Last synced: 07 May 2026

https://github.com/reddyprasade/bird-classifications-problem

Birds Classification by using Scikit-learn and Scikit-image

matplotlib numpy pandas python-3 sckiit-learn sckit-image

Last synced: 08 May 2026

https://github.com/khushi130404/k_means

This repository showcases 2D, 3D, and custom K-Means clustering models with visualizations. It includes both Jupyter notebooks and Python scripts for ease of reproducibility.

clustering-algorithm k-means-clustering numpy plotly sklearn

Last synced: 08 May 2026

https://github.com/gama1903/digital_image_processing

Implementation of Basic Algorithms of Digital Image Processing mainly through Numpy.

digital-image-processing numpy opencv python

Last synced: 08 May 2026

https://github.com/dhruvil-26/python-projects

This repository contains Python projects showcasing data analysis and visualization. 1. IMDB Movie Analysis: Analyzing movie trends, genres, and ratings. 2. Loan Default Analysis EDA: Exploring factors contributing to loan defaults.

eda imdb-dataset loan-default-analysis matplotlib numpy pandas python seaborn visualization

Last synced: 08 May 2026

https://github.com/octavioduarte/datascience

A repository with study notes on Data Science concepts, NumPy, pandas, and an introduction to statistics.

data-science jupyter-notebook numpy pandas

Last synced: 08 May 2026

https://github.com/amiegirl/ford_gobike-tripdata

This data set includes information about individual rides made in a bike-sharing system covering the greater San Francisco Bay area.

jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 08 May 2026

https://github.com/rohansoni45/ipl-win-predictor

A machine learning model that predicts the outcome of IPL matches based on historical data and player performance. Built using Python and libraries like Pandas, NumPy, and scikit-learn. This project demonstrates the use of classification algorithms to predict match outcomes.

classification data-science ipl machine-learning model-evaluation numpy pandas python sklearn sports-analytics

Last synced: 08 May 2026

https://github.com/alz-11/ftoc

Fahrenheit to Celsius conversion analysis, comparing accurate and approximate methods with visualizations and error analysis.

approximation celsius conversion data-visualization error-analysis fahrenheit fahrenheit-to-celsius jupyter jupyter-notebook matplotlib numpy python python-3 python3

Last synced: 08 May 2026

https://github.com/thekartikeyamishra/data-preprocessor

A Google Colab module for interactive data preprocessing. Handles missing values, categorical encoding (One-Hot, Label), and numerical scaling (Standard, MinMax). Outputs a cleaned dataset

ipywidgets numpy pandas python scikit-learn

Last synced: 08 May 2026

https://github.com/filnow/tinygimp

project for image processing class

gimp numpy opencv pysimplegui python3

Last synced: 08 May 2026

https://github.com/omogbolahan94/movies-budget-revenue-analysis

Data exploration, data cleaning, data analysis and predictive analysis

dataframe linear-regression matplotlib numpy pandas seaborn statistical-analysis

Last synced: 08 May 2026

https://github.com/chdl17/credit-eda-casestudy

Credit EDA is a GitHub repository designed to provide educational materials and tools for exploring credit risk data. The repository contains notebooks, datasets, and code samples to help users learn and apply these concepts to credit data.

exploratory-data-analysis numpy pandas

Last synced: 08 May 2026

https://github.com/kkkaan/tensor-library-403-hw1

A numpy like library for tensor (gergen) operations.

deep-learning numpy python tensor

Last synced: 08 May 2026

https://github.com/raghavendranhp/predictive-analytics-in-automotive-manufacturing-testing-time-optimization

Explore machine learning for automotive testing optimization. Predictive analytics to reduce testing time and environmental impact.

elasticnet extratreesregressor fastica model-pipeline numpy pandas pca randomforestregressor svr-regression-prediction xgboost

Last synced: 09 May 2026

https://github.com/jsnevt/series_temporais

PrevisΓ£o de eventos que ocorrem sazonalmente

datetime matplotlib numpy pandas

Last synced: 09 May 2026

https://github.com/omogbolahan94/nobel-prize-winner-analysis

Data exploration, cleaning, visualization and presentation

markdown matplotlib numpy pandas plotly seaborn

Last synced: 09 May 2026

https://github.com/flexthink/matricize

A convenience library to convert between pure Python objects and their vectorized representations

data machine-learning numpy python

Last synced: 09 May 2026

https://github.com/moonshallow5/food_vision_mini

A program which can detect if an image contains either a pizza, sushi or steak: Has now been deployed on Flutter :)

huggingface matplotlib numpy pytorch

Last synced: 09 May 2026

https://github.com/aroojzahra908/machine-learning

This repository contains helpful guides and materials for easily understanding Pandas, NumPy, and Matplotlib libraries, which are essential tools for creating visual representations of data.

algorithms machine-learning-algorithms matplotlib numpy pandas

Last synced: 09 May 2026