An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/iamkushvanth/marketing-campaign-analysis

Data analysis comparing the performance of Facebook and AdWords ad campaigns in 2019, Using Statical Methods such as A/B test focusing on clicks, conversions, cost-effectiveness, and seasonal trends.

ab-testing hypothesis-testing linear-regression matplotlib-pyplot numpy pandas python regression-analysis scikit-learn scipy stastistics

Last synced: 07 Apr 2026

https://github.com/jillmpla/pythonxmldatabase

A Python & XML database application for a book collection.

database matplotlib numpy pandas pandas-dataframe python xml

Last synced: 12 Apr 2026

https://github.com/sadegh15khedry/creditcard-transaction-anomaly-detection-using-isolationforest

detecting fraudulent transactions in credit card data using anomaly detection techniques.

anomaly-detection matplotlib numpy pandas python seaborn sklearn

Last synced: 27 Feb 2025

https://github.com/id-andyyy/alfahack

📈💰 Investment propensity prediction model

catboost hackathon-project jupyter lightgbm numpy optuna pandas python scikit-learn

Last synced: 12 Apr 2026

https://github.com/adarshpheonix2810/fake-job-post-detection

This project focuses on detecting fake job posts using machine learning. Fake job advertisements are often created to scam individuals by stealing personal information or money.

data-analysis deep-learning joblib machine-learning nlp-machine-learning numpy pandas python scikit-learn tkinter

Last synced: 12 Apr 2026

https://github.com/hairymax/python-for-science-lecture-notes

Tutorial material on the scientific Python ecosystem in Russian

matplotlib numpy python scipy sympy

Last synced: 12 Apr 2026

https://github.com/khushi130404/echobot

EchoBot is a chatbot built using TensorFlow to classify user inputs into predefined intents and respond accordingly. It uses a neural network trained with the data in intents.json.

keras nlp nltk numpy pickle tensorflow

Last synced: 12 Apr 2026

https://github.com/3rd-son/movie-streaming-service-analysis

Exploratory Data Analysis of the Streaming Services like Neflix, Hulu, Disney+ etc

data-analysis exploratory-data-analysis jupyter-notebook matplotlib numpy pandas plotly python seaborn

Last synced: 18 Apr 2026

https://github.com/allenvox/statistics

Workspace for applied problems of probability theory & mathematical statistics & modelling class

acf autocorrelation distribution k-means k-means-clustering maths matplotlib numpy probability-theory python sklearn statistics

Last synced: 17 Apr 2026

https://github.com/manojkp08/student-performance-analysis

The Student Performance Analyzer is your go-to solution for understanding and improving student performance. By blending the power of machine learning with interactive visualizations, this tool provides educators and learners with personalized insights into learning styles, performance gaps, and actionable improvements.

machine-learning numpy pandas python requests scikit-learn streamlit

Last synced: 12 Apr 2026

https://github.com/shibam120302/indian_sign_language_translator

This repository consists of the code utilized for creation of an Indian Sign Language Translator satisfying the following criteria : Near-Real-Time Application Achieve background independence Attain Illumination independence

imageai keras numpy opencv pillow tensorflow

Last synced: 12 Apr 2026

https://github.com/mzayles/base_vendas_dataprep

💻📊✅ Curso de Programação em Python para Data Science | Preparando e tratando dados com Pandas e NumPy.

numpy pandas python

Last synced: 24 Mar 2025

https://github.com/sultanazhari/prediction-model-of-customers-leaving

Bank Beta Company focus on retain existing customers, our task is to create a model that predicts whether or not a customer will leave the bank soon.

accuracy-score confusion-matrix logistic-regression-algorithm matplotlib-pyplot numpy pandas python3 random-forest roc-auc-score seaborn train-test-using-sklearn

Last synced: 20 Apr 2026

https://github.com/raoulnormand/easygrader

A library to easily calculate course grades.

gradebook gradescope numpy pandas teaching-tool

Last synced: 10 Mar 2025

https://github.com/kunalshelke90/students_marks_prediction_in_exams

Students Performance Evaluation using Feature Engineering, Feature Extraction, Manipulation of Data, Data Analysis, Data Visualization .Later on deployed on AWS with help of docker

dokcerfile flask githubactions machine-learning matplotlib mlops numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/azaz9026/car_price_prediction_model

This repository contains a machine learning model designed to predict car prices based on various features. Using historical data on car attributes such as make, model, year, mileage, and other relevant factors, the model aims to provide accurate and reliable price estimates for used cars.

data-analysis data-engineering liner-regestion machine-learning modeling numpy pandas python3 rendering

Last synced: 09 Apr 2026

https://github.com/tugot17/linear-regression-from-scratch

Linear Regression concepts and intuitions presented using Jupyter Notebooks

from-scratch linear-regression numpy python

Last synced: 11 Jul 2025

https://github.com/khushi130404/catnet

CatNet is a simple machine learning project that classifies images as either a cat or not a cat using logistic regression. The dataset consists of labeled images of cats and non-cats, preprocessed and used to train a binary classification model.

h5py matplotlib numpy pillow scipy

Last synced: 12 Apr 2026

https://github.com/darkusphantom/newton-method

7 ejercicios practicos del método de Newton en la programación computacional

computational-maths numpy python

Last synced: 25 Mar 2025

https://github.com/kareem74x/data-mining-uni-project

A project focused on applying data mining techniques to extract insights from datasets

data-mining matplotlib numpy pandas python

Last synced: 25 Feb 2026

https://github.com/ds-python-programmers-spectrum/flipkart-sentimental_analysis-app

The Flipkart Sentiment Analysis project aims to analyze customer reviews of Flipkart products using Natural Language Processing (NLP) and visualize insights through a Streamlit-based web application. This system will classify customer feedback as positive, negative, or neutral

machine-learning nltk numpy pandas sentiment-analysis streamlit-webapp textblob vader-sentiment-analysis

Last synced: 22 Jun 2025

https://github.com/akashash01/random_forest

It enable learning method for classification, regression and other task. Operates by building multiple decision tree ay training time. It is much more robust than Decision Tree algorithm.

data-science machine-learning numpy pandas python random-forest-classifier sklearn supervised-learning

Last synced: 01 Apr 2025

https://github.com/urvee1810/market_basket_analysis

A data mining project analyzing Instacart's 3 million grocery orders to uncover customer shopping patterns and product associations. Using market basket analysis and the Apriori algorithm, the project reveals key insights about shopping behavior, product combinations, and temporal patterns, providing valuable recommendations for retail strategy

apriori-algorithm data-mining data-visualization machine-learning market-basket-analysis matplotlib mlxtend numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/rohitinu6/image_cartoonify

To Cartoonify an Image using opencv-python

image-processing matplotlib numpy opencv opencv-python python

Last synced: 12 Apr 2026

https://github.com/arrmansa/leetcode-nice-solutions

Some solutions for leetcode problems that I found fun - either the algorithm or using numpy bruteforce where it shouldn't be

leetcode leetcode-python leetcode-python3 leetcode-solutions numpy python python3

Last synced: 16 Mar 2025

https://github.com/techcsispit/algorithmic-trading

This project focuses on algorithmic training using Python in Jupyter Notebooks. It covers various data structures and algorithms, providing a hands-on approach to solving algorithmic challenges. The notebooks are designed to be interactive and educational for learners of all levels.

matplotlib numpy pandas python sklearn

Last synced: 04 Sep 2025

https://github.com/hrolive/fundamentals-of-accelerated-data-science

How to use GPU-accelerated tools to conduct data science faster, leading to more scalable, reliable, and cost-effective results.

accelerated-computing bokeh cudf cugraph cuml cupy dask data-science jupyter numpy pandas python rapids xgboost

Last synced: 12 Apr 2026

https://github.com/parth-jatav/super-store-analysis-project

The Super Store Analysis project leverages Python libraries such as pandas, matplotlib, and numpy to perform a comprehensive analysis of a retail store's data. This project includes data cleaning, visualization, and statistical analysis to identify key trends, optimize inventory, enhance decision-making processes for improved business performance.

data-analysis matplotlib numpy pandas python super-store

Last synced: 12 Apr 2026

https://github.com/apal21/tensorflow-linear-regression-fit-data

Fit a line passing through data with some random noise using Tensorflow linear regression with Gradient Descent Optimizer.

gradient-descent linear-regression numpy python3 tensorflow

Last synced: 12 Apr 2026

https://github.com/lmizner/codecademy_life_expectancy_gdp

Utilize understanding of data visualization to analyze and plot data about GDP and life expectancy

barplot jupyter-notebook lineplot matplotlib-pyplot numpy pandas python scatterplot seaborn

Last synced: 12 Apr 2026

https://github.com/gregoritsch3/project_radioactive_decay

An analytic ( system of coupled ODEs), numeric (Runge-Kutta 4, Newton-Raphson) and stochastic (Monte Carlo) treatment of the chained radioactive decay of Bi[210, 83] into Po[210,84], as well as the emission of alpha-particles. Associated comments and paper are in Croatian.

cplusplus matplotlib montecarlo-simulation numerical-methods numpy python radioactive-decay scipy

Last synced: 12 Apr 2026

https://github.com/jpcano/boston_housing

Predicting Boston Housing Prices using supervised Machine Learning algorithms

cross-validation machine-learning numpy pandas python regression-models scikit-learn

Last synced: 12 Apr 2026

https://github.com/themihirmathur/soiligator

Soiligator is an advanced machine learning project designed to optimize irrigation management by predicting whether irrigation is necessary based on environmental and soil-related data.

auc-score logistic-regression machine-learning matplotlib numpy pandas python random-forest-classifier roc-curve scikit-learn seaborn standardscaler support-vector-machine

Last synced: 12 Apr 2026

https://github.com/emhash/machine-learning-and-ai

Personal tasks or codes of Machine Learning and Artificial Intelligence. Practice codes and projects of Ml/AI will be share here.

ai datasets ipynb jupyter-notebook keras linear-regression logistic-regression machine-learning machine-learning-algorithms numpy opencv pandas python seaborn sklearn tensorflow

Last synced: 05 Jan 2026

https://github.com/filip-kustura/python-covid-19-behaviors-analysis

Using Jupyter Notebook, this university project analyzes attitudes and behaviors related to the COVID-19 pandemic using a two-year survey from Imperial College London and YouGov research company. Utilizing Pandas, NumPy and Matplotlib, the data analysis focuses on three countries, exploring trends and insights throughout the pandemic.

covid-19 data-analysis data-visualization jupyter-notebook matplotlib numpy pandas python university-project

Last synced: 12 Apr 2026

https://github.com/fabricioarendtorres/streamauc

Light-weight package for classification metrics computed on streams or minibatches of data. Mainly for area under the curve (AUC) of precision-recall (PR) or receiver operating characteristic (ROC) curves. Supports multi-class setting with either macro- or micro aggregation..

classification-model machine-learning metrics numpy precision-recall-curve receiver-operating-characteristic

Last synced: 06 Jan 2026

https://github.com/lorenzorottigni/ml-universities

Machine Learning python bootcamp: K mean clustering with public/private universities dataset

k-mean-clustering machine-learning numpy pandas python scikit-learn seaborn

Last synced: 05 Apr 2026

https://github.com/lorenzorottigni/ml-random

Machine Learning course of Piero Savastano 7: matplotlib, accuracy_score, precision_score, recall_score, f1_score, classification_report

machine-learning matplotlib metrics numpy

Last synced: 04 May 2026

https://github.com/alisonmitchell/countries-of-the-world

Exploratory Data Analysis of US Government data from the World Factbook to investigate GDP ($ per capita), Literacy (%) and Phones (per 1000).

data-science machine-learning matplotlib numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/aleksandarbuk/machine-learning

The Machine Learning Library repository provides a collection of scripts and tools leveraging Scikit-Learn, Pandas, and NumPy for various machine learning tasks and data analysis.

matplotlib numpy python scikit-learn tensorflow

Last synced: 16 Apr 2026

https://github.com/akashkg03/facial-expression-image-classification

This notebook involves to build a facial expression image classifier which categorizes facial expressions into one of seven emotions: anger, disgust, fear, happiness, sadness, surprise, and neutral.

jupiter-notebook numpy pandas python

Last synced: 28 Apr 2026

https://github.com/sonatipek/matris_transactions_py

My "Python Programlama-I" lesson's homework. Some matris transactions with numpy library in Python

matrix numpy numpy-arrays python sonatipek

Last synced: 24 Mar 2025

https://github.com/sothulthorn/stock-prediction-portal

The Stock Prediction Portal is a full-stack web application that demonstrates how to use machine learning models to analyze stock market data and forecast future trends.

axios django django-rest-framework javascript keras matplotlib numpy pandas python react tensorflow yfinance

Last synced: 02 Apr 2026

https://github.com/hariprasath-v/av-job-a-thon-november-2022

Build a machine learning/deep learning approach to forecast the total energy demand on an hourly basis for the next 3 years based on past trends.

exploratory-data-analysis kaggle lightgbm-regressor matplotlib numpy pandas python rmse-score seaborn sklearn statsmodels timeseries-forecasting

Last synced: 12 Apr 2026

https://github.com/prarthana-singh/potato-disease-classification-using-deep-learning

🌱 Potato Disease Classification using Deep Learning – A deep learning-based model to detect and classify potato plant diseases using image processing and CNNs. Built with TensorFlow/Keras & Python.

agriculture cnn deep-learning image-classification keras machine-learning neural-network numpy pandas plant-disease-detection python3 streaming tensorflow

Last synced: 12 Apr 2026

https://github.com/yahiazakaria445/image-classification-using-vgg-16

computer vision model for 10 classes classification

matplotlib numpy opencv pandas sklearn tensorflow

Last synced: 12 Apr 2026

https://github.com/manikantasanjay/data_analysis_using_python_libraries_series

This Series contains Data Analysis projects performed on different Kaggle datasets and providing valuable insights into the data by making use of Python libraries.

exploratory-data-analysis kaggle-datasets machine-learning matplotlib numpy opendatasets pandas python visualization

Last synced: 12 Apr 2026

https://github.com/the-developer-306/fake-review-detector

This project is a machine learning-based review classification system that predicts whether a product review is GENUINE or FAKE. It preprocesses review text, analyzes sentiment, and uses numerical features like ratings and helpfulness to make predictions. The model is deployed via a Flask web application for user interaction.

classification flask logistic-regression machine-learning numpy pandas python renderdeploy scikit-learn sentiment-analysis

Last synced: 12 Apr 2026

https://github.com/chandkund/sentiment-analysis-using-nlp

This project focuses on Sentiment Analysis using the textual content from product reviews. The goal is to analyze user sentiments based on their written feedback, particularly focusing on the "reviewText" column in the dataset.

lemmatization matplotlib nlp nltk numpy pandas seaborn tokenization

Last synced: 31 Jan 2026

https://github.com/hariprasath-v/av_job-a-thon-august-2022

Build a Machine Learning model to predict the CTR(click through rate) of an email campaign based on the email campaigning information.

exploratory-data-analysis klib machine-learning matplotlib numpy pandas pycaret-library regression seaborn sklearn

Last synced: 12 Apr 2026

https://github.com/ieCecchetti/Python_ML_DL_examples

A variety of Machine Learning and Deep Learning scripts in Python. Included some theorical info about that in the Readme

bayes-classifier bayesian-statistics deep-learning kernel machine-learning matplotlib neural-network numpy pandas pca pca-analysis python scikitlearn-machine-learning scipy shi

Last synced: 10 Mar 2025

https://github.com/abideen-olawuwo/logistic-regression-house-prediction

Using logistic Regression Model to predict house price

logistic-regression matplotlib numpy pandas python seaborn sklearn

Last synced: 12 Apr 2026

https://github.com/raghavendra317/brain-tumor-detection

A deep learning-based project for classifying brain MRI images into Glioma, Meningioma, No Tumor, and Pituitary Tumor using Convolutional Neural Networks (CNNs). This project aims to assist in early tumor detection, improving diagnostic accuracy and healthcare outcomes.

cnn-classification keras numpy opencv scikit-learn tensorflow

Last synced: 12 Apr 2026

https://github.com/sandipkushwaha20/brain_tumour_detector

This project utilizes deep learning to detect and classify brain tumors from MRI images. The primary objective is to determine whether a tumor is present and, if so, classify its type with high accuracy.

deeplearning keras numpy pandas python tensorflow

Last synced: 12 Apr 2026

https://github.com/konynour/facedectection

This project demonstrates real-time face detection using OpenCV and a pre-trained SSD (Single Shot MultiBox Detector) model. The system captures video from a camera, processes the frames, detects faces, and displays the results with bounding boxes and confidence scores. This application showcases the use of deep learning models for object detection

cv2 jupyter-notebook lambda-functions matplotlib numpy opencv python3

Last synced: 12 Apr 2026

https://github.com/mairagalvao/steak_risk_data

A data analysis based on correlations and comparisons among the samples.

database matplotlib numpy pandas python3 sql

Last synced: 12 Apr 2026

https://github.com/supriya811106/whatsapp-chat-analyzer-app

Analyze WhatsApp chats with Python, Streamlit, and data visualization. Explore messaging patterns, content trends, and emoji usage to uncover insights from your conversations.

analyzer-web-app chat-analytics chat-analyzer data-preprocessing data-visualization emojis machine-learning matplotlib natural-language-processing nltk numpy pandas plotly python3 seaborn sentiment-analysis streamlit-webapp text-analysis user-engagement

Last synced: 30 Dec 2025

https://github.com/AsuquoAA/CDC_2017_Immunization_Data_Analysis

This project analyzes data from the CDC's 2017 National Immunization Survey Public Use File (NISPUF17) to answer research questions about immunization, child health, and vaccine-related behaviors.

numpy

Last synced: 21 Jul 2025

https://github.com/2kabhishek/ml-primer

Basic Machine Learning & Data Science in Python/ 📈🐍

jupyter-notebook machine-learning matplotlib numpy pandas python

Last synced: 12 Apr 2026

https://github.com/sudarshanc00/mnist-digit-classification

This project uses a machine learning model to classify handwritten digits (0-9) from the MNIST dataset, a popular collection of grayscale images for image classification benchmarking. It includes a Jupyter Notebook to train a neural network, enabling accurate recognition and classification of digits.

jupyter-notebook matplotlib numpy python tensorflow

Last synced: 12 Apr 2026

https://github.com/marcelovilla/fire-congress-2019-tdf-talk

This repository contains the code used for the project presented in the "Fire activity on the Colombian Tropical Dry Forest: an environmental and social perspective" talk during the 8th International Fire Ecology and Management Congress.

earth-observation ecology fire-activity gdal latex matplotlib modis numpy pandas python remote-sensing requests seaborn tikz trmm xarray

Last synced: 12 Apr 2026

https://github.com/ben854719/wasde-revenue-in-2024

The project will use AI to analyze the revenue from the dataset of the World Agricultural Supply and Demands Estimates.

gemini langchain large-language-models matplotlib numpy pandas-python

Last synced: 12 Apr 2026

https://github.com/nidomika/digit-recognition

Digit recognition project in Python

digit-recognition numpy opencv python tensorflow

Last synced: 12 Apr 2026

https://github.com/samkazan/structural_discovery_of_macromolecules_data_analysis

This research project uses machine learning techniques and neural network to uncover key factors that contribute to successful protein structure discovery using Python and R

classification clustering ipython-notebook jupyter-notebook keras-neural-networks keras-tensorflow machine-learning neural-network numpy python r rmarkdown scikit-learn scipy tensorflow

Last synced: 02 Feb 2026

https://github.com/dahsie/machine_learning_from_scratch

This project aims to implement some machine learning basic techniques(e.g. MinMaxScaler, StandardScaler, TD-IDF, PCA, Logistic Regression, LDA, KNN, Naive Bayes Classifier) using only pyton, numpy and pandas. This will enable me to have hone my data scientist skills

classification clustering data data-processing datascience machienlearning nlp nltk numpy pandas python regression

Last synced: 04 May 2026

https://github.com/elifftosunn/data-labeling

Bu araç veri kazıma aşamasından sonra verilerin istenilen kategoriye göre etiketlenmesi ve kullanılmayacak olan verilerin database'ten silinmesi ile makine öğrenmesi modellerini kullanmak için etiketleme adımınını otomatikleştirmek amaçlı yapılmıştır.

base64 kaleido numpy os pandas plotly postgresql psycopg2 psycopg2-binary time

Last synced: 12 Apr 2026

https://github.com/chandkund/iris-classification

Iris Classification project using machine learning to classify iris flowers into Setosa, Versicolour, and Virginica species. Includes data exploration, model training with SVM, Logistic Regression, and Decision Tree, and performance evaluation. Contributions welcome! Python, Pandas, Scikit-learn, Seaborn, Matplotlib.

machine-learning matplotlib-pyplot numpy pandas python sklearn

Last synced: 21 Jan 2026

https://github.com/nambukeerthi/youtube_dataproject

Build a Streamlit App – A user-friendly interface. Use Google API – Fetch YouTube channel data. Store in SQL Database – Save channel details for easy access. Search & Join Tables – Allow users to query and view data

api numpy pandas python sql streamlit

Last synced: 09 Apr 2026

https://github.com/daksh777/expressando

Expressando is a rock, paper, scissors game made using Python (TensorFlow, Keras and OpenCV)

keras keras-tensorflow machine-learning numpy opencv opencv-python tensorflow

Last synced: 13 Apr 2026

https://github.com/pedasoft-consult/house-price-prediction

This project is a machine learning-powered web application for predicting house prices based on various features such as area, bedrooms, bathrooms, and more. It uses Flask for the backend and HTML/CSS/JavaScript for the frontend.

flask joblib matplotlib numpy pandas r2 random-forest regression seaborn sklearn

Last synced: 13 Apr 2026

https://github.com/bachboy0/montecarlo

Python-based Monte Carlo simulations for statistical analysis and modeling.

monte-carlo-methods numpy python

Last synced: 21 Apr 2026

https://github.com/ayhem18/tinyad

A simple Python library to better understand automatic Differentiation

automatic-differentiation mathematical-programming numpy python3

Last synced: 29 Apr 2026

https://github.com/hansalemaos/cinpy

C/C++ in Python for Dummies

c cpp ctypes numpy python

Last synced: 13 Apr 2026

https://github.com/jim-by/tweet-sentiment-analysis

Sentiment analysis of tweets using TextBlob for labeling and RandomForest for classification.

nltk nltk-tokenizer numpy pandas python random-forest-classifier sklearn textblob-sentiment-analysis

Last synced: 13 Apr 2026

https://github.com/karanyeole/movie-rating-analysis

analyzing movie ratings from various sources to gain insights into audience preferences, trends, and patterns in movie ratings. The project utilizes data analysis and visualization techniques to explore and interpret movie rating data, providing valuable information for movie enthusiasts, filmmakers, and industry professionals.

matplotlib numpy pandas python seaborn

Last synced: 13 Apr 2026

https://github.com/rkarahul/ok.win-big-small-predictor

Predict the next “Big” or “Small” outcome on the OK.Win lottery-style game using OCR + time-series features + ML.

joblib numpy opencv-python paddleocr paddlepaddle paddlepaddle-gpu pandas python sckit-learn

Last synced: 13 Apr 2026