An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/dharma-acha/resnet18_imageclassification_cnn

In this part of the project, we implement ResNet-18 from scratch using PyTorch and train it on an image dataset to achieve over 75% accuracy. We apply techniques to prevent overfitting and optimize performance, aiming for an accuracy of 80% or higher.

matplotlib numpy python3 pytorch scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/renukadhule/crop_production_python_analysis

This project analyzes crop production data in India from 1997 to 2015, identifying trends, seasonality, and top-performing states and crops. The insights aim to guide improvements in agricultural productivity and policy decisions.

matplotlib numpy pandas python

Last synced: 12 Apr 2026

https://github.com/imdeepmind/knn_from_scratch

KNN classifier in pure Numpy and Python

knn knn-classification machine-learning numpy python

Last synced: 11 Apr 2026

https://github.com/samkazan/structural_discovery_of_macromolecules_data_analysis

This research project uses machine learning techniques and neural network to uncover key factors that contribute to successful protein structure discovery using Python and R

classification clustering ipython-notebook jupyter-notebook keras-neural-networks keras-tensorflow machine-learning neural-network numpy python r rmarkdown scikit-learn scipy tensorflow

Last synced: 02 Feb 2026

https://github.com/necraul/perlin-noise-py

Some examples of generating perling noise using Python

noise numpy perlin-noise python python3 vector

Last synced: 11 Mar 2025

https://github.com/vyjayanthipolapragada/fraud_detection_creditcard

Detecting the fraudulent credit card transactions by training Decision Tree model using Scikit-learn and SnapML

classification-model data-preprocessing decision-tree-classifier kaggle-dataset machine-learning numpy pandas python scikit-learn snapml time tree-model

Last synced: 11 Apr 2026

https://github.com/frolov-andrey2405/prime-numbers-and-polar-coordinate-system

Prime numbers and polar coordinate system

matplotlib numba numpy python

Last synced: 01 May 2026

https://github.com/sakhileln/multimodal-chatbot-with-memory

A lightweight, CLI-based conversational agent that processes text and image inputs, remembers past interactions.

chatbot machine-learning nlp numpy opencv spacy tensorflow

Last synced: 11 Apr 2026

https://github.com/dan-niles/neural-networks-from-scratch

A neural network implemented using only NumPy and Pandas for CS3631: Deep Neural Networks

neural-networks numpy pandas

Last synced: 05 May 2026

https://github.com/ebadshabbir/logistic_regression-binomial-

Logistic Regression on Social Network Ads Dataset This project applies Logistic Regression to predict whether a user will purchase a product based on their age and estimated salary, using the Social Network Ads dataset. The data is split into training and test sets, with feature scaling applied for normalization.

classification jupyter-notebook logistic-regression machine-learning matplotlib-pyplot numpy pandas python sklearn

Last synced: 11 Apr 2026

https://github.com/thaisgarcia/convert-censo-file

Este repositório contém scripts Python para processar dados destinados ao censo do MEC, incluindo a leitura de um arquivo TXT, a geração de um arquivo Excel e a conversão desse arquivo Excel de volta para um formato de texto.

numpy pandas python

Last synced: 11 Apr 2026

https://github.com/guptamudit/whatsapp-chat-analyser

Created an online Whats App chat analysis tool which is designed to extract valuable insights from your group or one-on-one conversations. By analyzing your chat data, WhatsChat provides detailed statistics and visualizations.

matplotlib numpy pandas python seaborn

Last synced: 11 Apr 2026

https://github.com/dahsie/machine_learning_from_scratch

This project aims to implement some machine learning basic techniques(e.g. MinMaxScaler, StandardScaler, TD-IDF, PCA, Logistic Regression, LDA, KNN, Naive Bayes Classifier) using only pyton, numpy and pandas. This will enable me to have hone my data scientist skills

classification clustering data data-processing datascience machienlearning nlp nltk numpy pandas python regression

Last synced: 04 May 2026

https://github.com/ashishsingh789/titanic_dataset_eda_and_visualization

This repository contains an exploratory data analysis (EDA) of the Titanic dataset. Key analyses include survival rates by gender, passenger class, age distribution, family size, and correlation heatmaps.

data data-science dataanalysis matplotlib numpy pandas pandas-dataframe python seborn visualisation

Last synced: 11 Apr 2026

https://github.com/devbigboy/php-crud-project

PHP-CRUD-Project

jupyter numpy pandas

Last synced: 20 Apr 2026

https://github.com/hossein-cod/parking-space-detection

Identifying empty spaces in the parking lot with openCV

cv2 jupyter-notebook numpy opencv pickle python

Last synced: 14 Apr 2026

https://github.com/francislauriano/chatsoftex

Plataforma desenvolvida em Python que visa automatizar e agilizar o processo de avaliação de projetos de inovação tecnológica, utilizando inteligência artificial e critérios padronizados com base na Lei do Bem.

cryptography fernet firebase flask flask-jwt-extended hugging-face-transformers numpy openai pdfplumber postgresql pyjwt pymupdf-fitz pypdf2 python pytorch scikit-learn scipy spacy sqlalchemy tensorflow

Last synced: 03 Apr 2026

https://github.com/nilusche/feed_forward_network_py

Implementation of a neural network in python which can predict handwritten numbers from the MNIST dataset.

colab-notebook feed-forward-neural-networks neural-network numpy python

Last synced: 01 May 2026

https://github.com/matkorussovich/student-performance-analysis

Este repositorio contiene un análisis del desempeño académico de estudiantes, realizado como parte del módulo "Introducción al Data Science" en el Máster en Data Science de la Universidad Europea de Madrid.

jupyter-notebook matplotlib-pyplot numpy pandas python

Last synced: 14 Apr 2026

https://github.com/samuele-lolli/data-analytics-techniques

A practical approach to data analytics pipeline.

numpy pandas pytorch scikit-learn

Last synced: 11 Apr 2026

https://github.com/alejandro945/insurance-risk

This project aims to predict the risk of insurance claims using a dataset from Kaggle. The dataset consists of 26 columns and 205 rows, providing various features related to insurance risk. By analyzing this data, we seek to build predictive models that can help insurers assess the risk of claims.

data-analytics ipython-notebook numpy pandas python

Last synced: 06 Feb 2026

https://github.com/harmanveer-2546/movie-industry

Investigate the film industry to gain sufficient understanding of what attributes to success and in turn utilize this analysis to create actionable recommendations for companies to enter the industry.

business business-analytics data-analysis datatime film-industry graphs matplotlib movie-database numpy pandas python scraping-websites seaborn visualization web-scraping-python

Last synced: 10 Apr 2026

https://github.com/asuquoaa/cdc_2017_immunization_data_analysis

This project analyzes data from the CDC's 2017 National Immunization Survey Public Use File (NISPUF17) to answer research questions about immunization, child health, and vaccine-related behaviors.

numpy

Last synced: 12 Apr 2026

https://github.com/amnydv17/landmark-detection

This project aims to leverage the power of deep learning models to automatically detect and pinpoint landmarks such as famous monuments, buildings, natural landmarks, and other recognizable structures within images.

machine-learning matplotlib numpy pandas python3 scikit-learn seaborn tensorflow

Last synced: 11 Apr 2026

https://github.com/mizcausevic-dev/embedding-drift-graph

Track how entity embeddings drift across encoder model versions. SQLite store + Strawberry GraphQL API. Cosine drift events computed automatically on every record. Reference impl for RAG/eval pipelines re-encoding their corpus.

ai-governance drift-detection embeddings graphql llm numpy python rag sqlite strawberry-graphql vector-search

Last synced: 01 Jun 2026

https://github.com/rakibhhridoy/differentprojects

Some of my learning projects that I practice to launch in data science. Not all, but some of few that was stored in my local repository. It can be useful for beginner data science enthusiast. Explore and learn!

data-science deep-learning machine-learning mathematics matplotlib numpy pandas python scikit-learn seaborn statistics

Last synced: 11 Apr 2026

https://github.com/elifftosunn/data-labeling

Bu araç veri kazıma aşamasından sonra verilerin istenilen kategoriye göre etiketlenmesi ve kullanılmayacak olan verilerin database'ten silinmesi ile makine öğrenmesi modellerini kullanmak için etiketleme adımınını otomatikleştirmek amaçlı yapılmıştır.

base64 kaleido numpy os pandas plotly postgresql psycopg2 psycopg2-binary time

Last synced: 12 Apr 2026

https://github.com/chernyakid/bank-clients-churn-research

Анализ оттока клиентов в Метанпромбанке

jupyter numpy pandas python scipy seaborn

Last synced: 11 Apr 2026

https://github.com/testetson22/abelian_sandpiles

Simple App that implements, builds and tests an approximation of the Abelian sandpile model

github-actions numpy pytest

Last synced: 12 Apr 2026

https://github.com/ljubogdan/solar-cycle-lstm

This project predicts sunspot activity using an LSTM model for time series data. Built with TensorFlow and Keras, it uses Huber loss for outlier handling and MAE for performance evaluation. The dataset, sourced from Kaggle or SIDC, spans over 270 years of monthly sunspot data.

conv1d huber-loss-regression kaggle keras lstm machine matplotlib numpy pandas scikit-learn seaborn solar sunspots tensorflow time-ser

Last synced: 13 Apr 2026

https://github.com/anshpg/vectorwave

The Jupyter Notebook (ipynb file) contained within this repository serves as the primary codebase for this project. It demonstrates how to preprocess the input data, train the logistic regression classifier, evaluate its performance, and save the trained model for future use.

collab jupyter-notebook matplotlib-pyplot numpy pandas python sklearn

Last synced: 11 Apr 2026

https://github.com/manoj-kumar-c/data_analyst_projects

Live Data Analyst Projects.

excel numpy pandas powerbi

Last synced: 07 Feb 2026

https://github.com/linsanity03/football_prediction

ML project to predict goal from football events such as penalty, free kick.

jupyter-notebook numpy pandas pyspark python3 tensorflow

Last synced: 11 Apr 2026

https://github.com/chandkund/iris-classification

Iris Classification project using machine learning to classify iris flowers into Setosa, Versicolour, and Virginica species. Includes data exploration, model training with SVM, Logistic Regression, and Decision Tree, and performance evaluation. Contributions welcome! Python, Pandas, Scikit-learn, Seaborn, Matplotlib.

machine-learning matplotlib-pyplot numpy pandas python sklearn

Last synced: 21 Jan 2026

https://github.com/ansh-info/industrial-scale-penicillin-simulation

Optimizing industrial-scale penicillin production using machine learning and data analysis.

jupyter-notebook machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/adamouization/python-machine-learning-data-science-notes

:orange_book: Jupyter notebooks containing useful Python code and notes for general Machine Learning and Data Science projects.

data data-science data-visualization guide jupyter jupyter-notebook machine-learning matplotlib notes numpy pandas pandas-dataframe python seaborn

Last synced: 11 Apr 2026

https://github.com/nambukeerthi/youtube_dataproject

Build a Streamlit App – A user-friendly interface. Use Google API – Fetch YouTube channel data. Store in SQL Database – Save channel details for easy access. Search & Join Tables – Allow users to query and view data

api numpy pandas python sql streamlit

Last synced: 09 Apr 2026

https://github.com/ialam085/super_store_sales_analysis_python

The project is indeed focused on performing an exploratory data analysis (EDA) of Super Store Sales data from various perspectives, using comprehensive visualizations.

charts matplotlib numpy pandas python seaborn visualization

Last synced: 10 Apr 2026

https://github.com/erikaax08/numerical_methods

This repository hosts the project developed for the Numerical Methods class at the Benemérita Universidad Autónoma de Puebla (BUAP)

django maths matplotlib numerical-methods numpy plotly

Last synced: 19 Aug 2025

https://github.com/arnauog/ironhack_data_analytics

Weekly labs to practice everything taught in class during the Data Analytics bootcamp of Ironhack in Barcelona of Sept 2024 - March 2025.

machine-learning mongodb mysql numpy pandas python seaborn tableau webscraping

Last synced: 11 Apr 2026

https://github.com/yosrak5/predictive_maintenance

End-To-End Predictive Analytics to predict Hardware Maintenance Before it occurs using data preprocessing and Machine Learning classification modeling (XGBOOST , Random Forest )

data-factory data-perspective machine-learning-workbench matplotlib numpy pandas python random-forest seaborn xgboost

Last synced: 25 Feb 2026

https://github.com/broodhoney/titanic-ml-from-disaster

This repository contains my analysis and solutions for the Titanic: Machine Learning from Disaster competition on Kaggle. The notebook explores the dataset, performs extensive Exploratory Data Analysis (EDA), applies feature engineering techniques, and builds predictive models to determine survival outcomes based on passenger data

machine-learning numpy pandas python scikit-learn scikitlearn-machine-learning

Last synced: 11 Apr 2026

https://github.com/shwetapardhi/assignment-1-q11--basic-statistics-level-1

Q11) Suppose we want to estimate the average weight of an adult male in Mexico. We draw a random sample of 2,000 men from a population of 3,000,000 men and weigh them. We find that the average person in our sample weighs 200 pounds, and the standard deviation of the sample is 30 pounds. Calculate 94%,98%,96% confidence interval?

confidence-interval normalization numpy pandas python scipy stats

Last synced: 13 Apr 2026

https://github.com/daksh777/expressando

Expressando is a rock, paper, scissors game made using Python (TensorFlow, Keras and OpenCV)

keras keras-tensorflow machine-learning numpy opencv opencv-python tensorflow

Last synced: 13 Apr 2026

https://github.com/murugavl/loan_approval_prediction

This project is a Loan Approval Prediction System that uses Machine Learning to determine whether a loan application should be approved or rejected based on various factors. It is deployed using Streamlit for an interactive user experience.

jupyter-notebook machine-learning numpy pandas python random-forest-classifier scikit-learn stremlit

Last synced: 13 Apr 2026

https://github.com/ayushtiwari134/machine_learning_models

A repo where i upload all the models which i train during my journey of learning Machine Learning from scratch

linear-regression logistic-regression machinelearning matplotlib numpy pandas python random-forest scikit-learn

Last synced: 11 Apr 2026

https://github.com/dmarks84/coursework_project_text-mining-spam-analysis

Project for University of Michigan Applied Data Science Specialization -- Performed NLP in order to build features of email messages; trained various classification models to help predict if a message was spam.

classification databases eda nlp numpy pandas python scikit seaborn sentiment-analysis statistics supervised-ml text-mining unsupervised-ml visualization

Last synced: 11 Apr 2026

https://github.com/pedasoft-consult/house-price-prediction

This project is a machine learning-powered web application for predicting house prices based on various features such as area, bedrooms, bathrooms, and more. It uses Flask for the backend and HTML/CSS/JavaScript for the frontend.

flask joblib matplotlib numpy pandas r2 random-forest regression seaborn sklearn

Last synced: 13 Apr 2026

https://github.com/gustavo-venceslau/housing-price

A Machine Learning project to create a model to predict housing prices. The project is based on the example in chapter 2 of the book Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow by Aurélion Géron

conda machine-learning matplotlib numpy pandas python sklearn

Last synced: 13 Apr 2026

https://github.com/sohan181204/airline-ticket-price-analysis

The analysis of airline ticket prices explores the dynamic and competitive nature of the aviation industry in India. With rapid economic growth and increasing air travel demand, understanding ticket price trends is crucial for airlines, passengers, and policymakers.

numpy pandas python selenium

Last synced: 11 Apr 2026

https://github.com/naveen88112/final_education

Student Performance Prediction This project examines the student performance data, pre-processes the features, and implements machine learning methods (Random Forest) for the forecasting of final grades. The data is subjected to exploratory data analysis (EDA) and feature engineering prior to model training and assessment.

data-preprocessing exploratory-data-analysis machine-learning model-evaluation numpy pandas python

Last synced: 11 Apr 2026

https://github.com/bachboy0/montecarlo

Python-based Monte Carlo simulations for statistical analysis and modeling.

monte-carlo-methods numpy python

Last synced: 21 Apr 2026

https://github.com/barkerbg001/framefusion

Python script that creates a video from a set of images and adds a random audio clip to each video

glob moviepy numpy opencv pygt5 python python-3 random video video-editor

Last synced: 16 Apr 2026

https://github.com/architj6/gesture-control-system

This Gesture Control System allows users to control various system functions (like mouse movement, volume adjustment, and screen brightness) using hand gestures through a webcam. It leverages MediaPipe for hand tracking and PyAutoGUI for controlling the system. The project enables intuitive, hands-free control for everyday tasks.

brightness-control cursor cursor-control doubleclick drag-and-drop gesture handgesture handgesture-recognition mediapipe numpy opencv os pyautogui pycaw python python3 scroll volume-control

Last synced: 17 Apr 2026

https://github.com/anish37860/python_party

Python summer party by Interview Master & in partnership with DataCamp

jupyter numpy pandas python

Last synced: 06 May 2026

https://github.com/ayhem18/tinyad

A simple Python library to better understand automatic Differentiation

automatic-differentiation mathematical-programming numpy python3

Last synced: 29 Apr 2026

https://github.com/dineshdhamodharan24/industrial-copper-modeling

It seems like you have a project that involves modeling industrial copper data using Python and several libraries such as pandas, numpy, and scikit-learn. This is a common and practical approach, as these libraries are widely used for data manipulation, analysis, and machine learning tasks.

bussiness-solution numpy pandas pickle pickle-file seaborn sklearn-library streamlit

Last synced: 13 Apr 2026

https://github.com/hansalemaos/cinpy

C/C++ in Python for Dummies

c cpp ctypes numpy python

Last synced: 13 Apr 2026

https://github.com/companyakis/numpy

Python Numpy

data-science numpy python

Last synced: 04 Sep 2025

https://github.com/alphacrypto246/stock-price-movement-prediction

A project leveraging Polynomial Regression to predict stock price movements based on historical data. Includes data preprocessing, feature engineering, visualization, and model evaluation to provide insights for informed trading decisions.

machine-learning numpy pandas polynomial-regression scikit-learn yfinance

Last synced: 13 Apr 2026

https://github.com/jim-by/tweet-sentiment-analysis

Sentiment analysis of tweets using TextBlob for labeling and RandomForest for classification.

nltk nltk-tokenizer numpy pandas python random-forest-classifier sklearn textblob-sentiment-analysis

Last synced: 13 Apr 2026

https://github.com/f-ssemwanga/advanced-programming-york

This repository holds the extensive python programming completed during the advanced programming unit of my master's programme

comprehension csv-parser json lambda-functions list matplotlib mongodb numpy oops-in-python pandas-dataframe postgresql python3 tkinter-gui

Last synced: 11 Apr 2026

https://github.com/sralter/potential_talents

Using NLP techniques (word and sentence embedding tools like SBERT and Learning-to-Rank systems like RankNet and LambdaRank) to rank candidates.

lambdarank learning-to-rank lightgbm matplotlib nlp numpy pandas python pytorch ranknet

Last synced: 09 Apr 2026

https://github.com/slovnicki/matematicki-softver

homeworks and projects for Mathematical Software course

ipython jupyter latex markdown mathplotlib numpy pandas scikit sympy

Last synced: 15 Apr 2026

https://github.com/aciluca/credit-approval-system

A machine learning application to predict credit card approval using Python, TensorFlow, and Docker.

data-science keras machine-learning numpy pandas python sklearn tensorflow

Last synced: 11 Apr 2026

https://github.com/ellennewerkla/apriori-analise_cesta_de_mercado

Análise de Cesta de Mercado (Market Basket Analysis) utilizando o algoritmo Apriori para identificar regras de associação entre produtos em um conjunto de transações de supermercado.

apriori-algorithm jupyternotebook marketbasketanalysis mathplotlib mba numpy pandas python3

Last synced: 13 Apr 2026

https://github.com/rahul-shrivastav/digit-recognition-nn

Web application that recognizes handwritten digits using a deep learning model

deep-learning flask numpy python reactjs tensorflow

Last synced: 11 Apr 2026

https://github.com/williamfridh/umu_5tf078_paper_keyboard_tracker

Project exploring how a CNN alongside computer vision can be used to detect drawn keys and track finger input,.

cnn-classification computer-vision keras-tensorflow numpy python tensorflow

Last synced: 07 Jan 2026

https://github.com/karanyeole/movie-rating-analysis

analyzing movie ratings from various sources to gain insights into audience preferences, trends, and patterns in movie ratings. The project utilizes data analysis and visualization techniques to explore and interpret movie rating data, providing valuable information for movie enthusiasts, filmmakers, and industry professionals.

matplotlib numpy pandas python seaborn

Last synced: 13 Apr 2026

https://github.com/shahiakhilesh1304/fitbitcasestudy

This is a case study based on data retrieved from a Fitbit band, and we are making predictions about human behavior based on their mood.

case-study data-visualization fitbit jupyter-notebook numpy python3

Last synced: 13 Apr 2026

https://github.com/rkarahul/ok.win-big-small-predictor

Predict the next “Big” or “Small” outcome on the OK.Win lottery-style game using OCR + time-series features + ML.

joblib numpy opencv-python paddleocr paddlepaddle paddlepaddle-gpu pandas python sckit-learn

Last synced: 13 Apr 2026

https://github.com/preethiphilip/prosper-loan-data-analysis

This project explores the Prosper loan dataset to uncover trends, borrower behaviors, and factors influencing loan performance. Using Python and data visualization techniques, the analysis examines key variables such as credit scores, loan status, income ranges, and debt-to-income ratios to gain insights into credit risk and lending patterns.

jupyter-notebook matplotlib numpy pandas python3 seaborn

Last synced: 13 Apr 2026

https://github.com/vishal-verma-96/pre-owned-car-price-prediction-using-streamlit-app

Capstone Project by skill Academy- Exploratory Analysis, Visualization and Prediction of Used Car Prices. Deploying the highest-scoring model with Streamlit web app

data-analysis data-science jupyter-notebook machine-learning machine-learning-algorithms matplotlib numpy pandas python3 regression-algorithms scikit-learn seaborn streamlit

Last synced: 11 Apr 2026

https://github.com/miss-mhv/udacityproject_dataviz

Data analysis of a bicycle rental system. The goal is to understand how trips are made according to age, time of year, day of the week, time of day

jupyter-notebook matplotlib numpy pandas python3 seaborn-plots

Last synced: 13 Apr 2026

https://github.com/sunilsm7/numpy-starter

IPython Notebook(s) demonstrating NumPy functionality.

numpy python

Last synced: 13 Apr 2026

https://github.com/hitthecodelabs/weatherforecast-lstm

TensorFlow and Python for analyzing and forecasting weather data

cnn keras lstm matplotlib numpy pandas python tensorflow weather weather-forecast

Last synced: 11 Apr 2026

https://github.com/harmanveer-2546/nasa-asteroid-classification

Classifying whether an asteroid is hazardous or not.

eda matplotlib numpy pandas python seaborn visualization xgboost

Last synced: 11 Apr 2026