An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/nirmaldeepponnada/codeclauseinternshipproject2

Python, NLTK, Scikit-Learn, Pandas, NumPy, Pickle, SciPy, and JSON are used for text preprocessing, feature engineering, multi-label classification, and model persistence.

nltk numpy pandas pickle python scikit-learn scipy

Last synced: 21 Feb 2025

https://github.com/jcm-ai/robust-mlops-elevating-machine-learning-to-production-scale

This project aims to address the challenge faced by the Office of Foreign Labor Certification in managing a high volume of job certification applications for employers seeking to hire foreign workers in the United States. By leveraging ML models, this project seeks to streamline the process of shortlisting visa applicants based on historical data.

aws-ec2 aws-ecr aws-s3 cicd-pipeline data-science docker-image end-to-end exploratory-data-analysis git github-actions html machine-learning-models mlops mlops-project model-deployment mongodb numpy pandas python yaml

Last synced: 30 Dec 2025

https://github.com/ditikrushna/identify-customer-segments

In this project, Bertelsmann partners AZ Direct and Arvato Financial Solutions have provided two datasets one with demographic information about the people of Germany, and one with that same information for customers of a mail-order sales company. I have looked at relationships between demographics features, organized the population into clusters, and saw how prevalent customers are in each of the segments obtained.

clustering clustering-algorithm numpy pca

Last synced: 01 Mar 2025

https://github.com/ahmed-maher77/diabetes-prediction-app-using-machine-learning

Diabetes Prediction: Using machine learning to classify individuals as diabetic or non-diabetic based on health data, enabling early intervention and improved healthcare outcomes.

ai css data-science gradientboostinclassifier javascript logisticregression machine-learning matplotlib numpy pandas python randomforestclassifier seaborn streamlit supportvectormachine webdevelopment

Last synced: 06 Jul 2025

https://github.com/tnleite/real-estate-opportunities-analysis

Este repositório apresenta uma análise de oportunidades no mercado imobiliário, combinando séries temporais, clusterização e previsões para identificar estados com maior potencial de crescimento e orientar estratégias de expansão eficientes.

catboostregressor cluster-analysis data-science kmeans-clustering lightgbm-regressor machine-learning-algorithms numpy regression-models scikit-learn xgboost-regression

Last synced: 10 Apr 2025

https://github.com/sumitkundu102022/ecobuddy

EcoBuddy: Personalized Carbon Footprint Tracker

bcrypt numpy pandas plotly pymongo python-dotenv requests streamlit

Last synced: 29 Aug 2025

https://github.com/takana671/partiallytransparentimage

A simple tool to make an image partially transparent

geomipterrain numpy opencv pillow python3 transparent-image

Last synced: 17 Mar 2025

https://github.com/dmkk01/mlp-python

Implementation of a multilayer perceptron using Pytorch and Numpy libraries

mlp numpy pytorch

Last synced: 28 Feb 2025

https://github.com/jeniljani-4444/end-to-end-car-price-prediction-model

Predict car prices effortlessly using this machine learning model. Built with Python and Scikit-learn it analyzes features like mileage age brand and more to estimate accurate prices. Perfect for buyers sellers and dealerships.

machine-learning matplotlib numpy pandas python scikit-learn seaborn streamlit

Last synced: 26 Mar 2025

https://github.com/samankhamesian/rating-prediction-system

This project is an implementation of simple rating prediction systems for items from user

latent-factor-model machine-learning numpy pattern-recognition python rating-prediction

Last synced: 17 Mar 2025

https://github.com/harshs472/real-time-object-detection

Real time object detection that detects approximately 80 objects!!!!

coco numpy python yolov3

Last synced: 12 Mar 2025

https://gitlab.com/DLR-SY/mcodac

MCODAC is a Fortran library for the numerical evaluation of fiber composite damage. The library contains analysis methods specifically tailored to fiber composites, from micromechanical homogenization approaches to macroscopic fatigue models of orthotropic multilayer composites.

fortran numpy python

Last synced: 02 Apr 2025

https://github.com/prajakta1321/exoplanet-atmospheric-characterization-1

A machine learning project to classify exoplanets using light curve image data. Developed as part of the ML4SCI GSoC 2025 Test Task. Includes data processing, CNN-based model, and full report.

classification colab-notebook dbscan gsoc-2025 machine-learning-algorithms matplotlib-python ml numpy open-source pca-analysis python3 seaborn

Last synced: 10 Apr 2025

https://github.com/0nom4d/306radiator

Mathematical EPITECH Project - This project reproduces the behaviour of a radiator placed at a position in a room of a given size.

epitech epitech-project numpy pytest python3

Last synced: 01 Apr 2025

https://github.com/subratamondal1/heart-attack-prediction

Heart Attack Prediction of patients based on the required data. Data Ingestion - Data Preparation - Exploratory Data Analysis (EDA) - Modelling - Evaluation.

data-analysis data-science data-visualization kaggle-dataset machine-learning matplotlib-pyplot numpy pandas python3 scikit-learn seaborn

Last synced: 01 Mar 2025

https://github.com/aiwithqasim/ab-testing-for-shoefly.com

In this project, I had analyzed the data set of SHOEFLY.COM to find out valuable insights to check which platform is more useful for advertisement.

numpy pandas python3 sqlite3

Last synced: 17 Mar 2025

https://github.com/dmarks84/coursework_capstone_spacex_predictions

Final Project for IBM Data Science Professional Certificate -- Applied all skills and methods utilized in the series of courses for this certification to predict the success of SpaceX landings; issued full report to stakeholders

api classification dash eda folium linear-algebra matplotlib mysql numpy pandas plotly probability python seaborn sql statistics supervised-ml technical-writing web-scraping

Last synced: 09 Apr 2025

https://github.com/shhiivvaam/regression

Linear Regression Model

linear-regression numpy python

Last synced: 26 Mar 2025

https://github.com/anikdascodes/fantasy-cricket-project

🏏 Python simulation for generating 20K+ unique fantasy cricket teams with probability-based player selection and constraint optimization. Advanced algorithms, data analysis & visualizations included.

algorithm constraint-satisfaction cricket data-science fantasy-sports jupyter machine-learning numpy optimization pandas plotly probability python simulation sports-analytics

Last synced: 07 Sep 2025

https://github.com/abdullah2020/hamoye_stageb

This is my Hamoye Stage B project. The project focuses on Predicting Energy Efficiency of Buildings. It implemented different Machine Learning algorithm technique that are not limited to Linear Regression, LASSO, Ridge etc.

eda lasso-regression linear-regression numpy pandas predictive-modeling regression-models ridge-regression rmse rsquare-values

Last synced: 26 Mar 2025

https://github.com/pereira-elizabeth/aah-2d-hermitian

Minimal 2D AAH model: build, diagonalize, and plot spectrum (NumPy/Matplotlib)

condensed-matter numpy physics scipy tight-binding

Last synced: 05 Oct 2025

https://github.com/rkschroeder/portfolio

This repository contains my portfolio of data science projects.

matplotlib numpy pandas sckit-learn seaborn

Last synced: 05 Oct 2025

https://github.com/youngermaster/coulomb-superposition-principle

Principio de Superposición de Coulomb y Ley de Biot-Savart

coulomb-law matplotlib numpy simulation

Last synced: 05 Oct 2025

https://github.com/limix/ndarray-listener

Implementation of the Observer pattern for NumPy arrays.

numpy observer-pattern

Last synced: 12 Dec 2025

https://github.com/erick957/saleprice-prediction-dataset-analysis-and-cleaning-advance-regression

🏠 Predict house prices using advanced regression techniques with this comprehensive analysis and cleaning project, from data loading to model deployment.

data-analysis data-science eda google-colab machine-learning numpy pandas python scikit-learn scikit-learn-python

Last synced: 05 Oct 2025

https://github.com/zuhairzia/customer-segmentation

📖 About Customer Segmentation using KMeans clustering to analyze demographics, income, and spending. Helps businesses with targeted marketing and customer insights.

joblib matplotlib numpy pandas scikit-learn seaborn

Last synced: 05 Oct 2025

https://github.com/slovnicki/matematicki-softver

homeworks and projects for Mathematical Software course

ipython jupyter latex markdown mathplotlib numpy pandas scikit sympy

Last synced: 06 Oct 2025

https://github.com/sweta-kaundilya/finance-data-project

This data project focuses on exploratory data analysis of bank stock prices throughout financial crisis all the way to early 2016.

financial-analysis jupyter-notebook matplotlib numpy pandas-python seaborn

Last synced: 06 Oct 2025

https://github.com/dina-hosny/import-preprocess-and-visualize-a-dataset-project

A simple project to practice importing a dataset, data cleaning and preparation processes, and visualize the results to answer some given questions.

data-cleaning data-engineering data-science data-visualization jupyter-notebook matplotlib numpy pandas python

Last synced: 07 Oct 2025

https://github.com/sudharshan-bathini/chat-bot-using-cornell-movie-data-set

Developed an AI chatbot using the Cornell Movie Dialogs dataset; trained an LSTM-based model to generate context-aware, conversational responses for user inputs.

keras lstm-sentiment-analysis neural-networks nlp numpy python tensorflow tokenizer

Last synced: 30 Dec 2025

https://github.com/alfastrek/textinsight-pro

Webscraper with Natural Language Processing

beautifulsoup jupyter-notebook nltk numpy pandas python webscraping

Last synced: 07 Oct 2025

https://github.com/18mahi/tweet-sentiment-analysis

Classify tweets into happy, sad, angry, excited, and neutral with this interactive Python model. Combining TF-IDF text features with engineered numeric features like emoji sentiment, polarity, subjectivity, and punctuation counts, it demonstrates intermediate-level NLP, feature engineering, and visualization skills.

data-science data-visualization jupyter-notebook machine-learning matplotlib nlp numpy pandas sklearn string textblob

Last synced: 07 Oct 2025

https://github.com/alexkhype/04python-carpricepredictor

Modelo de machine learning para predecir el valor de mercado de autos usados, optimizando precisión y eficiencia.

catboost data-science lightgbm machine-learning numpy pandas python sklearn statistics xgboost

Last synced: 30 Dec 2025

https://github.com/nimaabediforud/heart-disease-prediction

The project highlights the end-to-end workflow of data preprocessing, model building, evaluation, and comparison for a healthcare prediction task.

biomedical-applications biomedical-data-science biomedical-engineering classification deep-learning machine-learning neural-networks numpy pandas prediction python seaborn sklearn tensorflow

Last synced: 07 Oct 2025

https://github.com/ricardorobledo/malicious_server_hack_detection

Predictive model to detect malicious hacking patterns in banking servers. Utilizes advanced Machine Learning techniques such as SMOTE, Gradient Boosting, and probability calibration to predict attacks befor

anaconda cibersecurity imbalanced-data imbalanced-learning imblearn kaggle matplotlib numpy pandas pandas-library python3 sklearn

Last synced: 07 Oct 2025

https://github.com/anuppm9917/super-store-sales-analysis-power-bi-project

My drive to know which products, regions, categories and customer segments a company should target or avoid, I search and selected an appropriate dataset on kaggle which will match a standard superstore requirement.

data data-analysis data-visualization datacleansing excel exploratory-data-analysis jupyter-notebook numpy pandas plotly powerbi python3

Last synced: 30 Dec 2025

https://github.com/nicolashaubricht/analise-dados-pib-per-capita

Este projeto tem como objetivo a análise de dados sobre as variações do PIB per capita em diversos países ao longo dos anos.

numpy pandas plotly python

Last synced: 07 Oct 2025

https://github.com/pedasoft-consult/employee-attrition-prediction

This project implements a machine learning pipeline for predicting employee attrition using HR data. The system analyzes various factors that contribute to employee turnover and provides predictions to help HR departments take preventive measures.

joblib numpy pandas python3 sklearn streamlit

Last synced: 30 Dec 2025

https://github.com/cosmicsid004/air_draw

Digital drawing....

handtracking mediapipe numpy opencv

Last synced: 08 Oct 2025

https://github.com/bharath-tars/streamlit_diabsynth

Predictive Health Analytics for Diabetic Risk Assessment and Personalized Reporting WebApp using Streamlit

deeplearning mahine-learning numpy onrender-deploy pandas seaborn streamlit

Last synced: 08 Oct 2025

https://github.com/khushneetsingh/datasanity

DataSanity is a AI-powered web application for dataset cleaning, synthetic data generation, vectorization, and data enrichment using natural language prompts.

cerebrus exa faiss-vector-database llm nextjs numpy pandas serperdev sqlite tailwindcss

Last synced: 08 Oct 2025

https://github.com/davityak03/object-detection-in-images-using-yolo

This project demonstrates object detection using the YOLO (You Only Look Once) model, providing a solution for detecting and labeling multiple objects within images or video streams. This implementation covers image preprocessing, model inference, and visualization of detected objects.

computer-vision matplotlib-pyplot numpy object-detection opencv-python pandas python yolo

Last synced: 18 Jan 2026

https://github.com/harshavardh8n/machine-learning-projects

Machine Learning Projects: A collection of machine learning projects developed as part of the curriculum, utilizing libraries such as NumPy, Pandas, Seaborn, and Matplotlib. These projects demonstrate various algorithms, including Apriori and Decision Trees, showcasing practical applications and data analysis techniques.

apriori-algorithm decision-trees jupyter-notebook machine-learning numpy pandas python

Last synced: 18 Jan 2026

https://github.com/mkdirer/disaster-tweet-nlp-adaboost-transformers

A machine learning project that classifies tweets as disaster-related or non-disaster-related using transformers and the AdaBoost technique. Achieved an accuracy of approximately 80% on the validation set

geopy keras matplotlib numpy pandas python sklearn tensorflow

Last synced: 16 Oct 2025

https://github.com/amish5ingh/cricket-data-analytics-ipl

Data analysis and visualization of IPL 2022 matches using Python, Pandas, Matplotlib, and Seaborn. Includes insights on match outcomes, player performances, toss trends, and venue stats with 12+ charts.

data-analysis data-visualization ipl-data-analysis ipl-data-visualization jupiter-notebook matplotlib-pyplot numpy pandas python seaborn

Last synced: 09 Oct 2025

https://github.com/varun-khorgade/salesense-e-commerce-analytics

Analyzed e-commerce datasets to discover customer behavior trends and business insights. Performed EDA, data cleaning, and visualizations to help stakeholders make data-driven decisions.

customer-behavior-analysis dashboards datacleaning exploratory-data-analysis feature-engineering jupyter-notebook matplotlib numpy pandas python seaborn sql time-series-trend visulaization

Last synced: 09 Oct 2025

https://github.com/nikhleshshukla123/web-scraping-using-python

scrapes multiple pages of Amazon search results using python.

beautifulsoup4 numpy pandas python

Last synced: 09 Oct 2025

https://github.com/faisal-khann/ipl-analysis

The IPL Analysis project is a comprehensive data-driven exploration of the Indian Premier League (IPL), analyzing historical match data to uncover patterns in team performance, player statistics, and match outcomes.

data-analysis exploratory-data-analysis jupyter-notebook matplotlib numpy pandas seaborn

Last synced: 09 Oct 2025

https://github.com/exclowd/gg

A library for creating terminal based games in python.

numpy python

Last synced: 09 Oct 2025

https://github.com/fuodorov/python

Python Numpy Tutorial

numpy python

Last synced: 09 Oct 2025

https://github.com/pedronjorge/average-face

Generate an average face from a given set of images by detecting facial landmarks, aligning features, and blending the images using Delaunay Triangulation. Libraries used: Dlib, OpenCV, NumPy

dlib image-generation jupyter-notebook numpy opencv python3

Last synced: 10 Oct 2025

https://github.com/0nom4d/308reedpipes

Mathematical EPITECH Project - This project computes a representation of a reedpipe.

coverage epitech epitech-project matplotlib-pyplot numpy pytest python scipy

Last synced: 10 Oct 2025

https://github.com/chandkund/housing-price-prediction

Predict housing prices using the Boston Housing Dataset. Covers data loading, cleaning, preprocessing, EDA, normalization, standardization, and regression models (Linear Regression, Decision Tree, Random Forest, Extra Trees). Evaluated with Mean Squared Error (MSE). Tech: Python, Pandas, NumPy, Scikit-learn, Seaborn, Matplotlib.

data-science data-visualization matplotlib numpy pandas pyhton sklearn sklearn-library sklearn-metrics

Last synced: 21 Jan 2026

https://github.com/psgebeline/b2_plotter

A simple Python package for use by Belle II analysts to make various plots out of pandas dataframes and numpy arrays.

jupyter-notebook matplotlib matplotlib-pyplot numpy pandas pandas-dataframe python uproot visualization-tools

Last synced: 19 Jan 2026

https://github.com/akshay-kamath/anomaly_detection

This project addresses the challenge of identifying anomalies in sales data, aiming to optimize sales strategies by uncovering deviations from expected profit patterns.

anomaly anomaly-detection cblof flask isolation-forest knn matplotlib numpy pandas pyod python sklearn

Last synced: 19 Jan 2026

https://github.com/massimilianovisintainer/rock-vs-mine-prediction

Rock vs. Mine classification using Logistic Regression

machine-learning model numpy pandas sklearn

Last synced: 21 Jan 2026

https://github.com/andrewsy1004/logistic-regression-spam-classifier

This project implements a spam email classifier using Logistic Regression.

numpy pandas scikit-learn

Last synced: 11 Oct 2025

https://github.com/sultanazhari/game-trend

Ice as a online store Video Games want to know a new trend of video game to help them prepare a video games which is more profitable to sell. As a data Analyst we're giving a trend of video game by genre, platform, console and the others clasification.

matplotlib-pyplot numpy pandas python3 scipy-stats video-game

Last synced: 11 Oct 2025

https://github.com/samestrin/image-manipulation-api-digitalocean

A Python DigitalOcean App Platform based REST image manipulation API using Flask, NumPy, and OpenCV that runs in a Docker container.

api docker flask image image-processing numpy python python3 rest rest-api

Last synced: 11 Oct 2025

https://github.com/vyjayanthipolapragada/image_classifier_model_hotdog

Building an Image classifier model to train and test a dataset and classify the given images into hotdog and not-hotdog.

artificial-intelligence dataset image-classification image-processing machine-learning matplotlib neural-networks numpy pandas python pytorch tensor torchvision transfer-learning

Last synced: 12 Oct 2025

https://github.com/umesh-01/python

Python is a programming language that lets you work more quickly and integrate your systems more effectively. Python is used for web development, AI, machine learning, operating systems, mobile application development, and video games.

ai application-development artificial-intelligence deep-learning machine-learning matpotlib ml nlp numpy open-source pandas python python-framework python-libraries python3 web-development

Last synced: 12 Oct 2025

https://github.com/fedesgh/parkinson_volatility_spread_on_cedears

Creating a function that returns a graph with the difference between Parkinson's volatility and regular volatility given a certain bounds

numpy pandas pickle seaborn

Last synced: 12 Oct 2025

https://github.com/rajesh9943/developing-a-bike-buyer-sales-report-dashboard-using-excel

This dataset has details of 1000 users from different backgrounds and whether or not they buy a bike. This data can be used to build the dashboard in Google Sheets. There are some NA (Null / Empty) values injected in the dataset. Use this dataset for Data Cleaning, Exploration, and Visualization.

excel google-sheets numpy pandas-library python3

Last synced: 12 Oct 2025

https://github.com/szymon-budziak/real_estate_house_prices_prediction

Predicting real estate house prices using various machine learning algorithms, including data exploration, preprocessing, model training, and evaluation.

data-analysis data-preprocessing data-science eda jupyter-notebook machine-learning matplotlib numpy optuna pandas predictive-modeling price-prediction python random-forest regression scikit-learn seaborn

Last synced: 21 Jan 2026

https://github.com/vedanty3/supermarket-sales-data-analysis

This project contains data visualization techniques (using pandas and matplotlib) to explore different aspects of supermarket sales data of 3 months.

data-analysis data-science jupyter-notebook matplotlib numpy pandas python

Last synced: 13 Oct 2025

https://github.com/armahdavi/ai_ml_assisted_breast_cancer_tumor_detection

Consolidating tutorial codes for breast cancer tumor detection, covering ML fundamentals like classification, feature engineering, training, evaluation, and key performance metrics.

bias-variance feature-engineering logistic-regression machine-learning machine-learning-algorithms medical-application numpy pandas python random-forest recall-precision sklearn xgboost-classifier

Last synced: 14 Oct 2025

https://github.com/wasifsohail5/amusic-ai_powered_musicrecommendationsystem

AMUSIC is an AI-driven music recommendation system that helps users discover personalized songs. Using Python, Streamlit, and Scikit-learn, it offers smart recommendations, advanced search, and interactive music insights. Users can save favorites, create playlists, and export data for a seamless music discovery experience.

joblib k-nearest-neighbours matplotlib minmaxscaler numpy pandas pickle plotly python scikit-learn seaborn streamlit

Last synced: 14 Oct 2025

https://github.com/jinhx128/data_analysis

基于Matplotlib+Numpy+Pandas搭建的数据分析项目。

matpolotlib numpy pandas python

Last synced: 14 Oct 2025

https://github.com/seymencanates/pythonn

Here is my all personal educations , notes , basic apps I did. Here will be my backside for all supportive information about python I am gonna need.

django flask numpy pandas python3 scikitlearn-machine-learning tensorflow tkinter

Last synced: 15 Oct 2025

https://github.com/aishwaryagm1999/california-housing-prices-data-analysis

Performed Data Cleaning and Data Analysis of the California Housing Prices Dataset to find the relation between the housing prices at a block and the amenities and facilities stated in the dataset such as total number of rooms, ocean proximity etc.

data-analysis data-visualization matplotlib numpy pandas python seaborn

Last synced: 16 Oct 2025

https://github.com/jim-by/scrape_analysis_books

Scraping book data from the website books.toscrape.com and performing analysis on the collected data.

beautifulsoup json matplotlib numpy pandas python scipy scraping seaborn

Last synced: 30 Dec 2025

https://github.com/lmizner/codecademy_life_expectancy

Calculate quartiles, quantiles, and the inter-quartile range (IQR) for a variable

histogram jupyter-notebook matplotlib-pyplot numpy pandas python quantiles quartiles

Last synced: 02 Apr 2025

https://github.com/grandmoff100/pygrids

Dimensional Data Manipulation and Spreadsheet-like pretty grids.

arrays grids hacktoberfest numpy python spreadsheet

Last synced: 17 Mar 2025

https://github.com/hansalemaos/create_empty_image

Creates new/empty cv2 images/numpy arrays

cv2 numpy python

Last synced: 02 Apr 2025

https://github.com/chandkund/image-classification-using-the-mnist-dataset

Image Classification using the MNIST dataset. This project leverages a Convolutional Neural Network (CNN) to recognize and classify handwritten digits with high accuracy. Includes data preprocessing, model architecture, and evaluation. Explore the code and results here!

computer-vision data-science machine-learning matplotlib numpy pandas python

Last synced: 11 Dec 2025

https://github.com/suap-dev/python-essentials

a repository intended for people who want to know some python basics

lambda-functions numba numpy oop pandas teaching

Last synced: 17 Mar 2025

https://github.com/jersonrn/termviewer3d

Interactive terminal environment that allows users to explore 3D models in real-time.

3d ascii cupy gpu linear-algebra numpy python

Last synced: 17 Mar 2025