An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/vedanty3/supermarket-sales-data-analysis

This project contains data visualization techniques (using pandas and matplotlib) to explore different aspects of supermarket sales data of 3 months.

data-analysis data-science jupyter-notebook matplotlib numpy pandas python

Last synced: 08 May 2026

https://github.com/harmanveer-2546/predicting-schizophrenia-disorder

The positive symptoms typical of schizophrenia – such as delusions, hallucinations or formal thought disorders – often first appear in an attenuated or transient form during the initial prodromal phase

boxplot decisiontreeregressor disorder linearregression matplotlib mean-squared-error numpy pairplot pandas prediction randomforestregressor schizophrenia seaborn train-test-split visualization

Last synced: 13 Apr 2026

https://github.com/zvdy/movie_recommendation

Movie Recommendation Search Engine using Jupyter Notebooks, Pandas, Nnmpy, SciKit Learn, IPyWidgets

data-science jupyter-notebook machine-learning numpy pandas python scikit-learn

Last synced: 13 Apr 2026

https://github.com/mayankmittal29/algovision-statistical_methods_in_ai

Implementation of various machine learning algorithms from scratch, including Linear Regression, K-Nearest Neighbors, Decision Trees, and K-Means clustering. Also done EDA on data, Implemented LSH, IVF, SLIC algorithms also with evaluation metrics

decision-tree-classifier eda gradient-descent image-segmentation ivf knn-classification linear-regression lsh-implementation matplotlib-pyplot numpy pandas python3 seaborn sgd-optimizer sklearn slic-superpixel-algorithm

Last synced: 11 Apr 2026

https://github.com/vishal815/python-based-flight-data-scraping-automating-data-collection-for-analysis

This project is a Python-based web scraping script designed to extract flight details from the Yatra travel website. The extracted data includes flight pricing, origin, destination, departure and arrival times, and duration. The data is then saved into a CSV file for further analysis.

beautifulsoup flight-data-analysis flight-data-visualization numpy pandas pyhton pyplot scraping scraping-python seaborn selenium vishal-lazrus vishallazrus webscraping webscraping-projects

Last synced: 18 Apr 2026

https://github.com/animesh-chourey/loan-classifier

Trained machine learning algorithms (Logistic Regression, KNN, SVM, Decision Tree) specifically, after performing visualization and pre-preocessing tasks on a loan dataset. Executed the evaluation metrics such as F1-score, Log loss and jaccard-similarity score to assess the algorithms performance.

decision-tree f1-score jaccard-similarity knn logistic-regression logloss matplotlib numpy pandas preprocessing-data svm

Last synced: 06 May 2026

https://github.com/a-parida12/numericalmethods

Teaching Material for Workshop for Engineering Students

numpy ode pde python3

Last synced: 13 Apr 2026

https://github.com/tromesh/attendance-predictor

Student Attendance predictor for Scheduling lecture sessions

matplotlib numpy pandas tensorflow

Last synced: 05 May 2026

https://github.com/dbriane208/python-for-data-science

Machine Learning and Data Science repository. Love crafting Machine Learning models.

data-analysis data-science data-visualization machine-learning numpy pandas python seaborn

Last synced: 13 Apr 2026

https://github.com/udacity-machinelearning-internship/support-vector-machine

Implementing SVM's using pandas and sklearn in python

numpy pandas sklearn support-vector-machines svm

Last synced: 13 Apr 2026

https://github.com/ot-code/coca-cola-stock-prediction

This repo compares four predictive models—Linear Regression, ARIMA, XGBoost, and LSTM—to forecast Coca‑Cola FEMSA stock closing prices using Python and five years of historical data.

arima csv linear-regression lstm-neural-networks mae matplotlib mse numpy pandas python r2 scikit-learn seaborn tensorflow-keras xgboost

Last synced: 13 Apr 2026

https://github.com/5hraddha/eda-instacart-customers-shopping-habits

In this Exploratory Data Analysis (EDA) project we'll clean up the data and prepare a report that gives insight into the shopping habits of Instacart customers.

data-visualization exploratory-data-analysis instacart matlpotlib numpy pandas

Last synced: 13 Apr 2026

https://github.com/arrnikhilchauhan/book-recommendation

This Book Recommendation System is designed to provide personalized book suggestions based on user ratings and interaction. By leveraging the power of Pandas and NumPy, it filters and processes data efficiently, ensuring that recommendations are relevant and tailored to the user’s preferences

numpy pandas-library python

Last synced: 15 Apr 2026

https://github.com/nirmaldeepponnada/codeclauseinternshipproject2

Python, NLTK, Scikit-Learn, Pandas, NumPy, Pickle, SciPy, and JSON are used for text preprocessing, feature engineering, multi-label classification, and model persistence.

nltk numpy pandas pickle python scikit-learn scipy

Last synced: 07 Apr 2026

https://github.com/mastermindromii/car-price-prediction-model

Here is My Regression Project based on Predicting Price of Car using Linear Regression.

linear-regression matplotlib numpy pandas python scikit-learn seaborn

Last synced: 13 Apr 2026

https://github.com/dharma-acha/imageclassification

This project is an interactive Streamlit web application using the VGG-13 model to classify images from the CIFAR-10 dataset. Users can upload images to receive real-time predictions and visual explanations of the model's decisions. The goal is to accurately classify images into one of the ten CIFAR-10 classes: airplanes, automobiles, birds, cats,

colab-notebook matplotlib numpy pandas python3 pytorch scikit-learn seaborn streamlit

Last synced: 13 Apr 2026

https://github.com/shreeparab1890/heart-attack-analysis-prediction-accuracy-85.24

This iPython Notebook implements the Analysis and Predictions for Heart Attack. The Analysis is done on the dataset to identify the key features which has a impact on Heart Attack. After the analysis and data cleaning, the dataset is used to train the model using different ML algorithm.

eda exploratory-data-analysis matplotlib numpy pandas plotly prediction python scikit sklearn

Last synced: 13 Apr 2026

https://github.com/hilalozdemirbuyukasik/deep-learning

A collection of deep learning projects demonstrating RNNs, BiLSTMs, CNNs, and basic neural networks applied to time series forecasting, text sentiment analysis, image classification, and tabular data tasks, with examples of data preprocessing, model training, evaluation, and visualization.

bilstm cnn keras matplotlib nn numpy rnn scikit-learn tensorflow

Last synced: 12 Apr 2026

https://github.com/railab/nxscli-np

Nxscli extention with Numpy operations

numpy nxscli

Last synced: 21 Apr 2026

https://github.com/vedikasnehil/daily-leetcode-

Daily LeetCode Using Python is a project dedicated to solving coding challenges consistently using Python. It focuses on improving problem-solving skills, mastering Python techniques, and building a collection of clean, well-documented solutions. Perfect for interview preparation, learning algorithms, or daily coding practice!

matplotlib numpy pandas python python3 seaborn

Last synced: 13 Apr 2026

https://github.com/bkellmayer6/pybelikov

Efficient high-precision computation of fully normalized associated Legendre functions (fnALFs) using the Belikov recurrence algorithm

fully-normalized-associated-legendre-functions geodesy gravity-field legendre-functions mathematics numba numpy python scientific-computing spherical-harmonics

Last synced: 14 Jan 2026

https://github.com/tkc/node-neural-network

Built with only pure typescript, neural network iris sample.

javascript machine-learning neural-network node numpy typescript

Last synced: 09 Apr 2026

https://github.com/lingesh81051/similar-template-document-matching-and-fraud-detection

An automated system for a health insurance company to streamline document processing, including template matching and fraud detection, resulting in reduction of processing time.

numpy opencv opencv-python pillow pymupdf pytesseract pytesseract-ocr python tkinter

Last synced: 13 Apr 2026

https://github.com/theblueberrycompendium/keras-movie-classification

This is an basic application that uses Keras

ai keras-tensorflow numpy python3 simple-project

Last synced: 13 Apr 2026

https://github.com/fimbres/matplotlib-start

A crash project with the most important matplotlib functions and methods, used in data science, AI and machine learning.

data-science matplotlib matplotlib-pyplot numpy pandas python python-script

Last synced: 13 Apr 2026

https://github.com/swatisinghit/treadmill-customer-profiling-for-aerofit

Create comprehensive customer profiles for each AeroFit treadmill product through descriptive analytics. Develop two-way contingency tables and analyze conditional and marginal probabilities to discern customer characteristics, facilitating improved product recommendations and informed business decisions.

analytics conditional-probability data-analysis data-science data-visualization eda numpy pandas probability statistics

Last synced: 08 May 2026

https://github.com/ejw-data/python-bitwise-operators

Difference between bitwise and logical operators when using pandas

numpy pandas

Last synced: 08 May 2026

https://github.com/raveenaross/bulldozer-price-prediction

Predicting the sale price of bulldozers using machine learning

matplotlib numpy pandas sklearn time

Last synced: 07 Oct 2025

https://github.com/thkox/uni_2023-content-based-image-retrieval

This project is a team assignment for the "Image Analysis" course, offered in the 7th semester of the 2023 academic year at the University of Piraeus, Department of Informatics.

7th-semester cartesian-product content-based-image-retrieval department-of-informatics hyperedge-similarity-calculation hypergraph image-analysis matplotlib networkx numpy python rank-normalization scipy torchvision university-of-piraeus

Last synced: 08 Oct 2025

https://github.com/manjotkaurgill/agritech

Enter details of your soil and weather, and find best suitable crop for farming. With our advanced AI system, you can make informed decisions and optimize your agricultural practices.

flask generative-ai insight-generation machine-learning matplotlib mongodb nextjs numpy pandas python scikit-learn seaborn

Last synced: 12 Apr 2026

https://github.com/lewis-morris/image_augment

Image augmentation for CNN's. Aiming to increase generalization.

augmentation image numpy python

Last synced: 18 Feb 2026

https://github.com/ahmedabdalkreem/lung-cancer

It help to know the patient have lung cancer or not and show what depenent in this disease that lead to this disease.

matplotlib numpy pandas python sklearn-library

Last synced: 14 Apr 2026

https://github.com/armahdavi/analytics_statistics_ml_plotting_dust_extraction_hvac_filters_ph2

PhD Technical Paper 1 - Phase 2 - Mahdavi & Siegel (2020) (Aerosol Science & Technology; AS&T) - Sharing all the data pipelines, processing codes, descriptive statistics, statistical modellings, and plotting/visualizations - Project Miestone: 2017 - 2020 - Full-length article is available

data-pipelines data-science data-visualization machine-learning matplotlib-pyplot numpy pandas-dataframe python scipy-stats sklearn statistics

Last synced: 14 Apr 2026

https://github.com/lorenzorottigni/dl-houses

Deep Learning python bootcamp: exploratory data analysis & deep learning on houses price dataset

deep-learning ipynb machine-learning numpy pandas python scikit-learn seaborn tensorflow

Last synced: 11 Apr 2026

https://github.com/03-bunny-06/japan-heart-attack-analysis

This project analyzes heart attack data in Japan, focusing on gender-based differences in risk factors such as smoking, diabetes, alcohol consumption, and regional variations. The dataset is cleaned and processed using Pandas and NumPy, extracting insights on affected individuals based on lifestyle and health conditions.

data-cleaning dataframes numpy pandas python risk-analysis

Last synced: 20 Apr 2026

https://github.com/naveen88112/vehicle-routing-problem

Vehicle Routing Problem (VRP) Solver This project applies a Genetic Algorithm (GA) to optimize the Vehicle Routing Problem (VRP). This is done for more than one vehicle to ensure delivery routes that save travel distance and increase efficiency.

deap genetic-algorithm matplotlib numpy python vehicle-routing-problem

Last synced: 13 Apr 2026

https://github.com/amritendunath/appointmate_1.0

A LangGraph-powered multi-agent/supervisor system for appointment scheduling. It enables users to interact in natural language to check doctor availability, book, reschedule, or cancel appointments.

fastapi langchain langgraph nltk numpy pandas psycopg2 python sqlalchemy uvicorn websocket

Last synced: 07 Apr 2026

https://github.com/shoaib1522/introduction-to-data-science

Unlock the world of data science with hands-on tutorials, interactive notebooks, and essential resources! Whether you're a beginner or looking to refine your skills, this repo offers a comprehensive guide to mastering Python, Git, NumPy, Pandas, and more—empowering you to analyze, visualize, and transform data like a pro! 🚀📊💻

confusion-matrix github-config kaggle matplotlib numpy pandas python3 script visualization

Last synced: 14 Apr 2026

https://github.com/rakesh3596/apple-phone-sales-analysis

Apple Phone Sales Analysis using Python tools like Pandas, Matplotlib, Numpy, Plotly, Apple csv file

csv-files csv-reader matplotlib numpy pandas plotly python

Last synced: 14 Apr 2026

https://github.com/farhannirzhor/python_eda_project_on_diwali_sales_analysis

In this project, I was given a sales data. I analyzed their sales data and shared a summary

data-cleaning data-manipulation exploratory-data-analysis matplotlib numpy pandas python seaborn

Last synced: 04 May 2026

https://github.com/chandkund/housing-price-prediction

Predict housing prices using the Boston Housing Dataset. Covers data loading, cleaning, preprocessing, EDA, normalization, standardization, and regression models (Linear Regression, Decision Tree, Random Forest, Extra Trees). Evaluated with Mean Squared Error (MSE). Tech: Python, Pandas, NumPy, Scikit-learn, Seaborn, Matplotlib.

data-science data-visualization matplotlib numpy pandas pyhton sklearn sklearn-library sklearn-metrics

Last synced: 21 Jan 2026

https://github.com/salma-mamdoh/the-android-app-market-on-google-play-project

My project aims to practice Data Analysis and Data Visualization on DataCamp

data-analysis data-visualization datacamp jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/alisonmitchell/titanic

Exploration of a subset of the Titanic passenger manifest to create a predictive classification model to determine which passengers were more likely to survive.

deep-learning keras machine-learning matplotlib numpy pandas python scikit-learn scipy seaborn tensorflow

Last synced: 14 Apr 2026

https://github.com/lorenzorottigni/ml-breast-cancer

Machine Learning python bootcamp: Support Vector Machines using breast cancer dataset

ipynb machine-learning numpy pandas python scikit-learn seaborn support-vector-machines

Last synced: 14 Apr 2026

https://github.com/ricardorobledo/paymentcardfrauddetection2025

Comparative analysis of probabilistic classification models for credit card fraud detection, focusing on model calibration and threshold optimization in highly imbalanced datasets.

imbalanced-learn matplotlib numpy pandas python3 scikit-learn search

Last synced: 14 Apr 2026

https://github.com/sultanazhari/game-trend

Ice as a online store Video Games want to know a new trend of video game to help them prepare a video games which is more profitable to sell. As a data Analyst we're giving a trend of video game by genre, platform, console and the others clasification.

matplotlib-pyplot numpy pandas python3 scipy-stats video-game

Last synced: 11 Apr 2026

https://github.com/gayatri018/ab-testing-marketing-campaign

This project involves conducting A/B testing to evaluate the performance of two marketing campaigns: Facebook Ads and Google Adwords for an online retail business. The primary goal is to determine which campaign is more effective in driving user engagement and conversions, helping the company optimize its marketing strategies.

ab-testing linear-regression matplotlib numpy pandas python scipy seaborn sklearn-

Last synced: 10 Apr 2026

https://github.com/nisch-mhrzn/house_prediction

This project predicts house prices using data exploration, feature engineering, and machine learning models like Linear Regression and Random Forest. It demonstrates how to optimize models and evaluate their performance to accurately forecast house prices.

matplotlib numpy pandas python scikit-learn seaborn

Last synced: 08 Apr 2026

https://github.com/polyee99/kaggle-titanic-data-analytics

Jupiter notebook to predict the outcome of passengers who died or not in the tragical Titanic event.

data eda jupiter-notebook matplotlib numpy pandas python regression-analysis test-train-split visualization

Last synced: 05 Feb 2026

https://github.com/5hraddha/ice-video-games-sales-analysis

Analysis of video game sales data of an online store Ice, which sells video games all over the world & identify patterns that determine whether a game succeeds or not.

data-visualization matplotlib numpy pandas requests-module scipy seaborn

Last synced: 14 Apr 2026

https://github.com/sralter/potential_talents

Using NLP techniques (word and sentence embedding tools like SBERT and Learning-to-Rank systems like RankNet and LambdaRank) to rank candidates.

lambdarank learning-to-rank lightgbm matplotlib nlp numpy pandas python pytorch ranknet

Last synced: 09 Apr 2026

https://github.com/al-ghaly/stock-market-simulation

Simulate and visualize stock market behavior

matplotlib numpy python python-visualization

Last synced: 16 May 2026

https://github.com/alansteinbarth/irysy

🪻 Irysy 🌼 eksploracyjna analiza danych – wizualizacje, ML, transparentny kod i autorskie grafiki. Projekt łączący AI, estetykę i praktyczne umiejętności!

ai classification clean-code data-science exploratory-data-analysis feature-engineering github-pages iris-dataset jupyter-notebook machine-learning matplotlib modern-ui numpy open-source pandas portfolio-project python reproducible-research seaborn visualization

Last synced: 17 Oct 2025

https://github.com/mbarbetti/hello-snakemake

Simple implementation of snakemake to sample and plot data

hello-world matplotlib numpy python snakemake

Last synced: 14 Apr 2026

https://github.com/deepcloudlabs/dcl702-2021-jul-12

DCL-702: Data Analytics using Python

data-analytics numpy pandas python3

Last synced: 05 May 2026

https://github.com/yulia-momotyuk/dla-data-analysis-practice

This repository contains my homework assignments completed during the "Data Analyst in IT" course at Data Loves Academy.

analytics data-analysis data-visualization excel mysql numpy pandas postgres powerbi python seaborn sql tableau

Last synced: 14 Apr 2026

https://github.com/juzershakir/predicting_boston_housing_prices

Builded a model to predict the value of a given house in the Boston real estate market using various statistical analysis tools. Identified the best price that a client can sell their house utilizing machine learning.

bias-variance boston-housing-price-prediction data-exploration decision-tree-regression gridsearchcv k-fold machine-learning matplotlib mlfnd model-evaluation model-validation numpy pandas python3 r2-score sklearn supervised-learning udacity-nanodegree

Last synced: 22 Oct 2025

https://github.com/luliatuccu/weather_analysis

This project highlights a combination of data science techniques and Python programming to explore real-world weather data.

data-preprocessing eda feature-engineering machine-learning matplotlib numpy pandas regex scikit-learn seab seaborn weather weather-patterns

Last synced: 02 Apr 2026

https://github.com/manuel-lang/numpymongo

A python package to export NumPy data to MongoDB

mongodb numpy wrapper

Last synced: 23 Feb 2026

https://github.com/jsd115/tensorflow_bankfrauddetection

This repository aims to provide examples of my ability to produce ML models using tensorflow and sklearn

numpy pandas sklearn tensorflow

Last synced: 06 Feb 2026

https://github.com/hansalemaos/cv2_stack_images

Stacks 2 images (horizontal/vertical)

concat cv2 images numpy python stack

Last synced: 28 Jan 2026

https://github.com/shahaba83/airplane-ticket-cancellation

In this project, we try to predict the possibility of canceling the plane ticket by the buyer

datatime numpy pandas python scikit-learn seaborn

Last synced: 25 Feb 2026

https://github.com/muhkartal/fr-photo

face-recognition system works on images first of all you need to register valid image as recognition

face-recognition numpy opencv pillow python streamlit

Last synced: 15 Apr 2026

https://github.com/stavrosarm/mathematics_algorithms

Fundamental algorithms of Linear Algebra , Numerical Analysis and Calculus in Python and Julia.

calculus julia-language linear-algebra numerical-analysis numpy python

Last synced: 30 Jan 2026

https://github.com/moh-27sl/image-alignment--python-edition

Alignment of an unaligned image with a base/reference image using feature detection, feature matching, and homography in OpenCV.

ai classical-computer-vision classical-image-processing computer-vision computervision image-alignment image-processing imageprocessing neural-network numpy opencv python python3

Last synced: 15 Apr 2026

https://github.com/karishmagupta05/airbnb-data-scraping-and-analysis

This project involves web scraping Airbnb listings using Python and BeautifulSoup, followed by data analysis with Pandas. The goal is to extract meaningful insights from the data, clean and preprocess it, and visualize the results using libraries like Matplotlib and Seaborn.

beautifulsoup4 matplotlib numpy pandas python scraping seaborn visualization

Last synced: 15 Apr 2026

https://github.com/dipeshgoyal013/salary-data-analysis

Salary Analysis according department and agency.

analysis matplotlib numpy pandas salary sklearn-library

Last synced: 15 Apr 2026

https://github.com/samiyaalizaidi/nn-ml-homeworks

Homework solutions for CPE-4903: Neural Networks & Machine Learning at Kennesaw State University.

machine-learning machine-learning-workflow neural-networks numpy scikit-learn

Last synced: 15 Apr 2026

https://github.com/arv-anshul/campusx-graphs

Contains all the graphs provided by CampusX to learn Statistics and Algebra with some added feature.

algebra campusx-dsmp data-science matrix numpy statistics streamlit vizualisation

Last synced: 15 Apr 2026

https://github.com/bineet-ratna-shakya/data-science-salary-analysis

analyzing a dataset containing salaries of data science professionals from 2020 to 2023.

data-analysis data-science data-visualization jupyter numpy pandas python

Last synced: 01 Feb 2026