An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/vasu7052/recognizing-handwritten-digits

This is a machine learning project created in Python using Neural Networks and Supervised Learning Algorithms.

machine-learning machine-learning-algorithms numpy python scikit-learn

Last synced: 13 Apr 2026

https://github.com/raghavendranhp/airbnb-data-analysis

The Airbnb Data Analysis project focuses on analyzing Airbnb data using MongoDB Atlas, Python scripting, data preprocessing, visualization, and interactive geospatial insights. We delve into the world of property management and tourism to uncover trends, pricing variations, and location-based analysis.

eda jupyter-notebook mongodb numpy pandas powerbi preprocessing

Last synced: 08 Apr 2026

https://github.com/allenvox/neural

Workspace for Neural Networks class

jupyter-notebook neural-networks numpy python pytorch tensorflow

Last synced: 02 Jan 2026

https://github.com/mehwishferoz/data-analysis-with-python-zero-to-pandas

This repository contains the Python code and projects I created while learning the Data Analysis with Python: Zero to Pandas course. The course covers essential topics such as data cleaning, analysis, and visualization using powerful Python libraries like Pandas, NumPy, Matplotlib, and Seaborn.

exploratory-data-analysis jovian matplotlib numpy pandas seaborn

Last synced: 10 May 2026

https://github.com/athul64/tmdb-dataset-analysis

This data set contains information about 10,000 movies extracted from TMDB. The dataset contains movies from 1960 to 2015. Including user ratings and revenue. Original data from Kaggle.

data-visualization dataframe eda numpy pandas python

Last synced: 14 Apr 2026

https://github.com/asut00/python-piscine_42ai

Python Bootcamp: A one-week intensive course with 42AI at École 42, covering Python fundamentals, data manipulation, and introductory AI concepts.

matplotlib numpy pandas python

Last synced: 07 May 2026

https://github.com/chdl17/marvelvsdc

This GitHub repository contains Python and Tableau for analyzing Marvel Vs DC superheros.

excel jupyter-notebook numpy pandas python tableau-public

Last synced: 05 May 2026

https://github.com/ahmedabdalkreem/airline_passenger_satisfaction

In this project, we work in machine learning project that show the satisfacion of customer from the service that have in airline this service make it like this experience or not.

decision-trees gradient-boosting knn logistic-regression machine-learning matplotlib naive-bayes numpy pandas preprocessing python random-forest svm visualization

Last synced: 05 May 2026

https://github.com/cezlul/analyse-ventes-immobilier

Solution ML d'analyse immobilière parisienne : classification automatique appartements vs commerces (K-means, 91%) et prédiction prix (régression, R²=0.98) sur 26K transactions. Valorise portefeuille 169M€ avec recommandations stratégiques data-driven.

data-analysis jupyter-notebook machine-learning matplotlib numpy pandas python sklearn

Last synced: 13 Apr 2026

https://github.com/lexiortiz/ibm-data-engineering-fundamentals

Notes, exercises, and projects from the IBM Data Engineering Fundamentals path via Verizon Skill Forward.

data-engineering numpy pandas postegresql python sql

Last synced: 13 Apr 2026

https://github.com/anushkundu/crime-pattern-analysis

Analyzing Crime Patterns in Montgomery County, USA: An Inclusive Study Based on NIBRS Data (2016-2022)

data-analysis data-visualization descriptive-statistics matplotlib numpy pandas python seaborn

Last synced: 05 May 2026

https://github.com/babagata/praktikum-4-data-analysis

Data analysis for course "Physics laboratory IV"

matplotlib numpy pandas scipy

Last synced: 28 Apr 2026

https://github.com/chawthinn/car-price-prediction-regression-ml

This project predicts used car prices using regression models including Linear, Ridge, Random Forest, XGBoost, and LightGBM. It covers preprocessing, EDA, model evaluation, hyperparameter tuning, and model persistence using Scikit-learn and related libraries.

car-price-prediction lgbmregressor linearregression numpy pandas python scikit-learn xgbregressor

Last synced: 08 Apr 2026

https://github.com/danhenriquex/data_science_and_machine_learning

A.I, Data Science & Machine Learning and Deep Learning with Tensorflow

deep-learning deep-neural-networks machine-learning matplotlib numpy pandas tensorflow

Last synced: 13 Apr 2026

https://github.com/zuhairzia/customer-segmentation

📖 About Customer Segmentation using KMeans clustering to analyze demographics, income, and spending. Helps businesses with targeted marketing and customer insights.

joblib matplotlib numpy pandas scikit-learn seaborn

Last synced: 05 May 2026

https://github.com/caefleury/cis-ieee-difusion-model

Repositório teste para códigos do projeto de Modelo de Difusão do branch CIS-IEEE

docker docker-compose numpy python3 tensorflow

Last synced: 08 Apr 2026

https://github.com/debjyotisaha/deep-learning-projects-phase-1

Deep Learning Projects which demonstrate the use of Python and its complex ML algorithms

deep-learning machine-learning neural-network numpy opencv pandas python seaborn streamlit

Last synced: 13 Apr 2026

https://github.com/armahdavi/data_analytics_statistics_plotting_pm_airborne_sampling

All codes for the data pipelines processing, statistical modellings, descriptive statistics and plot visualizations from airborne phase of Mahdavi et al. (2021) (Environmental Pollution) Project Miestone: 2018 - 2021

data-science data-visualization machine-learning matplotlib-pyplot numpy pandas python scikit-learn scipy-stats statistics

Last synced: 13 Apr 2026

https://github.com/hilarionengarejr/movie-recommender-app

Sentiment analysis on user reviews for movie recommendations using Content Based Filtering.

docker flask nltk numpy pandas python3 scikit-learn selenium

Last synced: 10 Apr 2026

https://github.com/pratyush905/farecast-nyc-taxifare-predictor

Machine learning models to predict nyc taxi fare based on given dataset

jupiter-notebook kaggle machine-learning matplotlib numpy python regression-models scikit-learn

Last synced: 13 Apr 2026

https://github.com/dwija12903/mentorness-internship

Developed and applied technical skills in areas such as programming languages, data analysis, and machine learning methodologies.

matplotlib numpy pandas python scikit-learn

Last synced: 08 Apr 2026

https://github.com/dmarks84/coursework_project_text-mining-topic-modeling

Project for University of Michigan Applied Data Science Specialization -- Developed functions to score similarity between text passages.

data-modeling data-reporting data-visualization databases eda nlp numpy pandas python statistics text-mining

Last synced: 12 Apr 2026

https://github.com/pinedah/sleep-data-analysis-exercise

Análisis de un dataset médico sobre el sueño, explorando duración, calidad y factores relacionados. Incluye limpieza de datos, EDA y visualizaciones con Python (pandas, numpy, matplotlib, seaborn, scipy).

data-analysis data-science escom numpy pandas python school-project scipy

Last synced: 13 Apr 2026

https://github.com/caterinatasinato/python-projects

Projects I worked on as Trainee in Data Analytics at ProfessionAI

matplotlib numpy pandas python3

Last synced: 13 Apr 2026

https://github.com/joiceo/python

Projetos e exercícios em Python

eda machine-learning numpy pandas python seaborn sklearn

Last synced: 13 Apr 2026

https://github.com/darshanpakhale250-gif/customer-churn-prediction-ml

A machine learning project to predict customer churn using regression and classification models including logistic regression, decision tree, and random forest. Performed EDA, visualizations, and model evaluation. The dataset is taken from Kaggle and implemented in Google Colab.

colab-notebook customer-churn-analysis data-science decision-trees kaggle logistic-regression machine ml numpy pandas python random-forest

Last synced: 13 Apr 2026

https://github.com/nishthasharma-22/binary-black-hole-merger-gravitational-waves-simulation

Repository of all astronomy related projects, including: Gravitational Waves graph from binary black hole merger

astrophysics blackhole-merger gravity-simulator matplotlib numpy python scipy

Last synced: 05 May 2026

https://github.com/blairnangle/python3-numpy-ta-lib

Docker image with NumPy and TA-Lib installed. Built and uploaded to Docker Hub using GitHub Actions.

docker numpy python3 ta-lib

Last synced: 13 Apr 2026

https://github.com/debjyotisaha/data-analytics-projects-phase-1

A set of projects on Data Analysis using Python and its libraries

matplotlib numpy pandas python seaborn sklearn streamlit

Last synced: 13 Apr 2026

https://github.com/isk-daniar/-contrast-improvements-on-pil

Contrast improvements on PIL

numpy pillow python

Last synced: 10 Jun 2026

https://github.com/mgitrov/coce

A deep learning-based project aiming to classify images out of 10 classes.

computer-vision convolutional-neural-networks deep-learning docker fastapi keras matplotlib numpy pillow regularization

Last synced: 06 Apr 2026

https://github.com/anujdutt9/language-model-using-rnn

Text Generation using basic RNN from scratch using Python and Numpy.

language-model numpy python rnn text-generation

Last synced: 08 May 2026

https://github.com/somyaaaaaa2004/travelling-sales-man-problem

This project is a soft computing-based solution to the Traveling Salesman Problem (TSP) using a Genetic Algorithm (GA), implemented entirely from scratch in Python on Google Colab. The goal of the TSP is to find the shortest possible route that visits each city exactly once and returns to the starting point. It's a classic NP-hard combinatorial

genetic-algorithm jupyter-notebook matplotlib-pyplot numpy pandas python travelling-salesman-problem

Last synced: 06 May 2026

https://github.com/harmanveer-2546/tweets-cleaning-with-python

Twitter is one of the most used data sources for data analysis. The reason is that it’s open and free to collect unless you subscribe to the paid version one. Besides, it’s pretty simple to collect data from it.

filtering function-calling nlptk numpy os pandas punctuation python re removing-links tokenization tweet-cleaning twitter-data-analysis twitter-sentiment-analysis

Last synced: 10 Apr 2026

https://github.com/om-kanabar/sciencefair2025

This is my project for Chicago Public School's student science fair 2025.

chicago-public-schools matplotlib-pyplot neural-networks numpy python science-fair tensorflow

Last synced: 03 Nov 2025

https://github.com/praju-1/data_science_projects

It contains the necessary code, datasets, and documentation to understand, replicate, and build upon the project's findings and methodologies.

data-science datasets deep-learning exploratory-data-analysis machine-learning matplotlib numpy pandas predictive-analytics python seaborn sklearn statistics visualization

Last synced: 08 Apr 2026

https://github.com/lingumd/school_district_analysis

Updated test score data and school district analysis using Python.

averages binning jupyterlab numpy pandas pandas-loc python

Last synced: 13 Apr 2026

https://github.com/csengupta1101/rock-paper-scissor-game

Rock Paper Scissor game built with Python 3. Jupyter notebook used as IDE. Code File , Read Me attached herewith.

game if-else-statements numpy python python3 random

Last synced: 11 May 2026

https://github.com/anjasfedo/data-analysis

Repo to Explore Data Analysis

data-analysis numpy

Last synced: 13 Apr 2026

https://github.com/voidwalker7/mnist

MNIST Digit Detector Model using only Numpy

from-scratch-in-python mnist numpy python

Last synced: 05 May 2026

https://github.com/vyjayanthipolapragada/human_detection_hog

Detect humans in real-time, image or video and count the detected people using OpenCV and HOG Descriptor

computer-vision easygui hog-descriptor human-detection image-processing imutils numpy opencv os python support-vector-machine

Last synced: 06 May 2026

https://github.com/abidhasanrafi/pharma-sales-analytics

A Streamlit-powered web application for analyzing pharmaceutical sales performance across teams, products, and territories.

matplotlib numpy pandas plotly sales-analysis scikit-learn seaborn streamlit

Last synced: 08 Apr 2026

https://github.com/pramodyasahan/cifar-10-object-recognition

This repository contains a TensorFlow-based image classification model designed to classify images from the CIFAR-10 dataset. The CIFAR-10 dataset comprises 60,000 32x32 color images in 10 classes, with 6,000 images per class.

convolutional-neural-networks numpy python tensorflow

Last synced: 05 May 2026

https://github.com/prowilliam/maze_generator

A maze generator is created where you can modify its dimensions and shows a red route where it describes the solution of the maze, and it is exported in png format

numpy pillow python

Last synced: 28 Apr 2026

https://github.com/akansharajput280799/covid19-impact-analysis-usa

Data Analysis and Predictive Modeling to study COVID-19 impact across age groups, regions, and seasons in the USA.

classification-algorithm clustering-algorithm data-preprocessing data-visualization descriptive-statistics exploratory-data-analysis matplotlib numpy pandas seaborn

Last synced: 13 Apr 2026

https://github.com/alepm1990/nba-analytics

Análisis y trabajos en Python con BBDD deportivas

matplotlib-pyplot numpy pandas pyhon python3 seaborn

Last synced: 05 May 2026

https://github.com/paulo-santos-ds/machinelearninglogisticregression

A fraude de cartão de crédito é uma forma comum de fraude que afeta milhões de pessoas em todo o mundo. Essa fraude pode ter consequências graves, tanto para as vítimas quanto para as instituições financeiras que emitem os cartões de crédito.

math matplot numpy pandas python3 sklearn

Last synced: 13 Apr 2026

https://github.com/khaja-shaik-21/heart-disease-prediction-system

This form allows users to enter key health details like age, blood pressure, cholesterol levels, and exercise results to predict the likelihood of heart disease. The data is submitted to the backend for processing, where a machine learning model provides a prediction. The form is styled for a clean and responsive user experience.

css3 flask-application git html5 logestic-regression numpy pandas python3 scikit-learn

Last synced: 12 Apr 2026

https://github.com/jessicahora/series_temporais

Análise de Series Temporais do Consumo de Energia da Alemanha.

numpy pandas python time-series-analysis

Last synced: 07 Apr 2026

https://github.com/rohra-mehak/sciencesync

System for Personalized Google Scholar Alerts Processing and Data Management, and provision of ML based clustering analysis

agglomerative-clustering clustering crossref-api customtkinter google-api google-scholar graph-api machine-learning numpy pandas python3 scientific-article-analysis scikit-learn sqlite3

Last synced: 05 May 2026

https://github.com/mayankmittal29/algovision-statistical_methods_in_ai

Implementation of various machine learning algorithms from scratch, including Linear Regression, K-Nearest Neighbors, Decision Trees, and K-Means clustering. Also done EDA on data, Implemented LSH, IVF, SLIC algorithms also with evaluation metrics

decision-tree-classifier eda gradient-descent image-segmentation ivf knn-classification linear-regression lsh-implementation matplotlib-pyplot numpy pandas python3 seaborn sgd-optimizer sklearn slic-superpixel-algorithm

Last synced: 11 Apr 2026

https://github.com/bhanuprakashnani/scit-tasks

Tasks for the SCIT Work Shop.

numpy python3

Last synced: 25 Apr 2026

https://github.com/max134/deepplate

A custom-trained YOLOv11-based license plate detection with PaddleOCR for accurate ANPR.

computer-vision cv2 deep-learning image-recognition licesnse-plate-recognition numpy object-detection ocr paddleocr real-time-processing sqlite3 ultralytics yolov11

Last synced: 13 Jul 2025

https://github.com/tatmorenno/python_para_data_science

Python para Data Science Introdução à linguagem e Numpy 🌎💻

jupyter-notebook numpy python

Last synced: 13 Apr 2026

https://github.com/tnleite/loan-approval-prediction

Este repositório apresenta um modelo preditivo de aprovação de empréstimos, focado em minimizar o risco de inadimplência. Utilizando EDA e algoritmos de machine learning (Random Forest, XGBoost), ajustamos o threshold para maximizar o recall de inadimplentes, contribuindo para uma gestão de riscos eficiente.

classification-algorithm data-science exploratory-data-analysis machine-learning-algorithms machine-learning-models matplotlib numpy scikit-learn scipy seaborn xgboost-classifier

Last synced: 13 Apr 2026

https://github.com/abynxv/automatedexamsheetgrader-ocr

This Python project automates grading of handwritten or printed answer sheets using OCR (Optical Character Recognition). It extracts student answers from scanned images, compares them to an answer key, and generates a detailed report with class statistics.

numpy ocr-recognition opencv pandas python3 tesseract

Last synced: 27 Jun 2025

https://github.com/v41bh4vr4jput/python---numpy

A collection of hands-on examples, exercises, and projects to master NumPy — the fundamental package for numerical computing in Python. This repository is perfect for beginners and advanced learners looking to explore array manipulation, mathematical operations, and high-performance data analysis.

numpy python3

Last synced: 20 Apr 2026

https://github.com/yipjunkai/pyvolr

Modern Black-Scholes-Merton pricing, Greeks, and implied volatility for Python. Rust core. Drop-in replacement for the abandoned py_vollib.

black-scholes black-scholes-merton greeks implied-volatility maturin numpy options options-pricing py-vollib pyo3 python quant quantitative-finance rust

Last synced: 30 May 2026

https://github.com/aliakbar-omidi/ten-most-populous-cities-of-iran

This project is to show the densely populated cities of Iran on the map

basemap matplotlib numpy pandas python

Last synced: 09 May 2026

https://github.com/jonad/quora_data_analysis

Exploratory data analysis and feature engineering on Quora dataset.

exploratory-data-analysis feature-extraction numpy pandas python3 quora-dataset

Last synced: 05 May 2026

https://github.com/abdelrahman-amen/neural_style_transfer

This project applies neural style transfer to create new images by merging the content of one image with the artistic style of another.

neuralstyletransfer numpy os python streamlit tensorflow

Last synced: 05 May 2026

https://github.com/hanzdr/2d-3d-object-transformation

For my Linear Algebra Project Defense

matplotlib-python matrix numpy

Last synced: 19 Apr 2026

https://github.com/nandinimarepalli/ai_ml_internship_projects

Projects completed during my AI/ML and Data Expert internship, including EDA, machine learning models, and dashboard development using Python, pandas, scikit-learn, and visualization libraries.

matplotlib numpy pandas python scikit-learn seaborn

Last synced: 05 May 2026

https://github.com/andriikot/smartmatch

Smart Match with python

docker faiss huggingface numpy ptyhon pythorch

Last synced: 04 Apr 2026

https://github.com/shivamkumar818/mbti-personality-prediction-from-text-data

MBTI Personality Prediction from Text Data This project leverages machine learning to predict Myers-Briggs Type Indicator (MBTI) personality types based on textual data, specifically from social media posts.

confusion-matrix correlation-matrix data-visualization dataset knn-classification linear-regression logistic-regression modeltraining navebayes numpy pandas python

Last synced: 13 Apr 2026

https://github.com/amruta33/lead_scoring_case_study

An education company named X Education sells online courses to industry professionals. On any given day, many professionals who are interested in the courses land on their website and browse for courses.

bagging linear-regression numpy pandas python3

Last synced: 05 May 2026

https://github.com/shreyasdankhade/portfolio_optimatization_project

The Portfolio Optimization Project uses optimization techniques to balance risk and return, helping investors make efficient asset allocation decisions.

flask flask-application matplotlib numpy pandas pandas-python porfolio-optimization portfolio python

Last synced: 06 Apr 2026

https://github.com/hajaarh/malaria_hematie_cnn

Algorithme de réseaux de neurones convolutionnels

numpy pandas python resnet-50 sklearn-library tensorflow vgg16

Last synced: 13 Apr 2026

https://github.com/dummumounika/ecommerce-sales-categorization

This repository contains Python code for text classification and analysis of e-commerce sales data. The script processes textual descriptions of products and categorizes them into predefined categories using a Naive Bayes classifier. It also includes various analysis and visualization methods to explore the dataset.

machine-learning matplotlib-pyplot ntlk numpy pandas python scikit-learn

Last synced: 13 Apr 2026

https://github.com/pramodyasahan/spaceship-titanic

This repository features a machine learning model designed to predict whether passengers of a space travel company are likely to be transported. The model employs CatBoostClassifier, a machine learning algorithm known for handling categorical data effectively.

machine-learning numpy pandas python scikit-learn

Last synced: 13 Apr 2026

https://github.com/thekartikeyamishra/aipoweredmarketingassistant

AI-Powered Marketing Assistant, an advanced tool designed to enhance your digital marketing campaigns using the power of machine learning (ML) and large language models (LLMs). This project empowers small businesses and MSMEs to create compelling content, analyze campaigns, and strategize effectively.

artificial-intelligence llm matplotlib numpy openai pandas python scikit-learn streamlit

Last synced: 08 Apr 2026

https://github.com/shwetapardhi/assignment-05-multiple-linear-regression-2

Prepare a prediction model for profit of 50_startups data. Do transformations for getting better predictions of profit and make a table containing R^2 value for each prepared model. R&D Spend -- Research and devolop spend in the past few years Administration -- spend on administration in the past few years Marketing Spend -- spend on Marketing in t

collinearity-diagnostics cooks-distance correlation-analysis eda heteroscedasticity homoscedasticity leverage-score multi-linear-regression numpy ols-regression p-value pair-plot python r-square-values regress-exog residual-analysis smf statsmodels vif

Last synced: 09 May 2026

https://github.com/vlad1343/tools

This repository demonstrates the acquisition of advanced Python programming skills through structured exercises and projects, with a strong emphasis on NumPy for array manipulation, slicing, masking, and reshaping

jupyter-notebook numpy numpy-library numpy-python pandas pandas-python python python3

Last synced: 08 Apr 2026

https://github.com/saraasgari99/videoclassificationusingdeeplearning

ML pipeline for biological video analysis using cutting-edge CNN architectures and industry-standard data processing and ML techniques

computer-vision data-processing keras-tensorflow machine-learning numpy sklearn tensorflow

Last synced: 13 Apr 2026

https://github.com/bropenguin847/python-learning

Learning Python for Scientific Programming. Intended for beginners with some knowledge of Python.

array-manipulations convolution jupyter matlab matplotlib-pyplot numpy pandas-python pep8 python python3 scientific-programming scipy spyder-ide vscode

Last synced: 13 Apr 2026

https://github.com/rajan-bhateja/data-analysis-with-python

Cleaning, analysis and visualization on various datasets using Python and its various libraries via Jupyter Notebook

data-cleaning data-science jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 30 Apr 2026

https://github.com/rdmolony/nix-numpy-financial

A nix derivation for numpy-financial - https://pypi.org/project/numpy-financial/#files

nix numpy python

Last synced: 05 Mar 2025