An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/terrelbrinkley/python-projects

Python Scripts & Data Analyst Projects

aws numpy pandas python selenium

Last synced: 10 Apr 2026

https://github.com/vedikasnehil/my-data-science-projects

This repository is a comprehensive collection of resources and implementations dedicated to the field of Data Science. It serves as a platform for exploring various aspects of data science, ranging from data preprocessing and exploratory data analysis (EDA) to machine learning and deep learning.

data data-science deep-learning machine-learning matplotlib numpy python sql visualization

Last synced: 10 Apr 2026

https://github.com/yulia-momotyuk/dla-data-analysis-practice

This repository contains my homework assignments completed during the "Data Analyst in IT" course at Data Loves Academy.

analytics data-analysis data-visualization excel mysql numpy pandas postgres powerbi python seaborn sql tableau

Last synced: 14 Apr 2026

https://github.com/muhammadshavaiz/hand-sketch-recognition--inceptionv3

The Hand Drawn Sketch Classification project uses PyTorch to classify hand-drawn sketches. It evaluates model Inception_v3, with Inception_v3 achieving the highest accuracy of 57%. The repository features scripts for dataset management, model training, and evaluation.

inception-v3 matplotlib numpy pandas python pytorch

Last synced: 14 Apr 2026

https://github.com/kylehperez/mushroomnet

This API is for research purposes ONLY and is NOT to be used for food-safety or medical advice. Mushroomnet is a machine learning neural network for classifying mushrooms as poisonous or edible. The model was trained on data from uni.edu: over 5,000 instances of mushrooms, with 22 qualitative features and a determination of poisonous or edible.

artificial-intelligence botany flask-api machine-learning mycology neural-network numpy python3 pytorch scikit-learn

Last synced: 09 Apr 2026

https://github.com/sun-data/ndoptimize

Numba-accelerated minimization of objective functions.

astropy minimization numba numpy optimization scientific-computing

Last synced: 27 Jan 2026

https://github.com/juzershakir/predicting_boston_housing_prices

Builded a model to predict the value of a given house in the Boston real estate market using various statistical analysis tools. Identified the best price that a client can sell their house utilizing machine learning.

bias-variance boston-housing-price-prediction data-exploration decision-tree-regression gridsearchcv k-fold machine-learning matplotlib mlfnd model-evaluation model-validation numpy pandas python3 r2-score sklearn supervised-learning udacity-nanodegree

Last synced: 22 Oct 2025

https://github.com/zotho/grav_euler

Euler gravity compute

gravity numpy python

Last synced: 06 May 2026

https://github.com/chirindaopensource/search_benford_law_compatibility

End-to-End Python scalable forensic accounting toolkit implementing Benford's Law analysis for FTSE financial data. Delivers automated anomaly detection with Chi-Squared/MAD testing, comprehensive validation pipelines, and risk-based prioritization of investigative resources. Replicates Ausloos et al.'s (2025) methodology with full reproducibility.

academic-research anomaly-detection benfords-law chi-squared-test data-validation econometrics financial-analysis financial-data forensic-accounting fraud-detection ftse goodness-of-fit jupyter-notebook numpy pandas python reproducible-research risk-management scipy statistical-testing

Last synced: 12 Apr 2026

https://github.com/bilgenurbekar/turkishcyberbullying

Contains fine-tuned BERT models and results in the text classification category using Turkish social media data

bert-fine-tuning huggingface-transformers matplotlib numpy pandas python pytorch scikit-learn transformers

Last synced: 07 Mar 2026

https://github.com/sukanyadutta52/topic_modeling

What are the most pressing concerns regarding ‘Climate Change’ among tweeters according to Topic Modeling?

climate-change gensim matplotlib nltk numpy pandas pyldavis regular-expression spacy

Last synced: 09 Apr 2026

https://github.com/priyanshulathi/cancer-diagnosis-prediction-model

A Machine Learning project to predict cancer malignancy using K-Nearest Neighbor, Support Vector Machine, and Decision Tree algorithms.

machine-learning numpy pandas python scikit-learn

Last synced: 03 Jan 2026

https://github.com/amruta33/melanoma-detection

To build a CNN based model which can accurately detect melanoma. Melanoma is a type of cancer that can be deadly if not detected early. It accounts for 75% of skin cancer deaths. A solution that can evaluate images and alert dermatologists about the presence of melanoma has the potential to reduce a lot of manual effort needed in diagnosis.

keras machine-learning numpy pandas python3 sckiit-learn tensorflow

Last synced: 09 Apr 2026

https://github.com/apbetioli/neural-network

A simple neural network implementation

machine-learning neural-network numpy python3

Last synced: 29 Apr 2026

https://github.com/abdullahashfaqvirk/SMS-Spam-Detection

A machine learning application designed to classify SMS messages as spam or non-spam, offering real-time analysis to identify potentially harmful content.

css3 docker flask html5 javascript matplotlib nltk numpy pandas python scikit-learn seaborn tailwindcss xgboost

Last synced: 16 Aug 2025

https://github.com/tnleite/real-estate-opportunities-analysis

Este repositório apresenta uma análise de oportunidades no mercado imobiliário, combinando séries temporais, clusterização e previsões para identificar estados com maior potencial de crescimento e orientar estratégias de expansão eficientes.

catboostregressor cluster-analysis data-science kmeans-clustering lightgbm-regressor machine-learning-algorithms numpy regression-models scikit-learn xgboost-regression

Last synced: 10 May 2026

https://github.com/hellymodikalpesh/numpy

This repository is designed as a comprehensive resource for learning the NumPy library. It covers nearly all functions and features of NumPy, providing a thorough exploration of its capabilities for numerical computing and data analysis.

jupyter-notebook numpy

Last synced: 28 Jan 2026

https://github.com/hafaio/homeotopy

A python library for computing homeomorphisms between some common continuous spaces

homeomorphism numpy

Last synced: 24 Feb 2026

https://github.com/baranwaljii/data-science-projects

This Repository contains projects related to the domain of data science based on python and jupyter notebook

data-science jupyter-notebook machine-learning numpy pandas python

Last synced: 09 Apr 2026

https://github.com/hajdu-patrik/nlp_music-mood-classifier_bme_hf

This project is a Python-based NLP system that analyzes song lyrics to determine their emotional profile and recommends similar songs.

nlp nlp-machine-learning numpy pandas python

Last synced: 10 Nov 2025

https://github.com/aleksibovellan/ai-2048-game

AI Heuristics in Python for solving the 2048 Game

2048 2048-game 2048-solver ai expectimax game heuristics numba numpy python python3

Last synced: 06 May 2026

https://github.com/pushtogithub23/projectilemotion-simulation

This Jupyter notebook explores projectile motion through mathematics, visualizations, and interactive simulations.

jupyter-notebook matplotlib-pyplot numpy physics-simulation projectile-motion-simulation

Last synced: 28 Jan 2026

https://github.com/prithvi45/machine-learning-prerequisites

Basics to Advance concepts on Numpy, Pandas, Matplotlib

matplotlib numpy pandas python

Last synced: 09 Apr 2026

https://github.com/sanchariii/health-assist-1

Health Assist web app uses the Streamlit framework to help users identify potential diseases. Simply input your symptoms, and the system will predict whether you have diabetes, heart disease, or Parkinson's disease. The system is designed to be quick and convenient, promoting early detection and timely medical intervention.

numpy pandas pickle streamlit-webapp svm-model

Last synced: 13 Aug 2025

https://github.com/chandkund/pytorch

Foundational introduction to PyTorch, focusing on the basics of tensors, their creation, manipulation, and operations, which are essential for understanding and building deep learning models

classification computer-vision cuda-programming deep-learning loss-functions matplotlib numpy optimization pandas pyhton pytroch workflow

Last synced: 07 May 2026

https://github.com/muhkartal/e-forecast

machine learning-powered energy consumption prediction system that analyzes historical data to forecast future energy usage trends, optimizing efficiency and sustainability.

fastapi joblib matplotlib numpy pandas pydantic pytest sckit-learn seaborn tensorflow tqdm uvicorn xgboost yaml

Last synced: 08 Apr 2026

https://github.com/shwetapardhi/assignment-03-q5--hypothesis-testing

Chi2 contengency independence test Q5. Fantaloons Sales managers commented that % of males versus females walking in to the store differ based on day of the week. Analyze the data and determine whether there is evidence at 5 % significance level to support this hypothesis. Assume Null Hypothesis as Ho: Independence of categorical variables (% of

alternate-hypothesis chi2-contingency contingency-analysis hypothesis-testing null-hypothesis numpy pandas python scipy significance-testing stats

Last synced: 28 Jan 2026

https://github.com/mdarshad1000/ml-libraries

Code for the scientific and visualization libraries in Python for Machine Learning.

matplotlib numpy pandas seaborn

Last synced: 09 Apr 2026

https://github.com/matheusafonseca/c213-trabalho-1

Repository dedicated to storing and managing the first assignment for C213 - embedded systems.

matplotlib numpy pid-controller python scypi streamlit

Last synced: 29 Jan 2026

https://github.com/adityabadgotiya/facial-recognition-attendance-system

This Attendance System that uses facial recognition to automatically mark student attendance. It features admin and student logins, real-time face detection, attendance tracking, email reporting, and secure password management. Built with Python (Tkinter, OpenCV, and Pillow), it replaces manual attendance with automated facial verification.

hashlib numpy opencv pandas pillow python sha-256-hash smtplib tkinter-gui

Last synced: 12 Aug 2025

https://github.com/suhas-005/jovian-data-analysis-course-assignment

These are my assignments for Data Analysis : Zero to Pandas course by Jovian.ai

data-analysis data-analytics numpy pandas python

Last synced: 07 May 2026

https://github.com/ljadhav25/decision-tree-random-forest-algorithm-data-science-

This repository contains an implementation of decision tree and random forest algorithms from scratch in Python. Decision trees and random forests are popular machine learning algorithms used for classification and regression tasks. The goal of this project is to provide a clear and understandable implementation of these algorithms

data-analysis data-science decision-trees machine-learning-algorithms matplotlib numpy pandas python random-forest-classifier

Last synced: 15 Apr 2026

https://github.com/tamk-kol/project_orbital_data_analysis

The goal of this project is to develop an automatic method to detect orbital maneuvers using machine learning.

matplotlib numpy pandas scikit-learn

Last synced: 30 Jan 2026

https://github.com/ayuub34/siesta-smart

This project is designed to be a AIaaS 2-Way-PMS optimized with AI and ML models. The goal is to enhance the efficiency of the system through advanced technology solutions.

ai data-analysis-python data-processing data-science jupyter-python machine-learning matplotlib numpy pandas powerbi-report pyqt5 scipy seaborn sklearn

Last synced: 25 Feb 2026

https://github.com/miss-mhv/retail_strategy_and_analytics

The purpose of this work is to analyze data from a supermarket to better understand the types of customers who buy chips and their buying behavior in the area.

jupyter-notebook numpy pandas python3

Last synced: 07 May 2026

https://github.com/keyurparalkar/breast-cancer-detection

Predict whether the cancer is benign or malignant

gradient-descent logistic-regression machine-learning numpy

Last synced: 26 Apr 2026

https://github.com/thkox/uni_2023-compression-and-object-removal

This project is a team assignment for the "Multimedia Systems" course, offered in the 6th semester of the 2023 academic year at the University of Piraeus, Department of Informatics.

2023 6th-semester colorama compression-algorithm department-of-informatics heapq lossless-compression-algorithm motion-compensation multimedia-systems numpy object-removal opencv-python pickle python scipy-stats university-of-piraeus

Last synced: 07 Feb 2026

https://github.com/jonad/chicago_crime_analysis

Exploratory data analysis on Chicago Crime dataset.

jupyter-notebook matplotlib numpy pandas python3

Last synced: 07 May 2026

https://github.com/manu-karenite/medical-insurance-cost-predictor

Medical Insurance Cost Generator is a Linear Regression based Predictor which is used to estimate and predict the Cost a person has to pay while Buying a Medical Insurance.

kaggle-dataset linear-regression machine-learning matplotlib numpy pandas python3 reactjs scikit-learn

Last synced: 15 Apr 2026

https://github.com/s-hirata0831/graphicsprocessing

Pythonによる画像処理基礎実験(with OpenCV)

numpy opencv python

Last synced: 07 May 2026

https://github.com/sarmad426/ai

AI basic to advanced featuring Machine Learning, Deep Learning and Data Science.

ai data-science deep-learning hugging-face machine-learning numpy pandas python scikit-learn

Last synced: 15 Apr 2026

https://github.com/amruta33/credit_card_analysis

The loan providing companies find it hard to give loans to the people due to their insufficient or non-existent credit history. Because of that, some consumers use it as their advantage by becoming a defaulter.

numpy pandas python3

Last synced: 15 Apr 2026

https://github.com/nmerka/practicum_rep

Проекты курса Аналитик данных (Яндекс.Практикум)

jupyter-notebook matplotlib numpy pandas plotly python scipy seaborn sql tableau

Last synced: 31 Jan 2026

https://github.com/farha-dahman/investigate_dataset

Investigate [No-show appointments] Dataset

data-analyst jupyter-notebook matplotlib numpy pandas python

Last synced: 15 Apr 2026

https://github.com/singhvishal003/sales-data-analysis

Data Analysis according to sale data.

matplotlib numpy pandas seborn

Last synced: 15 Apr 2026

https://github.com/ashithapallath/comparison-of-ssd-sad-in-motion-estimation-of-video-coding

Course assignment for Algorithm and Massive Datasets comparing SAD and SSD for motion estimation. Includes analysis of accuracy, speed (Python/NumPy), and error sensitivity. Highlights trade-offs for speed-critical vs. precision-focused applications.

l1-normalization l2-norm numpy pytho video-cod

Last synced: 04 Apr 2025

https://github.com/samiyaalizaidi/nn-ml-homeworks

Homework solutions for CPE-4903: Neural Networks & Machine Learning at Kennesaw State University.

machine-learning machine-learning-workflow neural-networks numpy scikit-learn

Last synced: 15 Apr 2026

https://github.com/martincastroalvarez/python-video-processing

Movie generator using MoviePy, OpenCV2 and Numpy.

moviepy numpy opencv

Last synced: 15 Apr 2026

https://github.com/arv-anshul/campusx-graphs

Contains all the graphs provided by CampusX to learn Statistics and Algebra with some added feature.

algebra campusx-dsmp data-science matrix numpy statistics streamlit vizualisation

Last synced: 15 Apr 2026

https://github.com/salvatoreamaddio/powerbi

This project combines both Python and PowerBI technologies. It cleans a dataset and imports it into a PowerBI file.

numpy pandas pandas-python powerbi python

Last synced: 07 May 2026

https://github.com/rissh/titanicsurvivalpredictionusingml

Predicting Titanic passenger survival through machine learning. This project includes data preprocessing, exploratory data analysis, feature engineering, and model training using Python. 🚢

data data-analysis data-science data-visualization dataanalysis jupiter-notebook machine-learning machine-learning-algorithms machinelearning matplotlib numpy pandas prediction prediction-model python python3 seaborn tenserflow tflearn titanic

Last synced: 01 Feb 2026

https://github.com/davityak03/english-audio-to-hindi-text-using-pretrained-model

The primary goal of this project is to transcribe English audio and then translate the transcribed text into Hindi.

adam-weight-decay css flask html nlp numpy python tensorflow tokenizer transformers translation

Last synced: 26 Feb 2026

https://github.com/magzybog12/real-time-object-distance-measurement-using-opencv-python

Using an A4 paper as a guide, performing the distance measurements to find the height and width of objects placed in the region

numpy opencv-python pycharm-ide

Last synced: 05 May 2026

https://github.com/seymencanates/pythonn

Here is my all personal educations , notes , basic apps I did. Here will be my backside for all supportive information about python I am gonna need.

django flask numpy pandas python3 scikitlearn-machine-learning tensorflow tkinter

Last synced: 08 Apr 2026

https://github.com/harshavardhanbommalata/knn_tutorial

K-Nearest Neighbors (KNN) is a simple yet powerful machine learning algorithm. Unlike models that learn parameters during training, KNN uses lazy learning—it stores the dataset and predicts by finding the closest neighbors, making decisions through majority voting.

machine-learning-algorithms mathematical-modelling matplotlib numpy pandas

Last synced: 20 Jun 2026

https://github.com/paulomppatricio/projeto_challenge_telecomx-br_parte-2

Projeto Challenge TelecomX-BR_Parte-2 - Formação Data Science do programa ONE - Oracle Next Education em parceria com a Alura.

data-science imblearn joblib machine-learning matplotlib modelos-preditivos numpy pandas python scipy seaborn sklearn statsmodels xgboost

Last synced: 12 Apr 2026

https://github.com/mdaltamashalam/uber-fare-prediction-models

Predicts the fare amount of Uber rides based on various factors such as pickup/drop-off coordinates, passenger count, and trip distance.

catboost data-analysis data-cleaning data-visualization lgbm-regressor machine-learning matplotlib numpy pandas python random-forest regression-models skit-learn xgboost-algorithm

Last synced: 26 Feb 2026

https://github.com/smahala02/materials-science-introduction

Introduction to Materials Science concepts using Python for array manipulation and visualization with NumPy and Matplotlib.

data-visualization materials-science matplotlib numpy python scientific-computing

Last synced: 09 Feb 2026

https://github.com/rampal-punia/data-science-toolkit

Your Go-To Resource for Essential Data Science Related Commands, Concepts, Quick Overviews and Useful Functions.

artificial-intelligence data-science keras machine-learning matplotlib nlp nlp-machine-learning numpy pandas pythorch sql tensorflow

Last synced: 09 Feb 2026

https://github.com/hansalemaos/all_against_all

Each item of a list against all others

list numpy

Last synced: 02 Apr 2026

https://github.com/sahilk12nayak/data-capstone-projects-using-python-libraries

In this data project we will focus on exploratory data analysis. This project utilizes Python libraries such as NumPy, Pandas, Matplotlib, and Seaborn for data manipulation, analysis, and visualization.

matplotlib numpy pandas seaborn

Last synced: 15 Apr 2026

https://github.com/murugavl/forage_tata_genai_powered_data_analytics

This repository showcases my work from the TATA GenAI Powered Data Analytics Virtual Experience on Forage, where I applied data cleaning, analysis, and visualization techniques to derive actionable business insights.

excel genai matplotlib numpy pandas powerbi python

Last synced: 09 Feb 2026

https://github.com/carterbox/libimage

Provides large (2k) test images as NumPy arrays.

images numpy python

Last synced: 15 Apr 2026

https://github.com/harmanveer-2546/statistics-for-machine-learning

Statistical tools help you clean and organize your data. You can identify outliers, manage missing values, and ensure your data is in a format that the ML algorithms can understand.

inline matplotlib matplotlib-styles numpy pandas probability python seaborn statistics

Last synced: 18 Apr 2026

https://github.com/ifimust/room_generator

Room generator service for 2D game levels

flask google-app-engine microservice numpy procedural-generation python

Last synced: 07 May 2026