An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/tuokri/rs2simlib

Python library for Rising Storm 2: Vietnam physics objects simulation and automated data collection.

numba numpy physics rising-storm-2-vietnam simulation unreal-engine-3

Last synced: 23 Feb 2026

https://github.com/ianhaggerty/final-capstone

This represents the final capstone project in my HyperionDev Data Science (fundamentals) course. A dataset of Amazon customer reviews is analysed using natural language processing.

amazon data-analytics data-science data-visualization dataset matplotlib nlp nlp-machine-learning numpy pandas plotly reviews seaborn spacy tabulate textblob wordcloud

Last synced: 19 Jan 2026

https://github.com/sa001gar/password-manager-on-the-top-of-streamlit

Secure Password Manager is a Python-based application developed by Sagar Kundu, designed to provide users with a secure and convenient way to manage their passwords. With features like user authentication, password generation, and secure storage using bcrypt and MongoDB, it ensures the safety of sensitive information while offering an intuitive use

bcrypt gui-application mongodb numpy numpy-library pandas-dataframe pandas-library pandas-python password-generator password-manager pymongo python python3 streamlit-application streamlit-webapp urllib

Last synced: 15 Apr 2026

https://github.com/rakesh3596/apple-phone-sales-analysis

Apple Phone Sales Analysis using Python tools like Pandas, Matplotlib, Numpy, Plotly, Apple csv file

csv-files csv-reader matplotlib numpy pandas plotly python

Last synced: 14 Apr 2026

https://github.com/oguzhansarigol/Expected-Goals-xG-Data-Analysis

We will analyze in our code the expected goal locations of football players based on which areas of the field, at what times, with which feet, using which parts of their bodies, and from which angles and distances they are most likely to score.

data-analysis-python expected-goals numpy pandas-python python

Last synced: 20 Aug 2025

https://github.com/thekartikeyamishra/colorextractor

A Python-based project that extracts the dominant colors from an image using the KMeans clustering algorithm. This project is useful for applications like image processing, color palette generation, or analyzing the most prominent colors in images.

kmeans-clustering matplotlib matplotlib-pyplot numpy p pillow python sklearn

Last synced: 05 May 2026

https://github.com/farhannirzhor/python_eda_project_on_diwali_sales_analysis

In this project, I was given a sales data. I analyzed their sales data and shared a summary

data-cleaning data-manipulation exploratory-data-analysis matplotlib numpy pandas python seaborn

Last synced: 04 May 2026

https://github.com/rscr1/summer_practice_mai_2

Решение всех заданий летней практики

matplotlib numpy pandas python scipy sympy

Last synced: 14 Apr 2026

https://github.com/gopikrsmscs/mycinepick-recommendation-system

MyCinePick is a user-specific movie recommendation system built using collaborative filtering techniques. This project aims to provide personalized movie suggestions tailored to each user's unique viewing preferences and habits.

collaborative-filtering movie-recommendation-app numpy python3 recommendation-system sklearn streamlit

Last synced: 14 Apr 2026

https://github.com/tnel7/data-science-yandex-practicum

Проекты из Яндекс Практикума "Специалист по Data Science"

data-science machine-learning nlp-machine-learning numpy pandas seaborn sklearn

Last synced: 19 Jan 2026

https://github.com/paulokarabyna/ml-foundations-day1

🔧 Set up a Python environment and practice linear algebra with NumPy while plotting vectors and reflections in this Day 1 ML foundations starter repository.

ai data-science education jupyter-notebook linear-algebra machine-learning matplotlib numpy terminal-mac tutorial visualization

Last synced: 18 May 2026

https://github.com/adi3042/diamond_price_prediction

🔮💎 Explore the Diamond Price Oracle! Predicting diamond values based on your preferences and market trends. Your ultimate tool for informed decisions in the diamond world. Join us for the future of pricing! 💎✨🚀 #DiamondPredictor

anaconda datetime diamond flask functools html ipykernel jupyter-notebook matplotlib numpy pandas price-prediction readme regressor sckiit-learn setuptools venv

Last synced: 14 Apr 2026

https://github.com/csengupta1101/career-survey---data-scientist

This repository holds an in depth analysis of Scrapped data from naukri.com. which showcases in in demand skills and opportunities in Data science across the subcontinent.

analysis career data-science matplotlib naukri numpy pandas python3

Last synced: 07 May 2026

https://github.com/drsartoriuss/portfolio

Учебные проекты из Яндекс Практикума "Специалист по Data Science+"

catboost computer-vision data-science deeplearning mashinelearning matplotlib nlp-machine-learning numpy pandas scipy seaborn sklearn sql torch tsf

Last synced: 28 Jan 2026

https://github.com/rtmigo/skifts_py

Search for the most relevant documents containing words from a query. Uses Scikit-learn and Numpy

cosine-similarity information-retrieval numpy python scikit-learn text-mining tf-idf

Last synced: 19 Aug 2025

https://github.com/niteshchawla/netflix-case-study

Analyze the data and generate insights that could help Netflix ijn deciding which type of shows/movies to produce and how they can grow the business in different countries

matplotlib numpy pandas python seaborn

Last synced: 09 Apr 2026

https://github.com/pedrolustosa/energyforecasting

This project offers a complete solution for solar energy generation analysis and forecasting, using different machine learning models and a modern, intuitive web interface.

artificial-intelligence axios bootstrap5 chartjs css data-science fastapi fontawesome html5 numpy pandas python r react react-bootstrap rpy2 sklearn spinner toastr typescript

Last synced: 04 Apr 2026

https://github.com/harmanveer-2546/covid-19-chances-detection

The main goal of this project is to build a machine learning model that, given a Covid-19 patient's current symptom, status, and medical history, will predict whether the patient is in high risk or not.

covid-19 covid-data logistic-regression matplotlib mlp-classifier numpy pandas python random-forest seaborn sklearn-library xgboost

Last synced: 10 Apr 2026

https://github.com/psgebeline/b2_plotter

A simple Python package for use by Belle II analysts to make various plots out of pandas dataframes and numpy arrays.

jupyter-notebook matplotlib matplotlib-pyplot numpy pandas pandas-dataframe python uproot visualization-tools

Last synced: 19 Jan 2026

https://github.com/venkat-a/python-eda-exploratory-data-analysis-

This project focuses on conducting exploratory data analysis (EDA) using Python. Utilizing powerful libraries such as Pandas for data manipulation and Plotly for interactive visualization, the project aims to uncover insights from datasets through comprehensive analysis and visualization techniques.

descriptive-statistics matplotlib numpy pandas seaborn statistical-analysis visualization

Last synced: 07 May 2026

https://github.com/quarkgluant/analyse-visualisation-data-with-python

French Udemy Course "Analyse et visualisation de données avec Python" on Numpy, Pandas and MatPlotLib Python's librairies

matplotlib-tutorial numpy numpy-tutorial pandas pandas-tutorial python python3 udemy

Last synced: 09 Apr 2026

https://github.com/akshay-kamath/anomaly_detection

This project addresses the challenge of identifying anomalies in sales data, aiming to optimize sales strategies by uncovering deviations from expected profit patterns.

anomaly anomaly-detection cblof flask isolation-forest knn matplotlib numpy pandas pyod python sklearn

Last synced: 19 Jan 2026

https://github.com/allanreda/automated-k-means-clustering-engine

An interactive K-Means clustering tool built with Flask and Scikit-Learn, supporting Excel file uploads, cluster analysis, and data export, deployed on Google Cloud Run via Docker with CI/CD integration.

cicd css data-visualization deployment docker flask google-cloud-run html javascript k-means-clustering machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 19 Jan 2026

https://github.com/hajaarh/health_state_classifier

Ce projet vise à analyser un dataset contenant divers indicateurs de santé, bien-être et vie quotidienne afin de classifier les individus en trois catégories selon leur état de santé.

numpy pandas python seaborn sklearn

Last synced: 15 Apr 2026

https://github.com/alisonmitchell/titanic

Exploration of a subset of the Titanic passenger manifest to create a predictive classification model to determine which passengers were more likely to survive.

deep-learning keras machine-learning matplotlib numpy pandas python scikit-learn scipy seaborn tensorflow

Last synced: 14 Apr 2026

https://github.com/priyanshulathi/air-quality-index-prediction

Machine learning based air quality index prediction using environmental and pollutant data to classify and forecast pollution levels.

machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 19 Jan 2026

https://github.com/adi3042/diamond-price-prediction

🔮💎 Explore the Diamond Price Oracle! Predicting diamond values based on your preferences and market trends. Your ultimate tool for informed decisions in the diamond world. Join us for the future of pricing! 💎✨🚀 #DiamondPredictor

anaconda datetime diamond flask functools html ipykernel jupytrnotebooks matplotlib numpy pandas price-prediction readme regression sckiit-learn setuptools venv

Last synced: 09 Apr 2026

https://github.com/massimilianovisintainer/rock-vs-mine-prediction

Rock vs. Mine classification using Logistic Regression

machine-learning model numpy pandas sklearn

Last synced: 21 Jan 2026

https://github.com/bharath-tars/deeptumor2.0

DeepTumor2.0 is a project aimed at developing a flask webapp with robust front end and highly trained model using tensorflow in the backend with several features like with automatic html template reporting. **NOTE: DeepTumor2.0 has been moved from Azure to On-Render to save costs**

css datetime docs flask github html javascript keras numpy onrender os pandas python sheety smtp tensorflow

Last synced: 21 Jan 2026

https://github.com/sweta2501/ml

Consist of ML projects based on Python along with DataSheets

machine-learning matplotlib-pyplot numpy pandas python seaborn sklearn sklearn-model

Last synced: 14 Apr 2026

https://github.com/ramchaik/cinebrain

CineBrain uses ML & NLP to analyze movies and recommend similar ones based on user preferences through cosine similarity.

cosine-similarity flask htmx kaggle machine-learning movie-recomendation-system nlp nltk numpy pandas sklearn tailwind tmdb tmdb-api unsupervised-learning vectorization

Last synced: 19 Jan 2026

https://github.com/omcodewizard/nasa-battery-analysis

This repository contains code for analyzing battery data from NASA's battery testing dataset. The analysis involves processing battery impedance, electrolyte resistance, and charge transfer resistance across charge/discharge cycles to track the aging and performance of various batteries.

battery-impedence nasa-battery-dataset numpy pandas plotly python

Last synced: 09 May 2026

https://github.com/soumyapro/parkinson-disease-prediction

This project predicts Parkinson's disease using machine learning models.

logistic-regression numpy pandas scikit-learn svc xgboost

Last synced: 19 Jan 2026

https://github.com/sudarshanc00/smishing

This project aims to classify text messages to detect potential smishing (SMS phishing) attacks. Using machine learning, the project provides a classifier that can differentiate between legitimate messages and smishing attempts, helping to prevent scams.

nltk numpy pandas python scikit-learn scipy

Last synced: 14 Apr 2026

https://github.com/charancholaveti/imdb-sentimentanalysis

IMDB Movie Review Sentiment Analysis-It will give the sentiment of the review whether it is Positive or Negative

deeplearning keras numpy rnn-tensorflow tensorflow

Last synced: 14 Apr 2026

https://github.com/liblaf/array-typing

🧮 Type-safe array operations for NumPy, JAX, and PyTorch

array code-quality documentation jax numpy python static-analysis torch type-hints typing

Last synced: 19 Aug 2025

https://github.com/avin27/credit-card-fraud-detection

A python script to analyse credit card data to detect fraudulent transactions

data-science numpy pandas sklearn

Last synced: 10 Apr 2026

https://github.com/cyberoctane29/epa-air-quality-aqi-analysis

This project involved analyzing air quality data from the EPA, focusing on the Air Quality Index (AQI). I used Python data structures like dictionaries and sets to manage and process the data, simulating real-world data analysis to assess pollution levels and their health implications.

data-analysis numpy pandas python statistics

Last synced: 10 Apr 2026

https://github.com/lorenzorottigni/ml-kyphosis

Machine Learning python bootcamp: decision trees & random forest on kyphosis dataset

decision-trees ipynb machine-learning numpy pandas python random-forest seaborn

Last synced: 14 Apr 2026

https://github.com/ricardorobledo/paymentcardfrauddetection2025

Comparative analysis of probabilistic classification models for credit card fraud detection, focusing on model calibration and threshold optimization in highly imbalanced datasets.

imbalanced-learn matplotlib numpy pandas python3 scikit-learn search

Last synced: 14 Apr 2026

https://github.com/chaakshay/heartdrive

A Streamlet-based tool that analyzes cardiovascular health data, predicts population risk using ML, and suggests targeted government actions like awareness campaigns, health checkups, and policy changes.

csv mathplotlib numpy pandas pandas-dataframe python scikit-learn seaborn streamlit

Last synced: 05 Apr 2026

https://github.com/adarshpheonix2810/fake-job-post-detection

This project focuses on detecting fake job posts using machine learning. Fake job advertisements are often created to scam individuals by stealing personal information or money.

data-analysis deep-learning joblib machine-learning nlp-machine-learning numpy pandas python scikit-learn tkinter

Last synced: 12 Apr 2026

https://github.com/kr1shnasomani/DeepDetect

Traffic light, vehicle and human detection from aerial images using YOLOv8 model and Computer Vision

computer-vision deep-learning neural-network numpy opencv yolov8

Last synced: 03 Sep 2025

https://github.com/waghraj1699/pokemon-predicton

Predict which type of pokemon it is using CNN

data-science deep-learning keras numpy python tensorflow utils

Last synced: 05 Apr 2026

https://github.com/elhanarinc/deeplearning

Ceng 783 Deep Learning Assignments

ipython-notebook jupyter-notebook numpy python2

Last synced: 06 May 2026

https://github.com/id-andyyy/alfahack

📈💰 Investment propensity prediction model

catboost hackathon-project jupyter lightgbm numpy optuna pandas python scikit-learn

Last synced: 12 Apr 2026

https://github.com/amandeep-gupta19/chatbot

Created a custom chatbot using Langchain. Here's a summary of what I did: Data Extraction: I gathered data about technical courses from the Brainlox website using Langchain’s URL loaders. Embedding Creation & Storage: I converted this data into embeddings and stored it in a vector store for efficient searching. API Development: I built a Flask

data-extraction faiss-vector-database flask-restful langchain numpy scikit-learn vector-database webbaseloader

Last synced: 09 Apr 2026

https://github.com/tebogoyungmercykay/mathematical_modelling_in_python

Python is an interpreted, object-oriented, high-level programming language with dynamic semantics. Its high-level built in data structures, combined with dynamic typing and dynamic binding, make it very attractive for Rapid Application Development, as well as for use as a scripting or glue language to connect existing components together. Python's simple, easy to learn syntax emphasizes readability and therefore reduces the cost of program maintenance. Python supports modules and packages, which encourages program modularity and code reuse. The Python interpreter and the extensive standard library are available in source or binary form without charge for all major platforms, and can be freely distributed.

clock counter differential-equations equations-and-plotting growth-factor is-sorted lamda-expressions linearization logistic-difference-equations mathplotlib numpy pell-recursive polyfit python recursion rieman-sums sympy

Last synced: 14 Mar 2025

https://github.com/ksganni/diabetes-prediction-using-machine-learning

🔬 A machine learning project that predicts diabetes likelihood using patient health metrics and multiple classification algorithms.

jupyter-notebook matplotlib numpy pandas python scikitlearn-machine-learning seaborn

Last synced: 28 Apr 2026

https://github.com/hairymax/python-for-science-lecture-notes

Tutorial material on the scientific Python ecosystem in Russian

matplotlib numpy python scipy sympy

Last synced: 12 Apr 2026

https://github.com/shahsuvarli/election-voters-data-analysis-pandas

Educational project analyzing Azerbaijan voter demographics with pandas, focusing on data cleaning, grouping, and visualization.

cleaning data grouping matplotlib numpy pandas python visualization

Last synced: 12 Apr 2026

https://github.com/amruta33/customer-support-on-twitter

The Customer Support on Twitter dataset is a large, modern corpus of tweets and replies to aid innovation in natural language understanding and conversational models, and for study of modern customer support practices and impact.

nlp numpy pandas python regex sklearn

Last synced: 12 Apr 2026

https://github.com/winterwind/housingpricesproject

A two-part project involving making predictions using various regressors and then implementing linear regression from scratch and predicting that way

csv csv-files data-science decision-tree gradient-descent jupyter jupyter-notebook knearest-neighbors knn linear-regression linear-regression-scratch machine-learning matplotlib matplotlib-pyplot numpy pandas python random-forest regression sklearn

Last synced: 12 Apr 2026

https://github.com/alwaysvivek/next-word-prediction

🔮 Predicts the next word in a text sequence using either an N-gram statistical model or an LSTM-based neural network.

argparse laplace-smoothing machine-learning neural-network ngrams nlp nltk numpy python3 tensorflow

Last synced: 12 Apr 2026

https://github.com/khushi130404/echobot

EchoBot is a chatbot built using TensorFlow to classify user inputs into predefined intents and respond accordingly. It uses a neural network trained with the data in intents.json.

keras nlp nltk numpy pickle tensorflow

Last synced: 12 Apr 2026

https://github.com/jvictor011/analise_de_dados_pokemon

Este projeto se concentra na análise dos dados da Pokedex, que incluem informações sobre vários Pokémon. O código em Python realiza várias etapas de análise exploratória de dados (AED) e gera visualizações para entender melhor as características dos Pokémon.

matplotlib numpy pandas python scipy

Last synced: 07 May 2025

https://github.com/gurpreet0022/unveiling-pcos

Data Driven approach to get insights about PCOS

analysis eda insights matplotlib numpy pandas python3 scipy-stats seaborn visualisation

Last synced: 12 Apr 2026

https://github.com/mark1708/neurointerfaces-of-information-systems

Laboratory work on the discipline "Neurointerfaces of information systems"

numpy pandas python scikit-learn

Last synced: 12 Apr 2026

https://github.com/3rd-son/movie-streaming-service-analysis

Exploratory Data Analysis of the Streaming Services like Neflix, Hulu, Disney+ etc

data-analysis exploratory-data-analysis jupyter-notebook matplotlib numpy pandas plotly python seaborn

Last synced: 18 Apr 2026

https://github.com/offthetab/matcha_code_generation

Fine-tunning MatCha model for better python matplot lib code generation by given chart.

fine-tuning huggingface matcha matplotlib numpy pandas pil sklearn torch

Last synced: 20 Jan 2026

https://github.com/webintellectual/coloured-image-to-b-w-converter-web-app

A Streamlit web application to convert a coloured image to black and white image.

numpy opencv pil pillow python3 streamlit streamlit-webapp

Last synced: 12 Apr 2026

https://github.com/dhruvrajsinhzala24/house-price-predictor-ml-model

A machine learning project to predict house prices using linear regression and random forest models. The project leverages housing data to train and evaluate models, comparing their performance using metrics like MSE and R². Visualizes actual vs. predicted prices to assess model accuracy.

linear-regression machine-learning matplotlib ml numpy pandas python random-forest-regression sklearn

Last synced: 14 Apr 2026

https://github.com/mariam-zaidi/credit_risk_analytics

Performing Exploratory Data analysis for loan application approvals and understanding Risk analytics

exploratory-data-analysis matplotlib numpy pandas risk-analytics seaborn

Last synced: 06 May 2026

https://github.com/nicolas-giacomelli/modelo-polinomial-api-streamlit

Codigo focado em treinamento de modelo de previsão de salario por tempo de empresa e nivel dentro da empresa disponibilizado por api e usando front-end streamlit para calcular basta inserir tempo de empresa e nivel e calcular o resultado de salario compativel

api fastapi joblib machine-learning matplotlib numpy pandas pingouin pydantic scikit-learn seaborn streamlit uvicorn

Last synced: 12 Apr 2026

https://github.com/marvin-vw/3dengine_python

3D engine written in Python using OpenCV and NumPy

3dengine numpy opencv python

Last synced: 04 Feb 2026

https://github.com/nazwright/daria

Real-time fraud detection architecture powered by AWS Kinesis, KaggleHub, and SMOTE-augmented data — the foundation of DARIA, the Detection And Risk-Intelligence Agent.

aws evm fraud fraud-detection-using-machine-learning kaggle kinesis machine-learning math numpy pandas python random web3

Last synced: 12 Apr 2026

https://github.com/ricky9667/taiwanoceantrashpredict

A very simple DNN project using Keras.

dnn keras numpy python

Last synced: 14 Apr 2026

https://github.com/allenvox/statistics

Workspace for applied problems of probability theory & mathematical statistics & modelling class

acf autocorrelation distribution k-means k-means-clustering maths matplotlib numpy probability-theory python sklearn statistics

Last synced: 17 Apr 2026

https://github.com/manuelfay/numpydeeplearning

Deep Learning Framework built entirely using numpy

deep-learning deep-learning-framework minigrad numpy pytorch

Last synced: 06 May 2026

https://github.com/sadegh15khedry/creditcard-transaction-anomaly-detection-using-isolationforest

detecting fraudulent transactions in credit card data using anomaly detection techniques.

anomaly-detection matplotlib numpy pandas python seaborn sklearn

Last synced: 27 Feb 2025

https://github.com/nisch-mhrzn/book-recommender-system

A Flask-based application that suggests books tailored to user preferences, leveraging collaborative filtering and a curated dataset.

flask numpy pandas python

Last synced: 12 Apr 2026

https://github.com/iadtya/hate-text-classification

Training RNN(LSTM) on Hate speech with tensorflow and keras

fastapi keras lstm nltk numpy pandas tenserflow

Last synced: 12 Apr 2026

https://github.com/fimbres/emotion-recognition

This is an AI project developed in Python to recognize emotions in people.

ai kaggle-dataset keras numpy opencv python tensorflow

Last synced: 12 Apr 2026