An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/ronitjariwala/-prodigy_ds_01

Prodigy InfoTech Data Science Internship Task-1

matplotlib numpy pandas python seaborn

Last synced: 04 May 2026

https://github.com/swarup-ranjan-mondal/chat-analyzer

A streamlit app to analyze whatsapp personal and group chats.

matplotlib numpy pandas python seaborn streamlit

Last synced: 04 May 2026

https://github.com/rtgrt5645/numpy-lab

🧮 Explore, manipulate, and visualize data with NumPy to enhance your Python skills in scientific computing and data analysis.

array-operations data-analysis data-science jupyter-notebook machine-learning numerical-computing numpy numpy-arrays numpy-library numpy-python python python3 scientific-computing

Last synced: 04 May 2026

https://github.com/jakubfr4czek/apartment-prices-analysis

This repository contains an analysis of apartments prices in Krakow. It was made for "Rachunek prawdopodobieństwa i statystyka" (probability and statistics) course i took at AGH University of Science.

agh agh-wi analysis csv jupyter jupyter-notebook krakow numpy pandas price-analysis price-prediction probability probability-statistics python statistics statistics-learning

Last synced: 04 May 2026

https://github.com/abhinav330/911-emergency-calls-analysis

This Python Notebook analyzes emergency call data from the '911.csv' dataset. It uses various data visualization techniques to explore and gain insights into the emergency call data, including the types of calls, reasons for calls, and call patterns over time.

data-analysis data-science data-visualization eda exploratory-data-analysis exploratory-data-visualizations numpy pandas python

Last synced: 09 Jun 2026

https://github.com/hanzopgp/randomkagglesolving

Trying to build models to resolve some interesting Kaggle datasets thanks to deep learning.

artificial-intelligence deep-learning jupyter-notebook machine-learning matplotlib numpy personal tensorflow

Last synced: 04 May 2026

https://github.com/alphan26/optimal-logistics-locator

This is a project in which we estimate the biomass avaibility of places due to their index and determine the optimal preprocessing depot and biorafinery in Gujarat, India

numpy pandas python sckit-learn

Last synced: 04 May 2026

https://github.com/dmkk01/mlp-python

Implementation of a multilayer perceptron using Pytorch and Numpy libraries

mlp numpy pytorch

Last synced: 05 May 2026

https://github.com/lorenzorottigni/ml-missing-values

Machine Learning course of Piero Savastano 5: ColumnTransformer, SimpleImputer, numpy

column-transformer machine-learning missing-values numpy simple-imputer

Last synced: 05 May 2026

https://github.com/jokoum/sentiment-classification

Multi-layer Perceptron classification of product reviews written in Greek language

flask machine-learning multilayer-perceptron-network neural-network numpy python sentiment-classification

Last synced: 05 May 2026

https://github.com/thekartikeyamishra/object-detection-for-retailer

The AI Customer Feedback Summarizer is a Python-based tool that uses extractive summarization techniques to provide concise summaries of customer feedback.

ai computer-vision cv2 numpy pillow python torch torchvision yolov5

Last synced: 05 May 2026

https://github.com/bhaskarkulshrestha/rock-vs-mine-prediction

A machine learning project to predict whether an object detected underwater is a rock or a mine using Logistic Regression.

colab-notebook logistic-regression machine-learning numpy pandas python sckit-learn supervised-learning

Last synced: 05 May 2026

https://github.com/burhanali2211/marksanalyzer

This is a simple CLI-based tool that helps analyze student marks by calculating key statistics such as the highest and lowest scores.

analysis analyzer arrays marks numpy

Last synced: 05 May 2026

https://github.com/vc-nithesh944/pandas_numpy

Practice repository for NumPy & Pandas with scripts, notebooks, and notes for future revision

numpy pandas

Last synced: 05 May 2026

https://github.com/kbunwong/stock-price-prediction-using-lstm

This project is about predicting stock prices with more accuracy using LSTM algorithm. For this project we have fetched real-time data from yfinance library.

google-colab jupyter-notebook keras-tensorflow lstm-model machine-learning matplotlib numpy pandas python visualization yfinance

Last synced: 05 May 2026

https://github.com/sowmi0304/predictive-modeling-titanic-survival-prediction

Machine learning project to predict passenger survival on the Titanic using classification algorithms, data preprocessing, and model evaluation techniques.

machine-learning matplotlib numpy pandas python seaborn

Last synced: 05 May 2026

https://github.com/elijahondiek/data-analysis-udacity

Udacity Data Analysis Nano Degree course resources

jupyter-notebook matplotlib numpy pandas python

Last synced: 05 May 2026

https://github.com/zafir100100/cancer-stage-prediction

This code predicts cancer data using various regression models, calculates their average R-squared scores, and prints the best model.

cross-validation data-analysis data-preprocessing decision-trees gradient-boosting linear-regression machine-learning-algorithms numpy pandas random-forest regression scikit-learn

Last synced: 05 May 2026

https://github.com/hitthecodelabs/petalanalyticsstreamlit

Web application developed with Streamlit that predicts the Iris flower type based on its physical features

matplotlib model numpy pickle python scikit-learn sklearn streamlit

Last synced: 05 May 2026

https://github.com/rayan-alam-uoit/space-x-launch-prediction-tool

A full-stack data science project analyzing SpaceX launch data using web scraping, REST APIs, SQL, interactive visualizations, and machine learning in Python to explore trends and predict launch success.

dash data-science folium jupyter-notebook machine-learning matplotlib numpy pandas plotly python seaborn sql sqlite

Last synced: 05 May 2026

https://github.com/ranaessam03/climate-prediction

A machine learning project to predict rainfall using Decision Tree, k-Nearest Neighbors (kNN), and Naïve Bayes, with preprocessing, model evaluation, and custom kNN implementation.

decision-trees knn-classification machine-learning naive-bayes-classifier numpy sklearn

Last synced: 05 May 2026

https://github.com/ahmedabdalkreem/airline_passenger_satisfaction

In this project, we work in machine learning project that show the satisfacion of customer from the service that have in airline this service make it like this experience or not.

decision-trees gradient-boosting knn logistic-regression machine-learning matplotlib naive-bayes numpy pandas preprocessing python random-forest svm visualization

Last synced: 05 May 2026

https://github.com/batrick-swaistan/nalaiya_thiran

Nalaiya Thiran IBM Course Training

matplotlib numpy pandas-dataframe python

Last synced: 05 May 2026

https://github.com/theadeelahmed/mall-customers-segmention

This project segments mall customers based on income, age, and shopping score. Using clustering techniques, it identifies key customer groups for targeted marketing campaigns. Tools used: Pandas, Matplotlib, Seaborn, and Scikit-learn

matplotlib-pyplot numpy pandas python3 seaborn sklearn-clustering

Last synced: 05 May 2026

https://github.com/pyladiesams/python-oss-sprints-mar2024

Sprints to contribute to the most popular data and AI Python OSS libraries

conda dask geopandas jupyter matplotlib numpy pandas pymc sklearn

Last synced: 05 May 2026

https://github.com/pramodyasahan/cifar-10-object-recognition

This repository contains a TensorFlow-based image classification model designed to classify images from the CIFAR-10 dataset. The CIFAR-10 dataset comprises 60,000 32x32 color images in 10 classes, with 6,000 images per class.

convolutional-neural-networks numpy python tensorflow

Last synced: 05 May 2026

https://github.com/jonad/quora_data_analysis

Exploratory data analysis and feature engineering on Quora dataset.

exploratory-data-analysis feature-extraction numpy pandas python3 quora-dataset

Last synced: 05 May 2026

https://github.com/muthupillai1204/diwali_sales_analysis

The Diwali sales analysis reviews past data to identify trends, peak buying times, popular products, and customer demographics. It assesses sales volume, revenue growth, and promotional effectiveness, helping businesses optimize marketing and inventory for future seasons.

data datacleaning eda excel jupyter-notebook matlplotlib numpy pandas python seaborn visualization

Last synced: 05 May 2026

https://github.com/sahaninilesh/future_ds_01_social-media-sentiments-analysis

The Social Media Sentiments Analysis Dataset captures a vibrant tapestry of emotions, trends, and interactions across various social media platforms. This dataset provides a snapshot of user-generated content, encompassing text, timestamps, hashtags, countries, likes, and retweets. Each entry unveils unique stories—moments of surprise, excitement,

jupyter matplotlib numpy pandas python

Last synced: 05 May 2026

https://github.com/ayushdutta-ai/face-recognition-model

A custom face recognition model built from scratch using Python, OpenCV, Pillow, and Face libraries, utilizing LBPH for efficient facial feature extraction and accurate identification.

face haar-cascade lbph-face-recognizer numpy opencv pillow

Last synced: 05 May 2026

https://github.com/tsungtsetu122/patternrecognition-gnn-node-classification

This project implements a Mixed Graph Neural Network (GNN) for semi-supervised multi-class node classification using the Facebook Large Page-Page Network Dataset. The goal of the project is to predict the class of each node (page) based on 128-dimensional feature vectors and the graph structure connecting the nodes.

evaluation-metrics gnn matplotlib networkx numpy optimization pandas python pytorch-geometric

Last synced: 05 May 2026

https://github.com/waghraj1699/fraud-detection-project

A fraud detection system with machine learning will be able to detect risk based on your historical data

data-processing decision-tree-classifier machine-learning numpy pandas

Last synced: 05 May 2026

https://github.com/hms75/movie_rating_analysis

A movie rating analysis which identifies trends amongst a dataset of 5000 movies.

data-analysis data-visualization matplotlib-pyplot numpy pandas python

Last synced: 05 May 2026

https://github.com/vikkiezdev/ai-global-index-analysis

This project analyzes the AI readiness of 62 countries using key indicators like government strategy, commercial activity, research, development, and infrastructure. Through data cleaning, EDA, and visualization, it identifies key drivers of AI adoption and competitiveness.

cleaning-data correlation-analysis eda matplotlib numpy pandas python3 seaborn statistical-analysis

Last synced: 06 May 2026

https://github.com/romanectatyana/fuel_norm

Practical data analytics case: calculating and optimizing fuel consumption norms for vehicles and heavy equipment. (Аналітичний проєкт із визначення норм витрат палива для автотранспорту та спецтехніки.)

matplotlib numpy pandas plotly power-bi

Last synced: 06 May 2026

https://github.com/leavers/npblob

Transfer numpy array between Python and JavaScript losslessly

array javascript matrix numpy python streaming typescript

Last synced: 06 May 2026

https://github.com/shahriarha/programming

Python problem solving, HR-30 days of code, DSA, NumPy

algorithms-and-data-structures cpp numpy programming python

Last synced: 06 May 2026

https://github.com/ianjure/simple-neural-network

Implementing a simple neural network using only Numpy.

deep-learning neural-network notebook numpy scratch-implementation

Last synced: 06 May 2026

https://github.com/iama02/vet-medicine-analysis

Analysis of 50,000+ veterinary medicine records with Pandas and NumPy, highlighting dosage forms, manufacturers, indications, and shelf life trends.

numpy pandas python

Last synced: 06 May 2026

https://github.com/frolov-andrey2405/mandelbrot-s-fractal

Mandelbrot's Fractal Constructions

numba numpy pygame python

Last synced: 06 May 2026

https://github.com/billgewrgoulas/recommendation-systems

Algorithms for joke rating prediction using the joke data-set from Kaggle.

algorithm clustering collaborative-filtering machine-learning numpy pandas recommender-system scikit-learn scypi

Last synced: 06 May 2026

https://github.com/sankaran-s2001/us-traffic-accidents-analysis-python-eda

Exploratory data analysis of US traffic accidents from 2016-2023, analyzing patterns by time, location, weather, and severity using Python data science libraries.

data-analysis data-science data-visualization eda matplolib numpy pandas python

Last synced: 06 May 2026

https://github.com/abhinav330/customer-behavior-analysis-linear-regression

This repository explores customer behavior data for an NYC clothing company with both a mobile app and website. They want to understand which platform drives higher sales.

data-analysis data-science data-visualization eda exploratory-data-analysis jupyter jupyter-notebook linear-regression machine-learning machine-learning-algorithms machinelearning-python numpy pandas python regression-analysis

Last synced: 06 May 2026

https://github.com/5hraddha/optimize-oil-well-locations

In the quest for harnessing valuable energy resources, the OilyGiant mining company wants to expand its operations by discovering new oil well locations. To achieve this, a data-driven approach is adopted, leveraging geological exploration data from three distinct regions and employing techniques in data analysis and modeling.

linear-regression numpy pandas scikit-learn supervised-learning

Last synced: 06 May 2026

https://github.com/coder5omkar/lendingclubcasestudy

Performed exploratory data analysis (EDA) on the loan dataset from the Lending Club Case Study to identify the key factors influencing loan defaults.

bivariate-analysis data-science exploratory-data-analysis matplotlib numpy pandas seaborn univariate-analysis

Last synced: 06 May 2026

https://github.com/deanlogan/data-analysis-course

Code created when completing the Data Analysis with Python Course on freecodecamp.org

course data-analysis numpy pandas python python3

Last synced: 06 May 2026

https://github.com/paulo-santos-ds/llm

Um aplicativo web simples de tradução que converte texto do português para o inglês, construído com Streamlit e Google Translator

deeptranslate googletranslate numpy pyngrok python streamlit

Last synced: 06 May 2026

https://github.com/cycle-sync-ai/student-score-analysis

A data-driven student performance analysis project using UCI dataset (396 students, 33 features). Implements machine learning models (K-means, PCA, Decision Tree, Random Forest, Linear Regression) to analyze academic patterns and predict student scores based on lifestyle, health, and study habits.

clustering clustering-algorithm decision-trees feature-engineering learning-management-system linear-regression machine-learning machine-learning-algorithms matplotlib numpy pandas pca pickle prediction prediction-algorithm scikit-learn score seaborn student

Last synced: 06 May 2026

https://github.com/naveen88112/genetic_algos_maze_solver

Genetic Algorithm Maze Solver This project utilizes a Genetic Algorithm (GA) to solve the optimization of a maze problem. It evolves the population of promising solutions to the best solution finding shortest path between start and end point using selection, crossover, and mutation.

deap-library genetic-algorithms matplotlib maze-solver numpy python

Last synced: 06 May 2026

https://github.com/albert-2002/robust-penalized-empirical-likelihood-estimation-method-for-linear-regression

Implementation for the paper titled: Robust Penalized Empirical Likelihood Estimation Method for Linear Regression

linear-regression machine-learning numpy pandas python regression regression-analysis statistics

Last synced: 06 May 2026

https://github.com/christiansassi/signal-image-and-video-project

Project developed by Pietro Bologna (@bolognapietro) and Christian Sassi for the Signal, Image and Video course.

image-processing mri-segmentation numpy opencv python

Last synced: 06 May 2026

https://github.com/swapnilvishwakarma/twitter_sentiment_dashboard

Using Streamlit and Python, I created interactive data dashboards and used Pandas to manipulate data in data science workflows. I also used Plotly to create interactive graphs.

dashboard machine-learning matplotlib numpy pandas plotly sentiment-analysis streamlit-webapp webapp wordcloud

Last synced: 06 May 2026

https://github.com/vandanac20/advanced-real-estate-valuation-with-ensemble-regression-models

Advanced Real Estate Valuation with Ensemble Regression Models

machine-learning numpy pandas python sklearn

Last synced: 06 May 2026

https://github.com/bartosz-ziolkowski/social_data

Data analysis of San Francisco's crime data set from 2003 to 2018

data-science data-visualization jupyter-notebook numpy pandas python

Last synced: 06 May 2026

https://github.com/jayanthpotluri5513/heart-disease-prediction-using-ml

This project aims to develop a machine learning model for heart disease prediction, leveraging the potential of advanced algorithms and a comprehensive dataset.

machine-learning matplotlib numpy pandas python

Last synced: 06 May 2026

https://github.com/shimu-i/pythonmastery

Diving into Python essentials, libraries, and tricks to become an AI & ML pro. Coding, learning, and having fun while building the future!

numpy pandas python

Last synced: 07 May 2026

https://github.com/chandkund/pytorch

Foundational introduction to PyTorch, focusing on the basics of tensors, their creation, manipulation, and operations, which are essential for understanding and building deep learning models

classification computer-vision cuda-programming deep-learning loss-functions matplotlib numpy optimization pandas pyhton pytroch workflow

Last synced: 07 May 2026

https://github.com/waghraj1699/ipl-win-probability-predictor

Web app to predict live probability of win percentage of match

data-preprocessing machine-learning ml-algorithms numpy pandas

Last synced: 07 May 2026

https://github.com/suhas-005/jovian-data-analysis-course-assignment

These are my assignments for Data Analysis : Zero to Pandas course by Jovian.ai

data-analysis data-analytics numpy pandas python

Last synced: 07 May 2026

https://github.com/ai-ahmed/bda-and-ds

Dual Course: EMC2 Data Scientist & Big Data Analytics & IBM Professional Data Science Course

analytics big-data data-science jupyter-notebook jupyterlab linear-algebra mathematics matplotlib ml numpy pandas probability seaborn sklearn statistics

Last synced: 07 May 2026

https://github.com/jonad/chicago_crime_analysis

Exploratory data analysis on Chicago Crime dataset.

jupyter-notebook matplotlib numpy pandas python3

Last synced: 07 May 2026

https://github.com/s-hirata0831/graphicsprocessing

Pythonによる画像処理基礎実験(with OpenCV)

numpy opencv python

Last synced: 07 May 2026

https://github.com/ganesh774218/eda-book-store

Exploratory data analysis on a book store dataset to uncover sales trends, popular genres, and top publishers.

data-visualization datacleaning datamanipulation eda matplotlib numpy pandas python pythonp pythonproject seaborn

Last synced: 07 May 2026

https://github.com/krishbharadwaj5678/gestureninja

Play Fruit Ninja using just your index finger with real time gesture tracking. No mouse, no keyboard — just your webcam and smooth slicing.

automation computer-vision cvzone finger-tracking fruit-ninja game-development gamedev gesture-recognition hand-tracking mediapipe mouse mouse-control numpy opencv pyautogui python realtime virtualmouse webcam

Last synced: 07 May 2026

https://github.com/shefreenkaur/web-scraping-and-word-frequencies

This project analyzes word frequencies in BC Legislative documents using Stanford CoreNLP and Python. The program extracts text from PDF documents, processes it using natural language processing techniques, and generates a comprehensive word frequency analysis.

analytics chromedriver easyocr nlp numpy pandas pymupdf python selenium stanfordcorenlp webscraping wordfrequency

Last synced: 07 May 2026

https://github.com/ifimust/room_generator

Room generator service for 2D game levels

flask google-app-engine microservice numpy procedural-generation python

Last synced: 07 May 2026

https://github.com/pesala-x/data-science-ai-assessments

this is inClass practice application for Python-for-Data-Science-AI

data-types functions jupyter-notebook loops machine-learning numpy pandas-python python python3 seaborn

Last synced: 07 May 2026

https://github.com/kavayk29/audio-classification-using-python-library

This is a audio classification Project using python Libraries such as librosa to make the visual representation of the audio files, and using numpy to make array of data for manipulation and then extraction the features for classification to train and test of CNN model.

librosa matplotlib-pyplot mfcc-features numpy pandas sklearn-library

Last synced: 07 May 2026

https://github.com/jimmymugendi/bulding-a-decision-tree-to-predict-customer-churn

This repo desribes bulding a decision tree to predict customer churn in a given organisation

accuracy-score decision-tree-classifier matplotlib-pyplot numpy pandas-dataframe scikit-learn

Last synced: 07 May 2026

https://github.com/christabelsakyi/sentiment_analysis

This project involves analyzing customer reviews to classify them as positive or negative using Logistic Regression. The workflow includes text preprocessing, feature extraction, training a model, making predictions, and evaluating its performance. Dataset

imblearn machine-learning nltk numpy python sklearn

Last synced: 07 May 2026

https://github.com/arnie1x/massive-dataset-lab

This project involves basic data manipulation with JSON files, focusing on tasks related to data processing of the MASSive dataset and file management.

data-science numpy pandas pandas-dataframe python3

Last synced: 07 May 2026

https://github.com/jpgiant/gujaratrainfallanalysis_2021

Analysis about the rainfall that occurred in the districts of Gujarat state in 2021

data-analysis exploratory-data-analysis exploratory-data-visualizations matplotlib numpy pandas-python python

Last synced: 07 May 2026

https://github.com/bettenoch/yolo_real_time_object_detector

The YOLO Real-Time Object Detector leverages the ultralytics package to provide robust object detection capabilities. This project is designed to identify and classify objects in images, video streams, and live webcam feeds with high accuracy and efficiency.

numpy object-detection object-tracking pandas python3 real-time-processing ultralytics yolov8

Last synced: 07 May 2026

https://github.com/r-mahesh45/reccomendation-system

This project involves building a recommender system using cosine similarity to measure the similarity between items or users. It leverages numerical data to provide personalized recommendations. The model focuses on extracting meaningful insights and improving decision-making processes.

numpy pandas-dataframe python3 scikitlearn-machine-learning

Last synced: 07 May 2026

https://github.com/azaz9026/python_library_numpy

NumPy is a Python library used for working with arrays. It also has functions for working in domain of linear algebra, fourier transform, and matrices...

array jupyter-notebooks numpy python3

Last synced: 08 May 2026