Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/muskanmi/data_analysis_python

Data analysis on students result dataset using python libraries.

boxplot countplots data-analysis numpy pandas pie-chart python3 seaborn

Last synced: 05 Nov 2024

https://github.com/estaheri7/dotawinpredictor

This project aims to predict the outcome of Dota 2 matches based on the heroes picked by both the Radiant and Dire teams

data-science dota2 machine-learning numpy pandas python random-forest sklearn support-vector-machines

Last synced: 10 Oct 2024

https://github.com/allanreda/automated-k-means-clustering-engine

An interactive K-Means clustering tool built with Flask and Scikit-Learn, supporting Excel file uploads, cluster analysis, and data export, deployed on Google Cloud Run via Docker with CI/CD integration.

cicd css data-visualization deployment docker flask google-cloud-run html javascript k-means-clustering machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 31 Oct 2024

https://github.com/secary/compsci7211

Foundations of Computer Science B

algorithms git numpy pandas shell unix

Last synced: 31 Oct 2024

https://github.com/amiriiw/digit_classification

Welcome to the Digit Classification Project! This project focuses on training a model to classify handwritten digits and using the trained model to predict digits from new images.

digit-classification keras mysql mysql-database numpy os sklearn tensorflow

Last synced: 03 Nov 2024

https://github.com/offthetab/matcha_code_generation

Fine-tunning MatCha model for better python matplot lib code generation by given chart.

fine-tuning huggingface matcha matplotlib numpy pandas pil sklearn torch

Last synced: 03 Nov 2024

https://github.com/sudarsann27/basic_machine_learning_algorithms

Basic Machine learning algorithms using scikit-learn and other fundamental libraries

data-science data-visualization ensemble-model kaggle numpy pandas scikit-learn supervised-machine-learning

Last synced: 03 Nov 2024

https://github.com/mchenryspagg/investigate_a_dataset

This is a data analysis project that demonstrates the student's ability to use python data analysis libraries such as pandas, numpy and pyplot in matplotlib to investigate a dataset and answer specific questions from the dataset, thus demonstrating skills in data cleaning, data wrangling, and exploratory data analysis.

data-analysis datetime descriptive-analysis descriptive-statistics exploratory-data-analysis numpy pandas pyplot python visualization

Last synced: 08 Nov 2024

https://github.com/eiichiroito/numpharo

NumPy like tools for Pharo Smalltalk

matplotlib-pyplot numpy pharo pharo-smalltalk smalltalk

Last synced: 31 Oct 2024

https://github.com/lakshaybhushan/pns-coding-assignment

Solving questions given in "Programming Questions.pdf" using python's matplotlib and jupyter notebooks

ipynb juypter-notebook matplotlib-pyplot numpy python

Last synced: 05 Nov 2024

https://github.com/samir-atra/experiments_and-_improvements

A repository showing the knowledge in some of the machine learning technologies there are.

dcgan-tensorflow numpy tensorflow-tutorials tensorflow2

Last synced: 10 Oct 2024

https://github.com/ieCecchetti/Python_ML_DL_examples

A variety of Machine Learning and Deep Learning scripts in Python. Included some theorical info about that in the Readme

bayes-classifier bayesian-statistics deep-learning kernel machine-learning matplotlib neural-network numpy pandas pca pca-analysis python scikitlearn-machine-learning scipy shi

Last synced: 23 Oct 2024

https://github.com/ankitpundir7/group-face-emotion-recognition

This project can identify and categorize human emotions in both static and dynamic context. Duration : Jun 2023 - Jul 2023

convolutional-neural-networks deep-learning keras machine-learning-algorithms numpy opencv python tensorflow

Last synced: 10 Oct 2024

https://github.com/bharath-tars/deeptumor2.0

DeepTumor2.0 is a project aimed at developing a flask webapp with robust front end and highly trained model using tensorflow in the backend with several features like with automatic html template reporting

css datetime docs flask github html javascript keras numpy onrender os pandas python sheety smtp tensorflow

Last synced: 03 Nov 2024

https://github.com/shwetapardhi/assignment-04-simple-linear-regression-1

Q1) Delivery_time -> Predict delivery time using sorting time. Build a simple linear regression model by performing EDA and do necessary transformations and select the best model using R or Python. EDA and Data Visualization, Feature Engineering, Correlation Analysis, Model Building, Model Testing and Model Predictions using simple linear regressi

correlation-analysis data-visualization distplot eda feature-engineering model-building model-prediction model-testing numpy ols-regression p-value pandas python regression-plot rsquare-values seaborn simple-linear-regression smf statsmodel t-score

Last synced: 11 Nov 2024

https://github.com/shwetapardhi/assignment-05-multiple-linear-regression-2

Prepare a prediction model for profit of 50_startups data. Do transformations for getting better predictions of profit and make a table containing R^2 value for each prepared model. R&D Spend -- Research and devolop spend in the past few years Administration -- spend on administration in the past few years Marketing Spend -- spend on Marketing in t

collinearity-diagnostics cooks-distance correlation-analysis eda heteroscedasticity homoscedasticity leverage-score multi-linear-regression numpy ols-regression p-value pair-plot python r-square-values regress-exog residual-analysis smf statsmodels vif

Last synced: 11 Nov 2024

https://github.com/fatimaafzaal/car_price_prediction

Trains Random Forest and Gradient Boosting models to predict car prices based on user inputs for various car attributes, evaluating models and making predictions using the best-performing model.

car-price-prediction ensemble-learning gradient-boosting matplotlib numpy pandas random-forest regression regression-models scikit-learn seaborn

Last synced: 15 Nov 2024

https://github.com/mchenryspagg/prosper-loan-project

A data analysis project that entails using the data from a fictional loan company known as Prosper to perform exploratory data analysis using univariate, bivariate and multivariate visualizations to produce insights that answers questions asked from the data

datastorytelling exploratory-data-analysis exploratory-data-visualizations jupyter-notebook matplotlib-pyplot numpy pandas python seaborn-plots

Last synced: 08 Nov 2024

https://github.com/atul-maurya-30/galaxy

Galaxy Classification is a machine learning project focused on classifying galaxies into two subclasses: 'STARFORMING' and 'STARBURST'. This project demonstrates data preprocessing, model training, and evaluation using advanced machine learning techniques and Python libraries.

flask machine-learning matplotlib numpy pandas python regression-classification seaborn sklearn

Last synced: 10 Oct 2024

https://github.com/shwetapardhi/assignment-03-q1--hypothesis-testing

Q1.A F&B manager wants to determine whether there is any significant difference in the diameter of the cutlet between two units. A randomly selected sample of cutlets was collected from both units and measured? Analyze the data and draw inferences at 5% significance level. Please state the assumptions and tests that you carried out to check validit

hypothesis-testing numpy p-value pandas python scipy significance-testing stats t-test

Last synced: 11 Nov 2024

https://github.com/shwetapardhi/assignment-04-simple-linear-regression-2

Q2) Salary_hike -> Build a prediction model for Salary_hike Build a simple linear regression model by performing EDA and do necessary transformations and select the best model using R or Python. EDA and Data Visualization. Correlation Analysis. Model Building. Model Testing. Model Predictions.

correlation-analysis data-visualization distplot eda feature-engineering model-building model-predictions model-template numpy ols-regression p-value pandas python r-square-values regression-plot seaborn simple-linear-regression smf statsmodels t-score

Last synced: 11 Nov 2024

https://github.com/mustafaozvardar/image-classification-webapp

This project features an AI model for diagnosing pneumonia from chest X-ray images. Users can upload a chest X-ray image through the app, and the model will classify it.

artificial-intelligence computer-vision keras numpy python streamlit streamlit-webapp tensorflow

Last synced: 10 Oct 2024

https://github.com/jsd115/tensorflow_bankfrauddetection

This repository aims to provide examples of my ability to produce ML models using tensorflow and sklearn

numpy pandas sklearn tensorflow

Last synced: 03 Nov 2024

https://github.com/adriantomin/heart-disease-prediction

Predicting Heart Disease with ML 🏥🤖 Machine learning model to predict heart disease using clinical data. Includes data analysis, feature engineering, and model training.

machine-learning matplotlib numpy pandas python sklearn

Last synced: 10 Oct 2024

https://github.com/chandkund/sentiment-analysis-on-movie-reviews

Project on Sentiment Analysis of Movie Reviews using K-means clustering. This project clusters movie reviews based on their sentiment scores to understand audience reactions. Includes data preprocessing, feature selection, model development, and visualization.

data-science eda jupyter-notebook machine-learning matplotlib numpy pandas preprocessing python sklearn

Last synced: 03 Nov 2024

https://gitlab.com/Makman2/scipy-steadystate

SciPy extension modules for general purpose steady-state solvers and optimizations for differential state-equations!

numpy optimization scipy steady-state

Last synced: 30 Sep 2024

https://github.com/adriantomin/bulldozer-price-prediction

Predicting the Sale Price of Bulldozers Using Machine Learning 🚜💰 This project uses machine learning to predict bulldozer sale prices based on historical data from the Kaggle Bluebook for Bulldozers competition. The goal is to minimize the RMSLE between actual and predicted prices.

data-science jupyter-notebook machine-learning matplotlib-pyplot numpy pandas python scikit-learn

Last synced: 10 Oct 2024

https://github.com/davgiles/ut-austin-data-science-program

This repository contains my projects from the Data Science & Business Analytics Post-Graduate Program through UT Austin.

eda matplotlib numpy pandas python scikit-learn scipy seaborn visualization xgboost

Last synced: 03 Nov 2024

https://github.com/szymon-budziak/real_estate_house_prices_prediction

Predicting real estate house prices using various machine learning algorithms, including data exploration, preprocessing, model training, and evaluation.

data-analysis data-preprocessing data-science eda jupyter-notebook machine-learning matplotlib numpy optuna pandas predictive-modeling price-prediction python random-forest regression scikit-learn seaborn

Last synced: 10 Oct 2024

https://github.com/alejoduarte23/si_bayesianmixturemodel

Implementation of a two-stage fast Bayesian system identification for separated Modes. This repository expands the usage of this technique by adding a mixture model fit to obtain modal parameters from the posterior distribution.

matplotlib numpy scikit-learn scipy

Last synced: 10 Oct 2024

https://github.com/ibrahimhabibeg/multimedia-project

Turn images monochrome and create transition between two images

canvas image-processing multimedia numpy pillow python react typescript vite

Last synced: 05 Nov 2024

https://github.com/mkdirer/depression-data-analysis

This project analyzes a Kaggle depression dataset using data preprocessing, clustering, classification, and outlier detection techniques. Python libraries like pandas, numpy, matplotlib, seaborn, and scikit-learn are used to extract insights.

classification clustering matplotlib numpy pandas scikit-learn seaborn vizualization

Last synced: 10 Oct 2024

https://github.com/bilgenurbekar/turkishcyberbullying

Contains fine-tuned BERT models and results in the text classification category using Turkish social media data

bert-fine-tuning huggingface-transformers matplotlib numpy pandas python pytorch scikit-learn transformers

Last synced: 10 Oct 2024

https://github.com/chaitanya1436/student_performance_analysis

A project focused on analyzing college student performance using data on department, assessment scores, and performance labels. Implemented in Google Colab, the analysis includes data preprocessing, feature scaling, and exploratory data analysis to uncover insights and prepare the data for further analysis or modeling.

ata-preprocessing data-preparation exploratory-data-analysis feature-scaling google-colab numpy pandas scikit-learn

Last synced: 03 Nov 2024

https://github.com/stella4444/linear-regression

learning about linear regression (currently a work in progress) ~ working with data

linear-regression machine-learning numpy scikit-learn

Last synced: 03 Nov 2024

https://github.com/alaazameldev/text-based-search-engine

Implementation of a search engine using TF-IDF and Word Embedding-based vectorization techniques for efficient document retrieval

chromadb fastapi gensim-word2vec nltk numpy precision-recall python scikit-learn tf-idf-vectorizer

Last synced: 10 Oct 2024

https://github.com/bilalm04/email-spam-classifier

A machine learning project that classifies emails as spam or not spam using Logistic Regression, with a deployable Flask API for real-time classification.

api flask jupyter-notebook machine-learning matplotlib nlp numpy pandas python scikit-learn

Last synced: 10 Oct 2024

https://github.com/sarowarahmed/predicting-kolkata-house-price

🏠 Predicting Kolkata House Price: A web app powered by a Machine Learning model, built with Numpy, Pandas, Scikit-learn, and Streamlit, to predict house prices in Kolkata. Deployed on Streamlit Cloud for easy access and real-time predictions.

app kolkata linear-regression machine-learning numpy pandas scikit-learn streamlit

Last synced: 03 Nov 2024

https://github.com/haranrk/linear-regression

Implemented Linear Regression and some of it's variants

machine-learning matplotlib numpy python python3

Last synced: 22 Oct 2024

https://github.com/mchenryspagg/wrangle-and-analyze-data

This project which is known as 'wrangle and analyze data' involves the wrangling of WeRateDogs twitter archive data from the period of 2015 to 2017

api data dataanalysis datacollection datawrangling datetime json numpy os pandas pil python requests tweepy-api visualization

Last synced: 08 Nov 2024

https://github.com/ifimust/room_generator

Room generator service for 2D game levels

flask google-app-engine microservice numpy procedural-generation python

Last synced: 10 Nov 2024

https://github.com/manuel-lang/numpymongo

A python package to export NumPy data to MongoDB

mongodb numpy wrapper

Last synced: 02 Nov 2024

https://github.com/hifly81/photox

photo and album organizer for gtk3

gtk3 numpy opencv-python photography pil pillow python

Last synced: 10 Oct 2024

https://github.com/sibendud/csi_2024_dataengineering

Internship on Data Engineering where below topics are applied skills that are used to complete the given tasks through out 8 weeks including the project.

adf adls azure-pipelines databricks docker ipynb json numpy pandas-python parquet-avro pipeline pyspark python sql sql-server

Last synced: 11 Oct 2024

https://github.com/gherrada22/spanish-sign-translator

This project uses Python and the OpenCV and MediaPipe libraries to detect and translate signs in real time.

language matplotlib mediapipe numpy opencv pycharm-ide python python3 spanish-language toml

Last synced: 31 Oct 2024

https://github.com/jay6430/python-for-data-science

This Repository includes all the Python concepts that are necessary for Data science.

csv data-visualization datascience-machinelearning html-css ipynb-jupyter-notebook matplotlib numpy pandas python3 xml

Last synced: 13 Nov 2024

https://github.com/aminnj/numpy.jl

Simple numpy snippets implemented in julia

joke julia maybenot numpy python

Last synced: 12 Oct 2024

https://github.com/apuravdivekar2032/real-estate-price-prediction

A real estate price prediction website developed using Python with Numpy and Pandas for data cleaning Matplotlib for data visualization and scikit-learn for model building, featuring a Flask server to handle HTTP requests and integrates a user-friendly UI built with HTML, CSS, and JavaScript

flask html-css-javascript jupyter-notebook matplotlib numpy pandas pycharm python3 sklearn vscode

Last synced: 31 Oct 2024

https://github.com/lewis-morris/image_augment

Image augmentation for CNN's. Aiming to increase generalization.

augmentation image numpy python

Last synced: 31 Oct 2024

https://github.com/sonu275981/face-mask-detection-web-app

Face Mask Detection web application built with Flask, Keras-TensorFlow, OpenCV. It can be used to detect face masks both in images and in real-time camera video using browser.

flask-application haar-cascade-classifier haarcascade-frontalface javascript keras-tensorflow mask mask-detection numpy opencv webapp

Last synced: 05 Nov 2024

https://github.com/mike014/chatbot_app

Chabot is an application with a graphical user interface that uses various natural language processing (NLP) techniques to tokenize, stem, find stop words, and apply regular expressions to user-input text. The interface is built using Tkinter.

deep-learning dialogpt keras-neural-networks keras-tensorflow knn machine-learning-algorithms ml natural-language-processing nlp nltk nltk-python numpy pca python seq2seq tensorflow textpreprocessing tk tokenizer

Last synced: 31 Oct 2024

https://github.com/sonu275981/uber-rides-data-analysis

Analysis of Uber's Ridership Data for NYC.

data-analysis-python flask machine-learning numpy pandas uber

Last synced: 05 Nov 2024

https://github.com/xkomil/datasciencesummerstudy

I want to document in this repository all my studying data science topics

data-science data-visualization machine-learning meteostat numpy pandas seaborn sklearn streamlit-webapp

Last synced: 05 Nov 2024

https://github.com/jonad/chicago_crime_analysis

Exploratory data analysis on Chicago Crime dataset.

jupyter-notebook matplotlib numpy pandas python3

Last synced: 07 Nov 2024

https://github.com/smahala02/materials-science-introduction

Introduction to Materials Science concepts using Python for array manipulation and visualization with NumPy and Matplotlib.

data-visualization materials-science matplotlib numpy python scientific-computing

Last synced: 31 Oct 2024

https://github.com/jonad/finding_donors

Predicting income with UCI Census Income Dataset using supervised machine learning algorithms

numpy pandas scikit-learn scikitlearn-machine-learning

Last synced: 07 Nov 2024

https://github.com/harmanveer-2546/guide-to-regularization

Regularization is a crucial technique in machine learning that helps to prevent overfitting. Overfitting occurs when a model becomes too complex and learns the training data so well that it fails to generalize to new, unseen data.

generalization inline l1 l2 matplotlib numpy overfitting overfitting-prevention overfitting-reduced pandas regularization regularization-methods regularization-techniques regularization-to-avoid-overfitting seaborn

Last synced: 31 Oct 2024

https://github.com/jonad/quora_w2v

In-domain word embedding using Quora dataset.

gensim-word2vec matplotlib numpy pandas python3

Last synced: 07 Nov 2024

https://github.com/jonad/titanic_exploratory_analysis

Exploratory Data Analysis on Titanic Dataset

jupyter-notebook numpy pandas python3

Last synced: 07 Nov 2024

https://github.com/jonad/quora_data_analysis

Exploratory data analysis and feature engineering on Quora dataset.

exploratory-data-analysis feature-extraction numpy pandas python3 quora-dataset

Last synced: 07 Nov 2024

https://github.com/jonad/titanic_survival

Basic decision tree model for Titanic dataset. Predict who will survive.

jupyter-notebook matplotlib numpy pandas python3 titanic-survival-prediction

Last synced: 07 Nov 2024

https://github.com/aniketdash7/multihead_attention_implementation

Implementation of Multihead attention mechanism using numpy and pyTorch

multihead-attention numpy pytorch torch

Last synced: 09 Oct 2024

https://github.com/niel0503/probabilidade-e-amostragem

Desafio desarrollado para el bootcamp de WoMakersCode

estadistica matplotlib numpy pandas stats

Last synced: 31 Oct 2024

https://github.com/huangcongqing/python-libraries

python库(numpy,pandas...)学习

matplotlib numpy pandas python3 xgboost

Last synced: 28 Oct 2024

https://github.com/chandkund/ev-market-segmentation

This project focuses on segmenting the Electric Vehicle (EV) market in India to identify key customer groups based on their preferences, behaviors, and demographics.

machine-learning matplotlib numpy pandas seaborn sklearn visualization

Last synced: 31 Oct 2024

https://github.com/mariam-zaidi/bike-sharing-prediction-model

Build a multiple linear regression model for the prediction of demand for shared bikes.

linear-regression matplotlib numpy pandas rfe seaborn sklearn

Last synced: 31 Oct 2024

https://github.com/tsar123/neutral-single-nucleotide-polymorphisms-analysis

analysis of data on neutral single nucleotide polymorphisms

dna-polymorphism matplotlib numpy pandas seaborn snv

Last synced: 31 Oct 2024

https://github.com/omarsaad21/shopping-cart-eda

An EDA python project focusing on getting the most out of a movies dataset (- 1- combine this data together - 2- check messing values - 3- show summary statistics - 4- deal with date time and extract features from dates - 5- answer at least three questions from this data)

eda jupyter-notebook numpy pandas plotly python

Last synced: 04 Nov 2024

https://github.com/guybanbo/qr-algorithm

QR alogrithm for finding eigenvalues and eigenvectors of a large symmetric matrix

haskell hmatrix numpy optimization python qr-algorithm symmetric-matrix

Last synced: 31 Oct 2024

https://github.com/omarsaad21/it-salary-eda

A python EDA project implemented on IT department salaries data we made data exploration and made data visulization for some questions on dataset

data explotary-data-analysis juypter-notebook numpy pandas python visualization

Last synced: 04 Nov 2024

https://github.com/samkazan/structural_discovery_of_macromolecules_data_analysis

This research project uses machine learning techniques and neural network to uncover key factors that contribute to successful protein structure discovery using Python and R

classification clustering ipython-notebook jupyter-notebook keras-neural-networks keras-tensorflow machine-learning neural-network numpy python r rmarkdown scikit-learn scipy tensorflow

Last synced: 13 Nov 2024

https://github.com/alejandro945/insurance-risk

This project aims to predict the risk of insurance claims using a dataset from Kaggle. The dataset consists of 26 columns and 205 rows, providing various features related to insurance risk. By analyzing this data, we seek to build predictive models that can help insurers assess the risk of claims.

data-analytics ipython-notebook numpy pandas python

Last synced: 11 Oct 2024

https://github.com/shahaba83/airplane-ticket-cancellation

In this project, we try to predict the possibility of canceling the plane ticket by the buyer

datatime numpy pandas python scikit-learn seaborn

Last synced: 31 Oct 2024

https://github.com/hadson0/dip-candy-classification

Candy classification project using digital image processing. Includes dataset creation, model development, and evaluation tools with TensorFlow, OpenCV, and Albumentations. Complete setup and instructions provided for building and training the model.

albumentations bounding-boxes cnn computer-vision cvat image-classification image-processing keras machine-learning matplotlib numpy opencv python seaborn sklearn tensorflow

Last synced: 26 Sep 2024

https://github.com/ahmedabdalkreem/sentiment-analysis

This project performs sentiment analysis on a Twitter dataset, aiming to classify tweets into positive, negative, or neutral sentiments. Sentiment analysis is crucial for understanding public opinion on various topics, brands, or events based on social media data.

bert-model lematization matplotlib nlp nltk numpy pandas python3 sentiment-analysis stopwords streamlit

Last synced: 31 Oct 2024

https://github.com/jcardonamde/datasets_ml

This project analyzes cab and limousine travel data in New York City. This with the goal of predicting the total duration of trips within the city. Machine learning models were used.

data-science machine-learning machine-learning-algorithms matplotlib numpy pandas pipelines python seaborn sklearn

Last synced: 05 Nov 2024

https://github.com/omarsaad21/rfm-clustering-

A full Data science and deployment project focusing on Data analysis and ML ( create a customer segmentation model to recommend the best merchants for each user as targetted offers)

business-solutions data-science eda numpy pandas plotly python sickit-learn streamlit

Last synced: 04 Nov 2024