Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/thaisgarcia/convert-censo-file

Este repositório contém scripts Python para processar dados destinados ao censo do MEC, incluindo a leitura de um arquivo TXT, a geração de um arquivo Excel e a conversão desse arquivo Excel de volta para um formato de texto.

numpy pandas python

Last synced: 12 Nov 2024

https://github.com/elon-fask/nlp_num1

Natural Language Processing with Disaster Tweets

ai machine-learning nlp nlp-machine-learning numpy pandas python text-processing

Last synced: 07 Nov 2024

https://github.com/jonad/finding_donors

Predicting income with UCI Census Income Dataset using supervised machine learning algorithms

numpy pandas scikit-learn scikitlearn-machine-learning

Last synced: 07 Nov 2024

https://github.com/manikantasanjay/emotion_prediction_kaggle_cnn

Real time Facial Emotion Recognition System with CNN Model Architecture making use of Transfer Learning, Data Augmentation, and Regularization Techniques.

cnn convolutional-neural-networks dataset deep-learning jupyter-notebook matplotlib numpy pandas python3 pytorch transfer-learning

Last synced: 05 Nov 2024

https://github.com/jonad/quora_w2v

In-domain word embedding using Quora dataset.

gensim-word2vec matplotlib numpy pandas python3

Last synced: 07 Nov 2024

https://github.com/hayatiyrtgl/data_analysis_project

Financial data analysis: preprocess, visualize, calculate technical indicators.

data-analysis data-analysis-python data-science dataframe numpy pandas python python3 stock-price-prediction talib trade-analysis

Last synced: 05 Nov 2024

https://github.com/ksharma67/partial-dependent-plots-individual-conditional-expectation-plots-with-shap

The goal of SHAP is to explain the prediction of an instance x by computing the contribution of each feature to the prediction. The SHAP explanation method computes Shapley values from coalitional game theory. The feature values of a data instance act as players in a coalition.

eda individual-conditional-expectation matplotlib numpy pandas partial-dependence-plot python seaborn shap shapley-additive-explanations sklearn xgboost

Last synced: 06 Nov 2024

https://github.com/ksharma67/eda-on-ipl

In this python notebook, analysis of IPL matches from 2008 to 2020 is done using python packages like pandas, matplotlib and seaborn.

data-analysis data-science eda matplotlib numpy pandas python seaborn

Last synced: 06 Nov 2024

https://github.com/ksharma67/anomaly-detection-on-temperature-device-failure

A typical anomaly detection task and performing KMeans, PCA, Gaussian distribution, and Isolation Forest.

eda ellipticenvelope feature-engineering gaussian-distribution isolation-forest kmeans-clustering numpy pca python sklearn

Last synced: 06 Nov 2024

https://github.com/jonad/titanic_exploratory_analysis

Exploratory Data Analysis on Titanic Dataset

jupyter-notebook numpy pandas python3

Last synced: 07 Nov 2024

https://github.com/paulo-santos-ds/llm

Um aplicativo web simples de tradução que converte texto do português para o inglês, construído com Streamlit e Google Translator

deeptranslate googletranslate numpy pyngrok python streamlit

Last synced: 05 Nov 2024

https://github.com/moritzsalla/cci-advanced-visualization

Classwork for Mick Grierson's module at the Creative Computing Institute

matplotlib numpy openframeworks python tensorflow

Last synced: 06 Nov 2024

https://github.com/ksharma67/heart-failure-prediction

This problem is a typical Classification Machine Learning task. Building various classifiers by using the following Machine Learning models: Logistic Regression (LR), Decision Tree (DT), Random Forest (RF), XGBoost (XGB), Light GBM and Support Vector Machines with RBF kernel.

auc-roc-curve auc-roc-score decision-trees eda eli5 gridsearchcv lightgbm lime logistic-regression numpy pandas python random-forest seaborn shap skit-learn sklearn svm xgboost

Last synced: 06 Nov 2024

https://github.com/jonad/quora_data_analysis

Exploratory data analysis and feature engineering on Quora dataset.

exploratory-data-analysis feature-extraction numpy pandas python3 quora-dataset

Last synced: 07 Nov 2024

https://github.com/quocanh189/sentiment_analysis_eventhub

This project implements a Sentiment Analysis model using a neural network architecture built with Keras. The model classifies text into binary sentiment categories (positive or negative) by leveraging word embeddings and Gated Recurrent Units (GRU) layers. Additionally, the project includes a simple Flask web application to provide a real-time sent

docker flask nlp-machine-learning numpy pandas sentiment-analysis tensorflow

Last synced: 05 Nov 2024

https://github.com/thanhloc81/rfm-model-project

✨ Build a flow to deploy Segmentation evaluation through Python programming.

customer-segmentation numpy pandas python rfm-analysis

Last synced: 08 Nov 2024

https://github.com/mkdirer/multivariate-time-series-forecasting-using-transformers

This project develops a Transformers-based neural network for modeling and forecasting multivariate time series data from a COVID-19 dataset in Poland. The implementation is in Python, utilizing the Keras library for neural network design and training, along with numpy, pandas, matplotlib, and sklearn for data analysis and model evaluation.

keras-tensorflow matplotlib neural-network numpy pandas sklearn

Last synced: 07 Nov 2024

https://github.com/jonad/titanic_survival

Basic decision tree model for Titanic dataset. Predict who will survive.

jupyter-notebook matplotlib numpy pandas python3 titanic-survival-prediction

Last synced: 07 Nov 2024

https://github.com/smahala02/materials-science-introduction

Introduction to Materials Science concepts using Python for array manipulation and visualization with NumPy and Matplotlib.

data-visualization materials-science matplotlib numpy python scientific-computing

Last synced: 31 Oct 2024

https://github.com/huangcongqing/python-libraries

python库(numpy,pandas...)学习

matplotlib numpy pandas python3 xgboost

Last synced: 28 Oct 2024

https://github.com/saksham-21/data_analysis-on-hindustan-times-

Performed a proper NLP based analysis from the headlines, date published and the category of the news. Moreover created a interactive Dashboard on Tableau

beautifulsoup matplotlib numpy pandas tableau

Last synced: 12 Nov 2024

https://github.com/danielpancake/mlml

my little machine learning library written in NumPy

machine-learning numpy

Last synced: 30 Oct 2024

https://github.com/darkdk123/handwashing-discovery-analysis

A Guided Project in a Boot camp to Analyse the Original Data used in the Discovery of Viruses & Hand Washing By Dr. Ignaz Semmelweis in Vienna General Hospital in the 1840s.

data-analysis data-science data-visualization matplotlib-pyplot numpy pandas plotly-python python seaborn-plots

Last synced: 07 Nov 2024

https://github.com/harmanveer-2546/guide-to-regularization

Regularization is a crucial technique in machine learning that helps to prevent overfitting. Overfitting occurs when a model becomes too complex and learns the training data so well that it fails to generalize to new, unseen data.

generalization inline l1 l2 matplotlib numpy overfitting overfitting-prevention overfitting-reduced pandas regularization regularization-methods regularization-techniques regularization-to-avoid-overfitting seaborn

Last synced: 31 Oct 2024

https://github.com/jayanthsd2003/dms_sarathi_-driver-monitoring_system-

This is a project which can be seen on TATA Trucks and other premium edition cars based and focussed on the safety and monitoring driver Consciousness state

dlib dlib-face-detection numpy opencv-python pygame python pyttsx3 scipy

Last synced: 30 Oct 2024

https://github.com/adesoji1/food-image-classification

Food Image Classification using TensorFlow: A deep learning model to classify various food items using TensorFlow and CNNs.

colab-notebook hyperparameter-optimization hyperparameter-tuning keras-tensorflow matplotlib-pyplot numpy pandas pillow python3 pytorch regularization tensorflow transfer-learning transformer

Last synced: 30 Oct 2024

https://github.com/aniketdash7/multihead_attention_implementation

Implementation of Multihead attention mechanism using numpy and pyTorch

multihead-attention numpy pytorch torch

Last synced: 09 Oct 2024

https://github.com/niel0503/probabilidade-e-amostragem

Desafio desarrollado para el bootcamp de WoMakersCode

estadistica matplotlib numpy pandas stats

Last synced: 31 Oct 2024

https://github.com/rakibhhridoy/appliedmachinelearninghousing-regression

Let's take the Housing dataset which contains information about different houses in Boston. This data was originally a part of UCI Machine Learning Repository and has been removed now. We can also access this data from the scikit-learn library. The objective is to predict the value of prices of the house using the given features.

deep-learning housing-market housing-prices machine-learning numpy pandas python real-estate regression scikit-learn

Last synced: 06 Nov 2024

https://github.com/ahmedabdalkreem/sentiment-analysis

This project performs sentiment analysis on a Twitter dataset, aiming to classify tweets into positive, negative, or neutral sentiments. Sentiment analysis is crucial for understanding public opinion on various topics, brands, or events based on social media data.

bert-model lematization matplotlib nlp nltk numpy pandas python3 sentiment-analysis stopwords streamlit

Last synced: 31 Oct 2024

https://github.com/jcardonamde/datasets_ml

This project analyzes cab and limousine travel data in New York City. This with the goal of predicting the total duration of trips within the city. Machine learning models were used.

data-science machine-learning machine-learning-algorithms matplotlib numpy pandas pipelines python seaborn sklearn

Last synced: 05 Nov 2024

https://github.com/costopoulos/ntua-dsp

:signal_strength: NTUA ECE Digital Signal Processing Course Source Codes and Reports

dsp filters fourier-transform librosa numpy pywt scipy short-time-signal-analysis stft

Last synced: 07 Nov 2024

https://github.com/smirnovlad/data-science-notebooks

A collection of various data analysis approaches

data-science deep-learning kaggle machine-learning numpy pandas pytorch

Last synced: 30 Oct 2024

https://github.com/cmdecker95/neural_net_from_scratch

A ground-up Python implementation of an artificial neural network, using NumPy and good math.

machine-learning neural-network numpy python scratch-implementation

Last synced: 04 Nov 2024

https://github.com/aishwaryagm1999/aircraft-network-security-using-yara-rules-and-machine-learning-for-threat-detection-and-prevention

This project addresses cybersecurity in aviation by developing a machine learning-enhanced intrusion detection and prevention system (IDPS) for aircraft networks. Combining YARA-based signature detection with behavior-based (ML) anomaly detection, the system mitigates cyber threats in real-time, protecting aircraft from sophisticated attacks.

argus automation cybersecurity feature-hashing idps joblib machine-learning matplotlib networking numpy pandas python random-forest-classifier requests seaborn shell smote smotesmote tshark yara

Last synced: 04 Nov 2024

https://github.com/omarsaad21/rfm-clustering-

A full Data science and deployment project focusing on Data analysis and ML ( create a customer segmentation model to recommend the best merchants for each user as targetted offers)

business-solutions data-science eda numpy pandas plotly python sickit-learn streamlit

Last synced: 04 Nov 2024

https://github.com/omarsaad21/shopping-cart-eda

An EDA python project focusing on getting the most out of a movies dataset (- 1- combine this data together - 2- check messing values - 3- show summary statistics - 4- deal with date time and extract features from dates - 5- answer at least three questions from this data)

eda jupyter-notebook numpy pandas plotly python

Last synced: 04 Nov 2024

https://github.com/chandkund/ev-market-segmentation

This project focuses on segmenting the Electric Vehicle (EV) market in India to identify key customer groups based on their preferences, behaviors, and demographics.

machine-learning matplotlib numpy pandas seaborn sklearn visualization

Last synced: 31 Oct 2024

https://github.com/omarsaad21/it-salary-eda

A python EDA project implemented on IT department salaries data we made data exploration and made data visulization for some questions on dataset

data explotary-data-analysis juypter-notebook numpy pandas python visualization

Last synced: 04 Nov 2024

https://github.com/samkazan/structural_discovery_of_macromolecules_data_analysis

This research project uses machine learning techniques and neural network to uncover key factors that contribute to successful protein structure discovery using Python and R

classification clustering ipython-notebook jupyter-notebook keras-neural-networks keras-tensorflow machine-learning neural-network numpy python r rmarkdown scikit-learn scipy tensorflow

Last synced: 13 Nov 2024

https://github.com/alejandro945/insurance-risk

This project aims to predict the risk of insurance claims using a dataset from Kaggle. The dataset consists of 26 columns and 205 rows, providing various features related to insurance risk. By analyzing this data, we seek to build predictive models that can help insurers assess the risk of claims.

data-analytics ipython-notebook numpy pandas python

Last synced: 11 Oct 2024

https://github.com/mariam-zaidi/bike-sharing-prediction-model

Build a multiple linear regression model for the prediction of demand for shared bikes.

linear-regression matplotlib numpy pandas rfe seaborn sklearn

Last synced: 31 Oct 2024

https://github.com/tsar123/neutral-single-nucleotide-polymorphisms-analysis

analysis of data on neutral single nucleotide polymorphisms

dna-polymorphism matplotlib numpy pandas seaborn snv

Last synced: 31 Oct 2024

https://github.com/thkox/uni_2023-compression-and-object-removal

This project is a team assignment for the "Multimedia Systems" course, offered in the 6th semester of the 2023 academic year at the University of Piraeus, Department of Informatics.

2023 6th-semester colorama compression-algorithm department-of-informatics heapq lossless-compression-algorithm motion-compensation multimedia-systems numpy object-removal opencv-python pickle python scipy-stats university-of-piraeus

Last synced: 03 Nov 2024

https://github.com/bineet-ratna-shakya/data-science-salary-analysis

analyzing a dataset containing salaries of data science professionals from 2020 to 2023.

data-analysis data-science data-visualization jupyter numpy pandas python

Last synced: 11 Oct 2024

https://github.com/guybanbo/qr-algorithm

QR alogrithm for finding eigenvalues and eigenvectors of a large symmetric matrix

haskell hmatrix numpy optimization python qr-algorithm symmetric-matrix

Last synced: 31 Oct 2024

https://github.com/fimbres/emotion-recognition

This is an AI project developed in Python to recognize emotions in people.

ai kaggle-dataset keras numpy opencv python tensorflow

Last synced: 06 Nov 2024

https://github.com/badranalyst/movie-correlation-analysis-in-python

This project analyzes movie data correlations using Python libraries like Pandas, NumPy, Seaborn, and Matplotlib. It examines relationships between attributes such as ratings, genres, and box office performance to uncover trends that inform recommendations and enhance understanding of movie success factors.

data data-analysis dataset jupyter jupyter-notebook matplotlib matplotlib-pyplot numpy pandas python seaborn

Last synced: 03 Nov 2024

https://github.com/hossein-cod/deep-learning-for-training

Analysi hear disease data set with deep learning

accuracy chart jupyter keras matplotlib numpy pandas py python sklearn tensorflow

Last synced: 11 Oct 2024

https://github.com/shahaba83/airplane-ticket-cancellation

In this project, we try to predict the possibility of canceling the plane ticket by the buyer

datatime numpy pandas python scikit-learn seaborn

Last synced: 31 Oct 2024

https://github.com/hadson0/dip-candy-classification

Candy classification project using digital image processing. Includes dataset creation, model development, and evaluation tools with TensorFlow, OpenCV, and Albumentations. Complete setup and instructions provided for building and training the model.

albumentations bounding-boxes cnn computer-vision cvat image-classification image-processing keras machine-learning matplotlib numpy opencv python seaborn sklearn tensorflow

Last synced: 26 Sep 2024

https://github.com/marvin-vw/3dengine

3D engine written in Python using OpenCV and NumPy

3dengine numpy opencv python

Last synced: 26 Sep 2024

https://github.com/ugurcan222/a-different-approach--image-enhancement-with-imputation-and-regression-methods

This experimental work presents a different approach to increase the size and quality of an image by adding a blank pixel around each pixel in an image, enlarging the image, breaking it into parts, and generating these blank pixels by predicting them with models.

ai-image-upscaling computer-vision digital-image-processing gradient-boosting image-analysis image-enhancement image-enlargement image-interpolation image-processing imputation knn machine-learning numpy opencv pixel-prediction python randomforest regression-models super-resolution xgboost

Last synced: 31 Oct 2024

https://github.com/hansalemaos/numpy_choices

Some useful np.random functions

choices numpy python random

Last synced: 30 Oct 2024

https://github.com/avik-pal/kaggle-titanic

Predicting whether a given set of people survive on the Titanic

machine-learning numpy pandas scikit-learn scikitlearn-machine-learning

Last synced: 12 Oct 2024

https://github.com/mobinabatebi/electricity_consumption_project

The project aimed at estimating electricity consumption in administrative offices, along with analyzing the impacts of such consumption on power outages, is critical for addressing the pressing issue of summer electricity shortages.

jupyter-notebook numpy pandas python

Last synced: 26 Sep 2024

https://github.com/jmromer/salaries

A number-crunching API powered by Flask and Pandas

demo flask numpy pandas python python3

Last synced: 13 Nov 2024

https://github.com/samir-atra/cancerdetector

The final project of AI50xIraq course is found in the commit on 21-3-2023

cancer-detection cnn dataset inceptionv3 matplotlib numpy opencv tensorboard tensorflow tensorflow-tutorials transfer-learning

Last synced: 03 Nov 2024

https://github.com/rampal-punia/data-science-toolkit

Your Go-To Resource for Essential Data Science Related Commands, Concepts, Quick Overviews and Useful Functions.

artificial-intelligence data-science keras machine-learning matplotlib nlp nlp-machine-learning numpy pandas pythorch sql tensorflow

Last synced: 03 Nov 2024

https://github.com/pradeep31747/smartsuggest-personalized_product_recommendations

This project implements a personalized product recommendation system using machine learning techniques to enhance user experience and drive engagement.

jupyter-notebook keras numpy pandas pyhton scikit-learn sql tensorflow vscode

Last synced: 03 Nov 2024

https://github.com/hannah-aji/predictive-modeling-to-identify-high-risk-areas

Predictive Modeling to Identify High-Risk Areas for Targeted Interventions in Chicago, Illinois

machine-learning matplotlib numpy supervised-learning visualization

Last synced: 06 Nov 2024

https://github.com/csengupta1101/netflix-rating

The project revolves around Netflix shows and movies around the world. The problem statement that is being tried to address here is that what kind of show to come up with in future times and how well that will fit with the audience

jupyter-notebook matplotlib movies netflix numpy pandas plotly python python3 rating tvseries

Last synced: 07 Nov 2024

https://github.com/arserchung/ml_cnn

Basis CNN intro& project of recognize to cat or dog

glob keras keras-models numpy opencv os sklearn tensorflow

Last synced: 03 Nov 2024

https://github.com/jakeoeding/rippy

A Python seam carving implementation

numpy python3 seam-carving

Last synced: 06 Nov 2024

https://github.com/fanisgl/cnn_machine_learning

A Machine Learning exercise that trains Convolutional Neural Network (CNN) using the tensorflow 2 and Keras libraries to predict images from the CIFAR-10 dataset.

data-science keras machine-learning neural-networks numpy pandas pooling-layers relu softmax tensorflow2

Last synced: 03 Nov 2024

https://github.com/amiriiw/gesture_recognition

Welcome to the Gesture Recognition Project! This project is designed to train a model to recognize hand gestures from images, and then use that model to detect gestures in real-time.

gesture-recognition json keras matplotlib numpy pathlib tensorflow

Last synced: 03 Nov 2024

https://github.com/varuntotakura/face-recognition

My own code for Face Recognition using Python

face-recognition numpy opencv python tensorflow

Last synced: 13 Nov 2024

https://github.com/ttwag/p2_gaussian_elimination

A function that encodes the Gaussian Elimination algorithm

algorithm linear-algebra numpy python

Last synced: 06 Nov 2024

https://github.com/gabboraron/python_nyelvtan

Egy python nyelvi elemekkel fogallkozó kurzus jegyzete.

ai matplotlib-pyplot numpy pandas

Last synced: 01 Nov 2024

https://github.com/jmromer/warbler

A recommendation engine powered by scikit-learn and served by Flask.

flask flask-api numpy pandas python3 scikit-learn

Last synced: 13 Nov 2024

https://github.com/pedrosfaria2/analisetitulosnetflix

Estudo de popularidade dos filmes da Netflix no IMDB.

analise-de-dados data-analysis jupyter-notebook matplotlib numpy pandas python

Last synced: 09 Nov 2024

https://github.com/aliakbar-omidi/ten-most-populous-cities-of-iran

This project is to show the densely populated cities of Iran on the map

basemap matplotlib numpy pandas python

Last synced: 05 Nov 2024

https://github.com/jcardonamde/food-sales-predictions

A project that seeks to predict the sales of a food distributor by applying machine learning algorithms and data analysis.

data-science machine-learning machine-learning-algorithms numpy pandas pipelines python seaborn sklearn

Last synced: 05 Nov 2024

https://github.com/badranalyst/student-tests-data-analysis-application

Python-based analysis of student test scores in math, reading, and writing, examining correlations with parental education, lunch type, and test preparation. Includes data cleaning, visualization, and statistical insights into factors influencing academic performance.

data-analysis data-visualization dataset matplotlib numpy pandas python sklearn

Last synced: 03 Nov 2024

https://github.com/akshay-kamath/anomaly_detection

This project addresses the challenge of identifying anomalies in sales data, aiming to optimize sales strategies by uncovering deviations from expected profit patterns.

anomaly anomaly-detection cblof flask isolation-forest knn matplotlib numpy pandas pyod python sklearn

Last synced: 03 Nov 2024

https://github.com/sukanyadutta52/topic_modeling

What are the most pressing concerns regarding ‘Climate Change’ among tweeters according to Topic Modeling?

climate-change gensim matplotlib nltk numpy pandas pyldavis regular-expression spacy

Last synced: 06 Nov 2024

https://github.com/yashika-malhotra/machine-learning---linear-regression-on-education-institute

In this analysis, I built a model to predict graduate admissions using Linear, Ridge, Lasso, and ElasticNet regressions. CGPA, GRE, and TOEFL scores emerged as key predictors. ElasticNet effectively handled multicollinearity and balanced L1 and L2 regularization.

linear-models linear-regression matplotlib normalization numpy pandas python seaborn sklearn sklearn-library standardization standardscaler statsmodels

Last synced: 03 Nov 2024

https://github.com/ramchaik/cinebrain

CineBrain uses ML & NLP to analyze movies and recommend similar ones based on user preferences through cosine similarity.

cosine-similarity flask htmx kaggle machine-learning movie-recomendation-system nlp nltk numpy pandas sklearn tailwind tmdb tmdb-api unsupervised-learning vectorization

Last synced: 03 Nov 2024

https://github.com/chandkund/stock-price-prediction-

Predict Tesla stock prices using historical data. The project preprocesses data, applies normalization and standardization, trains a linear regression model, and evaluates its performance. Visualizations include box plots and time series plots.

data-science linear-regression numpy pandas plotly python sklearn sklearn-model

Last synced: 03 Nov 2024

https://github.com/offthetab/lung_cancer_prediction

ML for Lung Cancer Prediction dataset from kaggle.

matplotlib numpy pandas python seaborn sklearn

Last synced: 03 Nov 2024

https://github.com/chandkund/iris-classification

Iris Classification project using machine learning to classify iris flowers into Setosa, Versicolour, and Virginica species. Includes data exploration, model training with SVM, Logistic Regression, and Decision Tree, and performance evaluation. Contributions welcome! Python, Pandas, Scikit-learn, Seaborn, Matplotlib.

machine-learning matplotlib-pyplot numpy pandas python sklearn

Last synced: 03 Nov 2024