Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/davgiles/ut-austin-data-science-program

This repository contains my projects from the Data Science & Business Analytics Post-Graduate Program through UT Austin.

eda matplotlib numpy pandas python scikit-learn scipy seaborn visualization xgboost

Last synced: 03 Nov 2024

https://github.com/davityak03/sentence-paraphraser-checker-using-transformers

This Jupyter Notebook implements a tool to check whether two sentences are paraphrases by analyzing their semantic similarity using NLP techniques. It provides a similarity score and a binary decision to indicate if the sentences are paraphrases.

keras nlp nltk numpy python tensorflow tokenizer transformers

Last synced: 13 Oct 2024

https://github.com/navindafernando/heart-disease-prediction

Deploying the heart risk level predicting Regression model in a web application using Flask :white_flower:

flask html joblib machine-learning numpy

Last synced: 08 Nov 2024

https://github.com/prashhhant213/cardioflex-treadmill-analysis-using-descriptive-statistics-probability

Description Analysis and Visualization on CardioFlex Treadmill data to provide insights and recommendations to improve their userbase.

colab-notebook numpy pandas probability python stats

Last synced: 07 Nov 2024

https://github.com/hansalemaos/numpyslicesplit

Splits a numpy array or a list based on the given indices or ranges and returns the split arrays.

numpy split

Last synced: 07 Nov 2024

https://github.com/prashhhant213/exploratory-data-analysis-for-multinational-retail-corporation

Analysis via CLT and Visualization on Multinational Retail Corporation's data to provide insights and recommendations to improve their userbase.

math matplotlib numpy pandas python scipy-stats seaborn stats

Last synced: 07 Nov 2024

https://github.com/prashhhant213/data_analysis_and_visualization-_for_streaming_platform

Data Analysis and Visualization for streaming platform to provide insights and recommendations to improve their userbase.

colab-notebook data datavisualization matplotlib numpy pandas python seaborn

Last synced: 07 Nov 2024

https://github.com/szymon-budziak/real_estate_house_prices_prediction

Predicting real estate house prices using various machine learning algorithms, including data exploration, preprocessing, model training, and evaluation.

data-analysis data-preprocessing data-science eda jupyter-notebook machine-learning matplotlib numpy optuna pandas predictive-modeling price-prediction python random-forest regression scikit-learn seaborn

Last synced: 10 Oct 2024

https://github.com/zenithclown/agine

Library for finding Nearest Neighbor or to find if two points on Earth have a Direct Line of Sight.

bearing bearing-calculations distance-calculation euclidean-distance geometry haversine manhattan-distance nearest-neighbor-search numpy pandas rasterio shapely

Last synced: 07 Nov 2024

https://github.com/alejoduarte23/si_bayesianmixturemodel

Implementation of a two-stage fast Bayesian system identification for separated Modes. This repository expands the usage of this technique by adding a mixture model fit to obtain modal parameters from the posterior distribution.

matplotlib numpy scikit-learn scipy

Last synced: 10 Oct 2024

https://github.com/mkdirer/depression-data-analysis

This project analyzes a Kaggle depression dataset using data preprocessing, clustering, classification, and outlier detection techniques. Python libraries like pandas, numpy, matplotlib, seaborn, and scikit-learn are used to extract insights.

classification clustering matplotlib numpy pandas scikit-learn seaborn vizualization

Last synced: 10 Oct 2024

https://github.com/bilgenurbekar/turkishcyberbullying

Contains fine-tuned BERT models and results in the text classification category using Turkish social media data

bert-fine-tuning huggingface-transformers matplotlib numpy pandas python pytorch scikit-learn transformers

Last synced: 10 Oct 2024

https://github.com/r-papso/pynet

Deep learning library implemented in Python

deep-learning machine-learning neural-network numpy python

Last synced: 13 Oct 2024

https://github.com/sahaavi/color-detection

Colour detection is the process of detecting the name of any color. Well, for humans this is an extremely easy task but for computers, it is not straightforward. Human eyes and brains work together to translate light into color. Light receptors that are present in our eyes transmit the signal to the brain. Our brain then recognizes the color. Since childhood, we have mapped certain lights with their color names. We will be using the somewhat same strategy to detect color names.

color-detection color-detector numpy opencv-python pandas

Last synced: 07 Nov 2024

https://github.com/chaitanya1436/student_performance_analysis

A project focused on analyzing college student performance using data on department, assessment scores, and performance labels. Implemented in Google Colab, the analysis includes data preprocessing, feature scaling, and exploratory data analysis to uncover insights and prepare the data for further analysis or modeling.

ata-preprocessing data-preparation exploratory-data-analysis feature-scaling google-colab numpy pandas scikit-learn

Last synced: 03 Nov 2024

https://github.com/stella4444/linear-regression

learning about linear regression (currently a work in progress) ~ working with data

linear-regression machine-learning numpy scikit-learn

Last synced: 03 Nov 2024

https://github.com/open-risk-academy/academy-course-dat31055

Support material for Academy course DAT31055

linear-algebra matrix-libraries ndarray numpy

Last synced: 13 Oct 2024

https://github.com/alaazameldev/text-based-search-engine

Implementation of a search engine using TF-IDF and Word Embedding-based vectorization techniques for efficient document retrieval

chromadb fastapi gensim-word2vec nltk numpy precision-recall python scikit-learn tf-idf-vectorizer

Last synced: 10 Oct 2024

https://github.com/allenvox/neural

Workspace for Neural Networks class

jupyter-notebook neural-networks numpy python pytorch tensorflow

Last synced: 13 Oct 2024

https://github.com/alvinmurimi/jumia

Data analysis on smartphones listed on jumia.co.ke

beautifulsoup4 jumia matplotlib numpy pandas scraping seaborn

Last synced: 14 Oct 2024

https://github.com/bilalm04/email-spam-classifier

A machine learning project that classifies emails as spam or not spam using Logistic Regression, with a deployable Flask API for real-time classification.

api flask jupyter-notebook machine-learning matplotlib nlp numpy pandas python scikit-learn

Last synced: 10 Oct 2024

https://github.com/sarowarahmed/predicting-kolkata-house-price

🏠 Predicting Kolkata House Price: A web app powered by a Machine Learning model, built with Numpy, Pandas, Scikit-learn, and Streamlit, to predict house prices in Kolkata. Deployed on Streamlit Cloud for easy access and real-time predictions.

app kolkata linear-regression machine-learning numpy pandas scikit-learn streamlit

Last synced: 03 Nov 2024

https://github.com/vjpaij/datascience-udemy

Various basic codes on datascience

machine-learning matplotlib numpy pandas python seaborn

Last synced: 19 Nov 2024

https://github.com/venky-1710/superhero-recruitment

Superhero Recruitment System predicts hero selection using machine learning. Users input hero attributes through a web interface. A Random Forest model analyzes abilities, strengths, weaknesses, success rates, and missions completed. The Flask app displays results, showing if a hero is selected.

css flask html numpy pandas python scikit-learn

Last synced: 19 Nov 2024

https://github.com/venky-1710/stress-level-predection

Stress Level Prediction is a web app using machine learning to estimate user stress levels. It takes inputs like anxiety, sleep quality, and academic performance, then predicts stress using a Decision Tree Classifier. Built with Python, Flask, and scikit-learn, it's useful for students, researchers, and those interested in stress management.

css flask html machine-learning numpy pandas python python-sklearn scikit-learn

Last synced: 19 Nov 2024

https://github.com/yuriidorosh/information-theory-and-coding

Using an analytical description for data, the theory of information objectifies the number of bits required to represent the data which is the source's information entropy. Coding theory is the study related to the nature of codes and their individual capability for particular applications.

coding information information-and-coding-theory matlab matlab-script matplotlib matplotlib-python numpy python python-3 python3

Last synced: 14 Oct 2024

https://github.com/ifimust/room_generator

Room generator service for 2D game levels

flask google-app-engine microservice numpy procedural-generation python

Last synced: 10 Nov 2024

https://github.com/manuel-lang/numpymongo

A python package to export NumPy data to MongoDB

mongodb numpy wrapper

Last synced: 02 Nov 2024

https://github.com/hifly81/photox

photo and album organizer for gtk3

gtk3 numpy opencv-python photography pil pillow python

Last synced: 10 Oct 2024

https://github.com/li-plus/tinynn

An educational PyTorch-like neural network framework based on NumPy

autograd deep-learning machine-learning neural-network numpy python

Last synced: 06 Nov 2024

https://github.com/sibendud/csi_2024_dataengineering

Internship on Data Engineering where below topics are applied skills that are used to complete the given tasks through out 8 weeks including the project.

adf adls azure-pipelines databricks docker ipynb json numpy pandas-python parquet-avro pipeline pyspark python sql sql-server

Last synced: 11 Oct 2024

https://github.com/aminnj/numpy.jl

Simple numpy snippets implemented in julia

joke julia maybenot numpy python

Last synced: 12 Oct 2024

https://github.com/fearless-spider/spideroptic-cli

Spider Optic CLI is a command line application.

cement cli cli-app command-line makefile numpy oanda pypi python

Last synced: 24 Oct 2024

https://github.com/sonaligill/olympics-analysis

The outcome of this project is an interactive streamlit web application that visualizes the analysis of Olympic data while rendering different aspects of Olympic history, compare country performances, and gain insights into athlete demographics.

numpy plotly python scikit-learn scipy streamlit

Last synced: 12 Oct 2024

https://github.com/sonu275981/face-mask-detection-web-app

Face Mask Detection web application built with Flask, Keras-TensorFlow, OpenCV. It can be used to detect face masks both in images and in real-time camera video using browser.

flask-application haar-cascade-classifier haarcascade-frontalface javascript keras-tensorflow mask mask-detection numpy opencv webapp

Last synced: 05 Nov 2024

https://github.com/morpheomax/uberpython

Viajes realizados en Uber con Python

numpy pandas python streamlit uber

Last synced: 11 Nov 2024

https://github.com/sonu275981/uber-rides-data-analysis

Analysis of Uber's Ridership Data for NYC.

data-analysis-python flask machine-learning numpy pandas uber

Last synced: 05 Nov 2024

https://github.com/xkomil/datasciencesummerstudy

I want to document in this repository all my studying data science topics

data-science data-visualization machine-learning meteostat numpy pandas seaborn sklearn streamlit-webapp

Last synced: 05 Nov 2024

https://github.com/jonad/chicago_crime_analysis

Exploratory data analysis on Chicago Crime dataset.

jupyter-notebook matplotlib numpy pandas python3

Last synced: 07 Nov 2024

https://github.com/jonad/finding_donors

Predicting income with UCI Census Income Dataset using supervised machine learning algorithms

numpy pandas scikit-learn scikitlearn-machine-learning

Last synced: 07 Nov 2024

https://github.com/jonad/quora_w2v

In-domain word embedding using Quora dataset.

gensim-word2vec matplotlib numpy pandas python3

Last synced: 07 Nov 2024

https://github.com/jonad/titanic_exploratory_analysis

Exploratory Data Analysis on Titanic Dataset

jupyter-notebook numpy pandas python3

Last synced: 07 Nov 2024

https://github.com/jonad/quora_data_analysis

Exploratory data analysis and feature engineering on Quora dataset.

exploratory-data-analysis feature-extraction numpy pandas python3 quora-dataset

Last synced: 07 Nov 2024

https://github.com/jonad/titanic_survival

Basic decision tree model for Titanic dataset. Predict who will survive.

jupyter-notebook matplotlib numpy pandas python3 titanic-survival-prediction

Last synced: 07 Nov 2024

https://github.com/huangcongqing/python-libraries

python库(numpy,pandas...)学习

matplotlib numpy pandas python3 xgboost

Last synced: 28 Oct 2024

https://github.com/samkazan/structural_discovery_of_macromolecules_data_analysis

This research project uses machine learning techniques and neural network to uncover key factors that contribute to successful protein structure discovery using Python and R

classification clustering ipython-notebook jupyter-notebook keras-neural-networks keras-tensorflow machine-learning neural-network numpy python r rmarkdown scikit-learn scipy tensorflow

Last synced: 13 Nov 2024

https://github.com/alejandro945/insurance-risk

This project aims to predict the risk of insurance claims using a dataset from Kaggle. The dataset consists of 26 columns and 205 rows, providing various features related to insurance risk. By analyzing this data, we seek to build predictive models that can help insurers assess the risk of claims.

data-analytics ipython-notebook numpy pandas python

Last synced: 11 Oct 2024

https://github.com/bineet-ratna-shakya/data-science-salary-analysis

analyzing a dataset containing salaries of data science professionals from 2020 to 2023.

data-analysis data-science data-visualization jupyter numpy pandas python

Last synced: 11 Oct 2024

https://github.com/costrouc/xnd-notebooks

A collections of notebooks that demonstrate xnd, ndtypes, and gumath features

binder gumath ndtypes notebooks numba numpy xnd

Last synced: 07 Nov 2024

https://github.com/strawndri/python-ds-numpy

Projeto de introdução à ciência de dados e Python, da plataforma Alura.

alura data-science numpy python

Last synced: 10 Nov 2024

https://github.com/hossein-cod/deep-learning-for-training

Analysi hear disease data set with deep learning

accuracy chart jupyter keras matplotlib numpy pandas py python sklearn tensorflow

Last synced: 11 Oct 2024

https://github.com/avik-pal/kaggle-titanic

Predicting whether a given set of people survive on the Titanic

machine-learning numpy pandas scikit-learn scikitlearn-machine-learning

Last synced: 12 Oct 2024

https://github.com/varuntotakura/face-recognition

My own code for Face Recognition using Python

face-recognition numpy opencv python tensorflow

Last synced: 13 Nov 2024

https://github.com/gabboraron/python_nyelvtan

Egy python nyelvi elemekkel fogallkozĂł kurzus jegyzete.

ai matplotlib-pyplot numpy pandas

Last synced: 01 Nov 2024

https://github.com/alejoduarte23/reading_data_from_dewesoft

The following repository retrieves sensor data (acceleration and strains) from both local and cloud databases. It processes the data using classes from another repository called Modal Engine for spectral analysis, modal analysis, and signal processing.

dewesoft matplotlib modal-analysis numpy orm scipy signal-processing sql sqlalchemy

Last synced: 12 Oct 2024

https://github.com/e-candeloro/data-analysis-code-snippets-for-pandas-and-sklearn

These notebooks are useful to learn how to load, understand, clean and classify data using Pandas and Sklearn with Python

analysis big-data classification data datascience datavisualization machine-learning notebook numpy pandas python sklearn

Last synced: 07 Nov 2024

https://github.com/luizabash/wmap-tt-eda

EDA of the WMAP 5-Year TT Power Spectrum data, the analysis includes summary statistics, data visualization, polynomial regression for model fitting, and hypothesis testing

astrophysics matplotlib numpy pandas scipy

Last synced: 13 Oct 2024

https://github.com/ismaelvr1999/bioinspiredalgorithms

This repository contains implementations of various bioinspired algorithms for optimization and machine learning.

machine-learning matplotlib numpy python tensorflow

Last synced: 07 Nov 2024

https://github.com/cyb3r-jak3/docker-alpine-pandas

Python alpine image with pandas, and numpy installed

alpine-image numpy pandas-python python-alpine

Last synced: 16 Nov 2024

https://github.com/pushtogithub23/projectilemotion-simulation

This Jupyter notebook explores projectile motion through mathematics, visualizations, and interactive simulations.

jupyter-notebook matplotlib-pyplot numpy physics-simulation projectile-motion-simulation

Last synced: 05 Nov 2024

https://github.com/navindafernando/feature-extraction

Heart Risk Level Predicting Regression Model & Web using Feature Engineering and Data Preprocessing :baby_chick:

categorical-encoding feature-engineering flask handling-outlier html5 joblib label-encoding machine-learning numpy pandas polynomial-features quantile-transformer scaling

Last synced: 08 Nov 2024

https://github.com/shuyib/pimadlkeras

Using keras specify-compile-fit- predict workflow on this binary classification problem to investigate if i'll get better predictions.

diabetes hidden-layers keras-neural-networks matplotlib-figures model-validation numpy pima-indians-dataset prediction predictive-modeling python-3-5 seaborn

Last synced: 18 Nov 2024

https://github.com/quickheaven/scs-3251-statistics-for-data-science

Repository dedicated to Term Project of UofT Statistics for Data Science Course

data-science jupyter-notebook logistic-regression numpy pandas python

Last synced: 18 Nov 2024

https://github.com/jessicahora/series_temporais

Análise de Series Temporais do Consumo de Energia da Alemanha.

numpy pandas python time-series-analysis

Last synced: 11 Nov 2024

https://github.com/lasithaamarasinghe/hand-gesture-math-solver

This project demonstrates a real-time hand gesture recognition system using Python, OpenCV, and Gemini AI by Google.

computer-vision cvzone gemini-ai hand-recognition numpy opencv pillow python visual-studio-code

Last synced: 11 Nov 2024

https://github.com/sivatsk26/university-admit-eligibility-predictor

This project is created using Machine Learning and Regression methods- a statistical technique to predict the outcome of event which is to verify the users’ admission eligibility level, considering the universities they have chosen. This is achieved based on the algorithms implemented, when is user feed the application with the required information

html-css-javascript ibm-cloud ibm-watson linear-regression machine-learning matplotlib numpy pandas python python-flask random-forest scikit-learn

Last synced: 18 Nov 2024

https://github.com/drewstone/katz-centrality

Katz centrality of a wheelgraph

centrality graphs networks numpy

Last synced: 16 Nov 2024

https://github.com/virajbhutada/netflix-data-analysis

Discover the dynamics of Netflix's content library through our data analysis project. Explore trends in movies vs. series distribution, genre preferences, top directors, and more. Inform strategic decisions and enhance user experience with valuable insights.

data-science data-visualization genre-classification jupiter-notebook machinelearning-python matplotlib-pyplot netflix netflix-data-analysis numpy pandas python seaborn

Last synced: 11 Nov 2024

https://github.com/simon2k/stock-price-prediction-evaluation

This project is indented to present a small evaluation of different types of regression models for predicting stock prices for AAPL.

evaluation machine-learning numpy pandas predicting-stock-prices scikit-learn

Last synced: 13 Nov 2024

https://github.com/yashaka/learncv

Toolset used in learning Computer Vision at R_D course

cv numpy opencv utitlities

Last synced: 14 Nov 2024

https://github.com/shashwat9kumar/us-accidents-data-analysis

Analysis of the US accidents using the US-Accidents dataset (4.2 million entries) from Kaggle

accidents accidents-analysis data-analysis data-analytics data-visualisation data-visualization matplotlib numpy pandas python

Last synced: 13 Nov 2024

https://github.com/jose-zothner-meyer/finance-individual-loans

A comprehensive analysis of a financial institution's loan portfolio using Python, focusing on data preprocessing, exploratory data analysis (EDA), and feature engineering.

credit data-science data-visualization eda exploratory-data-analysis finance jupyter-notebook matplotlib numpy pandas plotly python scipy seaborn

Last synced: 11 Nov 2024

https://github.com/ivangael/genderdetection

Gender detection using gender classification model

genderclassification genderdetection numpy pil python resnet18 torch torchvision

Last synced: 11 Nov 2024

https://github.com/vadimkantorov/fasttsv

TSV parser for Python in pure vectorized NumPy code

numpy parsing simd tsv vectorized-code

Last synced: 13 Nov 2024

https://github.com/nilusche/feed_forward_network_py

Implementation of a neural network in python which can predict handwritten numbers from the MNIST dataset.

colab-notebook feed-forward-neural-networks neural-network numpy python

Last synced: 11 Nov 2024