Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/dmvianna/python-nix

Trivial Nix environment with pandas and postgresql

data-analysis nix

Last synced: 04 Dec 2024

https://github.com/marcogdepinto/olympichistoryanalysis

Python visual analysis of the Olympic Games history. Kaggle gold medal with 15000+ views, 200+ upvotes and 100+ comments.

data-analysis data-science jupyter-notebook olympic-games python seaborn

Last synced: 06 Jan 2025

https://github.com/dhruvil-26/sql-projects

This repository contains SQL projects focusing on data analysis and insights. Currently, it includes: 1. RSVP Movies Analysis - SQL queries to analyze movie trends, ratings, and genres. 2. Pizza Sales Analysis - SQL queries to explore sales patterns, customer behavior, and profitability in a pizza business.

analysis data-analysis database mysql pizza-sales-analysis rdbms rsvp sql

Last synced: 17 Dec 2024

https://github.com/muneeb706/human_activity_recognition

This project performs data cleaning and data exploration steps for Human Activity Recognition Using Smartphones Data Set in R programming language.

data-analysis data-cleaning data-exploration r-programming

Last synced: 04 Dec 2024

https://github.com/muneeb706/r-programming

R-Programming examples for data analysis.

data-analysis r-programming

Last synced: 04 Dec 2024

https://github.com/muneeb706/exploratory-data-analysis

Exploratory Data Analysis of some problems using python (numpy & pandas)

data-analysis exploratory-data-analysis jupyter-n numpy pandas python3

Last synced: 04 Dec 2024

https://github.com/dina-hosny/investigate-a-dataset-project

Investigate a Dataset project - Udacity Data Analysis Professional Track. In this project, I used the Python libraries NumPy, Pandas and Matplotlib to analyze a dataset and communicate my findings about it.

csv data-analysis jupyter-notebook matplotlib numpy pandas python

Last synced: 13 Jan 2025

https://github.com/benami171/ml_knn_decision-trees

A ml implementation comparing Decision Trees and k-Nearest Neighbors (k-NN) algorithms for Iris flower classification. Features comprehensive analysis of different approaches including brute-force and entropy-based decision trees, along with k-NN using multiple distance metrics.

classification cross-validation data-analysis decision-trees iris-dataset k-nearest-neighbours machine-learning nearest-neighbors python

Last synced: 27 Jan 2025

https://github.com/chingu-voyages/v47-tier3-team-30

An easily accessible tool for calculating electricity-related carbon emissions, along with insights for reducing environmental impact. | Voyage-47 | https://chingu.io/ | Twitter: https://twitter.com/ChinguCollabs

carbon-emissions carbon-footprint data-analysis data-engineering data-science

Last synced: 13 Jan 2025

https://github.com/dcs-training/exploratory-data-analysis-and-visualisation-with-observable-plot

This two-hour workshop will teach you how to follow an exploratory data analysis pipeline with Observable Plot, a new JavaScript library based on the Grammar of Graphics, that proposes a simple yet expressive interface to create powerful graphics easily shareable on the web. Go to the Readme file

d3 data-analysis data-visualisation javascript observable-notebook

Last synced: 13 Dec 2024

https://github.com/saitoxu/data-analysis-workspace

Docker image for data analysis

data-analysis docker python

Last synced: 06 Jan 2025

https://github.com/zachlagden/spotify-listening-analyzer

A comprehensive Python tool for analyzing your Spotify listening history data.

analytics data-analysis pandas python spotify-web-api spotipy

Last synced: 14 Dec 2024

https://github.com/badranalyst/student-tests-data-analysis-application

Python-based analysis of student test scores in math, reading, and writing, examining correlations with parental education, lunch type, and test preparation. Includes data cleaning, visualization, and statistical insights into factors influencing academic performance.

data-analysis data-visualization dataset matplotlib numpy pandas python sklearn

Last synced: 21 Dec 2024

https://github.com/harkishen-singh/agriculture-ds

An Agricultural based Mtech project, on Data Science, which predicts the growth of crops based on previous year records.

data-analysis pandas python

Last synced: 29 Dec 2024

https://github.com/shz-code/diwali_sales_data_analysis

Customer Product Purchase Behavior Analysis

behavior-analysis data-analysis matplotlib ml sales seaborn

Last synced: 21 Jan 2025

https://github.com/idaraabasiudoh/credit_card_fraud_detection

This repository contains a machine learning project focused on detecting credit card fraud using Decision Tree and Support Vector Machine (SVM) classifiers.

data-analysis jupyter-notebook machine-learning python3 scikit-learn snapml

Last synced: 21 Dec 2024

https://github.com/engineertolulope/us_states_living_ranking_analysis

Python script for analyzing and ranking U.S. states based on factors like cost of living, tax burden, diversity, crime rates, and climate. Uses weighted criteria to identify the best states to live in according to these metrics. Ideal for decision-making on relocation.

data-analysis data-science linear-regression machine-learning python scikit-learn

Last synced: 21 Dec 2024

https://github.com/valyaevgeorgiy/r_basic

Работа с основами среды R и тем самым изучения нового языка программирования, связанного непосредственно с анализом данных и построением графиков и диаграмм.

coding data data-analysis r rstudio

Last synced: 29 Dec 2024

https://github.com/khanovico/python-stock-analyzer

This is a Webapp implemented by python and several data science frameworks, enabling online stock trend analyzing.

amcharts-js-charts data-analysis data-visualization flask javascript pandas python scikit-learn

Last synced: 21 Dec 2024

https://github.com/tawounfouet/mlops-specialiazation-duke

MLOps | Machine Learning Operations Specialization from Duke University : acquiring critical MLOps skills, including the use of Python and Rust, utilizing GitHub Copilot to enhance productivity, and leveraging platforms like Amazon SageMaker, Azure ML, and MLflow.

aws azure big-data cloud-computing data-analysis data-management devops mlops python rust

Last synced: 06 Jan 2025

https://github.com/jagoda11/elastic-vision

This repository contains a full-stack application designed to explore data from ElasticSearch🧐indices and visualize it using charts and graphs. The backend is built using Node.js and the frontend is powered🚀 by React.

backend chartjs dashboard-development data-analysis data-visualization docker elasticsearch frontend fullstack javascript material-ui monorepo mui-x node pie-chart react restful-api tables

Last synced: 01 Jan 2025

https://github.com/joannescode/regex-with-py

Learning by practicing with Regex (Python)

data-analysis python3 regex

Last synced: 09 Jan 2025

https://github.com/sharoonjoseph321/social_media_eda

Data Analysis on social media apps ,using pandas, python, matplotlib.

data data-analysis data-science data-visualization matplotlib programming-language project python pythonprojects

Last synced: 13 Jan 2025

https://github.com/dsite42/simple_data_visualizer

This is a simple tool to visualize data for a quick Exploratory Data Analysis (EDA). You can create various plot types as seaborn or plotly plot via a GUI in multiple windows (RelPlot, PairPlot, JointPlot, DisPlot, CatPlot, LmPlot, 3DPlot).

data-analysis data-science data-visualisation data-visualization eda exploratory-data-analysis plotly seaborn

Last synced: 14 Jan 2025

https://github.com/badranalyst/startup-expansion-analysis-with-pandas-matplotlib-and-power-bi

Analyzes startup growth and expansion factors using Pandas for data analysis and Matplotlib for visualizations. Complements findings with data visualizations in Power BI, providing actionable insights into funding and market trends.

dashboard data-analysis data-visualization dataset matplotlib matplotlib-pyplot pandas power-bi powerbi

Last synced: 07 Jan 2025

https://github.com/azaz9026/loan_approval_prediction

Welcome to the Loan Approval Prediction repository! This project aims to build a predictive model that can determine whether a loan application should be approved or denied based on various features. Purpose The goal of this repository is to develop a machine learning model that can accurately predict loan approval decisio

data data-analysis data-visualization eda machine-learning numpy pandas python statistics

Last synced: 19 Jan 2025

https://github.com/findmyway/dataframe-in-julia

A quick introduction of DataFrame in Julia for users from Python

data-analysis dataframe julia jupyter-notebook

Last synced: 21 Jan 2025

https://github.com/madhuresh2011/amazon-sales-report-analysis-project-using-python

This project focuses on analyzing Amazon sales data using Python to uncover insights into sales performance, customer behavior, and product trends

charts cleaning-data data-analysis jupyter-notebook matplotlib numpy pandas python seaborn visualization

Last synced: 29 Dec 2024

https://github.com/lulloooo/python-googlemilestoneproject

Google Data Analysis Milestone Project about Chicago Bike Sharing Service

data-analysis data-visualization python tableau

Last synced: 10 Jan 2025

https://github.com/kmbuki/uk_police_data

R programming - Using open data about crime and policing in England, Wales and Northern Ireland.

data-analysis data-visualization r

Last synced: 18 Jan 2025

https://github.com/junpenglao/spafv

SPAFV - Surface Profile Analysis for Free Viewing eye movement experiment in 2AFC task

data-analysis statistics temporal-logic

Last synced: 13 Dec 2024

https://github.com/fortunewalla/birdstrikes

birdstrikes database created for postgresql with simple sample queries

birdstrikes csv data-analysis data-science database dataset pgsql postgresql practice sample sql sql-query workshop

Last synced: 21 Jan 2025

https://github.com/htsandaruvan/attrition-analytics-suite-by-hello-green

I have created a comprehensive data analytics dashboard to identify factors contributing to attrition,

data-analysis data-analytics data-visualization powerbi

Last synced: 21 Dec 2024

https://github.com/madhuresh2011/telco-customer-churn-analysis-project-using-python

The analysis primarily investigates factors influencing customer churn, particularly focusing on payment methods and contract types.

csv data-analysis matplotlib numpy pandas pyhton seaborn vizualisation

Last synced: 07 Jan 2025

https://github.com/jofaval/boston-housing

Regression Analysis into the Boston Housing in-demand pricing in 1978

boston-housing data-analysis data-science data-visualization machine-learning python regression

Last synced: 09 Dec 2024

https://github.com/linguini1/edueval

The BorealisAI Let's Solve It mentorship project: summarizing student feedback submissions on their professor into one cohesive paragraph for faculty consideration during performance reviews.

ai data data-analysis data-science machine-learning machinelearning nlp python pytorch sentiment-analysis

Last synced: 29 Dec 2024

https://github.com/linguini1/coopscraper

Scrapes the co-op job board provided by Carleton for jobs on my shortlist, then saves the jobs to a CSV file so that I can manipulate them with Excel.

csv data-analysis python selenium webscraper webscraping

Last synced: 29 Dec 2024

https://github.com/jofaval/melbourne-temperature-timeseries

Timeseries Data Analysis and Forecasting of the daily min temperature in Melbourne from 1981 to 1990

data-analysis data-science data-visualization deep-learning google-colab melbourne python temperature tensorflow timeseries timeseries-analysis

Last synced: 09 Dec 2024

https://github.com/sayamalt/fake-news-classification-using-fine-tuned-bert

Successfully developed a text classification model to predict whether a given news text is fake or not by fine-tuning a pretrained BERT transformed model imported from Hugging Face.

bert-embeddings bert-model data-analysis data-visualization deep-learning fine-tuning-bert model-evaluation model-training-and-evaluation text-classification text-preprocessing text-tokenization tokenizer-nlp wordcloud-visualization

Last synced: 18 Dec 2024

https://github.com/eea/eea.reveal

Reveal hidden knowledge by visualizing network structure in your data.

data-analysis data-visualization graphviz network-visualization

Last synced: 24 Jan 2025

https://github.com/saksham-jain177/data-analysis

A collection of data analysis and machine learning projects across various datasets. Explore predictive modeling, data visualization, and insights from real-world data. Projects include sales predictions, disease detection, customer segmentation, and more.

api data data-analysis data-cleaning data-science data-visualization datamodeling dataset datasets exploratory-data-analysis python python3 web-scraping youtube-api

Last synced: 14 Jan 2025

https://github.com/saksham-jain177/automated-data-analysis-and-visualization

About Automated Data Analysis and Visualization is a Streamlit web application designed for quick and insightful data analysis. Users can easily upload CSV files, perform automated preprocessing, and generate interactive visualizations such as histograms, scatter plots, and heatmaps.

automated-reporting data-analysis data-preprocessing data-science data-visualization datasets exploratory-data-analysis interactive-visualizations machine-learning python streamlit

Last synced: 14 Jan 2025

https://github.com/sujata-adhikari/data-analysis

Data analysis of Market sales data using PowerBi, created dashboard to show analysis.

data-analysis excel pandas powerbi

Last synced: 14 Jan 2025

https://github.com/solrikk/utrace

PicTraceV2 is a highly efficient image matching platform that leverages computer vision using OpenCV, deep learning with TensorFlow and the ResNet50 model, asynchronous processing with aiohttp, and Selenium for browser automation. PicTraceV2 allows users to upload images directly or provide URLs, quickly scanning a vast database to find image

automation computer-vision data-analysis data-extraction deep-learning image-processing image-search machine-learning natural-language-processing opencv openpyxl pandas python selenium tensorflow web-scraping yandex yandex-api

Last synced: 21 Dec 2024

https://github.com/christos99/scraping-project

This project is a Python-based tool for web scraping with a user-friendly GUI. Built with PyQt5 and Selenium, it allows users to scrape online listings by specifying keywords, price ranges, and exclusions. Results are displayed in a table and can be exported to an Excel file.

automation data-analysis excel gui openpyxl pandas pyqt5 python selenium web-scraping

Last synced: 21 Dec 2024

https://github.com/9dl/usbfalcon

Automatically copies files from plugged USB drives to a specified location, enabling quick data retrieval for analysis.

automation data-analysis data-retrieval ethical-hacking file-copying usb

Last synced: 14 Dec 2024

https://github.com/andystmc/nextflownyc

Developed a machine learning model (Bidirectional LSTM) to forecast NYC traffic volumes using 10 years of automated traffic count data. Achieved strong predictive accuracy, demonstrating the power of deep learning for urban traffic analysis.

data-analysis data-cleaning data-science data-visualization exploratory-data-analysis feature-engineering hyperparameter-tuning jupyter-notebook lstm-neural-networks machine-learning numpy pandas predictive-modeling python3 scikit-learn tensorflow-keras traffic-flow-forecasting

Last synced: 07 Jan 2025

https://github.com/pranjalya/hand-washing-data-visualisation

A small project of Data Visualization, where we analyze the effect of hand washing after introduced by Dr. Semmelweis to the nurses and midwives after giving birth.

data-analysis data-visualization jupyter-notebook pandas python3

Last synced: 21 Jan 2025

https://github.com/shivamsharma32/customer-churn-analysis-power-bi-

This project is about analyzing and visualizing customer churn data using Power BI. Customer churn is the percentage of customers who stop doing business with a company over a given period of time. It is an important metric for businesses to understand why customers leave and how to retain them.

data-analysis dataanalytics datavisualization powerbi

Last synced: 14 Dec 2024

https://github.com/sco1/xbmini-py

Python Toolkit for the GCDC HAM

data-analysis data-visualization python python3

Last synced: 02 Nov 2024

https://github.com/badranalyst/movie-correlation-analysis-in-python

This project analyzes movie data correlations using Python libraries like Pandas, NumPy, Seaborn, and Matplotlib. It examines relationships between attributes such as ratings, genres, and box office performance to uncover trends that inform recommendations and enhance understanding of movie success factors.

data data-analysis dataset jupyter jupyter-notebook matplotlib matplotlib-pyplot numpy pandas python seaborn

Last synced: 21 Dec 2024

https://github.com/priyanshu7639/data_visualization_dashboard

An Interactive data visualization tool that combines traditional plotting capabilities with modern AI assistance. It allows users to create and modify visualizations through natural language commands, making data exploration accessible to users of all skill levels.

business-analytics data-analysis data-engineering data-exploration data-science data-visualization datapreprocessing datascience interactive-visualizations matplotlib plotly plotting python research-tool streamlit

Last synced: 07 Jan 2025

https://github.com/aishwaryahastak/ipl_analysis

Analysis of IPL dataset using PySpark

data-analysis mllib pyspark

Last synced: 02 Jan 2025

https://github.com/sweta-kaundilya/python_for_data_analysis

Learning Python and all the relevant libraries in python for Data field.

cufflinks data-analysis data-science matplotlib numpy pandas plotly python seaborn

Last synced: 07 Jan 2025

https://github.com/subhamghimire/dataanavis

Learning Data analysis and visualization

data-analysis data-science data-visualization dataset

Last synced: 21 Jan 2025

https://github.com/evanwporter/sloth

Faster Pandas Dataframe

cython data-analysis dataframe pandas

Last synced: 21 Jan 2025

https://github.com/jjfiv/csc212spellchecking

Data Structure Analysis for Spell Checking

data-analysis smith-csc212

Last synced: 19 Nov 2024

https://github.com/madhuresh2011/telco-customer-churn-analysis-using-python

The analysis primarily investigates factors influencing customer churn, particularly focusing on payment methods and contract types.

csv data-analysis matplotlib numpy pandas pyhton seaborn vizualisation

Last synced: 14 Jan 2025

https://github.com/swapnil-jain/tailored-tomes

Web application which shows Top 50 books of all time & recommends similar books if a book name is provided.

book bookrecommendsystem books bootstrap3 cosine-similarity data-analysis html machine-learning python

Last synced: 20 Dec 2024

https://github.com/marielachirinosr/nyc-taxi-trip-exploration-2019-2020

Explores passenger behavior & impact of COVID-19 on NYC taxi industry (Q1 2019-2020).

bigquery data data-analysis data-visualization python sql tableau

Last synced: 29 Dec 2024

https://github.com/hi-jin2/data-analysis-basics

데이터분석기초(R) 수업 중에 작성한 소스코드 모음입니다. 『모두를 위한 R 데이터 분석 입문』 교재를 통해 R언어를 학습하였습니다.

data-analysis r r-studio

Last synced: 21 Jan 2025

https://github.com/marielachirinosr/hotel-data-analysis

Pandas & Matplotlib Learning Analysis. Repository featuring data analysis projects using Pandas and Matplotlib libraries

data data-analysis matplotlib pandas python

Last synced: 29 Dec 2024

https://github.com/marielachirinosr/bellabeat-wellness-data-trends

Analyzing smart device data for insights on user activity patterns to optimize interventions for better health outcomes.

data data-analysis data-visualization pandas python python3 tableau tableau-public

Last synced: 29 Dec 2024

https://github.com/mstovarh/analisis-de-bebidas-de-starbucks

En este repositorio se encuentran unas gráficas basadas en diversas características de las bebidas de Starbucks, usé tecnologías como la herramienta de Data Analysis de ChatGPT, Excel y PowerQuery.

chatgpt data-analysis excel powerquery

Last synced: 25 Dec 2024

https://github.com/nagar2nd/zomato-bangalore-analysis-tableau

Analysing restaurant data in Bengaluru to enhance customer satisfaction by optimizing the restaurant experience. The focus is on improving the popularity of different cuisines, enhancing delivery times, and boosting restaurant ratings. An interactive Tableau dashboard has been developed to help Zomato identify key areas for improvements.

data-analysis data-visualization tableau

Last synced: 07 Jan 2025

https://github.com/nagar2nd/airbnb-property-management-optimization

This project aims to analyze Airbnb’s dataset to optimize rental strategies, enhance customer satisfaction, and maximize revenue for property owners. Using Tableau, the insights generated will help improve decision-making for both Airbnb and its hosts.

data-analysis data-visualization tableau

Last synced: 07 Jan 2025

https://github.com/madrury/hot-sauce

Simuation of a Hot Sauce Spicyness Dataset

data-analysis data-science data-visualization dataset machine-learning

Last synced: 07 Jan 2025

https://github.com/thesfinox/fit-the-data

Data analysis using Wolfram Mathematica

analysis data data-analysis lab mathematica wolfram wolfram-mathematica

Last synced: 25 Dec 2024

https://github.com/iguptashubham/ev-market-exploration

So, market size analysis is a crucial aspect of market research that determines the potential sales volume within a given market

data-analysis data-analysis-projects data-science-project forecast projects python

Last synced: 14 Jan 2025

https://github.com/karlyndiary/coffee-shop-sales-analysis

Analyzing coffee shop sales using Pandas for data cleaning and exploratory data analysis (EDA), and Streamlit for data visualization dashboards.

data-analysis data-cleaning data-preprocessing data-visualization eda pandas streamlit streamlit-dashboard

Last synced: 21 Dec 2024

https://github.com/ahmedtaher10/covid-19-cases

The data we are using contains the data on covid-19 cases and their impact on GDP from December 31, 2019, to October 10, 2020.

data-analysis python visualization

Last synced: 21 Dec 2024

https://github.com/namratha2301/python-dashboard-streamlit

Experimenting with Streamlit. Streamlit app provides an interactive visualization of the best-selling books, showcasing trends, top-selling books, top authors, genre distributions, and sales by decade.

css dashboard data-analysis pandas plotly python seaborn streamlit

Last synced: 21 Dec 2024

https://github.com/elmezianech/autoinventory

This project is an end-to-end, fully automated warehouse management solution designed to tackle real-world inventory challenges in the FMCG sector. From real-time data ingestion and predictive analytics to interactive dashboards, this project combines cutting-edge technologies and an event-driven architecture to simulate a business-ready system.

automation dashboard data-analysis data-engineering-pipeline docker etl glue-job inventory-management kafka kpis lambda-functions lstm ml-pipeline mlflow power-bi pytorch redshift s3 streamlit warehouse-management

Last synced: 29 Dec 2024

https://github.com/ronylpatil/whatsapp-group-chat-analysis

This project is totally based on data analysis where our college official Whatsapp group is used to extract useful information from the chat. Some of the useful extracted features are most active members of the group, most active day of the week, top-10 media contributors in the Group, and many more...

data-analysis data-preprocessing data-wrangling feature-engineering

Last synced: 21 Jan 2025

https://github.com/bniladridas/dataanalysis

An open-source Python script for data analysis and exploration using pandas.

data-analysis flask pandas spreadsheet

Last synced: 21 Jan 2025

https://github.com/hamzacham/data_set-projet-8

Analyzing a real world data-set with SQL and Python

data-analysis database dataset jupyter-notebook paython sql

Last synced: 07 Jan 2025

https://github.com/avratanubiswas/fluorpenplugin

A matlab user interface for analysing OJIP curve datasets from FluorPen instrument. That is, serving as an additional plug in for "quick categorical analysis".

data-analysis fluorpen ojip-curve

Last synced: 21 Dec 2024

https://github.com/ashithapallath/r-lab

This repository offers a collection of exercises, assignments, and projects designed for the R Programming course. It focuses on utilizing R for data analysis, statistical modeling, and visualization tasks.

data-analysis exploratory-data-analysis machine-learning r-language visualization

Last synced: 27 Jan 2025

https://github.com/dissorial/prx21_erikz

Analysis of self-tracked data: interactive visualizations & predictive algorithms

analytics data-analysis data-science data-visualization machine-learning matplotlib pandas python python3 visualization

Last synced: 01 Jan 2025

https://github.com/anushkundu/london-housing-market-analysis

London Housing Market Analysis: An Insightful Power BI Dashboard"

data-analysis data-visualization powerbi transformation

Last synced: 07 Jan 2025