Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2025-02-20 00:07:05 UTC
- JSON Representation
https://github.com/antonio-f/big-data-analysis-with-scala-and-spark
Coding assignments from the course "Big Data Analysis with Scala and Spark" (Coursera).
big-data bigdata coursera data-analysis scala spark
Last synced: 06 Feb 2025
https://github.com/wiseaidev/truth-guard
Analyzing a 79k Dataset of Misinformation and Fake News
data-analysis fastapi lstm machine-learning python supervised-learning
Last synced: 13 Feb 2025
https://github.com/as16082023/coffee-bean-sales-analysis
Analyzing coffee bean sales data to optimize consumer targeting, product offerings, and strategic marketing in the coffee industry.
coffee-bean-sales dashboard data-analysis data-visualization ms-excel
Last synced: 15 Feb 2025
https://github.com/bilal-belli/personalacademicdocuments
This repository contains some personal academic assignments, maybe it will help someone!
compilation computer-architecture data-analysis data-structures-and-algorithms database front-end hpc networking operating-systems signal-processing
Last synced: 17 Jan 2025
https://github.com/as16082023/atliq-hospitality-analysis
This project presents an overview of AtliQ Grands' performance in the hospitality industry using Power BI.
atliqgrand codebasicsresumeprojectchallenge data-analysis data-visualization powerbi revenueinsights
Last synced: 15 Feb 2025
https://github.com/agungbudiwirawan/supermarket_sales_dashboard-sales-analysis-using-pivot-table-and-chart
The objective of this project is to analyze supermarket sales using pivot table and chart in Microsoft Excel. After doing the analysis, I created an interactive dashboard that aims to make it easier for the audience to explore data.
dashboard data-analysis data-science data-visualization excel sales-dashboard spreadsheet
Last synced: 06 Feb 2025
https://github.com/jakubkorytko/data-graphs
Transform raw data into captivating visual stories with this app, effortlessly craft stunning data charts that unveil insights and trends
charts data-analysis mit-license open-source
Last synced: 11 Jan 2025
https://github.com/kalebers/economic_analysis_data_science
Data Analysis Python project using economic data base to predict percentage of good and bad payers
data-analysis data-science machine-learning pandas python scipy sklearn-library
Last synced: 21 Jan 2025
https://github.com/elkronos/stat_py
Statistics functions for python
assumption-check data-analysis data-visualization python regression statistical-analysis statistical-inference statistical-models statistical-tests statistics
Last synced: 24 Jan 2025
https://github.com/kshitiz1302/credit_card_financial_weekly_status_dashboard
An interactive beginner friendly PowerBi dashboard with useful insights
data-analysis data-cleaning data-manipulation data-modeling data-storytelling data-visualization dax dax-expression dax-query financial-analysis mysql-database powerbi powerbi-custom-visuals powerbi-dashboards powerbi-desktop powerbi-embedded powerbi-report powerbi-visuals reporting
Last synced: 20 Jan 2025
https://github.com/blagojeblagojevic/lol_data_analysis
classification data-analysis data-science jupyter-notebook kaggle python3
Last synced: 14 Feb 2025
https://github.com/gursv/autoworth
Used Car Price Prediction (India)
data-analysis data-analysis-python data-analytics data-cleaning data-preprocessing data-science-projects eda fine-tuning gridsearchcv machine-learning matplotlib-pyplot pandas python3 random-forest-regressor scikit-learn seaborn
Last synced: 20 Jan 2025
https://github.com/cjunwon/youtube-data-analysis
End-to-end Youtube data analysis project using Youtube Data API, MySQL, AWS, Flask
aws-rds data-analysis datapipeline flask nlp pandas python shell sql vader-sentiment-analysis youtube youtube-api
Last synced: 08 Feb 2025
https://github.com/brunomontezano/benzocovid
💊 Data Analysis Project of Benzodiazepines during COVID-19 Pandemic.
benzodiazepines covid-19 data-analysis
Last synced: 11 Jan 2025
https://github.com/muneeb1030/eda-of-physionets-ecg
EDA of Physionet Data set regarding "A Large Scale 12 Lead Electrocardiogram Database for Arrhythmia Study 1.0.0". This project focuses on the preprocessing of electrocardiogram (ECG) signals and utilizes Principal Component Analysis (PCA) for dimensionality reduction
12-lead-ecg data-analysis ecg-signal eda pca python3 wfdb
Last synced: 11 Jan 2025
https://github.com/john-science/data_science_by_example
Examples of Data Science Tools & Libraries
data-analysis data-science ipython pandas
Last synced: 18 Nov 2024
https://github.com/t-mohamed-shafeek/data-analysis-on-tamil-nadu-road-accidents
The "Data Analysis on Tamil Nadu Road Accidents" is a project deals with analysis of data on Road Accidents encountered by Tamil Nadu ( one of the states of India ) in the year of 2020 and 2021. But the dataset is most recently created (created on February 15, 2023 with source form TN Police).
dashboard data-analysis data-science data-visualization jupyter-notebook tableau
Last synced: 07 Feb 2025
https://github.com/michenriksen/inspectra
A simple web app for data inspection.
data-analysis decoding web-tool
Last synced: 14 Jan 2025
https://github.com/sermonzagoto/data_cleansing_in_telco
Data Cleansing in Python
data-analysis data-science machine-learning matplotlib-figures pandas-python seaborn-plots
Last synced: 28 Jan 2025
https://github.com/nikhilash45/live_ipl_report
This repository hosts the source code for an interactive IPL (Indian Premier League) Dashboard built using PowerBI. The dashboard provides real-time updates on ongoing matches, including live scores, batting and bowling statistics for both teams, and the points table.
analysts cleaning-data cricket-data dashboard data data-analysis data-visualization dax powerbi
Last synced: 11 Jan 2025
https://github.com/ehtisham-sadiq/building-an-ml-based-heart-disease-diagnosis-system-with-flask
It is an end-to-end project that combines machine learning to create a user-friendly Heart Disease Diagnosis System, powered by Flask.
data-analysis exploratory-data-analysis feature-engineering flask machine-learning model-building model-evaluation pipelines python3 rest-api
Last synced: 11 Jan 2025
https://github.com/randomshek/Working-With-Excel
Using Excel Power Query and PowerPivot, reorganise the data into a star schema and showcasing reports that can be created by data analysts using DAX formulae and PowerPivot
data-analysis excel power-pivot power-query
Last synced: 27 Nov 2024
https://github.com/moindalvs/learn_eda_house_price_dataset
Data Set: House Prices: Advanced Regression Techniques Exploratory Data Analysis on more than 80 features
cardinality data-analysis data-science data-structures data-visualization missing-values
Last synced: 18 Jan 2025
https://github.com/quantitext/quantitext
Official repository for QuantiText applications in the .NET ecosystem.
api aspnet-core csharp data-analysis dotnet-core mvc-architecture
Last synced: 06 Feb 2025
https://github.com/carlosvinimsouza/dataanalysiswithpython
Learn Data Analysis using Libs Python (Numpy, Pandas, Matplotlib and Seaborn)
data-analysis data-science free-code-camp matplotlib numpy pandas python python3 seaborn
Last synced: 11 Jan 2025
https://github.com/namratha2301/best-selling-books
Comprehensive examination of best-selling books, focusing on understanding sales patterns, genre distributions, and the impact of various features on book performance.This project aims to predict book sales and classify genres, providing valuable insights for authors, publishers, and readers.
data-analysis data-visualization matplotlib pandas sckiit-learn seaborn
Last synced: 28 Jan 2025
https://github.com/gf712/abpytools-qt
Qt interface of AbPyTools
antibody-numbering antibody-sequences cpp11 data-analysis python3 qt5
Last synced: 29 Jan 2025
https://github.com/cano1998/eda-survival-of-the-titanic
This project focuses on Exploratory Data Analysis (EDA) to identify the key determinants that influenced survival during the infamous Titanic accident.
data-analysis data-cleaning data-preprocessing data-visualization exploratory-data-analysis jupyter-notebook titanic-survival-exploration
Last synced: 04 Jan 2025
https://github.com/knyghtmare/msba_projects_public
A repo containing links to my projects done
data-analysis data-mining data-science data-science-portfolio data-science-projects data-visualization datascience tahsinjahinkhalid
Last synced: 15 Feb 2025
https://github.com/maskedsyntax/budget-pie
Android app to manage monthly budgets
android dart data-analysis data-visualization finance-management firebase flutter
Last synced: 12 Feb 2025
https://github.com/metalwarrior665/actor-results-checker
apify data-analysis json-schema-checker
Last synced: 04 Jan 2025
https://github.com/noodleslove/house-of-representative-analysis-i
This project uses public data about the stock trades made by members of the US House of Representatives.
data-analysis data-science eda kaggle-dataset matplotlib-pyplot pandas python stocks-trading
Last synced: 28 Jan 2025
https://github.com/mr-vozhyk/karpov.courses-study
Часть заданий и проектов от karpov.courses
airflow data-analysis git python sql statistics
Last synced: 13 Feb 2025
https://github.com/narenkhatwani/arkouda-projects
This repository contains the source codes of the projects done using Arkouda (a software package that allows a user to interactively issue massive parallel computations on distributed data using functions and syntax that mimic NumPy, the underlying computational library used in most Python data science workflows.)
arkouda data-analysis data-analytics data-science high-performance high-performance-computing highperformancecomputing numpy pandas parallel-computing parallel-processing parallelization python
Last synced: 06 Feb 2025
https://github.com/ganesh2409/cricket-player-performance
This repository contains a comprehensive project focused on analyzing cricket player performance using various datasets, including batting, bowling, and match results. The project involves data preprocessing, feature engineering, and model training to predict and evaluate player performance scores. It includes detailed scripts for data analysis
cricket-performance-analysis data-analysis machine-learning sports-analytics
Last synced: 11 Jan 2025
https://github.com/alexandrelamarre/fission
Data analytics & Structured streaming optimized for the Edge
data-analysis data-engineering rust structured-data unstructured-data
Last synced: 11 Jan 2025
https://github.com/zen204/airbnb-availability
A machine learning model that predicts Airbnb listing availability, utilizing feature engineering and supervised learning techniques to improve guest experience and optimize host management.
binary-classification data-analysis data-preprocessing data-visualization feature-engineering machine-learning matplotlib model-evaluation nlp pandas predictive-modeling python scikit-learn seaborn supervised-learning
Last synced: 13 Feb 2025
https://github.com/suhas-005/power-bi-dashboard
Power BI Dashboard Projects
data-analysis data-visualization dataset power-bi-project powerbi
Last synced: 07 Feb 2025
https://github.com/bebofekry/i_care-graduation_project
Graduation Project
ai artificial-intelligence chatbot cnn computer-vision data-analysis deep-learning ecg graduation-project healthcare machine-learning medical medical-imaging natural-language-processing neural-networks nlp pattern-recognition predictive-modeling python
Last synced: 21 Jan 2025
https://github.com/fatihilhan42/web_scraping_football_statistics_per_game_data-main
In this notebook I will describe the process of scraping data from web portal understat.com that has a lot of statistical information about all games in top 5 European football leagues.
data-analysis data-manipulation data-science data-scraping data-visualization jupyter-notebook python
Last synced: 29 Jan 2025
https://github.com/meetup-python-grenoble/datasette-workshop
Exploration de données avec Datasette
data-analysis data-science data-visualization datasette exploratory-data-analysis python sql workshop
Last synced: 06 Feb 2025
https://github.com/maheshthedev/twitter-analysis
Analysis on Various Topics with Twitter Data
data-analysis twitter-analysis
Last synced: 13 Feb 2025
https://github.com/kaz-yos/distributed
Comparison of Privacy-Protecting Analytic and Data-sharing Methods: a Simulation Study (Pharmacoepidemiol Drug Saf 2018)
data-analysis epidemiology statistics
Last synced: 11 Jan 2025
https://github.com/oguzgn/budget-checker-for-campaign-budget-allocation
This project focuses on modeling campaign performance data for Looker, helping determine which campaigns to scale up or cut back. It aggregates metrics over the last 7 and 30 days, providing actionable insights for budget optimization and performance improvement.
budget-allocation budget-controller budget-management calculated-fields campaign-analytics data-analysis data-modeling looker-studio sql
Last synced: 07 Feb 2025
https://github.com/sijuswamy/data-analytics-using-r
Course Repository for Data Analysis using R- Add-on course
Last synced: 31 Jan 2025
https://github.com/dogoncouch/dhcptranslate
Parses ISC DHCP server config, performs DNS resolution as needed, and outputs lease data in CSV format.
configuration csv-format data-analysis isc-dhcp isc-dhcp-server migration-tool
Last synced: 25 Jan 2025
https://github.com/dr-saad-la/r-distilled
R Programming Language distilled
data data-analysis learning programming-language r rlanguage rprogramming statistical-analysis
Last synced: 04 Jan 2025
https://github.com/nelsonkariuki/dataanalysis
This project involves data analysis of vido game sales from https://www.kaggle.com/gregorut/videogamesales/download
data-analysis data-visualization python
Last synced: 10 Jan 2025
https://github.com/0xjeremy/me-18-final
Data collection and Analysis tools for IMUs
data-analysis imu raspberry-pi
Last synced: 31 Jan 2025
https://github.com/eshaagarwa/sales_insight_project
Sales insights project using Powerbi and SQL
data-analysis data-visualization databse datacleaning datamodeling microsoft-power-bi mysql-database powerbi sales-insights sql
Last synced: 19 Feb 2025
https://github.com/sumidcyber/dataviz-master
This Python application provides a user-friendly interface to load and visualize the contents of a CSV file. Users can choose from various types of graphs and perform analyses on the dataset.
data-analysis data-analysis-project data-analysis-python database databases python python3
Last synced: 22 Jan 2025
https://github.com/maskedsyntax/taskit
A simple web based Task Tracker for better focus
charts data-analysis python3 streamlit task-tracker-app todo-list
Last synced: 05 Feb 2025
https://github.com/bishtrishu/pizza_sales_data_analysis_sql
This project is a comprehensive data analysis of pizza sales, aimed at uncovering key insights and trends to inform business decisions. Using a combination of SQL, Python, and data visualization tools, the project analyzes sales data to understand customer preferences, peak sales periods, and the most popular pizza types.
cloud data data-analysis data-science data-visualization dataanalytics database mysql oracle-database
Last synced: 04 Jan 2025
https://github.com/rupav/fifa17-detailed-analysis
⚽ FIFA 17 data analysis using various Machine Learning Algorithms. ⚽
data-analysis data-visualization fifa17 machine-learning-algorithms radar-chart
Last synced: 09 Feb 2025
https://github.com/windjammer6/8.-star-wars-data-analysis-python
A personal project to analyse data from a Star Wars survey. Python libraries used: Pandas, Matplotlib
Last synced: 29 Jan 2025
https://github.com/pawlo77/kaggle-project
Repository for 'kaggle' project of Data Science Scientific Circle at Faculty of Mathematics and Information Science, Warsaw University of Technology
data-analysis data-science eda maschine-learning
Last synced: 25 Jan 2025
https://github.com/ronaldkanyepi/python-streamlit-covid-19-dashboard
This is a responsive streamlit covid 19 Dashboard
analytics data data-analysis data-visualization datascience python streamlit
Last synced: 04 Jan 2025
https://github.com/ronaldkanyepi/python-sreamlit-duplicate-records-finder-remover
This is a duplicate remover on csv,excel or txt files based on single or multi columns
css data-analysis data-visualization datascience python streamlit
Last synced: 04 Jan 2025
https://github.com/jpcadena/solid-principles-machine-learning
S.O.L.I.D. Principles for Machine Learning project.
clean-code data-analysis data-engineering data-science deep-learning dependency-inversion-principle design-patterns design-principles interface-segregation-principle liskov-substitution-principle machine-learning machine-learning-models mlops models open-closed-principle pylint python single-responsibility-principle software-engineering solid-principles
Last synced: 15 Jan 2025
https://github.com/sanam2405/chatinfo
Analysing the WhatsApp Chat with my crush over a 6M period
data-analysis data-visualization python
Last synced: 15 Feb 2025
https://github.com/ajimaulana123/e-commerce-data-analis
Analisis dataset e-commerce guna menjawab kebutuhan product mana yang paling laris dibeli customer
Last synced: 28 Jan 2025
https://github.com/agustinmusanti/delitosencaba-proyectofinal-dataanalytics-coderhouse
En este repositorio muestro mi proyecto final en el curso "Data Analytics" de Coderhouse.
Last synced: 15 Feb 2025
https://github.com/victoriapm/analyze_a-b_test_results
Understand the results of an A/B test run by an e-commerce website.
ab-testing data-analysis ecommerce-website
Last synced: 17 Jan 2025
https://github.com/gracysapra/r-in-data-science
This repository contains essential guides for data analysis using R, covering topics like data preparation, data reshaping, and data visualization. Each file focuses on fundamental techniques to manipulate, clean, and visualize data effectively using R programming.
data-analysis data-preparation data-reshaping data-science data-visualization data-visualizations ggplot r r-for-data-science
Last synced: 08 Feb 2025
https://github.com/jen-uis/la-crime-data-analysis
This repository contains project materials for the Fall 2023 MGT 256 class. This project is completed with assists from Professor Adem Orsdemir.
business-analytics crime-data crime-data-analysis data-analysis knn la-crimes-from-2020 la-safe r r-markdown r-studio report-generation rmd united-states visualization
Last synced: 21 Jan 2025
https://github.com/mohamedhany99/human-voice-identifier-counter
the application developed in (KIVY) it can identify the users imported into the dataset based on the support vector machine training model it has two features ( Importing new voice - Detection to detect the human voices and count them)
android android-app android-application automation automation-framework data data-analysis data-mining data-science data-visualization datascience kivy kivy-framework machine-learning python
Last synced: 18 Jan 2025
https://github.com/elcaiseri/udacity-advanced-data-analysis
UDACITY - Advanced-Data-Analysis Track Project
Last synced: 01 Jan 2025
https://github.com/akash1070/project---applied-statistics-
To dive deep into this data & find some valuable insights.
data-analysis data-science python statistics
Last synced: 29 Jan 2025
https://github.com/ndohvich/ibm-data-science-professional-certificate
Kickstart your career in data science & ML. Build data science skills, learn Python & SQL, analyze & visualize data, build machine learning models. No degree or prior experience required.
coursera dash data-analysis data-science html5 ibm ibm-professional-certificate javascript machine-learnng python sql
Last synced: 19 Feb 2025
https://github.com/dsrodrigovieira/houserocketsales
Este repositório contém um projeto desenvolvido para praticar habilidades de análise de dados utilizando Python
data-analysis data-visualization heroku kaggle-dataset python
Last synced: 20 Feb 2025
https://github.com/akash1070/data-science-advanced-analytics-virtual-experience-program
The BCG Open-Access Data Science & Advanced Analytics Virtual Experience Program
data-analysis data-science machine-learning-algorithms
Last synced: 29 Jan 2025
https://github.com/jinkogule/multi-analyst
O Multi Analyst é uma ferramenta de análise de dados com uma usabilidade simples, que utiliza inteligência artificial para interpretar os resultados das análises realizadas, retornando insights úteis aos usuários.
apriori-algorithm bootstrap css data-analysis django html numpy open-ai pandas python web-application
Last synced: 03 Jan 2025
https://github.com/roberto-butti/fit_explorer
FIT File Explorer, in GO Lang
data-analysis fitness geospatial golang
Last synced: 16 Feb 2025
https://github.com/pxaris/expenditure-analyzer
Application for analyzing expenditure data over time
data-analysis data-visualization docker python statistics
Last synced: 15 Feb 2025
https://github.com/hariyebk/eplinsights
English Premier League 2018/2019 Data Analysis
class-composition data-analysis filesystem-library
Last synced: 25 Jan 2025
https://github.com/mateibejan1/ai-masters
A repository for all the projects I have done during my AI MSc.
ai-masters bayesian-inference big-data computer-vision data-analysis data-mining data-visualization deep-learning machine-learning-algorithms natural-language-processing
Last synced: 09 Feb 2025
https://github.com/ryanfranklin237/data-cleansing
A group of python scripts that clean large data sets by removing duplicate data, putting data in correct formats, and removing redundant cells
data-analysis data-cleaning data-science extract-transform-load pandas-dataframe python
Last synced: 10 Jan 2025
https://github.com/sayedgamal99/data-science
This is a repository for Data Science Projects.
data-analysis data-science deep-learning machine-learning python regression supervised-learning
Last synced: 31 Jan 2025
https://github.com/akash1070/data-analytics-virtual-experience-program-by-quantium
Data Analytics Virtual Experience Program by Quantium
data-analysis data-science machine-learning-algorithms python3 tableau
Last synced: 29 Jan 2025
https://github.com/akash1070/data-science-virtual-internship-by-accenture
data merging and data cleaning in python as well as data visulaisation with dashboard in Tableau.
data-analysis data-cleaning data-science python3 tableau visualization
Last synced: 29 Jan 2025
https://github.com/bcko/ud-da-eda-whitewinequality
Udacity Data Analyst Nanodegree Project : Exploratory Data Analysis : White Wine Quality dataset
data-analysis exploratory-data-analysis rmarkdown rstudio udacity udacity-data-analyst-nanodegree
Last synced: 25 Jan 2025
https://github.com/bcko/ud-da-stroopeffect
Udacity Data Analyst Nanodegree Project : Test a Perceptual Phenomenon (Stroop Effect)
data-analysis data-analyst-nanodegree stroop-effect udacity udacity-data-analyst-nanodegree
Last synced: 25 Jan 2025
https://github.com/ryanfranklin237/data-visualization-python
A tool that allows you to visualize data from a csv or excel file in a graph or charts form
data-analysis data-science data-visualization matplotlib pandas-dataframe python
Last synced: 10 Jan 2025
https://github.com/neerajcodes888/whatsapp-chat-analyzer
A Python tool for effortless analysis of WhatsApp conversations. Gain insights with basic statistics, word cloud visualizations, and URL statistics. Powered by pandas, urlextract, wordcloud, seaborn, and Streamlit. 📊📱
analyzer chat data-analysis data-visualization pandas python3 seaborn urlextract whatsapp wordcloud
Last synced: 31 Jan 2025
https://github.com/yonatanadam/film-success-prediction
Analyzing Hollywood movie success based on genre, target audience, and runtime using machine learning
data-analysis ipynb machine-learning
Last synced: 13 Feb 2025
https://github.com/sarmadahmad8/ml-and-deeplearning-projects-for-beginners
Beginner ML/DL projects spanning core libraries and problem sets.
beginner-friendly data-analysis data-science deep-learning fastai machine-learning opencv pytorch scikit-learn transformer
Last synced: 23 Jan 2025
https://github.com/tjpalanca/ph-elections-2016-analysis
Analysis of Philippines Election Results 2016
analysis data-analysis data-science philippines-election voter-turnout
Last synced: 29 Jan 2025
https://github.com/sermonzagoto/data_manipulation_with_pandas
Data Manipulation with Pandas - Part 1
data-analysis data-science jupyter-notebook pandas-python python
Last synced: 28 Jan 2025
https://github.com/bcko/ud-da-datawrangling
Udacity Data Analyst Nanodegree Project : Wrangle and Analyze Data
data-analysis data-analyst-nanodegree data-wrangling python tweepy twitter-api udacity udacity-data-analyst-nanodegree udacity-nanodegree we-rate-dogs
Last synced: 26 Nov 2024
https://github.com/gher-uliege/stareso-data-processing
A set of tools to read, plot and process data from STARESO
coastal corsica data-analysis data-processing ocean-sciences oceanography
Last synced: 05 Feb 2025
https://github.com/lobooooooo14/badwords-pt-br
💬 Wordlist com palavrões em pt-BR para análise de dados, filtros, ou texto considerado "evitável"
badword-filter badwords brasil data-analysis filter filter-lists filterlist portugues portuguese text-analysis wordlist
Last synced: 30 Jan 2025
https://github.com/ezzz-lui/rsm-evaluationproject
Este repositorio es donde esta documentado nuestro proyecto para RSM por parte de actividad final para el bootcamp Data Analyst
Last synced: 30 Jan 2025
https://github.com/gauravcodepro/numpy-builder
A numpy shell builder to extract and how to use the numpy across the arrays.I am putting the entire manual for those who like to search immediately rather than looking here and there.
bash-prompt bash-script bash-scripting data-analysis data-mining data-science numpy numpy-arrays shell-prompt shell-script
Last synced: 02 Jan 2025