Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2024-11-19 00:06:50 UTC
- JSON Representation
https://github.com/robinmillford/loanalytics-investigating-financial-trends-with-world-bank-data
The project aimed to explore and analyze World Bank Loan Data, leveraging Python for data preprocessing and SQL for in-depth queries
data-analysis data-visualization jupyter-notebook mysql tableau world-bank
Last synced: 17 Nov 2024
https://github.com/robinmillford/optimizing-treatment-plans-through-data-analysis
The primary focus was on understanding customer health, treatment, and associated charges over multiple years.
data-analysis data-visualization healthcare mysql powerbi sql
Last synced: 17 Nov 2024
https://github.com/frankelavsky/security-dash-challenge
I had two 8 hour days to create a visualization dashboard for three datasets. Tab one: Voronoi overlay on line graph. Tab two: Data partitioning method keeps in-memory usage low. Tab three: deals with "Failed" vs "Successful" attempts as positive/negative barcharts over time. I used d3.js, require, MVC pattern, and vanilla js.
client-side complexity css3 d3 d3js dashboard data-analysis data-structures-algorithms data-visualization frontend-app html5 interactive-visualizations javascript modular network-analysis network-monitoring network-security security single-page-app visualization
Last synced: 18 Nov 2024
https://github.com/chahelgupta/dep-videogames-dataset
The data extraction and processing involved thorough exploration, preprocessing, and visualization of the "Video Game Sales with Ratings" dataset.
data-analysis data-exploration data-extraction data-preparation data-preprocessing data-processing data-science data-visualization
Last synced: 18 Nov 2024
https://github.com/chahelgupta/fitness-data-analysis-r-project
This project focuses on analyzing fitness data collected from various tracking devices to gain insights into users' activity levels, sleep patterns, calorie expenditure, and heart rate. The dataset used in this project consists of multiple CSV files, each containing different aspects of fitness-related data.
data-analysis data-cleaning data-exploration data-science data-visualization r r-language r-programming r-studio
Last synced: 18 Nov 2024
https://github.com/robinmillford/animeinsights-user-feedback-analysis
In this project, I leveraged SQL queries to analyze and extract valuable insights from an "anime" dataset. The dataset includes information such as titles, scores, episode counts, genres, and popularity rankings for various anime series and movies.
anime data-analysis data-cleaning mysql
Last synced: 17 Nov 2024
https://github.com/robinmillford/predicting-diabetes-a-machine-learning-approach-to-early-intervention
The goal of this project was to develop a predictive model for diabetes using a dataset containing various health-related features
data-analysis data-science diabetes-prediction jupyter-notebook machine-learning smote
Last synced: 17 Nov 2024
https://github.com/robinmillford/environmental-monitoring-and-analysis
In this comprehensive project, I undertook an in-depth exploration of environmental data, seeking to understand the intricate relationship between Liquefied Petroleum Gas (LPG) consumption and Carbon Monoxide (CO) emissions.
data-analysis data-visualization environmental-monitoring iot jupyter-notebook machine-learning prediction python3 sql tableau
Last synced: 17 Nov 2024
https://github.com/myself-aas/quantium_data_analytics_forage
This project analyzes retail customer chip purchasing behavior using Python, focusing on customer segmentation and key spending drivers to provide data-driven insights for strategic category management recommendations.
data-analysis data-engineering data-science data-visualization feature-engineering forage internship-project matplotlib-pyplot numpy-library pandas-dataframe pearson-correlation python quantium-virtual-experience scipy-stats seaborn
Last synced: 18 Nov 2024
https://github.com/robinmillford/precision-marketing-for-personal-loans
In this project, I conducted an in-depth analysis of a dataset containing personal loan information
data-analysis data-visualization kaggle loan mysql tableau
Last synced: 17 Nov 2024
https://github.com/robinmillford/strategic-insights-unveiling-the-dynamics-of-ipl-2022-auction
This project involves a comprehensive analysis of the IPL 2022 Auction. The goal was to gain insights into the auction dynamics, player characteristics, and spending patterns of different teams.
data-analysis data-visualization ipl powerbi sql
Last synced: 17 Nov 2024
https://github.com/pythondeveloper6/store-sales-eda
simple EDA with some insights on Store Sales
data-analysis eda matplotlib numpy pandas seaborn
Last synced: 18 Nov 2024
https://github.com/pythondeveloper6/matplotlib-for-beginners
how to visualize your data using matplotlib
data-analysis matplotlib numpy pandas python visualization
Last synced: 18 Nov 2024
https://github.com/pythondeveloper6/california-housing-prices-eda-linear-regression
simple EDA for house prices with Regression model
data-analysis eda machine-learning matplotlib numpy pandas regression seaborn
Last synced: 18 Nov 2024
https://github.com/pythondeveloper6/supermarket-eda-seaborn-for-beginners
learn Seaborn basics using a simple EDA
data-analysis eda numpy pandas seaborn visualization
Last synced: 18 Nov 2024
https://github.com/pythondeveloper6/udemy-courses-full-eda
simple EDA for Udemy courses
data-analysis eda matplotlib numpy pandas python seaborn
Last synced: 18 Nov 2024
https://github.com/erseco/ugr_tratamiento_inteligente_datos
Repositorio de trabajo de la asignatura Tratamiento Inteligente de Datos del Máster en Ingeniería Informática de la Universidad de Granada (UGR)
Last synced: 18 Nov 2024
https://github.com/mubassim-khan/stack-overflow-developer-survey-2023
This repository contains the code for data analysis of Stack Overflow Developer Survey 2023, containing the digital representation of most used languages and much more. View README for more descriptive overview of repository.
data-analysis data-analysis-python matplotlib-pyplot numpy pandas-python
Last synced: 15 Nov 2024
https://github.com/mrankitgupta/titanic-survival-prediction-93-xgboost
Titanic Survival Prediction Project (93% Accuracy)🛳️ In this notebook, The goal is to correctly predict if someone survived the Titanic shipwreck using different Machine Learning Model & Hyperparameter tunning.
classification data-analysis data-science data-visualization gradient-boosting kaggle-competition linear-regression logistic-regression machine-learning machine-learning-algorithms ml ml-models nlp prediction predictive-modeling random-forest titanic titanic-kaggle titanic-survival-prediction xgboost
Last synced: 17 Nov 2024
https://github.com/jatin-mehra119/flight-price-prediction
This study aims to analyze flight booking data from "Ease My Trip" website, using statistical tests and linear regression to extract insights. By understanding this data, valuable information can be gained to benefit passengers using the platform.
data-analysis datacleaning datavisualization machine-learning preprocessing-data python sklearn-pipeline sklearn-regression-algorithm streamlit-webapp
Last synced: 16 Nov 2024
https://github.com/adrija-debnath/ideas-isi-data-science-internship
Topic of the Project - Predictive Maintenance Analysis, Data Science Internship at IDEAS - Institute of Data Engineering, Analytics and Science Foundation Technology Innovation Hub at Indian Statistical Institute, Kolkata.
data-analysis data-science predictive-analytics predictive-maintenance streamlit
Last synced: 20 Nov 2024
https://github.com/fhdsl/seattlestatsummer_r
A 4-day introduction to R programming, focused on Fred Hutch Research Interns
beginner beginner-friendly data-analysis data-science introduction-to-programming r-programming tidyverse
Last synced: 15 Nov 2024
https://github.com/djm158/learning-microsoft-r
Working through https://www.gitbook.com/book/smott/introduction-to-microsoft-r-server/details and creating samples
data-analysis data-science microsoft microsoft-sql-server r
Last synced: 15 Nov 2024
https://github.com/aaaastark/world_happiness_report
World_Happiness_Report
csv data-analysis data-science database keras numpy pandas python sklearn tensorflow
Last synced: 15 Nov 2024
https://github.com/codewithmayank-py/box-office-analysis-with-seaborn-and-python
This repository contains Python code and datasets for analyzing box office data. Explore trends, patterns, and factors influencing movie performance.
analysis box-office-data-analysis data-analysis data-visualization dataset jupyter-notebook matplotlib pandas python3 seaborn
Last synced: 17 Nov 2024
https://github.com/alchemine/analysis-tools
Analysis tools for machine learning projects
data-analysis explanatory-data-analysis machine-learning python
Last synced: 15 Nov 2024
https://github.com/hhuseyincosgun/data-analyst-portfolio
Data Analyst Portfolio
ab-testing aws customer-segmentation data-analysis data-visualization powerbi python sql
Last synced: 15 Nov 2024
https://github.com/alanjamlu34/bike-dataset
Ini adalah tugas akhir dari kelas Dicoding Menjadi Data Analist
data-analysis streamlit-dashboard
Last synced: 17 Nov 2024
https://github.com/codewithmayank-py/covid19-data-analysis-using-python
COVID-19 and Happiness Analysis
data-analysis data-analysis-python data-visualization dataset jupyter-notebooks numpy pandas python3 seaborn
Last synced: 17 Nov 2024
https://github.com/michalspano/maturitna-skuska-proj
Maturitná skúška 2021/2022 - objektívna spracovanie a analýza dát
Last synced: 17 Nov 2024
https://github.com/hadson0/chess-live-ratings-data
A study project focused on web scraping the live chess ratings from chess.com, with data analysis and visualization on nearly 5000 players in the classical world ranking.
beautifulsoup chess data-analysis data-visualization numpy pandas python seaborn web-scraping
Last synced: 19 Nov 2024
https://github.com/lenakirara/formacao_data_science_alura
Formação Data Science da plataforma de estudo da Alura
alura data-analysis data-science pandas python
Last synced: 16 Nov 2024
https://github.com/elrf3lipes/ramon-s_portfolio
I'm passionate about Cloud and DevOps, and for the moment I'm posting some of my work and personal projects here to showcase that. If its useful for you, feel free to integrate or contribute!
api-integration biopython clinical-trials data-analysis data-extraction data-parsing django docker entrez ipython medline-xml pandas pubmed-parser requests rest-api
Last synced: 18 Nov 2024
https://github.com/mostafa-bashir/investigating_weather_data
data-analysis ipython jupyter-notebook nump pandas python
Last synced: 18 Nov 2024
https://github.com/omr5221/bi-scripts
Example of DI BI tool scripting
automation configuration-files data-analysis data-warehousing dimensional-analysis diver etl-pipeline lookup modeling perl-script python shell-script sql summarization
Last synced: 18 Nov 2024
https://github.com/alkasaliss/nosql_opendata_nyc
PROJET NoSQL - ENSAI
data-analysis mongodb nosql open-data
Last synced: 18 Nov 2024
https://github.com/82luli02/sakila_dvd_rental_database_analysis
Analysis of the Sakila DVD Rental database using SQL
data data-analysis data-science data-visualization sql
Last synced: 16 Nov 2024
https://github.com/kunalpisolkar24/winequalityprediction
Predicting wine quality using machine learning with matplotlib, numpy, pandas, and seaborn for insightful data analysis. 🍇🤖📊
data-analysis data-science data-visualization machine-learning prediction-model
Last synced: 15 Nov 2024
https://github.com/ljadhav25/decision-tree-random-forest-algorithm-data-science-
This repository contains an implementation of decision tree and random forest algorithms from scratch in Python. Decision trees and random forests are popular machine learning algorithms used for classification and regression tasks. The goal of this project is to provide a clear and understandable implementation of these algorithms
data-analysis data-science decision-trees machine-learning-algorithms matplotlib numpy pandas python random-forest-classifier
Last synced: 18 Nov 2024
https://github.com/w-edward/youtube-keyword-popularity-analyzer
An effort to discover the top trending keywords on Youtube.
data-analysis node-js numpy python webscraping youtube-api
Last synced: 15 Nov 2024
https://github.com/shivakumarhl/digital-music-store-analysis
Digital Music Store Data Analysis using SQL
Last synced: 17 Nov 2024
https://github.com/bhavanachitragar/data-analysis-using-pyspark
Working with pyspark module in python and using google colab environment in order to apply some queries to the dataset. The dataset consist of two csv files listening.csv and genre.csv. Also, visualizing query results using matplotlib.
data-analysis google-colab pyspark-sql
Last synced: 17 Nov 2024
https://github.com/hanzopgp/lolanalysis
League Of Legends game data engineering, analysis, visualization and machine learning. Business intelligence project.
data-analysis data-cleaning data-engineering data-visualization dataiku deep-learning etl machine-learning scraping university
Last synced: 15 Nov 2024
https://github.com/ribin-baby/the-sparks-foundation-data-science-internship
This repository contains tasks and solutions assigned as part of internship program. This repository contains workbooks on data analysis and model building parts.
Last synced: 15 Nov 2024
https://github.com/hevalhazalkurt/word_analyser
A web app developed in Python and Django that analyzes given text mathematically and sentimentally.
analyzer analyzes content data-analysis django emotion python python3 sentiment sentiment-analyser sentiment-analysis text text-analysis
Last synced: 20 Nov 2024
https://github.com/ljadhav25/knn-algorithm-data-science-
This repository contains a project demonstrating the implementation and application of the K-Nearest Neighbors (K-NN) algorithm in Data Science. The objective is to provide a comprehensive understanding of the K-NN algorithm, including data preprocessing, model training, evaluation, and visualization of results. This project is ideal for beginners
data-analysis data-science knn-classification machine-learning matplotlib-pyplot numpy pandas-library seaborn
Last synced: 18 Nov 2024
https://github.com/ahmad-ali-rafique/weather-prediction-fcnn
This project demonstrates a complete pipeline for weather prediction using a Fully Connected Neural Network (FCNN). The project is implemented in Python using Jupyter Notebook, and it covers data loading, preprocessing, model training, and performance evaluation.
ai artificial-intelligence data-analysis data-science deep-learning deep-neural-networks fully-connected-network machine-learning machine-learning-algorithms weather-information
Last synced: 15 Nov 2024