Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/phillbertnevinemmanuel/dataprofessionalquestionnaireanalysis_pbix

This project delves into the demographics and job satisfaction of data professionals, presenting insights through a user-friendly dashboard built with Power BI. The dataset has been graciously provided by Alex The Analyst

dashboard data-analysis powerbi visualization

Last synced: 23 Dec 2024

https://github.com/jen-uis/loan-status-prediction

This repository contains project materials for the Winter STAT 206 class, University of California, Riverside, A. Gary Anderson School of Management.

data data-analysis data-analytics data-cleaning data-visualization descriptive-analytics julia julia-language jupyter-notebook predictive-analytics predictive-modeling team-collaboration

Last synced: 21 Nov 2024

https://github.com/arhcoder/base-hackathon-2022

💸 Sistema que analiza las facturas de compra-venta de una empresa de importaciones y exportaciones, y crea una base de conocimiento con la que crea sugerencias de abastecimiento para las empresas clientes de Banco BASE, con el fin de ahorrarles dinero.

algorithms bank companies data-analysis decision-making exportation hackaton importation javascript mysql python suggestions

Last synced: 10 Jan 2025

https://github.com/ssreeramj/youtube_channels_analysis

This web app gives a detailed analysis of the videos uploaded in a particular youtube channel.

data-analysis heroku pandas python streamlit youtube

Last synced: 02 Jan 2025

https://github.com/draym/swmanager

Web-app to help you in your daily life raids in SpacesWars thanks to game statistics and data management

dashboard-application data-analysis data-visualization game-data game-utility

Last synced: 10 Jan 2025

https://github.com/al-ghaly/airline-company-data-warehouse

Data Warehouse modeling, design, implementation, and analysis for an Airline Company.

data-analysis data-warehousing database-modeling sql-server

Last synced: 21 Nov 2024

https://github.com/gauranshgoel123/predictive-demand-analysis

Demand Forecasting Project A web application for predicting future demand for part numbers based on historical data. Built with React for the frontend and FastAPI with Python for the backend, this application visualizes demand trends and allows users to input additional data for improved accuracy. In render analyzer is frontend analysis is backend

chartjs data-analysis data-science data-visualization dataset deployment full-stack machine-learning numpy pandas predictive-analysis prophet-model python reactjs render

Last synced: 12 Jan 2025

https://github.com/sevdanurgenc/data-modeling-techniques-lecture-notes

In this repo, I have the course contents of Data Modelling Techniques training, which will be given to Innova Technology by the cooperation of Academy Peak Information Technologies Training and Consultancy between 25 - 26 January 2022.

data-analysis data-mining data-modeling data-science data-structure data-visualization

Last synced: 30 Nov 2024

https://github.com/ansh420/mcdonald_case-study

It is basically depend on the market Segment Analysis. It is a case study of mcDonald.

algorithms-implemented data-analysis python3 segmentation

Last synced: 18 Nov 2024

https://github.com/titanscouting/tra-analysis

Titan Robotics 2022 Strategy Team Analysis Repository

data-analysis frc frc-scouting hacktoberfest python

Last synced: 17 Dec 2024

https://github.com/okwilkins/retailanalysis

A comprehensive exploratory analysis and implementation of kmeans/hierarchical clustering on online retail data.

data-analysis data-science machine-learning statistics

Last synced: 20 Nov 2024

https://github.com/jasontanx/capstone-project-machine-learning

A final semester project from my MSc Data Science course

data-analysis datascience machinelearningprojects tourism-data

Last synced: 05 Dec 2024

https://github.com/supertetelman/kaggle-public

A collection of Python and Matlab projects aimed at utilizing various machine learning techniques to solve big data problems.

cnn data-analysis deep-learning machine-learning matlab python

Last synced: 30 Nov 2024

https://github.com/vipul2001/cousera-courses

This repo covers the solution to the assignments of various courses on algorithm,deep learning and data Analytics

coursera-courses data-analysis data-analytics-ibm deep-learning divide-and-conquer neural-network

Last synced: 17 Jan 2025

https://github.com/vitia-fritelle/ipynb_converter

Jupyter notebook to Python file conversor

data-analysis data-science jupyter-notebook python

Last synced: 16 Dec 2024

https://github.com/chen0040/pyspark-advanced-algorithms

Samples of Advanced Algorithms and Data Analysis implemented in pyspark

advanced-algorithms data-analysis map-reduce pyspark

Last synced: 16 Dec 2024

https://github.com/khuyentran1401/sample_datapane_script

This repo shows how to use Datapane create a simple script to see the rank of the authors or publications with respect to publishing frequency

data-analysis data-science datapane python

Last synced: 26 Nov 2024

https://github.com/sathyasris27/statistical-analysis-on-rehoming-time-for-different-dog-breeds-in-animal-shelter

The aim of this project is given a collection of records documenting the stray, unwanted, or neglected dogs sent to animal shelters to be rehomed, we analyse their rehoming patterns based on their breeds.

data-analysis r statistical-analysis statistical-inference statistical-models

Last synced: 10 Jan 2025

https://github.com/sathyasris27/time-series-and-spectral-analysis-

The aim of this project involves the analyses the data, removing trends and seasonal effects, identifying the underlying process, understanding the dominant frequencies, and using the residuals to make predictions.

data-analysis data-visualization forecasting r spectral-analysis time-series-analysis

Last synced: 10 Jan 2025

https://github.com/shridhar1504/foreign-exchange-rate-time-series-datascience-project

This project will use time series analysis to forecast the exchange rate between the euro and the US dollar. The project will use a variety of statistical techniques, such as ARIMA to model the data and forecast the exchange rate.

data-analysis data-preprocessing data-science data-transformation data-visualization eda exploratory-data-analysis foreign-exchange-rates machine-learning model-fitting predictive-modeling python3 time-series time-series-analysis

Last synced: 23 Dec 2024

https://github.com/gauravcodepro/numpy-builder

A numpy shell builder to extract and how to use the numpy across the arrays.I am putting the entire manual for those who like to search immediately rather than looking here and there.

bash-prompt bash-script bash-scripting data-analysis data-mining data-science numpy numpy-arrays shell-prompt shell-script

Last synced: 02 Jan 2025

https://github.com/shridhar1504/loan-clustering-datascience-project

This project uses Machine Learning to Cluster loan together based on their similarities. The project uses a dataset of loan application which includes information about the Loan amount and Balance. The project then use the clustering algorithm to group the loan together based on the similarities.

clustering-algorithm data-analysis data-science data-visualization datanalysis eda kmeans-clustering machine-learning python sql sql-server unsupervised-learning

Last synced: 23 Dec 2024

https://github.com/shridhar1504/power-bi-visualization-project

This repository contains Visualization Projects which is visualized through Power BI Software, by using the visualization we can gain multiple insights and strategies which helps to develop the business for gaining high profit margins and by the insights we can reduce the damages by accidents & calamities.

dashboard data-analysis data-cleaning data-science data-visualization exploratory-data-analysis microsoft-excel microsoft-power-bi microsoft-powerpoint powerbi powerbi-report powerbi-visuals powerpoint-slides

Last synced: 23 Dec 2024

https://github.com/tnleite/projeto_king_lift

Este projeto apresenta uma análise detalhada dos dados financeiros da King Lift, uma empresa de locação de empilhadeiras. Utilizando Microsoft Excel, Power Query e Power Pivot, desenvolvi um dashboard interativo, também em Excel, que ajuda a empresa a obter insights valiosos para melhorar a eficiência operacional e aumentar o faturamento.

data-analysis data-science data-visualization excel

Last synced: 10 Dec 2024

https://github.com/olgapavlova/agile-health-hackathon

Визуализируем здоровье спринтов разработки по сырым данным

data-analysis data-visualization figma google-sheets matplotlib pandas python sql

Last synced: 18 Nov 2024

https://github.com/thecoderpinar/samsung_stock_analysis_forecasting_and_volatility_analysis

A comprehensive analysis and forecasting project for Samsung stock data, utilizing historical data to build predictive models and analyze volatility.

data-analysis deep-learning financial-analysis forecasting machine-learning python stock-analysis volatility-forecasting

Last synced: 16 Dec 2024

https://github.com/ryanfranklin237/data-visualization-python

A tool that allows you to visualize data from a csv or excel file in a graph or charts form

data-analysis data-science data-visualization matplotlib pandas-dataframe python

Last synced: 10 Jan 2025

https://github.com/ryanfranklin237/data-cleansing

A group of python scripts that clean large data sets by removing duplicate data, putting data in correct formats, and removing redundant cells

data-analysis data-cleaning data-science extract-transform-load pandas-dataframe python

Last synced: 10 Jan 2025

https://github.com/jinkogule/multi-analyst

O Multi Analyst é uma ferramenta de análise de dados com uma usabilidade simples, que utiliza inteligência artificial para interpretar os resultados das análises realizadas, retornando insights úteis aos usuários.

apriori-algorithm bootstrap css data-analysis django html numpy open-ai pandas python web-application

Last synced: 03 Jan 2025

https://github.com/ronaldkanyepi/python-sreamlit-duplicate-records-finder-remover

This is a duplicate remover on csv,excel or txt files based on single or multi columns

css data-analysis data-visualization datascience python streamlit

Last synced: 04 Jan 2025

https://github.com/bishtrishu/pizza_sales_data_analysis_sql

This project is a comprehensive data analysis of pizza sales, aimed at uncovering key insights and trends to inform business decisions. Using a combination of SQL, Python, and data visualization tools, the project analyzes sales data to understand customer preferences, peak sales periods, and the most popular pizza types.

cloud data data-analysis data-science data-visualization dataanalytics database mysql oracle-database

Last synced: 04 Jan 2025

https://github.com/nelsonkariuki/dataanalysis

This project involves data analysis of vido game sales from https://www.kaggle.com/gregorut/videogamesales/download

data-analysis data-visualization python

Last synced: 10 Jan 2025

https://github.com/kaz-yos/distributed

Comparison of Privacy-Protecting Analytic and Data-sharing Methods: a Simulation Study (Pharmacoepidemiol Drug Saf 2018)

data-analysis epidemiology statistics

Last synced: 11 Jan 2025

https://github.com/alexandrelamarre/fission

Data analytics & Structured streaming optimized for the Edge

data-analysis data-engineering rust structured-data unstructured-data

Last synced: 11 Jan 2025

https://github.com/ganesh2409/cricket-player-performance

This repository contains a comprehensive project focused on analyzing cricket player performance using various datasets, including batting, bowling, and match results. The project involves data preprocessing, feature engineering, and model training to predict and evaluate player performance scores. It includes detailed scripts for data analysis

cricket-performance-analysis data-analysis machine-learning sports-analytics

Last synced: 11 Jan 2025

https://github.com/msthamizh/phonepe-pulse-data-visualization-and-exploration

Developing a Streamlit application that allows users to explore and analyze transaction data from the PhonePe Pulse dataset. The project aims to provide insights into digital payment trends across India.

data-analysis data-visualization dataframe mysql pandas plotly python streamlit

Last synced: 27 Nov 2024

https://github.com/maskedsyntax/taskit

A simple web based Task Tracker for better focus

charts data-analysis python3 streamlit task-tracker-app todo-list

Last synced: 10 Dec 2024

https://github.com/randomshek/Working-With-Excel

Using Excel Power Query and PowerPivot, reorganise the data into a star schema and showcasing reports that can be created by data analysts using DAX formulae and PowerPivot

data-analysis excel power-pivot power-query

Last synced: 27 Nov 2024

https://github.com/cano1998/eda-survival-of-the-titanic

This project focuses on Exploratory Data Analysis (EDA) to identify the key determinants that influenced survival during the infamous Titanic accident.

data-analysis data-cleaning data-preprocessing data-visualization exploratory-data-analysis jupyter-notebook titanic-survival-exploration

Last synced: 04 Jan 2025

https://github.com/carlosvinimsouza/dataanalysiswithpython

Learn Data Analysis using Libs Python (Numpy, Pandas, Matplotlib and Seaborn)

data-analysis data-science free-code-camp matplotlib numpy pandas python python3 seaborn

Last synced: 11 Jan 2025

https://github.com/moindalvs/learn_eda_house_price_dataset

Data Set: House Prices: Advanced Regression Techniques Exploratory Data Analysis on more than 80 features

cardinality data-analysis data-science data-structures data-visualization missing-values

Last synced: 18 Jan 2025

https://github.com/vidhi1290/machine-learning-pipeline

Explore a collection of Jupyter notebooks that guide you through various stages of the machine learning pipeline. From data analysis and feature engineering to model training and deployment, these notebooks provide practical insights for both beginners and experienced data enthusiasts. Let's dive into the world of data-driven decision-making! 📊🚀"

data-analysis feature-engineering feature-selection jupyter jupyter-notebook machine-learning machine-learning-algorithms machine-learning-pipeline model-training new-dataset opensource python

Last synced: 08 Dec 2024

https://github.com/ehtisham-sadiq/building-an-ml-based-heart-disease-diagnosis-system-with-flask

It is an end-to-end project that combines machine learning to create a user-friendly Heart Disease Diagnosis System, powered by Flask.

data-analysis exploratory-data-analysis feature-engineering flask machine-learning model-building model-evaluation pipelines python3 rest-api

Last synced: 11 Jan 2025

https://github.com/nikhilash45/live_ipl_report

This repository hosts the source code for an interactive IPL (Indian Premier League) Dashboard built using PowerBI. The dashboard provides real-time updates on ongoing matches, including live scores, batting and bowling statistics for both teams, and the points table.

analysts cleaning-data cricket-data dashboard data data-analysis data-visualization dax powerbi

Last synced: 11 Jan 2025

https://github.com/vidhi1290/zomato-data-analysis

Zomato Data Analysis - Explore the world of Zomato restaurant data through Python and data analysis. Uncover trends and insights using Pandas for data manipulation and Matplotlib for visualization. Join us in this journey to reveal the hidden stories within the data!

data-analysis data-analysis-python data-science data-visualization dataprocessing machine-learning machine-learning-algorithms matplotlib numpy pandas python scikit-learn zomato-data-analysis

Last synced: 08 Dec 2024

https://github.com/muneeb1030/eda-of-physionets-ecg

EDA of Physionet Data set regarding "A Large Scale 12 Lead Electrocardiogram Database for Arrhythmia Study 1.0.0". This project focuses on the preprocessing of electrocardiogram (ECG) signals and utilizes Principal Component Analysis (PCA) for dimensionality reduction

12-lead-ecg data-analysis ecg-signal eda pca python3 wfdb

Last synced: 11 Jan 2025

https://github.com/sarincr/data-analytics-with-knime

Data Analytics with KNIME (Konstanz Information Miner), a free and open-source data analytics, reporting and integration platform. KNIME integrates various components for machine learning and data mining through its modular data pipelining concept. A graphical user interface and use of JDBC allows assembly of nodes blending different data sources, including preprocessing (ETL: Extraction, Transformation, Loading), for modeling, data analysis and visualization without, or with only minimal, programming.

ai artificial-intelligence artificial-intelligence-algorithms artificial-neural-networks data-analysis data-mining data-science data-structures data-visualization database datascience deep-learning machine-intelligence machine-learning machine-learning-algorithms machinelearning mining mining-software

Last synced: 20 Nov 2024

https://github.com/brunomontezano/benzocovid

💊 Data Analysis Project of Benzodiazepines during COVID-19 Pandemic.

benzodiazepines covid-19 data-analysis

Last synced: 11 Jan 2025

https://github.com/cjunwon/youtube-data-analysis

End-to-end Youtube data analysis project using Youtube Data API, MySQL, AWS, Flask

aws-rds data-analysis datapipeline flask nlp pandas python shell sql vader-sentiment-analysis youtube youtube-api

Last synced: 15 Dec 2024

https://github.com/pheithar/socialdata_madridcentral

Social data and visualization course at DTU - 2022. Effectiveness of Madrid Central

data-analysis data-visualization jupyer-notebook madrid python

Last synced: 30 Nov 2024

https://github.com/jakubkorytko/data-graphs

Transform raw data into captivating visual stories with this app, effortlessly craft stunning data charts that unveil insights and trends

charts data-analysis mit-license open-source

Last synced: 11 Jan 2025

https://github.com/gher-uliege/stareso-data-processing

A set of tools to read, plot and process data from STARESO

coastal corsica data-analysis data-processing ocean-sciences oceanography

Last synced: 11 Dec 2024

https://github.com/gab-182/market-analysis-report-for-national-clothing-chain

Using custom M and DAX codes in Power BI, I conducte a thorough market analysis for a national clothing chain. The insights gathered from customer data and US Census Bureau statistics led to the formulation of a targeted marketing strategy, contributing to enhanced sales and customer satisfaction.

data-analysis power-bi

Last synced: 18 Jan 2025

https://github.com/is-leeroy-jenkins/sherpa

A budget execution & data analysis tool based on Winforms, .NET 6, and written in C# for EPA analysts

budget-management data-analysis data-science data-visualization federal-government

Last synced: 12 Jan 2025

https://github.com/shliakhovai/sales-analysis-project

This project involves analyzing sales data to gain insights into sales trends, performance metrics, and product categories. The analysis includes data cleaning, exploratory data analysis (EDA), sales trend analysis, profit dependency analysis, and ABC analysis.

abc-analysis data-analysis data-science data-visualization eda exploratory-data-analysis jupyter-notebook python

Last synced: 12 Jan 2025

https://github.com/gher-uliege/bluecloud-plankton

Spatial interpolation of plankton data using a neural network

data data-analysis data-visualization neural-network oceanography

Last synced: 11 Dec 2024

https://github.com/mdaffailhami/customer-data-analysis

This repository contains code and analysis for exploring customer data, focusing on profiling and contact preferences. The project includes various stages of data processing, from raw data preparation to final cleaned datasets, and employs Python and popular data analysis libraries to uncover insights and trends.

data-analysis data-cleaning data-science data-visualization jupyter jupyter-notebook pandas plotly python

Last synced: 12 Jan 2025

https://github.com/alejo1630/sport_stats

Data analysis of information from the summer and winter Olympic games over the years. UC Davis SQL Specialization Final Project

data-analysis jupyter-notebook olympics-dataset plotly python seaborn sql

Last synced: 31 Dec 2024

https://github.com/fbraza/python-dataframe-skim

Get an extended statistic summary of your pandas DataFrame

data-analysis data-science dataframe pandas python3

Last synced: 27 Nov 2024

https://github.com/mohamedhany99/collecting-sound-features-frequency-from-an-audio-file-and-save-it-in-excel

This script takes a single audio file and collect all the sound features in it including (Frequency - mean - variance - minimum value - maximum value - Median - Kurtosis - Skewness) and save it in an array and import this array in a row in the datasheet.

automation data-analysis data-science dataset-generation excel-import signal-processing

Last synced: 17 Jan 2025

https://github.com/fbraza/paris_airbnb

Analysis of Paris AirBnB data using R and Shiny

analysis data data-analysis paris-airbnb r shiny

Last synced: 27 Nov 2024

https://github.com/fatihilhan42/web_scraping_football_statistics_per_game_data-main

In this notebook I will describe the process of scraping data from web portal understat.com that has a lot of statistical information about all games in top 5 European football leagues.

data-analysis data-manipulation data-science data-scraping data-visualization jupyter-notebook python

Last synced: 01 Dec 2024

https://github.com/gursv/stocksage

Predict next day's close price for a stock like NSEI, NYA, HSI, IXIC, TWII, etc...!

data-analysis data-preprocessing data-science gridsearchcv machine-learning python3 random-forest-regressor stock-data stock-price-prediction streamlit

Last synced: 12 Jan 2025

https://github.com/rishabhraj43/diwali-sales-analysis

A Data Analysis project made in Python

data-analysis python

Last synced: 12 Jan 2025

https://github.com/codingprivacy/feedback-portal-system

AI based Feedback Portal System which takes periodic feedbacks from users via highly human friendly chat-bot, analyse the responses through NLP and sentiment analysis and visualize the analysis on the portal website.

artificial-intelligence bokeh chatbot data-analysis flask mysql-database nlp portal python sentiment-analysis visualization website

Last synced: 12 Jan 2025

https://github.com/programmer-rd-ai/dimensionality-reduction

DimRed is a comprehensive Python toolkit for advanced dimensionality reduction, integrating with major machine learning libraries and featuring real-time performance monitoring to enhance data analysis and model efficiency.

analytics data-analysis data-science lightgbm machine-learning matplotlib numpy pandas programming python python3 sklearn university xgboost

Last synced: 12 Jan 2025

https://github.com/alejo1630/chicago_crimes

A Jupyter Notebook with the data analysis and data visualization of crimes in Chicago from 2017 to 2023 using libraries such as seaborn and folium

data-analysis data-visualization folium pandas python seaborn

Last synced: 31 Dec 2024

https://github.com/umutsevdi/hr-management

HR Management, Analytics and Salary Determination System

analytics data-analysis java java17 postgresql python spring spring-boot vaadin vaadin-flow

Last synced: 18 Jan 2025

https://github.com/jakobzmrzlikar/fake-news-analysis

An analysis of the FakeNewsNet dataset using NLP techniques.

data-analysis fake-news ipynb-jupyter-notebook nlp-machine-learning

Last synced: 12 Jan 2025

https://github.com/viztruth/google-play-store-data-analysis

This repository contains all the materials of my final project 'Google Play store Data Analysis' for the 'Telling Stories with Data' course at PES University.

data-analysis data-visualization

Last synced: 27 Nov 2024

https://github.com/phillbertnevinemmanuel/automotivesalesdataanalysis

This marks my inaugural venture into personal data analysis, employing SQL and Python for Correlation Analysis. I've sourced the dataset from Kaggle, specifically focusing on automotive sales. You can find the dataset linked on my website below. I'm excited to share that I've independently managed the majority of tasks involved in this project.

data-analysis dataset microsoft-sql-server python python-lambda sql ssms tsql

Last synced: 17 Nov 2024

https://github.com/yard1/linearordering

An R package. Provides various methods of linear ordering of data. Supports weights and positive/negative impacts.

data-analysis data-analysis-in-r data-analysis-r data-science r

Last synced: 18 Nov 2024

https://github.com/gattiharishkumar/blinkit-sales-analysis-dashboard

This project presents a comprehensive sales analysis dashboard for Blinkit, an Indian last-minute delivery app. The dashboard was created using Power BI and provides a detailed overview of the company's sales performance across various outlets and product categories.

dashboard data-analysis data-transformation data-visualization ms-excel-data-analytics power-query powerbi powerbi-visuals

Last synced: 29 Dec 2024

https://github.com/shipyardapp/amazonathena-blueprints

Simplified blueprints for building data pipelines with Amazon Athena.

amazon-athena athena cli data-analysis data-engineering data-science elt etl

Last synced: 04 Dec 2024

https://github.com/adagio/ivoox_episodes

iVoox Episodes: Scraping & Analysis

beautifulsoup4 data-analysis ivoox pandas python python3 scraping

Last synced: 27 Dec 2024

https://github.com/saidsef/ff18

A complete catalog of all the players in Fifa 2018 and their complete statistics

data-analysis data-visualization fifa18 machine-learning machine-learning-algorithms world-cup-2018 world-cup-ranking

Last synced: 15 Jan 2025

https://github.com/giordano-lucas/tesco-extension

Products clustering and interactive visualization

clustering data-analysis data-visualization tesco

Last synced: 02 Jan 2025

https://github.com/vishal-038/real_estate_price_prediction

The Real Estate Price Prediction project aims to develop a machine learning model to predict house prices based on various features

data-analysis data-science data-visualization machine-learning python

Last synced: 22 Nov 2024