Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/its-kanii/predictive-maintenance-for-healthcare-equipment

Predictive Maintenance for Healthcare Equipment utilizes machine learning to analyze operational metrics and predict equipment failures. This project leverages a dataset of usage hours, temperature, and maintenance history to enhance equipment reliability and reduce downtime.

data-analysis data-science failure-prediction feature-engineering healthcare-equipment jupyter-notebook machine-learning predictive-maintenance python time-series-analysis

Last synced: 12 Feb 2025

https://github.com/quantitext/quantitext

Official repository for QuantiText applications in the .NET ecosystem.

api aspnet-core csharp data-analysis dotnet-core mvc-architecture

Last synced: 06 Feb 2025

https://github.com/ryanfranklin237/data-visualization-python

A tool that allows you to visualize data from a csv or excel file in a graph or charts form

data-analysis data-science data-visualization matplotlib pandas-dataframe python

Last synced: 10 Jan 2025

https://github.com/ryanfranklin237/data-cleansing

A group of python scripts that clean large data sets by removing duplicate data, putting data in correct formats, and removing redundant cells

data-analysis data-cleaning data-science extract-transform-load pandas-dataframe python

Last synced: 10 Jan 2025

https://github.com/prangonghose/analysis_of_bangladesh_economic_complexity

In this project a brief analysis has been done by our team in the export economy of Bangldesh for the past three decades.

data-analysis data-science data-visualization inequalipy matplotlib pandas plotly

Last synced: 19 Jan 2025

https://github.com/adrianycmc/introducaoadatascience

Explorando dados: Utilizando Python, Pandas e o Colaboratory do Google.

data-analysis data-science jupyter pandas python

Last synced: 12 Feb 2025

https://github.com/x1ao4/doc-merger

通过 python 脚本将两个相对不完整的文档合并为一个完整的文档 / merge two relatively incomplete documents into one complete document via python script

data-analysis data-merging document-analysis document-comparison document-processing documents filtering filtering-data merge merge-documents

Last synced: 20 Feb 2025

https://github.com/thomascenni/anfavea-data-analysis

Data analysis with Pandas and Datapane.

data-analysis datapane pandas

Last synced: 30 Jan 2025

https://github.com/jinkogule/multi-analyst

O Multi Analyst é uma ferramenta de análise de dados com uma usabilidade simples, que utiliza inteligência artificial para interpretar os resultados das análises realizadas, retornando insights úteis aos usuários.

apriori-algorithm bootstrap css data-analysis django html numpy open-ai pandas python web-application

Last synced: 03 Jan 2025

https://github.com/al-ghaly/power-bi-dashboard

A dashboard to analyze data specializations job market.

dashboard data-analysis powerbi

Last synced: 22 Jan 2025

https://github.com/arielle0222/data_analysis

📊 Data analysis projects for autonomous driving and smart mobility engineering using Python and SQL.

autonomous-driving composite data-analysis electric-vehicles environmental-data python visualizatoin

Last synced: 06 Feb 2025

https://github.com/jubinjacob03/heartdiseaseclassify-ml

Heart Disease Dataset Analysis & Classification using ML models such as linear, support vector machine, k-means, k-nearest neighbors and logistic regression.

data-analysis data-science data-visualization ipython-notebook kaggle-dataset kmeans knn linear-regression logistic-regression machine-learning matplotlib python seaborn support-vector-machine

Last synced: 12 Feb 2025

https://github.com/akash1070/data-science-advanced-analytics-virtual-experience-program

The BCG Open-Access Data Science & Advanced Analytics Virtual Experience Program

data-analysis data-science machine-learning-algorithms

Last synced: 29 Jan 2025

https://github.com/akash1070/project---applied-statistics-

To dive deep into this data & find some valuable insights.

data-analysis data-science python statistics

Last synced: 29 Jan 2025

https://github.com/randomshek/Working-With-Excel

Using Excel Power Query and PowerPivot, reorganise the data into a star schema and showcasing reports that can be created by data analysts using DAX formulae and PowerPivot

data-analysis excel power-pivot power-query

Last synced: 27 Nov 2024

https://github.com/alphonsg/bimana

Package for performing automated bio-image analysis tasks.

bioimage-analysis bioinformatics data-analysis deep-learning edge-detection image-analysis image-processing

Last synced: 13 Feb 2025

https://github.com/olgapavlova/agile-health-hackathon

Визуализируем здоровье спринтов разработки по сырым данным

data-analysis data-visualization figma google-sheets matplotlib pandas python sql

Last synced: 19 Jan 2025

https://github.com/viper373/163-buff

爬取网易BUFF平台CS:GO武器皮肤交易数据

163 arima crawler-python csgo data-analysis prediction python

Last synced: 05 Feb 2025

https://github.com/shridhar1504/loan-clustering-datascience-project

This project uses Machine Learning to Cluster loan together based on their similarities. The project uses a dataset of loan application which includes information about the Loan amount and Balance. The project then use the clustering algorithm to group the loan together based on the similarities.

clustering-algorithm data-analysis data-science data-visualization datanalysis eda kmeans-clustering machine-learning python sql sql-server unsupervised-learning

Last synced: 15 Feb 2025

https://github.com/alxrm/scent-of-literature

Russian literature sentiment analysis in terms of very small dataset

classification data-analysis sentiment-analysis sklearn tf-idf

Last synced: 01 Feb 2025

https://github.com/jofaval/mlai-portfolio

My portfolio about Data Analysis, Machine Learning and A.I.

computer-science data-analysis data-science machine-learning portfolio python

Last synced: 04 Feb 2025

https://github.com/vitia-fritelle/analise_dieese

Análise realizada com base nos dados extraídos do site https://www.dieese.org.br/analisecestabasica/salarioMinimo.html

data-analysis economic-data

Last synced: 15 Feb 2025

https://github.com/fbraza/paris_airbnb

Analysis of Paris AirBnB data using R and Shiny

analysis data data-analysis paris-airbnb r shiny

Last synced: 26 Jan 2025

https://github.com/sandk21/detection_faux_billets

Algorithme de détection de faux billets selon leurs dimensions géométriques et application web pour générer les prédictions

data-analysis data-science data-visualization machine-learning pandas python scipy sklearn streamlit

Last synced: 02 Feb 2025

https://github.com/jen-uis/loan-status-prediction

This repository contains project materials for the Winter STAT 206 class, University of California, Riverside, A. Gary Anderson School of Management.

data data-analysis data-analytics data-cleaning data-visualization descriptive-analytics julia julia-language jupyter-notebook predictive-analytics predictive-modeling team-collaboration

Last synced: 22 Jan 2025

https://github.com/giordano-lucas/tesco-extension

Products clustering and interactive visualization

clustering data-analysis data-visualization tesco

Last synced: 02 Jan 2025

https://github.com/gesiscss/wikipedia-language-olga-master

Measuring Gender Inequalities of German Professions on Wikipedia

bias crowdflower data-analysis data-science gender images python statistics wikipedia

Last synced: 03 Jan 2025

https://github.com/ikanurfitriani/project-from-dqlab

This repository contains the results of my learning while at DQLab.

data-analysis data-science data-visualization database dqlab python python3 r sql

Last synced: 26 Jan 2025

https://github.com/lebrancconvas/how-much-love-in-thai-song

How much Love song among the Thai Songs?

data-analysis side-project web-scraping

Last synced: 08 Jan 2025

https://github.com/vatshayan/hospital-discharge-analysis

Analysis of Hospitalization Discharge Rates in Lake County, Illinois of various attributes like Anxiety, Alcohol, mood, Diabetes, Asthma, etc

data-analysis data-visualization jupyter-notebook machine machine-learning machine-learning-algorithms scikit-learn

Last synced: 15 Jan 2025

https://github.com/alexandregazagnes/rica-analysis

This repository contains the code to download, analyse, and modelize the RICA dataset from the french ministry of agriculture.

analysis argiculture business data data-analysis data-analytics food python

Last synced: 03 Jan 2025

https://github.com/dannyben/datamix

DSL for manipulating tabular data

csv data data-analysis data-engineering gem ruby tabular-data

Last synced: 02 Feb 2025

https://github.com/titanscouting/tra-analysis

Titan Robotics 2022 Strategy Team Analysis Repository

data-analysis frc frc-scouting hacktoberfest python

Last synced: 09 Feb 2025

https://github.com/turquetti/projeto5-vamoai

Projeto final da Resilia + iFood <3

data-analysis python tableau

Last synced: 15 Jan 2025

https://github.com/pradipece/weather_forecast_data_analysis

Using decision trees and random forest algorithms to solve real-world data analysis. "sklearn_decision_trees_random_forests"

data-analysis data-science data-visualization git github python python3

Last synced: 02 Feb 2025

https://github.com/madhuresh2011/amazon-sales-report-analysis-using-python

This project focuses on analyzing Amazon sales data using Python to uncover insights into sales performance, customer behavior, and product trends

charts cleaning-data data-analysis jupyter-notebook matplotlib numpy pandas python seaborn visualization

Last synced: 02 Feb 2025

https://github.com/rakumar99/jp-morgan-chase-virtual-internship

This repository contains the various tasks assigned by JPMorgan Chase & Co. Virtual Internship on Microsoft Excel

conditional-formatting dashboard data-analysis data-visualization hlookup pivot-tables presentation vba-macros vlookup

Last synced: 08 Jan 2025

https://github.com/rakumar99/power-bi-projects

This repository contains various power bi projects and dashboards of Humaan Resources , Financial Analysis using Power BI Desktop.

dashboards data-analysis data-visualization databases datacleaning datamodeling etl powerbi powerquery reports

Last synced: 08 Jan 2025

https://github.com/gappeah/nike_web_crawler

This project involves web scraping Nike's product pages to extract product names, prices, and links. The project showcases three different implementations of the web crawler using Selenium and BeautifulSoup. It also includes visualisation of the scraped data using Matplotlib and Seaborn.

beautifulsoup data-analysis data-visualization python selenium web-crawler web-scraper webcrawler webscraper webscraping webscraping-beautifulsoup

Last synced: 10 Nov 2024

https://github.com/gappeah/british-airways-analysis

This project focuses on analyzing and visualising travel data from British Airways using Tableau. The goal is to extract insights and present them in an interactive and visually appealing manner.

data data-analysis data-visualization tableau

Last synced: 07 Jan 2025

https://github.com/gappeah/london-housing-price-dashboard

This Excel-based Housing Visual Dashboard provides a comprehensive view of average house prices across various boroughs in London from 1996 to 2013. The dashboard is designed to offer insights into housing market trends and price variations across different areas of London over time.

data data-analysis data-visualization excel visual

Last synced: 07 Jan 2025

https://github.com/evardnk/dataanalyticsportfolio

Собрание моих проектов по аналитике данных

api automation data-analysis etl-pipeline jupyter-notebook jupyterlab kpis numpy pandas pipeline postgresql powerbi python sql visualization

Last synced: 11 Feb 2025

https://github.com/mohammadreza-mohammadi94/data-analysis-and-machine-learning-projects

A comprehensive collection of data analysis and machine learning projects, showcasing techniques and models for various data challenges. Dive in to explore code examples, analyses, and machine learning workflows.

data-analysis data-science dataframes exploratory-data-analysis pandas python scikit-learn visualization

Last synced: 07 Nov 2024

https://github.com/avinesh-masih/data-analytics-assignment

Comprehensive repository of data analytics assignments covering Python, EDA, data cleaning, visualization, machine learning, statistics, SQL, Power BI, and more. Includes practical projects and examples to build skills in tools like NumPy, Pandas, and business intelligence.

ai api data-analysis data-science data-visualization eda flask hypothesis-testing jupyter-notebook machine-learning matplotlib numpy pandas python seaborn sql statistics

Last synced: 11 Feb 2025

https://github.com/enamhasan/analyzing-the-impact-of-recession-on-automobile-sales

Data Analyis and Visualization Dashboard of the Impact of Recession on Automobile Sales

dashboard data-analysis data-science data-visualization pandas plotly plotly-dash python

Last synced: 02 Feb 2025

https://github.com/daniel1kp/openrtb-dashboard

This is a demo project designed to illustrate using Rill to analyze programmatic bid logs using the canonical open RTB framework.

data-analysis openrtb real-time-bidding rill

Last synced: 15 Jan 2025

https://github.com/haloapping/pisangijo

Kumpulan library dan framework untuk analisa data, data science, machine learning, deep learning dan masih banyak lagi berbasis bahasa pemrograman Python 🐍.

belajar data-analysis data-science deep-learning forecasting libraries machine-learning perkakas pustaka python3 recommender-system referensi tools

Last synced: 06 Jan 2025

https://github.com/nirmit27/book-recommender-system

This is a book recommendation system based on item-based Collaborative Filtering memory-based model created using Flask.

data-analysis data-science flask python python3 recommender-system render

Last synced: 08 Jan 2025

https://github.com/sunnybibyan/exploratory-data-analysis-eda

Welcome to the Titanic Dataset - Exploratory Data Analysis (EDA) project repository! This project aims to uncover insights from the Titanic dataset using Python and Jupyter Notebook. By analyzing key variables such as age, gender, and class, we aim to visualize relationships between passenger characteristics and survival rates.

data-analysis data-visualization jupyter-notebook python titanic-dataset

Last synced: 11 Feb 2025

https://github.com/nafisalawalidris/building-a-clustering-model-for-customer-segmentation

Customer Segmentation Using Clustering: This repo applies clustering algorithms to a customer transaction dataset, grouping similar customers together based on their purchasing behavior. Targeted marketing strategies can be developed by analyzing distinct customer segments.

clustering customer-segmentation data-analysis data-visualization k-means machine-learning marketing-analytics unsupervised-learning

Last synced: 23 Jan 2025

https://github.com/smahala02/calorimtery

A calorimetry lab project involving Python and Excel for computing heat transfer from experimental data.

calorimetry chemistry data-analysis excel jupyter-notebook python thermodynamics

Last synced: 11 Feb 2025

https://github.com/nafisalawalidris/investigating-netflix-movies-and-guest-stars-in-the-office

Dive into the world of Netflix and explore the average duration of movies. Netflix, being the largest entertainment company, offers a wide range of movies for its viewers. In this project, we analyse movie durations using pandas and create a DataFrame from a dictionary. By examining average durations from 2011 to 2020.

average-duration csv-files data-analysis data-visualization dataframe filtering movie-durations movie-length-distribution netflix pandas python trends

Last synced: 23 Jan 2025

https://github.com/pinedah/escom_development-of-applications-for-data-analysis

This repository is a personal collection of programs, exercises, and notes from the Development of Applications for Data Analysis course at Instituto Politécnico Nacional (IPN). As part of the Bachelor's in Data Science, the course focuses on developing practical skills in Python for data analysis.

data-analysis data-science data-visualization jupyter-notebook python python-data-analysis

Last synced: 11 Feb 2025

https://github.com/carusel02/sequential-data-processing-and-analysis

Sequential data processing and analysis using linked-list in C

data-analysis data-processing linked-list

Last synced: 09 Feb 2025

https://github.com/greed2411/ndl

Numbers Don't Lie, attempt on Data Analysis using pandas and matplotlib.

cities data-analysis data-science data-visualization india kaggle

Last synced: 18 Jan 2025

https://github.com/patriloto/reinventartec_2021

Material para el taller de Primeros pasos en R para el análisis de datos

data-analysis rstats

Last synced: 28 Jan 2025

https://github.com/mindgamesnl/yanderestats

https://mindgamesnl.github.io/YandereStats/

data-analysis reporting-pipeline yandere yandere-sim

Last synced: 01 Jan 2025

https://github.com/souvik09-tech/walmart_sales_dataanalysis

This end-to-end data analysis project leverages Python for processing and SQL for advanced querying to extract key business insights from Walmart sales data. It's designed for data analysts to enhance skills in data manipulation, querying, and pipeline creation.

data-analysis end-to-end etl-pipeline jupyter-notebook mysql mysql-database pandas python

Last synced: 09 Jan 2025

https://github.com/nafisalawalidris/tools-for-data-science

It covers popular languages (Python, R, SQL) and libraries (NumPy, Pandas) used in the field. The author shares their objectives of teaching data analysis, web development, and critical thinking skills. The repository also includes code examples, explanations of arithmetic expressions, and contact information for the author.

arithmetic-expressions data-analysis data-science data-visualization languages libraries matplotlib numpy pandas programming python r sql tools web-development

Last synced: 23 Jan 2025

https://github.com/gaurav-van/ev-battery-analysis

Evaluating Key Factors Influencing EV Battery Performance and Lifespan for Lizmotors Mobility Assessment.

data-analysis data-visualization electric-vehicles ev-cars-analysis exploratory-data-analysis informed-decisions python

Last synced: 02 Feb 2025

https://github.com/gaurav-van/house_price_predictor_streamlit_web_app

Data Science Project to Predict House Prices in Bangalore using the concept of Regression. This Repository is used for Deployment of the Project

data-analysis data-science exploratory-data-analysis machine-learning prediction python regression streamlit

Last synced: 02 Feb 2025

https://github.com/nafisalawalidris/international-breweries

This GitHub readme provides an overview of data analysis using SQL on the International Breweries dataset, including dataset description, analysis questions, example SQL queries, and key insights derived from the analysis.

data-analysis insights international-breweries-dataset queries sql

Last synced: 23 Jan 2025

https://github.com/saidsef/ff18

A complete catalog of all the players in Fifa 2018 and their complete statistics

data-analysis data-visualization fifa18 machine-learning machine-learning-algorithms world-cup-2018 world-cup-ranking

Last synced: 15 Jan 2025

https://github.com/dcs-training/introcausalinference

This is a repository for the Introduction to Causal Inference course provided by Chris Oldnall for the CDCS. Go to the readme file

data-analysis python r statistics

Last synced: 07 Jan 2025

https://github.com/whis99/userfunnelanalysis

An ecommerce user funnel conversion data analysis with matplotlib & python.

data-analysis data-analysis-python data-analyst data-visualization google-colab jupyter-notebook matplotlib python

Last synced: 13 Jan 2025

https://github.com/dcs-training/good-data-visualisation-with-r

Our guide on how we create data visualisations through R. Go to the readme file

data-analysis data-visualisation r rmarkdown

Last synced: 07 Jan 2025

https://github.com/noeyislearning/sharpe-ratio-amazon-facebook

Explore the Sharpe Ratio and its application to evaluate the performance of two tech giants: Amazon and Facebook.

amazon data-analysis data-science data-visualization facebook python3 sharpe-ratio

Last synced: 01 Feb 2025

https://github.com/airdac/sim-telco_customer_churn

Prediction of customer churn with logistic regression in R. Team project from UPC's Master's Degree in Data Science

classification data-analysis data-science logistic-regression r statistical-models upc

Last synced: 15 Jan 2025

https://github.com/mathieu2301/pbsc-tracker

Expérience de tracking des vélos en libre service fonctionnants avec PBSC

ai data-analysis data-mining data-science data-visualization libelo machine-learning pbsc valence velib-tracker

Last synced: 15 Jan 2025

https://github.com/jamesnw/wtb-data

Explore beer addition and style info from WhatToBrew.com

data-analysis homebrewing jupyter-notebook python3

Last synced: 27 Jan 2025

https://github.com/noeyislearning/intro-to-data-analysis

The repository teaches skills for cleaning, exploring, analyzing, and visualizing data in Python to gain insights and make data-driven decisions.

data-analysis jupyter-notebook lecture-notes python

Last synced: 01 Feb 2025

https://github.com/shadan100/sales-prediction-analysis

The aim is to build a predictive model and find out the sales of each product at a particular store. Using this model, BigMart will try to understand the properties of products and stores which play a key role in increasing sales.

artificial-intelligence data-analysis data-science django django-framework jupyter-notebook machine-learning matplotlib pandas predictive-modeling python sales-prediction

Last synced: 12 Feb 2025