Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/gaurav-van/ev-battery-analysis

Evaluating Key Factors Influencing EV Battery Performance and Lifespan for Lizmotors Mobility Assessment.

data-analysis data-visualization electric-vehicles ev-cars-analysis exploratory-data-analysis informed-decisions python

Last synced: 02 Feb 2025

https://github.com/souvik09-tech/walmart_sales_dataanalysis

This end-to-end data analysis project leverages Python for processing and SQL for advanced querying to extract key business insights from Walmart sales data. It's designed for data analysts to enhance skills in data manipulation, querying, and pipeline creation.

data-analysis end-to-end etl-pipeline jupyter-notebook mysql mysql-database pandas python

Last synced: 09 Jan 2025

https://github.com/patriloto/reinventartec_2021

Material para el taller de Primeros pasos en R para el análisis de datos

data-analysis rstats

Last synced: 28 Jan 2025

https://github.com/nafisalawalidris/tools-for-data-science

It covers popular languages (Python, R, SQL) and libraries (NumPy, Pandas) used in the field. The author shares their objectives of teaching data analysis, web development, and critical thinking skills. The repository also includes code examples, explanations of arithmetic expressions, and contact information for the author.

arithmetic-expressions data-analysis data-science data-visualization languages libraries matplotlib numpy pandas programming python r sql tools web-development

Last synced: 23 Jan 2025

https://github.com/fbraza/python-dataframe-skim

Get an extended statistic summary of your pandas DataFrame

data-analysis data-science dataframe pandas python3

Last synced: 26 Jan 2025

https://github.com/drcbeatz/aynm-data

Python scripts for data cleaning and processing for AYNM (Pandas/NumPy/Selenium/Azure Cognitive Services)

automation azure-cognitive-services csv data-analysis data-cleaning ipynb numpy ocr pandas python reverb selenium shopify webscraping xml

Last synced: 26 Jan 2025

https://github.com/abeltavares/finstockdash

A streamlit web app for retrieving and analyzing financial data for a stock ticker.

dashboard data-analysis finance financial-analysis financial-reporting financial-statements investments python stocks streamlit web-app

Last synced: 05 Jan 2025

https://github.com/mleidel/sqlcel

Python GUI to run SQL select on Spreadsheets, CSV, and Sqlite database tables

csv-export csv-import data data-analysis data-science datascience excel-export excel-import pandas python3 sql sqlalchemy sqlite3

Last synced: 12 Feb 2025

https://github.com/khuyentran1401/sample_datapane_script

This repo shows how to use Datapane create a simple script to see the rank of the authors or publications with respect to publishing frequency

data-analysis data-science datapane python

Last synced: 26 Jan 2025

https://github.com/adolbyb/data-science-python

An Introduction to Data Science and Data Visualization with the FAU Data Science and Machine Learning Club

data-analysis data-science data-visualization jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 20 Jan 2025

https://github.com/pinedah/escom_development-of-applications-for-data-analysis

This repository is a personal collection of programs, exercises, and notes from the Development of Applications for Data Analysis course at Instituto Politécnico Nacional (IPN). As part of the Bachelor's in Data Science, the course focuses on developing practical skills in Python for data analysis.

data-analysis data-science data-visualization jupyter-notebook python python-data-analysis

Last synced: 11 Feb 2025

https://github.com/gappeah/global-shipping-analytics-dashboard

This Tableau project provides a comprehensive visual analysis of global sales, shipping costs, and quality metrics across different regions and countries.

data data-analysis data-analyst data-visualization metrics tableau

Last synced: 07 Jan 2025

https://github.com/turquetti/projeto5-vamoai

Projeto final da Resilia + iFood <3

data-analysis python tableau

Last synced: 15 Jan 2025

https://github.com/sathyasris27/data-analysis-on-adult-smoking-patterns-in-the-uk

The aim of this analysis is to understand the smoking patterns among adults in the UK.

data data-analysis data-visualization python3

Last synced: 10 Jan 2025

https://github.com/smahala02/calorimtery

A calorimetry lab project involving Python and Excel for computing heat transfer from experimental data.

calorimetry chemistry data-analysis excel jupyter-notebook python thermodynamics

Last synced: 11 Feb 2025

https://github.com/kedwar83/housing-prices-exploration

Using machine learning algorithms to explore housing prices

data-analysis data-science python school-project

Last synced: 21 Jan 2025

https://github.com/atxtechbro/glassdoorwebscraping

"Scraping Glassdoor: A GraphQL Journey" is an advanced data harvesting tool leveraging GraphQL and an API-first strategy to extract and analyze Glassdoor data for business intelligence and predictive analytics.

api-first-approach business-intelligence data-analysis data-harvesting data-mining data-science glassdoor-scraper graphql html machine-learning performance-optimization predictive-analytics python requests-library-python scaleability scraper system-design web-scraping

Last synced: 25 Jan 2025

https://github.com/allanotieno254/codsoft

This repository showcases a series of data science projects completed during an internship with CODESOFT. Each project utilizes Python and various machine learning techniques to solve specific problems in data analysis, classification, regression, and predictive modeling.

classification data-analysis data-science feature-engineering machine-learning model-evaluation predictive-modeling python-programming regression

Last synced: 17 Feb 2025

https://github.com/nafisalawalidris/investigating-netflix-movies-and-guest-stars-in-the-office

Dive into the world of Netflix and explore the average duration of movies. Netflix, being the largest entertainment company, offers a wide range of movies for its viewers. In this project, we analyse movie durations using pandas and create a DataFrame from a dictionary. By examining average durations from 2011 to 2020.

average-duration csv-files data-analysis data-visualization dataframe filtering movie-durations movie-length-distribution netflix pandas python trends

Last synced: 23 Jan 2025

https://github.com/virajbhutada/global-universities-success-analysis-powerbi-sql-excel

This capstone project conducts in-depth analysis using Power BI, SQL, and Excel to explore complex dynamics shaping global university success. Integrating data from diverse ranking systems and criteria, our aim is to unravel the factors influencing universities worldwide.

capstone capstoneproject data-analysis data-analytics data-insights data-science data-science-projects data-visualization excel exploratory-data-analysis mece mysql powerbi powerpoint sql

Last synced: 10 Jan 2025

https://github.com/sunnybibyan/exploratory-data-analysis-eda

Welcome to the Titanic Dataset - Exploratory Data Analysis (EDA) project repository! This project aims to uncover insights from the Titanic dataset using Python and Jupyter Notebook. By analyzing key variables such as age, gender, and class, we aim to visualize relationships between passenger characteristics and survival rates.

data-analysis data-visualization jupyter-notebook python titanic-dataset

Last synced: 11 Feb 2025

https://github.com/steciuk/ium-recommendation-system

Evaluation and comparison of 3 different recommendations models for web shopping service simulation.

data-analysis model-evaluation recomendation-system

Last synced: 06 Jan 2025

https://github.com/allanotieno254/british-airways-review-dashboard-using-tableau

An interactive dashboard built using Tableau, visualizing British Airways customer review data. This project analyzes customer sentiment, satisfaction levels, and other key metrics to provide insights into passenger experiences and identify improvement areas for British Airways.

customer-satisfaction-analysis data-analysis data-visualisation tableau-dashboards tableau-public

Last synced: 17 Feb 2025

https://github.com/datalopes1/ds_salaries2024_eda

Neste projeto será realizado o processo de EDA (Exploratory Data Analysis) a partir do dataset Data Science Salaries 2024, que pode ser encontrado no Kaggle, com licensa Database: Open Database e enviado por Sazidul Islam.

data-analysis data-visualization eda exploratory-data-analysis jupyter-notebook python

Last synced: 02 Feb 2025

https://github.com/priboy313/pandasflow

A set of custom python modules for friendly workflow on pandas

catboost data-analysis data-science pandas phik python scikit-learn shap

Last synced: 13 Feb 2025

https://github.com/lijesh010/netflix_dataset_exploratory_data_analysis_python_project

This repository contains an Exploratory Data Analysis (EDA) Python project on the Netflix dataset. The purpose of this project is to gain insights and better understand the characteristics of the content available on Netflix, including movies and TV shows.

data-analysis data-exploration data-visualization exploratory-data-analysis jupyter-notebook python

Last synced: 03 Feb 2025

https://github.com/li-pearl/gene-count-normalizer

First step of data wrangling in MERFISH data project

data-analysis merfish merscope python

Last synced: 12 Jan 2025

https://github.com/vitia-fritelle/analise_dieese

Análise realizada com base nos dados extraídos do site https://www.dieese.org.br/analisecestabasica/salarioMinimo.html

data-analysis economic-data

Last synced: 15 Feb 2025

https://github.com/viztruth/google-play-store-data-analysis

This repository contains all the materials of my final project 'Google Play store Data Analysis' for the 'Telling Stories with Data' course at PES University.

data-analysis data-visualization

Last synced: 26 Jan 2025

https://github.com/alxrm/scent-of-literature

Russian literature sentiment analysis in terms of very small dataset

classification data-analysis sentiment-analysis sklearn tf-idf

Last synced: 01 Feb 2025

https://github.com/rohithsaji97/open_gate_dip

An automatic gate opening system with an additional parking system (using Raspberry PI).

automated data-analysis digital-image-processing opencv python3 raspberry-pi-3 trained-models

Last synced: 19 Feb 2025

https://github.com/abhinavsharma07/fraud_analytics-credit_card_fraud_detection

The aim of this project is to predict fraudulent credit card transactions with the help of different machine learning models.

banking data-analysis decision-trees hyperparameter-optimization machine-learning-algorithms pipelines random-forest-classifier svm-classifier xgboost-classifier

Last synced: 10 Jan 2025

https://github.com/nafisalawalidris/building-a-clustering-model-for-customer-segmentation

Customer Segmentation Using Clustering: This repo applies clustering algorithms to a customer transaction dataset, grouping similar customers together based on their purchasing behavior. Targeted marketing strategies can be developed by analyzing distinct customer segments.

clustering customer-segmentation data-analysis data-visualization k-means machine-learning marketing-analytics unsupervised-learning

Last synced: 23 Jan 2025

https://github.com/olekscode/covidanalysis

A setup for COVID-19 data analysis in Pharo

coronavirus covid-19 data-analysis pharo

Last synced: 11 Feb 2025

https://github.com/sandk21/detection_faux_billets

Algorithme de détection de faux billets selon leurs dimensions géométriques et application web pour générer les prédictions

data-analysis data-science data-visualization machine-learning pandas python scipy sklearn streamlit

Last synced: 02 Feb 2025

https://github.com/aadityatamrakar/futures_spread_chart

Cash Market & Futures Daily Spread Chart - NSE Stocks

data data-analysis data-mining expressjs nodejs requests

Last synced: 27 Jan 2025

https://github.com/vi/rendercsv

Tool to convert CSV table to a picture.

animation csv csv2pic csv2png data-analysis picture png table table-renderer visualization

Last synced: 16 Oct 2024

https://github.com/msthamizh/airbnb_analysis

Developing a Streamlit application enabling users to explore and analyze Airbnb listing data. This application allows users to interactively visualize geospatial distributions of listings, analyze pricing trends, and explore availability patterns across different locations. Integrates MongoDB Atlas for data storage and PowerBi for advanced insights

data-analysis data-cleaning data-visualization json mongodb pandas-dataframe plotly powerbi python streamlit

Last synced: 10 Feb 2025

https://github.com/chaganti-reddy/case-study-market-segmentation

Market Segmentation Case Study Analysis using Clustering

case-study data-analysis machine-learning market-segmentation plotting

Last synced: 20 Jan 2025

https://github.com/yash-kavaiya/ai-analytics

This is a Streamlit app that uses Pandas and AI to perform data analytics on uploaded CSV files.

data-analysis generative-ai pandas streamlit

Last synced: 24 Dec 2024

https://github.com/palwisha-18/weather-api

Weather API built in Flask

data-analysis flask html pandas python

Last synced: 10 Feb 2025

https://github.com/nikhilash45/power-bi-vsualisation-of-joins

In This Power Bi Report User Can Visualis Join By Themselves , and it is easy to understand joins now.

business-analytics business-intelligence data data-analysis data-visualization joins powerbi sql visualization

Last synced: 11 Jan 2025

https://github.com/gholamrezadar/favourite-youtube-channels

this program goes through your youtube watch history and sorts channels based how many of their videos you have watched!

data-analysis data-visualization python

Last synced: 20 Feb 2025

https://github.com/asifdotexe/flipkart-electric-scooter-data-analysis

In this project, I have web scraped Electric Scooter data from Flipkart and turn it into a csv file for further analysis

beautifulsoup4 data-analysis data-science flipkart webscraping

Last synced: 15 Jan 2025

https://github.com/asifdotexe/quickvu

Quick VU: No-code, data cleaning analysis and visualization tool built on Streamlit. Quickly clean, visualize, explore, and understand data relationships and correlations with ease. Perfect for analysts, business users, and anyone looking to gain data insights—without writing a single line of code.

automation data-analysis data-cleaning data-visualization python3 streamlit-application toolkit

Last synced: 15 Jan 2025

https://github.com/asifdotexe/air-quality-analysis-aqa

AQA is a data-driven project focused on analyzing air quality data sourced from data.gov.in. The project encompasses data preprocessing, analysis, and visualization to gain insights into air pollution levels across various locations in India. By examining six key pollutants, the project aims to raise awareness about the environmental issues

aqi-analysis data-analysis data-preprocessing data-science data-visualization presentation

Last synced: 15 Jan 2025

https://github.com/gholamrezadar/most-profitable-actors

Finds the list of actors with the most boxoffice profit using TMDB API.

crawling data-analysis tmdb

Last synced: 20 Feb 2025

https://github.com/gonzalo123/pivot.pandas

Data Analysis with Python. Pivot tables with Pandas

data-analysis jupyter-notebook pandas pivot-tables python

Last synced: 27 Jan 2025

https://github.com/ayaanjawaid/google_playstore_data_analysis

This project provides an in-depth analysis of Google Play Store apps and user reviews, focusing on understanding app performance, user sentiment, and key trends in app categories. Using Python, I performed data cleaning, feature engineering, and exploratory data analysis (EDA) on app data and reviews.

data-analysis eda html numpy pandas-dataframe plotly python vizualisation

Last synced: 31 Oct 2024

https://github.com/jofaval/mlai-portfolio

My portfolio about Data Analysis, Machine Learning and A.I.

computer-science data-analysis data-science machine-learning portfolio python

Last synced: 04 Feb 2025

https://github.com/shuklayash02/data_analysis_using_r

Covid19 analysis and cleaning of data where the death age and deaths of specific gender is cleaned and analysed

analysis cleaning-data data-analysis data-visualization rprogramming

Last synced: 15 Feb 2025

https://github.com/manwithacap/by-the-metric-match

🎲🃏 A game data tracker for your board/card/video games!

data-analysis data-visualization games jupyter-notebook python utility

Last synced: 08 Jan 2025

https://github.com/fbraza/paris_airbnb

Analysis of Paris AirBnB data using R and Shiny

analysis data data-analysis paris-airbnb r shiny

Last synced: 26 Jan 2025

https://github.com/vidhi1290/zomato-data-analysis

Zomato Data Analysis - Explore the world of Zomato restaurant data through Python and data analysis. Uncover trends and insights using Pandas for data manipulation and Matplotlib for visualization. Join us in this journey to reveal the hidden stories within the data!

data-analysis data-analysis-python data-science data-visualization dataprocessing machine-learning machine-learning-algorithms matplotlib numpy pandas python scikit-learn zomato-data-analysis

Last synced: 02 Feb 2025

https://github.com/dannyben/datamix

DSL for manipulating tabular data

csv data data-analysis data-engineering gem ruby tabular-data

Last synced: 02 Feb 2025

https://github.com/karatechop/noaa-storm-database-data-analysis

Analysis of population health and economic consequences of events documented in the U.S. National Oceanic and Atmospheric Administration’s (NOAA) storm database.

data-analysis knitr r rmarkdown

Last synced: 21 Jan 2025

https://github.com/flyingfathead/neurograph-framework

A versatile tool for visualizing entropy loss in TensorFlow-based neural network training, providing insightful scatter plots with annotations.

data-analysis data-analysis-python data-visualization entropy graph graphs neural-network neural-networks neural-networks-visualization nn python python3 tensorflow tensorflow2 training visualization visualization-tools

Last synced: 11 Jan 2025

https://github.com/happybono/sonatasmooth

Provides three different noise reduction algorithms for smoothing out data : Rectangular Averaging, Binomial Median Filtering, and Binomial Averaging. It processes data from a list and displays the results in another list.

algorithms average binomial binomial-coefficient binomial-theorem calibration csharp data-analysis data-calibration dynamic-noise-reduction median noise-algorithms noise-reduction noise-reduction-kernel outliers rectangular-averaging windows-desktop windows-desktop-application windows-forms winforms

Last synced: 11 Feb 2025

https://github.com/gmbeddard/ee152-realtime_embedded_systems-finalproject

An STM32-based implementation of the Pan-Tompkins algorithm for real-time QRS detection. Includes robust debugging tools, heart rate monitoring, and live ECG signal support via a python graphing script.

cpp-programming data-analysis ecg embedded-c freertos stm32

Last synced: 20 Jan 2025

https://github.com/pseudomanifold/us-inauguration-speeches

Data & feature extraction for U.S. inauguration speeches

data-analysis data-science inauguration politics speech speeches

Last synced: 17 Feb 2025

https://github.com/antononcube/wl-datareshapers-paclet

Wolfram Language (aka Mathematica) paclet for data reshaping functions, like, long- and wide form, cross tabulation, etc.

contingency-table cross-tabulation data-analysis data-transformation long-form wide-form

Last synced: 08 Feb 2025

https://github.com/adagio/ivoox_episodes

iVoox Episodes: Scraping & Analysis

beautifulsoup4 data-analysis ivoox pandas python python3 scraping

Last synced: 18 Feb 2025

https://github.com/danhenriquex/data-science-project

The main goal of this project was to apply the concepts of data visualization and analysis.

data-analysis data-science numpy pandas python

Last synced: 14 Jan 2025

https://github.com/nirmalvatsyayan/data-analyst-nanodegree

Udacity data analyst nanodegree project submissions and learning

data-analysis numpy pandas python statistics udacity-data-analyst-nanodegree

Last synced: 12 Jan 2025

https://github.com/garciparedes/castile-and-leon-crops

Data Analysis of Castile and Leon Crops Area over the last years

castile-and-leon crops data-analysis data-science jupyter jupyter-notebook notebook spain

Last synced: 16 Jan 2025

https://github.com/vara-co/sql-challenge

EmployeeSQL "Data modeling, data engineering, and data analysis."

data-analysis data-engineering data-modeling databases employee-database erd erdiagram postgres postgresql schema sql tables

Last synced: 02 Feb 2025

https://github.com/danhenriquex/final-project-ia

Artificial Intelligence Project - Analysis of sentiments of news that impact the value of shares.

data-analysis machine-learning supervised-learning

Last synced: 14 Jan 2025

https://github.com/richardwarepam16/rental_analysis_using_python_and_sql

Maximizing Rental Profits: Data-Driven Strategies for a Movie Rental Store

data-analysis data-analytics python3 rental-management sakila-db sqlite3

Last synced: 22 Jan 2025

https://github.com/carmoreno/analisisaccidentalidadbogota

Data Analysis about traffic accidents at Bogotá, Colombia.

data-analysis data-science jupyer-notebook matplotlib numpy pandas scikit-learn

Last synced: 05 Jan 2025

https://github.com/vara-co/space-missions

Space Missions Over Time (1957-2022): Successes vs Failures, and Rocket Usage

data-analysis data-analysis-python history matplotlib pandas pandas-python space space-race spaceships team-project

Last synced: 02 Feb 2025

https://github.com/dina-hosny/explore-us-bike-share-data-project

Explore US Bike Share Data project - FWD Data Analysis Professional Track. In this project, I used Python to explore data related to bike share systems for three major cities in the United States and answer questions about it by computing descriptive statistics.

data-analysis data-science numpy pandas python

Last synced: 13 Jan 2025

https://github.com/iguptashubham/ott-churn-eda-ml

Understanding why customers discontinue their subscriptions will be crucial in optimizing the user experience, reducing churn, and maximizing customer lifetime value. By using Machine learning model to predict the Customer Churn.

data-analysis data-analysis-project data-science data-science-portfolio data-science-projects data-visualization machine-learning python

Last synced: 14 Jan 2025

https://github.com/vara-co/pandas-challenge

PyCitySchools - Analysis between budget and academic performance in schools

budget-analysis data-analysis jupiter-notebook pandas-dataframe python school-performances

Last synced: 02 Feb 2025

https://github.com/jasoncobra3/whatsapp_chat_analyzer

WhatsApp Chat Analyzer is a powerful tool that provides insightful analytics from your WhatsApp conversations. Whether you're curious about your chatting habits, want to analyze group dynamics, or need to extract meaningful data from your conversations, this tool has got you covered!

data-analysis data-science data-visualization machine-learning streamlit streamlit-webapp whatsapp-chat whatsapp-chat-analyzer

Last synced: 10 Feb 2025