Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/bkataru/physics-ia

Programs and files written for Astrostatistics for IB Physics IA. Topic: Visualizing and analyzing the habitable zones for 150,000 stars from the hipparcos catalogue.

astronomical-algorithms astronomy astrophysics astrostatistics data-analysis data-science data-visualization matplotlib plotting

Last synced: 22 Dec 2024

https://github.com/verbasik/yandex.practicum.datascience

Портфолио проектов Data Science, выполненных в рамках профессиональной переподготовки в Яндекс.Практикум. Включает исследования в области финансов, недвижимости, кинопроката и других, с использованием статистики, машинного обучения и анализа данных.

data-analysis data-science machine-learning yandex-praktikum

Last synced: 10 Jan 2025

https://github.com/andr3w03/bike-sharing-dashboard

Bike Sharing Data Analysis Streamlit Dashboard

dashboard data-analysis data-visualization python streamlit

Last synced: 29 Jan 2025

https://github.com/kevinyang372/san-francisco-crime-data-analysis

An ARIMA prediction model for forecasting potential crimes based on users' time and location

data-analysis machine-learning

Last synced: 30 Jan 2025

https://github.com/milind220/hk-air-quality-analysis

My final project for a statistics and data analysis course. Whew that was a lot of graphs!

data-analysis jupyter-notebook numpy pandas python python3 scipy seaborn statistics

Last synced: 03 Jan 2025

https://github.com/poga/dat-ipynb-demo

use ipython notebook to analyze data in dat archive

dat data-analysis distributed jupyter-notebook

Last synced: 08 Feb 2025

https://github.com/joanacmbarros/ardm-website

Website to support the R in Pharma 2023 workshop on the ARDM

analysis-results automation clinical-data data-analysis data-model r-in-pharma

Last synced: 09 Feb 2025

https://github.com/winter000boy/dsa-practice

This repository holds my solutions for LeetCode’s Pandas playlists. Each section includes code and notes on using Pandas to handle real-world data tasks efficiently. Perfect for anyone looking to deepen their understanding of data manipulation with Pandas.

data-analysis data-science leetcode leetcode-python pandas-python python3

Last synced: 30 Jan 2025

https://github.com/tathithienthanh/datamining-banking-dataset

Implement some learned data mining techniques and predict if the client will subscribe to a term deposit

apriori association-rules classification clustering data-analysis data-mining data-processing google-colab ipynb kmeans naive-bayes py python scikit-learn svm visualization

Last synced: 25 Jan 2025

https://github.com/grburgess/gbm_kitty

Database, reduce, and analyze GBM data without having to know anything. Curiosity killed the catalog.

3ml catalogue data-analysis fermi-science grbs pipelines

Last synced: 23 Jan 2025

https://github.com/ziaeemehr/itng_nest

Nest Simulator quick guides and examples, adding new model using NESTML

computational-neuroscience data-analysis nest-simulator neuroscience

Last synced: 07 Feb 2025

https://github.com/adirthaborgohain/community-data-analysis

Data and Visual Analysis on several different communities generated using Louvain Algorithm in Neo4j on the dblp dataset.

data-analysis lda python

Last synced: 05 Feb 2025

https://github.com/mwoss/mlflow-stock-market-example

Stock market prediction - machine learning pipeline using MLFlow.

anaconda data-analysis databricks example lstm mlflow python stock-market stock-price-prediction tutorial

Last synced: 24 Jan 2025

https://github.com/vishrut-b/end-to-end-data-analytics-with-python-and-sql

This project involves the data cleaning and SQL-based analytics of a retail orders dataset using Python and SQL. It focuses on preprocessing data, followed by detailed analytics to extract insights on sales trends and product performance.

data-analysis python retail sql sql-server sqlalchemy

Last synced: 05 Feb 2025

https://github.com/narius2030/sakila-datawarehouse-ssis

Implement a simple data warehouse to store Saklia data - Create data pipelines for extract, transform and load data from source to warehouse - Retrieve data in warehouse to explore and do several analysis

data-analysis data-integration data-modeling data-visualization excel microsoft-sql-server power-bi ssas ssis

Last synced: 07 Feb 2025

https://github.com/atxtechbro/flightradar24

Advanced Python application leveraging the power of APIs and the pandas library to retrieve and perform in-depth analysis of flight data from Flightradar24. It uncovers insights such as the most common departure and arrival cities, contributing to the field of aviation data science.

api-integration aviation-data data-analysis data-science data-visualization flightradar24-api pandas-library python requests-library web-scraping

Last synced: 25 Jan 2025

https://github.com/thecoderpinar/reta

🍃 Explore the world of renewable energy production, analyze historical data, and predict sustainable energy trends. Join us on the journey to a greener future!

arima clean-energy data-analysis data-science data-visualization energy-future forecasting-models innovation renewable-energy sustainability time-series

Last synced: 09 Feb 2025

https://github.com/ehtisham-sadiq/ai-pioneers-datascience-arena

This repository is dedicated to the AI Amigos team's participation in the Artificial Intelligence (AI) competition with a focus on Data Science.

artificial-intelligence competition data-analysis data-science data-visualization machine-learning model-building model-evaluation numpy pandas python3 supervised-learning unsupervised-learning

Last synced: 11 Jan 2025

https://github.com/tirendazacademy/data-sets

Data sets for Tirendaz Akademi Youtube

data-analysis dataset

Last synced: 01 Jan 2025

https://github.com/rajshrestha86/police-brutality-data-analysis

In this project, we analyze the events after George Floyd’s death. The protests and riots across the United States and sentiments of news articles of three different news sources that have different political leaning. We will see how these media reacted after Floyd’s death and see the effect of media bias on the sentiments of news for #BlackLivesMatter and #AllLivesMatter movement. We will also see if there is a correlation between the police budget and the number of protests. This analysis will help us to see if there is really a need for defunding police to reduce police brutality and casualties. We will also see the correlation of partisan segregation and number of deaths to see if political preference has an effect on the number of deaths by police.

data-analysis matplotlib pandas python sentiment-analysis web-scraping

Last synced: 07 Feb 2025

https://github.com/zachlagden/spotify-listening-analyzer

A comprehensive Python tool for analyzing your Spotify listening history data.

analytics data-analysis pandas python spotify-web-api spotipy

Last synced: 07 Feb 2025

https://github.com/manmolecular/http-response-clustering

:chart_with_downwards_trend: Clustering of HTTP responses using k-means++ and the elbow method

data-analysis elbow-method elbow-plot jupyter k-means-plus-plus python3

Last synced: 16 Jan 2025

https://github.com/kenvilar/data-analysis-using-python

Transforming a description of a location from an analyzed CSV file data using Pandas with Python 3

bs4 data-analysis jupyter pandas python python3 requests xlrd

Last synced: 18 Nov 2024

https://github.com/programmer-rd-ai/moviedatascraper

Explore the cinematic universe with our IMDb web scraping project! Dive into movie data with ease, uncovering insights from cast to critical reviews. With dynamic visualizations and reliable data, let's journey through the world of movies like never before. Lights, camera, analysis!

beautifulsoup beautifulsoup4 data data-analysis jupyter-notebook matplotlib numpy pandas programming python python3 scraping seaborn software web

Last synced: 12 Jan 2025

https://github.com/louislefevre/sstubs-miner

Data mining and analysis for the ManySStuBs4J dataset.

data-analysis data-mining manysstubs4j-dataset msr

Last synced: 05 Feb 2025

https://github.com/vandita2020/merra2_nasa_wind_speed_analysis

In this study, we aim to explore the vulnerability of power grids in the south-east region of the USA with the help of data analysis tools and machine learning algorithms

data-analysis data-science machine-learning-algorithms python

Last synced: 11 Jan 2025

https://github.com/c0deta1ker/matbase

MatBase provides access to an extensive database of material parameters, inelastic mean free paths (IMFP), photoionization binding energies, cross sections, and asymmetry parameters. Additionally, MatBase includes a suite of functions for users to load, process, model and fit their own data, making it an indispensable tool in the field.

cross-sections crystal-structure crystallography data-analysis data-fitting database electron imfp imfp-calculator-matlab material material-database matlab matlab-application matlab-gui matlab-toolbox pes-modelling photoelectron-spectroscopy photoionization simulation xps

Last synced: 30 Nov 2024

https://github.com/farahibrar/kpmg-job-simulation

This repository showcases my work from the KPMG Technology Job Simulation by Forage, focusing on Data Analytics and Cloud Engineering. Explore how I tackled real-world business challenges through sales data analysis, regional growth strategies, and AWS architecture design, highlighting my analytical and technical expertise.

aws-architecture business-intelligence cloud-engineering cloud-strategy-and-design data-analysis data-visualization fintech-solutions forage kpmg kpmg-careers python-for-data-analysis sales-data-insights sustainable-retail-analysis

Last synced: 25 Jan 2025

https://github.com/frikishaan/browsing-history-analysis

This is a data analysis of my browsing history for the last 7 months.

browsing-history data-analysis jupyter-notebook python

Last synced: 09 Jan 2025

https://github.com/pratishtha-abrol/astronomy-dataanalysis

A key technique in Data Driven Astronomy

astronomy astropy crossmatch data-analysis

Last synced: 05 Feb 2025

https://github.com/depressioncenter/data-and-design-core

Code developed by the EFDC Data and Design Core team to support mental health research.

data-analysis data-science efdc inference r statistical-analysis umich

Last synced: 25 Jan 2025

https://github.com/antononcube/wl-outlieridentifiers-paclet

Wolfram Language (aka Mathematica) paclet that provides outlier identifier functions.

data-analysis hampel outlier-detection outliers

Last synced: 08 Feb 2025

https://github.com/goggle/dataisbeautiful

Some data analysis Jupyter notebooks, mainly indented for submissions on the subreddit /r/dataisbeautiful.

data-analysis data-visualisation jupyter-notebook notebook reddit

Last synced: 26 Dec 2024

https://github.com/gjbex/python-dashboards

Repository that contains material for training sessions on creating dashboards using Python.

dash dashboard data-analysis data-exploration data-science data-visualization panel python streamlit training training-materials visualization

Last synced: 22 Nov 2024

https://github.com/leonism/customer-predictive-analysis

Explore this repository, a comprehensive resource offering an in-depth guide to conducting customer predictive analysis using cutting-edge machine learning techniques, all within the intuitive framework of Dataiku.

data-analysis data-model data-science data-visualization dataiku machine-learning predictive-modeling

Last synced: 03 Feb 2025

https://github.com/rapidsurveys/oldr

An Implementation of the Rapid Assessment Method for Older People (RAM-OP)

assessment data-analysis epidata estimate odk older-people r ram-op ranalyticflow rapid-assessment

Last synced: 24 Dec 2024

https://github.com/john-science/data_science_by_example

Examples of Data Science Tools & Libraries

data-analysis data-science ipython pandas

Last synced: 18 Nov 2024

https://github.com/ysayaovong/car-sales

An analysis of car sales data to uncover market trends and insights through data cleaning, analysis, and visualization.

automotive business-analysis data-analysis data-cleaning data-visualization market-trends matplotlib pandas python sales-data scikit-learn seaborn

Last synced: 23 Jan 2025

https://github.com/billgewrgoulas/hypothesis-testing-on-37-seasons-of-nba

First assignment for the course Data Mining @CSE.UOI

data-analysis data-science numpy scipy seaborn statistics

Last synced: 16 Jan 2025

https://github.com/ezzz-lui/rsm-evaluationproject

Este repositorio es donde esta documentado nuestro proyecto para RSM por parte de actividad final para el bootcamp Data Analyst

data-analysis python

Last synced: 30 Jan 2025

https://github.com/nakshjainsonigara/vba-canteenmanagementsystem

The Canteen Management System is a comprehensive software solution designed to modernize and optimize canteen operations. It aims to simplify the complexities of managing a canteen by automating key processes such as order management, payment processing, and report generation.

canteen canteen-mangement-system charts data-analysis email excel microsoft payment-gateway vba vba-excel vba-macros word

Last synced: 23 Dec 2024

https://github.com/shuklayash02/data_analysis_using_r

Covid19 analysis and cleaning of data where the death age and deaths of specific gender is cleaned and analysed

analysis cleaning-data data-analysis data-visualization rprogramming

Last synced: 23 Dec 2024

https://github.com/elcaiseri/udacity-advanced-data-analysis

UDACITY - Advanced-Data-Analysis Track Project

data-analysis python

Last synced: 01 Jan 2025

https://github.com/aekanshd/crazytics-suicidesindia

Basic interpretation of the Suicides in India data-set using R.

data-analysis data-science graph india r suicides

Last synced: 15 Jan 2025

https://github.com/rupav/fifa17-detailed-analysis

⚽ FIFA 17 data analysis using various Machine Learning Algorithms. ⚽

data-analysis data-visualization fifa17 machine-learning-algorithms radar-chart

Last synced: 09 Feb 2025

https://github.com/akash1070/project---applied-statistics-

To dive deep into this data & find some valuable insights.

data-analysis data-science python statistics

Last synced: 29 Jan 2025

https://github.com/akash1070/data-science-advanced-analytics-virtual-experience-program

The BCG Open-Access Data Science & Advanced Analytics Virtual Experience Program

data-analysis data-science machine-learning-algorithms

Last synced: 29 Jan 2025

https://github.com/johnsesana/eda-video-game-sales

Exploratory Data Analysis on Public Datasets

data-analysis data-visualization excel

Last synced: 17 Jan 2025

https://github.com/narenkhatwani/arkouda-projects

This repository contains the source codes of the projects done using Arkouda (a software package that allows a user to interactively issue massive parallel computations on distributed data using functions and syntax that mimic NumPy, the underlying computational library used in most Python data science workflows.)

arkouda data-analysis data-analytics data-science high-performance high-performance-computing highperformancecomputing numpy pandas parallel-computing parallel-processing parallelization python

Last synced: 06 Feb 2025

https://github.com/nysportsfan/Gun-Violence-in-the-US

This repository contains all the relevant files for my first capstone project as part of the Springboard Data Science Career Track.

data-analysis data-science data-visualization machine-learning python3 statistics

Last synced: 16 Nov 2024

https://github.com/agungbudiwirawan/supermarket_sales_dashboard-sales-analysis-using-pivot-table-and-chart

The objective of this project is to analyze supermarket sales using pivot table and chart in Microsoft Excel. After doing the analysis, I created an interactive dashboard that aims to make it easier for the audience to explore data.

dashboard data-analysis data-science data-visualization excel sales-dashboard spreadsheet

Last synced: 06 Feb 2025

https://github.com/akash1070/data-science-virtual-internship-by-accenture

data merging and data cleaning in python as well as data visulaisation with dashboard in Tableau.

data-analysis data-cleaning data-science python3 tableau visualization

Last synced: 29 Jan 2025

https://github.com/gher-uliege/stareso-data-processing

A set of tools to read, plot and process data from STARESO

coastal corsica data-analysis data-processing ocean-sciences oceanography

Last synced: 05 Feb 2025

https://github.com/mxagar/airbnb_data_analysis

An analysis of the AirBnB dataset from Euskadi / the Basque Country.

airbnb data-analysis data-science eda feature-engineering modeling pandas regression

Last synced: 23 Dec 2024

https://github.com/gher-uliege/bluecloud-plankton

Spatial interpolation of plankton data using a neural network

data data-analysis data-visualization neural-network oceanography

Last synced: 05 Feb 2025

https://github.com/agustinmusanti/delitosencaba-proyectofinal-dataanalytics-coderhouse

En este repositorio muestro mi proyecto final en el curso "Data Analytics" de Coderhouse.

data-analysis excel powerbi

Last synced: 23 Dec 2024

https://github.com/vitia-fritelle/analise_dieese

Análise realizada com base nos dados extraídos do site https://www.dieese.org.br/analisecestabasica/salarioMinimo.html

data-analysis economic-data

Last synced: 23 Dec 2024

https://github.com/bretsw/subreddits-over-time

Study of the r/Teachers and r/education subreddits over time

data-analysis dataset reddit

Last synced: 06 Feb 2025

https://github.com/fer-aguirre/taller-cookiecutter

Taller sobre cómo usar Cookiecutter para análisis de datos.

cookiecutter data-analysis project-template workshop

Last synced: 23 Dec 2024

https://github.com/virajbhutada/movie-rental-store-analytics-sql-powerbi-excel

Dive into the DVD rental industry with my Capstone project, Movie Rental Analytics. Analyzing the Sakila DVD Rental Store Database, I extract insights through exploratory data analysis (EDA) and Power BI visualizations. Findings inform strategies for optimizing film inventory, enhancing business operations, and customer experiences.

business-intelligence capstone-project customer-behavior-analysis data-analysis data-science excel exploratory-data-analysis film-ratings mece movie-database movie-rental mysql powerbi powerbi-visuals revenue-analysis sql sql-database

Last synced: 10 Jan 2025

https://github.com/savinrazvan/degrees

A program that computes the "degrees of separation" between two actors by identifying the sequence of movies connecting them, inspired by the Six Degrees of Kevin Bacon game. Uses IMDb-based datasets for actors, movies, and their relationships.

actor-connections ai data-analysis degrees-of-separation educational-project graph-theory imdb movie-database python six-degrees-of-kevin-bacon

Last synced: 10 Jan 2025

https://github.com/savinrazvan/heredity

An AI that assesses the likelihood of genetic traits in individuals using a Bayesian Network to analyze family genetic data, modeling genetic inheritance and mutations to infer probabilities of gene presence and trait expression.

ai bayesian-network biological-data-analysis data-analysis educational-project family-genetics genetic-inheritance genetic-traits heredity mutation-modeling probability-calculation python

Last synced: 10 Jan 2025

https://github.com/sumitkumargiri/machine-learning-project

This repository contain all the best practices for managing Github repository.

data-analysis github machine-learning opensource project python

Last synced: 19 Jan 2025

https://github.com/virajbhutada/coursera-google-data-analytics-capstone

A repository containing the Capstone project for the Google Data Analytics Professional Certificate, focusing on analyzing FitBit fitness tracker usage data to derive insights relevant to Bellabeat, a wellness technology company.

coursera data-analysis data-visualization fitbit fitness-tracker google google-data-analytics-capstone-project python-programming

Last synced: 10 Jan 2025

https://github.com/fatihilhan42/eda-spacex-launches-falcon9-and-falcon-heavy

In this project, we analyze the space flight data of Spacex space research company Falcon 9 rocket.

data-analysis data-science data-visualization eda elonmusk spacex

Last synced: 29 Jan 2025

https://github.com/nafisalawalidris/analyzing-nobel-prize-dataset-demographics-and-trends

This project analyses a Nobel Prize dataset using Python and data analysis libraries. It explores the distribution of winners by category and country, examines the proportion of female winners over time, investigates the age of winners when they received the prize and identifies the oldest and youngest recipients.

age-at-award country-distribution data-analysis data-manipulation dataset demographics filtering gender-balance grouping nobel-prize notable-laureates python trends visualisation winners

Last synced: 23 Jan 2025

https://github.com/nafisalawalidris/northwind-traders-sales-analysis

Northwind Traders Sales Analysis project, which analyses sales data for a fictitious company. It utilises the Northwind Database and includes SQL queries to provide insights on employees, products, suppliers and revenue. The project aims to help the company gain valuable information for business decision-making.

business-insights data-analysis database northwind-traders sales sql

Last synced: 23 Jan 2025

https://github.com/nafisalawalidris/hici-african-foods

HiCi African Foods: Excel dashboard & pivot table analysis of EU food rejection data to identify risks & recommend focus areas for market expansion.

data-analysis data-cleaning data-visualization eu-food-rejection excel-dashboard hici-african-foods market-expansion pivot-tables

Last synced: 23 Jan 2025

https://github.com/as16082023/music-store-analysis

This project involves analyzing music store data using SQL queries in MySQL workbench to enhance decision-making, identify trends, and understand customer behavior

data-analysis music-store-analysis mysql sql

Last synced: 23 Dec 2024

https://github.com/nafisalawalidris/springforth-university-foodbank

Springforth University Food Bank: A collaborative initiative with UNESCO to address student food insecurity. Contains code and resources for the web application, data analysis, and insights into the prevalence and impact of food insecurity on academic performance.

academic-performance collaborative-initiative data-analysis data-visualization excel pivot-tables powerbi springforth-university-food-bank student-food-insecurity unesco

Last synced: 23 Jan 2025

https://github.com/nafisalawalidris/logistic-regression-model-for-breast-cancer-recurrence-prediction

Predicting Breast Cancer Recurrence - A logistic regression model using patient attributes to classify recurrence risk. Dataset analysis and model evaluation. Contributions welcome.

breast-cancer classification-model data-analysis data-science healthcare logistic-regression machine-learning python recurrence-prediction scikit-learn

Last synced: 23 Jan 2025

https://github.com/nafisalawalidris/data-analysis-with-python

This repo features Jupyter Notebook labs for learning data analysis with Python. Explore data acquisition, wrangling, visualization, modeling, and evaluation. Enhance your skills in Python data analysis.

data-acquisition data-analysis data-science data-wrangling exploratory-data-analysis feature-engineering machine-learning model-development model-evaluation-and-refinement pandas

Last synced: 23 Jan 2025

https://github.com/nafisalawalidris/sales-performance-dashboard

Sales Performance Dashboard: Analyze and visualize sales data using Power BI. Gain insights into trends, customer segments, product performance, and geographic distribution. Make data-driven decisions to optimize sales strategies and maximize revenue.

analytics-revenue dashboard-power-bi data data-analysis intelligence-sales optimization performance sales visualization-business

Last synced: 23 Jan 2025

https://github.com/nafisalawalidris/dr.-semmelweis-and-the-discovery-of-handwashing

Uncover the revolutionary impact of handwashing on mortality rates in healthcare. Explore the story of Dr. Semmelweis and his groundbreaking findings.

data data-analysis handwashing healthcare-analysis medical-breakthrough mortality-rates

Last synced: 23 Jan 2025

https://github.com/nafisalawalidris/buybuy-e-commerce-company

The BuyBuy E-commerce Company repository is a comprehensive hub for the company's e-commerce platform. It includes source code, documentation, and data analysis insights, providing a data-driven approach to improve customer experience, drive revenue, and inform decision-making.

buybuy cleaning-data company customer-experience data data-analysis decision-making documentation e-commerce excel insights postgresql repository revenue source-code sql

Last synced: 23 Jan 2025