Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/raccoon-hero/gender-equality-tracker

A web application visualizing gender equality metrics with a focus on Ukraine. Built with Flask, it's powered by live data from global open sources, with dynamic research insights and analysis.

chartjs css dashboard data-analysis data-visualization flask frontend gender-equality global-metrics html linked-data openalex opendata python representation semantic-web ukraine webapp wikidata world-bank-api

Last synced: 27 Dec 2024

https://github.com/montanaz0r/testing-if-mma-math-deduction-works-using-ufc-fighters-data

The probabilistic reasoning about phenomenon called MMA math using UFC fighters data and Python.

bayesian-inference data-analysis data-science graphviz jupyter-notebook pandas python scipy statistics

Last synced: 14 Dec 2024

https://github.com/ayobami6/tweet-data-analysis

WeRateDogs Tweets Scrape using twitter Api

data-analysis data-science twitter webscraping

Last synced: 13 Nov 2024

https://github.com/ibnaleem/cyberchef-discord

A versatile Discord bot that implements CyberChef's features for encoding, decoding, encrypting, compressing, analysing data directly and more in your Discord server

compression cti cyberchef cybersecurity data-analysis data-manipulation discord-bot discord-js encoding encryption hashing infosec parsing redteam

Last synced: 07 Dec 2024

https://github.com/hfxbse/dhbw-data-analysis

Exploratory data analysis R notebook for the module T3INF4333 "Grundlagen Data Science" held in 2024 by Lothar B. Blum at the DHBW Stuttgart.

data-analysis data-science dhbw dhbw-stuttgart ggplot2 r r-notebook

Last synced: 20 Dec 2024

https://github.com/antononcube/wl-quantileregression-paclet

Wolfram Language (aka Mathematica) paclet that provides various Quantile Regression functions.

data-analysis machine-learning quantile-regression time-series time-series-analysis

Last synced: 15 Dec 2024

https://github.com/akash1070/data-science-virtual-internship-by-accenture

data merging and data cleaning in python as well as data visulaisation with dashboard in Tableau.

data-analysis data-cleaning data-science python3 tableau visualization

Last synced: 01 Dec 2024

https://github.com/valeriopagliarino/electronics-2021-unito-public

Data analysis and simulations for the course "Electronics laboratory" held at Physics Dep. - University of Turin, 2021

data-analysis electronics physics

Last synced: 06 Dec 2024

https://github.com/antononcube/wl-datareshapers-paclet

Wolfram Language (aka Mathematica) paclet for data reshaping functions, like, long- and wide form, cross tabulation, etc.

contingency-table cross-tabulation data-analysis data-transformation long-form wide-form

Last synced: 15 Dec 2024

https://github.com/draym/swmanager

Web-app to help you in your daily life raids in SpacesWars thanks to game statistics and data management

dashboard-application data-analysis data-visualization game-data game-utility

Last synced: 11 Nov 2024

https://github.com/akash1070/project---applied-statistics-

To dive deep into this data & find some valuable insights.

data-analysis data-science python statistics

Last synced: 01 Dec 2024

https://github.com/colburncodes/se_pudding_2023

This project is a React app designed to showcase research conducted by a team of data scientists and data analysts. The app is utilizing React and React-Chartjs-2

chartjs-2 data-analysis data-science data-visualization react-chartjs-2 reactjs

Last synced: 22 Nov 2024

https://github.com/akash1070/data-science-virtual-internship-by-anz

Exploratory data analysis and prediction of annual salary for customers from the dataset provided by ANZ.

data-analysis data-science predictive-analytics presentation-slides

Last synced: 01 Dec 2024

https://github.com/akash1070/data-science-advanced-analytics-virtual-experience-program

The BCG Open-Access Data Science & Advanced Analytics Virtual Experience Program

data-analysis data-science machine-learning-algorithms

Last synced: 01 Dec 2024

https://github.com/al-ghaly/airline-company-data-warehouse

Data Warehouse modeling, design, implementation, and analysis for an Airline Company.

data-analysis data-warehousing database-modeling sql-server

Last synced: 21 Nov 2024

https://github.com/jen-uis/loan-status-prediction

This repository contains project materials for the Winter STAT 206 class, University of California, Riverside, A. Gary Anderson School of Management.

data data-analysis data-analytics data-cleaning data-visualization descriptive-analytics julia julia-language jupyter-notebook predictive-analytics predictive-modeling team-collaboration

Last synced: 21 Nov 2024

https://github.com/mindful-ai-assistants/movierevenueanalysis

🎬💰 Analyze movie companies' revenue, release strategies, and financial performance using statistical techniques for actionable insights. This project explores data on total revenue, number of releases, and lifetime gross to uncover patterns that can drive strategic decisions in the film industry.

data-analysis data-science python statistical-analysis statistics

Last synced: 21 Nov 2024

https://github.com/jhrcook/wagenmaker-data-analysis

Analysis of Registered Replication Report: Strack, Martin, & Stepper (1988) by Wagenmaker et al.

data-analysis r r-project statistics

Last synced: 13 Nov 2024

https://github.com/kalebers/data_streams_parametric_t-sne

Research for Parametric T-SNE in high to low dimensional data stream, published in 2021 by Kalebe Rodrigues Szlachta and Andre de Macedo Wlodkowski, oriented by Jean Paul Barddal, Computer Science graduation from Pontifical Catholic University of Parana (PUCPR)

classifier data-analysis data-science data-visualization machinelearning parametric parametric-tsne python tsne-algorithm tsne-visualization

Last synced: 20 Nov 2024

https://github.com/frankelavsky/political-polarization-challenge

I had 8 hours to build a solution to the research claim that "politics have become more divided in the past 50 years." You can navigate views of congressional voting patterns using arrows. I used d3, require, MVC pattern, and vanilla js. Pre-processed the data in node.js. Data is from DW-NOMINATE: ftp://k7moa.com/junkord/HANDSL01114A20_STAND_ALONE_30.DAT

client-side css d3 d3js data-analysis data-visualization frontend frontend-app html interactive interactive-visualizations javascript modular nodejs political-science politics requirejs research single-page-app visualization

Last synced: 18 Nov 2024

https://github.com/thennen/py-ivtools

This is a package for measurement and analysis of current-voltage characteristics of electronic devices.

current-voltage data-analysis data-visualization electrical-engineering emerging-technology instrumentation measurements

Last synced: 24 Nov 2024

https://github.com/unrndm/dataanalysis

artifacts and sollutions of homework for course "Data Analysis" in Magistrate of HSE during 2023-2024

2023-2024 data-analysis hse

Last synced: 05 Dec 2024

https://github.com/githubuseraccountamazing/the-amari-project

a project in which I attempted to push some of the limits of stable-diffusion while taking some data along the way

ai ai-generated-images bash data-analysis machine-learning stable-diffusion textual-inversion

Last synced: 19 Nov 2024

https://github.com/fatihilhan42/web_scraping_football_statistics_per_game_data-main

In this notebook I will describe the process of scraping data from web portal understat.com that has a lot of statistical information about all games in top 5 European football leagues.

data-analysis data-manipulation data-science data-scraping data-visualization jupyter-notebook python

Last synced: 01 Dec 2024

https://github.com/olgapavlova/agile-health-hackathon

Визуализируем здоровье спринтов разработки по сырым данным

data-analysis data-visualization figma google-sheets matplotlib pandas python sql

Last synced: 18 Nov 2024

https://github.com/carmoreno/analisisaccidentalidadbogota

Data Analysis about traffic accidents at Bogotá, Colombia.

data-analysis data-science jupyer-notebook matplotlib numpy pandas scikit-learn

Last synced: 05 Jan 2025

https://github.com/prangonghose/analysis_of_bangladesh_economic_complexity

In this project a brief analysis has been done by our team in the export economy of Bangldesh for the past three decades.

data-analysis data-science data-visualization inequalipy matplotlib pandas plotly

Last synced: 18 Nov 2024

https://github.com/sumitkumargiri/machine-learning-project

This repository contain all the best practices for managing Github repository.

data-analysis github machine-learning opensource project python

Last synced: 18 Nov 2024

https://github.com/gracysapra/r-in-data-science

This repository contains essential guides for data analysis using R, covering topics like data preparation, data reshaping, and data visualization. Each file focuses on fundamental techniques to manipulate, clean, and visualize data effectively using R programming.

data-analysis data-preparation data-reshaping data-science data-visualization data-visualizations ggplot r r-for-data-science

Last synced: 15 Dec 2024

https://github.com/ondrejhruby/countries-of-the-world

Explore global data with this repository, featuring insights, visualizations, and Python code examples on countries worldwide—perfect for enhancing your data analysis and visualization skills.

data-analysis data-science data-visualization geography jupyter-notebook machine-learning matplotlib pandas python statistics

Last synced: 21 Nov 2024

https://github.com/ayaanjawaid/google_playstore_data_analysis

This project provides an in-depth analysis of Google Play Store apps and user reviews, focusing on understanding app performance, user sentiment, and key trends in app categories. Using Python, I performed data cleaning, feature engineering, and exploratory data analysis (EDA) on app data and reviews.

data-analysis eda html numpy pandas-dataframe plotly python vizualisation

Last synced: 31 Oct 2024

https://github.com/victoriapm/analyze_a-b_test_results

Understand the results of an A/B test run by an e-commerce website.

ab-testing data-analysis ecommerce-website

Last synced: 16 Nov 2024

https://github.com/aisurjyasamantaray/sales-perfomance-analysis-dashboard

A comprehensive sales performance analysis dashboard built using Python, and visualization tools. This project includes data cleaning, descriptive statistics, correlation analysis, and insights into sales trends, profitability, and the impact of discounts. Key features include interactive visualizations using Seaborn, and Matplot

analytics annova data data-analysis data-visualization-project dataproject eda hypothesis-testing pandas-dataframe python sales-performance-analysis statistics

Last synced: 18 Nov 2024

https://github.com/alejo1630/chicago_crimes

A Jupyter Notebook with the data analysis and data visualization of crimes in Chicago from 2017 to 2023 using libraries such as seaborn and folium

data-analysis data-visualization folium pandas python seaborn

Last synced: 31 Dec 2024

https://github.com/yard1/linearordering

An R package. Provides various methods of linear ordering of data. Supports weights and positive/negative impacts.

data-analysis data-analysis-in-r data-analysis-r data-science r

Last synced: 18 Nov 2024

https://github.com/pinedah/escom_development-of-applications-for-data-analysis

This repository is a personal collection of programs, exercises, and notes from the Development of Applications for Data Analysis course at Instituto Politécnico Nacional (IPN). As part of the Bachelor's in Data Science, the course focuses on developing practical skills in Python for data analysis.

data-analysis data-science data-visualization jupyter-notebook python python-data-analysis

Last synced: 19 Dec 2024

https://github.com/pheithar/socialdata_madridcentral

Social data and visualization course at DTU - 2022. Effectiveness of Madrid Central

data-analysis data-visualization jupyer-notebook madrid python

Last synced: 30 Nov 2024

https://github.com/alejo1630/sport_stats

Data analysis of information from the summer and winter Olympic games over the years. UC Davis SQL Specialization Final Project

data-analysis jupyter-notebook olympics-dataset plotly python seaborn sql

Last synced: 31 Dec 2024

https://github.com/john-science/data_science_by_example

Examples of Data Science Tools & Libraries

data-analysis data-science ipython pandas

Last synced: 18 Nov 2024

https://github.com/noeyislearning/e-commerce-sales-analysis

E-Commerce Sales Analysis, repository contains code and analysis for an e-commerce transaction dataset from Kaggle. The goal is to uncover insights from the data that could help drive business strategy and decisions.

data-analysis data-science jupyter-notebook nextjs python typescript

Last synced: 06 Dec 2024

https://github.com/sunnybibyan/exploratory-data-analysis-eda

Welcome to the Titanic Dataset - Exploratory Data Analysis (EDA) project repository! This project aims to uncover insights from the Titanic dataset using Python and Jupyter Notebook. By analyzing key variables such as age, gender, and class, we aim to visualize relationships between passenger characteristics and survival rates.

data-analysis data-visualization jupyter-notebook python titanic-dataset

Last synced: 19 Dec 2024

https://github.com/sevdanurgenc/data-modeling-techniques-lecture-notes

In this repo, I have the course contents of Data Modelling Techniques training, which will be given to Innova Technology by the cooperation of Academy Peak Information Technologies Training and Consultancy between 25 - 26 January 2022.

data-analysis data-mining data-modeling data-science data-structure data-visualization

Last synced: 30 Nov 2024

https://github.com/sevdanurgenc/python-for-data-science-lecture-notes

In this repo, I have the course contents of Python for Data Science training, which will be given to Siemens by the cooperation of Academy Peak Information Technologies Training and Consultancy between 28 June - 1 July 2022.

data-analysis data-mining data-modeling data-science data-structure data-visualization matplotlib-tutorial numpy-tutorial pandas-tutorial

Last synced: 30 Nov 2024

https://github.com/ansh420/mcdonald_case-study

It is basically depend on the market Segment Analysis. It is a case study of mcDonald.

algorithms-implemented data-analysis python3 segmentation

Last synced: 18 Nov 2024

https://github.com/noeyislearning/cancer-linear-regression-model

The correlation between socioeconomic status and lung cancer incidence and mortality rates among low-income populations in the United States.

cancer-research data-analysis data-science data-visualization jupyter-notebook linear-regression-models matplotlib numpy python seaborn statsmodels

Last synced: 06 Dec 2024

https://github.com/noeyislearning/intro-to-data-analysis

The repository teaches skills for cleaning, exploring, analyzing, and visualizing data in Python to gain insights and make data-driven decisions.

data-analysis jupyter-notebook lecture-notes python

Last synced: 06 Dec 2024

https://github.com/noeyislearning/customer-shopping-trends

An invaluable resource for businesses aiming to optimize strategies and enhance customer satisfaction. Analyze customer attributes, purchase history, and preferences to make data-driven decisions.

business-analytics data-analysis data-science data-visualization jupyter-notebook matplotlib pandas python3 seaborn

Last synced: 06 Dec 2024

https://github.com/noeyislearning/netflix-movie-analysis

Explore movie duration trends on Netflix and assess the impact of non-feature film genres in this data-driven analysis.

data-analysis data-science data-visualization datacamp-projects jupyter-notebook netflix-analysis python3

Last synced: 06 Dec 2024

https://github.com/noeyislearning/sharpe-ratio-amazon-facebook

Explore the Sharpe Ratio and its application to evaluate the performance of two tech giants: Amazon and Facebook.

amazon data-analysis data-science data-visualization facebook python3 sharpe-ratio

Last synced: 06 Dec 2024

https://github.com/adagio/ivoox_episodes

iVoox Episodes: Scraping & Analysis

beautifulsoup4 data-analysis ivoox pandas python python3 scraping

Last synced: 27 Dec 2024

https://github.com/ajwad-shaikh/sristi-sanshodh-collect

SRISTI Sanshodh Collect is an Android app for filling out forms. It's been used to collect billions of data points in challenging environments. Contribute and make the world a better place! ✨📋✨ https://docs.opendatakit.org/collect-…

collect data-analysis data-collection javarosa odk opendatakit

Last synced: 17 Dec 2024

https://github.com/mgobeaalcoba/analisis_con_r

Trabajos de análisis realizados con lenguaje R

data-analysis data-science dataset r r-package r-programming r-studio

Last synced: 19 Nov 2024

https://github.com/whis99/userfunnelanalysis

An ecommerce user funnel conversion data analysis with matplotlib & python.

data-analysis data-analysis-python data-analyst data-visualization google-colab jupyter-notebook matplotlib python

Last synced: 13 Nov 2024

https://github.com/mgobeaalcoba/matplotlib_y_seaborn

Aquí dejaré trabajos de visualización realizados con ambas librerías de Python.

data-analysis data-science data-visualization dataset matplotlib numpy pandas python seaborn

Last synced: 19 Nov 2024

https://github.com/mgobeaalcoba/pandas_y_numpy

Trabajos realizados con estas librerías de Python para manejo de datos.

data-analysis data-science dataset numpy pandas python

Last synced: 19 Nov 2024

https://github.com/pranavarora1895/proteintypeprediction

Data Analysis on Protein Type Prediction

bioinformatics data-analysis supervised-learning

Last synced: 18 Nov 2024

https://github.com/okwilkins/retailanalysis

A comprehensive exploratory analysis and implementation of kmeans/hierarchical clustering on online retail data.

data-analysis data-science machine-learning statistics

Last synced: 20 Nov 2024

https://github.com/derrickbaruga7/mapping-median-age-europe

An R project that creates an interactive map of the median age across European regions using Eurostat data and spatial visualization packages.

data-analysis data-science data-visualization datascience european-union mapping r

Last synced: 03 Dec 2024

https://github.com/arhcoder/base-hackathon-2022

💸 Sistema que analiza las facturas de compra-venta de una empresa de importaciones y exportaciones, y crea una base de conocimiento con la que crea sugerencias de abastecimiento para las empresas clientes de Banco BASE, con el fin de ahorrarles dinero.

algorithms bank companies data-analysis decision-making exportation hackaton importation javascript mysql python suggestions

Last synced: 11 Nov 2024

https://github.com/markmusic27/data-statistics-calculator

💣 This method (made in JavaScript / Python) can find the mean, median, mode, range, and standard deviation.

data-analysis standard-deviation statistics statistics-calculator

Last synced: 05 Jan 2025

https://github.com/harshmule1/school-data-analysis-

School Data Analysis Using SQL

data-analysis mssql sql

Last synced: 17 Nov 2024

https://github.com/phillbertnevinemmanuel/automotivesalesdataanalysis

This marks my inaugural venture into personal data analysis, employing SQL and Python for Correlation Analysis. I've sourced the dataset from Kaggle, specifically focusing on automotive sales. You can find the dataset linked on my website below. I'm excited to share that I've independently managed the majority of tasks involved in this project.

data-analysis dataset microsoft-sql-server python python-lambda sql ssms tsql

Last synced: 17 Nov 2024

https://github.com/matthewgrosman/messenger-analytics

Project that ingests Facebook Messenger conversations and generates analytics.

analytics data-analysis excel facebook facebook-messenger java mongodb

Last synced: 08 Nov 2024

https://github.com/mohamedhany99/human-voice-identifier-counter

the application developed in (KIVY) it can identify the users imported into the dataset based on the support vector machine training model it has two features ( Importing new voice - Detection to detect the human voices and count them)

android android-app android-application automation automation-framework data data-analysis data-mining data-science data-visualization datascience kivy kivy-framework machine-learning python

Last synced: 17 Nov 2024

https://github.com/mynenik/xyplot-win32

XYPLOT Plotting and Data Analysis Program for 32-bit Windows

cpp data-analysis data-manipulation data-visualization forth mfc windows-app

Last synced: 24 Nov 2024

https://github.com/patriloto/reinventartec_2021

Material para el taller de Primeros pasos en R para el análisis de datos

data-analysis rstats

Last synced: 29 Nov 2024

https://github.com/supertetelman/kaggle-public

A collection of Python and Matlab projects aimed at utilizing various machine learning techniques to solve big data problems.

cnn data-analysis deep-learning machine-learning matlab python

Last synced: 30 Nov 2024

https://github.com/shibam120302/heart-disease-data-analysis-by-shibam

You can read more on the heart disease statistics and causes for self-understanding. This project covers manual exploratory data analysis

analysis data-analysis scraper

Last synced: 20 Nov 2024

https://github.com/shadan100/stroke-prediction-analysis

A web based application to predict whether a patient is likely to get stroke based on the input parameters like gender, age, various diseases, and smoking status. Each row in the data provides relevant information about the patient.

artificial-intelligence data-analysis data-science django django-framework jupyter-notebook machine-learning matplotlib pandas predictive-modeling python stroke-prediction web-application

Last synced: 11 Oct 2024

https://github.com/hevalhazalkurt/exploring_the_data_of_lego_history

A data exploration project on LEGO history in Python with pandas, matplotlib etc. (WIP)

data data-analysis data-science data-visualization datascience datasets lego lego-history matplotlib pandas python python3

Last synced: 20 Nov 2024

https://github.com/alexandrelamarre/fission

Data analytics & Structured streaming optimized for the Edge

data-analysis data-engineering rust structured-data unstructured-data

Last synced: 12 Nov 2024

https://github.com/hayatiyrtgl/data_analysis_project

Financial data analysis: preprocess, visualize, calculate technical indicators.

data-analysis data-analysis-python data-science dataframe numpy pandas python python3 stock-price-prediction talib trade-analysis

Last synced: 22 Dec 2024

https://github.com/kaz-yos/distributed

Comparison of Privacy-Protecting Analytic and Data-sharing Methods: a Simulation Study (Pharmacoepidemiol Drug Saf 2018)

data-analysis epidemiology statistics

Last synced: 12 Nov 2024