Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/marios-mamalis/mca-visualisation

A script for automatic visualisation of Multiple Correspondence Analysis (MCA) results from FactoMineR in 3 dimensions using Plotly (exported as html)

3d-scatterplots correspondence-analysis data-analysis factominer html mca multiple-correspondence-analysis plotly visualisation

Last synced: 13 Jan 2025

https://github.com/jimbrig/eda

Exploratory Data Analysis R Package and Shiny App

data-analysis data-visualization eda r shiny

Last synced: 23 Jan 2025

https://github.com/w-edward/youtube-keyword-popularity-analyzer

An effort to discover the top trending keywords on Youtube.

data-analysis node-js numpy python webscraping youtube-api

Last synced: 16 Jan 2025

https://github.com/muzammil-13/mimlrepo

Data Analysis using Python Machine Learning Libraries

data-analysis data-science machine-learning numpy pandas python python-library

Last synced: 16 Jan 2025

https://github.com/analyticalnahid/numpy-tutorial

A complete Notebook on Numpy for Data Science

data-analysis numpy numpy-tutorial

Last synced: 02 Feb 2025

https://github.com/jethronap/asylumdataku_website

Mini website for reporting analysis of Asylum Data @ DIKU

data-analysis docsify nlp

Last synced: 06 Jan 2025

https://github.com/tirendazacademy/data-sets

Data sets for Tirendaz Akademi Youtube

data-analysis dataset

Last synced: 01 Jan 2025

https://github.com/CAIDA/submarine-cable-impact-analysis-public

This repository contains tools implemented for the PAM 2020 paper "Unintended consequences: Effects of submarine cable deployment on Internet routing" to collect and analyze data depicting the impact of the South-Atlantic Cable System (SACS) launch on Internet routing. This codebase can be extended to other use-cases of cable launches, failures, etc.

africa-americas africa-south-america bgp-data-analysis caida-ark-measurement-platform data-analysis historical-traceroutes impact internet-routing ripe-atlas-measurement-platform sacs-cable sail-cable submarine-cables

Last synced: 06 Nov 2024

https://github.com/walidbosso/r_data_mining

Extract knowledge from a data using different techniques, including Association Rules Hierarchical Agglomerative Clustering (HAC) K-means Clustering Decision Trees

association-rule-mining association-rules clustering data-analysis data-mining data-science data-visualization decision-tree-classifier decision-trees exportation extract-data hac hierarchical-clustering k-means k-means-clustering k-means-r r-programming r-studio

Last synced: 28 Jan 2025

https://github.com/ashwinpn/visualization

Data Visualization using Matplotlib, Pandas Visualization, Seaborn, ggplot, and Plotly.

analysis data data-analysis data-science data-visualization graphs plots python python3 visualization

Last synced: 16 Jan 2025

https://github.com/zelosleone/finncorr

A .NET Core financial analysis tool/API for calculating correlations between time series data with interactive visualizations powered by ML.NET and Plotly.js.

aspnet-core correlation-analysis csv-parser data-analysis dotnet financial-analysis machine-learning ml-net plotly rest-api statistical-analysis swagger time-series visualization

Last synced: 06 Feb 2025

https://github.com/virajbhutada/tableau-data-vizzes

Engage with a growing collection of Tableau dashboards covering financial trends, HR analytics, streaming service insights, real estate dynamics, and more. Meticulously crafted for valuable insights, this repository continues to expand with new and compelling visualizations.

business-analytics data-analysis data-visualization hr-analytics industry-trends netflix performance-metrics stock-market-analysis strategic-analytics tableau visual-insights

Last synced: 10 Jan 2025

https://github.com/yogeshnile/nifty50-index-time-series-analysis

In this repo i did analysis of Nifty50 five year data from 01-04-2015 to 31-03-2020. Data Downloaded from nse official website.

data-analysis matplotlib nifty numpy pandas plotly python3 time-series-analysis

Last synced: 10 Jan 2025

https://github.com/BigBangData/TimesheetAnalysis

R shiny app to help analyze a bookkeeper's business - or anyone with a timesheet and some time.

bookkeeping data-analysis data-viz r-programming shiny-apps shiny-r timesheet-management

Last synced: 04 Dec 2024

https://github.com/fbecerra/fbecerra.github.io

Source code for my website www.fernandobecerra.com

data-analysis data-science data-visualization dataviz interactive-visualizations

Last synced: 27 Oct 2024

https://github.com/rawsashimi1604/jobextract

Scrapes LinkedIn data. Conducts sentiment analysis on what traits and qualifications employers are looking for.

data data-analysis data-analytics data-cleaning linkedin mvc python webscraper

Last synced: 27 Dec 2024

https://github.com/0mppula/element-compare

A single page Next.js 14 app that allows the user to inspect and compare elements from the periodic table.

compare dark-mode data-analysis elements inspect lucide-react nextjs periodic-table reactjs server-side-rendering shadcn-ui single-page-app typescript vercel zustand

Last synced: 11 Nov 2024

https://github.com/hvignolo87/ortex-programming-challenge

Coding challenges required for the Python Developer and Data Engineer job positions.

challenge data-analysis finance pandas python scripting sql sqlalchemy

Last synced: 02 Jan 2025

https://github.com/praju-1/pandas

The library is widely used in data science and machine learning for data cleaning, preparation, and analysis.

data-analysis pandas python

Last synced: 08 Feb 2025

https://github.com/shipyardapp/postgresql-blueprints

Simplified blueprints for building data pipelines with PostgreSQL.

cli data-analysis data-engineering data-pipeline data-science database elt etl postgres postgresql

Last synced: 04 Dec 2024

https://github.com/zrkhadija/data-analysis-for-financial-time-series

In this notebook, we performed data analysis on financial time series data from Yahoo Finance for the US market. We examined seasonality, trends, stationarity, and other aspects such as outliers and correlations.

autocorrelation correlation-analysis data-analysis financial-analysis time-series-analysis timeseries-forecasting visualization

Last synced: 09 Feb 2025

https://github.com/mindlessmuse666/client-data-analysing-tool

Проект производственной практики: Инструмент для анализа данных, построенный с использованием Python (бэкэнд, фронтэнд PyQt6), Pandas, Matplotlib и SQLite. Это приложение позволяет пользователям загружать данные в формате CSV, фильтровать их, визуализировать ключевые показатели с помощью графиков и создавать отчеты.

data-analysis desktop-application matplotlib pandas pyqt6 pyqt6-desktop-application python sqlite student-project

Last synced: 23 Dec 2024

https://github.com/mrjxtr/tokyo_airbnb_analysis_project

Full project case study and analysis to show potential opportunities to start an AirBnb business in Tokyo, Japan.

data-analysis data-cleaning data-science data-visualization pandas python3

Last synced: 06 Nov 2024

https://github.com/asifdotexe/timeseriesanalysis

This repository serves as a central hub for all of my projects related to time series analysis. Here, you'll find a collection of projects, code samples, and resources that explore various aspects of time series data and its analysis.

data-analysis feature-engineering jupyter-notebook pandas python time-series-analysis visualization

Last synced: 15 Jan 2025

https://github.com/tathithienthanh/dataanalysis_diagnosis-of-diabetes-based-on-data-set-of-blood-test-result

Implement all learned knowledge about data analysis and data mining to make a complete project about Diagnosis of diabetes based on data set of blood test result

blood-test classification clustering data-analysis data-processing decision-tree diabetes-prediction diagnosis exercise google-colab health hierarchical ipynb kmeans knn knns py python smote-sampling visualization

Last synced: 04 Feb 2025

https://github.com/thecoderpinar/reta

🍃 Explore the world of renewable energy production, analyze historical data, and predict sustainable energy trends. Join us on the journey to a greener future!

arima clean-energy data-analysis data-science data-visualization energy-future forecasting-models innovation renewable-energy sustainability time-series

Last synced: 09 Feb 2025

https://github.com/asifdotexe/covidporfolioproject

This is a SQL + Tableau Project on real world Covid 19 Dataset from the start of recorded case to 2nd March 2022 i.e My birthday XD

dashboard data-analysis data-exploration data-visualization sql sql-server tableau

Last synced: 15 Jan 2025

https://github.com/nafiealhilaly/analyze-coderhub-sa

A simple web app to analyze/explore coderhub.sa API data, this project was my first real react app.

backend data-analysis eda frontend python react reactjs

Last synced: 08 Feb 2025

https://github.com/jimbrig/EDA

Exploratory Data Analysis R Package and Shiny App

data-analysis data-visualization eda r shiny

Last synced: 04 Dec 2024

https://github.com/akshat0427/spotify_history

code to find out some insights in spotify streaming data (work in progress)

data-analysis data-visualization

Last synced: 31 Jan 2025

https://github.com/cbg-ethz/scdna-pipe

Python data analysis pipeline for single cell copy number event history reconstruction

bioinformatics bioinformatics-pipeline data-analysis genomics python snakemake snakemake-workflows workflow

Last synced: 28 Jan 2025

https://github.com/casualcomputer/sql.mechanic

Functions that generate SQL queries that summarize high-dimensional tables stored in various databases (e.g. Microsoft SQL Servers, Netezza, DB2, Postgres, Oracle, MySQL, etc.).

data-analysis data-quality-checks data-science database mysql netezza oracle postgres quality-control r sql sql-server

Last synced: 04 Dec 2024

https://github.com/cworld1/da-learning

Some notes and code about CWorld learning Data Analysis

data-analysis data-science jupyter-book jupyter-notebook python r

Last synced: 23 Jan 2025

https://github.com/pitmonticone/covid-italy

References for COVID-19 situation in Italy.

coronavirus covid-19 covid-19-italy data data-analysis documentation testing

Last synced: 22 Jan 2025

https://github.com/banyc/dfsql

SQL REPL/lib for Data Frames

cli csv data-analysis jsonl ndjson repl sql

Last synced: 19 Nov 2024

https://github.com/gustavohnsv/teamwork_mqa

Repositório dedicado ao trabalho em grupo baseado nos estudos de métodos para análise de dados da matéria Métodos Quantitativos para Anáise Multivariada.

data-analysis group-project r team-repo

Last synced: 16 Dec 2024

https://github.com/revogati/ecommerce_consumer_behaviour

This is a Full Data Analytics project From data cleaning, preparation, exploration, Interpretation of insights up to Presentation of findings and recommendations..

data-analysis data-exploration ecommerce jupyter-notebook python sql tableau-public visualization

Last synced: 07 Jan 2025

https://github.com/parisaroozgarian/ibm-data-analyst-professional-certificate

The IBM Data Analyst Professional Certificate, consisting of 9 courses, equips with essential skills in Excel, SQL, Python, data visualization, and analysis techniques

big-data business-analysis business-communication communication data-analysis data-management data-structures data-visualization databases general-statistics human-resources planning python-programming spreedsheet sql

Last synced: 19 Dec 2024

https://github.com/dcs-training/r-qgisintegratingspatialanalysis

This was an intermediate course of three sessions with a focus on developing skills in data visualisation, analysis and integration using both R studio and QGIS. Go to the readme file

data-analysis data-visualisation data-wrangling gis qgis r spatial-analysis

Last synced: 10 Nov 2024

https://github.com/mindful-ai-assistants/credit-card-prediction

💳 This repository focuses on building a predictive model to assess the likelihood of credit card defaults. The project includes data analysis, feature engineering, and machine learning to provide accurate default predictions.

artificial-intelligence data-analysis data-science jupyter logistic-regression machine-learning predictive-modeling python3 scikit-learn

Last synced: 09 Dec 2024

https://github.com/ronylpatil/whatsapplib

WhatsApp Group Chat Analysis Python Package.

data-analysis open-source pypi-package python-library python-package

Last synced: 21 Jan 2025

https://github.com/dcs-training/from-spss-to-r-how-to-make-your-statistical-analysis-reproducible

Comfortable/aware of how to run your stats in SPSS? Curious to learn how to run them in R? You've come to the right place. Go to the readme file

data-analysis data-visualisation data-wrangling good-practices-digital-research r rmarkdown spss statistics

Last synced: 10 Nov 2024

https://github.com/dcs-training/effectivedatavisualisation

This repository hosts the material connected to a training course developed by Dave Elsmore (Edina) for CDCS on good data visualisation. Go to the readme file

data-analysis data-visualisation data-wrangling python

Last synced: 10 Nov 2024

https://github.com/openpmd/openpmd-ccd

A Python Module & LabView Bindings for Storing CCD Images with openPMD

ccd data-analysis database hdf5 open-data open-science openpmd

Last synced: 04 Jan 2025

https://github.com/dcs-training/bayesian-statistics

Materials for the CDCS Introduction to Bayesian Statistics course. Go to the readme file

bayesian-statistics data-analysis r statistics

Last synced: 10 Nov 2024

https://github.com/bkataru/physics-ia

Programs and files written for Astrostatistics for IB Physics IA. Topic: Visualizing and analyzing the habitable zones for 150,000 stars from the hipparcos catalogue.

astronomical-algorithms astronomy astrophysics astrostatistics data-analysis data-science data-visualization matplotlib plotting

Last synced: 22 Dec 2024

https://github.com/leonism/customer-predictive-analysis

Explore this repository, a comprehensive resource offering an in-depth guide to conducting customer predictive analysis using cutting-edge machine learning techniques, all within the intuitive framework of Dataiku.

data-analysis data-model data-science data-visualization dataiku machine-learning predictive-modeling

Last synced: 03 Feb 2025

https://github.com/hongbo-wei/global-status-of-cc-security-certification

Data visualization of CC Security Certification using VUE, Django, and MySQL.

big-date common-criteria data-analysis data-visualisation data-visualization

Last synced: 14 Jan 2025

https://github.com/aad99bxp/whatsapp-chat-analyzer

A project intended for Business Owners / Managers to analyze Whatsapp chats between their customer care executives and their customers.

data-analysis heroku-deployment python3

Last synced: 22 Jan 2025

https://github.com/elhaban3ro/thewildtool

TheWildTool is a tool developed with the main objective of saving time when working with audio datasets. Either to prepare them, to get them or to train a model with them. 🤖

ai audio audio-processing data-analysis data-science dataset deeplearning python

Last synced: 30 Jan 2025

https://github.com/nhsdigital/sde_example_analysis

Example of what you can do in Databricks in the Secure Data Environment (SDE) using Python, SQL, and R.

data-analysis data-science databricks-notebooks machine-learning mlflow

Last synced: 23 Dec 2024

https://github.com/iguptashubham/online-retail-sales

This Power BI dashboard, designed for marketing strategists, analyzes sales trends and customer behavior. It provides key insights empowering them to identify sales opportunities and optimize marketing campaigns, ultimately boosting business sales.

dashboard data data-analysis data-analysis-project data-analysis-project-powerbi data-analysis-python data-project data-science powerbi project

Last synced: 14 Jan 2025

https://github.com/leandronasx/agro-data

Projeto final da formação de analista de dados e dashboard da SoulCode Academy.

bigquery data-analysis gcp looker pandas powerbi python

Last synced: 12 Oct 2024

https://github.com/nikoshet/exploratory-data-analysis-using-r

Exploratory Data Analysis using R Course Project for M.Sc. 'Data Science and Machine Learning' in NTUA

data data-analysis data-science eda exploratory-data-analysis ggplot2 r

Last synced: 03 Jan 2025

https://github.com/njoyedevs/chatgpt3_riskanalyzer

In this project, ChatGPT3 was fine tuned on 9 data series spanning 40 years. This helped train ChatGPT3 to provide a market risk score. To view, visit: https://www.aimarketrisk.com

chatgpt3 data-analysis flask fred-api full-stack-web-development pandas python

Last synced: 30 Jan 2025

https://github.com/jshinm/web-scrapper

Web Scrapper used to extract NeuroData github repo stats

data-analysis web-scraping

Last synced: 10 Feb 2025

https://github.com/llnl/hdtopology

High-dimensional topological data analysis library for NDDAV

analysis cpp data-analysis data-viz high-dimensional-data topological-data-analysis visualization

Last synced: 11 Nov 2024

https://github.com/kylejgillett/stevepy

A Space Weather data analysis tool for Python.

astronomy aurora data-analysis physics python space-weather space-weather-research

Last synced: 27 Jan 2025

https://github.com/coumbacoulibaly/adventureworkscycles

Repository for Adventure Works Sample Database Analysis

adventureworks data-analysis data-analytics mssql-database mssqlserver sql ssms

Last synced: 17 Nov 2024

https://github.com/frikishaan/browsing-history-analysis

This is a data analysis of my browsing history for the last 7 months.

browsing-history data-analysis jupyter-notebook python

Last synced: 09 Jan 2025

https://github.com/chaganti-reddy/evmarket-india

Electric Vehicle Market Segmentation Analysis in India

data-analysis data-science machine-learning market-segmentation pandas python

Last synced: 20 Jan 2025

https://github.com/i4ds/ecallisto_ng

Ecallisto NG is a Python package tailored for interacting with Ecallisto data.

data-analysis data-visualization e-callisto ecallisto-international-network numpy pandas python spectrometer

Last synced: 09 Nov 2024

https://github.com/1ayanabil1/healthcare-machine-learning

Explore our open-source repository focused on healthcare machine learning. We've developed predictive models for cardiovascular disease, diabetes, breast cancer, and more. Our projects employ diverse machine learning algorithms and data science techniques, enhancing early detection, diagnosis, and patient outcomes.

data-analysis data-science deep-learning disease disease-detection disease-modeling disease-prediction eda healthcare-application heathcare jupyter-notebook machine-learning machine-learning-algorithms machinelearning-python python

Last synced: 07 Jan 2025

https://github.com/thealphadollar/messiah

Messiah: The Mighty Son Of God Is Here To Help You Through Times Of Calamity

azure backend data data-analysis flask frontend materialize natural-disasters

Last synced: 21 Dec 2024

https://github.com/lacerbi/vbmc

Variational Bayesian Monte Carlo (VBMC) algorithm for posterior and model inference in MATLAB (old location)

bayesian-inference data-analysis gaussian-processes machine-learning matlab variational-inference

Last synced: 05 Feb 2025

https://github.com/alexandregazagnes/unilasalle-public-resources

UniLaSalle-Public-Ressources : This public repository contains the notebooks and the data used for both : 2nd Year - Practical Statistical Tests 4th Year - Data Analysis with Python

data data-analysis data-analytics data-cleaning data-storytelling education educational exploratory-data-analysis python python3 r r-programming rstudio statistics visualization

Last synced: 11 Feb 2025

https://github.com/shivamswarnkar/tesla-stock-prediction

Making prediction of close prices of Tesla Stocks using different regression methods.

data-analysis data-visualization plotly regression regularization sklearn stock-price-prediction

Last synced: 26 Jan 2025

https://github.com/shashankbansal6/signal-analysis-for-patient-monitoring

A reliable patient monitoring system which analyzes the correlated physiological signals collected from the patient's body, and generates alarms for abnormalities.

data-analysis patient-monitoring

Last synced: 10 Feb 2025

https://github.com/trybnetic/tu7-acceleration-sleep-wake-classification

Supporting material for the paper ''Discrimination of sleep and wake periods from a hip-worn raw acceleration sensor using recurrent neural networks''

accelerometer accelerometry actigraphy data-analysis sensors sleep

Last synced: 15 Jan 2025

https://github.com/winter000boy/dsa-practice

This repository holds my solutions for LeetCode’s Pandas playlists. Each section includes code and notes on using Pandas to handle real-world data tasks efficiently. Perfect for anyone looking to deepen their understanding of data manipulation with Pandas.

data-analysis data-science leetcode leetcode-python pandas-python python3

Last synced: 30 Jan 2025

https://github.com/cego669/datathonengopevi

Equipe: Embrapeiros. Solução proposta para o Datathon do VI ENGOPE (Encontro Goiano de Probabilidade e Estatística). Obs: FOMOS CAMPEÕES!!!!!!!!

data-analysis data-science datathon python r streamlit xgboost-classifier

Last synced: 08 Dec 2024

https://github.com/tushar2704/everyday-sql

Welcome to Everyday SQL Sheets – your go-to resource for everyday SQL cheat sheets, pro tips, interview questions, and more. Whether you're a beginner looking to learn SQL or an experienced developer seeking quick reference materials, this application has got you covered.

artificial-intelligence cheatsheet data-analysis data-science database mysql postgresql query-language sql sqlalchemy streamlit streamlit-tushar2704 tushar2704

Last synced: 27 Dec 2024

https://github.com/pratishtha-abrol/astronomy-dataanalysis

A key technique in Data Driven Astronomy

astronomy astropy crossmatch data-analysis

Last synced: 05 Feb 2025

https://github.com/ac-gomes/data-engineering-with-databricks

A simple boilerplate for data engineering and data analysis training in Databricks.

data-analysis data-engineering databricks databricks-notebooks pyspark python unit-testing

Last synced: 09 Nov 2024

https://github.com/louislefevre/sstubs-miner

Data mining and analysis for the ManySStuBs4J dataset.

data-analysis data-mining manysstubs4j-dataset msr

Last synced: 05 Feb 2025

https://github.com/vishnu-t-r/data-analytics-portfolio-projects

This repository contain data analyst portfolio projects developed using various data analytics tools including SQL, Python, Tableau, Looker etc.

data data-analysis data-cleaning data-modeling data-visualization looker looker-studio python sql ssms tableau

Last synced: 10 Nov 2024

https://github.com/skylord0001/python-daily

Python - Basic, Apache - Conf, Black Stack Hub, Data analysis, Data Structure, Google Cloud, SQL system

apache-configuration data-analysis data-structure python-scripts python-sql

Last synced: 23 Nov 2024

https://github.com/kevinyang372/san-francisco-crime-data-analysis

An ARIMA prediction model for forecasting potential crimes based on users' time and location

data-analysis machine-learning

Last synced: 30 Jan 2025