Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/skyzh/meteor

🚆 Fine-grained analysis and visualization of Hangzhou Metro for efficient traveling in metro system. Project report, slide and presentation video included.

cmake data-analysis hangzhou metro qt sqlite visualize

Last synced: 28 Oct 2024

https://github.com/alejandrodumas/kodiak

Enhance your feature engineering workflow with Kodiak

data-analysis pandas

Last synced: 27 Nov 2024

https://github.com/PySloth/pysloth

A Python Package for Probabilistic Prediction

data-analysis data-science machine-learning python statistics

Last synced: 17 Nov 2024

https://github.com/saranshbansal/data-science-with-python

Data science with Python: This repository mostly contains DataCamp data-science courses/exercises that I have completed.

data-analysis data-science datacamp-exercises numpy python

Last synced: 09 Nov 2024

https://github.com/cengel/R-data-wrangling

Materials for my my R data workshop. https://cengel.github.io/R-data-wrangling/

data-analysis data-workshop datascience material r rstats social-sciences teaching tidyverse workshop

Last synced: 13 Nov 2024

https://github.com/pgomba/mdpi_explorer

A simple package to explore MDPI´s articles by journal. A series of functions help to obtain lists of papers, obtain data from them (turnaround times, special issues and articles types) and create summary graphs.

analysis data-analysis data-visualization mdpi metrics scientific-journals visualization web-scraping

Last synced: 18 Nov 2024

https://github.com/csinva/data-viz-utils

Functions for easily making publication-quality figures with matplotlib.

big-data data-analysis data-science data-visualization eda legend matplotlib python python3 scatterplot time-series

Last synced: 09 Nov 2024

https://github.com/ptyadana/data-analysis-for-digital-music-store

helping Digitial Music Store to optimize their business practices using PostgreSQL

chinook chinook-database data-analysis datavisualization pgadmin4 postgresql sql tableau

Last synced: 15 Nov 2024

https://github.com/bcgov/shinyrems

An R package to launch shinyrems; an online application that allows a user to access, download, clean, plot and calculate simple statistics using data from the B.C. government Environmental Monitoring System database.

data-analysis environment environmental-data water-quality

Last synced: 04 Dec 2024

https://github.com/rob-med/everything-shapelets

This repo contains useful links to research papers and implementations of shapelets discovery/learning techniques from different sources.

data-analysis data-mining shapelets time-series-analysis timeseries

Last synced: 14 Nov 2024

https://github.com/hoangsonww/north-carolina-household-analysis

🏠 This repository contains data analysis scripts for the 2022 American Community Survey (ACS) focusing on individuals aged 25 and over in North Carolina, based on 75,340 observations. This repository offers valuable insights into demographic and economic patterns across North Carolina's urban areas.

confidence-interval confidence-score data data-analysis data-analytics data-science data-visualization ggplot2 hypothesis-testing hypothesis-tests north-carolina r r-language r-programming stata

Last synced: 14 Nov 2024

https://github.com/chyikwei/bnp

Bayesian nonparametric models for python

bayesian data-analysis probabilistic-graphical-models python topic-modeling

Last synced: 13 Nov 2024

https://github.com/andrewreynen/lazylyst

Lazylyst is a GUI created for time series review, using a flexible framework for new workflows

data-analysis earthquakes gui python qt seismology

Last synced: 19 Oct 2024

https://github.com/samuroi/samuroi

SamuROI - Structured analysis of multiple user-defined ROIs, an open source Python-based analysis environment for imaging data.

calcium-imaging conda data-analysis data-visualization opencv python scikit-image

Last synced: 16 Nov 2024

https://github.com/aifred-health/vulcanai

A high level deep learning framework for quickly prototyping networks with added tools in data visualisation, model interpretability and performance metrics

data-analysis data-cleaning data-science data-visualization deep-learning deep-neural-networks feature-engineering mental-health python3 pytorch scikit-learn

Last synced: 05 Dec 2024

https://github.com/jpenuchot/ctbench

Compiler-assisted variable size benchmarking for the study of C++ metaprogram compile times.

benchmark clang compilation data-analysis data-visualization gcc metaprogramming

Last synced: 12 Nov 2024

https://github.com/mcwaage1/qs

Quantified Self: A Personal Data Aggregator and Dashboard for Self-Trackers and Quantified Self Enthusiasts

activity-tracking data-analysis data-visualization fitbit goodreads google-sheets lastfm mood personal-data quantified-self quantifiedself self-tracking writing

Last synced: 08 Nov 2024

https://github.com/llnl/topoms

Topological Analysis for Molecular Systems

data-analysis data-viz

Last synced: 11 Nov 2024

https://github.com/cmudig/texture

Visualize your text data with structured attributes

data-analysis llm text visualization

Last synced: 09 Nov 2024

https://github.com/zekeriyyaa/pyspark-structured-streaming-ros-kafka-apachespark-cassandra

A structured streaming was applied to the robot data from ROS-Gazebo simulation environment using Apache Spark. Data is collected in Kafka, analyzed by Apache Spark and stored in Cassandra.

apache-cassandra apache-kafka apache-spark cqlsh data-analysis kafka-consumer kafka-producer pyspark python python3 ros ros-noetic spark-cassandra spark-cassandra-connector spark-kafka-connector spark-kafka-integration spark-sql spark-streaming structured-streaming

Last synced: 12 Oct 2024

https://github.com/kennethleungty/fifa-football-world-rankings

Analyzing FIFA World Football Rankings with Python and R

data-analysis data-analytics data-science football python r soccer sports

Last synced: 22 Nov 2024

https://github.com/nceas/metajam

Bringing data and metadata togetheR

data data-analysis metadata r repositories

Last synced: 10 Oct 2024

https://github.com/kongruksiamza/python-datascience

เอกสารประกอบการสอนเนื้อหา Python - Data Science และงานด้าน Machine Learning

data-analysis data-science numpy pandas python

Last synced: 09 Nov 2024

https://github.com/msyriac/orphics

A library containing analysis and theory tools for cosmological data.

analysis cmb cosmology data-analysis theory

Last synced: 27 Oct 2024

https://github.com/iBridges-for-iRODS/iBridges

A wrapper around the python-irodsclient to allow for easy interaction with iRODS servers.

data-analysis data-engineering data-science datascience irods-client

Last synced: 22 Nov 2024

https://github.com/tanglespace/hotstepper

A Numpy based step function library for analysis and profit. More than just taking you up and down.

data-analysis kernel-methods linear-algebra numpy pandas step-functions time-series

Last synced: 27 Oct 2024

https://github.com/hoangsonww/global-covid19-analysis

🌍 This repository hosts an in-depth analysis of COVID-19's impact across five key countries from Jan 2020 to Dec 2021. Through advanced data analysis and visualization, we aim to provide insights into how the pandemic evolved differently across these nations, shedding light on the effectiveness of various health measures and vaccination campaigns.

covid covid-19 covid19-tracker data data-analysis data-analytics data-science data-visualization ggplot2 julia julia-language python r r-language r-markdown r-programming sas sas-programming stata vaccination

Last synced: 12 Oct 2024

https://github.com/tsffarias/data-analysis-queries

Este repositório foi cuidadosamente criado para fornecer uma extensa coleção de consultas SQL que visam facilitar o trabalho dos analistas de dados em diversas áreas de uma empresa, incluindo marketing, logística, comercial, financeiro, recursos humanos, operação, jurídico, suporte e muito mais.

business-intelligence comercial data-analysis data-insights esg finance-management fraud-prevention human-resources juridico kpis logistics marketing marketing-analytics operacao pricing sql suporte

Last synced: 05 Nov 2024

https://github.com/rugk/crops-parser

🌱🍎🍆 A shell script to parse the data by the Food and Agriculture Organization of the United Nations on crops/fruits.

agriculture agriculture-research crop crops data-analysis data-science food fruit fruits statistics streetcomplete tree vegetables

Last synced: 23 Oct 2024

https://github.com/librariesio/metrics

:chart_with_upwards_trend: What to measure, how to measure it.

data-analysis measure metrics open-source

Last synced: 10 Nov 2024

https://github.com/charliezcr/Kpop-Data-Analysis

Data analysis about K-pop industry, artists, and companies. Visualized business performances of public K-pop companies and analyzed artist management and international marketing strategies

data-analysis data-visualization kpop pandas python

Last synced: 08 Nov 2024

https://github.com/amhsirak/ickle

DataFrame, analysis & manipulation library for tiny labeled datasets

data-analysis dataframe datascience ickle pandas python

Last synced: 17 Nov 2024

https://github.com/awadell1/datamaster

Tool for accessing and extracting data from MoTeC Log Files

data-analysis data-visualization fsae matlab motec

Last synced: 08 Nov 2024

https://github.com/engali94/twitter-account-analyzer

Using various Python libraries such as Pandas, tweetPy, JSON ans matplotLib to take a sneak peek on your Twitter account using Google Colab.

data-analysis data-visualization python3 twitter-api twitter-sentiment-analysis twitter-streaming-api

Last synced: 11 Nov 2024

https://github.com/wildtreetech/explore-open-data

📈🔍 Exploring open data from Zürich

binder data-analysis notebook open-data opendata python tutorial

Last synced: 12 Nov 2024

https://github.com/dawievlill/datascience-871

Data science module for economists written mostly in Julia and R

data-analysis data-science machine-learning

Last synced: 11 Nov 2024

https://github.com/thechymera/repsep

Reproducible Self-Publishing - Demo Publications in the Most Common Formats

article data-analysis foss latex poster presentation publishing python reexecutable-publication reproducible-research

Last synced: 16 Nov 2024

https://github.com/omarsar/data_mining_2017_fall_lab

Contains information and instructions for the first Data Mining lab session for 2017 Fall.

data data-analysis data-mining data-science data-visualization

Last synced: 13 Oct 2024

https://github.com/rameerez/bicimad-data-analysis

🚲 BiciMAD - Data analysis + ML usage predictions of Madrid's public bike system data

artificial-intelligence bicimad bicimad-data bike city data-analysis emt-opendata ipython-notebook machine-learning madrid neural-network python

Last synced: 11 Oct 2024

https://github.com/ndleah/health-analysis

This case study is contained within the Serious SQL course by Danny Ma

data-analysis data-exploration data-with-danny database serious-sql sql

Last synced: 13 Nov 2024

https://github.com/ahammadmejbah/tensorflow-developers-roadmap

TensorFlow is an open-source machine learning framework developed by Google. It provides a versatile platform for creating and deploying machine learning models, particularly neural networks, enabling tasks like image recognition, natural language processing, and more.

computer-vision computervision data-analysis data-engineering data-science data-visualization machine-image machine-learning machine-learning-algorithms machine-vision tensorflow tensorflow-tutorials tensorflow2

Last synced: 10 Oct 2024

https://github.com/toutiaoio/howtos

How-To Recipes, 碎片化实用教程, 开发技巧

cookbook data-analysis howto-tutorial howtos pandas recipes tips-and-tricks tutorials

Last synced: 08 Nov 2024

https://github.com/Big-Bytes-Labs/fisher-rs

A fast, powerful, flexible and easy to use open source data analysis and manipulation tool written in Rust

data-analysis dataframe-library machine-learning rust-lang

Last synced: 04 Nov 2024

https://github.com/mathewroy/ynabr

Analyze and visualize your You Need A Budget (YNAB) data. YNAB meets R programming language.

api data-analysis data-science data-visualization r ynab ynab-api

Last synced: 04 Dec 2024

https://github.com/open-risk/correlationmatrix

correlationMatrix is a Python powered library for the statistical analysis and visualization of correlations

correlation-analysis correlation-matrices data-analysis data-science statistics

Last synced: 13 Oct 2024

https://github.com/llnl/pydv

PyDV: Python Data Visualizer

1d data-analysis data-viz python visualization

Last synced: 11 Nov 2024

https://github.com/olow304/sqdata

📊 Simple SQL Client for lightweight data analysis using Reactjs framework. Demo

chartsjs data-analysis highcharts javascript nodejs react reactjs sql sqljs visualization

Last synced: 09 Nov 2024

https://github.com/glentner/dataphile

Data analytics library for Python and suite of open source, command line based data ops tools.

data-analysis data-ops data-science python scientific-computing

Last synced: 09 Nov 2024

https://github.com/brad-cannell/freqtables

Quickly make tables of descriptive statistics (i.e., counts, percentages, confidence intervals) for categorical variables. This package is designed to work in a tidyverse pipeline, and consideration has been given to get results from R to Microsoft Word ® with minimal pain.

categorical-data data-analysis descriptive-statistics epidemiology r

Last synced: 04 Dec 2024

https://github.com/asnelt/mixedvinetoolbox

Matlab toolbox for canonical vine copula trees with mixed continuous and discrete marginals

c-vines copula copula-models copulae copulas data-analysis dependency-analysis dependency-modeling matlab modeling regular-vines statistics

Last synced: 27 Oct 2024

https://github.com/iam-mhaseeb/Satellite-Imagery-Analysis-of-Vegetation-in-Southern-Pakistan

This repository contains a study how we can examine the vegetation cover of a region with the help of satellite data. The notebook in this repository aims to familiarise with the concept of satellite imagery data and how it can be analyzed to investigate real-world environmental and humanitarian challenges.

data-analysis data-visualization jupyter-notebook notebook pakistan pakistan-analysis python3 satellite-data satellite-imagery satellite-images vegetation vegetation-analysis

Last synced: 15 Nov 2024

https://github.com/30mb1/pandas-boost

This repo contains code with comparison of Pandas speedup libs, such as modin, dask, swifter, pandarallel and numba

data-analysis machine-learning multiprocessing pandas pandas-tutorial python

Last synced: 02 Nov 2024

https://github.com/ct83/surway

SurWay is a survey/polling website for cab drivers where they can report their typical work hours and which company they work for, this data is then stored anonymously and used to generate charts and insights.

charts data-analysis data-visualization data-visualization-project docker docker-compose express node nodeexpress nodejs react react-chartjs-2 react-charts react-project react-projects reactjs reactjs-demo

Last synced: 06 Dec 2024

https://github.com/chandraprakash-bathula/apparel-recommendations

This project implements a personalized apparel recommendation engine using content-based search with the Amazon API, NLTK, and Keras libraries.

boxplot cnn-keras data-analysis data-science deep-learning linear-regression machine-learning numpy pandas scatter-plot scikit-learn svm tensorflow xgboost

Last synced: 28 Oct 2024

https://github.com/openbridge/chatlytics

Chatlytics is a data query and visualization platform for chat!

charts chatbot data-analysis data-visualization slack slack-bot

Last synced: 14 Nov 2024

https://github.com/cosmoduende/r-google-search-history-analysis

Explore your activity on Google with R: How to Analyze and Visualize Your Personal Data Search History. Find out how and how much you have used the most popular search engine in the world, using a copy of your personal data.

data-analysis data-visualisation data-visualization data-viz google google-history google-takeout personal-data-analysis r-language r-programming r-script rvest search-data search-history takeout takeout-data

Last synced: 07 Nov 2024

https://github.com/jm199504/python-exercises

Python 练习册(基础操作 / 数据库 / 数据统计处理 / 图文生成 / 数据转换 / 算法题 / 小应用 / 程序开发)

data-analysis database python

Last synced: 09 Nov 2024

https://github.com/pottekkat/go-corona

Live viz and updates of COVID 19. Let us fight this together!

corona covid-19 covid-19-india data-analysis data-visualization health infographics spread tableau

Last synced: 10 Nov 2024

https://github.com/supercowpowers/scp-labs

SCP Labs (Open Source Team for SuperCowPowers)

data-analysis data-science pandas python scikit-learn security

Last synced: 09 Nov 2024