Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/AllenInstitute/openscope_databook

OpenScope databook: a collaborative, versioned, data-centric collection of foundational analyses for reproducible systems neuroscience 🐁🧠🔬🖥️📈

dandi-archive data-analysis data-visualization nwb python reproducible-research visualization

Last synced: 02 Aug 2024

https://github.com/DistrictDataLabs/cultivar

Multidimensional data explorer and visualization tool.

data-analysis data-exploration data-management visualization

Last synced: 01 Aug 2024

https://github.com/VUKOZ-OEL/3d-forest

Visualization, processing and analysis of Lidar point clouds, mainly focused on forest environment. New version of 3D Forest. Process files with terabytes of data. Edit new point attributes. Simple addition of new features by plugins.

3d classification cpp cross-platform data-analysis desktop-application editor forest gui interactive-visualization las laser-scanning lidar opengl plugins point-cloud qt scientific-computing segmentation tree

Last synced: 03 Aug 2024

https://github.com/isxcode/spark-yun

Big data computing platform based on Spark <至轻云-打造大数据计算平台/数据中台>

data-analysis docker platform spark

Last synced: 04 Sep 2024

https://github.com/palewire/first-python-notebook

A step-by-step guide to analyzing data with Python and the Jupyter notebook.

altair data-analysis data-journalism education journalism jupyter jupyter-notebook jupyterlab news pandas python sphinx tutorial

Last synced: 30 Jul 2024

https://github.com/cosmoduende/r-youtube-personal-history-analysis

Explore your activity on YouTube with R: How to analyze and visualize your personal pata history. Find out how you consume YouTube using a copy of your personal data from Google Takeout.

data-analysis data-analytics data-visualization data-viz google-takeout r-data r-language r-programming youtube youtube-accounts youtube-analytics youtube-api youtube-data youtube-data-analysis youtube-data-api youtube-data-api-v3 youtube-data-scraping youtube-dataset youtube-scrape youtube-scraper

Last synced: 01 Aug 2024

https://github.com/ContextLab/computational-neuroscience

Short undergraduate course taught at University of Pennsylvania on computational and theoretical neuroscience. Provides an introduction to programming in MATLAB, single-neuron models, ion channel models, basic neural networks, and neural decoding.

computational-neuroscience course-materials data-analysis matlab modeling neuron problem-set simulation

Last synced: 07 Aug 2024

https://github.com/briatte/dsr

Introduction to Data Science with R (Sciences Po, Paris, 2023)

course data-analysis data-science data-visualization r statistics

Last synced: 02 Aug 2024

https://github.com/kb22/GitHub-User-Insights-using-API

The project involves using the GitHub API using user authentication to fetch information such as commits and repositories for that specific user and store them as CSV files for data collection and analysis.

api data-analysis data-science data-scraping github-api python

Last synced: 01 Aug 2024

https://github.com/SOCR/SOCRAT

A Dynamic Web Toolbox for Interactive Data Processing, Analysis, and Visualization

data-analysis data-science data-visualization socr statistics visual-analytics visualization

Last synced: 01 Aug 2024

https://github.com/ZijieZhaoMMHW/m_mhw1.0

A MATLAB toolbox to detect and analyze marine heatwaves (MHWs).

climate-science data-analysis heatwaves marine-heatwaves matlab

Last synced: 08 Aug 2024

https://github.com/rafzamb/sknifedatar

sknifedatar is a package that serves primarily as an extension to the modeltime 📦 ecosystem. In addition to some functionalities of spatial data and visualization.

data data-analysis data-science data-visualization forecasting r statistics time-series

Last synced: 05 Aug 2024

https://github.com/sharmaroshan/Insurance-Claim-Prediction

In this Data set we are Predicting the Insurance Claim by each user, Machine Learning algorithms for Regression analysis are used and Data Visualization are also performed to support Analysis.

beginner classification data-analysis data-visualization eda evaluation-metrics finance machine-learning radar-chart

Last synced: 08 Aug 2024

https://github.com/fjosw/pyerrors

Error propagation and statistical analysis for Markov chain Monte Carlo simulations in lattice QCD and statistical mechanics using autograd

autocorrelation autograd automatic-differentiation condensed-matter correlation data-analysis error-propagation lattice-field-theory lattice-qcd markov-chain monte-carlo particle-physics physics python qcd statistical-analysis statistical-mechanics

Last synced: 31 Jul 2024

https://github.com/dfinke/PSDuckDB

PSDuckDB is a PowerShell module that provides seamless integration with DuckDB, enabling efficient execution of analytical SQL queries directly from the PowerShell environment.

data-analysis data-science duckdb powershell sql

Last synced: 23 Aug 2024

https://github.com/leeper/make-example

An example of using make for a data analysis project

data-analysis make manuscript reproducible-research

Last synced: 05 Aug 2024

https://github.com/davidchall/ipaddress

Data analysis for IP addresses and networks

cyber data-analysis ip-address ipv4 ipv6 r vctrs

Last synced: 13 Aug 2024

https://github.com/petersontylerd/mlmachine

mlmachine accelerates machine learning experimentation

data-analysis data-science data-visualization machine-learning python

Last synced: 02 Aug 2024

https://github.com/braph-software/BRAPH-2

BRAPH 2.0 is a comprehensive software package for the analysis and visualization of brain connectivity data, offering flexible customization, rich visualization capabilities, and a platform for collaboration in neuroscience research.

biomedical-engineering brain-connectivity-analysis brain-research computational-neuroscience connectomics data-analysis data-science data-visualization deep-learning graph-theory machine-learning matlab network-analysis neuroimaging neuroscience open-source reproducible-research research-tools scientific-software toolbox

Last synced: 02 Aug 2024

https://github.com/rshkarin/quanfima

Quanfima (Quantitative Analysis of Fibrous Materials)

data-analysis material-science morphological-analysis volumetric-data

Last synced: 03 Aug 2024

https://github.com/ActivityWatch/aw-research

Tools to analyse and experiment with ActivityWatch data

activitywatch data-analysis python quantified-self

Last synced: 02 Aug 2024

https://github.com/goplus/pandas

Flexible and powerful data analysis / manipulation library for Go+, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

data-analysis data-science data-tech go golang gop goplus pandas scientific-computing

Last synced: 02 Aug 2024

https://github.com/PiotrZakrzewski/merge-chance

Source code of https://merge-chance.info

analysis data data-analysis open-source

Last synced: 31 Jul 2024

https://github.com/gagolews/genie

Genie: A Fast and Robust Hierarchical Clustering Algorithm (this R package has now been superseded by genieclust)

cluster cluster-analysis clustering data-analysis data-mining data-science datascience genie hierarchical-clustering-algorithm machine-learning machine-learning-algorithms outliers r

Last synced: 31 Jul 2024

https://github.com/skyzh/Meteor

🚆 Fine-grained analysis and visualization of Hangzhou Metro for efficient traveling in metro system. Project report, slide and presentation video included.

cmake data-analysis hangzhou metro qt sqlite visualize

Last synced: 01 Aug 2024

https://github.com/alejandrodumas/kodiak

Enhance your feature engineering workflow with Kodiak

data-analysis pandas

Last synced: 07 Aug 2024

https://github.com/PySloth/pysloth

A Python Package for Probabilistic Prediction

data-analysis data-science machine-learning python statistics

Last synced: 03 Aug 2024

https://github.com/cengel/R-data-wrangling

Materials for my my R data workshop. https://cengel.github.io/R-data-wrangling/

data-analysis data-workshop datascience material r rstats social-sciences teaching tidyverse workshop

Last synced: 02 Aug 2024

https://github.com/davidgasquez/filecoin-data-portal

🧮 Open and local-first data hub for Filecoin!

data-analysis data-platform filecoin

Last synced: 03 Aug 2024

https://github.com/bcgov/shinyrems

An R package to launch shinyrems; an online application that allows a user to access, download, clean, plot and calculate simple statistics using data from the B.C. government Environmental Monitoring System database.

data-analysis environment environmental-data water-quality

Last synced: 13 Aug 2024

https://github.com/mcwaage1/qs

Quantified Self: A Personal Data Aggregator and Dashboard for Self-Trackers and Quantified Self Enthusiasts

activity-tracking data-analysis data-visualization fitbit goodreads google-sheets lastfm mood personal-data quantified-self quantifiedself self-tracking writing

Last synced: 01 Aug 2024

https://github.com/chyikwei/bnp

Bayesian nonparametric models for python

bayesian data-analysis probabilistic-graphical-models python topic-modeling

Last synced: 02 Aug 2024

https://github.com/charliezcr/Kpop-Data-Analysis

Data analysis about K-pop industry, artists, and companies. Visualized business performances of public K-pop companies and analyzed artist management and international marketing strategies

data-analysis data-visualization kpop pandas python

Last synced: 01 Aug 2024

https://github.com/Big-Bytes-Labs/fisher-rs

A fast, powerful, flexible and easy to use open source data analysis and manipulation tool written in Rust

data-analysis dataframe-library machine-learning rust-lang

Last synced: 01 Aug 2024

https://github.com/iam-mhaseeb/Satellite-Imagery-Analysis-of-Vegetation-in-Southern-Pakistan

This repository contains a study how we can examine the vegetation cover of a region with the help of satellite data. The notebook in this repository aims to familiarise with the concept of satellite imagery data and how it can be analyzed to investigate real-world environmental and humanitarian challenges.

data-analysis data-visualization jupyter-notebook notebook pakistan pakistan-analysis python3 satellite-data satellite-imagery satellite-images vegetation vegetation-analysis

Last synced: 03 Aug 2024

https://github.com/mathewroy/ynabr

Analyze and visualize your You Need A Budget (YNAB) data. YNAB meets R programming language.

api data-analysis data-science data-visualization r ynab ynab-api

Last synced: 13 Aug 2024

https://github.com/cosmoduende/r-google-search-history-analysis

Explore your activity on Google with R: How to Analyze and Visualize Your Personal Data Search History. Find out how and how much you have used the most popular search engine in the world, using a copy of your personal data.

data-analysis data-visualisation data-visualization data-viz google google-history google-takeout personal-data-analysis r-language r-programming r-script rvest search-data search-history takeout takeout-data

Last synced: 01 Aug 2024

https://github.com/brad-cannell/freqtables

Quickly make tables of descriptive statistics (i.e., counts, percentages, confidence intervals) for categorical variables. This package is designed to work in a tidyverse pipeline, and consideration has been given to get results from R to Microsoft Word ® with minimal pain.

categorical-data data-analysis descriptive-statistics epidemiology r

Last synced: 13 Aug 2024

https://github.com/AurelienAubry/Spotlight

Spotlight is a Spotify dashboard that allows user to visualize his listening habits.

backend bootstrap chartjs data data-analysis data-science data-visualization flask frontend javascript js pandas python python3 react react-bootstrap spotify

Last synced: 01 Aug 2024

https://github.com/coalio/Assistant

A data science library providing flexible dataframes for Lua 5.1+

data-analysis data-science data-structures dataframe lua

Last synced: 01 Aug 2024

https://github.com/stappit/blog

I often post solutions to textbook exercises, including: Bayesian Data Analysis (BDA) by Gelman et al; Causal Inference in Statistics Primer (CISP) by Pearl et al; Purely Functional Data Structures (PFDS) by Okasaki.

bayesian-data-analysis blog data-analysis data-science gelman hakyll haskell pearl purely-functional-data-structures solutions stan static-site statistical-inference statistics

Last synced: 30 Jul 2024

https://github.com/abmenzel/messenger-wrapped

Explore your Facebook Messenger chat history, in a manner inspired by Spotify Wrapped

data-analysis data-visualization facebook meta nextjs reactjs spotify

Last synced: 07 Aug 2024

https://github.com/sanvishal/Exoplanet-Explore

An Interactive data visualization of Exoplanets

animation d3js data-analysis data-science exoplanet python space visualization

Last synced: 01 Aug 2024

https://github.com/i10mm/gpt-arxiv-fetcher

Revolutionize your research with our GitHub repository, where GPT meets arXiv API for seamless access and analysis of the latest academic papers!

artificial data-analysis intelligence llm machine-learning

Last synced: 05 Aug 2024

https://github.com/globeandmail/startr-cli

A command-line scaffolder for the startr R project template

data-analysis data-journalism data-visualization journalism r

Last synced: 08 Aug 2024

https://github.com/gcappon/py_agata

Official Python porting of AGATA (Automated Glucose dATa Analysis) a toolbox to analyse glucose data.

continuous-glucose-monitoring data-analysis hacktoberfest python toolbox

Last synced: 02 Aug 2024

https://github.com/EmirhanServeren/NFT-CollaBot

NFT CollaBot is a data-oriented project designed by the requirements of NFT ecosystem and aims to strengthen community.

data data-analysis data-analytics nft streamlit streamlit-webapp tezos tezos-api tezos-blockchain tezoswallet

Last synced: 01 Aug 2024

https://github.com/DataHerb/dataherb-python

Python Package for DataHerb: create, search, and load datasets.

data data-analysis data-mining database dataset python

Last synced: 03 Aug 2024

https://github.com/bts-cm/airdrop_tool

Fetch & analyse blockchain tickets. View leaderboards and user tickets. Calculate and perform provably fair airdrops.

airdrop bitshares bitsharesjs blockchain crypto data-analysis data-science electron etl javascript nodejs react ticket tusc

Last synced: 03 Aug 2024

https://github.com/andreantonacci/eu2019

Social Network Analysis of Twitter Topic-Network Structures during the 2019 European Elections

data-analysis election-analysis european-elections gephi latex master-thesis social-network-analysis

Last synced: 06 Aug 2024

https://github.com/jimbrig/lossrunAnalyzer

R Package and Shiny App to Analyze Insurance Lossruns

actuarial data-analysis data-mining data-science insurance r record-linkage risk-management shiny

Last synced: 13 Aug 2024

https://github.com/jimbrig/EDA

Exploratory Data Analysis R Package and Shiny App

data-analysis data-visualization eda r shiny

Last synced: 13 Aug 2024

https://github.com/casualcomputer/sql.mechanic

Functions that generate SQL queries that summarize high-dimensional tables stored in various databases (e.g. Microsoft SQL Servers, Netezza, DB2, Postgres, Oracle, MySQL, etc.).

data-analysis data-quality-checks data-science database mysql netezza oracle postgres quality-control r sql sql-server

Last synced: 13 Aug 2024

https://github.com/AnonCatalyst/WebHound

WebHound is your Python-powered command-line assistant for sharp and efficient web searches! It sniffs out data from major search engines and detects social platforms, helping you uncover valuable insights and stay ahead of the game. 🌐🔍✨

awareness data-analysis data-visualization information-retrieval osint osint-python osint-reconnaissance osint-resources osint-tool osint-tools osinttools web-osint webscraping

Last synced: 13 Aug 2024

https://github.com/CAIDA/submarine-cable-impact-analysis-public

This repository contains tools implemented for the PAM 2020 paper "Unintended consequences: Effects of submarine cable deployment on Internet routing" to collect and analyze data depicting the impact of the South-Atlantic Cable System (SACS) launch on Internet routing. This codebase can be extended to other use-cases of cable launches, failures, etc.

africa-americas africa-south-america bgp-data-analysis caida-ark-measurement-platform data-analysis historical-traceroutes impact internet-routing ripe-atlas-measurement-platform sacs-cable sail-cable submarine-cables

Last synced: 01 Aug 2024

https://github.com/1994nikunj/nlp-toolkit-desktop-app

The code is a collection of NLP analyses, including text cleaning, most common words, n-grams generation, co-occurrence matrix generation, wordcloud generation, topic modeling (using Latent Dirichlet Allocation), and general text statistics.

data-analysis n-grams network-visualization nlp python text-cleaning topic-modeling wordcloud-generator

Last synced: 07 Aug 2024

https://github.com/randomshek/Working-With-Excel

Using Excel Power Query and PowerPivot, reorganise the data into a star schema and showcasing reports that can be created by data analysts using DAX formulae and PowerPivot

data-analysis excel power-pivot power-query

Last synced: 08 Aug 2024

https://github.com/matthewgrosman/messenger-analytics

Project that ingests Facebook Messenger conversations and generates analytics.

analytics data-analysis excel facebook facebook-messenger java mongodb

Last synced: 01 Aug 2024

https://github.com/nysportsfan/Gun-Violence-in-the-US

This repository contains all the relevant files for my first capstone project as part of the Springboard Data Science Career Track.

data-analysis data-science data-visualization machine-learning python3 statistics

Last synced: 03 Aug 2024

https://github.com/shipyardapp/amazonathena-blueprints

Simplified blueprints for building data pipelines with Amazon Athena.

amazon-athena athena cli data-analysis data-engineering data-science elt etl

Last synced: 13 Aug 2024

https://github.com/JovaniPink/excel-powerbi

The folder of my work with Excel, VBA, and PowerBI for Data Analysis & Visualization.

data-analysis data-visualization dax excel excel-vba power-pivot power-query powerbi vba-macros

Last synced: 08 Aug 2024

https://github.com/shipyardapp/postgresql-blueprints

Simplified blueprints for building data pipelines with PostgreSQL.

cli data-analysis data-engineering data-pipeline data-science database elt etl postgres postgresql

Last synced: 13 Aug 2024

https://github.com/BigBangData/TimesheetAnalysis

R shiny app to help analyze a bookkeeper's business - or anyone with a timesheet and some time.

bookkeeping data-analysis data-viz r-programming shiny-apps shiny-r timesheet-management

Last synced: 13 Aug 2024

https://github.com/ehopperdietzel/billionaires-analysis

Análisis de la cantidad de billonarios por país. Inspirado en el artículo "Russian Billionaires"

bootstrap data-analysis poisson-distribution prediction

Last synced: 31 Jul 2024

https://github.com/Fisseha-Estifanos/telecom

A showcase repository for a specific telecommunication company. Used to analyze several telecommunication data set features and generate useful insights accordingly. Insights generated could be seen at https://github.com/Fisseha-Estifanos/telecom-visualizer or at https://fisseha-estifanos-telecom-visualizer-home-huxgy0.streamlitapp.com/

data-analysis notebooks-jupyter python visual-studio-code visualization

Last synced: 29 Jul 2024

https://github.com/wadeChriestenson/Main_Application

A Django application to host my personal resume.

data-analysis data-visualization django plotly python ui-design

Last synced: 29 Jul 2024