Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/dfinke/PSDuckDB

PSDuckDB is a PowerShell module that provides seamless integration with DuckDB, enabling efficient execution of analytical SQL queries directly from the PowerShell environment.

data-analysis data-science duckdb powershell sql

Last synced: 16 Dec 2024

https://github.com/jobovy/apogee

Tools for dealing with APOGEE data

astronomy astrophysics data data-analysis python spectroscopy

Last synced: 02 Dec 2024

https://github.com/ndleah/8-week-sql-challenge

#8WeekSQLChallenge by Danny Ma.

data-analysis data-science sql

Last synced: 13 Nov 2024

https://github.com/cdnjs/cf-stats

📈 Monthly usage statistics from Cloudflare for the cdnjs.cloudflare.com domain - The #1 free and open source CDN built to make life easier for developers.

cdnjs cloudflare data data-analysis statistics stats usage usage-data usage-reports

Last synced: 19 Dec 2024

https://github.com/elysian01/data-purifier

A Python library for Automated Exploratory Data Analysis, Automated Data Cleaning, and Automated Data Preprocessing For Machine Learning and Natural Language Processing Applications in Python.

data-analysis data-cleaning data-cleaning-pipeline data-preprocessing data-science data-visualization datapurifier eda exploratory-data-analysis jupyter python-lib python-library python3

Last synced: 07 Nov 2024

https://github.com/briatte/dsr

Introduction to Data Science with R (Sciences Po, Paris, 2023)

course data-analysis data-science data-visualization r statistics

Last synced: 27 Oct 2024

https://github.com/lunarwhite/covid-social-analysis

Apply ML on weibo sentiment. 疫情背景下微博文本情感分析与可视化

crawling data-analysis machine-learning nlp python vizualization

Last synced: 06 Nov 2024

https://github.com/nicolaskruchten/scipy2021

Data Visualization as the First and Last Mile of Data Science: Plotly Express and Dash

data-analysis data-science data-visualization python visualization

Last synced: 08 Nov 2024

https://github.com/SOCR/SOCRAT

A Dynamic Web Toolbox for Interactive Data Processing, Analysis, and Visualization

data-analysis data-science data-visualization socr statistics visual-analytics visualization

Last synced: 03 Nov 2024

https://github.com/dfinke/psduckdb

PSDuckDB is a PowerShell module that provides seamless integration with DuckDB, enabling efficient execution of analytical SQL queries directly from the PowerShell environment.

data-analysis data-science duckdb powershell sql

Last synced: 27 Oct 2024

https://github.com/tstreamdoth/instacart-market-basket-analysis

Use Instacart public dataset to report which products are often shopped together. 🍋🍉🥑🥦

data-analysis data-science instacart market-basket-analysis

Last synced: 28 Oct 2024

https://github.com/rafzamb/sknifedatar

sknifedatar is a package that serves primarily as an extension to the modeltime 📦 ecosystem. In addition to some functionalities of spatial data and visualization.

data data-analysis data-science data-visualization forecasting r statistics time-series

Last synced: 22 Nov 2024

https://github.com/root-11/tablite

multiprocessing enabled out-of-memory data analysis library for tabular data.

data-analysis data-science datatype disk etl excel filereader pandas pivot-tables python table tabular-data

Last synced: 11 Oct 2024

https://github.com/sharmaroshan/Insurance-Claim-Prediction

In this Data set we are Predicting the Insurance Claim by each user, Machine Learning algorithms for Regression analysis are used and Data Visualization are also performed to support Analysis.

beginner classification data-analysis data-visualization eda evaluation-metrics finance machine-learning radar-chart

Last synced: 27 Nov 2024

https://github.com/czyt1988/data-workbench

Data processing tool software developed by QT(CPP)

data-analysis graphicsview qt qt-workflow qt5 workflow

Last synced: 19 Dec 2024

https://github.com/kennbroorg/poorskeme

OSINT - Data Visualization - Blockchain - Awareness - Scam

data-analysis data-visualization python scam smart-contracts visualization

Last synced: 11 Nov 2024

https://github.com/khanhnamle1994/world-cup-2018

An exploratory data analysis and data visualization project for World Cup 2018

data-analysis data-visualization

Last synced: 10 Nov 2024

https://github.com/braph-software/BRAPH-2

BRAPH 2.0 is a comprehensive software package for the analysis and visualization of brain connectivity data, offering flexible customization, rich visualization capabilities, and a platform for collaboration in neuroscience research.

biomedical-engineering brain-connectivity-analysis brain-research computational-neuroscience connectomics data-analysis data-science data-visualization deep-learning graph-theory machine-learning matlab network-analysis neuroimaging neuroscience open-source reproducible-research research-tools scientific-software toolbox

Last synced: 12 Nov 2024

https://github.com/atapas/covid-19

COVID-19 World is yet another Project to build a Dashboard like app to showcase the data related to the COVID-19(Corona Virus).

analytics countries covid covid-19 covid-19-india covid19 dashboard data-analysis data-visualization jamstack react reactjs recharts saas showcase virus visualization

Last synced: 07 Nov 2024

https://github.com/leeper/make-example

An example of using make for a data analysis project

data-analysis make manuscript reproducible-research

Last synced: 28 Oct 2024

https://github.com/raycad/stream-processing

Stream processing guidelines and examples using Apache Flink and Apache Spark

apache-flink apache-spark batch-processing data-analysis streaming

Last synced: 22 Nov 2024

https://github.com/stellar/stellar-etl

Stellar ETL will enable real-time analytics on the Stellar network

bitcoin blockchain data-analysis ethereum etl-framework etl-pipeline stellar stellar-lumens stellar-network

Last synced: 06 Nov 2024

https://github.com/inphyt/covid19-italy-integrated-surveillance-data

COVID-19 integrated surveillance data provided by the Italian Institute of Health and processed via UnrollingAverages.jl to deconvolve the weekly moving averages.

covid-19 covid19-data data data-analysis data-structures data-visualization data-wrangling database dataset epidemiological-data epidemiology italy italy-data italy-dataset open-data surveillance surveillance-data time-series time-series-analysis

Last synced: 12 Nov 2024

https://github.com/prakhar-ff13/customer-analytics

Machine Learning Case study on customer segmentation and prediction of groups.

analytics case-study data-analysis data-science data-visualization dimensionality-reduction machine-learning python python3

Last synced: 30 Nov 2024

https://github.com/kwokhing/yandexcatboost-python-demo

Demo on the capability of Yandex CatBoost gradient boosting classifier on a fictitious IBM HR dataset obtained from Kaggle. Data exploration, cleaning, preprocessing and model tuning are performed on the dataset

catboost data-analysis data-preprocessing data-science feature-selection gradient-boosting gradient-boosting-classifier one-hot-encode pandas pearson-correlation python python27 seaborn variance-analysis visualization yandex-catboost

Last synced: 22 Dec 2024

https://github.com/alexandroskyriakakis/strongappanalytics

Analytics and Charts for your fitness log exported from the Strong App (AppStore/PlayStore).

chartjs data-analysis data-representation fitness-log nodejs pyplot python-analytics react rsuitejs strong strong-app

Last synced: 15 Nov 2024

https://github.com/davidchall/ipaddress

Data analysis for IP addresses and networks

cyber data-analysis ip-address ipv4 ipv6 r vctrs

Last synced: 04 Dec 2024

https://github.com/petersontylerd/mlmachine

mlmachine accelerates machine learning experimentation

data-analysis data-science data-visualization machine-learning python

Last synced: 13 Nov 2024

https://github.com/spidy20/data-scince-ml-project

In this repository i created many data scince - machine learning projects like(Deep dream,weather prediction,Movie recommender system etc) with code & datasets

data-analysis data-analytics deep-dream machine-learning matplotlib number-recognition python recommender-system songs songs-data-analysis stock-market-prediction

Last synced: 15 Nov 2024

https://github.com/cdhunt/pselect

PowerShell DSL for aggregating data

data-analysis dsl powershell powershell-module

Last synced: 28 Oct 2024

https://github.com/hackersandslackers/pandas-sqlalchemy-tutorial

:panda_face: :computer: Load or insert data into a SQL database using Pandas DataFrames.

data-analysis data-science dataframes pandas pandas-sqlalchemy-tutorial python sql-database sqlalchemy tutorial

Last synced: 16 Nov 2024

https://github.com/vvzen/houdini-geospatial-tools

tools for geospatial exploration in Houdini (ipython notebooks + GeoJSON python library)

data-analysis data-visualization geojson geospatial geotiff houdini python27

Last synced: 22 Dec 2024

https://github.com/alexbykoff/datafield

Sort, select, filter, evaluate and perform maths on your arrays of data

arrays collections data-analysis data-structures filtering sorting

Last synced: 09 Nov 2024

https://github.com/rshkarin/quanfima

Quanfima (Quantitative Analysis of Fibrous Materials)

data-analysis material-science morphological-analysis volumetric-data

Last synced: 14 Nov 2024

https://github.com/alessandrocorradini/harvard-data-science-professional

Repository for the Data Science Professional Program from Harvard University on edX

data-analysis data-science datascience edx harvardx machine-learning machinelearning mooc moocs r r-language

Last synced: 22 Nov 2024

https://github.com/anselmoo/spectrafit

📊📈🔬 SpectraFit is a command-line and Jupyter-notebook tool for quick data-fitting based on the regular expression of distribution functions.

console-application curve-fitting data-analysis data-analysis-python data-science data-visualization fitting juypter-notebook python science science-research scientific-plotting spectral-analysis spectroscopy

Last synced: 20 Dec 2024

https://github.com/mainakrepositor/data-analysis

Different types of data analytics projects : EDA, PDA, DDA, TSA and much more.....

data-analysis data-science deeplearning machine-learning-algorithms neural-networks time-series-analysis tsa

Last synced: 12 Nov 2024

https://github.com/davidgasquez/filecoin-data-portal

🧮 Open and local-first data hub for Filecoin!

data-analysis data-platform filecoin

Last synced: 14 Nov 2024

https://github.com/computationalcore/introduction-to-python

A very useful collection of Jupyter Notebooks, which aims to introduce the Python programming language.

data-analysis data-science fundamental google-colab jupyter-notebook jupyter-notebooks numpy pandas python python-language python-programming python3

Last synced: 10 Nov 2024

https://github.com/theengineeringworld/python-data-science

Python Data Science has all the data sets and jupyter notebook files for the Youtube course at http://youtube.com/theengineeringworld under the name of " Python Data Science Course ".

data data-analysis data-mining data-science data-visualization jupyter-notebook jupyter-notebooks machine-learning python python27

Last synced: 22 Dec 2024

https://github.com/mkcor/advanced-pandas

Pandas is a powerful tool for data exploration and analysis (including timeseries).

data-analysis data-science labeled-data notebooks python3 teaching-materials

Last synced: 16 Oct 2024

https://github.com/activitywatch/aw-research

Tools to analyse and experiment with ActivityWatch data

activitywatch data-analysis python quantified-self

Last synced: 08 Nov 2024

https://github.com/ActivityWatch/aw-research

Tools to analyse and experiment with ActivityWatch data

activitywatch data-analysis python quantified-self

Last synced: 12 Nov 2024

https://github.com/staircase-dev/piso

Pandas Interval Set Operations: providing methods for set operations, analytics, lookups and joins on pandas' Interval, IntervalArray and IntervalIndex

data-analysis data-science data-structures interval interval-arithmetic interval-set pandas set set-operations set-theory

Last synced: 19 Dec 2024

https://github.com/csbiology/fsharpgephistreamer

F# functions for streaming any kind of graph/network data to the network visualization tool gephi

data-analysis exploratory-data-analysis fsharp gephi graph-visualization streaming-graph-data visualization

Last synced: 04 Dec 2024

https://github.com/pravj/ospi

Open Source Presence Infographic of Indian Startups

data-analysis data-visualization india open-source startup

Last synced: 14 Oct 2024

https://github.com/PiotrZakrzewski/merge-chance

Source code of https://merge-chance.info

analysis data data-analysis open-source

Last synced: 29 Oct 2024

https://github.com/goplus/pandas

Flexible and powerful data analysis / manipulation library for Go+, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

data-analysis data-science data-tech go golang gop goplus pandas scientific-computing

Last synced: 12 Nov 2024

https://github.com/gagolews/genie

Genie: A Fast and Robust Hierarchical Clustering Algorithm (this R package has now been superseded by genieclust)

cluster cluster-analysis clustering data-analysis data-mining data-science datascience genie hierarchical-clustering-algorithm machine-learning machine-learning-algorithms outliers r

Last synced: 22 Nov 2024

https://github.com/itzmeanjan/chanalyze

A simple WhatsApp Chat Analyzer ( for both Private & Group chats ), made with :heart:

chat-analysis data-analysis datascience dataviz matplotlib python3 visualization whatsapp whatsapp-chat whatsapp-chat-analyzer

Last synced: 30 Sep 2024

https://github.com/rfordatascience/r4dswebsite

Public repository for the R4DS community website.

blogdown data-analysis data-analytics data-science data-visualization r r4ds tidyverse

Last synced: 14 Nov 2024

https://github.com/simfg/etcd-analysis

🔦 Etcd Data Analysis Tool

data-analysis etcd go raft

Last synced: 06 Dec 2024

https://github.com/danvk/march-madness-data

NCAA brackets in JSON form

data-analysis ncaa-basketball sports

Last synced: 14 Nov 2024

https://github.com/skyzh/Meteor

🚆 Fine-grained analysis and visualization of Hangzhou Metro for efficient traveling in metro system. Project report, slide and presentation video included.

cmake data-analysis hangzhou metro qt sqlite visualize

Last synced: 07 Nov 2024