Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Data Science

Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.

https://github.com/earth-artificial-intelligence/earth_ai_book_materials

The repo contains the source code, notebooks, and technical resources that assist students to read the book Artificial Intelligence in Earth Science.

data-science earth-science machine-learning python

Last synced: 03 Aug 2024

https://github.com/mad-lab-fau/tpcp

Pipeline and Dataset helpers for complex algorithm evaluation.

algorithms biosignals data-management data-science machine-learning python

Last synced: 02 Aug 2024

https://github.com/codelibs/fione

Fione is Enterprise AI Platform

ai automl data-science machine-learning

Last synced: 31 Jul 2024

https://github.com/Absolventa/iruby-chartkick

Minimalistic wrapper around chartkick for using it within iruby

chartkick data-science iruby rubydatascience visualization

Last synced: 03 Aug 2024

https://github.com/strazto/mandrake

๐Ÿ“–๐Ÿ‰- Bring reading the manual ๐Ÿ“– closer to your drake ๐Ÿ‰ workflow ๐Ÿ”ฅ

data-science drake high-performance-computing makefile pipeline r r-package reproducibility reproducible-research rstats workflow

Last synced: 05 Aug 2024

https://github.com/numeract/rflow

Flexible R Pipelines with Caching

cache data-science pipeline r rflow

Last synced: 13 Aug 2024

https://github.com/mathewroy/ynabr

Analyze and visualize your You Need A Budget (YNAB) data. YNAB meets R programming language.

api data-analysis data-science data-visualization r ynab ynab-api

Last synced: 13 Aug 2024

https://github.com/qpwedev/blockchain-network-visualizer

Blockchain Network Visualizer for TON.

blockchain data-science network ton toncoin

Last synced: 30 Jul 2024

https://github.com/buccaneerai/rxjs-stats

Moved to @bottlenose/rxstats (https://github.com/buccaneerai/bottlenose)

analytics data data-mining data-science observables reactive rxjs statistics

Last synced: 05 Aug 2024

https://github.com/bcgov/canwqdata

R ๐Ÿ“ฆ to download ๐Ÿ‡จ๐Ÿ‡ฆ open water quality data

data-science env r r-package rlang rstats

Last synced: 08 Aug 2024

https://github.com/giswqs/leafmaptools

A Python package for building a tool widgets infrastructure with ipyleaflet and ipywidgets

data-science data-visualization geopython geospatial ipyleaflet ipywidgets jupyter jupyter-notebook mapping python

Last synced: 05 Aug 2024

https://github.com/jimbrig/lossrx

An R package, plumber API, database, and Shiny App for Actuarial Loss Development and Reserving Workflows.

actuarial-science claims-data claims-reserving data-science insurance modelling property-casualty reserving rpackage rshiny rstats workflow

Last synced: 13 Aug 2024

https://github.com/AurelienAubry/Spotlight

Spotlight is a Spotify dashboard that allows user to visualize his listening habits.

backend bootstrap chartjs data data-analysis data-science data-visualization flask frontend javascript js pandas python python3 react react-bootstrap spotify

Last synced: 01 Aug 2024

https://github.com/tuanle618/AEDA

AEDA - Automated Data Exploratory Analysis in R

data-science eda eda-report exploratory-data-analysis r

Last synced: 13 Aug 2024

https://github.com/katrienantonio/workshop-loss-reserv-fraud

Course material for a workshop on loss modelling, reserving and insurance fraud analytics

actuarial-science data-science insurance-claims

Last synced: 02 Aug 2024

https://github.com/tezansahu/dvc-pycaret-fastapi-demo

Repository for the Demo of using DVC with PyCaret & MLOps (DVC Office Hours - 20th Jan, 2022)

data-science demo deployment dvc fastapi machine-learning mlops-workflow pycaret

Last synced: 03 Aug 2024

https://github.com/bcgov/ghg-emissions-indicator

R scripts for a GHG emissions indicator published on Environmental Reporting BC

data-science env r rstats

Last synced: 03 Aug 2024

https://github.com/SamEdwardes/pydatafaker

A python package to create fake data with relationships between tables.

data data-science fake-data python

Last synced: 04 Aug 2024

https://github.com/MCodrescu/octopus

R Package for Interacting with Databases

data-science database r rshiny

Last synced: 13 Aug 2024

https://github.com/adityakamble49/loss-ratio-prediction

Predicting Loss Ratios for Auto Insurance Portfolios - ITCS 6100 Big Data Analytics for Competitive Advantage

big-data big-data-analytics data-science insurance jupyter-notebook politics python

Last synced: 08 Aug 2024

https://github.com/oceannetworkscanada/api-python-client

Provides easy access to ONC data in Python

api data-science ocean-sciences onc python

Last synced: 08 Aug 2024

https://github.com/wesslen/iviz-rstudio-workshop

Interactive Visualizations with RStudio Workshop for UNCC DSI

data-science htmlwidgets interactive-visualizations rstudio shiny shinyapps tidyverse

Last synced: 02 Aug 2024

https://cufctl.github.io/mlbd/

Repository for the machine learning / big data creative inquiry

data-science high-performance-computing machine-learning python tensorflow

Last synced: 31 Jul 2024

https://github.com/crdietrich/meerkat

Data acquisition for Raspberry Pi and Micropython

data-science drivers micropython raspberrypi

Last synced: 04 Aug 2024

https://github.com/rueedlinger/ml-resources

A curated list of statistics, data visualization and machine learning resources which in find useful, have read or want to read.

curated-list data-science data-visualization deep-learning machine-learning statistics

Last synced: 01 Aug 2024

https://github.com/ishijo/Taylor-Swift-Lyrics

Database (.txt and .csv) of all Taylor Swift Song Lyrics upto April'23

data-science dataset datasets nlp-machine-learning taylor-swift text-mining

Last synced: 12 Aug 2024

https://codeformunich.github.io/radlquartier/

Command-line tool to prepare and extract bike sharing data. Plus example implementations of visualizations and a example website.

data-science data-visualization munich open-data visualization

Last synced: 02 Aug 2024

https://github.com/psyplot/psyplot-gui

Graphical User Interface for the psyplot package

data-science gui interactive ipython psyplot qtconsole sphinx

Last synced: 31 Jul 2024

https://github.com/stappit/blog

I often post solutions to textbook exercises, including: Bayesian Data Analysis (BDA) by Gelman et al; Causal Inference in Statistics Primer (CISP) by Pearl et al; Purely Functional Data Structures (PFDS) by Okasaki.

bayesian-data-analysis blog data-analysis data-science gelman hakyll haskell pearl purely-functional-data-structures solutions stan static-site statistical-inference statistics

Last synced: 30 Jul 2024

https://github.com/coalio/Assistant

A data science library providing flexible dataframes for Lua 5.1+

data-analysis data-science data-structures dataframe lua

Last synced: 01 Aug 2024

https://github.com/sanvishal/Exoplanet-Explore

An Interactive data visualization of Exoplanets

animation d3js data-analysis data-science exoplanet python space visualization

Last synced: 01 Aug 2024

https://github.com/zMoooooritz/stapy

An easy to use SensorThings API Client written in Python

api cli data-science database ogc python sensor sensor-data sensorthings sensorthings-api

Last synced: 04 Aug 2024

https://github.com/ucdavisdatalab/workshop_web_maps

Learn to build an interactive web map to display spatial data

data-science geospatial-visualization teaching-materials ucdavis ucdavis-datalab workshop

Last synced: 05 Aug 2024

https://github.com/WaylonWalker/kedro-auto-catalog

Kedro catalog create with default configuration

data data-science kedro kedro-catalog kedro-hook kedro-plugin

Last synced: 31 Jul 2024

https://github.com/sdcastillo/PA-R-Study-Manual

An online study guide for the SOA's predictive analytics exam.

data-science data-visualization machine-learning predictive-modeling r-programming

Last synced: 02 Aug 2024

https://github.com/tsdataclinic/TREC

Transit Resilience for Essential Commuting (TREC)

climate-change data-science transit-data

Last synced: 08 Aug 2024

https://github.com/Badr-MOUFAD/cookiecutter-simple-DS-project

A simple cookiecutter template to structure your Data Science projects.

cookiecutter data-science project-structure python simple-ds-project

Last synced: 03 Aug 2024

https://github.com/pharo-ai/data-partitioners

Pharo library for partitioning a collection. Given a set of proportions (e.g. 50%, 30%, and 20%), it shuffles the collection and divides it into non-empty subsets in such a way that every element is included in exactly one subset. Can be used in machine learning and statistical analysis for splitting data into training, validation, and test sets.

data-science machine-learning pharo statistical-analysis

Last synced: 03 Aug 2024

https://github.com/carlos-gg/digitalgarden

Personal knowledge garden dedicated to AI, ML, AI for Earth Sciences, AI for good, Machine Learning and Data Science

artificial-intelligence data-science digital-garden knowledge-management machine-learning

Last synced: 07 Aug 2024

https://github.com/fmv1992/data_utilities

Data utilities library focused on machine learning and data analysis.

data-science utility-library

Last synced: 30 Jul 2024

https://github.com/RemiRigal/DatasetExplorer

A web tool for local dataset browsing and processing developped using the Flask + Angular stack.

ai angular data-processing data-science data-visualization dataset dataset-analysis docker docker-compose flask web-application

Last synced: 13 Aug 2024

https://github.com/lockedata/opentrainingcontent

An MIT & CCBY4.0 licensed repository of training materials from Locke Data

data-science open-course r-stats

Last synced: 13 Aug 2024

https://github.com/RDeconomist/RDeconomist.github.io

RapidCharts - a site for teaching and demonstrating Data Science and Visualisation techniques

data data-science data-visualization economics politics sports

Last synced: 01 Aug 2024

https://github.com/ZackAkil/friendlier-data-labelling

Code resources for generating a google form for labelling data.

data-science google google-apps-script google-forms google-sheets machine-learning

Last synced: 01 Aug 2024

https://github.com/techbastic/roadmaps

A curated list of resources to start your developer journey.

blockchain community data-science devops full-stack hacktoberfest open-source roadmaps

Last synced: 31 Jul 2024

https://github.com/spsanderson/steveondata

Repository for R and SQL tips and tricks for @steveondata every Friday

ai blog data data-science machinelearning-r ml r sql time-series tipoftheday

Last synced: 08 Aug 2024

https://github.com/beeva-jorgezaldivar/plumberModel

Create APIs for the deployment of R models with minimal code

api caret data-science deployment machine-learning plumber r

Last synced: 13 Aug 2024

https://github.com/UniversalDataTool/courseware

Create instructions for labeling datasets using the Universal Data Tool

annotators courseware data-science dataset hacktoberfest label

Last synced: 01 Aug 2024

https://github.com/thevasudevgupta/ds-toolkit

Some useful stuff for a software/ML engineer

data-science docker-notes dvc-for-data-science git-notes markdown-notes

Last synced: 13 Aug 2024

https://github.com/kozodoi/dptools

Python package with utilities for data processing, aggregation, feature engineering and data versioning

aggregation data-preparation data-preprocessing data-science feature-engineering python

Last synced: 03 Aug 2024

https://github.com/Rahulkumarr2080/Comcast-Telecom-Consumer-Complaints

Comcast is an American global telecommunication company. The firm has been providing terrible customer service. They continue to fall short despite repeated promises to improve. Only last month (October 2016) the authority fined them $2.3 million, after receiving over 1000 consumer complaints. The existing database will serve as a repository of public customer complaints file.

comcast-telcom-complaints data-science data-scientists data-visualization datascience datascience-with-python jupyter-notebook matplotlib numpy pandas python python-for-data-science rahul-kumar rahul-kumar-thakur

Last synced: 31 Jul 2024

https://github.com/yuval-a/deriveODM

DeriveODM is a reactive ODM - Object Document Mapper - framework, a "wrapper" around MongoDB, that removes all the hassle of data-persistence by handling it transparently in the background, in a DRY manner.

collection data data-mapper data-science database db document dry mapper mongo mongodb mongoose node nodejs object odm persistence persistent react reactive

Last synced: 01 Aug 2024

https://github.com/rstudio/rviews-community

RViews Community Site for Authors and Editors

blog community data-science open-source r r-programming

Last synced: 02 Aug 2024

https://github.com/WeR-stats/workshop-setup_cloud_machine_data_science

Step-by-step instructions on how to set up a virtual machine for Data Science usiing Cloud Infrastructures

cloud data-science dataops digitalocean jupyterlab python r r-shiny r-stats rstudio rstudio-server shiny-server

Last synced: 13 Aug 2024

https://github.com/bts-cm/airdrop_tool

Fetch & analyse blockchain tickets. View leaderboards and user tickets. Calculate and perform provably fair airdrops.

airdrop bitshares bitsharesjs blockchain crypto data-analysis data-science electron etl javascript nodejs react ticket tusc

Last synced: 03 Aug 2024

https://github.com/bcgov/safepaths

An R ๐Ÿ“ฆ to safely set & use a path to a private network

citz data-science r r-package rstats

Last synced: 13 Aug 2024

https://github.com/jimbrig/lossrunAnalyzer

R Package and Shiny App to Analyze Insurance Lossruns

actuarial data-analysis data-mining data-science insurance r record-linkage risk-management shiny

Last synced: 13 Aug 2024

https://github.com/Suji04/Chat_Entropy_Analysis

A simple python script to find and compare WhatsApp chat entropy

data-science entropy python3 shannon-entropy whatsapp

Last synced: 29 Jul 2024

https://github.com/tusharnankani/analysis-2.0

An Exhaustive WhatsApp Chat Data Analysis 2.0

analysis data data-science plots trends visualization

Last synced: 01 Aug 2024

https://github.com/testlnord/rdstemplate

Reproducible Data Science RStudio Project Template

data-science r reproducibility reproducible-research rstudio template

Last synced: 13 Aug 2024

https://github.com/dimgold/Higgs_data_mining

Higgs data mining project (2015)

data-mining data-science higgs kaggle neurolab sklearn

Last synced: 09 Aug 2024

https://github.com/casualcomputer/sql.mechanic

Functions that generate SQL queries that summarize high-dimensional tables stored in various databases (e.g. Microsoft SQL Servers, Netezza, DB2, Postgres, Oracle, MySQL, etc.).

data-analysis data-quality-checks data-science database mysql netezza oracle postgres quality-control r sql sql-server

Last synced: 13 Aug 2024

https://github.com/janskwr/Processing-of-structured-data

Processing of structured data - the third homework assignment/project!

big-data data-processing data-science data-table r stackexchange stringi xml

Last synced: 29 Jul 2024

https://github.com/memair/apps

App Store for Memair

apps appstore data data-science quantified-self

Last synced: 01 Aug 2024