Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data Science
Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.
- GitHub: https://github.com/topics/data-science
- Wikipedia: https://en.wikipedia.org/wiki/Data_science
- Related Topics: data-analysis, data-mining, machine-learning, big-data, data-visualization,
- Aliases: datasciences, data-science-project, data-science-algorithm,
- Last updated: 2024-07-29 13:36:33 UTC
- JSON Representation
https://github.com/jmbhughes/goes_solar_retriever
Tool to retrieve GOES-R Solar Data
data data-retrieval data-science goes-16 goes-satellite goes16 goes17 solar solar-physics
Last synced: 01 Aug 2024
https://github.com/pharo-ai/a-priori
Implementation of A-Priori algorithm in Pharo
apriori-algorithm data-mining data-science frequent-itemsets pharo
Last synced: 03 Aug 2024
https://github.com/DeutscheAktuarvereinigung/Data_Science_Challenge_2020_Betrugserkennung
In this notebook we take a look at a relevant project that is frequently encountered by insurers: Fraud Detection. For this purpose we use a car data set from a public source and will show the necessary steps to establish an automated fraud detection.
actuarial-modeling betrugserkennung challenge data-science datasciencechallenge fraud-detection frauddetection
Last synced: 08 Aug 2024
https://github.com/jimbrig/property_allocation_demo
Dynamic Risk Allocation Model - bringing simplicity to the complex realm of Property Insurance.
actuarial-science allocation dashboard data-science insurance property-management r r-shiny workflow
Last synced: 13 Aug 2024
https://github.com/shipyardapp/postgresql-blueprints
Simplified blueprints for building data pipelines with PostgreSQL.
cli data-analysis data-engineering data-pipeline data-science database elt etl postgres postgresql
Last synced: 13 Aug 2024
https://github.com/navdeep-G/h2o3-gapstat
Estimating the number of clusters in a data set via the gap statistic. Implemented in H2O-3
big-data clustering data-science datascience distributed gap-statistic h2o-3 java kmeans kmeans-clustering machine-learning multithreading open-source
Last synced: 04 Aug 2024
https://github.com/nysportsfan/Gun-Violence-in-the-US
This repository contains all the relevant files for my first capstone project as part of the Springboard Data Science Career Track.
data-analysis data-science data-visualization machine-learning python3 statistics
Last synced: 03 Aug 2024
https://github.com/navdeep-G/h2o3-pam
Implementation of Partitioning Around Medoids (PAM) in H2O-3
big-data clustering clustering-algorithm data-science distributed h2o h2o-3 java machine-learning medoids multithreading pam unsupervised-learning
Last synced: 04 Aug 2024
https://github.com/ekagra-ranjan/Analyze-This-17
Analyze This' 17, American Express Data Science Competition ('Outstanding Performer')
american-express americanexpress analyze-this boosting-algorithm corelib credit-card cross-validation data-science data-science-challenges data-science-competition ensemble feature-engineering feature-extraction feature-selection gradient-boosting-machine random-forest sklearn svm xgb-classifier xgboost
Last synced: 03 Aug 2024
https://github.com/millyleadley/breast-cancer-genes
Predicting survival outcome in breast cancer patients based on their gene expression
breast-cancer breast-cancer-prediction breastcancer-classification data-science logistic-regression python svm
Last synced: 04 Aug 2024
https://github.com/garynth41/John-Hopkins-University-Mastering-Software-Development-in-R
This repository covers R software development for building data science tools. As the field of data science evolves, it has become clear that software development skills are essential for producing useful data science results and products. You will obtain rigorous training in the R language, including the skills for handling complex data, building R packages and developing custom data visualizations. You will learn modern software development practices to build tools that are highly reusable, modular, and suitable for use in a team-based environment or a community of developers.
data-science debugging development-practices development-skills modular namespace package-manager software-development software-testing yml-reference
Last synced: 13 Aug 2024
https://github.com/gbganalyst/API-in-R-and-Python
This API tutorial will teach you how to fetch data from an external source using HTTP requests and parse the data into a usable format
api data-science python quarto rprogramming
Last synced: 02 Aug 2024
https://github.com/Rilfanayasmin/Raven-coding
Practice programming with R and Python on all data science algorithms
data-science machine-learning-algorithms predictive-modeling r-programming statistical-models
Last synced: 13 Aug 2024
https://github.com/firefly-cpp/snail-dataset
computer-vision data-science object-detection snails
Last synced: 01 Aug 2024
https://github.com/shipyardapp/amazonathena-blueprints
Simplified blueprints for building data pipelines with Amazon Athena.
amazon-athena athena cli data-analysis data-engineering data-science elt etl
Last synced: 13 Aug 2024
https://github.com/erinaldi/bmn2-lattice
Data analysis of lattice Monte Carlo simulations of quantum matrix models.
data data-science data-visualisation lattice
Last synced: 31 Jul 2024
https://github.com/ameygawade/streamlit-robots_txt_generator
This Streamlit app allows users to generate and customize a robots.txt file by selecting user-agents, specifying disallowed paths, enabling crawler delay, and providing a sitemap URL.
config data-science front generative generator google robots-txt search-algorithm search-engine seo seo-optimization stream streamlit txt-files web webapp webapplication
Last synced: 13 Aug 2024
https://github.com/jimbrig/lossdevt
An R package and Shiny App for Actuarial Loss Development and Reserving
actuarial-science claims-reserving data-science insurance property-casualty rpackage rshiny rstats workflow
Last synced: 13 Aug 2024
https://github.com/ekagra-ranjan/GS-Quantify-17
GS-Quantify' 17, Goldman Sachs Data Science Competition
boosting-algorithm cross-validation data-science data-science-challenges data-science-competition ensemble feature-engineering feature-extraction feature-selection goldmann-sachs gradient-boosting-machine gs-quantify gsquantify linear-regression sklearn xgb-classifier xgboost
Last synced: 03 Aug 2024
https://github.com/DataHerb/dataherb-flora
DataHerb Flora: The core of DataHerb
data data-mining data-science datascience dataset datasets
Last synced: 03 Aug 2024
https://github.com/vchagas69/vchagas69.github.io
A Portuguese Actuary
actuarial-science data-science
Last synced: 08 Aug 2024
https://github.com/jxtngx/lightning-lab
hackable boilerplate for PyTorch Lightning driven deep learning research
artificial-intelligence data-science deep-learning fastapi machine-learning python pytorch pytorch-lightning reinforcement-learning streamlit wandb
Last synced: 12 Sep 2024
https://github.com/github/CodeSearchNet
Datasets, tools, and benchmarks for representation learning of code.
bert cnn data data-science datasets deep-learning machine-learning machine-learning-on-source-code ml natural-language-processing neural-networks nlp nlp-machine-learning open-data programming-language-theory python representation-learning rnn self-attention tensorflow
Last synced: 30 Jul 2024
https://github.com/TheoLvs/data-science-VR
Data Science experiments in Virtual Reality
data-science machine-learning virtual-reality
Last synced: 29 Jul 2024
https://github.com/iworeushankaonce/rimanalytics
Flask web application for calculation and analysis of the Rorschach Inkblot Method
data-science dataanalysis flask flask-backend psychodiagnostics psychology psychometics psychometrics psychotherapy python python3 rorschach
Last synced: 03 Aug 2024
https://github.com/softwares-com-clevertonrocha/CKR-Joinville
Developer Cleverton Rocha ferramenta simples, rápida e poderosa para criação blog. Você escreve postagens em Markdown (ou outras linguagens) e o Hexo gera arquivos estáticos com um lindo tema em segundos. Instalação Demora apenas alguns minutos para configurar o Hexo. Se você encontrar um problema e não conseguir encontrar a solução aqui, por favor abra uma issue no GitHub e vamos tentar resolvê-lo.
android-application blockchain blog bootstrap bootstrap4 css3 data-science express flutter frontend git github go hexo html-css html5 ios javascript json nodejs
Last synced: 01 Sep 2024