Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data Science
Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.
- GitHub: https://github.com/topics/data-science
- Wikipedia: https://en.wikipedia.org/wiki/Data_science
- Related Topics: data-analysis, data-mining, machine-learning, big-data, data-visualization,
- Aliases: datasciences, data-science-project, data-science-algorithm,
- Last updated: 2024-11-19 00:06:52 UTC
- JSON Representation
https://github.com/leerenjie/100-days-of-code-in-python
Udemy Angela Yu's course has 100 projects for students to make each day with classes for 2 hours each day. This repository will store all the related projects
100-days-of-code api backend-webdevelopment data-science database flask frontend-web game-development version-control
Last synced: 07 Nov 2024
https://github.com/jonmrowczynski/excel-2-csv-exporter
Command Line Interface script to export one or more Excel Workbooks to CSVs where each CSV contains data from one Worksheet.
command-line command-line-app csv csv-converter csv-export data data-acquisition data-science excel excel-to-csv executable executable-file openpyxl pycharm pycharm-community pycharm-ide pyinstaller python python-3 python-script
Last synced: 09 Oct 2024
https://github.com/bartczernicki/ArtificialIntelligence-Presentations
Public location of delivered Artificial Intelligence & Machine Intelligence Presentations
analytics artificial-intelligence data-science machine-learning
Last synced: 09 Nov 2024
https://github.com/mlr-org/bbotk
Black-box optimization framework for R.
bbotk black-box-optimization data-science hyperparameter-optimization hyperparameter-tuning machine-learning mlr3 optimization r r-package
Last synced: 30 Oct 2024
https://github.com/tjmahr/polypoly
Helper functions for orthogonal polynomials in R
Last synced: 12 Nov 2024
https://github.com/Grasia/WikiChron
Data visualization tool for wikis evolution
analyzer data-analysis data-science data-visualization datascience dump evolution graphs history history-dump mediawiki-wikis plot research-tool time-series visualization web-service wiki wikia wikimedia wikis
Last synced: 04 Nov 2024
https://github.com/deeptiman/go-batch
A Simple Batch Processing library in Go
batch-processing batch-reader concurrency concurrent-programming data-science go-library go-modules golang golang-channel golang-concurrency golang-library golang-tools parallel-computing parallel-processing parallel-programming supply-chain-data-science workers
Last synced: 08 Nov 2024
https://github.com/catdevnull/preciazo
analisis de precios en supermercados minoristas. en constante evolución https://preciazo.nulo.in
data data-science price-tracker scraper supermarket
Last synced: 27 Oct 2024
https://github.com/hfawaz/miccai18
Evaluating surgical skills from kinematic data using convolutional neural networks
class-activation-maps cnn cnn-keras data-science deep-learning research-paper surgery surgical time-series-classification
Last synced: 06 Nov 2024
https://github.com/raamana/missingdata
missing data handing: visualize and impute
biostatistics data-science dirty-data epidemiology imputation machine-learning missing-data missing-values neuroscience visualization
Last synced: 14 Oct 2024
https://github.com/adamvvu/tsfracdiff
Efficient and easy to use fractional differentiation transformations for stationarizing time series data in Python.
data-science machine-learning python quantitative-finance
Last synced: 12 Nov 2024
https://github.com/brakmic/data-science-for-losers
:chart_with_upwards_trend: Articles on Data Science, Jupyter, and Pandas
data-science jupyter machine-learning python
Last synced: 08 Nov 2024
https://github.com/facultyai/faculty
A Python library for interacting with the Faculty platform
data-science faculty-platform python
Last synced: 08 Nov 2024
https://github.com/kklemon/keras-loves-torchtext
Make Torchtext work with Keras.
data-science deep-learning keras natural-language-processing pytorch tensorflow torchtext
Last synced: 10 Nov 2024
https://github.com/amitkaps/multidim
Visualising Multi Dimensional Data
data-science data-visualization grammar python r visualization
Last synced: 06 Nov 2024
https://github.com/jmshea/foundations-of-data-science-with-python
Interactive flashcards and quizzes, as well as additional tutorials, animations, and code, for "Foundations of Data Science with Python" by John M. Shea
data-science data-visualization probability statistics statistics-course
Last synced: 07 Nov 2024
https://github.com/lettier/interactive-simple-linear-regression
A PureScript, browser-based implementation of simple linear regression.
ai artificial-intelligence data-science frontend functional functional-programming gradient-descent halogen linear-regression machine-learning machine-learning-algorithms nueral-networks press-statistic purescript purescript-halogen regression statistics web-development
Last synced: 30 Oct 2024
https://github.com/ritvik19/implemented-data-science
Implementation of various data science techniques and research papers
artificial-neural-networks classification computer-vision convolutional-neural-network data-science deep-learning generative-adversarial-network machine-learning natural-language-processing natural-language-understanding recurrent-neural-networks regression transfer-learning transformer
Last synced: 13 Oct 2024
https://github.com/csinva/data-viz-utils
Functions for easily making publication-quality figures with matplotlib.
big-data data-analysis data-science data-visualization eda legend matplotlib python python3 scatterplot time-series
Last synced: 09 Nov 2024
https://github.com/laurentrdc/javelin
Haskell implementation of series, or labeled one-dimensional arrays.
data-science data-structures-and-algorithms haskell quantitative-finance
Last synced: 02 Nov 2024
https://github.com/mkearney/tfse
🛠 Useful R functions for various things
data-science functions mkearney-r-package r-language rstats utility
Last synced: 15 Nov 2024
https://github.com/ahmetfurkandemir/online-istanbul-applied-data-science-102-bootcamp
Online Istanbul Applied Data Science 102 Bootcamp (Start : 15 August, Finish : 7 November)
bootcamp data-science deep-learning kodluyoruz machine-learning
Last synced: 16 Nov 2024
https://github.com/polyaxon/cli
Polyaxon Core Client & CLI to streamline MLOps
data-science dataops deep-learning hyperparameter-optimization kubernetes machine-learning ml mlops pytorch scikit-learn tensorflow workflows
Last synced: 16 Nov 2024
https://github.com/mainakrepositor/breast-cancer-detector
Detect Breast Cancer using ANN and Random Forest
ann breast-cancer-detection data-science machine-learning project python random-forest streamlit
Last synced: 12 Nov 2024
https://github.com/mauroluzzatto/explainy
explainy is a Python library for generating machine learning model explanations for humans
data-science explanation machine-learning machine-learning-explainability python scikit-learn
Last synced: 11 Nov 2024
https://github.com/gyrdym/ml_dataframe
A way to store and manipulate data
data-science dataframe datascience dataset toy-dataset toy-datasets
Last synced: 28 Oct 2024
https://github.com/yufree/datadown
数据分析残卷
bookdown chinese-simplified data-science r statistics
Last synced: 27 Oct 2024
https://github.com/smathot/eeg_eyetracking_parser
Python routines for parsing of combined EEG and eye-tracking data
data data-science eeg eye eye-tracking mne pupillometry python
Last synced: 07 Nov 2024
https://github.com/benedekrozemberczki/FSCNMF
An implementation of "Fusing Structure and Content via Non-negative Matrix Factorization for Embedding Information Networks".
analytics artifical-intelligence data-mining data-science deepwalk embedding factorization-machine fscnmf graph-embedding graph2vec information-network machine-learning network-embedding nmf node-embedding node2vec pca regularization word2vec word2vec-model
Last synced: 08 Nov 2024
https://github.com/viveckh/new-ml-data-science-framework-tutorials-by-ej
Internet's Most Popular Tutorials on Fresh-off-the-shelf ML & Data Science Technologies, Authored by Yours Truly.
crash-course data-science directed-acyclic-graph facebookai hiplot hiplot-tutorial ibm-qiskit machine-learning metaflow metaflow-tutorial netflix python python-library qiskit-tutorial qiskit-workshop-materials quantum-computing quantum-programming tutorials
Last synced: 27 Oct 2024
https://github.com/dataship/frame
A DataFrame for Javascript
data-frame data-science javascript statistics
Last synced: 19 Nov 2024
https://github.com/alastairrushworth/tdf
🚴🏅📊Tour de France winners and stages data
data-science dataframe exploratory-data-analysis rstats tdf tour-de-france
Last synced: 14 Oct 2024
https://github.com/MindSetLib/Insolver
Low code machine learning library, specified for insurance tasks: prepare data, build model, implement into production.
auto-ml automated-machine-learning automl bayesian-optimization data-science elyra elyra-community feature-engineering hyperparameter-optimization insurance insurance-claims insurance-company insurance-scoring insurance-team low-code machine-learning
Last synced: 08 Aug 2024
https://github.com/bcgov/groundwater-levels-indicator
R scripts for an indicator on long-term trends in groundwater levels in B.C. published on Environmental Reporting BC
Last synced: 08 Aug 2024
https://github.com/robinlovelace/opengeohub2023
Content for lecture at OpenGeoHub 2023 on spatial data and the tidyverse
course data-science opengeohub osgeo practical r reproducible summer-school tidy-data
Last synced: 27 Oct 2024
https://github.com/madsjulia/biguq.jl
Bayesian Information Gap Decision Theory
bayesian bayesian-data-analysis data-driven data-science decision-analysis decision-making decision-model decision-support decision-theory experimental-design high-performance-computing information-gap information-theory julia mads model-analysis model-driven predictive-analysis uncertainty-quantification
Last synced: 09 Nov 2024
https://github.com/morgan-sell/caiso-price-forecast
Predicts the CAISO day-ahead market hourly prices using different forecasting methods including ARIMA and LSTM.
arima data-science electricity-prices lstm neural-networks python time-series
Last synced: 23 Oct 2024
https://github.com/neonwatty/control-notes
Notes on topics ranging from Recurrent Newtorks to Automatic Control and Reinforcement Learning
automatic-control data-science deep-learning dynamic-programming dynamic-systems jupyter-notebook lecture-notes machine-learning python recurrent-neural-networks reinforcement-learning
Last synced: 27 Oct 2024
https://github.com/noahgift/core-stats-datascience
Core Statistics for Datascience
core data-science pragmaticai statistics
Last synced: 11 Oct 2024
https://github.com/nirala96/bangalore-house-prediction-app
Predicts home prices of Bangalore. Used Flutter, Flask and Jupyter Notebook.
data-science datacleaning exploratory-data-analysis flask-api flutter jupyter-notebook linear-regression python
Last synced: 28 Oct 2024
https://github.com/fbruzzesi/sklearn-smithy
Toolkit to forge scikit-learn compatible estimators
cli data-science machine-learning python scikit-learn webui
Last synced: 10 Oct 2024
https://github.com/wibeasley/ranalysisskeleton
Files and settings commonly used in analysis projects with R
Last synced: 27 Oct 2024
https://github.com/balavenkatesh3322/deep-learning-notebook
A collection of deep learning notebooks for learning and practicing.
data-science deep-learning juypter-notebook juypterhub machine-learning neural-network notebook python3 pytorch tensorflow
Last synced: 10 Nov 2024
https://github.com/wlandau/targetsketch
Sketch a pipeline of targets in an interactive web app
data-science high-performance-computing pipeline r reproducibility rstats shiny targets workflow
Last synced: 27 Oct 2024
https://github.com/esoxjem/algorithms
Algos and Data Structures
algorithms data-science dsa java kotlin
Last synced: 05 Nov 2024
https://github.com/vatshayan/b.tech-project-rainfall-predication-in-india
Rainfall Prediction using Machine Learning. India Rainfall Prediction for 115 years. Rainfall Project with Code and Documents
artificial-intelligence btech-project data data-analysis data-mining data-science data-visualization datascience datasets final final-project final-year-project finalproject finalyearproject machine-learning machine-learning-algorithms machinelearning rainfall-prediction semester-project
Last synced: 11 Oct 2024
https://github.com/sominw/kaggle
Data Analysis using datasets from Kaggle
data-analysis data-science data-science-portfolio exploratory-data-analysis ipython-notebook jupyter-notebook kaggle-competition machine-learning machine-learning-algorithms
Last synced: 11 Oct 2024
https://github.com/techforuk/my_eu
Code and data for myeu.uk - find out what the EU has done for your area
brexit data-science google-maps-api ipython-notebook javascript python static-site webpack
Last synced: 11 Oct 2024
https://github.com/nhatsmrt/nn-toolbox
A toolbox of commonly used deep learning components, procedures and applications
data-science deep-learning machine-learning neural-networks python pytorch
Last synced: 13 Oct 2024
https://github.com/scicloj/tablecloth.time
Tools for the processing and manipulation of time-series data in Clojure.
clojure data-processing data-science dataset scicloj tablecloth time-series
Last synced: 15 Nov 2024
https://github.com/njlyon0/dndr
Dungeons & Dragons Functions for Players and Dungeon Masters
data-science dungeons-and-dragons r-package ttrpg
Last synced: 27 Oct 2024
https://github.com/navdeep-g/interpretable-ml
Techniques & resources for training interpretable ML models, explaining ML models, and debugging ML models.
accountability data-mining data-science decision-trees fairness fatml gradient-boosting-machine iml interpretability interpretable interpretable-ai interpretable-machine-learning interpretable-ml lime machine-learning machine-learning-interpretability python transparency xai
Last synced: 06 Nov 2024
https://github.com/giswqs/geog-414-fall2022
Spatial Data Management with Google Earth Engine
data-science data-visualization earthengine geemap geospatial google-earth-engine jupyter open-source streamlit
Last synced: 09 Nov 2024
https://github.com/amitkaps/datascience
Build and Deploy Machine Learning Models on the Cloud
cloud data-science machine-learning python
Last synced: 06 Nov 2024
https://github.com/melling/data-science-from-scratch-swift
Data Science from Scratch Implemented in Swift
Last synced: 09 Nov 2024
https://github.com/psyplot/psy-view
An ncview-like GUI with psyplot
data-science gui netcdf psyplot visualization
Last synced: 08 Nov 2024
https://github.com/simbafl/interview-notes
Python随笔
data-science hadoop hive machine-learning python spark
Last synced: 06 Nov 2024
https://github.com/codait/pardata
artificial-intelligence data-science dataset machine-learning python
Last synced: 16 Nov 2024
https://github.com/anilkumarteegala/wqu-ds-unit-2
This repo contains all the files material releated to WorldQuant University's Data Science Summer 2020 Session Unit 2: Machine Learning and Statistical Analysis
data-science machine-learning statistical-analysis wqu
Last synced: 13 Nov 2024
https://github.com/joaocarabetta/osm-road-length
Calculate Open Street Maps road length for any polygon
data-science osm python urban-analytics urban-data-science
Last synced: 12 Nov 2024
https://github.com/kulbachcedric/evoautoml
automl data-science incremental-learning machine-learning online-learning python
Last synced: 13 Nov 2024
https://github.com/tommyod/generalized-additive-models
Generalized Additive Models in Python.
data-science gam glm statistical-inference statistical-models statistics
Last synced: 11 Nov 2024
https://github.com/sap-samples/btp-data-to-value-workshop
This repo contains a dataset, exercises, and sample code for an end-to-end SAP BTP data-to-value bootcamp covering SAP HANA Cloud, SAP Data Warehouse Cloud, SAP Data Intelligence Cloud, and SAP Analytics Cloud.
advanced-analytics analytics data-management data-orchestration data-science data-to-value machine-learning predictive-planning sample sample-code sap-analytics-cloud sap-btp sap-data-intelligence-cloud sap-data-warehouse-cloud sap-hana-cloud workshop
Last synced: 15 Nov 2024
https://github.com/ndleah/people-data-analysis
👥 Employee analysis #DWD
analytics data-analysis data-science historical-data materialized-view slow-changing-dimension sql
Last synced: 13 Nov 2024
https://github.com/hoangsonww/north-carolina-household-analysis
🏠 This repository contains data analysis scripts for the 2022 American Community Survey (ACS) focusing on individuals aged 25 and over in North Carolina, based on 75,340 observations. This repository offers valuable insights into demographic and economic patterns across North Carolina's urban areas.
confidence-interval confidence-score data data-analysis data-analytics data-science data-visualization ggplot2 hypothesis-testing hypothesis-tests north-carolina r r-language r-programming stata
Last synced: 14 Nov 2024
https://github.com/blazingdb/welcome_to_blazingsql_notebooks
RAPIDS data science. No setup required.
blazingsql blazingsql-notebooks dask data-science data-visualization demos gpu jupyterlab machine-learning notebooks rapids sql
Last synced: 15 Nov 2024
https://github.com/flyteorg/flytekit-python-template
CookieCutter template for getting started with Flyte python projects
cookiecutter cookiecutter-template data-science docker extensible flyte flytekit-python getting-started machine-learning project-structure python3
Last synced: 11 Nov 2024
https://github.com/charlesaverill/satyrn
A Notebook alternative that supports branching code and local collaboration.
data-science full-stack ide jupyter-notebook machine-learning open-source python web-development
Last synced: 14 Nov 2024
https://github.com/mmbazel/classifying-sales-calls
Turning salesforce lead, oppty, & sales activities data => Sales predictions using pandas, Scikit-learn, SQLAlchemy, Redshift, XGBoost Classifier
capstone classification classifying-sales-calls data-science data-science-notebook data-science-portfolio data-science-projects data-visualization gradient-boosting logistic-regression machine-learning saas sales springboard springboard-career-track springboard-data-science springboard-projects
Last synced: 18 Nov 2024
https://github.com/umuthopeyildirim/flatironopensource
Flatiron School lessons for graduated students.
computer-science data-science javascript python ruby web-development
Last synced: 12 Nov 2024
https://github.com/izam-mohammed/geminsights
🔍 GemInsights: Unleash Gemini AI on your data! 🚀 Analyze dataframes for valuable insights, replacing traditional data analysis. 📊 A cutting-edge tool revolutionizes the way you analyze dataframes, offering a paradigm shift from conventional data analysis methods.
ai autoviz data-science gemini gemini-api gemini-pro gemini-pro-vision google google-api llama-index llms python3 trulens vertex-ai
Last synced: 11 Nov 2024
https://github.com/amzn/rheoceros
Cloud-based AI / ML workflow and data application development framework
ai aws aws-emr aws-glue aws-lambda bring-your-own-account cloud data-science event-based feature-engineering flow low-code-framework machine-learning pyspark sagemaker-notebook sagemaker-notebook-instance scala-spark serverless spark
Last synced: 11 Nov 2024
https://github.com/bagussatoto/aplikasi-pembayaran-spp-berbasis-website
Aplikasi Pembayaran SPP - Codeigniter
bootstrap code-generation codeigniter css data-science database html pembaran php spp website
Last synced: 28 Oct 2024
https://github.com/azure/azuredsvm
AzureDSVM is an R package that offers convenient harness of Azure DSVM, remote execution of scalable and elastic data science work, and monitoring of on-demand resource consumption.
azure data-science data-science-virtual-machine r
Last synced: 30 Sep 2024
https://github.com/Jeniffen/projectr
Set up 📂-structure for data science projects
data-science package r rstats setup
Last synced: 13 Aug 2024
https://github.com/lvalnegri/workshops-setup_cloud_analytics_machine
Tips and Tricks to setup a cloud machine for Analytics and Data Science with R, RStudio and Shiny Servers, Python and JupyterLab
analytics cloud dashboard data-science docker dockerfile jupyterlab linux machine-learning python r raspberry-pi rmarkdown rstats rstudio-server scipy shiny shiny-apps shiny-server ubuntu
Last synced: 13 Aug 2024
https://github.com/imetomi/tiny-ann
Neural Network library in C
ann c-language data-science deep-learning deep-neural-networks library machine-learning network neural neural-network
Last synced: 12 Oct 2024
https://github.com/mine-cetinkaya-rundel/errormoji
®️ errors, in emoji
data-science education r rstats
Last synced: 27 Oct 2024
https://github.com/anuraganalog/datacamp
My Solutions to Datacamp projects and courses(datacamp-exercises)
analysis business courses data-science datacamp datacamp-exercises datacamp-projects datacamp-python datacamp-slides finance jupyter-notebook learning machine r solutions sql statistics tableau theory
Last synced: 12 Oct 2024
https://github.com/astrojuanlu/workshop-jupyter-kedro
Hands on workshop "Refactor your Jupyter notebooks into maintainable data science code with Kedro"
data-science jupyter-notebooks kedro python
Last synced: 13 Oct 2024
https://github.com/njanakiev/wikidata-mayors
Exploration of the Mayors in Europe with Wikidata and Python
data-science data-visualization deckgl python sparql wikidata
Last synced: 06 Nov 2024
https://github.com/deysuman/machinelearningstocks
Using python and scikit-learn to make stock predictions
data-science deep-learning deysuman finance historical-stock-fundamentals india machine-learning machine-learning-algorithms made-with-love math-with-python python python3 science scikit-learn sklearn stock-analysis stock-prediction yahoo-finance
Last synced: 12 Nov 2024
https://github.com/mindbeam/mindbase
A database for convergent intersubjectivity
data-science database language ontologies
Last synced: 06 Nov 2024
https://github.com/sayakpaul/floydhub-anomaly-detection-blog
Contains the thorough experiments made for a FloydHub article on Anomaly Detection
anomaly-detection data-science faker jupyter-notebook pyod python
Last synced: 09 Nov 2024
https://github.com/ngohungphuc/data-science-and-analytics
My Data Science and Analytics learning journey
Last synced: 06 Nov 2024
https://github.com/autonomio/astetik
Astetik takes away the pain from telling visual stories with data on Python
data-science descriptive-statistics jupyter matplotlib pandas seaborn visualization
Last synced: 06 Nov 2024
https://github.com/afondiel/cs-books
Computer science books from algorithms, data structure, programming, to data science, AI and much more.
ai books computer-science computer-science-books computer-vision computer-vision-books data-science data-structures dl image-processing ml programming
Last synced: 06 Nov 2024
https://github.com/r-lum/luminescence
Development of the R package 'Luminescence'
bayesian-statistics data-science geochronology luminescence luminescence-dating open-science osl plotting r r-package radiofluorescence rstats tl xsyg
Last synced: 30 Oct 2024
https://github.com/autonomio/wrangle
A data transformation package for deep learning with Autonomio, Keras and TensorFlow.
data-science deep-learning etl keras resampling transformation wrangling
Last synced: 06 Nov 2024
https://github.com/firefly-cpp/niaarm
A minimalistic framework for Numerical Association Rule Mining
association-rule-mining association-rules data-mining data-science evolutionary-algorithms swarm-intelligence
Last synced: 07 Nov 2024
https://github.com/exasol/data-science-examples
Collection of data science and machine learning examples with Exasol
data-science exasol-integration
Last synced: 14 Nov 2024
https://github.com/kennethleungty/Text-to-Audio-with-Bark
Exploring Bark, the Open-Source Text-to-Audio Generative Model
ai artificial-intelligence bark data-science deep-learning gen-ai generative-ai machine-learning prompt-engineering speech text-prompt text-to-audio text-to-music text-to-sound text-to-speech
Last synced: 27 Oct 2024
https://github.com/lastancientone/data-science
Using Kaggle Data and Real World Data for Data Science and prediction in Python, R, Excel, Power BI, and Tableau.
algorithms data-analysis data-science data-visualization datascience deep-learning dimensionality-reduction excel exploratory-data-analysis exploratory-data-visualizations feature-engineering inferential-statistics kaggle kaggle-competiton machine-learning model-tuning powerbi prediction python3 r
Last synced: 06 Nov 2024
https://github.com/lettier/interactivekmeans
Interactive HTML canvas based implementation of k-means.
ai cluster cluster-analysis clustering clustering-algorithm clustering-evaluation clustering-methods data-science interactive-kmeans kmeans kmeans-algorithm kmeans-clustering machine-learning machine-learning-algorithms scikit-learn
Last synced: 30 Oct 2024
https://github.com/thequackdaddy/r-openblas
64-bit R for Windows compiled with openblas
data-science mingw-w64 openblas r windows
Last synced: 08 Nov 2024
https://github.com/cscherrer/sossmlj.jl
SossMLJ makes it easy to build MLJ machines from user-defined models from the Soss probabilistic programming language
bayesian-inference data-science julialang mlj probabilistic-programming soss
Last synced: 20 Oct 2024
https://github.com/kongruksiamza/python-datascience
เอกสารประกอบการสอนเนื้อหา Python - Data Science และงานด้าน Machine Learning
data-analysis data-science numpy pandas python
Last synced: 09 Nov 2024