Data Science
Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.
- GitHub: https://github.com/topics/data-science
- Wikipedia: https://en.wikipedia.org/wiki/Data_science
- Related Topics: data-analysis, data-mining, machine-learning, big-data, data-visualization,
- Aliases: datasciences, data-science-project, data-science-algorithm,
- Last updated: 2026-07-03 00:07:42 UTC
- JSON Representation
https://github.com/csfelix/csfelix.github.io
๐ฑ My Personal Portfolio ๐ฑ
css data-science data-science-competition data-science-portfolio data-science-projects html javascript js portfolio
Last synced: 05 Aug 2025
https://github.com/devopscorner/nifi
Production Grade Nifi & Nifi Registry. Deploy for VM (Virtual Machine) with Terraform + Ansible, Helm & Helmfile for Kubernetes (EKS)
ansible data-science data-structures docker docker-compose dockerhub ecr eks eks-cluster etl kubernetes machine-learning ml mlops nifi nifi-registry terraform vpn vpn-client
Last synced: 08 Sep 2025
https://github.com/hoangsonww/standard-deviation-calculator
๐ This repository contains a Standard Deviation Calculator implemented in C++. It provides an efficient algorithm for calculating the statistical standard deviation of a dataset, making it a valuable tool for students, researchers, and analysts seeking a reliable method for data analysis.
algorithms cplusplus cpp data data-analysis data-analytics data-science standard-deviation standard-deviation-calculator standard-deviations
Last synced: 22 Sep 2025
https://github.com/mathewroy/ynabr
Analyze and visualize your You Need A Budget (YNAB) data. YNAB meets R programming language.
api data-analysis data-science data-visualization r ynab ynab-api
Last synced: 30 Jul 2025
https://github.com/eyadsibai/machine-learning-docker-image
Data Science/Machine Learning Docker Image for CPU
data-science docker docker-image google-cloud machine-learning
Last synced: 30 Apr 2025
https://github.com/openbridge/ob_pysh-db
pysh-db - The Data Science Toolkit (DSK)
bash data-science mysql postgres python redshift sql
Last synced: 10 Apr 2025
https://github.com/flbulgarelli/recursos-python
Spanish resources for learning Python
data-science education http imperative-programming object-oriented-programming python testing
Last synced: 30 Oct 2025
https://github.com/faridrashidi/cnsplots
๐จ Toolkit for generating publication-quality plots for Cell, Nature and Science journals
data-science data-visualization plotting publication-quality python scientific-publications
Last synced: 06 Apr 2026
https://github.com/jimbrig/lossrx
An R package, plumber API, database, and Shiny App for Actuarial Loss Development and Reserving Workflows.
actuarial-science claims-data claims-reserving data-science insurance modelling property-casualty reserving rpackage rshiny rstats workflow
Last synced: 01 Jul 2025
https://github.com/alan-turing-institute/hds-discussiongroup
Repo of the Turing's Humanities & Data Science Discussion Group
data-science digital-humanities discussion-group
Last synced: 03 Mar 2026
https://github.com/jhrcook/tidy-tuesday
#TidyTuesday to practice data analysis in R
data-analysis data-science r regression-models rlang tidytuesday tidytuesday-challenge tidyverse
Last synced: 28 Oct 2025
https://github.com/sjcobb/webxr-threejs-midi-visualizer
WebXR, augmented reality MIDI data visualization, built with Three.js and Tone.js. See video: https://youtu.be/lIecCGtbqSM
3d aframe cannonjs data-science data-visualization depth-estimation game-development hit-detection javascript midi music-theory physics three threejs tone tonejs webvr webxr
Last synced: 12 Jul 2025
https://github.com/srohit0/datasciencegraphalgorithms
Selected Graph Algorithms
astar astar-algorithm astar-pathfinding astar-search cpp data-science datascience depth-first-search dfs-algorithm dijkstra-algorithm dijkstra-shortest-path graph graph-algorithms graph-theory kosaraju kruskal-algorithm prim-algorithm strongly-connected-components topological-sort transpose
Last synced: 15 Apr 2025
https://github.com/canagnos/mcp
Tools for Measuring Classification Performance for R, Python and Spark
artificial-intelligence classification data-mining data-science machine-learning machine-learning-algorithms
Last synced: 28 Apr 2025
https://github.com/anshchoudhary/xgmodel
This repository contains code to predict the Expected Goals (xG) from shots in football using various machine learning models.
data-science football-analytics football-data machine-learning machine-learning-algorithms
Last synced: 10 Apr 2025
https://github.com/bodo-ai/pydough
Analytics DSL for Python
analytics artificial-intelligence big-data data-science defog defog-ai machine-learning pandas python sql text-to-analytics text-to-sql tpch
Last synced: 22 May 2026
https://github.com/opengeos/qgis-leafmap-plugin
A QGIS plugin for leafmap
data-science geospatial leafmap python qgis qgis-plugin
Last synced: 30 Jan 2026
https://github.com/sdpython/mlstatpy
Mathematics, Algorithmic, Data-Science, Teaching Materials
algorithms data-science mathematics python3 teaching-materials
Last synced: 23 Jun 2025
https://github.com/lungben/tableio.jl
A glue package for reading and writing tabular data. It aims to provide a uniform api for reading and writing tabular data from and to multiple sources.
arrow csv data data-science database dataframe dataframes excel jdf json-format parquet postgresql sqlite zip
Last synced: 12 Oct 2025
https://github.com/eliasdabbas/dash-aggrid-scales
Color scales (continuous and categorical) and bar charts for Dash-Ag-Grid
aggrid color-scales color-scheme data-science data-visualization html plotly-dash table
Last synced: 16 Mar 2026
https://github.com/h2oai/article-information-2019
Article for Special Edition of Information: Machine Learning with Python
data-science explainable-ai explainable-ml fairness-ai fairness-ml fairness-testing fatml iml interpretable-ai interpretable-machine-learning interpretable-ml machine-learning machine-learning-interpretability python xai
Last synced: 07 Apr 2025
https://github.com/tristanbilot/airflow-rbac-roles-cli
A tool to create Airflow RBAC roles with dag-level permissions from cli.
airflow cloud-composer data-engineering data-science gcp permissions pipeline rbac-roles
Last synced: 25 Oct 2025
https://github.com/clojurecivitas/clojurecivitas.github.io
An open effort to structure learning resources with meaningful connections.
blog clay clojure data-science literate markdown notebooks
Last synced: 24 Jun 2025
https://github.com/devinterview-io/supervised-learning-interview-questions
๐ฃ Supervised Learning interview questions and answers to help you prepare for your next machine learning and data science interview in 2025.
ai-interview-questions coding-interview-questions coding-interviews data-science data-science-interview data-science-interview-questions data-scientist-interview interview-practice interview-preparation machine-learning machine-learning-and-data-science machine-learning-interview machine-learning-interview-questions software-engineer-interview supervised-learning supervised-learning-interview-questions supervised-learning-questions supervised-learning-tech-interview technical-interview-questions
Last synced: 09 Feb 2026
https://github.com/anshumansinha3301/safetitude-project-finance
My works as an SDE @Safetitude
data-science data-structures dbms
Last synced: 13 Jul 2025
https://github.com/devinterview-io/tensorflow-interview-questions
๐ฃ Tensorflow interview questions and answers to help you prepare for your next machine learning and data science interview in 2025.
ai-interview-questions coding-interview-questions coding-interviews data-science data-science-interview data-science-interview-questions data-scientist-interview interview-practice interview-preparation machine-learning machine-learning-and-data-science machine-learning-interview machine-learning-interview-questions software-engineer-interview technical-interview-questions tensorflow tensorflow-interview-questions tensorflow-questions tensorflow-tech-interview
Last synced: 04 Jul 2025
https://github.com/trafficgcn/osmnx_adjacency_matrix_for_graph_convolutional_networks
Creating an Adjacency Matrix Using the Dijkstra Algorithm for Graph Convolutional Networksย GCNs
adjacency-matrix data-science dijkstra dijkstra-algorithm gcn graph graph-algorithms graph-convolutional-networks matrix metrla open-street-map optimal-route osm osmnx python traffic traffic-analysis traffic-congestion
Last synced: 27 Oct 2025
https://github.com/jose-jaen/airbnb
Airbnb price prediction using Machine Learning and Deep Learning
ai algorithms bayes bayesian-optimization bayesian-statistics data-science deep-learning deployment econometrics machine-learning python streamlit xai
Last synced: 15 Apr 2025
https://github.com/techshot25/healthcare
Insurance cost predictor
bayesian-regression data-analysis data-science linear-regression machine-learning polynomial-regression random-forest-regression
Last synced: 24 Apr 2025
https://github.com/elliotwutingfeng/twitter200m
Simple analysis of the Twitter 200M Data Dump of January 2023.
200m data-science haveibeenpwned leak osint twitter
Last synced: 16 Mar 2026
https://github.com/correia-jpv/fucking-awesome-datascience
๐ An awesome Data Science repository to learn and apply for real world problems. With repository starsโญ and forks๐ด
analytics awesome awesome-list data-mining data-science data-scientists data-visualization deep-learning hacktoberfest machine-learning science
Last synced: 27 Apr 2025
https://github.com/amirhosseinhonardoust/underwriting-decision-safety-lab
A decision-safety lab for loan approval: trains a baseline classifier, calibrates probabilities (ECE/Brier), sweeps confidence thresholds to build a coverage, quality frontier and outputs a defensible abstention policy (auto-decide vs review). Includes a Streamlit dashboard for report cards, triage UI, and data quality checks.
abstention calibration classification credit-risk data-quality data-science decision-policy loan-approval machine-learning mlops model-evaluation monitoring pandas reliability responsible-ai scikit-learn selective-classification streamlit uncertainty underwriting
Last synced: 10 Jun 2026
https://github.com/inphyt/imdb_sentiment_analysis_bert
BERT Sentiment Classification on the IMDb Large Movie Review Dataset.
bert bert-model data-mining data-mining-algorithms data-mining-python data-science machine-learning machine-learning-algorithms natural-language-processing nlp nlp-machine-learning scikit-learn sentiment-analysis sentiment-classification spacy spacy-models spacy-nlp
Last synced: 30 Apr 2025
https://github.com/oceannetworkscanada/api-python-client
Provides easy access to ONC data in Python
api data-science ocean-sciences onc python
Last synced: 20 Jul 2025
https://github.com/takuti/anompy
A Python library for anomaly detection
anomaly-detection data-science forecasting machine-learning python
Last synced: 15 Apr 2025
https://github.com/dovolopor-research/data-science-research-toolbox
๐งฐ ๆฐๆฎ็งๅญฆ็ง็ ๅทฅๅ ท็ฎฑ
data-science data-science-research data-science-resourses research-resources research-tool visualization
Last synced: 05 Jan 2026
https://github.com/tomaztk/list_of_r_packages_for_data_scientist
List of useful R packages for data scientists
data-science r r-language r-markdown r-package r-programming statistics
Last synced: 16 May 2025
https://github.com/chongyasong/youml
YouML: A Machine Learning Toolkit
ai artificial-intelligence big-data data-mining data-science machine-learning matplotlib numpy pandas python scikit-learn scipy
Last synced: 11 Apr 2025
https://github.com/emptymalei/audiorepr
A python package to represent data using musical notes.
audiolization data data-audiolization data-science
Last synced: 12 Oct 2025
https://github.com/mindful-ai-assistants/hackapucsp-2024
๐ HackaPUCSP 2024 - - Data Science and AI Hackathon - Pontifical Catholic University of Sรฃo Paulo
automation data-science design github-actions hackathon-project oneness-consciousness package-manager programming pucsp pytest python3 unittest
Last synced: 11 Jul 2025
https://github.com/khadkarajesh/internship-preparation-kit
Repository consist the technical and behavioural questions asked by french tech companies for internship
algorithm algorithms coding-interviews codinggame data-science data-structures data-structures-and-algorithms french hacktoberfest hacktoberfest-accepted hacktoberfest2022 internship interview interview-practice interview-preparation interview-questions interview-test leetcode python software-engineering
Last synced: 15 Jul 2025
https://github.com/seandavi/machinelearningintro
Machine learning use cases for teaching
data-science machine-learning r rstats teaching-materials tutorial
Last synced: 05 Apr 2025
https://github.com/qpwedev/blockchain-network-visualizer
Blockchain Network Visualizer for TON.
blockchain data-science network ton toncoin
Last synced: 14 Mar 2025
https://github.com/ma7555/kerasgen
A Keras/Tensorflow compatible image data generator for TripletLoss
data-generation data-generator data-generators data-science keras keras-tensorflow tensorflow triplet triplet-loss triplet-neural-network
Last synced: 11 Mar 2025
https://github.com/matteocargnelutti/maguire-lab-seizure-detection-webapp
๐ง Maguire Lab's Deep Learning Seizure Detection WebApp.
data-science eeg-signals-processing neuroscience
Last synced: 21 Apr 2025
https://github.com/aaaastark/top-big-data-scientist-questions-for-interview
Top Big Tech Data Science Questions
ai alibaba amazon apple computer-science computer-vision data-engineer data-science deep-learning facebook google ibm intel interview-questions machine-learning netflix nvidia orcale spacex tesla
Last synced: 04 Feb 2026
https://github.com/subugoe/scholcomm_analytics
Scholarly Communication Analytics with R Blog
bibliometrics data-science distill library rstats scholarly-communication-analytics
Last synced: 24 Feb 2026
https://github.com/bdist/bdist-workspace
This repository provides containerized applications and microservices for the Information Systems and Databases Course @ Instituto Superior Tรฉcnico
data-engineering data-science docker jupyter jupyterlab notebook postgres postgresql python sql sqlite
Last synced: 09 Apr 2026
https://github.com/fabsta/interesting_notebooks
A collection of Data Science Jupyter notebook (reference material)
data-science eda jupyter-notebook kaggle machine-learning python
Last synced: 03 Jul 2025
https://github.com/tomaztk/List_of_R_packages_for_Data_scientist
List of useful R packages for data scientists
data-science r r-language r-markdown r-package r-programming statistics
Last synced: 30 Jul 2025
https://github.com/shwetajoshi601/world-bank-data-analysis
An Exploratory Data Analysis on the World Bank Dataset.
analysis data-science eda python3 world-bank-api worldbank
Last synced: 02 Aug 2025
https://github.com/dmedri/roaster
R - Fetch, build and deploy.
build-tool data-science rstats statistical-analysis statistics virtual-environments
Last synced: 30 Jul 2025
https://github.com/krypty/trefle
Trefle is a scikit-learn compatible estimator implementing the FuzzyCoCo algorithm that uses a cooperative coevolution algorithm to find and build interpretable fuzzy systems.
data-science deap evolutionary-algorithm fuzzy-logic interpretability machine-learning python scikit-learn
Last synced: 29 Oct 2025
https://github.com/devinterview-io/bias-and-variance-interview-questions
๐ฃ Bias And Variance interview questions and answers to help you prepare for your next machine learning and data science interview in 2025.
ai-interview-questions bias-and-variance bias-and-variance-interview-questions bias-and-variance-questions bias-and-variance-tech-interview coding-interview-questions coding-interviews data-science data-science-interview data-science-interview-questions data-scientist-interview interview-practice interview-preparation machine-learning machine-learning-and-data-science machine-learning-interview machine-learning-interview-questions software-engineer-interview technical-interview-questions
Last synced: 08 Feb 2026
https://github.com/devinterview-io/reinforcement-learning-interview-questions
๐ฃ Reinforcement Learning interview questions and answers to help you prepare for your next machine learning and data science interview in 2024.
ai-interview-questions coding-interview-questions coding-interviews data-science data-science-interview data-science-interview-questions data-scientist-interview interview-practice interview-preparation machine-learning machine-learning-and-data-science machine-learning-interview machine-learning-interview-questions reinforcement-learning reinforcement-learning-interview-questions reinforcement-learning-questions reinforcement-learning-tech-interview software-engineer-interview technical-interview-questions
Last synced: 15 Jun 2025
https://github.com/zenml-io/template-starter
A template for a starter project for ZenML
cookiecutter copier-template data-science machine-learning mlops zenml
Last synced: 14 Apr 2025
https://github.com/martincastroalvarez/html2vec
Algorithm that converts an HTML to a vectorized object suitable for neural networks.
data-science html2vec natural-language-processing python web-scraping word2vec
Last synced: 11 Apr 2025
https://github.com/alugowski/matrepr
Format matrices and tensors to HTML, string, and LaTeX, with Jupyter integration.
data-science data-visualization data-viz graphblas jupyter numpy numpy-matrix pytorch scipy sparse sparse-data sparse-matrices sparse-matrix sparse-representations tensor tensorflow torch
Last synced: 12 Apr 2025
https://github.com/xuri/excelize-py
Excelize is a Python port of Go Excelize library that allow you to write to and read from XLAM / XLSM / XLSX / XLTM / XLTX files.
calculation chart data-analysis data-science data-visualization ecma-376 excel excelize golang microsoft office ooxml pipy python spreadsheet visualization xlsm xlsx xlsxreader xlsxwriter
Last synced: 07 May 2025
https://github.com/chaganti-reddy/evmarket-india
Electric Vehicle Market Segmentation Analysis in India
data-analysis data-science machine-learning market-segmentation pandas python
Last synced: 12 Apr 2025
https://github.com/sjcobb/ai-duet-3d
3D music animation + machine learning (in development)
3d-animation 3d-audio 3d-game artificial-intelligence browser-game data-science data-visualization game-development generative-music javascript machine-learning music music-bot music-composition music-theory music-visualizer neural-network web-development youtube-channel
Last synced: 28 Oct 2025
https://github.com/bsomps/OpenGeoPlotter
A PyQt5 app catered to the exploration industry for visualizing geologic drill hole data with features like cross-sections, simple 3D views, strip logs, scatter plots, and downhole line plots. Includes data transformation techniques like factor analysis, desurveying, and alpha-beta conversion.
cross-sections data-science drilling exploration geology geoscience pyqt5 python strip-logs
Last synced: 05 Mar 2025
https://github.com/giswqs/notebook-share
A repo for sharing notebooks
data-science dataviz geospatial jupyter-notebook mapping notebook
Last synced: 07 May 2025
https://github.com/zen-reportz/zen_dash
Simple, Fast, Scalable , production grade dashboard application . Right solution for team
dashboard data-analytics data-science fastapi flask python3 shiny streamlit
Last synced: 13 Apr 2025
https://github.com/ashishbamania/tutorials-on-artificial-intelligence
A collection of AI tutorials from Dr. Ashish Bamania
agentic-ai ai ai-agents artificial-intelligence crewai data-science langchain llama machine-learning ml rag retreival-augmented-generation software-engineering
Last synced: 13 May 2025
https://github.com/doarakko/kagoole
Search kaggle competitions and solutions based on data and predict type, evaluation metric, etc.
artificial-intelligence data-science heroku kaggle kaggle-competition kaggle-solution machine-learning webapp
Last synced: 17 Oct 2025
https://github.com/firaskahlaoui/heart-disease-analysis-r
R for data visualization and analysis of heart disease datasets.
data-science data-visualization ggplot kaggle-dataset r statistics
Last synced: 14 Apr 2025
https://github.com/dhhruv/stock-price-prediction
A deep learning project in which the model was trained using LSTM layers and Tata Stock prices were predicted and compared with thier actual values.
algorithm cli college-project data data-science dataset deep-learning jupyter jupyter-notebook lstm machine-learning prediction science shell stock-price-prediction tata-beverages terminal
Last synced: 03 May 2025
https://github.com/ptyadana/tableau_2020_a-z_hands-on
Tableau Projects for data analysis, data analytics and data visualaization on different data sets
data-analysis data-science data-visualization tableau tableau-dashboards tableau-desktop tableau-public tableau-workbooks
Last synced: 03 Aug 2025
https://github.com/ndxdeveloper/formation-python
Formation Python - Du dรฉbutant ร l'avancรฉ | 13 modules (FastAPI, Type Hints, Data Science, SQLAlchemy, asyncio) | 75+ sujets | 100% franรงais | MIT License
api-rest asyncio data-science developpement fastapi formation francais french learning numpy pandas poetry poo programmation pytest python python3 sqlalchemy type-hints
Last synced: 08 Apr 2026
https://github.com/hassaku/audio-plot
Python library to converts a line graph to sound and return an object that can be played in Jupyter notebook or Google Colab. Values are represented by pitches, and the timeline is represented by left and right pans. It was created to make data science fun for the visually impaired.
audio-plot colab data-science jupyter-notebook python visually-impaired
Last synced: 01 Nov 2025
https://github.com/edaaydinea/op1-prediction-of-the-different-progressive-levels-of-alzheimer-s-disease
This is an optional model development project on a real dataset related to predicting the different progressive levels of Alzheimerโs disease (AD).
alzheimer-disease-prediction anova-test catboost-classifier chi-square-test data-science deep-neural-networks keras-neural-networks lightgbm-classifier logistic-regression machine-learning multi-layer-perceptron-classifier neural-networks random-forest-classifier tensorflow xgboost-classifier
Last synced: 11 Apr 2025
https://github.com/rbhatia46/data-preprocessing-template
This repository includes all the Data Preprocessing required before using a dataset on a Machine Learning Model. Please refer README on how to use.
data-preprocessing data-science machine-learning python
Last synced: 11 Apr 2025
https://github.com/the-akira/datascience
Coleรงรฃo de recursos sobre Ciรชncia de Dados com Python.
data data-analysis data-science data-structures data-visualization machine-learning machine-learning-algorithms mathematics pandas pandas-dataframe portuguese-language python3 scikit-learn statistics sympy
Last synced: 07 May 2025
https://github.com/hsins/mpl-tc-fonts
๐น๐ผ A package to solve the problem of "Tofu" in your matplotlib plots whenever you're trying to use Traditional Chinese characters in labels or texts.
cjk-characters data-science matplotlib
Last synced: 29 Oct 2025
https://github.com/bpw1621/streamlit-topic-modeling
Topic modeling streamlit app.
data-science latent-dirichlet-allocation machine-learning natural-language-processing nlp non-negative-matrix-factorization streamlit streamlit-application streamlit-sharing streamlit-webapp topic-modeling
Last synced: 27 Jul 2025
https://github.com/klarna-incubator/mleko
Simplify and accelerate your machine learning development with mleko. Designed with modularity and customization in mind, it seamlessly integrates into your existing workflows. Its robust caching system optimizes performance, taking you from data ingestion to finalized models with unparalleled efficiency.
artificial-intelligence data-science machine-learning pipeline python vaex
Last synced: 11 Apr 2025
https://github.com/thomasnield/oreilly_kotlin_for_data_science
Notes, slides, and contents for the O'Reilly videos using Kotlin for Data Science
data-engineering data-science etl kotlin oreilly statistics
Last synced: 27 Mar 2025
https://github.com/juniortorresmtj/projeto_deupositivo
Projeto de Anรกlise de Dados Abertos - SUS
alura bootcampds brazil data-science projeto python
Last synced: 29 Jul 2025
https://github.com/dwhitena/ai-classroom
Code examples for the live, online AI Classroom training:
ai artificial-intelligence data-science machine-learning python pytorch tensorflow
Last synced: 07 Mar 2026
https://github.com/kurtispykes/twitter-sentiment-analysis
Creating a Gradio user interface to predict the sentiment of a tweet
data-science deep-learning gradio keras lstm machine-learning natural-language-processing neural-network nlp nlp-machine-learning prediction python sentiment-analysis tweet twitter
Last synced: 03 May 2025
https://github.com/rasmusrynell/predicting-nhl
The project explores the idea of using different machine learning techniques to determine different stats in NHL games.
ai algorithms data-science database machine-learning ml nhl nhl-api python scikit-learn sports sports-analytics sports-stats sportsanalytics
Last synced: 14 Apr 2025
https://github.com/doubleml/doubleml-serverless
DoubleML-Serverless - Distributed Double Machine Learning with a Serverless Architecture
aws-lambda causal-inference data-science double-machine-learning econometrics machine-learning python scikit-learn serverless statistics
Last synced: 07 May 2025
https://github.com/alro10/twitter-sentiment-live
Sentiment analysis for tweets written in Portuguese-Brazil
dash dash-app dash-plotly dashboards data-science plotly portuguese-brazilian python3 sentiment-analysis tweepy tweets vader-sentiment-analysis
Last synced: 17 Jun 2025
https://github.com/yangfa-zhang/lunax
Lunax is a machine learning framework specifically designed for the processing and analysis of tabular data.
data-analysis data-science lunax machine-learning tabular-data
Last synced: 14 Dec 2025
https://github.com/chandraprakash-bathula/apparel-recommendations
This project implements a personalized apparel recommendation engine using content-based search with the Amazon API, NLTK, and Keras libraries.
boxplot cnn-keras data-analysis data-science deep-learning linear-regression machine-learning numpy pandas scatter-plot scikit-learn svm tensorflow xgboost
Last synced: 23 Mar 2025
https://github.com/urbanclimatefr/coursera-learn-sql-basics-for-data-science
This repository contains the materials to "Learn SQL Basics for Data Science", a specialization provided by University of California, Davis through Coursera.
Last synced: 19 Feb 2026
https://github.com/mertguvencli/keyword-extractor
This project aims to find "what are the trending techs on Data Science jobs?" using NER.
data-science machine-learning ner nlp python spacy
Last synced: 10 Sep 2025
https://github.com/aruizeac/alexandria
The Alexandria Project is an open-source platform where people can share their knowledge through books, podcasts, docs and videos.
alexandria data-science donation ebooks go golang grpc http kafka knowledge knowledge-sharing library microservice podcasts python societies streaming videos webservice
Last synced: 11 Mar 2026
https://github.com/koalaverse/analyticssummit19
Material for 2019 Analytics Summit Machine Learning with R Training
data-science educational-materials machine-learning r workshop-materials
Last synced: 15 May 2025
https://github.com/nas5w/imdb-data
A JSON file of 50,000 IMDB movie reviews to be used in machine learning applications.
data data-science imdb javascript machine-learning
Last synced: 19 Apr 2025
https://github.com/lambdaclass/data_etudes
LambdaClass statistics, machine learning and data science etudes
data-science notebook probability statistics
Last synced: 09 Apr 2025
https://github.com/anshumansinha3301/occupational-hazard-analysis
The Occupational Hazard Analysis Using Industry Data project aims to analyze safety metrics across various industries to identify trends in reported incidents, injuries, and fatalities.
consulting-services data-science industrialisation jupyter-notebook python
Last synced: 09 Oct 2025
https://github.com/amey-thakur/depression_detection_using_tweets
Twitter Depression Detection
amey ameythakur computer-engineering data-science depression-detection depression-detector machine-learning megasatish nlp project python supervised-machine-learning
Last synced: 29 Aug 2025
https://github.com/lucadibello/it-salary-analysis
๐ฐ Analysis of Salaries in IT Roles: DevOps, Cyber Security, and AI
ai cybersecurity data-science devops jupyter-notebook salary-analysis
Last synced: 03 Jul 2025
https://github.com/strazto/mandrake
๐๐- Bring reading the manual ๐ closer to your drake ๐ workflow ๐ฅ
data-science drake high-performance-computing makefile pipeline r r-package reproducibility reproducible-research rstats workflow
Last synced: 13 Jul 2025
https://github.com/fozouni/data_science
Source codes of the first "Data Science Course"
artificial-intelligence data-science datascience deep-learning excel machine-learning python
Last synced: 04 Sep 2025