Data Science
Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.
- GitHub: https://github.com/topics/data-science
- Wikipedia: https://en.wikipedia.org/wiki/Data_science
- Related Topics: data-analysis, data-mining, machine-learning, big-data, data-visualization,
- Aliases: datasciences, data-science-project, data-science-algorithm,
- Last updated: 2026-07-03 00:07:42 UTC
- JSON Representation
https://github.com/opengeos/qgis-leafmap-plugin
A QGIS plugin for leafmap
data-science geospatial leafmap python qgis qgis-plugin
Last synced: 30 Jan 2026
https://github.com/emptymalei/audiorepr
A python package to represent data using musical notes.
audiolization data data-audiolization data-science
Last synced: 12 Oct 2025
https://github.com/bdist/bdist-workspace
This repository provides containerized applications and microservices for the Information Systems and Databases Course @ Instituto Superior Tรฉcnico
data-engineering data-science docker jupyter jupyterlab notebook postgres postgresql python sql sqlite
Last synced: 09 Apr 2026
https://github.com/flbulgarelli/recursos-python
Spanish resources for learning Python
data-science education http imperative-programming object-oriented-programming python testing
Last synced: 30 Oct 2025
https://github.com/matteocargnelutti/maguire-lab-seizure-detection-webapp
๐ง Maguire Lab's Deep Learning Seizure Detection WebApp.
data-science eeg-signals-processing neuroscience
Last synced: 21 Apr 2025
https://github.com/tomaztk/List_of_R_packages_for_Data_scientist
List of useful R packages for data scientists
data-science r r-language r-markdown r-package r-programming statistics
Last synced: 30 Jul 2025
https://github.com/lungben/tableio.jl
A glue package for reading and writing tabular data. It aims to provide a uniform api for reading and writing tabular data from and to multiple sources.
arrow csv data data-science database dataframe dataframes excel jdf json-format parquet postgresql sqlite zip
Last synced: 12 Oct 2025
https://github.com/bodo-ai/pydough
Analytics DSL for Python
analytics artificial-intelligence big-data data-science defog defog-ai machine-learning pandas python sql text-to-analytics text-to-sql tpch
Last synced: 22 May 2026
https://github.com/devopscorner/nifi
Production Grade Nifi & Nifi Registry. Deploy for VM (Virtual Machine) with Terraform + Ansible, Helm & Helmfile for Kubernetes (EKS)
ansible data-science data-structures docker docker-compose dockerhub ecr eks eks-cluster etl kubernetes machine-learning ml mlops nifi nifi-registry terraform vpn vpn-client
Last synced: 08 Sep 2025
https://github.com/csfelix/csfelix.github.io
๐ฑ My Personal Portfolio ๐ฑ
css data-science data-science-competition data-science-portfolio data-science-projects html javascript js portfolio
Last synced: 05 Aug 2025
https://github.com/mindful-ai-assistants/hackapucsp-2024
๐ HackaPUCSP 2024 - - Data Science and AI Hackathon - Pontifical Catholic University of Sรฃo Paulo
automation data-science design github-actions hackathon-project oneness-consciousness package-manager programming pucsp pytest python3 unittest
Last synced: 11 Jul 2025
https://github.com/tomaztk/list_of_r_packages_for_data_scientist
List of useful R packages for data scientists
data-science r r-language r-markdown r-package r-programming statistics
Last synced: 16 May 2025
https://github.com/shwetajoshi601/world-bank-data-analysis
An Exploratory Data Analysis on the World Bank Dataset.
analysis data-science eda python3 world-bank-api worldbank
Last synced: 02 Aug 2025
https://github.com/alugowski/matrepr
Format matrices and tensors to HTML, string, and LaTeX, with Jupyter integration.
data-science data-visualization data-viz graphblas jupyter numpy numpy-matrix pytorch scipy sparse sparse-data sparse-matrices sparse-matrix sparse-representations tensor tensorflow torch
Last synced: 12 Apr 2025
https://github.com/openbridge/ob_pysh-db
pysh-db - The Data Science Toolkit (DSK)
bash data-science mysql postgres python redshift sql
Last synced: 10 Apr 2025
https://github.com/giswqs/notebook-share
A repo for sharing notebooks
data-science dataviz geospatial jupyter-notebook mapping notebook
Last synced: 07 May 2025
https://github.com/fabsta/interesting_notebooks
A collection of Data Science Jupyter notebook (reference material)
data-science eda jupyter-notebook kaggle machine-learning python
Last synced: 03 Jul 2025
https://github.com/devinterview-io/reinforcement-learning-interview-questions
๐ฃ Reinforcement Learning interview questions and answers to help you prepare for your next machine learning and data science interview in 2024.
ai-interview-questions coding-interview-questions coding-interviews data-science data-science-interview data-science-interview-questions data-scientist-interview interview-practice interview-preparation machine-learning machine-learning-and-data-science machine-learning-interview machine-learning-interview-questions reinforcement-learning reinforcement-learning-interview-questions reinforcement-learning-questions reinforcement-learning-tech-interview software-engineer-interview technical-interview-questions
Last synced: 15 Jun 2025
https://github.com/alan-turing-institute/hds-discussiongroup
Repo of the Turing's Humanities & Data Science Discussion Group
data-science digital-humanities discussion-group
Last synced: 03 Mar 2026
https://github.com/clojurecivitas/clojurecivitas.github.io
An open effort to structure learning resources with meaningful connections.
blog clay clojure data-science literate markdown notebooks
Last synced: 24 Jun 2025
https://github.com/eyadsibai/machine-learning-docker-image
Data Science/Machine Learning Docker Image for CPU
data-science docker docker-image google-cloud machine-learning
Last synced: 30 Apr 2025
https://github.com/sjcobb/ai-duet-3d
3D music animation + machine learning (in development)
3d-animation 3d-audio 3d-game artificial-intelligence browser-game data-science data-visualization game-development generative-music javascript machine-learning music music-bot music-composition music-theory music-visualizer neural-network web-development youtube-channel
Last synced: 28 Oct 2025
https://github.com/jhrcook/tidy-tuesday
#TidyTuesday to practice data analysis in R
data-analysis data-science r regression-models rlang tidytuesday tidytuesday-challenge tidyverse
Last synced: 28 Oct 2025
https://github.com/anshumansinha3301/safetitude-project-finance
My works as an SDE @Safetitude
data-science data-structures dbms
Last synced: 13 Jul 2025
https://github.com/chaganti-reddy/evmarket-india
Electric Vehicle Market Segmentation Analysis in India
data-analysis data-science machine-learning market-segmentation pandas python
Last synced: 12 Apr 2025
https://github.com/xuri/excelize-py
Excelize is a Python port of Go Excelize library that allow you to write to and read from XLAM / XLSM / XLSX / XLTM / XLTX files.
calculation chart data-analysis data-science data-visualization ecma-376 excel excelize golang microsoft office ooxml pipy python spreadsheet visualization xlsm xlsx xlsxreader xlsxwriter
Last synced: 07 May 2025
https://github.com/bsomps/OpenGeoPlotter
A PyQt5 app catered to the exploration industry for visualizing geologic drill hole data with features like cross-sections, simple 3D views, strip logs, scatter plots, and downhole line plots. Includes data transformation techniques like factor analysis, desurveying, and alpha-beta conversion.
cross-sections data-science drilling exploration geology geoscience pyqt5 python strip-logs
Last synced: 05 Mar 2025
https://github.com/jimbrig/lossrx
An R package, plumber API, database, and Shiny App for Actuarial Loss Development and Reserving Workflows.
actuarial-science claims-data claims-reserving data-science insurance modelling property-casualty reserving rpackage rshiny rstats workflow
Last synced: 01 Jul 2025
https://github.com/faridrashidi/cnsplots
๐จ Toolkit for generating publication-quality plots for Cell, Nature and Science journals
data-science data-visualization plotting publication-quality python scientific-publications
Last synced: 06 Apr 2026
https://github.com/devinterview-io/bias-and-variance-interview-questions
๐ฃ Bias And Variance interview questions and answers to help you prepare for your next machine learning and data science interview in 2025.
ai-interview-questions bias-and-variance bias-and-variance-interview-questions bias-and-variance-questions bias-and-variance-tech-interview coding-interview-questions coding-interviews data-science data-science-interview data-science-interview-questions data-scientist-interview interview-practice interview-preparation machine-learning machine-learning-and-data-science machine-learning-interview machine-learning-interview-questions software-engineer-interview technical-interview-questions
Last synced: 08 Feb 2026
https://github.com/zen-reportz/zen_dash
Simple, Fast, Scalable , production grade dashboard application . Right solution for team
dashboard data-analytics data-science fastapi flask python3 shiny streamlit
Last synced: 13 Apr 2025
https://github.com/martincastroalvarez/html2vec
Algorithm that converts an HTML to a vectorized object suitable for neural networks.
data-science html2vec natural-language-processing python web-scraping word2vec
Last synced: 11 Apr 2025
https://github.com/krypty/trefle
Trefle is a scikit-learn compatible estimator implementing the FuzzyCoCo algorithm that uses a cooperative coevolution algorithm to find and build interpretable fuzzy systems.
data-science deap evolutionary-algorithm fuzzy-logic interpretability machine-learning python scikit-learn
Last synced: 29 Oct 2025
https://github.com/sjcobb/webxr-threejs-midi-visualizer
WebXR, augmented reality MIDI data visualization, built with Three.js and Tone.js. See video: https://youtu.be/lIecCGtbqSM
3d aframe cannonjs data-science data-visualization depth-estimation game-development hit-detection javascript midi music-theory physics three threejs tone tonejs webvr webxr
Last synced: 12 Jul 2025
https://github.com/dmedri/roaster
R - Fetch, build and deploy.
build-tool data-science rstats statistical-analysis statistics virtual-environments
Last synced: 30 Jul 2025
https://github.com/khadkarajesh/internship-preparation-kit
Repository consist the technical and behavioural questions asked by french tech companies for internship
algorithm algorithms coding-interviews codinggame data-science data-structures data-structures-and-algorithms french hacktoberfest hacktoberfest-accepted hacktoberfest2022 internship interview interview-practice interview-preparation interview-questions interview-test leetcode python software-engineering
Last synced: 15 Jul 2025
https://github.com/aaaastark/top-big-data-scientist-questions-for-interview
Top Big Tech Data Science Questions
ai alibaba amazon apple computer-science computer-vision data-engineer data-science deep-learning facebook google ibm intel interview-questions machine-learning netflix nvidia orcale spacex tesla
Last synced: 04 Feb 2026
https://github.com/chongyasong/youml
YouML: A Machine Learning Toolkit
ai artificial-intelligence big-data data-mining data-science machine-learning matplotlib numpy pandas python scikit-learn scipy
Last synced: 11 Apr 2025
https://github.com/techshot25/healthcare
Insurance cost predictor
bayesian-regression data-analysis data-science linear-regression machine-learning polynomial-regression random-forest-regression
Last synced: 24 Apr 2025
https://github.com/canagnos/mcp
Tools for Measuring Classification Performance for R, Python and Spark
artificial-intelligence classification data-mining data-science machine-learning machine-learning-algorithms
Last synced: 28 Apr 2025
https://github.com/ma7555/kerasgen
A Keras/Tensorflow compatible image data generator for TripletLoss
data-generation data-generator data-generators data-science keras keras-tensorflow tensorflow triplet triplet-loss triplet-neural-network
Last synced: 11 Mar 2025
https://github.com/hoangsonww/standard-deviation-calculator
๐ This repository contains a Standard Deviation Calculator implemented in C++. It provides an efficient algorithm for calculating the statistical standard deviation of a dataset, making it a valuable tool for students, researchers, and analysts seeking a reliable method for data analysis.
algorithms cplusplus cpp data data-analysis data-analytics data-science standard-deviation standard-deviation-calculator standard-deviations
Last synced: 22 Sep 2025
https://github.com/mathewroy/ynabr
Analyze and visualize your You Need A Budget (YNAB) data. YNAB meets R programming language.
api data-analysis data-science data-visualization r ynab ynab-api
Last synced: 30 Jul 2025
https://github.com/elliotwutingfeng/twitter200m
Simple analysis of the Twitter 200M Data Dump of January 2023.
200m data-science haveibeenpwned leak osint twitter
Last synced: 16 Mar 2026
https://github.com/correia-jpv/fucking-awesome-datascience
๐ An awesome Data Science repository to learn and apply for real world problems. With repository starsโญ and forks๐ด
analytics awesome awesome-list data-mining data-science data-scientists data-visualization deep-learning hacktoberfest machine-learning science
Last synced: 27 Apr 2025
https://github.com/amirhosseinhonardoust/underwriting-decision-safety-lab
A decision-safety lab for loan approval: trains a baseline classifier, calibrates probabilities (ECE/Brier), sweeps confidence thresholds to build a coverage, quality frontier and outputs a defensible abstention policy (auto-decide vs review). Includes a Streamlit dashboard for report cards, triage UI, and data quality checks.
abstention calibration classification credit-risk data-quality data-science decision-policy loan-approval machine-learning mlops model-evaluation monitoring pandas reliability responsible-ai scikit-learn selective-classification streamlit uncertainty underwriting
Last synced: 10 Jun 2026
https://github.com/eliasdabbas/dash-aggrid-scales
Color scales (continuous and categorical) and bar charts for Dash-Ag-Grid
aggrid color-scales color-scheme data-science data-visualization html plotly-dash table
Last synced: 16 Mar 2026
https://github.com/tristanbilot/airflow-rbac-roles-cli
A tool to create Airflow RBAC roles with dag-level permissions from cli.
airflow cloud-composer data-engineering data-science gcp permissions pipeline rbac-roles
Last synced: 25 Oct 2025
https://github.com/anshchoudhary/xgmodel
This repository contains code to predict the Expected Goals (xG) from shots in football using various machine learning models.
data-science football-analytics football-data machine-learning machine-learning-algorithms
Last synced: 10 Apr 2025
https://github.com/trafficgcn/osmnx_adjacency_matrix_for_graph_convolutional_networks
Creating an Adjacency Matrix Using the Dijkstra Algorithm for Graph Convolutional Networksย GCNs
adjacency-matrix data-science dijkstra dijkstra-algorithm gcn graph graph-algorithms graph-convolutional-networks matrix metrla open-street-map optimal-route osm osmnx python traffic traffic-analysis traffic-congestion
Last synced: 27 Oct 2025
https://github.com/h2oai/article-information-2019
Article for Special Edition of Information: Machine Learning with Python
data-science explainable-ai explainable-ml fairness-ai fairness-ml fairness-testing fatml iml interpretable-ai interpretable-machine-learning interpretable-ml machine-learning machine-learning-interpretability python xai
Last synced: 07 Apr 2025
https://github.com/takuti/anompy
A Python library for anomaly detection
anomaly-detection data-science forecasting machine-learning python
Last synced: 15 Apr 2025
https://github.com/jose-jaen/airbnb
Airbnb price prediction using Machine Learning and Deep Learning
ai algorithms bayes bayesian-optimization bayesian-statistics data-science deep-learning deployment econometrics machine-learning python streamlit xai
Last synced: 15 Apr 2025
https://github.com/srohit0/datasciencegraphalgorithms
Selected Graph Algorithms
astar astar-algorithm astar-pathfinding astar-search cpp data-science datascience depth-first-search dfs-algorithm dijkstra-algorithm dijkstra-shortest-path graph graph-algorithms graph-theory kosaraju kruskal-algorithm prim-algorithm strongly-connected-components topological-sort transpose
Last synced: 15 Apr 2025
https://github.com/qpwedev/blockchain-network-visualizer
Blockchain Network Visualizer for TON.
blockchain data-science network ton toncoin
Last synced: 14 Mar 2025
https://github.com/dovolopor-research/data-science-research-toolbox
๐งฐ ๆฐๆฎ็งๅญฆ็ง็ ๅทฅๅ ท็ฎฑ
data-science data-science-research data-science-resourses research-resources research-tool visualization
Last synced: 05 Jan 2026
https://github.com/seandavi/machinelearningintro
Machine learning use cases for teaching
data-science machine-learning r rstats teaching-materials tutorial
Last synced: 05 Apr 2025
https://github.com/inphyt/imdb_sentiment_analysis_bert
BERT Sentiment Classification on the IMDb Large Movie Review Dataset.
bert bert-model data-mining data-mining-algorithms data-mining-python data-science machine-learning machine-learning-algorithms natural-language-processing nlp nlp-machine-learning scikit-learn sentiment-analysis sentiment-classification spacy spacy-models spacy-nlp
Last synced: 30 Apr 2025
https://github.com/doarakko/kagoole
Search kaggle competitions and solutions based on data and predict type, evaluation metric, etc.
artificial-intelligence data-science heroku kaggle kaggle-competition kaggle-solution machine-learning webapp
Last synced: 17 Oct 2025
https://github.com/oceannetworkscanada/api-python-client
Provides easy access to ONC data in Python
api data-science ocean-sciences onc python
Last synced: 20 Jul 2025
https://github.com/sdpython/mlstatpy
Mathematics, Algorithmic, Data-Science, Teaching Materials
algorithms data-science mathematics python3 teaching-materials
Last synced: 23 Jun 2025
https://github.com/zenml-io/template-starter
A template for a starter project for ZenML
cookiecutter copier-template data-science machine-learning mlops zenml
Last synced: 14 Apr 2025
https://github.com/devinterview-io/tensorflow-interview-questions
๐ฃ Tensorflow interview questions and answers to help you prepare for your next machine learning and data science interview in 2025.
ai-interview-questions coding-interview-questions coding-interviews data-science data-science-interview data-science-interview-questions data-scientist-interview interview-practice interview-preparation machine-learning machine-learning-and-data-science machine-learning-interview machine-learning-interview-questions software-engineer-interview technical-interview-questions tensorflow tensorflow-interview-questions tensorflow-questions tensorflow-tech-interview
Last synced: 04 Jul 2025
https://github.com/ashishbamania/tutorials-on-artificial-intelligence
A collection of AI tutorials from Dr. Ashish Bamania
agentic-ai ai ai-agents artificial-intelligence crewai data-science langchain llama machine-learning ml rag retreival-augmented-generation software-engineering
Last synced: 13 May 2025
https://github.com/subugoe/scholcomm_analytics
Scholarly Communication Analytics with R Blog
bibliometrics data-science distill library rstats scholarly-communication-analytics
Last synced: 24 Feb 2026
https://github.com/hassaku/audio-plot
Python library to converts a line graph to sound and return an object that can be played in Jupyter notebook or Google Colab. Values are represented by pitches, and the timeline is represented by left and right pans. It was created to make data science fun for the visually impaired.
audio-plot colab data-science jupyter-notebook python visually-impaired
Last synced: 01 Nov 2025
https://github.com/kennethleungty/wikipedia-scraping-with-llm-agents
Scraping Wikipedia by combining LangChain's agents and tools with OpenAI's LLMs and function calling
artificial-intelligence data-analytics data-mining data-science deep-learning genai generative-ai langchain large-language-models llm machine-learning nlp openai openai-functions web-scraping wikipedia
Last synced: 12 Jul 2025
https://github.com/lucadibello/it-salary-analysis
๐ฐ Analysis of Salaries in IT Roles: DevOps, Cyber Security, and AI
ai cybersecurity data-science devops jupyter-notebook salary-analysis
Last synced: 03 Jul 2025
https://github.com/mertguvencli/keyword-extractor
This project aims to find "what are the trending techs on Data Science jobs?" using NER.
data-science machine-learning ner nlp python spacy
Last synced: 10 Sep 2025
https://github.com/chandraprakash-bathula/apparel-recommendations
This project implements a personalized apparel recommendation engine using content-based search with the Amazon API, NLTK, and Keras libraries.
boxplot cnn-keras data-analysis data-science deep-learning linear-regression machine-learning numpy pandas scatter-plot scikit-learn svm tensorflow xgboost
Last synced: 23 Mar 2025
https://github.com/kennethleungty/english-premier-league-var-analysis
Analyzing Video Assistant Referee (VAR) decisions in the English Premier League (2019 - 2021)
data-analysis data-analytics data-science english-premier-league football soccer var
Last synced: 27 Aug 2025
https://github.com/firaskahlaoui/heart-disease-analysis-r
R for data visualization and analysis of heart disease datasets.
data-science data-visualization ggplot kaggle-dataset r statistics
Last synced: 14 Apr 2025
https://github.com/ndxdeveloper/formation-python
Formation Python - Du dรฉbutant ร l'avancรฉ | 13 modules (FastAPI, Type Hints, Data Science, SQLAlchemy, asyncio) | 75+ sujets | 100% franรงais | MIT License
api-rest asyncio data-science developpement fastapi formation francais french learning numpy pandas poetry poo programmation pytest python python3 sqlalchemy type-hints
Last synced: 08 Apr 2026
https://github.com/fabriziomusacchio/python_neuro_practical
This is the course material for the advanced course into Python for Data Scientists.
data-analysis data-science jupyter jupyter-notebook jupyter-notebooks open-source python teaching teaching-materials
Last synced: 22 Jul 2025
https://github.com/lambdaclass/data_etudes
LambdaClass statistics, machine learning and data science etudes
data-science notebook probability statistics
Last synced: 09 Apr 2025
https://github.com/fozouni/data_science
Source codes of the first "Data Science Course"
artificial-intelligence data-science datascience deep-learning excel machine-learning python
Last synced: 04 Sep 2025
https://github.com/rasmusrynell/predicting-nhl
The project explores the idea of using different machine learning techniques to determine different stats in NHL games.
ai algorithms data-science database machine-learning ml nhl nhl-api python scikit-learn sports sports-analytics sports-stats sportsanalytics
Last synced: 14 Apr 2025
https://github.com/dalageo/ml-titanicshipwreck
Exploring the World's Most Renowned Shipwreck ๐ข
data-science decision-tree-classifier logistic-regression machine-learning python random-forest-classifier scikit-learn stacking-ensemble titanic-dataset xgboost-classifier
Last synced: 04 Sep 2025
https://github.com/dogukanayd/catch-tweet-with-keyword
Get Tweet by giving keyword and do keyword analysis
data-analysis data-mining data-science datascience keyword-analysis python python27 social-media social-network social-network-analysis tweet tweets twitter twitter-analysis twitter-api twitter-oauth twitter-sentiment-analysis twitterwordcloud wordcloud
Last synced: 30 Aug 2025
https://github.com/koalaverse/analyticssummit19
Material for 2019 Analytics Summit Machine Learning with R Training
data-science educational-materials machine-learning r workshop-materials
Last synced: 15 May 2025
https://github.com/arv-anshul/yt-watch-history
Analyse your YouTube watch history using Data Science, ML and NLP.
data-science docker docker-compose fastapi ml mlflow mlops mongodb nlp pydantic python3 streamlit youtube-api
Last synced: 22 Apr 2025
https://github.com/aruizeac/alexandria
The Alexandria Project is an open-source platform where people can share their knowledge through books, podcasts, docs and videos.
alexandria data-science donation ebooks go golang grpc http kafka knowledge knowledge-sharing library microservice podcasts python societies streaming videos webservice
Last synced: 11 Mar 2026
https://github.com/dina-hosny/chaincare
ChainCare is a health information system that uses smart contracts to handle medical procedures and stores the medical history in Block Chains.
api-rest bigchain blockchain blockchain-technology data-science data-storage data-visualization ethereum golang health-informatics-systems healthcare insomnia metamask postgresql postman reactjs solidity truffle web3
Last synced: 13 Apr 2026
https://github.com/devinterview-io/svm-interview-questions
๐ฃ SVM interview questions and answers to help you prepare for your next machine learning and data science interview in 2025.
ai-interview-questions coding-interview-questions coding-interviews data-science data-science-interview data-science-interview-questions data-scientist-interview interview-practice interview-preparation machine-learning machine-learning-and-data-science machine-learning-interview machine-learning-interview-questions software-engineer-interview svm svm-interview-questions svm-questions svm-tech-interview technical-interview-questions
Last synced: 28 Jan 2026
https://github.com/juniortorresmtj/projeto_deupositivo
Projeto de Anรกlise de Dados Abertos - SUS
alura bootcampds brazil data-science projeto python
Last synced: 29 Jul 2025
https://github.com/bradflaugher/ai-101
Notes, links and code samples and resources for teaching yourself pytorch and tensorflow.
bootcamp course data-engineering data-science learn-to-code learning-by-doing learning-python machine-learning
Last synced: 10 May 2025
https://github.com/mratsim/meilleur-data-scientist-france-2018
My solution for the competition "Le meilleur data scientist de France 2018" (Best Data Scientist of France 2018)
data-science data-science-competition machine-learning xgboost
Last synced: 15 Sep 2025
https://github.com/dhimmel/openskistats
The study of skiing where we shred open data like pow. Quantifying alpine ski areas with geospatial metrics derived from OpenStreetMap.
data-science data-visualization downhill elevation geospatial gis mapping open-data openskimap openstreetmap orientation python quarto ski-areas skiing slope snowpack solar-irradiance sunlight topography
Last synced: 21 Jul 2025
https://github.com/anaclumos/heart-diagnosis-engine
2019๋ ๋ฏผ์กฑ์ฌ๊ด๊ณ ๋ฑํ๊ต ์กธ์ ํ๋ก์ ํธ
data-science machine-learning pandas python scikit-learn
Last synced: 22 Aug 2025
https://github.com/bpw1621/streamlit-topic-modeling
Topic modeling streamlit app.
data-science latent-dirichlet-allocation machine-learning natural-language-processing nlp non-negative-matrix-factorization streamlit streamlit-application streamlit-sharing streamlit-webapp topic-modeling
Last synced: 27 Jul 2025
https://github.com/mrsaeeddev/ai-interview-questions
๐ค Real-World AI Interview Questions for You!
ai algorithms artificial-intelligence career data-science hacktoberfest hacktoberfest2020 interview interview-questions machine-learning resume
Last synced: 09 Aug 2025
https://github.com/nas5w/imdb-data
A JSON file of 50,000 IMDB movie reviews to be used in machine learning applications.
data data-science imdb javascript machine-learning
Last synced: 19 Apr 2025
https://github.com/numeract/rflow
Flexible R Pipelines with Caching
cache data-science pipeline r rflow
Last synced: 28 May 2026
https://github.com/rbhatia46/python-for-data-science
This repository contains iPython notebooks to get you started with sufficient amount of Python you need to learn to get started with your Data Science Journey.
data-science python-basics python3
Last synced: 03 Sep 2025
https://github.com/networks-learning/discussion-complexity
Code for "On the Complexity of Opinions and Online Discussions", WSDM 2019
complexity data-science discussion online-discussions opinion-mining paper wsdm
Last synced: 10 Aug 2025
https://github.com/amey-thakur/depression_detection_using_tweets
Twitter Depression Detection
amey ameythakur computer-engineering data-science depression-detection depression-detector machine-learning megasatish nlp project python supervised-machine-learning
Last synced: 29 Aug 2025
https://github.com/urbanclimatefr/coursera-learn-sql-basics-for-data-science
This repository contains the materials to "Learn SQL Basics for Data Science", a specialization provided by University of California, Davis through Coursera.
Last synced: 19 Feb 2026
https://github.com/raynardj/langhuan
Light weight labeling engine
classification data-science labeling labeling-tool machine-learning named-entity-recognition ner nlp tagging-tool
Last synced: 16 Oct 2025