Data Science
Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.
- GitHub: https://github.com/topics/data-science
- Wikipedia: https://en.wikipedia.org/wiki/Data_science
- Related Topics: data-analysis, data-mining, machine-learning, big-data, data-visualization,
- Aliases: datasciences, data-science-project, data-science-algorithm,
- Last updated: 2026-07-02 00:07:38 UTC
- JSON Representation
https://github.com/amirhosseinhonardoust/underwriting-decision-safety-lab
A decision-safety lab for loan approval: trains a baseline classifier, calibrates probabilities (ECE/Brier), sweeps confidence thresholds to build a coverage, quality frontier and outputs a defensible abstention policy (auto-decide vs review). Includes a Streamlit dashboard for report cards, triage UI, and data quality checks.
abstention calibration classification credit-risk data-quality data-science decision-policy loan-approval machine-learning mlops model-evaluation monitoring pandas reliability responsible-ai scikit-learn selective-classification streamlit uncertainty underwriting
Last synced: 10 Jun 2026
https://github.com/bdist/bdist-workspace
This repository provides containerized applications and microservices for the Information Systems and Databases Course @ Instituto Superior Tรฉcnico
data-engineering data-science docker jupyter jupyterlab notebook postgres postgresql python sql sqlite
Last synced: 09 Apr 2026
https://github.com/trafficgcn/osmnx_adjacency_matrix_for_graph_convolutional_networks
Creating an Adjacency Matrix Using the Dijkstra Algorithm for Graph Convolutional Networksย GCNs
adjacency-matrix data-science dijkstra dijkstra-algorithm gcn graph graph-algorithms graph-convolutional-networks matrix metrla open-street-map optimal-route osm osmnx python traffic traffic-analysis traffic-congestion
Last synced: 27 Oct 2025
https://github.com/elliotwutingfeng/twitter200m
Simple analysis of the Twitter 200M Data Dump of January 2023.
200m data-science haveibeenpwned leak osint twitter
Last synced: 16 Mar 2026
https://github.com/chaganti-reddy/evmarket-india
Electric Vehicle Market Segmentation Analysis in India
data-analysis data-science machine-learning market-segmentation pandas python
Last synced: 12 Apr 2025
https://github.com/giswqs/notebook-share
A repo for sharing notebooks
data-science dataviz geospatial jupyter-notebook mapping notebook
Last synced: 07 May 2025
https://github.com/sjcobb/webxr-threejs-midi-visualizer
WebXR, augmented reality MIDI data visualization, built with Three.js and Tone.js. See video: https://youtu.be/lIecCGtbqSM
3d aframe cannonjs data-science data-visualization depth-estimation game-development hit-detection javascript midi music-theory physics three threejs tone tonejs webvr webxr
Last synced: 12 Jul 2025
https://github.com/devinterview-io/reinforcement-learning-interview-questions
๐ฃ Reinforcement Learning interview questions and answers to help you prepare for your next machine learning and data science interview in 2024.
ai-interview-questions coding-interview-questions coding-interviews data-science data-science-interview data-science-interview-questions data-scientist-interview interview-practice interview-preparation machine-learning machine-learning-and-data-science machine-learning-interview machine-learning-interview-questions reinforcement-learning reinforcement-learning-interview-questions reinforcement-learning-questions reinforcement-learning-tech-interview software-engineer-interview technical-interview-questions
Last synced: 15 Jun 2025
https://github.com/eyadsibai/machine-learning-docker-image
Data Science/Machine Learning Docker Image for CPU
data-science docker docker-image google-cloud machine-learning
Last synced: 30 Apr 2025
https://github.com/jhrcook/tidy-tuesday
#TidyTuesday to practice data analysis in R
data-analysis data-science r regression-models rlang tidytuesday tidytuesday-challenge tidyverse
Last synced: 28 Oct 2025
https://github.com/sjcobb/ai-duet-3d
3D music animation + machine learning (in development)
3d-animation 3d-audio 3d-game artificial-intelligence browser-game data-science data-visualization game-development generative-music javascript machine-learning music music-bot music-composition music-theory music-visualizer neural-network web-development youtube-channel
Last synced: 28 Oct 2025
https://github.com/alugowski/matrepr
Format matrices and tensors to HTML, string, and LaTeX, with Jupyter integration.
data-science data-visualization data-viz graphblas jupyter numpy numpy-matrix pytorch scipy sparse sparse-data sparse-matrices sparse-matrix sparse-representations tensor tensorflow torch
Last synced: 12 Apr 2025
https://github.com/clojurecivitas/clojurecivitas.github.io
An open effort to structure learning resources with meaningful connections.
blog clay clojure data-science literate markdown notebooks
Last synced: 24 Jun 2025
https://github.com/xuri/excelize-py
Excelize is a Python port of Go Excelize library that allow you to write to and read from XLAM / XLSM / XLSX / XLTM / XLTX files.
calculation chart data-analysis data-science data-visualization ecma-376 excel excelize golang microsoft office ooxml pipy python spreadsheet visualization xlsm xlsx xlsxreader xlsxwriter
Last synced: 07 May 2025
https://github.com/martincastroalvarez/html2vec
Algorithm that converts an HTML to a vectorized object suitable for neural networks.
data-science html2vec natural-language-processing python web-scraping word2vec
Last synced: 11 Apr 2025
https://github.com/alan-turing-institute/hds-discussiongroup
Repo of the Turing's Humanities & Data Science Discussion Group
data-science digital-humanities discussion-group
Last synced: 03 Mar 2026
https://github.com/faridrashidi/cnsplots
๐จ Toolkit for generating publication-quality plots for Cell, Nature and Science journals
data-science data-visualization plotting publication-quality python scientific-publications
Last synced: 06 Apr 2026
https://github.com/jimbrig/lossrx
An R package, plumber API, database, and Shiny App for Actuarial Loss Development and Reserving Workflows.
actuarial-science claims-data claims-reserving data-science insurance modelling property-casualty reserving rpackage rshiny rstats workflow
Last synced: 01 Jul 2025
https://github.com/anshumansinha3301/safetitude-project-finance
My works as an SDE @Safetitude
data-science data-structures dbms
Last synced: 13 Jul 2025
https://github.com/fabsta/interesting_notebooks
A collection of Data Science Jupyter notebook (reference material)
data-science eda jupyter-notebook kaggle machine-learning python
Last synced: 03 Jul 2025
https://github.com/zen-reportz/zen_dash
Simple, Fast, Scalable , production grade dashboard application . Right solution for team
dashboard data-analytics data-science fastapi flask python3 shiny streamlit
Last synced: 13 Apr 2025
https://github.com/krypty/trefle
Trefle is a scikit-learn compatible estimator implementing the FuzzyCoCo algorithm that uses a cooperative coevolution algorithm to find and build interpretable fuzzy systems.
data-science deap evolutionary-algorithm fuzzy-logic interpretability machine-learning python scikit-learn
Last synced: 29 Oct 2025
https://github.com/openbridge/ob_pysh-db
pysh-db - The Data Science Toolkit (DSK)
bash data-science mysql postgres python redshift sql
Last synced: 10 Apr 2025
https://github.com/tomaztk/list_of_r_packages_for_data_scientist
List of useful R packages for data scientists
data-science r r-language r-markdown r-package r-programming statistics
Last synced: 16 May 2025
https://github.com/lungben/tableio.jl
A glue package for reading and writing tabular data. It aims to provide a uniform api for reading and writing tabular data from and to multiple sources.
arrow csv data data-science database dataframe dataframes excel jdf json-format parquet postgresql sqlite zip
Last synced: 12 Oct 2025
https://github.com/shwetajoshi601/world-bank-data-analysis
An Exploratory Data Analysis on the World Bank Dataset.
analysis data-science eda python3 world-bank-api worldbank
Last synced: 02 Aug 2025
https://github.com/jose-jaen/airbnb
Airbnb price prediction using Machine Learning and Deep Learning
ai algorithms bayes bayesian-optimization bayesian-statistics data-science deep-learning deployment econometrics machine-learning python streamlit xai
Last synced: 15 Apr 2025
https://github.com/srohit0/datasciencegraphalgorithms
Selected Graph Algorithms
astar astar-algorithm astar-pathfinding astar-search cpp data-science datascience depth-first-search dfs-algorithm dijkstra-algorithm dijkstra-shortest-path graph graph-algorithms graph-theory kosaraju kruskal-algorithm prim-algorithm strongly-connected-components topological-sort transpose
Last synced: 15 Apr 2025
https://github.com/devinterview-io/tensorflow-interview-questions
๐ฃ Tensorflow interview questions and answers to help you prepare for your next machine learning and data science interview in 2025.
ai-interview-questions coding-interview-questions coding-interviews data-science data-science-interview data-science-interview-questions data-scientist-interview interview-practice interview-preparation machine-learning machine-learning-and-data-science machine-learning-interview machine-learning-interview-questions software-engineer-interview technical-interview-questions tensorflow tensorflow-interview-questions tensorflow-questions tensorflow-tech-interview
Last synced: 04 Jul 2025
https://github.com/subugoe/scholcomm_analytics
Scholarly Communication Analytics with R Blog
bibliometrics data-science distill library rstats scholarly-communication-analytics
Last synced: 24 Feb 2026
https://github.com/techshot25/healthcare
Insurance cost predictor
bayesian-regression data-analysis data-science linear-regression machine-learning polynomial-regression random-forest-regression
Last synced: 24 Apr 2025
https://github.com/dovolopor-research/data-science-research-toolbox
๐งฐ ๆฐๆฎ็งๅญฆ็ง็ ๅทฅๅ ท็ฎฑ
data-science data-science-research data-science-resourses research-resources research-tool visualization
Last synced: 05 Jan 2026
https://github.com/oceannetworkscanada/api-python-client
Provides easy access to ONC data in Python
api data-science ocean-sciences onc python
Last synced: 20 Jul 2025
https://github.com/ma7555/kerasgen
A Keras/Tensorflow compatible image data generator for TripletLoss
data-generation data-generator data-generators data-science keras keras-tensorflow tensorflow triplet triplet-loss triplet-neural-network
Last synced: 11 Mar 2025
https://github.com/anshchoudhary/xgmodel
This repository contains code to predict the Expected Goals (xG) from shots in football using various machine learning models.
data-science football-analytics football-data machine-learning machine-learning-algorithms
Last synced: 10 Apr 2025
https://github.com/emptymalei/audiorepr
A python package to represent data using musical notes.
audiolization data data-audiolization data-science
Last synced: 12 Oct 2025
https://github.com/devinterview-io/supervised-learning-interview-questions
๐ฃ Supervised Learning interview questions and answers to help you prepare for your next machine learning and data science interview in 2025.
ai-interview-questions coding-interview-questions coding-interviews data-science data-science-interview data-science-interview-questions data-scientist-interview interview-practice interview-preparation machine-learning machine-learning-and-data-science machine-learning-interview machine-learning-interview-questions software-engineer-interview supervised-learning supervised-learning-interview-questions supervised-learning-questions supervised-learning-tech-interview technical-interview-questions
Last synced: 09 Feb 2026
https://github.com/chongyasong/youml
YouML: A Machine Learning Toolkit
ai artificial-intelligence big-data data-mining data-science machine-learning matplotlib numpy pandas python scikit-learn scipy
Last synced: 11 Apr 2025
https://github.com/aaaastark/top-big-data-scientist-questions-for-interview
Top Big Tech Data Science Questions
ai alibaba amazon apple computer-science computer-vision data-engineer data-science deep-learning facebook google ibm intel interview-questions machine-learning netflix nvidia orcale spacex tesla
Last synced: 04 Feb 2026
https://github.com/tomaztk/List_of_R_packages_for_Data_scientist
List of useful R packages for data scientists
data-science r r-language r-markdown r-package r-programming statistics
Last synced: 30 Jul 2025
https://github.com/tristanbilot/airflow-rbac-roles-cli
A tool to create Airflow RBAC roles with dag-level permissions from cli.
airflow cloud-composer data-engineering data-science gcp permissions pipeline rbac-roles
Last synced: 25 Oct 2025
https://github.com/canagnos/mcp
Tools for Measuring Classification Performance for R, Python and Spark
artificial-intelligence classification data-mining data-science machine-learning machine-learning-algorithms
Last synced: 28 Apr 2025
https://github.com/eliasdabbas/dash-aggrid-scales
Color scales (continuous and categorical) and bar charts for Dash-Ag-Grid
aggrid color-scales color-scheme data-science data-visualization html plotly-dash table
Last synced: 16 Mar 2026
https://github.com/doarakko/kagoole
Search kaggle competitions and solutions based on data and predict type, evaluation metric, etc.
artificial-intelligence data-science heroku kaggle kaggle-competition kaggle-solution machine-learning webapp
Last synced: 17 Oct 2025
https://github.com/bodo-ai/pydough
Analytics DSL for Python
analytics artificial-intelligence big-data data-science defog defog-ai machine-learning pandas python sql text-to-analytics text-to-sql tpch
Last synced: 22 May 2026
https://github.com/mathewroy/ynabr
Analyze and visualize your You Need A Budget (YNAB) data. YNAB meets R programming language.
api data-analysis data-science data-visualization r ynab ynab-api
Last synced: 30 Jul 2025
https://github.com/matteocargnelutti/maguire-lab-seizure-detection-webapp
๐ง Maguire Lab's Deep Learning Seizure Detection WebApp.
data-science eeg-signals-processing neuroscience
Last synced: 21 Apr 2025
https://github.com/devinterview-io/bias-and-variance-interview-questions
๐ฃ Bias And Variance interview questions and answers to help you prepare for your next machine learning and data science interview in 2025.
ai-interview-questions bias-and-variance bias-and-variance-interview-questions bias-and-variance-questions bias-and-variance-tech-interview coding-interview-questions coding-interviews data-science data-science-interview data-science-interview-questions data-scientist-interview interview-practice interview-preparation machine-learning machine-learning-and-data-science machine-learning-interview machine-learning-interview-questions software-engineer-interview technical-interview-questions
Last synced: 08 Feb 2026
https://github.com/devopscorner/nifi
Production Grade Nifi & Nifi Registry. Deploy for VM (Virtual Machine) with Terraform + Ansible, Helm & Helmfile for Kubernetes (EKS)
ansible data-science data-structures docker docker-compose dockerhub ecr eks eks-cluster etl kubernetes machine-learning ml mlops nifi nifi-registry terraform vpn vpn-client
Last synced: 08 Sep 2025
https://github.com/sdpython/mlstatpy
Mathematics, Algorithmic, Data-Science, Teaching Materials
algorithms data-science mathematics python3 teaching-materials
Last synced: 23 Jun 2025
https://github.com/opengeos/qgis-leafmap-plugin
A QGIS plugin for leafmap
data-science geospatial leafmap python qgis qgis-plugin
Last synced: 30 Jan 2026
https://github.com/flbulgarelli/recursos-python
Spanish resources for learning Python
data-science education http imperative-programming object-oriented-programming python testing
Last synced: 30 Oct 2025
https://github.com/csfelix/csfelix.github.io
๐ฑ My Personal Portfolio ๐ฑ
css data-science data-science-competition data-science-portfolio data-science-projects html javascript js portfolio
Last synced: 05 Aug 2025
https://github.com/zenml-io/template-starter
A template for a starter project for ZenML
cookiecutter copier-template data-science machine-learning mlops zenml
Last synced: 14 Apr 2025
https://github.com/hoangsonww/standard-deviation-calculator
๐ This repository contains a Standard Deviation Calculator implemented in C++. It provides an efficient algorithm for calculating the statistical standard deviation of a dataset, making it a valuable tool for students, researchers, and analysts seeking a reliable method for data analysis.
algorithms cplusplus cpp data data-analysis data-analytics data-science standard-deviation standard-deviation-calculator standard-deviations
Last synced: 22 Sep 2025
https://github.com/dmedri/roaster
R - Fetch, build and deploy.
build-tool data-science rstats statistical-analysis statistics virtual-environments
Last synced: 30 Jul 2025
https://github.com/qpwedev/blockchain-network-visualizer
Blockchain Network Visualizer for TON.
blockchain data-science network ton toncoin
Last synced: 14 Mar 2025
https://github.com/seandavi/machinelearningintro
Machine learning use cases for teaching
data-science machine-learning r rstats teaching-materials tutorial
Last synced: 05 Apr 2025
https://github.com/khadkarajesh/internship-preparation-kit
Repository consist the technical and behavioural questions asked by french tech companies for internship
algorithm algorithms coding-interviews codinggame data-science data-structures data-structures-and-algorithms french hacktoberfest hacktoberfest-accepted hacktoberfest2022 internship interview interview-practice interview-preparation interview-questions interview-test leetcode python software-engineering
Last synced: 15 Jul 2025
https://github.com/h2oai/article-information-2019
Article for Special Edition of Information: Machine Learning with Python
data-science explainable-ai explainable-ml fairness-ai fairness-ml fairness-testing fatml iml interpretable-ai interpretable-machine-learning interpretable-ml machine-learning machine-learning-interpretability python xai
Last synced: 07 Apr 2025
https://github.com/takuti/anompy
A Python library for anomaly detection
anomaly-detection data-science forecasting machine-learning python
Last synced: 15 Apr 2025
https://github.com/bsomps/OpenGeoPlotter
A PyQt5 app catered to the exploration industry for visualizing geologic drill hole data with features like cross-sections, simple 3D views, strip logs, scatter plots, and downhole line plots. Includes data transformation techniques like factor analysis, desurveying, and alpha-beta conversion.
cross-sections data-science drilling exploration geology geoscience pyqt5 python strip-logs
Last synced: 05 Mar 2025
https://github.com/inphyt/imdb_sentiment_analysis_bert
BERT Sentiment Classification on the IMDb Large Movie Review Dataset.
bert bert-model data-mining data-mining-algorithms data-mining-python data-science machine-learning machine-learning-algorithms natural-language-processing nlp nlp-machine-learning scikit-learn sentiment-analysis sentiment-classification spacy spacy-models spacy-nlp
Last synced: 30 Apr 2025
https://github.com/mindful-ai-assistants/hackapucsp-2024
๐ HackaPUCSP 2024 - - Data Science and AI Hackathon - Pontifical Catholic University of Sรฃo Paulo
automation data-science design github-actions hackathon-project oneness-consciousness package-manager programming pucsp pytest python3 unittest
Last synced: 11 Jul 2025
https://github.com/ashishbamania/tutorials-on-artificial-intelligence
A collection of AI tutorials from Dr. Ashish Bamania
agentic-ai ai ai-agents artificial-intelligence crewai data-science langchain llama machine-learning ml rag retreival-augmented-generation software-engineering
Last synced: 13 May 2025
https://github.com/arv-anshul/yt-watch-history
Analyse your YouTube watch history using Data Science, ML and NLP.
data-science docker docker-compose fastapi ml mlflow mlops mongodb nlp pydantic python3 streamlit youtube-api
Last synced: 22 Apr 2025
https://github.com/koalaverse/analyticssummit19
Material for 2019 Analytics Summit Machine Learning with R Training
data-science educational-materials machine-learning r workshop-materials
Last synced: 15 May 2025
https://github.com/nas5w/imdb-data
A JSON file of 50,000 IMDB movie reviews to be used in machine learning applications.
data data-science imdb javascript machine-learning
Last synced: 19 Apr 2025
https://github.com/oracle-samples/oracle-aidp-samples
Oracle AI Data Platform Workbench Samples
ai-agent ai-agents ai-assistant ai-data data-engineering data-ingestion data-integration data-science
Last synced: 18 Jan 2026
https://github.com/mratsim/meilleur-data-scientist-france-2018
My solution for the competition "Le meilleur data scientist de France 2018" (Best Data Scientist of France 2018)
data-science data-science-competition machine-learning xgboost
Last synced: 15 Sep 2025
https://github.com/dhimmel/openskistats
The study of skiing where we shred open data like pow. Quantifying alpine ski areas with geospatial metrics derived from OpenStreetMap.
data-science data-visualization downhill elevation geospatial gis mapping open-data openskimap openstreetmap orientation python quarto ski-areas skiing slope snowpack solar-irradiance sunlight topography
Last synced: 21 Jul 2025
https://github.com/sepandhaghighi/ethereum-fraud-detection-visualization
Ethereum Fraud Detection Visualization
data-analysis data-science data-visualization ethereum exploratory-data-analysis fraud fraud-detection machine-learning matplotlib python visualization
Last synced: 06 Sep 2025
https://github.com/firaskahlaoui/heart-disease-analysis-r
R for data visualization and analysis of heart disease datasets.
data-science data-visualization ggplot kaggle-dataset r statistics
Last synced: 14 Apr 2025
https://github.com/networks-learning/discussion-complexity
Code for "On the Complexity of Opinions and Online Discussions", WSDM 2019
complexity data-science discussion online-discussions opinion-mining paper wsdm
Last synced: 10 Aug 2025
https://github.com/laminetourelab/tutorial
Tutorials on machine learning, artificial intelligence in general and in biomedical research.
artificial-intelligence bioinformatics bioinformatics-tutorials computer-vision data-science data-visualization-dashboard deep-learning graph-machine-learning image-analysis machine-learning natural-language-processing plotly-dash python pytorch scrna-seq shiny-apps tensorflow-tutorials transfer-learning tutorial-code tutorials
Last synced: 24 Oct 2025
https://github.com/amey-thakur/depression_detection_using_tweets
Twitter Depression Detection
amey ameythakur computer-engineering data-science depression-detection depression-detector machine-learning megasatish nlp project python supervised-machine-learning
Last synced: 29 Aug 2025
https://github.com/anaclumos/heart-diagnosis-engine
2019๋ ๋ฏผ์กฑ์ฌ๊ด๊ณ ๋ฑํ๊ต ์กธ์ ํ๋ก์ ํธ
data-science machine-learning pandas python scikit-learn
Last synced: 22 Aug 2025
https://github.com/rbhatia46/python-for-data-science
This repository contains iPython notebooks to get you started with sufficient amount of Python you need to learn to get started with your Data Science Journey.
data-science python-basics python3
Last synced: 03 Sep 2025
https://github.com/strazto/mandrake
๐๐- Bring reading the manual ๐ closer to your drake ๐ workflow ๐ฅ
data-science drake high-performance-computing makefile pipeline r r-package reproducibility reproducible-research rstats workflow
Last synced: 13 Jul 2025
https://github.com/fozouni/data_science
Source codes of the first "Data Science Course"
artificial-intelligence data-science datascience deep-learning excel machine-learning python
Last synced: 04 Sep 2025
https://github.com/mertguvencli/keyword-extractor
This project aims to find "what are the trending techs on Data Science jobs?" using NER.
data-science machine-learning ner nlp python spacy
Last synced: 10 Sep 2025
https://github.com/numeract/rflow
Flexible R Pipelines with Caching
cache data-science pipeline r rflow
Last synced: 28 May 2026
https://github.com/devinterview-io/svm-interview-questions
๐ฃ SVM interview questions and answers to help you prepare for your next machine learning and data science interview in 2025.
ai-interview-questions coding-interview-questions coding-interviews data-science data-science-interview data-science-interview-questions data-scientist-interview interview-practice interview-preparation machine-learning machine-learning-and-data-science machine-learning-interview machine-learning-interview-questions software-engineer-interview svm svm-interview-questions svm-questions svm-tech-interview technical-interview-questions
Last synced: 28 Jan 2026
https://github.com/chandraprakash-bathula/apparel-recommendations
This project implements a personalized apparel recommendation engine using content-based search with the Amazon API, NLTK, and Keras libraries.
boxplot cnn-keras data-analysis data-science deep-learning linear-regression machine-learning numpy pandas scatter-plot scikit-learn svm tensorflow xgboost
Last synced: 23 Mar 2025
https://github.com/dogukanayd/catch-tweet-with-keyword
Get Tweet by giving keyword and do keyword analysis
data-analysis data-mining data-science datascience keyword-analysis python python27 social-media social-network social-network-analysis tweet tweets twitter twitter-analysis twitter-api twitter-oauth twitter-sentiment-analysis twitterwordcloud wordcloud
Last synced: 30 Aug 2025
https://github.com/kennethleungty/english-premier-league-var-analysis
Analyzing Video Assistant Referee (VAR) decisions in the English Premier League (2019 - 2021)
data-analysis data-analytics data-science english-premier-league football soccer var
Last synced: 27 Aug 2025
https://github.com/mrsaeeddev/ai-interview-questions
๐ค Real-World AI Interview Questions for You!
ai algorithms artificial-intelligence career data-science hacktoberfest hacktoberfest2020 interview interview-questions machine-learning resume
Last synced: 09 Aug 2025
https://github.com/lambdaclass/data_etudes
LambdaClass statistics, machine learning and data science etudes
data-science notebook probability statistics
Last synced: 09 Apr 2025
https://github.com/dalageo/ml-titanicshipwreck
Exploring the World's Most Renowned Shipwreck ๐ข
data-science decision-tree-classifier logistic-regression machine-learning python random-forest-classifier scikit-learn stacking-ensemble titanic-dataset xgboost-classifier
Last synced: 04 Sep 2025
https://github.com/blurred-machine/data-science
This repository contains all of my minor projects built by me during the learning plase of Machine Learning and Data Science. Feel free to create a PR for modifications.
algorithms-python data-science jupyter-notebook learning-by-doing machine-learning-algorithms minor-project python
Last synced: 27 Apr 2025
https://github.com/urbanclimatefr/coursera-learn-sql-basics-for-data-science
This repository contains the materials to "Learn SQL Basics for Data Science", a specialization provided by University of California, Davis through Coursera.
Last synced: 19 Feb 2026
https://github.com/devinterview-io/linear-algebra-interview-questions
๐ฃ Linear Algebra interview questions and answers to help you prepare for your next machine learning and data science interview in 2025.
ai-interview-questions coding-interview-questions coding-interviews data-science data-science-interview data-science-interview-questions data-scientist-interview interview-practice interview-preparation linear-algebra linear-algebra-interview-questions linear-algebra-questions linear-algebra-tech-interview machine-learning machine-learning-and-data-science machine-learning-interview machine-learning-interview-questions software-engineer-interview technical-interview-questions
Last synced: 07 Feb 2026
https://github.com/alro10/twitter-sentiment-live
Sentiment analysis for tweets written in Portuguese-Brazil
dash dash-app dash-plotly dashboards data-science plotly portuguese-brazilian python3 sentiment-analysis tweepy tweets vader-sentiment-analysis
Last synced: 17 Jun 2025
https://github.com/hourout/linora
Simple and efficient tools for data science.
data-analysis data-mining data-science hyperparameter-optimization lightgbm machine-learning python xgboost
Last synced: 04 Apr 2025
https://github.com/rbhatia46/data-preprocessing-template
This repository includes all the Data Preprocessing required before using a dataset on a Machine Learning Model. Please refer README on how to use.
data-preprocessing data-science machine-learning python
Last synced: 11 Apr 2025
https://github.com/edaaydinea/op1-prediction-of-the-different-progressive-levels-of-alzheimer-s-disease
This is an optional model development project on a real dataset related to predicting the different progressive levels of Alzheimerโs disease (AD).
alzheimer-disease-prediction anova-test catboost-classifier chi-square-test data-science deep-neural-networks keras-neural-networks lightgbm-classifier logistic-regression machine-learning multi-layer-perceptron-classifier neural-networks random-forest-classifier tensorflow xgboost-classifier
Last synced: 11 Apr 2025
https://github.com/hsins/mpl-tc-fonts
๐น๐ผ A package to solve the problem of "Tofu" in your matplotlib plots whenever you're trying to use Traditional Chinese characters in labels or texts.
cjk-characters data-science matplotlib
Last synced: 29 Oct 2025