Data Science
Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.
- GitHub: https://github.com/topics/data-science
- Wikipedia: https://en.wikipedia.org/wiki/Data_science
- Related Topics: data-analysis, data-mining, machine-learning, big-data, data-visualization,
- Aliases: datasciences, data-science-project, data-science-algorithm,
- Last updated: 2026-07-03 00:07:42 UTC
- JSON Representation
https://github.com/hassaku/audio-plot
Python library to converts a line graph to sound and return an object that can be played in Jupyter notebook or Google Colab. Values are represented by pitches, and the timeline is represented by left and right pans. It was created to make data science fun for the visually impaired.
audio-plot colab data-science jupyter-notebook python visually-impaired
Last synced: 01 Nov 2025
https://github.com/dhhruv/stock-price-prediction
A deep learning project in which the model was trained using LSTM layers and Tata Stock prices were predicted and compared with thier actual values.
algorithm cli college-project data data-science dataset deep-learning jupyter jupyter-notebook lstm machine-learning prediction science shell stock-price-prediction tata-beverages terminal
Last synced: 03 May 2025
https://github.com/l480/rewe-price-data
๐ช Daily updated prices of all items from the German supermarket chain REWE as CSV (including EAN, grammage, product image etc.)
csv data-science ean inflation prices rewe shrinkflation supermarket
Last synced: 11 Jan 2026
https://github.com/luminousmen/python_for_ds
Python for Data Analysis workshop
data-analysis data-science python tutorial
Last synced: 01 May 2025
https://github.com/kurtispykes/twitter-sentiment-analysis
Creating a Gradio user interface to predict the sentiment of a tweet
data-science deep-learning gradio keras lstm machine-learning natural-language-processing neural-network nlp nlp-machine-learning prediction python sentiment-analysis tweet twitter
Last synced: 03 May 2025
https://github.com/florents-tselai/sqlite-for-data-scientists
Notebooks and supporting files for SQLite for Data Scientists Online Live Training, on OReilly Learning Platform
data-science learning sql sqlite3 training-materials
Last synced: 11 Apr 2025
https://github.com/dwhitena/ai-classroom
Code examples for the live, online AI Classroom training:
ai artificial-intelligence data-science machine-learning python pytorch tensorflow
Last synced: 07 Mar 2026
https://github.com/VaibhavAbhimanyooHiwase/Risk_Calculation_using_Backward_Elimination_Algorithm_in_Life_Insurance
Implementation of backward elimination algorithm used for dimensionality reduction for improving the performance of risk calculation in life insurance industry.
alpha-value backward-elimination data-mining-algorithms data-science insurance kaggle-life-insurance life-insurance multiple-linear-regression p-value random-forest risk-analysis risk-assessment risk-calculations risk-modelling risk-models statistical-analysis statistical-data statistical-learning statistical-models statistics
Last synced: 29 Jul 2025
https://github.com/devinterview-io/llmops-interview-questions
๐ฃ LLMOps interview questions and answers to help you prepare for your next machine learning and data science interview in 2025.
ai-interview-questions coding-interview-questions coding-interviews data-science data-science-interview data-science-interview-questions data-scientist-interview interview-practice interview-preparation llmops llmops-interview-questions llmops-questions llmops-tech-interview machine-learning machine-learning-and-data-science machine-learning-interview machine-learning-interview-questions software-engineer-interview technical-interview-questions
Last synced: 16 Feb 2026
https://github.com/thecoderpinar/spotify_trends_2023_analysis
Exploring Spotify's latest trends, top songs, genres, and artists using Python, Pandas, NumPy, Matplotlib, CNNs for image-based analysis, and advanced algorithms for music recommendation. Dive into the world of music data and discover what's trending on Spotify! ๐ต๐
cnn cnn-keras data-analysis data-science data-visualization machine-learning matplotlib music-trend numpy pandas python spotify
Last synced: 30 Apr 2025
https://github.com/hsins/mpl-tc-fonts
๐น๐ผ A package to solve the problem of "Tofu" in your matplotlib plots whenever you're trying to use Traditional Chinese characters in labels or texts.
cjk-characters data-science matplotlib
Last synced: 29 Oct 2025
https://github.com/jeonghunyoon/machine-learning-lecture-notes
Lecture notes and codes for machine learning
data-science decision-tree deep-learning lecture-notes linear-algebra linear-regression lsa machine-learning naive-bayes-classifier statistics
Last synced: 10 Apr 2025
https://github.com/the-akira/datascience
Coleรงรฃo de recursos sobre Ciรชncia de Dados com Python.
data data-analysis data-science data-structures data-visualization machine-learning machine-learning-algorithms mathematics pandas pandas-dataframe portuguese-language python3 scikit-learn statistics sympy
Last synced: 07 May 2025
https://github.com/alexioannides/notes-and-demos
Study notes and demos.
data-engineering data-science ml-engineering mlops python
Last synced: 29 Oct 2025
https://github.com/alro10/twitter-sentiment-live
Sentiment analysis for tweets written in Portuguese-Brazil
dash dash-app dash-plotly dashboards data-science plotly portuguese-brazilian python3 sentiment-analysis tweepy tweets vader-sentiment-analysis
Last synced: 17 Jun 2025
https://github.com/thomasnield/oreilly_kotlin_for_data_science
Notes, slides, and contents for the O'Reilly videos using Kotlin for Data Science
data-engineering data-science etl kotlin oreilly statistics
Last synced: 27 Mar 2025
https://github.com/alvarobartt/ea-associate-ds
Electronic Arts (EA) NLP Assignment for: Associate Data Scientist
data-science electronic-arts nlp recruitment-task
Last synced: 12 Apr 2025
https://github.com/rbhatia46/data-preprocessing-template
This repository includes all the Data Preprocessing required before using a dataset on a Machine Learning Model. Please refer README on how to use.
data-preprocessing data-science machine-learning python
Last synced: 11 Apr 2025
https://github.com/doubleml/doubleml-serverless
DoubleML-Serverless - Distributed Double Machine Learning with a Serverless Architecture
aws-lambda causal-inference data-science double-machine-learning econometrics machine-learning python scikit-learn serverless statistics
Last synced: 07 May 2025
https://github.com/hourout/linora
Simple and efficient tools for data science.
data-analysis data-mining data-science hyperparameter-optimization lightgbm machine-learning python xgboost
Last synced: 04 Apr 2025
https://github.com/bcgov/canwqdata
R ๐ฆ to download ๐จ๐ฆ open water quality data
data-science env r r-package rlang rstats
Last synced: 20 Jul 2025
https://github.com/klarna-incubator/mleko
Simplify and accelerate your machine learning development with mleko. Designed with modularity and customization in mind, it seamlessly integrates into your existing workflows. Its robust caching system optimizes performance, taking you from data ingestion to finalized models with unparalleled efficiency.
artificial-intelligence data-science machine-learning pipeline python vaex
Last synced: 11 Apr 2025
https://github.com/cimentadaj/dataharvesting
Material for the course 'Data Harvesting' for the masters in computational social science - UC3M
api data-science r web-scraping
Last synced: 30 Apr 2025
https://github.com/ttitcombe/constituencymap
Python code to generate political maps
brexit choropleth choropleth-map data-science election-data map political-science politics united-kingdom visualization
Last synced: 11 Apr 2025
https://github.com/edaaydinea/op1-prediction-of-the-different-progressive-levels-of-alzheimer-s-disease
This is an optional model development project on a real dataset related to predicting the different progressive levels of Alzheimerโs disease (AD).
alzheimer-disease-prediction anova-test catboost-classifier chi-square-test data-science deep-neural-networks keras-neural-networks lightgbm-classifier logistic-regression machine-learning multi-layer-perceptron-classifier neural-networks random-forest-classifier tensorflow xgboost-classifier
Last synced: 11 Apr 2025
https://github.com/ptyadana/tableau_2020_a-z_hands-on
Tableau Projects for data analysis, data analytics and data visualaization on different data sets
data-analysis data-science data-visualization tableau tableau-dashboards tableau-desktop tableau-public tableau-workbooks
Last synced: 03 Aug 2025
https://github.com/yangfa-zhang/lunax
Lunax is a machine learning framework specifically designed for the processing and analysis of tabular data.
data-analysis data-science lunax machine-learning tabular-data
Last synced: 14 Dec 2025
https://github.com/strazto/mandrake
๐๐- Bring reading the manual ๐ closer to your drake ๐ workflow ๐ฅ
data-science drake high-performance-computing makefile pipeline r r-package reproducibility reproducible-research rstats workflow
Last synced: 13 Jul 2025
https://github.com/lucadibello/it-salary-analysis
๐ฐ Analysis of Salaries in IT Roles: DevOps, Cyber Security, and AI
ai cybersecurity data-science devops jupyter-notebook salary-analysis
Last synced: 03 Jul 2025
https://github.com/sepandhaghighi/ethereum-fraud-detection-visualization
Ethereum Fraud Detection Visualization
data-analysis data-science data-visualization ethereum exploratory-data-analysis fraud fraud-detection machine-learning matplotlib python visualization
Last synced: 06 Sep 2025
https://github.com/fozouni/data_science
Source codes of the first "Data Science Course"
artificial-intelligence data-science datascience deep-learning excel machine-learning python
Last synced: 04 Sep 2025
https://github.com/dina-hosny/chaincare
ChainCare is a health information system that uses smart contracts to handle medical procedures and stores the medical history in Block Chains.
api-rest bigchain blockchain blockchain-technology data-science data-storage data-visualization ethereum golang health-informatics-systems healthcare insomnia metamask postgresql postman reactjs solidity truffle web3
Last synced: 13 Apr 2026
https://github.com/anshumansinha3301/occupational-hazard-analysis
The Occupational Hazard Analysis Using Industry Data project aims to analyze safety metrics across various industries to identify trends in reported incidents, injuries, and fatalities.
consulting-services data-science industrialisation jupyter-notebook python
Last synced: 09 Oct 2025
https://github.com/mmore500/teeplot
organize data visualization output, automatically picking meaningful names based on semantic plotting variables
data-science data-visualization python python-package workflow
Last synced: 25 Feb 2026
https://github.com/gabrieltempass/abtester
A web application to design and evaluate the results of A/B tests.
ab-testing data-science hypothesis-testing python sample-size statistical-significance statistics streamlit web-app
Last synced: 06 Oct 2025
https://github.com/aruizeac/alexandria
The Alexandria Project is an open-source platform where people can share their knowledge through books, podcasts, docs and videos.
alexandria data-science donation ebooks go golang grpc http kafka knowledge knowledge-sharing library microservice podcasts python societies streaming videos webservice
Last synced: 11 Mar 2026
https://github.com/nicodupont/resources
Resources on SAS, Python, SQL, VBA-Excel, etc ...
airflow data-science data-visualization excel python r sas sql vba
Last synced: 24 Jun 2025
https://github.com/mindful-ai-assistants/.github
โฏ Empowering businesses with AI-driven technologies like Copilots, Agents, Bots, and Predictions, alongside intelligent Decision-Making Suppor
agents artificial-intelligence automation copilots data-science descion-making-systems design geolocation jupyter-notebook machine-learning mathematical-modelling mathpix mongodb oneness-consciousness predictive-analytics predictive-modeling python3 sql tsql
Last synced: 11 Jul 2025
https://github.com/aiguofer/sql_connectors
A simple wrapper for SQL connections using SQLAlchemy and Pandas read_sql to standardize SQL workflow with multiple data sources.
data-analysis data-analytics data-exploration data-science pandas relational-databases sql sqlalchemy standardized-api
Last synced: 13 Oct 2025
https://github.com/aicorsair/dataquest-data-science-analysis-projects
A repository dedicated to storing guided projects completed while learning data science concepts with Dataquest.
classification-models cluster-analysis data-analysis data-analytics data-cleaning data-preparation data-preprocessing data-science data-visualization deep-learning excel feature-engineering machine-learning pandas-dataframe power-bi python-3 regression-models scikit-learn sql web-scraping
Last synced: 27 Oct 2025
https://github.com/tkonopka/rcssplot
R plots styled with css
css data-science r visualization
Last synced: 22 Oct 2025
https://github.com/xability/py-maidr
Python binder for maidr library
accessibility binder braille data-science data-visualization python
Last synced: 03 Apr 2026
https://github.com/jdiaz97/iucnredlist.jl
API Wrapper for the IUCN Red List.
biodiversity data-science ecology
Last synced: 21 Oct 2025
https://github.com/zgornel/datalinter
Linting tools for ML workflows, data, code
code-analysis-tool coding-agent data-science linting
Last synced: 21 Apr 2026
https://github.com/kennethleungty/langextract-gemma-structured-extraction
Using LangExtract and Gemma 3 for structured information extraction from unstructured text in insurance polices
artificial-intelligence data-science deep-learning gemini gemma gemma3-4b google langextract large-language-models llm llms machine-learning openai structured-data unstructured-data
Last synced: 03 Sep 2025
https://github.com/nikhilaravi/neuralnetflix
Movie Genre Prediction from movie posters using Deep Learning
Last synced: 18 Oct 2025
https://github.com/fchamroukhi/samurais
StAtistical Models for the UnsupeRvised segmentAion of tIme-Series
artificial-intelligence change-point-detection data-science dynamic-programming em-algorithm hidden-markov-models hidden-process-regression human-activity-recognition latent-variable-models model-selection multivariate-timeseries newton-raphson piecewise-regression statistical-inference statistical-learning time-series-analysis time-series-clustering
Last synced: 22 Oct 2025
https://github.com/opt-nc/setup-duckdb-action
๐ฆ Blazing Fast and highly customizable Github Action to setup a DuckDb runtime
action actions analytics csv data-science database databases dataquality dataqualitycheck duckdb embedded-database github-actions olap sql
Last synced: 16 Mar 2026
https://github.com/fffaraz/datasets
My collection of random datasets
data-mining data-science dataset
Last synced: 04 Sep 2025
https://github.com/teddyoweh/sentiment-analysis-api
The Sentiment Analysis Api was created using python flask module,it allows users to parse a text or sentence throught the (?text) arguement, then view the sentiment analysis of that sentence. It can be implementable into a web application.
api data-science flask machine-learning nlp-machine-learning php python sentiment-analysis
Last synced: 09 Apr 2025
https://github.com/quantifyearth/yirgacheffe
A declarative geospatial library for Python to make data-science with maps easier
data-science geospatial python3
Last synced: 01 Apr 2026
https://github.com/rpoteau/pyphyschem
Python in the physical chemistry lab
chemistry data-science jupyter machine-learning physical-chemistry python sympy
Last synced: 05 Apr 2026
https://github.com/maayanlab/playbook-workflow-builder
A repository for the Playbook Workflow Builder project.
bioinformatics biology cwl data-science gene-expression gene-ontology gene-sets proteomics rna-seq-analysis systems-biology workflow
Last synced: 11 Jul 2025
https://github.com/brunocampos01/porto-seguro-safe-driver-prediction
Predict if a driver will file an insurance claim next year. (Kaggle Competition)
challenge data-cleansing data-engineering data-science dataset insurance-claims kaggle kaggle-competition machine-learning porto-seguro python random-forest xgboost
Last synced: 05 Sep 2025
https://github.com/apear9/riskmapr
Code for riskmapr apps for invasive weed risk mapping
bayesian bayesian-network data-science ecology ecology-of-invasion invasive-species risk-map shiny shiny-apps weeds
Last synced: 30 Jul 2025
https://github.com/rishisankineni/capital-one-data-challenge
NYC Taxi Data Challenge - Data Scientist
capital-one data-science eda machine-learning python-3-6 xgboost
Last synced: 09 Apr 2025
https://github.com/nikhilba/aerial-imagery
Data Science Research Project: Map poverty using satellite images.
carnegie-mellon-university data-science deep-learning ipynb neural-network satellite-images vgg16
Last synced: 28 Oct 2025
https://github.com/devinterview-io/optimization-interview-questions
๐ฃ Optimization interview questions and answers to help you prepare for your next machine learning and data science interview in 2025.
ai-interview-questions coding-interview-questions coding-interviews data-science data-science-interview data-science-interview-questions data-scientist-interview interview-practice interview-preparation machine-learning machine-learning-and-data-science machine-learning-interview machine-learning-interview-questions optimization optimization-interview-questions optimization-questions optimization-tech-interview software-engineer-interview technical-interview-questions
Last synced: 30 Jan 2026
https://github.com/adilshamim8/100-ai-machine-learning-deep-learnin-projects
100 AI Machine Learning Deep Learning Projects is a curated repository showcasing innovative, production-ready solutions across computer vision, NLP, and more.
ai artificial-intelligence computer-vision computer-vision-projects data-science deep-learning deep-learning-projects machine-learning machine-learning-projects nlp nlp-projects python
Last synced: 20 Apr 2026
https://github.com/kennethleungty/Finance-LLMs
Comprehensive Compilation of Customized LLMs for Specific Domains and Industries
artificial-intelligence data-science deep-learning domain-specific-models fine-tuning fine-tuning-llm finetuning gen-ai genai generative-ai industry-specific large-language-models llm machine-learning nlp
Last synced: 03 Oct 2025
https://github.com/synthesized-io/insight
๐งฟ Metrics & Monitoring of Datasets
data data-analysis data-science framework insights metrics monitoring python
Last synced: 24 Jun 2025
https://github.com/kensk8er/langdist
Multilingual Language Modeling Toolkit
character-embeddings data-science deep-learning language-model lstm machine-learning multilingual natural-language-generation natural-language-processing neural-network nlp python recurrent-neural-networks tensorflow
Last synced: 08 Apr 2026
https://github.com/quant-aq/aeromancy
โ๏ธ Aeromancy: A framework for performing reproducible AI and ML
aeromancy data-science machine-learning reproducibility reproducible reproducible-experiments reproducible-research reproducible-science
Last synced: 24 Dec 2025
https://github.com/tuanle618/AEDA
AEDA - Automated Data Exploratory Analysis in R
data-science eda eda-report exploratory-data-analysis r
Last synced: 29 Jul 2025
https://github.com/toxpi/toxpir
toxpiR R package for the Toxicological Priority Index (ToxPi) algorithm.
data-science modeling r r-package toxicology
Last synced: 19 Aug 2025
https://github.com/bluegreen-labs/appeears
Interface to the NASA AppEEARS API
api data-science r-package remote-sensing rstats
Last synced: 23 Aug 2025
https://github.com/dionhaefner/fowd
Processing framework for FOWD, a free ocean wave dataset, ready for your ML application :ocean:
data-science machine-learning ocean open-data waves
Last synced: 21 Aug 2025
https://github.com/aniketpatilanalyst/Disease-Prediction-Model
Prediction Model on Cell Images for Detecting Malaria
artificial-intelligence cnn-classification data-science deep-neural-networks disease-prediction image-processing
Last synced: 10 Mar 2025
https://github.com/ammarlodhi255/student_performance_indicator_end-to-end_implementation
An end-to-end machine learning project, student performance indicator. The goal of this project is to understand the influence of the parents background, test preparation, and various other variables on the students performance.
aws cd-pipeline data-analysis data-science data-science-projects eda end-to-end-machine-learning machine-learning machine-learning-projects regression regression-analysis
Last synced: 27 Sep 2025
https://github.com/eshikashah/skillship-internship-project-1-prediction-of-a-patient-s-no_show-appointments
Skillship Foundation internship project.
classification data-processing data-science machine-learning python
Last synced: 21 Jul 2025
https://github.com/sithu-khant/math-for-ml-ds
Mathematics learning path for Machine Learning and Data Science.
awesome-list data-science deep-learning machine-learning mathematics
Last synced: 13 Apr 2025
https://github.com/gabrieldim/calculation-cholesterol-data-science
Cholesterol is calculated from the given set of data.
convolutional-layers data-science dense layer
Last synced: 07 Jul 2025
https://github.com/codewithmuh/insatgram-ai-model
Create high-quality images effortlessly for your brand using Fooocus, an advanced image generation software.
ai ai-models artificial-intelligence chatgpt data-science generative-ai-model generative-ai-tools generative-model instagram machine-learning models text-to-image
Last synced: 10 Apr 2025
https://github.com/zohaib58/gdsc-dsx2022
Google Developers Student Club - Data Science Bootcamp 2022
Last synced: 05 May 2025
https://github.com/garciparedes/python-examples
Set of awesome Python Examples
data-science examples exercises math numpy pandas python python-3 tensorflow
Last synced: 13 Apr 2025
https://github.com/simranjeet97/top-machine-learning-algorithms-python
This Repository contains the Machine Learning Algorithms with Mathematical Explanation behind them along with Implementation in Python.
data data-analysis data-science data-structures database machine machine-learning machine-learning-algorithms machine-learning-library machine-learning-playlist machinelearning machinelearning-python python python-programming python-script python3 youtube youtube-tutorial youtube-tutorial-series
Last synced: 11 Apr 2025
https://github.com/buccaneerai/rxjs-stats
Moved to @bottlenose/rxstats (https://github.com/buccaneerai/bottlenose)
analytics data data-mining data-science observables reactive rxjs statistics
Last synced: 15 Jul 2025
https://github.com/torkamanilab/zoish
Zoish is a Python package that streamlines machine learning by leveraging SHAP values for feature selection and interpretability, making model development more efficient and user-friendly
automl data-science feature-engineering feature-selection machine-learning python scikit-learn
Last synced: 10 Apr 2025
https://github.com/fabiosmuu/rna
Este repositรณrio tem como intuito, demonstrar um modulo de redes neurais que venho desenvolvendo.
algorithms data-science ia inteligencia-artificial redes-neurais-artificiais rna
Last synced: 10 Apr 2025
https://github.com/Himscipy/bnn_hvd
Distributed Training of Bayesian Neural Networks at Scale
bayesian-networks computer-vision data-science distributed-computing horovod machine-learning mnist tensorflow tensorflow-probability uncertainty-quantification variational-inference
Last synced: 15 Jul 2025
https://github.com/millengustavo/demo-datasus-streamlit
Demo Application with DataSUS death records and Streamlit
data-science datasus health healthcare streamlit
Last synced: 10 Apr 2025
https://github.com/ashwinpn/applied-data-science-with-python-specialization-university-of-michigan
Applied Data Science with Python Specialization: University of Michigan
coursera coursera-assignment coursera-data-science coursera-machine-learning coursera-python coursera-specialization data-science machine-learning university-of-michigan
Last synced: 13 Apr 2025
https://github.com/vatshayan/image-recognition-project
Beautiful Image recognition and Classification Project for final year college students.
btech-project college-project collegeprojects cse-project data-science final final-project final-year-project finalyearproject image image-classification image-processing image-recognition image-recognition-algorithms keras keras-neural keras-neural-networks mtech-project
Last synced: 28 Oct 2025
https://github.com/virajbhutada/capstones
This repository contains all the necessary files and documentation for a detailed analysis of bank loan data using a combination of SQL, Power BI, Excel, and Tableau. The project aims to uncover insights related to loan applications, funding, repayments, and borrower demographics, facilitating data-driven decision-making in the banking sector.
bank-loan-analysis dashboard data-science dax-query eda excel excel-dashboard excel-functions mssql-server powerbi powerbi-reports powerbi-visuals sql sql-database tableau tableau-public tableau-server
Last synced: 30 Oct 2025
https://github.com/yevh/anonymizer
Anonymize sensitive data in your datasets.
anonymize anonymized anonymizer crypto cryptography data-anonymization data-anonymized data-science data-security dataset datasets datasets-csv datasets-preparation python python3 security sensitive sensitive-data
Last synced: 07 Jul 2025
https://github.com/tslu1s/atlantic
Atlantic: Automated Data Preprocessing Framework for Supervised Machine Learning
automation automl automl-pipeline data-preprocessing data-science feature-selection label-encoder machine-learning onehot-encoder predictive-maintenance predictive-modeling preprocessing-pipeline python scikit-learn
Last synced: 10 Apr 2025
https://github.com/mikeroyal/apache-ignite-guide
Apache Ignite Guide
data-science database hadoop hadoop-cluster ignite nosql nosql-data-storage nosql-databases stream-processing streaming
Last synced: 06 May 2025
https://github.com/juliaai/mljflow.jl
Connecting MLJ and MLFlow
data-science julia machine-learning machine-learning-operations machine-learning-ops mlflow mlj mlops statistics
Last synced: 25 Oct 2025
https://github.com/LukasHedegaard/datasetops
Fluent dataset operations, compatible with your favorite libraries
data-cleaning data-munging data-processing data-science data-wrangling dataset dataset-combinations deep-learning multiple-datasets pytorch tensorflow
Last synced: 08 May 2025
https://github.com/sahahn/bpt
The Brain Predictability toolbox (BPt), is a python based Machine Learning library designed primarily for tabular and neuroimaging specific neuroimaging data but can easily be generalized further.
bp bpt brain-predictability-toolbox data-analysis data-science machine-learning ml neuroimaging-data neuroscience neuroscience-methods pandas python sklearn
Last synced: 13 Apr 2025
https://github.com/vianneymi/baker
Project demonstrating a TDS article about structuring unstructured data using LLMs
data-engineering data-mining data-science langchain llm mistralai pydantic
Last synced: 11 Jul 2025
https://github.com/cdcgov/cdh-lava-react
CDC Data Hub Lifecycle, Analysis & Visualization Accelerator (LAVA) REACT Components based on machine readable requirements.
agile-development azure data-analysis data-catalog data-governance data-quality data-science data-visualization databricks datavisualization devops excel-export metadata operations powerautomate powerbi pyspark security sql test-automation
Last synced: 22 Apr 2025
https://github.com/fwd/reddit
Graph Visualization UI for Reddit.
data data-science datasets worldnews
Last synced: 24 Apr 2025
https://github.com/a-poor/flask-celery-ml
Handling long-running processes (like ML model predictions) inside a Flask app using Celery.
api celery data-science flask machine-learning python
Last synced: 03 Aug 2025
https://github.com/joshuaulrich/stl-rug
Content presented at the Saint Louis R User Group
Last synced: 26 Aug 2025
https://github.com/cadcad-org/snippets
Repo containing notebooks showcasing features and applications of cadCAD.
cadcad data-science education python simulation snippets
Last synced: 23 Apr 2025
https://github.com/mohd-faizy/career-track-data-scientist-with-python
This Repo contains tools that allow us to import, clean, manipulate, and visualize data โIncludes Python libraries, like pandas, NumPy, Matplotlib, and many more to work with real-world datasets to learn the statistical and machine learning techniques.
data-science data-visualization datascience-machinelearning datasciencecoursera datascientist datascientisttraining decision-trees hypothesis hypothesis-testing machine-learning machine-learning-algorithms nlp-machine-learning numpy pandas python scikit-learn seaborn statistical
Last synced: 12 Jul 2025
https://github.com/JRaviLab/compbio-gists
Computational Biology & Bioinformatics Resources
bioinformatics comparative-genomics computational-biology data-science gists molecular-evolution phylogeny r shell transcriptomics
Last synced: 07 Oct 2025