Data Science
Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.
- GitHub: https://github.com/topics/data-science
- Wikipedia: https://en.wikipedia.org/wiki/Data_science
- Related Topics: data-analysis, data-mining, machine-learning, big-data, data-visualization,
- Aliases: datasciences, data-science-project, data-science-algorithm,
- Last updated: 2026-07-03 00:07:42 UTC
- JSON Representation
https://github.com/dakimura/learn-data-science-for-free-jp
初学者がデータサイエンスをコンパクトに一通り学ぶための無料の資料です。
artificial-intelligence computer-vision data-science datascienceproject deeplearning machine-learning machine-learning-algorithms natural-language-processing neural-networks
Last synced: 09 Apr 2025
https://github.com/correia-jpv/fucking-awesome-machine-learning
A curated list of awesome Machine Learning frameworks, libraries and software. With repository stars⭐ and forks🍴
awesome awesome-list data-science data-visualization library machine-learning machine-learning-algorithms machine-learning-library machine-learning-tutorials machinelearning machinelearning-python ml
Last synced: 27 Apr 2025
https://github.com/lintangwisesa/python_fundamental_datascience
Python 🐍 for Jr Data Scientist 📈📊📉
data-science machine-learning python
Last synced: 12 Sep 2025
https://github.com/tosunthex/py_ds_ml_bootcamp
Udemy Jose Portilla Python for Data Science and Machine Learning Bootcamp
data-science datascience jose-portilla machine-learning matplotlib-pyplot python scikit-learn seaborn tensorflow udemy-course-project udemy-machine-learning
Last synced: 03 Oct 2025
https://github.com/touppercase78/intel-processors
Datasets for All Processors Maufactured By Intel
cpu data-science datasets intel jupyter-notebook microprocessors processors python
Last synced: 01 Apr 2025
https://github.com/kennethleungty/text-to-audio-with-bark
Exploring Bark, the Open-Source Text-to-Audio Generative Model
ai artificial-intelligence bark data-science deep-learning gen-ai generative-ai machine-learning prompt-engineering speech text-prompt text-to-audio text-to-music text-to-sound text-to-speech
Last synced: 12 Jul 2025
https://github.com/tushar2704/streamlit-magic-cheat-sheets
Streamlit Magic Cheat Sheets- All of Streamlit in one Streamlit App!(Available in English, Français & Deutsch.)
data-science machine-learning python snowflake streamlit streamlit-tushar2704 tushar2704 webapp
Last synced: 27 Apr 2026
https://github.com/martin-sicho/genui-gui
GenUI frontend application. It provides a GUI to the GenUI REST API web services.
cheminformatics data-science gui molecular-generation qsar react visualization webapp
Last synced: 19 Jan 2026
https://github.com/nicbet/infozilla
The infoZilla unstructured software engineering data mining tool. It can find and extract source code regions, patches, stack traces, enumerations and itemizations from discussion threads.
bugreport bugzilla data-mining data-science tools unstructured-data
Last synced: 13 Oct 2025
https://github.com/mostafa-wael/exploring-the-landscape-of-the-egyptian-software-market
A Data-driven Approach. Our story begins with a quest for knowledge. The information we obtained from LinkedIn offers us unprecedented insights into the landscape of the Egyptian Software Market.
carrers data-science hypothesis-testing linkedin market scraping software story-telling
Last synced: 13 Sep 2025
https://github.com/allen-li1231/zeppelin-vscode
Apache Zeppelin Extension for VS Code
apache-zeppelin data-science visual-studio-code vscode-extension zeppelin-notebook
Last synced: 09 Mar 2026
https://github.com/fernandojunior/financial-fraud-detection
A Python lib for fraud detection in a financial context
anomaly-detection credit-card data-science finance fraud-detection machine-learning outlier-detection python
Last synced: 02 Apr 2026
https://github.com/rubixml/dota2
Build a classifier to predict the outcome of Dota 2 games with the Naive Bayes algorithm and results from 102,944 sample games.
classifier cross-validation data-science dota dota2 machine-learning machine-learning-tutorial naive-bayes naive-bayes-algorithm naive-bayes-classifier php php-machine-learning php-ml prediction rubix-ml
Last synced: 30 Jul 2025
https://github.com/jessecambon/data-science-sandbox
Code and resources to serve as a starting point for data science projects.
data-science data-visualization geospatial-analysis machine-learning nlp python r statistical-modeling statistics
Last synced: 21 Jul 2025
https://github.com/exasol/data-science-examples
Collection of data science and machine learning examples with Exasol
data-science exasol-integration
Last synced: 07 May 2025
https://github.com/kennethleungty/Text-to-Audio-with-Bark
Exploring Bark, the Open-Source Text-to-Audio Generative Model
ai artificial-intelligence bark data-science deep-learning gen-ai generative-ai machine-learning prompt-engineering speech text-prompt text-to-audio text-to-music text-to-sound text-to-speech
Last synced: 17 Mar 2025
https://github.com/devinterview-io/numpy-interview-questions
🟣 NumPy interview questions and answers to help you prepare for your next machine learning and data science interview in 2025.
ai-interview-questions coding-interview-questions coding-interviews data-science data-science-interview data-science-interview-questions data-scientist-interview interview-practice interview-preparation machine-learning machine-learning-and-data-science machine-learning-interview machine-learning-interview-questions numpy numpy-interview-questions numpy-questions numpy-tech-interview software-engineer-interview technical-interview-questions
Last synced: 30 Oct 2025
https://github.com/silvanmelchior/cme_parser
A tiny parser for more flexible conda environment files
cme-parser conda conda-environment data-science meta-environment parser python
Last synced: 13 Jul 2025
https://github.com/vinibrsl/internet-affordability
🌍 Did you know that internet costs >20% of the average income in some countries?
data-science dataset human-rights insights jupyter-notebook scraping
Last synced: 12 Sep 2025
https://github.com/larribas/dagger
Define sophisticated data pipelines with Python and run them on different distributed systems (such as Argo Workflows).
argo-workflows data-engineering data-pipelines data-science distributed-systems pipelines-as-code workflows
Last synced: 28 Jul 2025
https://github.com/som-research/hfcommunity
HFCommunity offers an offline up-to-date relational database built from the data available at the Hugging Face Hub, providing queriable data about the repositories hosted in the Hub
data-science database dataset huggingface
Last synced: 05 Apr 2026
https://github.com/flipkart/foxtrot
A store abstraction and analytics system for real-time event data.
alerting analytics data-engineering data-science data-visualization elasticsearch hbase java monitoring
Last synced: 12 Dec 2025
https://github.com/wyfunique/DBSim
The codebase for DBSim
data-science database in-database in-database-analytics query-optimizer sql-parser sql-query
Last synced: 27 Apr 2025
https://github.com/yashksaini-coder/floraloracle--iris-inference-hub
The Objective is to combine the Prediction & classification scenarios of Machine Learning algorithms on the morphological Flower dataset
classification data-science jupyter-notebook machine-learning machine-learning-algorithms machinelearning prediction-model python3 scikit-learn scikitlearn-machine-learning
Last synced: 11 Apr 2025
https://github.com/rugk/crops-parser
🌱🍎🍆 A shell script to parse the data by the Food and Agriculture Organization of the United Nations on crops/fruits.
agriculture agriculture-research crop crops data-analysis data-science food fruit fruits statistics streetcomplete tree vegetables
Last synced: 11 Mar 2026
https://github.com/milos-agathon/map-rivers-with-sf-and-ggplot2-in-r
Let's make a pretty map of European rivers using the Global River Classification dataset 🧑🏼💻 Check the full tutorial at https://milospopovic.net/map-rivers-with-sf-and-ggplot2-in-r/
data-science data-visualization gis r rivers
Last synced: 04 Apr 2026
https://github.com/cscherrer/sossmlj.jl
SossMLJ makes it easy to build MLJ machines from user-defined models from the Soss probabilistic programming language
bayesian-inference data-science julialang mlj probabilistic-programming soss
Last synced: 10 Apr 2025
https://github.com/giswqs/streamlit-mapbox
A Streamlit Component for rendering Mapbox GL JS
data-science geospatial mapping streamlit streamlit-component streamlit-webapp
Last synced: 19 Jul 2025
https://github.com/harrystaley/open-source-data-science-degree-python
A fully curated, open-source Data Science curriculum focused on Python. Includes top-tier university courses (MIT, Stanford, Princeton) covering essential topics in computer science, data analysis, machine learning, and statistics — everything you need to build a solid foundation in Data Science, 100% free.
data data-science dataanalysis datasci ds open open-source py python python3 science source statistics
Last synced: 13 Apr 2025
https://github.com/alexioannides/lime-interpretable-ml
An example of how the LIME algorithm can be used to provide real-world insight into the decision processes of a 'black-box' machine learning algorithm - in this case a Radom Forest regressor.
data-science interpretability lime machine-learning numpy pandas pydata python scikit-learn
Last synced: 29 Oct 2025
https://github.com/codait/flight-delay-notebooks
Analyzing flight delay and weather data using Elyra, IBM Data Asset Exchange, Kubeflow Pipelines and KFServing
codait data-science elyra jupyter jupyter-notebook jupyterlab kfserving kubeflow-pipelines machine-learning
Last synced: 06 May 2025
https://github.com/chendaniely/ds4biomed
Data Science for the Biomedical Sciences
biomedical-sciences data-science
Last synced: 11 Apr 2025
https://github.com/facultyai/ipydataclean
Interactive cleaning for Pandas DataFrames
data-cleaning data-science dataframe jupyter-notebook pandas
Last synced: 26 Aug 2025
https://github.com/lettier/interactivekmeans
Interactive HTML canvas based implementation of k-means.
ai cluster cluster-analysis clustering clustering-algorithm clustering-evaluation clustering-methods data-science interactive-kmeans kmeans kmeans-algorithm kmeans-clustering machine-learning machine-learning-algorithms scikit-learn
Last synced: 26 Mar 2025
https://github.com/privacy-tech-lab/cross-device-tracking
Data and software for cross-device tracking data collection
cross-device-tracking data-science internet-tracking privacy privacy-tech
Last synced: 30 Apr 2025
https://github.com/ekote/build-your-first-end-to-end-lakehouse-solution
Build Your First End-to-End Lakehouse Solution (aka.ms/fabconlake)
apache-spark data-engineering data-factory data-pipeline data-science dataflows delta-lake lakehouse machine-learning microsoft-azure microsoft-fabric parquet powerbi tutorial warehouse workshop
Last synced: 29 Oct 2025
https://github.com/morningman/mcp-doris
An MCP server for Apache Doris & VeloDB
Last synced: 14 Dec 2025
https://github.com/Absolventa/iruby-chartkick
Minimalistic wrapper around chartkick for using it within iruby
chartkick data-science iruby rubydatascience visualization
Last synced: 07 May 2025
https://github.com/mohammadreza-mohammadi94/data-analysis-and-machine-learning-projects
A comprehensive collection of data analysis and machine learning projects, showcasing techniques and models for various data challenges. Dive in to explore code examples, analyses, and machine learning workflows.
data-analysis data-science dataframes deep-learning exploratory-data-analysis hyperparameter-tuning machine-learning machine-learning-algorithms pandas python scikit-learn visualization
Last synced: 06 Oct 2025
https://github.com/yashuv/python-for-data-science-ai-and-development
Python for Data Science, AI & Development - offered by IBM on Coursera
coursera-course data-analysis data-science ibm numpy pandas python
Last synced: 12 Apr 2025
https://github.com/millengustavo/time-series-forecasting
Notes on time series forecasting
data-science forecast statistical time-series time-series-analysis
Last synced: 04 Sep 2025
https://github.com/feedzai/feedzai-openml
API for Feedzai's Open Machine Learning that allows to integrate ML algorithms in Feedzai's platform.
api data-science feedzai machine-learning openml
Last synced: 30 Apr 2025
https://github.com/rsalmei/tsp-essay
A fun study of some heuristics for the Travelling Salesman Problem.
algorithms clustering data-science data-visualization greedy-algorithm greedy-nn-algorithm heuristic kmeans-algorithm kmeans-clustering logistics matplotlib numpy pandas traveling-salesman traveling-salesman-problem travelling-salesman travelling-salesman-problem tsp tsp-problem two-opt
Last synced: 11 Apr 2025
https://github.com/pmbrull/azure-ds-examdp100-notes
Personal notes for the Azure Data Science exam DP-100
azure cloud data-science exam machine-learning notes python
Last synced: 12 Apr 2025
https://github.com/yashksaini-coder/bharat-intern
Exploring data's depths with Bharat Intern! 🌐🚀 Unveiling insights from the dataset, my project is a fusion of creativity and analytics. Stay tuned for more! 📊
classification data data-science data-visualization internship-project
Last synced: 19 Jul 2025
https://github.com/mratsim/humpback-whale-identification
Kaggle Humpback whale identification: 2xGPU Data augmentation + FP16 mixed precision training
computer-vision data-science identification kaggle pytorch
Last synced: 07 May 2025
https://github.com/greenelab/gbm_immune_validation
Validating glioblastoma immune cell immunohistochemsitry using computational deconvolution of TCGA tumors
analysis cancer data-science gene-expression glioblastoma machine-learning survival-analysis tool
Last synced: 07 Jul 2025
https://github.com/sravb/nba-predictive-analytics
Being able to perform gameplay analysis of NBA players, NBA Predictive Analytics is a basketball coach's new best friend.
basketball data-mining data-science data-visualization decision-tree k-nearest-neighbors kaggle-dataset machine-learning matplotlib nba-analytics pandas predictive-analytics python scikit-learn scipy
Last synced: 07 May 2025
https://github.com/firefly-cpp/uarmsolver
universal Association Rule Mining Solver
association-rule-mining data-mining data-science evolutionary-algorithms swarm-intelligence
Last synced: 13 Apr 2025
https://github.com/khuyentran1401/prefect-alert
A decorator that sends alert when a Prefect flow fails
data data-engineering data-science prefect python
Last synced: 13 Apr 2025
https://github.com/vunb/node-crfsuite
A nodejs binding for crfsuite
crf crfsuite data-science node-crfsuite vntk
Last synced: 17 Jul 2025
https://github.com/glentner/dataphile
Data analytics library for Python and suite of open source, command line based data ops tools.
data-analysis data-ops data-science python scientific-computing
Last synced: 07 May 2025
https://github.com/omarsar/data_mining_2017_fall_lab
Contains information and instructions for the first Data Mining lab session for 2017 Fall.
data data-analysis data-mining data-science data-visualization
Last synced: 08 Sep 2025
https://github.com/gramian/hapod
HAPOD - Hierarchical Approximate Proper Orthogonal Decomposition
data-driven data-reduction data-science datascience dimension-reduction distributed-memory high-performance-computing hpc limited-memory mapreduce mapreduce-algorithm model-order-reduction model-reduction pca pod proper-orthogonal-decomposition svd unsupervised-learning
Last synced: 06 May 2025
https://github.com/open-risk/correlationmatrix
correlationMatrix is a Python powered library for the statistical analysis and visualization of correlations
correlation-analysis correlation-matrices data-analysis data-science statistics
Last synced: 04 Jul 2025
https://github.com/nceas/nceas-training
Training materials and modules from R-based data science short courses at NCEAS
Last synced: 30 Apr 2025
https://github.com/chalmerlowe/jupyter_tutorial
An introduction to Jupyter and Jupyter Labs for data analysis, data science, and Python development
data-analysis data-science jupyter jupyter-notebook jupyterlab notebook python tutorial
Last synced: 10 Apr 2025
https://github.com/compilerla/data-donuts
Public sector breakfast lecture series meant to inspire.
data-donuts data-science events government los-angeles
Last synced: 12 Feb 2026
https://github.com/darenasc/data-science-for-good
Data Science for Good links.
Last synced: 12 Jan 2026
https://github.com/mdh266/randomforests
Random Forest Library In Python Compatible with Scikit-Learn
classification data-science decision-tree ensemble-learning machine-learning machine-learning-algorithms pandas python random-forest regression scikit-learn
Last synced: 10 Mar 2026
https://github.com/cyrilou242/jnotebook
Notebook for Java.
data-science java jnotebook jupyter notebook
Last synced: 10 Mar 2026
https://github.com/alex-snd/malwareclassifier
👾 Malware Classification using Deep Learning and Cuckoo Sandbox
cuckoo-sandbox cvae data-science deep-learning malware malware-classification malware-detection python pytorch vae
Last synced: 25 Apr 2025
https://github.com/shivangraikar/datasciencevalue
Web application created using Streamlit to host an intelligent salary predictor. The project returns the position of the user in this particular field of Data Science.
data-science heroku-deployment logistic-regression machine-learning streamlit-webapp
Last synced: 15 Apr 2025
https://github.com/lydialucchesi/smallsets
Visual documentation for data preprocessing in R and Python
data-science data-visualization documentation-tool machine-learning preprocessing python r r-package visualization-tools
Last synced: 24 Jun 2025
https://github.com/gagolews/lmlcr
Lightweight Machine Learning Classics with R (Book Draft)
classification clustering data-science machine-learning machine-learning-algorithms mathematical-modelling optimisation-algorithms r regression
Last synced: 14 Jul 2025
https://github.com/philipyip1988/python-notebooks
Data-science tutorials covering Python, Object-Orientated Programming Python standard libraries such as collections, itertools, math, statistics, random and datetime. The tutorials also cover the data-science libraries such as numpy, pandas, matplotlib and seaborn as well as the conda ecosystem.
anaconda anaconda-environment conda data-science jupyterlab math matplotlib numpy pandas python python3 seaborn statistics
Last synced: 31 Jul 2025
https://github.com/pitmonticone/plantdiseaseclassification
Dataset Analysis & CNN Models Optimization for Plant Disease Classification.
classification-algorithims computer-science computer-vision convolutional-neural-networks data-science deep-learning deep-neural-networks neural-network neural-networks plant-disease-classification plant-disease-detection plant-pathology-fgvc7
Last synced: 02 Jan 2026
https://github.com/boniolp/dsymb-playground
[ICDE 2024] Python and Streamlit implementation of "d_{symb} playground: an interactive tool to explore large multivariate time series datasets"
clustering data-science data-visualization streamlit symbolization time-series time-series-analysis webapp
Last synced: 30 Apr 2025
https://github.com/alexioannides/pymc-advi-hmc-demo
Demonstrating HMC and ADVI algorithms for Bayesian data analysis using PYMC3.
bayesian-data-analysis bayesian-inference data-science example-project jupyter-notebook machine-learning markov-chain-monte-carlo probabilistic-programming pymc3 python variational-inference
Last synced: 28 Jul 2025
https://github.com/clojure-finance/datajure
Clojure data manipulation DSL — composable query syntax built on tech.ml.dataset
clojure data-manipulation data-science dataframe dsl empirical-research query-dsl tech-ml-dataset
Last synced: 20 Apr 2026
https://github.com/wlandau/targets-keras
An example Keras pipeline with the targets R package
data-science keras pipeline r reproducibility reproducible-research rstats statistics targets workflow
Last synced: 11 Mar 2026
https://github.com/latentcat/network-vis
WIP. Visualization of social networks. 社交网络可视化。
complex-networks data data-science graph graph-visualization social-media social-network-analysis visualization wechat
Last synced: 16 Mar 2026
https://github.com/vanessaaleung/ds-case-studies
Data Science Case Studies
ab-testing case-study data-science ds-case-studies financial-analysis gameofthrones marketing-analytics modeling network-analysis prediction pricing-analytics python r sharpe-ratio text-mining
Last synced: 26 Jun 2025
https://github.com/reddyprasade/python-basic-for-all-3.x
We are going to Learn Python, it is a powerful multi-purpose programming language created by Guido van Rossum. It has simple easy-to-use syntax, making it the perfect language for someone trying to learn computer programming for the first time. This is a comprehensive guide on how to get started in Python, why you should learn it and how you can learn it. However, if you knowledge of other programming languages and want to quickly get started with Python.
comprehensive-guide data-science knowledge perfect-language programming-languages python python-3 python-3-6 python3
Last synced: 25 Feb 2026
https://github.com/dongjunlee/beawesometoday
Be Awesome Today - My Awesome List & Today I Learned & Blogging Articles
awesome-list blog chatbot data-science deep-learning machine-learning python til today-i-learned
Last synced: 17 Mar 2026
https://github.com/ditikrushna/predict-sales-revenue-using-multiple-regression-model
In this project you will build and evaluate multiple linear regression models using Python. You will use scikit-learn to calculate the regression, while using pandas for data management and seaborn for data visualization. The data for this project consists of the very popular Advertising dataset to predict sales revenue based on advertising spending through media such as TV, radio, and newspaper.
data-science multiple-regression multiple-regression-analysis regression-models seaborn
Last synced: 01 Mar 2025
https://github.com/srowen/cdsw-simple-serving
Modeling Lifecycle with ACME Occupancy Detection and Cloudera
cloudera cloudera-data-science data-science openscoring pmml workbench
Last synced: 10 Oct 2025
https://github.com/tushar2704/taipy-cheat-sheets
Taipy Cheat Sheets - All of Taipy with Taipy Application
application artificial-intelligence data-science gui taipy taipy-core taipy-gui tushar-taipy tushar2704 ui ux
Last synced: 29 Apr 2026
https://github.com/dalageo/ml-gassensordrift
Drift Detection in Gas Sensor Array at Different Concentration Levels ☢️
data-science decision-tree-regression gas-sensor-calibration gas-sensor-datasets machine-learning optuna predictive-maintenance python random-forest-regression scikit-learn xgboost-regression
Last synced: 25 Oct 2025
https://github.com/mehmetkahya0/web-resource-downloader
This is a Python script that downloads all resources (images, scripts, stylesheets, etc.) from a given website.
algorithms beautifulsoup4 bs4 bs4-requests data-analysis data-science datascience python python3 requests scraper scraping
Last synced: 12 Oct 2025
https://github.com/axil/pandas-illustrated
A collection of Pandas helper functions.
data-analysis data-science pandas pandas-dataframe pandas-library python
Last synced: 01 Aug 2025
https://github.com/predicthq/phq-data-science-docs
PredictHQ’s Data Science documentation
Last synced: 11 Jun 2025
https://github.com/laderast/cvdriskdata
R package for Cardiovascular Risk Dataset and Data generation script
cardiovascular data-science synthetic-data synthetic-dataset-generation
Last synced: 25 Sep 2025
https://github.com/rubixml/divorce
Use the K Nearest Neighbors algorithm to predict the probability of a divorce with high accuracy.
classification cross-validation data-science divorce divorce-prediction example-project k-nearest-neighbors knn machine-learning machine-learning-tutorial nearest-neighbors php php-machine-learning php-ml prediction rubix-ml
Last synced: 30 Jul 2025
https://github.com/tamasgal/thepipe
A simplistic, general purpose pipeline framework.
data-processing data-processing-pipelines data-science hacktoberfest pipelines provenance python
Last synced: 21 Mar 2025
https://github.com/jxareas/linear-regression
Data Wrangling, Linear Models & other misc. Inferential Statistics.
collaborate data-analysis data-science dplyr ggplot2 github jetbrains learn linear-regression poisson-distribution polynomial-regression r regression regression-analysis regression-models
Last synced: 10 Aug 2025
https://github.com/uriamorp/mprod_package
Software implementation for tensor-tensor m-product
data-science linear-algebra multiway numerical-analysis pca python scientific-computing spatio-temporal tca tcam tensor tensor-algebra tensor-decomposition tensor-factorization tensor-product time-series-analysis
Last synced: 21 Oct 2025
https://github.com/amey-thakur/python-crash-course
IIT ROPAR - Diginique Techlabs --> Data Science Machine Learning and AI using Python
ai amey ameythakur data-science data-science-projects house-price-prediction machine-learning python python-crash-course
Last synced: 07 Oct 2025
https://github.com/laderast/burro
Exploring data together using shiny (burro(w) into the data)
data-science data-visualization eda exploratory-data-analysis shiny
Last synced: 25 Sep 2025
https://github.com/udst/bayarea_urbansim
UrbanSim implementation for the San Francisco Bay Area
bay-area data-science modeling simulation urbansim
Last synced: 07 May 2025
https://github.com/devinterview-io/scikit-learn-interview-questions
🟣 Scikit-Learn interview questions and answers to help you prepare for your next machine learning and data science interview in 2024.
ai-interview-questions coding-interview-questions coding-interviews data-science data-science-interview data-science-interview-questions data-scientist-interview interview-practice interview-preparation machine-learning machine-learning-and-data-science machine-learning-interview machine-learning-interview-questions scikit-learn scikit-learn-interview-questions scikit-learn-questions scikit-learn-tech-interview software-engineer-interview technical-interview-questions
Last synced: 13 Apr 2025
https://github.com/adamrossnelson/stataipedsall
Scripts to download and build panel data files for IPEDS.
bigdata data-science ipeds test-scores
Last synced: 13 Feb 2026
https://github.com/eshikashah/ibm-data-science-ml-with-python-project
Capstone project for IBM data science course - ML with python.
algorithms data-analysis data-science ibm machine-learning python
Last synced: 07 May 2025
https://github.com/adilzouitine/pyfeel
Python package for emotion analysis in French
data-analysis data-mining data-science emotion emotion-analysis nlp nlp-library opinion-mining python
Last synced: 06 Nov 2025
https://github.com/open-metadata/openmetadata-site
Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.
automation bigdata bigdataanalytics data-catalog data-discovery data-observability data-profiling data-quality-monitoring data-science datadiscovery dataengineering dataquality datascience dbt governance hacktoberfest hacktoberfest2022 metadata metadata-api metadata-management
Last synced: 14 Apr 2025
https://github.com/rotationalio/pyensign
Ensign driver, SDK, and helpers for Python
data-science event-driven event-driven-architecture eventing hacktoberfest microservices
Last synced: 19 Jun 2025
https://github.com/vidhi1290/deep-learning-for-eeg-emotion-classification
This repository contains a Python code script for performing emotion classification using EEG (Electroencephalogram) data. Emotion classification from EEG signals is an important application in neuroscience and human-computer interaction. The code leverages deep learning techniques to analyze EEG data and predict emotional states.
coorelation data-exploration data-preprocessing data-science data-visualization deep-learning deep-learning-algorithms eeg-emotion-recognition egg-signals emotion-distribution emotion-prediction feature-analysis heatmap human-emotions machine-learning machine-learning-algorithms pie-chart spectral-analysis time-series-visualization
Last synced: 10 Apr 2025
https://github.com/aaaastark/top-big-data-scientist-questions-for-interview
Top Big Tech Data Science Questions
ai alibaba amazon apple computer-science computer-vision data-engineer data-science deep-learning facebook google ibm intel interview-questions machine-learning netflix nvidia orcale spacex tesla
Last synced: 04 Feb 2026