Data Science
Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.
- GitHub: https://github.com/topics/data-science
- Wikipedia: https://en.wikipedia.org/wiki/Data_science
- Related Topics: data-analysis, data-mining, machine-learning, big-data, data-visualization,
- Aliases: datasciences, data-science-project, data-science-algorithm,
- Last updated: 2026-07-01 00:07:28 UTC
- JSON Representation
https://github.com/mdeff/ntds_2016
Material for the EPFL master course "A Network Tour of Data Science", edition 2016.
data-science education epfl graphs machine-learning neural-networks
Last synced: 12 Jul 2025
https://github.com/metriculous-ml/metriculous
Measure and visualize machine learning model performance without the usual boilerplate.
classification confusion-matrix data-science deep-learning machine-learning model-comparsion model-evaluation model-selection precision-recall-curve python regression residual-plot roc-curve statistics visual-analysis
Last synced: 17 Oct 2025
https://github.com/wlandau/targets-tutorial
Short course on the targets R package
data-science make pipeline r r-package reproducibility reproducible-research rstats targets workflow
Last synced: 16 Mar 2025
https://github.com/mratsim/Arch-Data-Science
Archlinux PKGBUILDs for Data Science, Machine Learning, Deep Learning, NLP and Computer Vision
archlinux cuda cudnn data-science deep-learning lightgbm machine-learning mkl mxnet natural-language-processing natural-language-understanding nervana opencv package pandas pytorch scikit-learn spacy tensorflow xgboost
Last synced: 20 Jul 2025
https://github.com/trademaster-ntu/fintech-literature
Fintech literature, including journal, conference, book and useful links
artificial-intelligence data-science machine-learning natural-language-processing quantitative-finance reinforcement-learning
Last synced: 08 Oct 2025
https://github.com/responsiblyai/responsibly
Toolkit for Auditing and Mitigating Bias and Fairness of Machine Learning Systems ๐๐ค๐งฐ
artificial-intelligence audit bias bias-correction bias-finder bias-reduction data-science ethics fairness fairness-ai fairness-awareness-model fairness-ml fairness-testing machine-bias machine-learning natural-language-processing python
Last synced: 30 Oct 2025
https://github.com/talkpython/excel-to-python-course
Student materials and handouts for Excel to Python course
course data-science excel office pandas python video
Last synced: 21 Sep 2025
https://github.com/eclipse-zenoh-flow/zenoh-flow
zenoh-flow aims at providing a zenoh-based data-flow programming framework for computations that span from the cloud to the device.
autonomous-vehicles data-science dataflow-programming machine-learning robotics ros2 rust-lang
Last synced: 24 Dec 2025
https://github.com/PetoLau/TSrepr
TSrepr: R package for time series representations
data-analysis data-mining data-mining-algorithms data-science r r-package representation time-series time-series-analysis time-series-classification time-series-clustering time-series-data-mining time-series-representations
Last synced: 04 Apr 2025
https://github.com/morph-data/morph
Python + Markdown framework for building internal apps.
data-analysis data-science data-visualization deep-learning developer-tools generative-ai machine-learning mdx morph python react sql
Last synced: 04 Apr 2025
https://github.com/ResponsiblyAI/responsibly
Toolkit for Auditing and Mitigating Bias and Fairness of Machine Learning Systems ๐๐ค๐งฐ
artificial-intelligence audit bias bias-correction bias-finder bias-reduction data-science ethics fairness fairness-ai fairness-awareness-model fairness-ml fairness-testing machine-bias machine-learning natural-language-processing python
Last synced: 19 Apr 2025
https://github.com/juanitorduz/btsa
Berlin Time Series Analysis Repository
data-science meetup python r statistics time-series-analysis
Last synced: 23 Jun 2025
https://github.com/darribas/gds_course
Geographic Data Science, the course
course data-science educational gds-course geographic-data-science gis
Last synced: 17 Jun 2025
https://github.com/elemento24/journey-with-artificial-intelligence
This repo consists of all the resources that can be referred during one's Journey with Artificial Intelligence.
artificial-intelligence data-science deep-learning machine-learning python
Last synced: 08 Sep 2025
https://github.com/AnonCatalyst/Coeus-OSINT-ToolBox
Coeus ๐ is an OSINT ToolBox empowering users with tools for effective intelligence gathering from open sources. From social media monitoring ๐ฑ to data analysis ๐, it offers a centralized platform for seamless OSINT investigations.
data-science data-visualization database forensic-analysis forensics forensics-tools framework information-retrieval infosec osint osint-framework osint-python osint-resources osint-tool osint-toolkit people-search reconnaissance
Last synced: 06 May 2025
https://github.com/IlyaGusev/tgcontest
Telegram Data Clustering contest solution by Mindful Squirrel
classification clustering cpp data-science document-similarity fasttext machine-learning nlp
Last synced: 03 Apr 2025
https://github.com/jkoutsikakis/pytorch-wrapper
Provides a systematic and extensible way to build, train, evaluate, and tune deep learning models using PyTorch.
data-science deep-learning machine-learning neural-network python pytorch pytorch-wrapper tensor
Last synced: 04 Feb 2026
https://github.com/mld3/fiddle
FlexIble Data-Driven pipeLinE โ a preprocessing pipeline that transforms structured EHR data into feature vectors to be used with ML algorithms. https://doi.org/10.1093/jamia/ocaa139
data-science electronic-health-records jamia machine-learning preprocessing
Last synced: 14 Jan 2026
https://github.com/hazemabdelkawy/QuranGPT
Quran GPT is a project that leverages the power of the GPT-4 language model to generate meaningful embeddings for Quran verses. This project not only generates embeddings for the verses but also visualizes the distribution of these embeddings using t-SNE in a 3D scatter plot.
artificial-intelligence data-science data-visualization machine-learning nlp quran sunnah
Last synced: 01 Feb 2026
https://github.com/igerber/diff-diff
A Python library for Difference-in-Differences (DiD) causal inference analysis with an sklearn-like API and statsmodels-style outputs.
analytics causal-inference data-science difference-in-differences econometrics economics
Last synced: 20 Apr 2026
https://github.com/parths007/loan-approval-prediction
Loan Application Data Analysis
accuracy-analysis classification data-analysis data-mining data-science data-visualization juypter logistic-regression machine-learning notebook-jupyter python python3
Last synced: 03 Jul 2025
https://github.com/GlobalMaksimum/sadedegel
A General Purpose NLP library for Turkish
acikhack2 ai artificial-intelligence bert binder corpus data-science deep-learning embeddings heroku machine-learning natural-language-processing neural-network neural-networks news-summarizer nlp python
Last synced: 03 May 2025
https://github.com/giswqs/manjaro-linux
Shell scripts for setting up Manjaro Linux for Python programming and deep learning
data-science deep-learning gis kde manjaro manjaro-linux notebook-jupyter python r remote-sensing shell-scripts tensorflow
Last synced: 12 May 2025
https://github.com/tiledb-inc/tiledb-vcf
Efficient variant-call data storage and retrieval library using the TileDB storage library.
bioinformatics data-science genomics gwas python spark tiledb variant-calling vcf
Last synced: 05 Apr 2025
https://github.com/mratsim/arch-data-science
Archlinux PKGBUILDs for Data Science, Machine Learning, Deep Learning, NLP and Computer Vision
archlinux cuda cudnn data-science deep-learning lightgbm machine-learning mkl mxnet natural-language-processing natural-language-understanding nervana opencv package pandas pytorch scikit-learn spacy tensorflow xgboost
Last synced: 18 Apr 2025
https://github.com/dagshub/client
DagsHub client libraries
ai data data-science data-streaming dvc hacktoberfest hacktoberfest2023 keras machine-learning machinelearning mlops python pytorch tensorflow
Last synced: 16 May 2025
https://github.com/soda-inria/hazardous
Competing Risks and Survival Analysis
competing-risks data-science gradient-boosting machine-learning survival-analysis
Last synced: 06 Apr 2025
https://github.com/dkirkby/machinelearningstatistics
Machine learning and statistics for physicists
data-science machine-learning physics python statistics
Last synced: 06 Mar 2025
https://github.com/lyltj2010/DataMining
ๆฐๆฎๆๆๅผๆบไนฆ
data-science datamining deeplearning machine-learning
Last synced: 23 Aug 2025
https://github.com/cedrickchee/data-science-notebooks
Data science Python notebooksโa collection of Jupyter notebooks on machine learning, deep learning, statistical inference, data analysis and visualization.
data-science deep-learning fastai kaggle keras machine-learning notebooks numpy pandas python pytorch tensorflow
Last synced: 07 May 2025
https://github.com/geekplux/timeline-sankey
A project to visualize time range series data using the Sankey diagram.
data-analysis data-science data-visualization sankey sankey-chart sankey-diagram time-series time-series-analysis timeline visualization
Last synced: 25 Jul 2025
https://github.com/synthesized-io/fairlens
Identify bias and measure fairness of your data
bias data data-analysis data-science fairness ml pandas python statistics
Last synced: 24 Jun 2025
https://github.com/sylvaticus/betaml.jl
Beta Machine Learning Toolkit
ai artificial-intelligence autoencoder classification clustering data-science decision-trees deep-learning feature-importance imputation julia machine-learning ml neural-networks pca random-forest regression
Last synced: 17 Mar 2025
https://github.com/janishar/data-analytics-project-template
A python project starter template for data-analytics and data-science.
ai anaconda conda data-analysis data-analytics data-science jupyter-notebook keras matplotlib notebook numpy pandas project-starter-kit python python3 tensorflow
Last synced: 17 Jun 2025
https://github.com/microsoft/coml
Interactive coding assistant for data scientists and machine learning developers, empowered by large language models.
automated-machine automl copilot data-science hyperparameter-optimization jupyter jupyter-lab large-language-models llm machine-learning
Last synced: 04 Apr 2025
https://github.com/markvanderloo/simputation
Making imputation easy
data-science imputation officialstatistics r rstats
Last synced: 22 Oct 2025
https://github.com/devinterview-io/pytorch-interview-questions
๐ฃ PyTorch interview questions and answers to help you prepare for your next machine learning and data science interview in 2024.
ai-interview-questions coding-interview-questions coding-interviews data-science data-science-interview data-science-interview-questions data-scientist-interview interview-practice interview-preparation machine-learning machine-learning-and-data-science machine-learning-interview machine-learning-interview-questions pytorch pytorch-interview-questions pytorch-questions pytorch-tech-interview software-engineer-interview technical-interview-questions
Last synced: 13 Apr 2025
https://github.com/mkearney/tweetbotornot2
๐๐ฆ๐ค Detect Twitter Bots!
bot-detection bot-detector classification data-science machine-learning r r-package rstats rtweet twitter twitter-api twitter-bot-detection twitter-bots xgboost
Last synced: 12 Apr 2025
https://github.com/empower-ai/sql-agent
Ai Agent that helps you do data analytics with natural language.
analytics bigquery chatgpt chatgpt-bot data data-analytics data-science mysql postgresql slack slack-bot slackbot
Last synced: 11 Apr 2025
https://github.com/asavinov/prosto
Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
business-intelligence data-preparation data-preprocessing data-processing data-science data-wrangling feature-engineering map-reduce olap pandas python spark workflow
Last synced: 11 Apr 2025
https://github.com/polakowo/datadocs
Documentation for data enthusiasts
best-practices big-data cloud collaboration data-engineering data-science deep-learning documentation ebook knowledge machine-learning moocs wiki
Last synced: 07 May 2025
https://github.com/mnr/r-for-data-science-lunchbreak-lessons
Source files for the LinkedIn Learning Course
data-science linkedin-learning mark-niemann-ross r rlang rstats tutorials
Last synced: 07 Feb 2026
https://github.com/questdb/time-series-streaming-analytics-template
Template to quickstart streaming analytics using Apache Kafka for ingestion, QuestDB for time-series storage and analytics, Grafana for near real-time dashboards, and Jupyter Notebook for data science
data-science grafana jupyter-notebook kafka kafka-connect monitoring pandas polars questdb telegraf timeseries timeseries-analysis timeseries-database timeseries-forecasting
Last synced: 27 Jun 2025
https://github.com/lsys/lexicalrichness
:smile_cat: :speech_balloon: A module to compute textual lexical richness (aka lexical diversity).
data-mining data-science information-retrieval lexical-analysis lexical-analyzer linguistic-analysis natural-language natural-language-processing nlp python
Last synced: 09 Apr 2025
https://github.com/nuclio/nuclio-jupyter
Nuclio Function Automation for Python and Jupyter
data-science jupyter kubernetes nuclio python
Last synced: 06 Jan 2026
https://github.com/caioricciuti/duck-ui
Duck-UI is a web-based interface for interacting with DuckDB, a high-performance analytical database system. It features a SQL editor, data import/export, data explorer, query history, theme toggle, and keyboard shortcuts, all running seamlessly in the browser using DuckDB's WebAssembly (WASM) capabilities.
data-science data-visualization dataanalysis datanalytics duckdb local
Last synced: 04 Apr 2025
https://github.com/faizanzaheergit/studentperformanceprediction-ml
This is a simple machine learning project using classifiers for predicting factors which affect student grades, using data from CSV file
ai-ml artificial-intelligence artificial-intelligence-projects csv-files data-science machine-learning machine-learning-projects ml-models python python3
Last synced: 10 Sep 2025
https://github.com/bioconductor/genomicdatacommons
Provide R access to the NCI Genomic Data Commons portal.
api-client bioconductor bioinformatics cancer core-services data-science genomics nci r tcga vignette
Last synced: 16 May 2025
https://github.com/slai-labs/get-beam
Run GPU inference and training jobs on serverless infrastructure that scales with you.
artificial-intelligence cloud-computing cost-optimization data-science deep-learning distributed-computing gpu-acceleration gpu-computing hpc llm-serving llm-training machine-learning ml-infrastructure mlops python serverless serverless-architectures
Last synced: 18 Apr 2025
https://github.com/josephrp/datatonic
๐DataTonic : A Data-Capable AGI-style Agent Builder of Agents , that creates swarms , runs commands and securely processes and creates datasets, databases, visualisations, and analyses.
agent-builder agi autogen azure chroma data data-science data-visualization database memgpt semantic-kernel semantic-memory taskweaver
Last synced: 11 Oct 2025
https://github.com/layerai-archive/sdk
Metadata store for Production ML
collaboration data-science data-versioning deep-learning experiment-tracking hyperparameter-optimization hyperparameter-tuning keras machine-learning mlops model-versioning python pytorch reinforcement-learning sklearn tensorflow
Last synced: 30 Sep 2025
https://github.com/aws-samples/cloud-experiments
Open innovation with 60 minute cloud experiments on AWS
amazon-athena amazon-comprehend amazon-rekognition amazon-s3 amazon-sagemaker aws-cloud aws-glue data-science machine-learning notebooks
Last synced: 20 Jul 2025
https://github.com/khuyentran1401/machine-learning-pipeline
Example machine learning pipeline with MLflow and Hydra
data-science hydra machine-learning machine-learning-pipeline mlflow
Last synced: 13 Apr 2025
https://github.com/galliaproject/gallia-core
A schema-aware Scala library for data transformation
data-engineering data-manipulation data-science data-transformation etl feature-engineering json nesting scala spark
Last synced: 12 Feb 2026
https://github.com/TradeMaster-NTU/fintech-literature
Fintech literature, including journal, conference, book and useful links
artificial-intelligence data-science machine-learning natural-language-processing quantitative-finance reinforcement-learning
Last synced: 16 Aug 2025
https://github.com/datacarpentry/semester-biology
Forkable teaching materials for course on working with data in R
biology data-carpentry data-science r spatial-data sql teaching-materials
Last synced: 11 Mar 2026
https://github.com/stanfordnlp/edu-convokit
Edu-ConvoKit: An Open-Source Framework for Education Conversation Data
data data-analysis data-science education language natural-language-processing
Last synced: 15 Apr 2025
https://github.com/ropensci/gittargets
Data version control for reproducible analysis pipelines in R with {targets}.
data-science data-version-control data-versioning r r-package reproducibility reproducible-research rstats targets workflow
Last synced: 21 Aug 2025
https://github.com/kensk8er/chicksexer
A Python package for gender classification.
data-science deep-learning gender-classification lstm machine-learning natural-language-processing neural-network nlp python recurrent-neural-networks tensorflow
Last synced: 10 Apr 2026
https://github.com/delsner/flask-angular-data-science
Repository for a data science starter app using Flask, Angular and Docker. https://medium.com/@dvelsner/deploying-a-simple-machine-learning-model-in-a-modern-web-application-flask-angular-docker-a657db075280
angular data-science docker flask machine-learning python sklearn typescript
Last synced: 24 Jul 2025
https://github.com/rogerfitz/tutorials
Git Repo for Articles on Ergo Sum blog and the youtube channel https://www.youtube.com/channel/UCiie9CN--dazA7iT2sry5FA
algorithmia data-science draft-kings fan-duel fivethirtyeight google-maps-api ocr python sports tech text-to-speech visualizations
Last synced: 05 Apr 2025
https://github.com/maxim5/hyper-engine
Python library for Bayesian hyper-parameters optimization
bayesian-optimization big-data convolutional-neural-networks data-science deep-learning gaussian-processes hyperparameter-optimization machine-learning model-selection neural-network optimization-algorithms python random-search tensorflow
Last synced: 06 Apr 2025
https://github.com/akgold/do4ds
A book on DevOps for Data Scientists with CRC Press.
data-science devops it python r
Last synced: 25 Apr 2025
https://github.com/mmkim1210/geneticsmakie.jl
๐งฌHigh-performance genetics- and genomics-related data visualization using Makie.jl
bioinformatics cairomakie colocalization data-science fine-mapping genetics genomics gwas julia julia-language linkage locuszoom makie multi-ethnic multivariate openmendel phewas qtl v2f visualization
Last synced: 24 Oct 2025
https://github.com/quantscious/finmlkit
An open-source, lightweight, and blazing-fast financial machine learning library built with Numba. Process raw trades, generate advanced bars, features, and labels for quantitative research.
data-engineering data-science data-structures feature-engineering feature-extraction financial-analysis financial-data financial-machine-learning numba python quant quantitative-finance quantitative-research
Last synced: 17 Mar 2026
https://github.com/zjuearthdata/geochemistrypi
an open-sourced highly automated machine learning Python framework for data-driven geochemistry discovery
dash data-science fastapi flaml geochemistry mlflow nodejs ray reactjs scikit-learn typer
Last synced: 13 Dec 2025
https://github.com/nimbleboxai/nbox
The official python package for NimbleBox. Exposes all APIs as CLIs and contains modules to make ML ๐ธ
data-science machine-learning ml-infrastructure ml-platform ml-service mlops mlops-automation mlops-pipeline mlops-tool mlops-workflow model-deployment model-management model-monitoring model-serving practical-mlops
Last synced: 14 Dec 2025
https://github.com/Nelson-numerical-software/nelson
The Nelson Programming Language
cpp17 data-science data-structures interpreter mathematical-functions matlab matrix-functions nelson octave programming-language scientific-computing scilab
Last synced: 11 May 2025
https://github.com/runprism/prism
Prism is the easiest way to develop, orchestrate, and execute data pipelines in Python.
bigquery data data-analysis data-engineering data-integration data-orc data-science dbt etl etl-pipeline machine-learning orchestration pipeline postgres python redshift snowflake trino
Last synced: 19 May 2026
https://github.com/sangaline/reverse-engineering-the-hacker-news-ranking-algorithm
An analysis of historical Hacker News data to determine the ranking algorithm
analysis data-science hacker-news
Last synced: 13 Apr 2025
https://github.com/fastai/fastgpu
A queue service for quickly developing scripts that use all your GPUs efficiently
data-science deep-learning gpus machine-learning python resource-management
Last synced: 18 Jun 2025
https://github.com/uc-r/uc-r.github.io
Main repository for R programming courses @ University of Cincinnati, courses and tutorials that focus on data wrangling, exploration, visualization, and analysis with R.
classroom data-science data-wrangling machine-learning r tutorial tutorial-code visualization
Last synced: 26 Mar 2025
https://github.com/devinterview-io/computer-vision-interview-questions
๐ฃ Computer Vision interview questions and answers to help you prepare for your next machine learning and data science interview in 2025.
ai-interview-questions coding-interview-questions coding-interviews computer-vision computer-vision-interview-questions computer-vision-questions computer-vision-tech-interview data-science data-science-interview data-science-interview-questions data-scientist-interview interview-practice interview-preparation machine-learning machine-learning-and-data-science machine-learning-interview machine-learning-interview-questions software-engineer-interview technical-interview-questions
Last synced: 14 Feb 2026
https://github.com/svenkreiss/databench
Data analysis tool.
data-science data-visualization python
Last synced: 06 Mar 2026
https://github.com/dlt-hub/verified-sources
Contribute to dlt verified sources ๐ฅ
api contribute data-analysis data-engineering data-science data-source pipeline python
Last synced: 13 Jun 2025
https://github.com/n3mo/data-science
Data science tooling for Racket
data-science racket sentiment-analysis statistics text-processing
Last synced: 22 Feb 2026
https://github.com/beneath-hq/beneath
Beneath is a serverless real-time data platform โก๏ธ
analytics beneath data-engineering data-pipelines data-science data-warehouse dataops developer-tools etl go kubernetes mlops python sql streaming
Last synced: 03 Apr 2025
https://github.com/habedi/practicalmachinelearning
A collection of open-source and free machine learning resources
anomaly-detection data-analysis data-mining data-science data-science-resourses datasets deep-learning deep-neural-networks graph-algorithms graph-mining jupyter-notebook kaggle machine-learning pandas python python-machine-learning scikit-learn self-learning zeppelin-notebook
Last synced: 07 Nov 2025
https://github.com/mahmoudparsian/pyspark-algorithms
PySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2
algorithms big-data data data-abstractions data-science dataframe distributed-computing graphframes mapreduce monoid nosql partitioning pyspark pyspark-algorithms python rdd spark transformations
Last synced: 07 Apr 2025
https://github.com/bcgov/bcdata
An R package for searching & retrieving data from the B.C. Data Catalogue
bcdc citz data-science env r r-package rstats
Last synced: 04 Apr 2025
https://github.com/frlender/jandas
A very much Pandas-like JavaScript library for data science
data-science dataframe indexing pandas series
Last synced: 17 Jan 2026
https://github.com/palashio/nylon
An intelligent, flexible grammar of machine learning.
auto-ml data-science grammar machine-learning
Last synced: 13 Apr 2025
https://github.com/TomasBeuzen/python-programming-for-data-science
Content from the University of British Columbia's Master of Data Science course DSCI 511.
data-manipulation data-science numpy pandas programming python teaching
Last synced: 18 Jul 2025
https://github.com/FlyRanch/figurefirst
A layout-first approach to figure making
data-science inkscape inkscape-extensions matplotlib plotting python svg
Last synced: 08 May 2025
https://github.com/gitonthescene/csv-reconcile
A reconciliation service for OpenRefine serving data from a given CSV file.
Last synced: 03 Jan 2026
https://github.com/hongping-zh/circular-bias-detection
a comprehensive statistical framework for detecting circular reasoning bias in AI algorithm evaluation
ai-ethics bias-detection data-science llm machine-learning model-evaluation
Last synced: 07 Mar 2026
https://github.com/ogustavo-pereira/aprenda-python
:books: Recursos para aprender Python
bioin data-science data-visualization deep-learning desing-patterns django flask machine-learning python python2 python3
Last synced: 22 Jul 2025
https://github.com/Dumbris/trunklucator
Python module for data scientists for quick creating annotation projects.
active-learning annotation annotation-tool data-science machine-learning nlp
Last synced: 03 Apr 2025
https://github.com/radicalbit/radicalbit-ai-monitoring
A comprehensive solution for monitoring your AI models in production
ai ai-monitoring ai-observability artificial-intelligence data-drift data-science llm-observability machine-learning machine-learning-engineering ml-observability monitoring observability
Last synced: 26 Jan 2026
https://github.com/GDSL-UL/san
Spatial Modelling for Data Scientists
book cross-validation data-science geographically-weighted-regression maps moran-i multilevel-models r r-spatial spatial-analysis spatial-econometrics
Last synced: 30 Jul 2025
https://github.com/krishkumar/createml-playgrounds
Create ML playgrounds for building machine learning models. For developers and data scientists.
apple classifier coreml createml data-science ios12 machine-learning model playground xcode
Last synced: 06 Oct 2025
https://github.com/sigvt/vtuber-livechat-dataset
๐ VTuber 1B: Billion-scale Live Chat and Moderation Event Dataset
data-science dataset holodata hololive machine-learning nijisanji nlp sigvt statistics superchat vtuber youtube-livestream
Last synced: 23 Jun 2025
https://github.com/jialuechen/tfq-finance
Quantum Finance Library
cirq data-science derivatives-pricing high-frequency-trading machine-learning model-calibration physics portfolio-optimization quantitative-finance quantum-classical quantum-computing quantum-finance risk-management tensorflow-quantum
Last synced: 02 Mar 2025
https://github.com/seandavi/geoquery
The bridge between the NCBI Gene Expression Omnibus and Bioconductor
bioconductor bioinformatics data-science genomics ncbi-geo r rstats
Last synced: 04 Apr 2025
https://github.com/benedekrozemberczki/asne
A sparsity aware and memory efficient implementation of "Attributed Social Network Embedding" (TKDE 2018).
2vec aane asne attributed attributed-embedding data-science deepwalk diff2vec dimensionality-reduction embedding factorization feature-extraction gemsec graph-embedding network-embedding node-embedding node2vec representation-learning tensorflow word2vec
Last synced: 11 Apr 2025
https://github.com/lettier/interactiveknn
Interactive K-Nearest Neighbors machine learning algorithm in JavaScript.
ai classification data-analysis data-science gui html5 interactive-knearest-neighbors javascript k-nearest-neighbor k-nearest-neighbors k-nearest-neighbours knn machine-learning machine-learning-algorithms nearest-neighbor-search scikit-learn statistics visualization
Last synced: 26 Mar 2025
https://github.com/MLMI2-CSSI/foundry
Simplifying the discovery and usage of machine-learning ready datasets in materials science and chemistry
chemistry data-science datasets machine-learning materials-science
Last synced: 15 Jul 2025
https://github.com/dspinellis/alexandria3k
Local relational access to openly-available publication data sets
bibliometric-analysis crossref data-science orcid scientometrics
Last synced: 04 Apr 2025
https://github.com/Erfaniaa/crypto-trading-strategy-backtester
Easy-to-use cryptocurrency trading strategy simulator and backtester
backtesting backtesting-trading-strategies binance bitcoin crypto cryptocurrency data-science dataset dataset-generation machine-learning python quantitative-finance quantitative-trading simulation time-series trading trading-strategies
Last synced: 18 Apr 2025