Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data Science
Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.
- GitHub: https://github.com/topics/data-science
- Wikipedia: https://en.wikipedia.org/wiki/Data_science
- Related Topics: data-analysis, data-mining, machine-learning, big-data, data-visualization,
- Aliases: datasciences, data-science-project, data-science-algorithm,
- Last updated: 2025-02-06 00:07:07 UTC
- JSON Representation
https://github.com/bioconductor/genomicdatacommons
Provide R access to the NCI Genomic Data Commons portal.
api-client bioconductor bioinformatics cancer core-services data-science genomics nci r tcga vignette
Last synced: 02 Feb 2025
https://github.com/nuclio/nuclio-jupyter
Nuclio Function Automation for Python and Jupyter
data-science jupyter kubernetes nuclio python
Last synced: 02 Feb 2025
https://github.com/uc-r/uc-r.github.io
Main repository for R programming courses @ University of Cincinnati, courses and tutorials that focus on data wrangling, exploration, visualization, and analysis with R.
classroom data-science data-wrangling machine-learning r tutorial tutorial-code visualization
Last synced: 30 Oct 2024
https://github.com/svilupp/PromptingTools.jl
Streamline your life using PromptingTools.jl, the Julia package that simplifies interacting with large language models.
data-science generative-ai julia
Last synced: 28 Oct 2024
https://github.com/jialuechen/tfq-finance
Quantum Finance Library
cirq data-science derivatives-pricing high-frequency-trading machine-learning model-calibration physics portfolio-optimization quantitative-finance quantum-classical quantum-computing quantum-finance risk-management tensorflow-quantum
Last synced: 02 Feb 2025
https://github.com/habedi/practicalmachinelearning
A collection of open-source and free machine learning resources
anomaly-detection data-analysis data-mining data-science data-science-resourses datasets deep-learning deep-neural-networks graph-algorithms graph-mining jupyter-notebook kaggle machine-learning pandas python python-machine-learning scikit-learn self-learning zeppelin-notebook
Last synced: 06 Feb 2025
https://github.com/zjuearthdata/geochemistrypi
an open-sourced highly automated machine learning Python framework for data-driven geochemistry discovery
dash data-science fastapi flaml geochemistry mlflow nodejs ray reactjs scikit-learn typer
Last synced: 05 Feb 2025
https://github.com/svenkreiss/databench
Data analysis tool.
data-science data-visualization python
Last synced: 26 Dec 2024
https://github.com/n3mo/data-science
Data science tooling for Racket
data-science racket sentiment-analysis statistics text-processing
Last synced: 18 Nov 2024
https://github.com/mmkim1210/geneticsmakie.jl
🧬High-performance genetics- and genomics-related data visualization using Makie.jl
bioinformatics cairomakie colocalization data-science fine-mapping genetics genomics gwas julia julia-language linkage locuszoom makie multi-ethnic multivariate openmendel phewas qtl v2f visualization
Last synced: 05 Feb 2025
https://github.com/khuyentran1401/machine-learning-pipeline
Example machine learning pipeline with MLflow and Hydra
data-science hydra machine-learning machine-learning-pipeline mlflow
Last synced: 26 Nov 2024
https://github.com/palashio/nylon
An intelligent, flexible grammar of machine learning.
auto-ml data-science grammar machine-learning
Last synced: 07 Nov 2024
https://github.com/GDSL-UL/san
Spatial Modelling for Data Scientists
book cross-validation data-science geographically-weighted-regression maps moran-i multilevel-models r r-spatial spatial-analysis spatial-econometrics
Last synced: 04 Dec 2024
https://github.com/sportsdataverse/sportsdataverse-py
sportsdataverse python package
cfb-data college-basketball college-football data-science espn hockey nba nba-stats nfl nflfastr nhl nhl-api python sports sports-analytics sports-data sports-stats sportsdataverse wnba womens-basketball
Last synced: 01 Feb 2025
https://github.com/seandavi/geoquery
The bridge between the NCBI Gene Expression Omnibus and Bioconductor
bioconductor bioinformatics data-science genomics ncbi-geo r rstats
Last synced: 01 Feb 2025
https://github.com/bcgov/bcdata
An R package for searching & retrieving data from the B.C. Data Catalogue
bcdc citz data-science env r r-package rstats
Last synced: 01 Feb 2025
https://github.com/Dumbris/trunklucator
Python module for data scientists for quick creating annotation projects.
active-learning annotation annotation-tool data-science machine-learning nlp
Last synced: 04 Nov 2024
https://github.com/stanfordnlp/edu-convokit
Edu-ConvoKit: An Open-Source Framework for Education Conversation Data
data data-analysis data-science education language natural-language-processing
Last synced: 30 Jan 2025
https://github.com/dspinellis/alexandria3k
Local relational access to openly-available publication data sets
bibliometric-analysis crossref data-science orcid scientometrics
Last synced: 01 Feb 2025
https://github.com/beneath-hq/beneath
Beneath is a serverless real-time data platform ⚡️
analytics beneath data-engineering data-pipelines data-science data-warehouse dataops developer-tools etl go kubernetes mlops python sql streaming
Last synced: 04 Nov 2024
https://github.com/lettier/interactiveknn
Interactive K-Nearest Neighbors machine learning algorithm in JavaScript.
ai classification data-analysis data-science gui html5 interactive-knearest-neighbors javascript k-nearest-neighbor k-nearest-neighbors k-nearest-neighbours knn machine-learning machine-learning-algorithms nearest-neighbor-search scikit-learn statistics visualization
Last synced: 30 Oct 2024
https://github.com/XpressAI/xircuits
Simple visual programming environment for jupyterlab
data-science jupyterlab python
Last synced: 10 Oct 2024
https://github.com/ropensci/gittargets
Data version control for reproducible analysis pipelines in R with {targets}.
data-science data-version-control data-versioning r r-package reproducibility reproducible-research rstats targets workflow
Last synced: 19 Dec 2024
https://github.com/great-expectations/great_expectations_action
A GitHub Action that makes it easy to use Great Expectations to validate your data pipelines in your CI workflows.
actions continuous-integration data-integrity data-quality data-science mlops
Last synced: 06 Nov 2024
https://github.com/gagolews/datawranglingpy
Minimalist Data Wrangling with Python (Open-Access Textbook)
data-analysis data-science data-visualisation data-wrangling jupyter machine-learning matplotlib modelling numpy pandas python python3 scikit-learn scipy scipy-stats seaborn statistics
Last synced: 30 Jan 2025
https://github.com/hemansnation/data-analyst-roadmap
Data-Analyst-Roadmap for Professionals. This roadmap contains 8 Chapters that can be completed in 8 weeks, whether you are a fresher in the field or an experienced professional who wants to transition into Data Analysis.
analytics data-analysis data-analysis-python data-analytics data-science numpy predictive-analytics project-based-learning python statistics tableau
Last synced: 08 Nov 2024
https://github.com/andrea-ballatore/open-geo-data-education
Open Geospatial Datasets for GIS Education: This is a repository of open geospatial datasets to be used in an educational context. I created these files over years of teaching Geographic Data Science and GIS. All original datasets are freely available online with open data licenses (see the dataset attribution for details). All the datasets in this repository have been selected, cleaned, harmonised, and repackaged for GIS exercises in a higher-education context. This is a pretty time-intensive process that other educators can hopefully avoid by using these versions.
data-science geojson geospatial-data geospatial-datasets gis gis-data gis-education tsv
Last synced: 27 Oct 2024
https://github.com/ekramasif/basic-machine-learning
This is a repo of basic Machine Learning what I learn. More to go...
ann artficial-neural-network artificial-intelligence bert-embeddings bert-model blstm collaborate data-science deep-learning embeddings keras lstm machine-learning natural-language-processing neural-network nlp pandas python seaborn tensorflow
Last synced: 26 Oct 2024
https://github.com/mahmoudparsian/pyspark-algorithms
PySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2
algorithms big-data data data-abstractions data-science dataframe distributed-computing graphframes mapreduce monoid nosql partitioning pyspark pyspark-algorithms python rdd spark transformations
Last synced: 06 Nov 2024
https://github.com/bramvanroy/spacy_conll
Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Doc and its sentences and tokens. Can also be used as a command-line tool.
conll conll-u data-science machine-learning natural-language-processing nlp pandas parser python spacy spacy-extension spacy-pipeline stanford-machine-learning stanford-nlp stanza udpipe
Last synced: 30 Jan 2025
https://github.com/produvia/ai-platform
An open-source platform for automating tasks using machine learning models
artificial-intelligence automation data-science deep-learning java keras-models machine-learning model-zoo neural-networks python pytorch-models r task tasks tensorflow-models
Last synced: 20 Jan 2025
https://github.com/benedekrozemberczki/asne
A sparsity aware and memory efficient implementation of "Attributed Social Network Embedding" (TKDE 2018).
2vec aane asne attributed attributed-embedding data-science deepwalk diff2vec dimensionality-reduction embedding factorization feature-extraction gemsec graph-embedding network-embedding node-embedding node2vec representation-learning tensorflow word2vec
Last synced: 14 Nov 2024
https://github.com/FlyRanch/figurefirst
A layout-first approach to figure making
data-science inkscape inkscape-extensions matplotlib plotting python svg
Last synced: 15 Nov 2024
https://github.com/imdeepmind/neuralpy
NeuralPy: A Keras like deep learning library works on top of PyTorch
data-science deep-learning keras library machine-learning neural-network neuralpy neuralpy-torch python pytorch
Last synced: 15 Dec 2024
https://github.com/TomasBeuzen/python-programming-for-data-science
Content from the University of British Columbia's Master of Data Science course DSCI 511.
data-manipulation data-science numpy pandas programming python teaching
Last synced: 26 Nov 2024
https://github.com/woz-u/DS-Student-Resources
Data Science Student Companion Notebooks and Data Lake
data-analysis data-science data-visualization machine-learning nosql python r sql statistics
Last synced: 27 Nov 2024
https://github.com/weiji14/zen3geo
The 🌏 data science library you've been waiting for~
analysis-ready-data cloud-native cloud-optimized-geotiff composition data-science datapipe earth-observation foss4g geospatial machine-learning-ready-data stac torch torchdata zarr zen
Last synced: 07 Jan 2025
https://github.com/kennethleungty/generative-ai-pharmacist
Generative AI Pharmacist (For Demo Purposes Only)
ai ai-pharmacist artificial-intelligence chatgpt data-science deep-learning generative-ai generative-ai-pharmacist generative-art healthcare machine-learning pharmacist pharmacy
Last synced: 14 Jan 2025
https://github.com/Invictify/Jupter-Notebook-REST-API
Run your jupyter notebooks as a REST API endpoint. This isn't a jupyter server but rather just a way to run your notebooks as a REST API Endpoint.
data-science data-science-pipelines docker dockerfile fastapi jupyter python rest-api
Last synced: 26 Oct 2024
https://github.com/polymathorg/dataframe
DataFrame in Pharo - tabular data structures for data analysis
data-analysis data-frame data-science data-visualization gsoc hacktoberfest pharo pharo-smalltalk smalltalk statistics tabular-data
Last synced: 03 Feb 2025
https://github.com/visgl/deck.gl-data
Data for the data visualization library deck.gl examples (https://uber.github.io/deck.gl/#/)
data data-science data-visualization uber
Last synced: 27 Nov 2024
https://github.com/5agado/conversation-analyzer
Analyzer and statistics generator for text-based conversations. Includes Facebook scraper and parser
data-science facebook quantified-self scraper
Last synced: 08 Nov 2024
https://github.com/psyplot/psyplot
Python package for interactive data visualization
cartopy climate data-science earth-science earth-system-model interactive matplotlib models netcdf python regression visualization
Last synced: 26 Jan 2025
https://github.com/ndleah/8-week-sql-challenge
#8WeekSQLChallenge by Danny Ma.
data-analysis data-science sql
Last synced: 12 Jan 2025
https://github.com/Erfaniaa/crypto-trading-strategy-backtester
Easy-to-use cryptocurrency trading strategy simulator and backtester
backtesting backtesting-trading-strategies binance bitcoin crypto cryptocurrency data-science dataset dataset-generation machine-learning python quantitative-finance quantitative-trading simulation time-series trading trading-strategies
Last synced: 09 Nov 2024
https://github.com/mainakrepositor/datasets
A bunch of some 200 datasets. You can call it mini-kaggle :)
csv data data-science database datasets image-files mini-kaggle ml nlp-machine-learning tsv
Last synced: 11 Jan 2025
https://github.com/dominodatalab/domino-research
Projects developed by Domino's R&D team
data-science mlflow mlops python sagemaker
Last synced: 24 Nov 2024
https://github.com/rodrigo-arenas/pyworkforce
Standard tools for workforce management, queuing, scheduling, rostering and optimization problems.
begginer-friendly data-science erlangc investigation-of-operation investigations-search looking-for-contributors operations-research optimization ortools python schedule scheduling-algorithms up-for-grabs workforce workforce-management
Last synced: 31 Jan 2025
https://github.com/ibm/kafka-streaming-click-analysis
Use Kafka and Apache Spark streaming to perform click stream analytics
apache-spark clickstream data-science ibm-data-science-experience ibmcode jupyter-notebook kafka spark structured-streaming
Last synced: 22 Jan 2025
https://github.com/ecsim/pem-dataset1
Proton Exchange Membrane (PEM) Fuel Cell Dataset
activation-procedure chemistry data data-science dataset electrochemistry energy fuel-cell impedance mea nafion open-science open-source pem physics polarization power proton-exchange-membrane science science-research
Last synced: 19 Nov 2024
https://github.com/ogustavo-pereira/aprenda-python
:books: Recursos para aprender Python
bioin data-science data-visualization deep-learning desing-patterns django flask machine-learning python python2 python3
Last synced: 18 Nov 2024
https://github.com/ECSIM/pem-dataset1
Proton Exchange Membrane (PEM) Fuel Cell Dataset
activation-procedure chemistry data data-science dataset electrochemistry energy fuel-cell impedance mea nafion open-science open-source pem physics polarization power proton-exchange-membrane science science-research
Last synced: 14 Nov 2024
https://github.com/jonrau1/SyntheticSun
SyntheticSun is a defense-in-depth security automation and monitoring framework which utilizes threat intelligence, machine learning, managed AWS security services and, serverless technologies to continuously prevent, detect and respond to threats.
anomaly-detection automation aws aws-security aws-serverless data-science data-visualization elasticsearch geolocation guardduty incident-response kibana machine-learning misp sagemaker security-automation security-tools serverless threat-detection threat-intelligence
Last synced: 21 Nov 2024
https://github.com/nbarrowman/vtree
An R package for calculating and drawing variable trees
data-science data-visualization exploratory-data-analysis r statistics
Last synced: 31 Oct 2024
https://github.com/siddhujetty/Product-analytics-insights-collection
My Solutions to "A Collection of Data Science Take-Home Challenges" by Giulio Palombo.
data-science machine-learning r-programming solutions take-home-test
Last synced: 04 Dec 2024
https://github.com/kianweelee/Edator
A python package that performs exploratory data analysis for users. Additionally, it generates 3 types of output files (cleaned CSV, plots and a text report).
data-analysis data-science exploratory-data-analysis
Last synced: 15 Nov 2024
https://github.com/PetoLau/petolau.github.io
Blog about time series data mining in R.
artificial-intelligence blog data-analysis data-mining data-science data-visualization forecasting machine-learning r time-series time-series-analysis time-series-clustering time-series-data-mining time-series-forecasting time-series-prediction
Last synced: 11 Nov 2024
https://github.com/tirthajyoti/synthetic-data-gen
Various methods for generating synthetic data for data science and ML
classification data data-science machine-learning python regression symbolic-computation time-series
Last synced: 22 Oct 2024
https://github.com/janishar/data-analytics-project-template
A python project starter template for data-analytics and data-science.
ai anaconda conda data-analysis data-analytics data-science jupyter-notebook keras matplotlib notebook numpy pandas project-starter-kit python python3 tensorflow
Last synced: 02 Nov 2024
https://github.com/cloud-cv/evalai-starters
How to create a challenge on EvalAI?
agent ai cv data-science data-science-competition environments evalai get-started getting-started ml reinforcement-learning rl
Last synced: 06 Feb 2025
https://github.com/aws-samples/aws-fargate-with-rstudio-open-source
This project delivers AWS CDK Python code to provision serverless infrastructure in AWS Cloud to run Open Source RStudio Server and Shiny.
amazon-athena amazon-ecr amazon-ecs amazon-efs amazon-route53 amazon-s3 amazon-ses amazon-vpc aws-cdk aws-codepipeline aws-datasync aws-fargate-application aws-kms aws-lambda aws-secrets-manager aws-wafv2 data-science rstudio-server shiny-apps
Last synced: 04 Dec 2024
https://github.com/capitalone/dataCompareR
dataCompareR is an R package that allows users to compare two datasets and view a report on the similarities and differences.
compare-data data data-analysis data-science r
Last synced: 04 Dec 2024
https://github.com/dataprofessor/python
Python codes from tutorials on the Data Professor YouTube channel
data-professor data-science dataprofessor datascience machine-learning machinelearning machinelearning-python python python-tutorial
Last synced: 11 Nov 2024
https://github.com/PolyMathOrg/DataFrame
DataFrame in Pharo - tabular data structures for data analysis
data-analysis data-frame data-science data-visualization gsoc hacktoberfest pharo pharo-smalltalk smalltalk statistics tabular-data
Last synced: 17 Nov 2024
https://github.com/MLMI2-CSSI/foundry
Simplifying the discovery and usage of machine-learning ready datasets in materials science and chemistry
chemistry data-science datasets machine-learning materials-science
Last synced: 23 Nov 2024
https://github.com/trainingbypackt/applied-deep-learning-with-python
Applied Deep Learning with Python, published by Packt
data-science deep-learning machine-learning python
Last synced: 14 Nov 2024
https://github.com/Thomas-George-T/Thomas-George-T
Readme for my :octocat: Profile
data-engineer data-science github github-profile icons machine-learning profile-readme readme svg svg-icons
Last synced: 26 Oct 2024
https://github.com/manumerous/vpselector
Visual Pandas Selector: Visualize and interactively select time-series data
data-science data-visualization pandas python selector
Last synced: 29 Oct 2024
https://github.com/glemaitre/pyparis-2018-sklearn
PyParis tutorial on machine learning using scikit-learn
data-science machine-learn pandas scikit-learn
Last synced: 01 Nov 2024
https://github.com/felipenoris/math-server-docker
The ideal multi-user Data Science server with Jupyterhub and RStudio, ready for Python, R and Julia languages.
data-science docker julia julia-language jupyter jupyter-kernels jupyterhub jupyterlab latex python rstudio-servers shiny-server
Last synced: 28 Oct 2024
https://github.com/erfaniaa/crypto-trading-strategy-backtester
Easy-to-use cryptocurrency trading strategy simulator and backtester
backtesting backtesting-trading-strategies binance bitcoin crypto cryptocurrency data-science dataset dataset-generation machine-learning python quantitative-finance quantitative-trading simulation time-series trading trading-strategies
Last synced: 27 Oct 2024
https://github.com/thoughtspile/hippotable
👩🏻🔬📊 Lightweight data analysis in your browser
csv dashboard data-analysis data-science javascript table visualization
Last synced: 18 Dec 2024
https://github.com/elemento24/journey-with-artificial-intelligence
This repo consists of all the resources that can be referred during one's Journey with Artificial Intelligence.
artificial-intelligence data-science deep-learning machine-learning python
Last synced: 23 Jan 2025
https://github.com/ploomber/soorgeon
Convert monolithic Jupyter notebooks 📙 into maintainable Ploomber pipelines. 📊
data-engineering data-science jupyter jupyter-notebooks machine-learning mlops workflow
Last synced: 23 Jan 2025
https://github.com/frjnn/bhtsne
Parallel Barnes-Hut t-SNE implementation written in Rust.
barnes-hut bhtsne data-science data-visualization dimensionality-reduction machine-learning rust similarity-measures
Last synced: 31 Jan 2025
https://github.com/exasol/pyexasol
Exasol Python driver with low overhead, fast HTTP transport and compression
data-science database driver exasol exasol-integration python websocket-client
Last synced: 05 Feb 2025
https://github.com/bcgov/bcmaps
An R package of map layers for British Columbia
data-science env r r-package rstats
Last synced: 05 Feb 2025
https://github.com/grailbio/bio
Bioinformatic infrastructure libraries
bioinformatics data-science golang
Last synced: 09 Nov 2024
https://github.com/uc-r/Advanced-R
Advanced Analytics with R training material delivered in a 2 day format
data-science educational-materials r training-materials workshop-materials
Last synced: 13 Nov 2024
https://github.com/piquette/qtrn
A cli tool to streamline financial markets data analysis :wrench:
cli data data-science finance go golang options quotes scraper stock stock-analysis stock-market
Last synced: 04 Nov 2024
https://github.com/balavenkatesh3322/model_deployment
A collection of model deployment library and technique.
aws azure caffe data-science deep-learning keras machine-learning model model-deployment model-server model-serving mxnet neural-network pytorch serving serving-pytorch-models serving-recommendation serving-tensors tensorflow
Last synced: 10 Nov 2024
https://github.com/xiaodaigh/jlboost.jl
A 100%-Julia implementation of Gradient-Boosting Regression Tree algorithms
catboost data-science gbdt gbrt lightgbm machine-learning tree tree-boosting-algorithms xgboost
Last synced: 30 Dec 2024
https://github.com/fneum/data-science-for-esm
data-science energy energy-data energy-system-modelling
Last synced: 05 Feb 2025
https://github.com/holgerbrandl/kalasim
Discrete Event Simulator
agent-based-modeling data-science discrete-event-simulation optimization process-modeling simulation visulization
Last synced: 05 Feb 2025
https://github.com/verynifty/RolodETH
A Rolodex for popular Ethereum chain address.
data-science ethereum ethereum-blockchain
Last synced: 18 Nov 2024
https://github.com/andrewtavis/kwx
BERT, LDA, and TFIDF based keyword extraction in Python
bert data-analysis data-science data-visualization keyword-extraction latent-dirichlet-allocation lda machine-learning multilingual natural-language-processing nlp open-source python python3 text-analysis text-classification text-mining tfidf topic-modeling unsupervised-learning
Last synced: 22 Jan 2025
https://github.com/robertmartin8/udemyml
Templates, code and notes for Kirill Eremenko's Machine Learning course
data-science machine-learning python r tutorial udemy udemy-machine-learning
Last synced: 22 Oct 2024
https://github.com/data-centric-ai/dcbench
A benchmark of data-centric tasks from across the machine learning lifecycle.
Last synced: 30 Oct 2024
https://github.com/maartengr/projects
Data Science Portfolio
data-science jupyter-notebook machine-learning nlp portfolio python pytorch reinforcement-learning
Last synced: 28 Oct 2024
https://github.com/paris-saclay-cds/ramp-workflow
Toolkit for building predictive workflows on top of pydata (pandas, scikit-learn, pytorch, keras, etc.).
data-challenge data-science python ramp
Last synced: 06 Jan 2025
https://github.com/devinterview-io/pytorch-interview-questions
🟣 PyTorch interview questions and answers to help you prepare for your next machine learning and data science interview in 2024.
ai-interview-questions coding-interview-questions coding-interviews data-science data-science-interview data-science-interview-questions data-scientist-interview interview-practice interview-preparation machine-learning machine-learning-and-data-science machine-learning-interview machine-learning-interview-questions pytorch pytorch-interview-questions pytorch-questions pytorch-tech-interview software-engineer-interview technical-interview-questions
Last synced: 06 Feb 2025
https://github.com/hsbc/tslumen
A library for Time Series EDA (exploratory data analysis)
analysis data-analysis data-science data-visualization eda exploratory-data-analysis exploratory-data-visualizations pandas profiling python time-series time-series-analysis time-series-eda time-series-profiling timeseries timeseries-analysis timeseries-eda
Last synced: 05 Feb 2025
https://github.com/shenxiangzhuang/pythondataanalysis
The data and code that used in my book.
data-science python3 webcrawler
Last synced: 24 Nov 2024
https://github.com/shenxiangzhuang/PythonDataAnalysis
The data and code that used in my book.
data-science python3 webcrawler
Last synced: 30 Oct 2024
https://github.com/nishkarshraj/automation-using-shell-scripts
Development Automation using Shell Scripting.
anacron at automation automation-framework backup bash-script cron crontab data-science data-structures development linux scenarios scheduler shell shell-scripts sorting-algorithms
Last synced: 16 Nov 2024
https://github.com/devsgnr/breadroll
breadroll 🥟 is a simple lightweight library for data processing operations written in Typescript and powered by Bun.
bun csv csv-parser data-engineering data-science data-transformation eda exploratory-data-analysis tsv tsv-parser
Last synced: 09 Dec 2024
https://github.com/jbramburger/DataDrivenDynSyst
Scripts and notebooks to accompany the book Data-Driven Methods for Dynamic Systems
autoencoder autoencoder-neural-network autoencoders conservation-laws data-science dynamic-mode-decomposition dynamical-systems extended-dynamic-mode-decomposition forecasting kernel-methods machine-learning neural-network physics-informed-learning physics-informed-neural-networks poincare-map sindy sindy-algorithm stroboscopic-map time-delay universal-approximation-theorem
Last synced: 12 Nov 2024
https://github.com/empower-ai/sql-agent
Ai Agent that helps you do data analytics with natural language.
analytics bigquery chatgpt chatgpt-bot data data-analytics data-science mysql postgresql slack slack-bot slackbot
Last synced: 14 Nov 2024
https://github.com/argilla-io/biome-text
Custom Natural Language Processing with big and small models 🌲🌱
allennlp data-science natural-language-processing nlp pytorch
Last synced: 25 Jan 2025
https://github.com/ahmedfgad/arithmeticencodingpython
Data Compression using Arithmetic Encoding in Python
arithmetic-coding data-compression data-science entropy-coding lossless-compression-algorithm python
Last synced: 17 Nov 2024