Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data Science
Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.
- GitHub: https://github.com/topics/data-science
- Wikipedia: https://en.wikipedia.org/wiki/Data_science
- Related Topics: data-analysis, data-mining, machine-learning, big-data, data-visualization,
- Aliases: datasciences, data-science-project, data-science-algorithm,
- Last updated: 2024-12-23 00:06:28 UTC
- JSON Representation
https://github.com/greenelab/scihub
Source code and data analyses for the Sci-Hub Coverage Study
crossref data-science doi journals libgen open-data sci-hub scimag scopus
Last synced: 18 Dec 2024
https://github.com/petrobras/3w
Promotes development of ML algorithms for early detection and classification of undesirable events in offshore oil wells.
anomaly-detection data-science machine-learning multivariate-time-series-analysis oil-well-monitoring
Last synced: 23 Dec 2024
https://github.com/neuhausi/canvasXpress
CanvasXpress: A JavaScript Library for Data Analytics with Full Audit Trail Capabilities.
analytics bioinformatics chart charting cran dash dashboard data-analytics data-science data-visualization genomics graphs javascript network network-visualization python r reproducible-research shiny visualization
Last synced: 04 Dec 2024
https://github.com/drivendataorg/deon
A command line tool to easily add an ethics checklist to your data science projects.
data-ethics data-science ethics machine-learning
Last synced: 21 Dec 2024
https://github.com/Dyakonov/PZAD
Курс "Прикладные задачи анализа данных" (ВМК, МГУ имени М.В. Ломоносова)
data-mining data-science data-visualization education lectures machine-learning ml russian slides
Last synced: 27 Nov 2024
https://github.com/iterative/terraform-provider-iterative
☁️ Terraform plugin for machine learning workloads: spot instance recovery & auto-termination | AWS, GCP, Azure, Kubernetes
aws azure cloud cloud-computing cloud-infrastructure cloud-orchestration cloud-storage cml data-science developer-tools gcp gpu hacktoberfest k8s machine-learning mlops terraform terraform-provider terraform-provider-iterative tpi
Last synced: 01 Nov 2024
https://github.com/yamafaktory/hypergraph
Hypergraph is data structure library to create a directed hypergraph in which a hyperedge can join any number of vertices.
data data-science data-structure data-structures hypergraph hypergraphs rust rust-lang rustlang
Last synced: 20 Dec 2024
https://github.com/zero-one-group/geni
A Clojure dataframe library that runs on Spark
big-data clojure clojure-library clojure-repl data-engineering data-science dataframe distributed-computing high-performance-computing machine-learning parallel-computing spark
Last synced: 18 Dec 2024
https://github.com/ibotta/sk-dist
Distributed scikit-learn meta-estimators in PySpark
data-science machine-learning ml scikit-learn spark
Last synced: 21 Dec 2024
https://github.com/Ibotta/sk-dist
Distributed scikit-learn meta-estimators in PySpark
data-science machine-learning ml scikit-learn spark
Last synced: 25 Nov 2024
https://github.com/microsoft/NimbusML
Python machine learning package providing simple interoperability between ML.NET and scikit-learn components.
data-science machine-learning ml mlnet nimbusml python scikit-learn
Last synced: 09 Nov 2024
https://github.com/deepgraph/deepgraph
Analyze Data with Pandas-based Networks. Documentation:
data-analysis data-mining data-science data-structures data-visualization graph-database graph-theory graphs graphviz interfacing iterative-methods multilayer-networks network network-analysis network-visualization networkx pandas parallel partitioning
Last synced: 07 Nov 2024
https://github.com/microsoft/nimbusml
Python machine learning package providing simple interoperability between ML.NET and scikit-learn components.
data-science machine-learning ml mlnet nimbusml python scikit-learn
Last synced: 30 Sep 2024
https://github.com/rballester/tntorch
Tensor Network Learning with PyTorch
cp-decomposition data-science learning pytorch tensor-decomposition tensor-networks tensor-train tensors tucker-decomposition
Last synced: 15 Nov 2024
https://github.com/merantix-momentum/squirrel-core
A Python library that enables ML teams to share, load, and transform data in a collaborative, flexible, and efficient way :chestnut:
ai cloud-computing collaboration computer-vision cv data-ingestion data-mesh data-science dataops datasets deep-learning distributed internal machine-learning ml natural-language-processing nlp python pytorch tensorflow
Last synced: 02 Nov 2024
https://github.com/glm-tools/pyglmnet
Python implementation of elastic-net regularized generalized linear models
data-science elastic-net glm lasso machine-learning python
Last synced: 12 Nov 2024
https://github.com/modal-labs/modal-client
Python client library for Modal
ai cloud data-science distributed genai machine-learning modal python serverless
Last synced: 30 Oct 2024
https://github.com/Giorgi/DuckDB.NET
Bindings and ADO.NET Provider for DuckDB
ado-net data-science duckdb duckdb-database hacktoberfest hacktoberfest2023
Last synced: 28 Oct 2024
https://github.com/Mybridge/python-articles
Monthly Series - Top 10 Python Articles
data-science data-visualization django flask python python3
Last synced: 28 Oct 2024
https://github.com/mybridge/python-articles
Monthly Series - Top 10 Python Articles
data-science data-visualization django flask python python3
Last synced: 19 Dec 2024
https://github.com/john-science/scipy_con_2019
Tutorial Sessions for SciPy Con 2019
data-science machine-learning python scipy scipy2019 time-series tutorial tutorial-sessions
Last synced: 21 Dec 2024
https://github.com/jupyter-naas/naas
Low-code Python library to safely use notebooks in production: schedule workflows, generate assets, trigger webhooks, send notifications, build pipelines, manage secrets (Cloud-only)
ai binder data data-science data-transformation engine etl integration jupyter jupyterlab notebooks open-source pipeline
Last synced: 04 Nov 2024
https://github.com/senseyeio/roger
Golang RServe client. Use R from Go
data-science go r rserve scientific-computing
Last synced: 13 Nov 2024
https://github.com/tirthajyoti/web-database-analytics
Web scrapping and related analytics using Python tools
analytics beautifulsoup4 data-science data-wrangling database json json-parser natural-language-processing nlp python regular-expression sql sqlite3 web-scraping xml-parser
Last synced: 17 Dec 2024
https://github.com/rasgointelligence/RasgoQL
Write python locally, execute SQL in your data warehouse
data-analysis data-science pandas python sql
Last synced: 27 Nov 2024
https://github.com/tirthajyoti/Web-Database-Analytics
Web scrapping and related analytics using Python tools
analytics beautifulsoup4 data-science data-wrangling database json json-parser natural-language-processing nlp python regular-expression sql sqlite3 web-scraping xml-parser
Last synced: 09 Nov 2024
https://github.com/quintoandar/butterfree
A tool for building feature stores.
data-engineering data-science etl etl-framework feature-store package pyspark python
Last synced: 15 Nov 2024
https://github.com/vopani/datatableton
100 exercises to learn Python Datatable
data-science datatable pydatatable python tutorial-exercises
Last synced: 18 Nov 2024
https://github.com/svenkreiss/pysparkling
A pure Python implementation of Apache Spark's RDD and DStream interfaces.
apache-spark data-processing data-science python
Last synced: 20 Dec 2024
https://github.com/PYFTS/pyFTS
An open source library for Fuzzy Time Series in Python
data-science econometrics forecasting forecasting-models fts fuzzy-rules fuzzy-sets fuzzy-sets-syst fuzzy-systems fuzzy-time-series interval probabilistic-forecasting python series series-models time-series time-series-analysis time-series-forecasting
Last synced: 05 Nov 2024
https://pyfts.github.io/pyFTS/
An open source library for Fuzzy Time Series in Python
data-science econometrics forecasting forecasting-models fts fuzzy-rules fuzzy-sets fuzzy-sets-syst fuzzy-systems fuzzy-time-series interval probabilistic-forecasting python series series-models time-series time-series-analysis time-series-forecasting
Last synced: 02 Nov 2024
https://github.com/enlite-ai/maze
Maze Applied Reinforcement Learning Framework
applied-machine-learning automation data-science decision-making deep-learning distributed documentation framework machine-learning monitoring optimization python reinforcement-learning simulation
Last synced: 02 Nov 2024
https://github.com/wizardforcel/data-science-notebook
:book: 每一个伟大的思想和行动都有一个微不足道的开始
data-analysis data-science machine-learning notebook numpy pandas sklearn tensorflow
Last synced: 18 Dec 2024
https://github.com/griperis/blenderdatavis
Data visualisation addon for Blender
blender blender-addon chart data-science data-visualisation
Last synced: 17 Dec 2024
https://github.com/packtworkshops/the-python-workshop
A New, Interactive Approach to Learning Python
algorithms data-science gridsearchcv linear-regression logistic-regression machine-learning python pytorch random-forests randomizedsearchcv structure types
Last synced: 21 Dec 2024
https://github.com/dataqa/nlp-labelling
Labelling platform for text using weak supervision.
annotation-tool data-labeling data-science learning-with-limited-labeled-data learning-with-noisy-labels natural-language-processing ner nlp nlp-machine-learning pseudo-labeling search-engine text-annotation-tool text-classification text-mining weak-supervision
Last synced: 29 Oct 2024
https://github.com/carloocchiena/the_statistics_handbook
the statistics handbook open source repository
data-science latex mathematics statistics
Last synced: 17 Dec 2024
https://github.com/empower-ai/dsensei
AI-powered key driver analysis tool that pinpoints root cause behind metrics fluctuation in one minute.
analytics business-analytics business-intelligence data data-analytics data-insights data-science
Last synced: 18 Dec 2024
https://github.com/PacktWorkshops/The-Python-Workshop
A New, Interactive Approach to Learning Python
algorithms data-science gridsearchcv linear-regression logistic-regression machine-learning python pytorch random-forests randomizedsearchcv structure types
Last synced: 13 Nov 2024
https://github.com/amitkaps/full-stack-data-science
Full Stack Data Science in Python
data-product data-science machine-learning python stack-data-science workshop
Last synced: 18 Dec 2024
https://github.com/kde/labplot
LabPlot is a FREE, open source and cross-platform Data Visualization and Analysis software accessible to everyone.
data-analysis data-science data-visualization fitting graph graph2d plotting scientific-plotting scientific-visualization
Last synced: 18 Dec 2024
https://github.com/aurimas13/machine-learning-goodness
The Machine Learning project including ML/DL projects, notebooks, cheat codes of ML/DL, useful information on AI/AGI and codes or snippets/scripts/tasks with tips.
algorithms artifcial-intelligence artificial-intelligence chatgpt cheatsheets computer-science data-science deep-neural-networks deep-reinforcement-learning gpt4 machine-learning machine-learning-algorithms mlops python python3 reinforcement-learning reinforcement-learning-algorithms tips tips-and-tricks
Last synced: 09 Nov 2024
https://github.com/maxpumperla/learning_ray
Notebooks for the O'Reilly book "Learning Ray"
data-science deep-learning distributed-computing machine-learning notebook python ray
Last synced: 24 Dec 2024
https://github.com/flyteorg/flytekit
Extensible Python SDK for developing Flyte tasks and workflows. Simple to get started and learn and highly extensible.
automation data data-science extensible flyte flyte-tasks hacktoberfest mlops pypi python sdk spark workflows
Last synced: 18 Dec 2024
https://github.com/gandersen101/spaczz
Fuzzy matching and more functionality for spaCy.
ai artificial-intelligence data-science fuzzy-matching natural-language-processing nlp nlp-library python rapidfuzz regex spacy spacy-extension spacy-extensions
Last synced: 20 Dec 2024
https://github.com/uclatommy/tweetfeels
Real-time sentiment analysis in Python using twitter's streaming api
data-mining data-science python-3-6 sentiment-analysis twitter
Last synced: 22 Dec 2024
https://github.com/dwhitena/gophernet
A simple from-scratch neural net written in Go
artificial-intelligence data-science go golang machine-learning neural-network
Last synced: 11 Nov 2024
https://github.com/msuzen/looper
A resource list for causality in statistics, data science and physics
bayesian-inference causal causal-discovery causal-impact causal-inference causal-machine-learning causal-models causal-networks causality causality-algorithms causality-analysis causation data-science machine-learning meta-learning physics statistical-inference statistical-mechanics statistical-physics statistics
Last synced: 12 Nov 2024
https://github.com/asad70/reddit-sentiment-analysis
This program goes thru reddit, finds the most mentioned tickers and uses Vader SentimentIntensityAnalyzer to calculate the ticker compound value.
algotrading data-science data-science-projects data-visualization mentioned-tickers reddit reddit-sentiment-analysis sentiment sentiment-analysis stocks ticker-compound trading vader vader-sentiment-analysis vader-sentimentintensityanalyzer wallstreetbets
Last synced: 11 Nov 2024
https://github.com/cartodb/cartoframes
CARTO Python package for data scientists
carto data-science jupyter-notebook maps python spatial-data-analysis
Last synced: 19 Dec 2024
https://github.com/khanhnamle1994/statistical-learning
Lecture Slides and R Sessions for Trevor Hastie and Rob Tibshinari's "Statistical Learning" Stanford course
data-mining data-science r regression statistical-learning
Last synced: 17 Nov 2024
https://github.com/red-data-tools/unicode_plot.rb
Plot your data by Unicode characters
data-science data-visualization ruby
Last synced: 24 Dec 2024
https://github.com/tirthajyoti/uci-ml-api
Simple API for UCI Machine Learning Dataset Repository (search, download, analyze)
api classification clustering data-science learning machine-learning python regression statistics uci-machine-learning
Last synced: 18 Dec 2024
https://github.com/sktime/skpro
A unified framework for tabular probabilistic regression, time-to-event prediction, and probability distributions in python
ai data-science distributional-regression distributions failure-prediction framework machine-learning prediction probability-distributions python regression sklearn sktime survival-analysis survival-models survival-prediction time-to-event
Last synced: 20 Dec 2024
https://github.com/Griperis/BlenderDataVis
Data visualisation addon for Blender
blender blender-addon chart data-science data-visualisation
Last synced: 16 Nov 2024
https://github.com/analysiscenter/cardio
CardIO is a library for data science research of heart signals
data-science deep-learning deep-neural-networks healthcare machine-learning python
Last synced: 13 Nov 2024
https://github.com/dgerlanc/programming-with-data
🐍 Learn Python and Pandas from the ground up
dangerlanc data-science pandas pandas-tutorial python workshop
Last synced: 23 Dec 2024
https://github.com/WenjieZ/TSCV
Time Series Cross-Validation -- an extension for scikit-learn
backtesting cross-validation data-science hyperparameter-optimization machine-learning model-selection time-series tuning-parameters
Last synced: 05 Nov 2024
https://github.com/ropensci/elastic
R client for the Elasticsearch HTTP API
data-science database database-wrapper elasticsearch etl http json r r-package rstats
Last synced: 17 Dec 2024
https://github.com/Bears-R-Us/arkouda
Arkouda (αρκούδα): Interactive Data Analytics at Supercomputing Scale :bear:
chapel data data-analysis data-science distributed-computing eda hpc python
Last synced: 20 Nov 2024
https://github.com/bears-r-us/arkouda
Arkouda (αρκούδα): Interactive Data Analytics at Supercomputing Scale :bear:
chapel data data-analysis data-science distributed-computing eda hpc python
Last synced: 23 Dec 2024
https://github.com/justmarkham/trump-lies
Tutorial: Web scraping in Python with Beautiful Soup
beautiful-soup data-science dataset pandas python requests tutorial web-scraping
Last synced: 17 Dec 2024
https://github.com/PKU-DAIR/Hetu
A high-performance distributed deep learning system targeting large-scale and automated distributed training.
artificial-intelligence autograd data-science deep-learning deep-neural-networks distributed-systems distributed-training embeddings gpu high-dimensional machine-learning python state-of-the-art
Last synced: 28 Oct 2024
https://github.com/adriangb/scikeras
Scikit-Learn API wrapper for Keras.
data-science deep-learning deep-neural-networks keras machine-learning python scikit-learn tensorflow wrappers
Last synced: 22 Dec 2024
https://github.com/Esri/awesome-arcgis-developer
A curated list of resources to help you with ArcGIS development, APIs, SDKs, tools, and location services
arcgis arcgis-apis awesome awesome-list data-science developer developer-experience developer-tools developers gis location-intelligence location-services mapping productivity samples spatial-analysis web-development web-mapping
Last synced: 06 Dec 2024
https://github.com/jldbc/coffee-quality-database
Building the Coffee Quality Institute Database
agriculture coffee data data-science dataset
Last synced: 18 Dec 2024
https://github.com/samueldobbie/markup
A web-based document annotation tool, powered by GPT-4 :rocket:
active-learning annotation-tool data-labeling data-science gpt-4 machine-learning named-entity-recognition natural-language-processing ner nlp sequence-to-sequence text-annotation text-annotation-tool
Last synced: 27 Oct 2024
https://github.com/durgeshsamariya/data-science-roadmap
Roadmap to learn Data Science and related areas.
data-science data-science-resources learn-data-science roadmap
Last synced: 08 Nov 2024
https://github.com/jphall663/GWU_data_mining
Materials for GWU DNSC 6279 and DNSC 6290.
data-mining data-science data-visualization h2o image-processing image-recognition machine-learning python r sas text-mining
Last synced: 18 Nov 2024
https://github.com/laresbernardo/lares
Analytics & Machine Learning R Sidekick
analytics api automation automl data-science descriptive-statistics h2o machine-learning marketing mmm predictive-modeling puzzle r r-package rlanguage robyn rstats visualization
Last synced: 20 Dec 2024
https://github.com/recodehive/stackoverflow-analysis
Stack overflow is a professional community for developers. This repo analysis 3 years of developer Survey done by Stackoverflow and do visualization and predict the salary of Data Scientist in future.
canva collaborate data-analysis data-science data-visualization ghdesktop github github-pages machine-learning stack-overflow student-vscode survey-analysis vscode
Last synced: 21 Dec 2024
https://github.com/graphia-app/graphia
A visualisation tool for the creation and analysis of graphs
analysis data data-analysis data-science data-visualization graphs interpretation networks visualisation visualization
Last synced: 03 Nov 2024
https://github.com/data-dot-all/dataall
A modern data marketplace that makes collaboration among diverse users (like business, analysts and engineers) easier, increasing efficiency and agility in data projects on AWS.
aws aws-glue aws-lake-formation aws-s3 data data-science etl-framework lakeformation lakehouse redshift
Last synced: 04 Dec 2024
https://github.com/shreyashankar/datasets-for-good
List of datasets to apply stats/machine learning/technology to the world of social good.
data-science dataset education environment government health machine-learning social-good
Last synced: 13 Nov 2024
https://github.com/dialnd/imbalanced-algorithms
Python-based implementations of algorithms for learning on imbalanced data.
data-science imbalanced-data machine-learning notre-dame python
Last synced: 07 Nov 2024
https://github.com/alex-lekov/automl_alex
State-of-the art Automated Machine Learning python library for Tabular Data
auto-ml automatic-machine-learning automl cross-validation data-science data-science-projects hyperparameter-optimization hyperparameter-tuning machine-learning machine-learning-library machine-learning-models ml model-selection optimisation python sklearn stacking stacking-ensemble xgboost
Last synced: 21 Dec 2024
https://github.com/voxel51/voxelgpt
AI assistant that can query visual datasets, search the FiftyOne docs, and answer general computer vision questions
artificial-intelligence chatgpt computer-vision data-science deep-learning fiftyone langchain llm machine-learning openai python
Last synced: 09 Nov 2024
https://github.com/paddymul/buckaroo
Buckaroo - the data wrangling assistant for pandas. Quickly explore dataframes, and run pandas commands via a GUI. Works inside the jupyter notebook.
buckaroo data-science jupyter paddy pandas
Last synced: 21 Dec 2024
https://github.com/syamkakarla98/hyperspectral_image_analysis_simplified
The repository contains the implementation of different machine learning techniques such as classification and clustering on Hyperspectral and Satellite Imagery.
classification data-analysis data-science dimensionality-reduction hacktoberfest hyperspectral hyperspectral-image-classification hyperspectral-images indian-pines-dataset machine-learning matplotlib-pyplot pandas plotly python python3 remote-sensing satellite-imagery satellite-images tensorflow turorial
Last synced: 17 Dec 2024
https://github.com/Yu-Group/covid19-severity-prediction
Extensive and accessible COVID-19 data + forecasting for counties and hospitals. 📈
coronavirus coronavirus-tracking county-health-data county-level covid-19 covid-19-data covid-19-data-analysis data-analysis data-science epidemic-model forecasting outbreak outbreak-severity python3 response4life risk-assessment risk-modelling statistics ventilator visualization
Last synced: 27 Nov 2024
https://github.com/koalaverse/homlr
Supplementary material for Hands-On Machine Learning with R, an applied book covering the fundamentals of machine learning with R.
data-science machine-learning r supervised-learning unsupervised-learning
Last synced: 19 Nov 2024
https://github.com/Alex-Lekov/AutoML_Alex
State-of-the art Automated Machine Learning python library for Tabular Data
auto-ml automatic-machine-learning automl cross-validation data-science data-science-projects hyperparameter-optimization hyperparameter-tuning machine-learning machine-learning-library machine-learning-models ml model-selection optimisation python sklearn stacking stacking-ensemble xgboost
Last synced: 22 Nov 2024
https://github.com/lgalke/vec4ir
Word Embeddings for Information Retrieval
data-science embedding-models embeddings evaluation information-retrieval natural-language-processing nlp retrieval-model similarity-scoring word-embeddings
Last synced: 11 Nov 2024
https://github.com/bgruening/docker-galaxy-stable
:whale::bar_chart::books: Docker Images tracking the stable Galaxy releases.
data-science docker-image galaxy galaxyproject science
Last synced: 09 Nov 2024
https://github.com/Benjamin-Lee/deep-rules
Ten Quick Tips for Deep Learning in Biology
bioinformatics biology computational-biology data-science deep-learning genomics machine-learning manubot manuscript
Last synced: 12 Nov 2024
https://github.com/bgruening/docker-galaxy
:whale::bar_chart::books: Docker Images tracking the stable Galaxy releases.
data-science docker-image galaxy galaxyproject science
Last synced: 23 Dec 2024
https://github.com/project-codeflare/codeflare
Simplifying the definition and execution, scaling and deployment of pipelines on the cloud.
automl data-science hyperparameter-optimization machine-learning pipelines ray sklearn workflows
Last synced: 21 Dec 2024
https://github.com/mukeshmithrakumar/book_list
Python, Machine Learning, Deep Learning and Data Science Books
algorithms books data-science deep-learning free machine-learning python
Last synced: 18 Dec 2024
https://github.com/xlang-ai/ds-1000
[ICML 2023] Data and code release for the paper "DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation".
benchmark code-generation data-science large-language-models semantic-parsing
Last synced: 18 Dec 2024
https://github.com/xlang-ai/DS-1000
[ICML 2023] Data and code release for the paper "DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation".
benchmark code-generation data-science large-language-models semantic-parsing
Last synced: 29 Nov 2024
https://github.com/Minyus/pipelinex
PipelineX: Python package to build ML pipelines for experimentation with Kedro, MLflow, and more
data-engineering data-science deep-learning experimentation machine-learning pipeline
Last synced: 29 Oct 2024
https://github.com/blockchain-etl/public-datasets
The list of public blockchain datasets in BigQuery
bitcoin blockchain blockchain-analytics crypto cryptocurrency data-analytics data-engineering data-science dogecoin ethereum gcp google-bigquery google-cloud google-cloud-platform on-chain-analysis polygon solana web3
Last synced: 18 Dec 2024
https://github.com/neurodata/hyppo
Python package for multivariate hypothesis testing
data-science hacktoberfest hypothesis-testing independence ksample-testing python
Last synced: 22 Dec 2024
https://github.com/nickslevine/zebras
Data analysis library for JavaScript built with Ramda
data-analysis data-science functional-programming javascript pandas ramda
Last synced: 07 Nov 2024
https://github.com/analysiscenter/radio
RadIO is a library for data science research of computed tomography imaging
computed-tomography data-science deep-learning machine-learning medical-imaging neural-networks tensorflow
Last synced: 27 Nov 2024
https://github.com/vertica/verticapy
VerticaPy is a Python library that exposes sci-kit like functionality to conduct data science projects on data stored in Vertica, thus taking advantage Vertica’s speed and built-in analytics and machine learning capabilities.
big-data data-science data-visualization machine-learning preparation python python-library vertica
Last synced: 20 Dec 2024
https://github.com/chasedehan/boostaroota
A fast xgboost feature selection algorithm
algorithm boruta data-science datascience datascientist dimension-reduction feature-selection machine-learning machine-learning-algorithms machinelearning xgboost xgboost-algorithm
Last synced: 23 Dec 2024
https://github.com/scicloj/scicloj.ml
A Clojure machine learning library
classification clojure clustering data-pipeline data-science experiment-tracking hyperparameter-optimization machine-learning nlp regression scicloj
Last synced: 21 Dec 2024