Projects in Awesome Lists tagged with high-dimensional-data
A curated list of projects in awesome lists tagged with high-dimensional-data .
https://github.com/nvidia/minkowskiengine
Minkowski Engine is an auto-diff neural network library for high-dimensional sparse tensors
3d-convolutional-network 3d-vision 4d-convolutional-neural-network auto-differentiation computer-vision convolutional-neural-networks cuda deep-learning high-dimensional-data high-dimensional-inference minkowski-engine neural-network pytorch semantic-segmentation space-time sparse-convolution sparse-tensor-network sparse-tensors spatio-temporal-analysis trilateral-filter
Last synced: 14 May 2025
https://nvidia.github.io/MinkowskiEngine/
Minkowski Engine is an auto-diff neural network library for high-dimensional sparse tensors
3d-convolutional-network 3d-vision 4d-convolutional-neural-network auto-differentiation computer-vision convolutional-neural-networks cuda deep-learning high-dimensional-data high-dimensional-inference minkowski-engine neural-network pytorch semantic-segmentation space-time sparse-convolution sparse-tensor-network sparse-tensors spatio-temporal-analysis trilateral-filter
Last synced: 08 May 2025
https://github.com/NVIDIA/MinkowskiEngine
Minkowski Engine is an auto-diff neural network library for high-dimensional sparse tensors
3d-convolutional-network 3d-vision 4d-convolutional-neural-network auto-differentiation computer-vision convolutional-neural-networks cuda deep-learning high-dimensional-data high-dimensional-inference minkowski-engine neural-network pytorch semantic-segmentation space-time sparse-convolution sparse-tensor-network sparse-tensors spatio-temporal-analysis trilateral-filter
Last synced: 20 Mar 2025
https://github.com/contextlab/hypertools
A Python toolbox for gaining geometric insights into high-dimensional data
data-visualization data-wrangling high-dimensional-data python text-vectorization time-series topic-modeling visualization
Last synced: 29 Jan 2026
https://github.com/ContextLab/hypertools
A Python toolbox for gaining geometric insights into high-dimensional data
data-visualization data-wrangling high-dimensional-data python text-vectorization time-series topic-modeling visualization
Last synced: 07 Apr 2025
https://github.com/vdaas/vald
Vald. A Highly Scalable Distributed Vector Search Engine
anng approximate-nearest-neighbor-search cloud cloud-native distributed-systems golang high-dimensional-data high-performance image-search image-search-engine kubernetes microservices nearest-neighbor-search ngt similarity-search vald vector vector-search-engine
Last synced: 13 May 2025
https://github.com/abess-team/abess
Fast Best-Subset Selection Library
best-subset-selection classification-algorithm cox-regression feature-selection high-dimensional-data linear-regression logistic-regression machine-learning multitask-learning ordinal-regression poisson-regression polynomial-algorithm principal-component-analysis python r robust-principal-component-analysis scikit-learn sparse-principal-component-analysis sure-independence-screening
Last synced: 15 May 2025
https://github.com/ramhiser/datamicroarray
A collection of small-sample, high-dimensional microarray data sets to assess machine-learning algorithms and models.
cancer colon-cancer high-dimensional-data machine-learning r
Last synced: 22 Jun 2025
https://github.com/gdkrmr/dimred
A Framework for Dimensionality Reduction in R
dimensionality-reduction framework high-dimensional-data manifold-learning quality-control r visualization
Last synced: 06 Apr 2025
https://github.com/gdkrmr/dimRed
A Framework for Dimensionality Reduction in R
dimensionality-reduction framework high-dimensional-data manifold-learning quality-control r visualization
Last synced: 13 Jul 2025
https://github.com/sergiocorreia/ppmlhdfe
Poisson pseudo-likelihood regression with multiple levels of fixed effects
fixed-effects high-dimensional-data poisson-regression separation stata
Last synced: 24 Jan 2026
https://github.com/daleroberts/hdmedians
High-dimensional medians (medoid, geometric median, etc.). Fast implementations in Python.
high-dimensional-data machine-learning median python statistics
Last synced: 10 Apr 2025
https://github.com/great-northern-diver/loon
A Toolkit for Interactive Statistical Data Visualization
data-analysis data-science data-visualization exploratory-analysis exploratory-data-analysis high-dimensional-data interactive-graphics interactive-visualizations loon python statistical-analysis statistical-graphics statistics tcl-extension tk
Last synced: 19 Feb 2026
https://github.com/varir/scikit-hubness
A Python package for hubness analysis and high-dimensional data mining
approximate-nearest-neighbor-search data-mining data-science high-dimensional-data hubness machine-learning nearest-neighbor-search
Last synced: 17 Aug 2025
https://github.com/nanxstats/hdnom
🔮 Benchmarking and visualization toolkit for penalized Cox models
benchmark high-dimensional-data linear-regression nomogram-visualization penalized-cox-models survival-analysis
Last synced: 06 May 2025
https://github.com/lightonai/newma
Implementation of NEWMA: a new method for scalable model-free online change-point detection
change-point-detection hardware-acceleration high-dimensional-data machine-learning paper python timeseries
Last synced: 26 Aug 2025
https://github.com/epigen/unsupervised_analysis
A general purpose Snakemake workflow and MrBiomics module to perform unsupervised analyses (dimensionality reduction & cluster analysis) and visualizations of high-dimensional data.
cluster-analysis cluster-validation clustering clustering-algorithm clustree data-science data-visualization densmap dimensionality-reduction heatmap high-dimensional-data leiden-algorithm pca principal-component-analysis snakemake umap unsupervised-learning visualization workflow
Last synced: 15 Apr 2025
https://github.com/nlesc/dive
An interactive 3D web viewer of up to million points on one screen that represent data. Provides interaction for viewing high-dimensional data that has been previously embedded in 3D or 2D. Based on graphosaurus.js and three.js. For a Linux release of a complete embedding+visualization pipeline please visit https://github.com/sonjageorgievska/Embed-Dive.
3d-data embedded-data high-dimensional-data interactive-visualizations manifold-learning non-linear-dimensionality-reduction web-application
Last synced: 17 Jun 2025
https://github.com/joshengels/flinng
A fast high dimensional near neighbor search algorithm based on group testing and locality sensitive hashing
group-testing high-dimensional-data locality-sensitive-hashing nearest-neighbor-search
Last synced: 04 Nov 2025
https://github.com/ofai/hub-toolbox-python3
Hubness analysis and removal functions
data-mining high-dimensional-data hubness machine-learning
Last synced: 11 Oct 2025
https://github.com/KChen-lab/SCMarker
Marker gene selection from scRNA-seq data
feature-selection high-dimensional-data single-cell-rna-seq statistical-methods
Last synced: 09 Apr 2025
https://github.com/ivan-pi/fortran-flann
Fortran bindings to the FLANN library for performing fast approximate nearest neighbor searches in high dimensional spaces.
approximate-nearest-neighbor-search hierarchical-clustering high-dimensional-data kdtree kmeans-clustering nearest-neighbor-search spatial-search
Last synced: 07 Jan 2026
https://github.com/ramhiser/sparsediscrim
Sparse and Regularized Discriminant Analysis in R
classifier high-dimensional-data machine-learning r
Last synced: 23 Jun 2025
https://github.com/nanxstats/msaenet
🧲 Multi-step adaptive estimation for reducing false positive selection in sparse regressions
false-positive-control high-dimensional-data linear-regression machine-learning variable-selection
Last synced: 22 Apr 2025
https://github.com/acidjazz/json-browse
jQuery plugin to easily browse and highlight your JSON
high-dimensional-data jquery-plugin json json-api json-browse
Last synced: 05 Mar 2026
https://github.com/shu-hai/D-CCA
A Decomposition-based Canonical Correlation Analysis for High-dimensional Datasets (JASA-20 paper)
data-fusion data-integration high-dimensional-data integrative-analysis multiblock-structures multiview
Last synced: 13 Apr 2025
https://github.com/longhaisk/htlr
Bayesian Logistic Regression with Hyper-LASSO priors
bayesian classification high-dimensional-data machine-learning mcmc
Last synced: 01 Mar 2026
https://github.com/astro-informatics/quantifai
PyTorch-based radio-interferometric imaging reconstruction package with scalable Bayesian uncertainty quantification relying on data-driven (learned) priors
high-dimensional-data machine-learning pytorch radio-interferometry uncertainty-quantification
Last synced: 10 Oct 2025
https://github.com/lirongwu/dcv
Code for TNNLS paper "Deep Clustering and Visualization for End-to-End High Dimensional Data analysis"
clustering geometric-deep-learning high-dimensional-data manifold-learning visualization
Last synced: 13 Apr 2025
https://github.com/llnl/fpp
Function preserving projection (FPP), a linear projection technique for capturing interpretable patterns of high-dimensional functions
data-viz dimensionality-reduction discriminant-analysis high-dimensional-data projection supervised-dimensionality-reduction visualization
Last synced: 29 Apr 2025
https://github.com/statphysandml/pystatplottools
Easy evaluation and plotting of statistical data and high-dimensional distributions in python - Fast generation, loading and storing of custom datasets.
contour-plots custom-datasets distributions expectation-values high-dimensional-data
Last synced: 15 Jul 2025
https://github.com/nanxstats/ohpl
📈 Ordered Homogeneity Pursuit Lasso for Group Variable Selection
chemometrics high-dimensional-data homogeneity-pursuit lasso partial-least-squares-regression spectroscopy variable-selection
Last synced: 22 Apr 2025
https://github.com/inseefrlab/grandedim
Codes correspondant au document de travail "L'économétrie en grande dimension"
data-science econometrics high-dimensional-data publication r statistics
Last synced: 13 Jun 2025
https://github.com/mlindsk/molic
Multivariate Outlierdetection In Contingency Tables
categorical-data contingency-tables decomposable-graphical-models high-dimensional-data outlier-detection
Last synced: 22 Oct 2025
https://github.com/otryakhin-dmitry/global-minimum-variance-portfolio
High dimensional shrinkage optimal portfolios in R
financial-mathematics high-dimensional-data portfolio-management shrinkage-estimators
Last synced: 22 Oct 2025
https://github.com/nanxstats/bcpm-msaenet
Solution for the precisionFDA Brain Cancer Predictive Modeling Challenge using msaenet
brain-cancer high-dimensional-data machine-learning precisionfda variable-selection
Last synced: 22 Apr 2025
https://github.com/numbats/cassowaryr
Compute scagnostics on your scatterplots
data-science data-visualization eda high-dimensional-data multivariate
Last synced: 19 Feb 2026
https://github.com/bellet/hdsl
High-Dimensional Similarity Learning
high-dimensional-data machine-learning metric-learning similarity-learning sparse-data
Last synced: 03 Apr 2025
https://github.com/insightsengineering/unicate
Univariate conditional average treatment effect estimation for predictive biomarker discovery
biomarkers clinical-trials high-dimensional-data nonparametrics r treatment-effects
Last synced: 17 Jul 2025
https://github.com/varir/copac
COPAC clustering
cluster-analysis clustering clustering-algorithm data-mining high-dimensional-data machine-learning
Last synced: 15 Jul 2025
https://github.com/insightsengineering/unihtee
Tools for uncovering treatment effect modifiers in high-dimensional data.
heterogeneous-treatment-effects high-dimensional-data nonparametrics targeted-learning variable-importance
Last synced: 05 Jul 2025
https://github.com/erik-roberts/GIMBL-Vis
a GUI-based Interactive Multi-dimensional extensiBLe Visualization toolbox for Matlab
graphics high-dimensional-data interactive matlab matlab-toolbox multi-dimensional simulations visualization
Last synced: 05 Apr 2026
https://github.com/llnl/nddav
N-Dimensional Data Analysis and Visualization
data-analysis data-viz high-dimensional-data topological-data-analysis visual-analytics visualization
Last synced: 29 Apr 2025
https://github.com/krystynagrzesiak/gslope
Sparse Gaussian graphical models with Sorted L-One Penalized Estimation
graphical-models high-dimensional-data regularization-methods sparse-modeling
Last synced: 12 May 2026
https://github.com/cfmtech/optimal_cleaning_for_singular_values_of_cross-covariance_matrices
Python scripts from paper Optimal cleaning for singular values of cross-covariance matrices, by Florent Benaych-Georges, Jean-Philippe Bouchaud, Marc Potters (see https://arxiv.org/abs/1901.05543)
cross-correlation cross-correlation-processing denoising high-dimensional-data high-dimensional-probability high-dimensional-statistics probability probability-statistics probability-theory random-matrices random-matrix random-matrix-theory rotationally-invariant-estimator statistics
Last synced: 12 Apr 2025
https://github.com/llnl/hdtopology
High-dimensional topological data analysis library for NDDAV
analysis cpp data-analysis data-viz high-dimensional-data topological-data-analysis visualization
Last synced: 29 Apr 2025
https://github.com/rwoldford/loon
A Toolkit for Interactive Statistical Data Visualization
data-visualization exploratory-data-analysis high-dimension-visualization high-dimensional-data interactive-graphics interactive-visualizations r r-package r-programming r-stats statistical-graphics statistical-learning statistics tcl-applications tcl-extension tcl-tk tcltk
Last synced: 30 Jul 2025
https://github.com/carpentries-incubator/high-dimensional-analysis-in-python
Exploring and Modeling High-Dimensional Data
clustering high-dimensional-data interpretable-machine-learning lesson pca pre-alpha regression statistics visualization
Last synced: 02 Sep 2025
https://github.com/jrenstat/spinbayes
Semi-Parametric Gene-Environment Interaction via Bayesian Variable Selection
bayesian-variable-selection gene-environment-interactions high-dimensional-data r-package semi-parametric-modeling
Last synced: 10 Mar 2026
https://github.com/cbhihe/smartcity_high-dim-statistical-learning
Multivariate statistical learning applied to high dimensional data
high-dimensional-data multivariate-analysis mva r smartcity statistical-modeling
Last synced: 23 Jun 2026
https://github.com/mkomod/mcmc_ss_surv
MCMC for spike and slab survival models
high-dimensional-data mcmc spike-and-slab-prior survival-analysis
Last synced: 23 May 2026
https://github.com/nanxstats/hdnom-app
Shiny app for benchmarking and visualization of penalized Cox models
high-dimensional-data penalized-cox-models survival-analysis
Last synced: 13 Apr 2025
https://github.com/happma/hrm
R package providing statistical tests for high-dimensional repeated measures or split-plot designs.
high-dimensional-data longitudinal-data r rstats
Last synced: 31 May 2026
https://github.com/florianwoelki/index-simulation-tool
An index simulation tool for sparse high-dimensional vector data.
high-dimensional-data indexing-algorithms indexing-querying sparse-data
Last synced: 27 Apr 2026