Projects in Awesome Lists tagged with data-summary
A curated list of projects in awesome lists tagged with data-summary .
https://github.com/ekzhu/datasketch
MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW
data-sketches data-summary hnsw hyperloglog jaccard-similarity locality-sensitive-hashing lsh lsh-ensemble lsh-forest minhash python search top-k weighted-quantiles
Last synced: 13 May 2025
https://github.com/bigmlcom/histogram
Streaming Histograms for Clojure/Java
clojure data-summary histogram streaming
Last synced: 04 Apr 2025
https://github.com/nhsdigital/sde_summary_notebooks
Notebooks provided by the Wranglers for users to quickly gain insights on datasets inside the Secure Data Environment (SDE)
data-analysis data-linkage data-quality data-summary metrics statistics
Last synced: 09 Apr 2025
https://github.com/ashenfad/space-saving
The "SpaceSaving" stream counting algorithm for Clojure
clojure data-summary stream-counting-algorithm
Last synced: 28 Mar 2025
https://github.com/teragrep/dpf_02
Teragrep Result Aggregation for Apache Spark
aggregation data-aggregation data-science data-summarization data-summary data-visualisation data-visualization teragrep
Last synced: 22 Apr 2025