Projects in Awesome Lists tagged with large-data
A curated list of projects in awesome lists tagged with large-data .
https://github.com/hosseinmoein/dataframe
C++ DataFrame for statistical, financial, and ML analysis in modern C++
ai cpp data-analysis data-science dataframe financial-data-analysis financial-engineering heterogeneous-data large-data machine-learning multidimensional-data numerical-analysis pandas polars statistical statistical-analysis tensor tensorboard trading-algorithms trading-strategies
Last synced: 04 Sep 2025
https://github.com/hosseinmoein/DataFrame
C++ DataFrame for statistical, Financial, and ML analysis -- in modern C++ using native types and contiguous memory storage
ai cpp data-analysis data-science dataframe financial-data-analysis financial-engineering heterogeneous-data large-data machine-learning multidimensional-data numerical-analysis pandas polars statistical statistical-analysis tensor tensorboard trading-algorithms trading-strategies
Last synced: 15 Mar 2025
https://github.com/bakdata/kafka-large-message-serde
A Kafka Serde that reads and writes records from and to Blob storage (S3, Azure, Google) transparently.
azure-blob-storage deserialization google-cloud-storage kafka kafka-streams large-data s3 serde serialization simple-storage-service
Last synced: 10 Apr 2025
https://github.com/randomfractals/tabular-data-viewer
Tabular Data Viewer 🀄 VSCode extension for viewing very large local and remote CSV and TSV data files with Tabulator Table, Perspective Datagrid and D3FC Chart Views 📊📈
charts csv d3fc data data-packages datapackage dsv flat-data large-data perspective remote-data tabular tabulator tsv view viewer vscode
Last synced: 11 Apr 2025
https://github.com/andrpavlou/files-dbms
File Search-Sorting Algorithms for DBMS
c-programming-language database dbms files hash-table heap heap-file large-data merge-sort search-algorithm sorting-algorithms
Last synced: 25 Jan 2026
https://github.com/inoueakimitsu/milwrap
Wrapping single instance learning algorithms for fitting them to data for multiple instance learning
large-data machine-learning multi-class-classification multiple-instance-learning python sklearn
Last synced: 07 May 2025
https://github.com/pat8901/hpc-data-parser
Analyzes Grid Engine log files converting them into a format suitable for time series analysis using Grafana.
Last synced: 16 Jul 2025