Projects in Awesome Lists tagged with large-scale
A curated list of projects in awesome lists tagged with large-scale .
https://github.com/hpcaitech/colossalai
Making large AI models cheaper, faster and more accessible
ai big-model data-parallelism deep-learning distributed-computing foundation-models heterogeneous-training hpc inference large-scale model-parallelism pipeline-parallelism
Last synced: 09 Sep 2025
https://github.com/hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
ai big-model data-parallelism deep-learning distributed-computing foundation-models heterogeneous-training hpc inference large-scale model-parallelism pipeline-parallelism
Last synced: 19 Mar 2025
https://github.com/paddlepaddle/parl
A high-performance distributed training framework for Reinforcement Learning
large-scale parallelization reinforcement-learning
Last synced: 14 May 2025
https://github.com/PaddlePaddle/PARL
A high-performance distributed training framework for Reinforcement Learning
large-scale parallelization reinforcement-learning
Last synced: 28 Mar 2025
https://github.com/detectrecog/ccpd
[ECCV 2018] CCPD: a diverse and well-annotated dataset for license plate detection and recognition
ccpd dataset detection large-scale plate-detection recognition
Last synced: 15 May 2025
https://github.com/camel-ai/oasis
🏝️ OASIS: Open Agent Social Interaction Simulations with One Million Agents. https://oasis.camel-ai.org
agent-based-framework agent-based-simulation ai-societies deep-learning large-language-models large-scale llm-agents multi-agent-systems natural-language-processing
Last synced: 14 May 2025
https://github.com/loicland/superpoint_graph
Large-scale Point Cloud Semantic Segmentation with Superpoint Graphs
clustering large-scale lidar partition ply-files point-cloud pytorch segmentation semantic semantic-segmentation superpoint-graphs
Last synced: 07 May 2025
https://github.com/afair/postgresql_cursor
ActiveRecord PostgreSQL Adapter extension for using a cursor to return a large result set
activerecord batch cursor for-update large-scale postgresql postgresql-cursor ruby ruby-gem
Last synced: 25 Mar 2025
https://github.com/qingyonghu/sensaturban
🔥Urban-scale point cloud dataset (CVPR 2021 & IJCV 2022)
benchmark city-modeling dataset large-scale photogrammetry pointcloud urban-scale
Last synced: 05 Apr 2025
https://github.com/QingyongHu/SensatUrban
🔥Urban-scale point cloud dataset (CVPR 2021 & IJCV 2022)
benchmark city-modeling dataset large-scale photogrammetry pointcloud urban-scale
Last synced: 20 Mar 2025
https://github.com/paddlepaddle/paddlefleetx
飞桨大模型开发套件,提供大语言模型、跨模态大模型、生物计算大模型等领域的全流程开发工具链。
benchmark cloud data-parallelism distributed-algorithm elastic fleet-api large-scale lightning model-parallelism paddlecloud paddlepaddle pipeline-parallelism pretraining self-supervised-learning unsupervised-learning
Last synced: 13 Apr 2025
https://github.com/Oneflow-Inc/libai
LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training
data-parallelism deep-learning distributed-training large-scale model-parallelism nlp oneflow pipeline-parallelism self-supervised-learning transformer vision-transformer
Last synced: 09 May 2025
https://github.com/oneflow-inc/libai
LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training
data-parallelism deep-learning distributed-training large-scale model-parallelism nlp oneflow pipeline-parallelism self-supervised-learning transformer vision-transformer
Last synced: 08 Apr 2025
https://github.com/qingyonghu/spinnet
[CVPR 2021] SpinNet: Learning a General Surface Descriptor for 3D Point Cloud Registration
3dmatch descriptor generalization kitti large-scale pointcloud pytorch-implementation registration
Last synced: 18 Jul 2025
https://github.com/BlueBrain/Brayns
Visualizer for large-scale and interactive ray-tracing of neurons
brain embree interactive ispc-compiler json-rpc large-scale neurons neuroscience ospray pathtracing photorealistic-based-rendering python raytracing realtime-rendering visualisation volume-rendering websockets
Last synced: 29 Mar 2025
https://github.com/bluebrain/brayns
Visualizer for large-scale and interactive ray-tracing of neurons
brain embree interactive ispc-compiler json-rpc large-scale neurons neuroscience ospray pathtracing photorealistic-based-rendering python raytracing realtime-rendering visualisation volume-rendering websockets
Last synced: 13 Apr 2025
https://github.com/QingyongHu/SpinNet
[CVPR 2021] SpinNet: Learning a General Surface Descriptor for 3D Point Cloud Registration
3dmatch descriptor generalization kitti large-scale pointcloud pytorch-implementation registration
Last synced: 20 Mar 2025
https://github.com/ps-wiki/best-of-ps
🏆 A weekly updated ranked list of popular open-source libraries and tools for Power System Analysis.
best-of best-of-list co-simulation dae differential-algebraic-equations large-scale ode optimization optimizer ordinary-differential-equations power power-grids power-system power-system-analysis power-system-simulation powerflow simulation
Last synced: 01 Apr 2026
https://github.com/igeligel/vuex-feature-scoped-structure
:chart_with_upwards_trend: Feature scoped Vuex modules to have a better organization of business logic code inside Vuex modules based on Large-scale Vuex application structures @3yourmind
blogpost container containers large-scale modules vue vue2 vuejs2 vuex
Last synced: 15 May 2025
https://github.com/simeonradivoev/gpu-planetary-rendering
GPU atmosphertic scattering and planet generation in Unity 3D
atmosphere charp compute-shader graphics hlsl large-scale planet postprocessing shader unity unity3d
Last synced: 26 Apr 2025
https://github.com/llnl/librom
Model reduction library with an emphasis on large scale parallelism and linear subspace methods
large-scale math-physics model-reduction modeling parallel-computing reduced-order-models scientific simulation subspace-learning
Last synced: 07 Apr 2025
https://github.com/simeonradivoev/GPU-Planetary-Rendering
GPU atmosphertic scattering and planet generation in Unity 3D
atmosphere charp compute-shader graphics hlsl large-scale planet postprocessing shader unity unity3d
Last synced: 25 Apr 2025
https://github.com/med-air/Endo-FM
[MICCAI'23] Foundation Model for Endoscopy Video Analysis via Large-scale Self-supervised Pre-train
endoscopy foundation-model large-scale miccai2023 pre-train self-supervised video
Last synced: 16 Mar 2025
https://github.com/igeligel/vuex-namespaced-module-structure
:chart_with_upwards_trend: A Vue.js project powered by Vuex namespaced modules in a simple structure based on Large-scale Vuex application structures
blogpost large-scale modules vue vue2 vuejs2 vuex
Last synced: 15 May 2025
https://github.com/paddlepaddle/plsc
Paddle Large Scale Classification Tools,supports ArcFace, CosFace, PartialFC, Data Parallel + Model Parallel. Model includes ResNet, ViT, Swin, DeiT, CaiT, FaceViT, MoCo, MAE, ConvMAE, CAE.
arcface cait convmae cosface data-parallel deit distributed-training face-recognition facevit hight-speed large-scale mae moco-v3 model-parallel paddle paddlepaddle partial-fc resnet swin-transformer vit
Last synced: 05 Mar 2026
https://github.com/BioDynaMo/biodynamo
BioDynaMo is a high-performance and modular, agent-based simulation platform.
agent-based agent-based-framework agent-based-modelling biology cancer epidemiology high-performance large-scale modular-design neuroscience parallel simulation
Last synced: 21 Jul 2025
https://github.com/skylab-tech/ffhqr-dataset
FFHQR -- the first large-scale retouching dataset for computer vision research.
computer-vision dataset deep-learning high-resolution large-scale retouching
Last synced: 25 Jan 2026
https://github.com/igeligel/vuex-simple-structure
:chart_with_upwards_trend: A repository showcasing a simple Vuex store inside a Vue.js application based on Large-scale Vuex application structures @3yourmind
blogpost large-scale simple vue vue2 vuejs2 vuex
Last synced: 17 Mar 2026
https://github.com/cgtuebingen/pointcloud-viewer
Efficient Large-Scale Point-Cloud Viewer based on OpenGL
gles glut large-scale opengl point-cloud qt scientific-visualization viewer visualization
Last synced: 19 Mar 2025
https://github.com/xingchensong/touchnet
A native-PyTorch library for large scale M-LLM (text/audio) training with tp/cp/dp/pp.
audio large-scale mllm pytorch text
Last synced: 27 Oct 2025
https://github.com/juliafirstorder/structuredoptimization.jl
Structured optimization in Julia
convex-optimization large-scale non-convex non-smooth optimization-algorithms
Last synced: 24 Feb 2025
https://github.com/aims-umich/neorl
NeuroEvolution Optimization with Reinforcement Learning
evolutionary-algorithms large-scale neuroevolution optimization-algorithms reinforcement-learning
Last synced: 17 Jan 2026
https://github.com/shhossain/facedb
A package designed for efficient face recognition across extensive photo collections, optimized for large-scale processing.
face-detection face-recognition large-scale
Last synced: 07 May 2025
https://github.com/astorfi/large-scale-ai-blueprint
A comprehensive guide designed to empower readers with advanced strategies and practical insights for developing, optimizing, and deploying scalable AI models in real-world applications.
deep-learning large-language-models large-scale large-scale-ai large-scale-machine-learning llms machinel-learning production-ml
Last synced: 09 Feb 2026
https://github.com/astorfi/Large-Scale-AI-Blueprint
A comprehensive guide designed to empower readers with advanced strategies and practical insights for developing, optimizing, and deploying scalable AI models in real-world applications.
deep-learning large-language-models large-scale large-scale-ai large-scale-machine-learning llms machinel-learning production-ml
Last synced: 13 Jul 2025
https://github.com/openearth/glofrim
Globally Applicable Framework for Integrated Hydrological-Hydrodynamic Modelling (GLOFRIM)
coupling hydrodynamics hydrology large-scale python
Last synced: 01 Feb 2026
https://github.com/tokee/juxta
Generates large collages of images using OpenSeadragon
collage image-processing large-scale mosaic-images openseadragon tile-generator twitter-image
Last synced: 10 Oct 2025
https://github.com/MORLab/sssMOR
sssMOR - Sparse State-Space and Model Order Reduction Toolbox
dynamical-systems large-scale matlab-toolbox model-order-reduction model-reduction state-space
Last synced: 21 Nov 2025
https://github.com/fuyb1992/es_pandas
Read, write and update large scale pandas DataFrame with Elasticsearch
elasticsearch large-scale pandas
Last synced: 14 Jan 2026
https://github.com/lcsb-biocore/gigasom.jl
Huge-scale, high-performance flow cytometry clustering in Julia
artifical-neural-network artificial-intelligence clustering clustering-methods cytof cytometry flow-cytometry huge-scale immunology large-scale mass-cytometry neural-networks self-organizing-map som
Last synced: 11 Apr 2025
https://github.com/8Ginette8/gbif.range
An R package to generate species range maps based on ecoregions and a user-friendly GBIF wrapper
accepted-names ecoregions environmental-classification filtering flags gbif-api hard-limit iucn-red-list large-scale macroecology occurrence-records r-package range-maps species-distribution species-observations synonyms taxonomy
Last synced: 25 Nov 2025
https://github.com/kul-optec/AbstractOperators.jl
Abstract operators for large scale optimization in Julia
automatic-differentiation back-propagation derivatives julia-language large-scale optimization
Last synced: 04 May 2025
https://github.com/kul-optec/abstractoperators.jl
Abstract operators for large scale optimization in Julia
automatic-differentiation back-propagation derivatives julia-language large-scale optimization
Last synced: 02 Jan 2026
https://github.com/ad-freiburg/completesearch
Search engine for semi-structured data (text and structured data) that provides all kinds of intelligent search features (keyword search, autocompletion, faceted search, error-tolerant search, synonym search, semantic search) very efficiently also on very large data.
autocompletion large-scale search-engine
Last synced: 24 Jun 2025
https://github.com/shujiahuang/basevar
This is the official development repository for BaseVar, which call variants for large-scale ultra low-pass (<1.0x) WGS data, especially for NIPT data
basevar bioinformatics cython genomics large-scale ngs nipt python
Last synced: 06 Apr 2025
https://github.com/doublechaintech/daas-with-github-actions
A low code learning project run with github actions
ant-design knowledge-graph large-scale low-code-development-platform mysql redis
Last synced: 23 Feb 2026
https://github.com/asigalov61/Euterpe
[DEPRECEATED] Multi-Instrumental Music Transformer trained on 12GB/400k MIDIs
euterpe euterpea large-scale large-scale-machine-learning midi multi-instrumental muse music music-ai music-ai-architectures music-composition music-generation music-transformer sota
Last synced: 11 Jan 2026
https://github.com/open-edge-platform/annflux
A research tool for exploring and annotating large datasets with Active Learning
active-learning annotation classification clustering data-exploration large-scale machine-learning multilabel-clustering
Last synced: 08 Feb 2026
https://github.com/bimk/large-scale-multi-objective-optimization
Large-scale multi-objective optimization related papers
large-scale multi-objective-optimization
Last synced: 29 Jan 2026
https://github.com/henrikbengtsson/aroma.affymetrix
🔬 R package: Analysis of Large Affymetrix Microarray Data Sets
affymetrix analysis copy-number dna expression hpc large-scale microarray notebook package r reproducibility rna
Last synced: 10 Apr 2025
https://github.com/haddocking/haddock-runner
Run large scale HADDOCK simulations using multiple input molecules in different scenarios
benchmark bioinformatics haddock high-performance-computing large-scale structural-biology utrecht-university
Last synced: 21 Jan 2026
https://github.com/yc-cui/extend-gan
[GRSL 2024] Reconstruction of Large-Scale Missing Data in Remote Sensing Images Using Extend-GAN
deep-learning generative-adversarial-network large-scale pytorch reconstruction remote-sensing
Last synced: 13 Apr 2025
https://github.com/bogdandm/redis_bulk_cleaner
Deletes keys from Redis database in bulk.
cleaner large-scale redis utils
Last synced: 14 Jan 2026
https://github.com/hiejulia/jhipster-distributed-system-computing
Jhipster in distributed computing
design design-patterns distributed-computing distributed-systems event-sourcing hpc-applications kubernetes kubernetes-cluster large-scale load-balancer metrics queue queues scalable scaling worker
Last synced: 28 Apr 2025
https://github.com/anders617/bigchat
BigChat is an internet wide chat supported by a chrome extension. Brought to you by Big Think
Last synced: 24 Oct 2025
https://github.com/softwaiter/largeimageview
Android超长图片显示组件 - LargeImageView
android image large large-scale largeimageview
Last synced: 06 May 2025
https://github.com/aresio/lassie
LASSIE is a black-box deterministic simulator of large-scale mass-action biochemical systems
biochemical cuda gpu-computing large-scale mass-action simulation stiff
Last synced: 21 Feb 2026
https://github.com/nadundesilva/mesh-manager
Kubernetes Operator for managing microservices at scale
controllers kubernetes large-scale microservices
Last synced: 28 Apr 2026
https://github.com/tokee/nrtmosaic
Fast mosaic creation from pre-processed source images
iipimage image-processing large-scale mosaic realtime-visualization
Last synced: 10 Oct 2025
https://github.com/tetiewastaken/hello-world
A repository that features "Hello, World!" in over 80+ programming languages
example-code hello-world large-scale programming-languages
Last synced: 30 Mar 2025
https://github.com/mbuzdalov/orthant-search
Orthant search is "one code to rule them all" for many operations in multiobjective evolutionary algorithms.
evolutionary-computation large-scale multiobjective-optimization
Last synced: 29 Jan 2026
https://github.com/garciparedes/ringer
Large-scale data structures hosted on the file-system
buffer circular-buffer filesystem in-memory large-scale python python-package python3
Last synced: 16 Feb 2026
https://github.com/bl33h/computersimulation
Measure the time for large-scale operations and contribute to the exploration of computational efficiency.
computational-efficiency computer-simulation large-scale python
Last synced: 14 Mar 2025
https://github.com/mobileguruvn/android-clean-architecture-multi-repo
A modular Android project demonstrating Clean Architecture with feature-based isolation, multi-repo structure, and independent versioning — built for scalable enterprise apps like Uber, Grab, or Shopify.
android-application clean-architecture jetpack-compose large-scale maven versioning
Last synced: 05 May 2026
https://github.com/ghazaleze/investigate_classifiers
The point is to investigate three types of classifiers (linear classifier with feature selection, linear classifier without feature selection, and a non-linear classifier) in a setting where precision and interpretability may matter.
feature-selection l2-regularization large-scale lasso-regression machine-learning random-forest-classifier support-vector-machine svm-classifier
Last synced: 06 Aug 2025
https://github.com/edisedis777/pyspark-ml-features
A PySpark implementation of 6 lesser-known Scikit-Learn features optimized for Azure Databricks. This project translates powerful machine learning techniques from Scikit-Learn into PySpark's distributed computing framework.
azure databricks databricks-notebooks large-scale machine-learning pyspark python scikit-learn scikitlearn-machine-learning
Last synced: 13 Apr 2026
https://github.com/uqatkit/ls-mcmc
A light-weight library for large-scale Markov Chain Monte Carlo sampling
bayesian-inference large-scale markov-chain-monte-carlo
Last synced: 14 Jan 2026
https://github.com/imsheridan/xdeeprank
An eXtensible Package of Deep Learning based Ranking Models for Large-scale Industrial Recommender System with Tensorflow
click-through-rate ctr deep-learning industrial large-scale ranking recommendation recommender-system tensorflow
Last synced: 30 Apr 2026
https://github.com/tokee/panoscripts
Scripts and notes for making panoramas from multiple images
Last synced: 10 Oct 2025
https://github.com/william1nguyen/dblab
Lab for database sharding concepts and large-scale data techniques
data-sharding database large-scale
Last synced: 28 Sep 2025
https://github.com/ajaikumarvs/certiflyte
Large scale certificate management. WinForms Implementation.
certificate-generation large-scale
Last synced: 31 Oct 2025
https://github.com/rambod-rahmani/stocksim
Project repository for the Large Scale and Multi-Structured Databases course.
aggregation-pipleline cassandra cassandra-cluster cassandra-cql cassandra-database database java-8 jfreechart large-scale large-scale-clustering mongodb mongodb-java-driver mongodb-replica-set stock-portfolios stock-price-prediction stocks yahoo-finance yahoo-finance-api
Last synced: 13 Apr 2026
https://github.com/ajaikumarvs/certiflytewf
Large scale certificate management. WinForms Implementation.
certificate-generation large-scale
Last synced: 24 Oct 2025
https://github.com/henrikbengtsson/aroma.seq
🔬 R package: aroma.seq: High-Throughput Sequence Analysis using the Aroma Framework
bioinformatics distributed-computing framework genomics hpc ht-seq large-scale package parallel r
Last synced: 28 Mar 2025
https://github.com/claireyurev/large-scale-media-converter
Command-line large-scale media converter for offline use.
large-scale media media-converter
Last synced: 16 Feb 2026
https://github.com/giangzuzana/liga
in-memory LInear large-scale GAzetteers - Standalone contain extraction tools for Natural Language Proces
gazetteer gn in-memory large-scale natural-language-processing sd
Last synced: 05 Jun 2026
https://github.com/ncouture/cast-magic
Find the world's publicly exposed Chromecasts.
asynchronous asynchronous-programming chromecast fast large-scale scanner web-scanner zmap
Last synced: 21 Feb 2026
https://github.com/neonwatty/quick_batch
ultra simple command line tool for docker-scaling batch processing
containerization data-science deep-learning docker large-scale machine-learning python
Last synced: 02 May 2026