An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with large-scale

A curated list of projects in awesome lists tagged with large-scale .

https://github.com/paddlepaddle/parl

A high-performance distributed training framework for Reinforcement Learning

large-scale parallelization reinforcement-learning

Last synced: 14 May 2025

https://github.com/PaddlePaddle/PARL

A high-performance distributed training framework for Reinforcement Learning

large-scale parallelization reinforcement-learning

Last synced: 28 Mar 2025

https://github.com/detectrecog/ccpd

[ECCV 2018] CCPD: a diverse and well-annotated dataset for license plate detection and recognition

ccpd dataset detection large-scale plate-detection recognition

Last synced: 15 May 2025

https://github.com/camel-ai/oasis

🏝️ OASIS: Open Agent Social Interaction Simulations with One Million Agents. https://oasis.camel-ai.org

agent-based-framework agent-based-simulation ai-societies deep-learning large-language-models large-scale llm-agents multi-agent-systems natural-language-processing

Last synced: 14 May 2025

https://github.com/afair/postgresql_cursor

ActiveRecord PostgreSQL Adapter extension for using a cursor to return a large result set

activerecord batch cursor for-update large-scale postgresql postgresql-cursor ruby ruby-gem

Last synced: 25 Mar 2025

https://github.com/qingyonghu/sensaturban

🔥Urban-scale point cloud dataset (CVPR 2021 & IJCV 2022)

benchmark city-modeling dataset large-scale photogrammetry pointcloud urban-scale

Last synced: 05 Apr 2025

https://github.com/QingyongHu/SensatUrban

🔥Urban-scale point cloud dataset (CVPR 2021 & IJCV 2022)

benchmark city-modeling dataset large-scale photogrammetry pointcloud urban-scale

Last synced: 20 Mar 2025

https://github.com/paddlepaddle/paddlefleetx

飞桨大模型开发套件,提供大语言模型、跨模态大模型、生物计算大模型等领域的全流程开发工具链。

benchmark cloud data-parallelism distributed-algorithm elastic fleet-api large-scale lightning model-parallelism paddlecloud paddlepaddle pipeline-parallelism pretraining self-supervised-learning unsupervised-learning

Last synced: 13 Apr 2025

https://github.com/qingyonghu/spinnet

[CVPR 2021] SpinNet: Learning a General Surface Descriptor for 3D Point Cloud Registration

3dmatch descriptor generalization kitti large-scale pointcloud pytorch-implementation registration

Last synced: 18 Jul 2025

https://github.com/QingyongHu/SpinNet

[CVPR 2021] SpinNet: Learning a General Surface Descriptor for 3D Point Cloud Registration

3dmatch descriptor generalization kitti large-scale pointcloud pytorch-implementation registration

Last synced: 20 Mar 2025

https://github.com/igeligel/vuex-feature-scoped-structure

:chart_with_upwards_trend: Feature scoped Vuex modules to have a better organization of business logic code inside Vuex modules based on Large-scale Vuex application structures @3yourmind

blogpost container containers large-scale modules vue vue2 vuejs2 vuex

Last synced: 15 May 2025

https://github.com/simeonradivoev/gpu-planetary-rendering

GPU atmosphertic scattering and planet generation in Unity 3D

atmosphere charp compute-shader graphics hlsl large-scale planet postprocessing shader unity unity3d

Last synced: 26 Apr 2025

https://github.com/llnl/librom

Model reduction library with an emphasis on large scale parallelism and linear subspace methods

large-scale math-physics model-reduction modeling parallel-computing reduced-order-models scientific simulation subspace-learning

Last synced: 07 Apr 2025

https://github.com/simeonradivoev/GPU-Planetary-Rendering

GPU atmosphertic scattering and planet generation in Unity 3D

atmosphere charp compute-shader graphics hlsl large-scale planet postprocessing shader unity unity3d

Last synced: 25 Apr 2025

https://github.com/med-air/Endo-FM

[MICCAI'23] Foundation Model for Endoscopy Video Analysis via Large-scale Self-supervised Pre-train

endoscopy foundation-model large-scale miccai2023 pre-train self-supervised video

Last synced: 16 Mar 2025

https://github.com/igeligel/vuex-namespaced-module-structure

:chart_with_upwards_trend: A Vue.js project powered by Vuex namespaced modules in a simple structure based on Large-scale Vuex application structures

blogpost large-scale modules vue vue2 vuejs2 vuex

Last synced: 15 May 2025

https://github.com/paddlepaddle/plsc

Paddle Large Scale Classification Tools,supports ArcFace, CosFace, PartialFC, Data Parallel + Model Parallel. Model includes ResNet, ViT, Swin, DeiT, CaiT, FaceViT, MoCo, MAE, ConvMAE, CAE.

arcface cait convmae cosface data-parallel deit distributed-training face-recognition facevit hight-speed large-scale mae moco-v3 model-parallel paddle paddlepaddle partial-fc resnet swin-transformer vit

Last synced: 05 Mar 2026

https://github.com/skylab-tech/ffhqr-dataset

FFHQR -- the first large-scale retouching dataset for computer vision research.

computer-vision dataset deep-learning high-resolution large-scale retouching

Last synced: 25 Jan 2026

https://github.com/igeligel/vuex-simple-structure

:chart_with_upwards_trend: A repository showcasing a simple Vuex store inside a Vue.js application based on Large-scale Vuex application structures @3yourmind

blogpost large-scale simple vue vue2 vuejs2 vuex

Last synced: 17 Mar 2026

https://github.com/cgtuebingen/pointcloud-viewer

Efficient Large-Scale Point-Cloud Viewer based on OpenGL

gles glut large-scale opengl point-cloud qt scientific-visualization viewer visualization

Last synced: 19 Mar 2025

https://github.com/xingchensong/touchnet

A native-PyTorch library for large scale M-LLM (text/audio) training with tp/cp/dp/pp.

audio large-scale mllm pytorch text

Last synced: 27 Oct 2025

https://github.com/aims-umich/neorl

NeuroEvolution Optimization with Reinforcement Learning

evolutionary-algorithms large-scale neuroevolution optimization-algorithms reinforcement-learning

Last synced: 17 Jan 2026

https://github.com/shhossain/facedb

A package designed for efficient face recognition across extensive photo collections, optimized for large-scale processing.

face-detection face-recognition large-scale

Last synced: 07 May 2025

https://github.com/astorfi/large-scale-ai-blueprint

A comprehensive guide designed to empower readers with advanced strategies and practical insights for developing, optimizing, and deploying scalable AI models in real-world applications.

deep-learning large-language-models large-scale large-scale-ai large-scale-machine-learning llms machinel-learning production-ml

Last synced: 09 Feb 2026

https://github.com/astorfi/Large-Scale-AI-Blueprint

A comprehensive guide designed to empower readers with advanced strategies and practical insights for developing, optimizing, and deploying scalable AI models in real-world applications.

deep-learning large-language-models large-scale large-scale-ai large-scale-machine-learning llms machinel-learning production-ml

Last synced: 13 Jul 2025

https://github.com/openearth/glofrim

Globally Applicable Framework for Integrated Hydrological-Hydrodynamic Modelling (GLOFRIM)

coupling hydrodynamics hydrology large-scale python

Last synced: 01 Feb 2026

https://github.com/tokee/juxta

Generates large collages of images using OpenSeadragon

collage image-processing large-scale mosaic-images openseadragon tile-generator twitter-image

Last synced: 10 Oct 2025

https://github.com/MORLab/sssMOR

sssMOR - Sparse State-Space and Model Order Reduction Toolbox

dynamical-systems large-scale matlab-toolbox model-order-reduction model-reduction state-space

Last synced: 21 Nov 2025

https://github.com/fuyb1992/es_pandas

Read, write and update large scale pandas DataFrame with Elasticsearch

elasticsearch large-scale pandas

Last synced: 14 Jan 2026

https://github.com/ad-freiburg/completesearch

Search engine for semi-structured data (text and structured data) that provides all kinds of intelligent search features (keyword search, autocompletion, faceted search, error-tolerant search, synonym search, semantic search) very efficiently also on very large data.

autocompletion large-scale search-engine

Last synced: 24 Jun 2025

https://github.com/shujiahuang/basevar

This is the official development repository for BaseVar, which call variants for large-scale ultra low-pass (<1.0x) WGS data, especially for NIPT data

basevar bioinformatics cython genomics large-scale ngs nipt python

Last synced: 06 Apr 2025

https://github.com/open-edge-platform/annflux

A research tool for exploring and annotating large datasets with Active Learning

active-learning annotation classification clustering data-exploration large-scale machine-learning multilabel-clustering

Last synced: 08 Feb 2026

https://github.com/bimk/large-scale-multi-objective-optimization

Large-scale multi-objective optimization related papers

large-scale multi-objective-optimization

Last synced: 29 Jan 2026

https://github.com/henrikbengtsson/aroma.affymetrix

🔬 R package: Analysis of Large Affymetrix Microarray Data Sets

affymetrix analysis copy-number dna expression hpc large-scale microarray notebook package r reproducibility rna

Last synced: 10 Apr 2025

https://github.com/haddocking/haddock-runner

Run large scale HADDOCK simulations using multiple input molecules in different scenarios

benchmark bioinformatics haddock high-performance-computing large-scale structural-biology utrecht-university

Last synced: 21 Jan 2026

https://github.com/yc-cui/extend-gan

[GRSL 2024] Reconstruction of Large-Scale Missing Data in Remote Sensing Images Using Extend-GAN

deep-learning generative-adversarial-network large-scale pytorch reconstruction remote-sensing

Last synced: 13 Apr 2025

https://github.com/bogdandm/redis_bulk_cleaner

Deletes keys from Redis database in bulk.

cleaner large-scale redis utils

Last synced: 14 Jan 2026

https://github.com/anders617/bigchat

BigChat is an internet wide chat supported by a chrome extension. Brought to you by Big Think

chat internet large-scale

Last synced: 24 Oct 2025

https://github.com/softwaiter/largeimageview

Android超长图片显示组件 - LargeImageView

android image large large-scale largeimageview

Last synced: 06 May 2025

https://github.com/aresio/lassie

LASSIE is a black-box deterministic simulator of large-scale mass-action biochemical systems

biochemical cuda gpu-computing large-scale mass-action simulation stiff

Last synced: 21 Feb 2026

https://github.com/nadundesilva/mesh-manager

Kubernetes Operator for managing microservices at scale

controllers kubernetes large-scale microservices

Last synced: 28 Apr 2026

https://github.com/tokee/nrtmosaic

Fast mosaic creation from pre-processed source images

iipimage image-processing large-scale mosaic realtime-visualization

Last synced: 10 Oct 2025

https://github.com/tetiewastaken/hello-world

A repository that features "Hello, World!" in over 80+ programming languages

example-code hello-world large-scale programming-languages

Last synced: 30 Mar 2025

https://github.com/mbuzdalov/orthant-search

Orthant search is "one code to rule them all" for many operations in multiobjective evolutionary algorithms.

evolutionary-computation large-scale multiobjective-optimization

Last synced: 29 Jan 2026

https://github.com/garciparedes/ringer

Large-scale data structures hosted on the file-system

buffer circular-buffer filesystem in-memory large-scale python python-package python3

Last synced: 16 Feb 2026

https://github.com/bl33h/computersimulation

Measure the time for large-scale operations and contribute to the exploration of computational efficiency.

computational-efficiency computer-simulation large-scale python

Last synced: 14 Mar 2025

https://github.com/mobileguruvn/android-clean-architecture-multi-repo

A modular Android project demonstrating Clean Architecture with feature-based isolation, multi-repo structure, and independent versioning — built for scalable enterprise apps like Uber, Grab, or Shopify.

android-application clean-architecture jetpack-compose large-scale maven versioning

Last synced: 05 May 2026

https://github.com/ghazaleze/investigate_classifiers

The point is to investigate three types of classifiers (linear classifier with feature selection, linear classifier without feature selection, and a non-linear classifier) in a setting where precision and interpretability may matter.

feature-selection l2-regularization large-scale lasso-regression machine-learning random-forest-classifier support-vector-machine svm-classifier

Last synced: 06 Aug 2025

https://github.com/edisedis777/pyspark-ml-features

A PySpark implementation of 6 lesser-known Scikit-Learn features optimized for Azure Databricks. This project translates powerful machine learning techniques from Scikit-Learn into PySpark's distributed computing framework.

azure databricks databricks-notebooks large-scale machine-learning pyspark python scikit-learn scikitlearn-machine-learning

Last synced: 13 Apr 2026

https://github.com/uqatkit/ls-mcmc

A light-weight library for large-scale Markov Chain Monte Carlo sampling

bayesian-inference large-scale markov-chain-monte-carlo

Last synced: 14 Jan 2026

https://github.com/imsheridan/xdeeprank

An eXtensible Package of Deep Learning based Ranking Models for Large-scale Industrial Recommender System with Tensorflow

click-through-rate ctr deep-learning industrial large-scale ranking recommendation recommender-system tensorflow

Last synced: 30 Apr 2026

https://github.com/tokee/panoscripts

Scripts and notes for making panoramas from multiple images

large-scale panorama

Last synced: 10 Oct 2025

https://github.com/william1nguyen/dblab

Lab for database sharding concepts and large-scale data techniques

data-sharding database large-scale

Last synced: 28 Sep 2025

https://github.com/ajaikumarvs/certiflyte

Large scale certificate management. WinForms Implementation.

certificate-generation large-scale

Last synced: 31 Oct 2025

https://github.com/ajaikumarvs/certiflytewf

Large scale certificate management. WinForms Implementation.

certificate-generation large-scale

Last synced: 24 Oct 2025

https://github.com/henrikbengtsson/aroma.seq

🔬 R package: aroma.seq: High-Throughput Sequence Analysis using the Aroma Framework

bioinformatics distributed-computing framework genomics hpc ht-seq large-scale package parallel r

Last synced: 28 Mar 2025

https://github.com/claireyurev/large-scale-media-converter

Command-line large-scale media converter for offline use.

large-scale media media-converter

Last synced: 16 Feb 2026

https://github.com/giangzuzana/liga

in-memory LInear large-scale GAzetteers - Standalone contain extraction tools for Natural Language Proces

gazetteer gn in-memory large-scale natural-language-processing sd

Last synced: 05 Jun 2026

https://github.com/neonwatty/quick_batch

ultra simple command line tool for docker-scaling batch processing

containerization data-science deep-learning docker large-scale machine-learning python

Last synced: 02 May 2026