Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/arian-askari/ChatGPT-RetrievalQA
A dataset for training/evaluating Question Answering Retrieval models on ChatGPT responses with the possibility to training/evaluating on real human responses.
ai chatgpt chatgpt-information-retrieval chatgpt-ir data-augmentation dataset deep-learning gpt-3 gpt2 gpt3 information-retrieval information-retrieval-chatgpt ir ir-chatgpt machine-learning nlp openai python sequence-to-sequence text-retrieval
Last synced: 03 Jul 2024
![](https://github.com/arian-askari.png)
https://github.com/THUDM/GRAND
Source code and dataset of the NeurIPS 2020 paper "Graph Random Neural Network for Semi-Supervised Learning on Graphs"
data-augmentation gnn graph-neural-networks graphs neurips-2020 semi-supervised-learning
Last synced: 28 Jun 2024
![](https://github.com/THUDM.png)
https://github.com/szacho/augmix-tf
Implementation of AugMix (2020) in TensorFlow
Last synced: 24 Jun 2024
![](https://github.com/szacho.png)
https://github.com/zhanlaoban/EDA_NLP_for_Chinese
An implement of the paper of EDA for Chinese corpus.中文语料的EDA数据增强工具。NLP数据增强。论文阅读笔记。
chinese chinese-data-augmentation data-augmentation easy-data-augmentation eda text-classification
Last synced: 23 Jun 2024
![](https://github.com/zhanlaoban.png)
https://github.com/beyondguo/genius
💡GENIUS – generating text using sketches! A strong text generation & data augmentation tool.
conditional-text-generation data-augmentation keywords-to-text named-entities-recognition sketch-to-text text-augmentation text-classificaiton text-generation
Last synced: 23 Jun 2024
![](https://github.com/beyondguo.png)
https://github.com/textflint/textflint
Unified Multilingual Robustness Evaluation Toolkit for Natural Language Processing
adversarial-samples attack data-augmentation model-robustness robustness-analysis subpopulation text-augmentation text-transformations transformation
Last synced: 22 Jun 2024
![](https://github.com/textflint.png)
https://github.com/doubleZ0108/Data-Augmentation
General Data Augmentation Algorithms for Object Detection(esp. Yolo)
data-augmentation dataset object-detection yolo
Last synced: 15 Jun 2024
![](https://github.com/doubleZ0108.png)
https://github.com/NVIDIA/DALI
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
audio-processing data-augmentation data-processing deep-learning fast-data-pipeline gpu gpu-tensorflow image-augmentation image-processing machine-learning mxnet neural-network paddle python pytorch
Last synced: 12 Jun 2024
![](https://github.com/NVIDIA.png)
https://github.com/arundo/tsaug
A Python package for time series augmentation
audio data-augmentation deep-learning time-series
Last synced: 07 Jun 2024
![](https://github.com/arundo.png)
https://github.com/LirongWu/awesome-graph-self-supervised-learning
Code for TKDE paper "Self-supervised learning on graphs: Contrastive, generative, or predictive"
data-augmentation deep-learning graph-neural-networks machine-learning pre-training pretext-task representation-learning self-supervised-learning transfer-learning unsupervised-learning
Last synced: 07 Jun 2024
![](https://github.com/LirongWu.png)
https://github.com/firmai/deltapy
DeltaPy - Tabular Data Augmentation (by @firmai)
augmentation data-augmentation data-science feature-engineering feature-extraction finance machine-learning tabular-data time-series
Last synced: 07 Jun 2024
![](https://github.com/firmai.png)
https://github.com/Garfield-kh/PoseTriplet
[CVPR 2022] PoseTriplet: Co-evolving 3D Human Pose Estimation, Imitation, and Hallucination under Self-supervision (Oral)
cvpr2022 data-augmentation motion-generation motion-imitation pose-estimation
Last synced: 07 Jun 2024
![](https://github.com/Garfield-kh.png)
https://github.com/Paperspace/DataAugmentationForObjectDetection
Data Augmentation For Object Detection
bounding-box data-augmentation deep-learning imagine-augmentation object-detection opencv
Last synced: 07 Jun 2024
![](https://github.com/Paperspace.png)
https://github.com/Lallapallooza/fast-audiomentations
⚡ Blazing fast audio augmentation in Python, powered by GPU for high-efficiency processing in machine learning and audio analysis tasks.
audio audio-augmentation audio-data-augmentation audio-effects augmentations data-augmentation dsp gpu machine-learning python pytorch triton
Last synced: 02 Jun 2024
![](https://github.com/Lallapallooza.png)
https://github.com/cuge1995/PointCutMix
our code for paper 'PointCutMix: Regularization Strategy for Point Cloud Classification', Neurocomputing, 2022
3d-deep-learning data-augmentation deep-learning point-cloud
Last synced: 29 May 2024
![](https://github.com/cuge1995.png)
https://github.com/akiomik/pilgram
A python library for instagram filters
blend-modes css-filters data-augmentation image-augmentation image-blending image-processing instagram instagram-filters pillow
Last synced: 28 May 2024
![](https://github.com/akiomik.png)
https://github.com/codebox/image_augmentor
Data augmentation tool for images
data-augmentation image-augmentor machine-learning
Last synced: 20 May 2024
![](https://github.com/codebox.png)
https://github.com/fepegar/torchio
Medical imaging toolkit for deep learning
augmentation data-augmentation deep-learning machine-learning medical-image-analysis medical-image-computing medical-image-processing medical-images medical-imaging-datasets medical-imaging-with-deep-learning python pytorch
Last synced: 17 May 2024
![](https://github.com/fepegar.png)
https://github.com/sshuair/torchsat
🔥TorchSat 🌏 is an open-source deep learning framework for satellite imagery analysis based on PyTorch.
classification data-augmentation deep-learning pytorch remote-sensing satellite satellite-imagery semantic-segmentation torchvision
Last synced: 16 May 2024
![](https://github.com/sshuair.png)
https://github.com/AgaMiko/data-augmentation-review
List of useful data augmentation resources. You will find here some not common techniques, libraries, links to GitHub repos, papers, and others.
audio-augmentation augmentation-policies autoaugment data-augmentation data-augmentations data-generation data-synthesis generative-adversarial-network graph-data-augmentation image-augmentation machine-learning nlp-augmentation review style-transfer survey
Last synced: 15 May 2024
![](https://github.com/AgaMiko.png)
https://github.com/JasonZhang156/awesome-mixed-sample-data-augmentation
A collection of awesome things about mixed sample data augmentation
data-augmentation mixed-sample
Last synced: 14 May 2024
![](https://github.com/JasonZhang156.png)
https://github.com/moskomule/dda
Differentiable Data Augmentation Library
data-augmentation differentiable-data-augmentation faster-auto-augment pytorch
Last synced: 14 May 2024
![](https://github.com/moskomule.png)
https://github.com/Oulu-IMEDS/solt
Streaming over lightweight data transformations
data-augmentation deep-learning image-recognition image-segmentation landmark-detection
Last synced: 14 May 2024
![](https://github.com/Oulu-IMEDS.png)
https://github.com/Westlake-AI/Awesome-Mixup
Awesome List of Mixup Augmentation Papers for Visual Representation Learning
awesome-list awesome-mixup computer-vision data-augmentation deep-learning image-classification mixup self-supervised-learning
Last synced: 14 May 2024
![](https://github.com/Westlake-AI.png)
https://github.com/webdataset/webdataset
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
data-augmentation deep-learning pytorch webdataset webdataset-format
Last synced: 13 May 2024
![](https://github.com/webdataset.png)
https://github.com/styfeng/DataAug4NLP
Collection of papers and resources for data augmentation for NLP.
acl2021 artificial-intelligence data-augmentation deep-learning machine-learning natural-language-processing survey survey-paper text-classification transformers
Last synced: 13 May 2024
![](https://github.com/styfeng.png)
https://github.com/bcmi/Awesome-Few-Shot-Image-Generation
A curated list of papers, code and resources pertaining to few-shot image generation.
data-augmentation few-shot-generation few-shot-image-generation few-shot-learning
Last synced: 13 May 2024
![](https://github.com/bcmi.png)
https://github.com/amazon-science/mix-generation
MixGen: A New Multi-Modal Data Augmentation
data-augmentation data-efficiency multimodal pretraining vision-language
Last synced: 12 May 2024
![](https://github.com/amazon-science.png)
https://github.com/JunlinHan/CropMix
Code of CropMix: Sampling a Rich Input Distribution via Multi-Scale Cropping
computer-vision contrastive-learning data-augmentation deep-learning image-classification machine-learning masked-image-modeling representation-learning self-supervised-learning
Last synced: 12 May 2024
![](https://github.com/JunlinHan.png)
https://github.com/implus/RecursiveMix-pytorch
Official Codes and Pretrained Models for RecursiveMix
cutmix cutmix-mixup data-augmentation generalization history image-classification mixup object-detection recursive semantic-segmentation
Last synced: 12 May 2024
![](https://github.com/implus.png)
https://github.com/snu-mllab/Co-Mixup
Official PyTorch implementation of "Co-Mixup: Saliency Guided Joint Mixup with Supermodular Diversity" (ICLR'21 Oral)
Last synced: 12 May 2024
![](https://github.com/snu-mllab.png)
https://github.com/hammoudiproject/SuperpixelGridMasks
SuperpixelGridMasks is an approach for sensor-based data augmentation towards image classification tasks and so on.
artificial-intelligence cnn data-augmentation data-augmentations data-generation datascience deep-learning image-augmentation image-classification image-recognition imagery imaging machine-learning sensor-data training-dataset
Last synced: 12 May 2024
![](https://github.com/hammoudiproject.png)
https://github.com/dmey/synthia
📈 🐍 Multidimensional synthetic data generation with Copula and fPCA models in Python
augmentation climate copula data-augmentation data-generation data-generator data-modelling data-science dependency-analysis dependency-modeling finance fpca functional-data machine-learning oversampling principal-component-analysis statistics synthetic-data weather xarray
Last synced: 12 May 2024
![](https://github.com/dmey.png)
https://github.com/snorkel-team/snorkel
A system for quickly generating training data with weak supervision
ai data-augmentation data-science data-slicing labeling machine-learning python snorkel training-data weak-supervision
Last synced: 28 Apr 2024
![](https://github.com/snorkel-team.png)
https://github.com/bmcfee/muda
A library for augmenting annotated audio data
data-augmentation machine-learning music nyucds python
Last synced: 28 Apr 2024
![](https://github.com/bmcfee.png)
https://github.com/iver56/audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
audio audio-data-augmentation audio-effects augmentation data-augmentation deep-learning dsp machine-learning music python sound sound-processing
Last synced: 28 Apr 2024
![](https://github.com/iver56.png)
https://github.com/DemisEom/SpecAugment
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
data-augmentation python pytorch specaugment speech speech-recognition tensorflow
Last synced: 27 Apr 2024
![](https://github.com/DemisEom.png)
https://github.com/isarandi/synthetic-occlusion
Synthetic Occlusion Augmentation
computer-vision data-augmentation occlusion python synthetic-dataset-generation
Last synced: 26 Apr 2024
![](https://github.com/isarandi.png)
https://github.com/mahmoudnafifi/WB_color_augmenter
WB color augmenter improves the accuracy of image classification and image semantic segmentation methods by emulating different WB effects (ICCV 2019) [Python & Matlab].
cnn color-augmentation color-constancy color-correction computer-vision data-augmentation deep-learning deep-neural-network deeplearning iccv19 iccv2019 image-augmentation image-classification semantic-segmentation white-balance whitebalance
Last synced: 23 Apr 2024
![](https://github.com/mahmoudnafifi.png)
https://github.com/mratsim/Amazon-Forest-Computer-Vision
Amazon Forest Computer Vision: Satellite Image tagging code using PyTorch / Keras with lots of PyTorch tricks
computer-vision data-augmentation deep-learning kaggle kaggle-competition keras neural-network-example neural-networks pytorch transfer-learning
Last synced: 23 Apr 2024
![](https://github.com/mratsim.png)
https://github.com/snu-mllab/PuzzleMix
Official PyTorch implementation of "Puzzle Mix: Exploiting Saliency and Local Statistics for Optimal Mixup" (ICML'20)
Last synced: 23 Apr 2024
![](https://github.com/snu-mllab.png)
https://github.com/ZhaoJ9014/face.evoLVe
🔥🔥High-Performance Face Recognition Library on PaddlePaddle & PyTorch🔥🔥
artificial-intelligence computer-vision convolutional-neural-network data-augmentation deep-learning face-alignment face-detection face-landmark-detection face-recognition feature-extraction fine-tuning hard-negative-mining imbalanced-learning machine-learning model-training nus pytorch supervised-learning tencent transfer-learning
Last synced: 20 Apr 2024
![](https://github.com/ZhaoJ9014.png)
https://github.com/zhunzhong07/Random-Erasing
Random Erasing Data Augmentation. Experiments on CIFAR10, CIFAR100 and Fashion-MNIST
aaai2020 data-augmentation image-classification object-detection person-re-identification pytorch
Last synced: 19 Apr 2024
![](https://github.com/zhunzhong07.png)
https://github.com/attilanagy234/TreeSwap
Complimentary code for our paper TreeSwap: Data Augmentation for Machine Translation via Dependency Subtree Swapping (RANLP 2023)
data-augmentation neural-machine-translation
Last synced: 19 Apr 2024
![](https://github.com/attilanagy234.png)
https://github.com/YuliangXiu/MobilePose
Light-weight Single Person Pose Estimator
data-augmentation dataloader deep-learning deeppose dsntnn heatmap lightweight machine-learning mobile-device mobilenetv2 pose-estimation pytorch real-time realtime resnet-18 shufflenet shufflenet-v2 shufflenetv2 squeezenet
Last synced: 17 Apr 2024
![](https://github.com/YuliangXiu.png)
https://github.com/mlcommons/GaNDLF
A generalizable application framework for segmentation, regression, and classification using PyTorch
biomedical-image-processing classification clinical-workflow data-augmentation deep-learning framework machine-learning medical-image-analysis medical-imaging medical-informatics pytorch regression segmentation
Last synced: 16 Apr 2024
![](https://github.com/mlcommons.png)
https://github.com/ExplainableML/ACVC
Official PyTorch implementation of CVPRW 2022 paper "Attention Consistency on Visual Corruptions for Single-Source Domain Generalization"
computer-vision cvpr cvpr2022 cvprw cvprw2022 data-augmentation deep-learning domain-generalization l3d-ivu machine-learning pytorch single-source-domain-generalization
Last synced: 16 Apr 2024
![](https://github.com/ExplainableML.png)
https://github.com/sutd-visual-computing-group/dag-gans
Data Augmentation optimized for GAN
biggan cyclegan dag data-augmentation gan pytorch tensorflow
Last synced: 16 Apr 2024
![](https://github.com/sutd-visual-computing-group.png)
https://github.com/KentoNishi/Augmentation-for-LNL
[CVPR 2021] Code for "Augmentation Strategies for Learning with Noisy Labels".
augmentation-policies cifar10 cifar100 clothing1m cvpr cvpr2021 data-augmentation data-augmentation-strategies label-noise label-noise-robustness semi-supervised-learning
Last synced: 16 Apr 2024
![](https://github.com/KentoNishi.png)
https://github.com/vkit-x/vkit
Boosting Document Intelligence
chineseocr computer-vision data-augmentation data-synthesis deep-learning document-inteligence image-augmentation machine-learning ocr python text-detection text-detection-recognition text-recognition vkit vkit-x
Last synced: 16 Apr 2024
![](https://github.com/vkit-x.png)
https://github.com/denisyarats/drq
DrQ: Data regularized Q
actor-critic control data-augmentation deep-learning deep-reinforcement-learning dm-control drq gym model-free mujoco off-policy pixel python pytorch reinforcement-learning rl sac soft-actor-crit
Last synced: 13 Apr 2024
![](https://github.com/denisyarats.png)
https://github.com/senwu/dauphin
An uncertainty-based random sampling algorithm for data augmentation
Last synced: 11 Apr 2024
![](https://github.com/senwu.png)
https://github.com/AIoT-MLSys-Lab/DeepAA
[ICLR 2022] "Deep AutoAugment" by Yu Zheng, Zhi Zhang, Shen Yan, Mi Zhang
automl data-augmentation deep-learning
Last synced: 11 Apr 2024
![](https://github.com/AIoT-MLSys-Lab.png)
https://github.com/alexandervnikitin/tsgm
Generative modeling of synthetic time series data and time series augmentations
augmentations data-augmentation data-science datasets deep-learning generative-model keras machine-learning python synthetic-data tensorflow2 time-series vae
Last synced: 08 Apr 2024
![](https://github.com/AlexanderVNikitin.png)
https://github.com/Westlake-AI/openmixup
CAIRI Supervised, Semi- and Self-Supervised Visual Representation Learning Toolbox and Benchmark
automix awesome-list awesome-mim awesome-mixup benchmark contrastive-learning data-augmentation data-generation deep-learning image-classifcation imagenet machine-learning masked-image-modeling mixup pytorch self-supervised-learning semi-supervised-learning vision-transformer
Last synced: 05 Apr 2024
![](https://github.com/Westlake-AI.png)
https://github.com/victorca25/augmennt
Augmentations for Neural Networks. Implementation of Torchvision's transforms using OpenCV and additional augmentations for super-resolution, restoration and image to image translation.
anisotropic bsrgan computer-vision data-augmentation deblur denoise image-processing opencv pytorch real-esrgan super-resolution superpixels unprocessing-images
Last synced: 05 Apr 2024
![](https://github.com/victorca25.png)
https://github.com/asteroid-team/torch-audiomentations
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
audio audio-data-augmentation audio-effects augmentation data-augmentation deep-learning differentiable-data-augmentation dsp machine-learning music python pytorch sound sound-processing waveform
Last synced: 05 Apr 2024
![](https://github.com/asteroid-team.png)
https://github.com/goru001/inltk
Natural Language Toolkit for Indic Languages aims to provide out of the box support for various NLP tasks that an application developer might need
data-augmentation deep-learning indic-languages nlp pytorch sentence-embeddings sentence-encoding sentence-similarity word-embeddings
Last synced: 01 Apr 2024
![](https://github.com/goru001.png)
https://github.com/425776024/nlpcda
一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda
chinese-data-augmentation chinese-eda data-augmentation nlp nlpcda
Last synced: 29 Mar 2024
![](https://github.com/425776024.png)
https://github.com/arcelien/pba
Efficient Learning of Augmentation Policy Schedules
artificial-intelligence augmentation automated-machine-learning automl convolutional-neural-networks data-augmentation data-science deep-learning image-classification machine-learning python tensorflow
Last synced: 28 Mar 2024
![](https://github.com/arcelien.png)
https://github.com/shunk031/chainer-RICAP
Chainer implementation of the paper "RICAP: Random Image Cropping and Patching Data Augmentation for Deep CNNs" (http://proceedings.mlr.press/v95/takahashi18a.html)
acml-2018 chainer chainerv5 data-augmentation deep-learning
Last synced: 28 Mar 2024
![](https://github.com/shunk031.png)
https://github.com/artitw/text2text
Text2Text: Crosslingual NLP/G toolkit
backtranslation chatgpt cross-lingual data-augmentation embeddings information-retrieval levenshtein-distance llm multi-lingual natural-language-generation natural-language-processing nlp question-answering question-generation search summarization tf-idf tokenizer transformers translator
Last synced: 27 Mar 2024
![](https://github.com/artitw.png)
https://github.com/dogyoonlee/RSMix
[CVPR 2021 - Official] Rigid Subset Mix (RSMix): Regularization Strategy for Point Cloud via Rigidly Mixed Samples
3d 3dpointcloud data-augmentation deep-learning modelnet40 pointcloud regularization shape-classification
Last synced: 26 Mar 2024
![](https://github.com/dogyoonlee.png)
https://github.com/jiachens/ModelNet40-C
Repo for "Benchmarking Robustness of 3D Point Cloud Recognition against Common Corruptions" https://arxiv.org/abs/2201.12296
benchmark computer-vision corruption-robustness data-augmentation deep-learning ml-safety point-cloud-processing pytorch regularization robustness
Last synced: 26 Mar 2024
![](https://github.com/jiachens.png)
https://github.com/wjun0830/Localizable-Rotation
Official PyTorch Repository of "Tailoring Self-Supervision for Supervised Learning" (ECCV 2022 Paper)
data-augmentation deep-learning long-tailed-recognition model-robustness out-of-distribution-detection self-supervised-learning
Last synced: 25 Mar 2024
![](https://github.com/wjun0830.png)
https://github.com/hfawaz/aaltd18
Data augmentation using synthetic data for time series classification with deep residual networks
convolutional-neural-networks data-augmentation deep-learning dtw dynamic-time-warping time-series-classification
Last synced: 23 Mar 2024
![](https://github.com/hfawaz.png)
https://github.com/QData/TextAttack
TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs.io/en/master/
adversarial-attacks adversarial-examples adversarial-machine-learning data-augmentation machine-learning natural-language-processing nlp security
Last synced: 23 Mar 2024
![](https://github.com/QData.png)
https://hazyresearch.github.io/snorkel
A system for quickly generating training data with weak supervision
ai data-augmentation data-science data-slicing labeling machine-learning python snorkel training-data weak-supervision
Last synced: 22 Mar 2024
![](https://github.com/snorkel-team.png)
https://github.com/Shaoli-Huang/SnapMix
SnapMix: Semantically Proportional Mixing for Augmenting Fine-grained Data (AAAI 2021)
aaai2021 cutmix data-augmentation fine-grained-recognition mixup
Last synced: 17 Mar 2024
![](https://github.com/Shaoli-Huang.png)
https://github.com/visual-layer/fastdup
fastdup is a powerful free tool designed to rapidly extract valuable insights from your image & video datasets. Assisting you to increase your dataset images & labels quality and reduce your data operations costs at an unparalleled scale.
data-augmentation data-curation dataset deep-learning image image-analysis image-classfication image-classification image-duplicate-detection image-processing image-similarity machine-learning novelty-detection object-detection outlier-detection python visual-search visualization visualization-tools
Last synced: 17 Mar 2024
![](https://github.com/visual-layer.png)