Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
awesome-huggingface
đ¤ A list of wonderful open-source projects & applications integrated with Hugging Face libraries.
https://github.com/huggingface/awesome-huggingface
Last synced: 2 days ago
JSON representation
-
â Cloud
- Amazon SageMaker - Making it easier than ever to train Hugging Face Transformer models in Amazon SageMaker.
-
đą Hardware
-
đ¤ Official Libraries
- transformers - State-of-the-art natural language processing for Jax, PyTorch and TensorFlow.
- datasets - The largest hub of ready-to-use NLP datasets for ML models with fast, easy-to-use and efficient data manipulation tools.
- tokenizers - Fast state-of-the-Art tokenizers optimized for research and production.
- knockknock - Get notified when your training ends with only two additional lines of code.
- nn_pruning - Prune a model while finetuning or training.
- huggingface_hub - Client library to download and publish models and other files on the huggingface.co hub.
- autonlp - Train state-of-the-art natural language processing models and deploy them in a scalable environment automatically.
- accelerate - A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision.
-
đŠâđĢ Tutorials
- transformers-tutorials - Tutorials for applying multiple models on real-world datasets.
-
𧰠NLP Toolkits
- AllenNLP - An open-source NLP research library.
- Graph4NLP - Enabling easy use of Graph Neural Networks for NLP.
- Obsei - A low-code AI workflow automation tool and performs various NLP tasks in the workflow pipeline.
- Lightning Transformers - Transformers with PyTorch Lightning interface.
- Adapter Transformers - Extension to the Transformers library, integrating adapters into state-of-the-art language models.
- Trapper - State-of-the-art NLP through transformer models in a modular design and consistent APIs.
- Flair - A very simple framework for state-of-the-art NLP.
-
đĨĄ Text Representation
- DensePhrases - Learning dense representations of phrases at scale.
- Sentence Transformers - Widely used encoders computing dense vector representations for sentences, paragraphs, and images.
- WhiteningBERT - An easy unsupervised sentence embedding approach with whitening.
- SimCSE - State-of-the-art sentence embedding with contrastive learning.
-
âī¸ Inference Engines
- TurboTransformers - An inference engine for transformers with fast C++ API.
- FasterTransformer - A script and recipe to run the highly optimized transformer-based encoder and decoder component on NVIDIA GPUs.
- lightseq - A high performance inference library for sequence processing and generation implemented in CUDA.
- FastSeq - Efficient implementation of popular sequence models (e.g., Bart, ProphetNet) for text generation, summarization, translation tasks etc.
-
đ Model Scalability
- fairscale - Implements ZeRO protocol as well. [Integrated with HF Trainer](https://huggingface.co/docs/transformers/master/main_classes/trainer#fairscale).
- ColossalAI - A Unified Deep Learning System for Large-Scale Parallel Training (1D, 2D, 2.5D, 3D and sequence parallelism, and ZeRO protocol).
- Parallelformers - A library for model parallel deployment.
- OSLO - A library that supports various features to help you train large-scale models.
- Deepspeed - Deepspeed-ZeRO - scales any model size with zero to no changes to the model. [Integrated with HF Trainer](https://huggingface.co/docs/transformers/master/main_classes/deepspeed).
-
đī¸ Model Compression/Acceleration
- BERT-of-Theseus - Compressing BERT by progressively replacing the components of the original BERT.
- torchdistill - PyTorch-based modular, configuration-driven framework for knowledge distillation.
- TextBrewer - State-of-the-art distillation methods to compress language models.
-
đšī¸ Adversarial Attack
- TextAttack - A Python framework for adversarial attacks, data augmentation, and model training in NLP.
- TextFlint - A unified multilingual robustness evaluation toolkit for NLP.
- OpenAttack - An open-source textual adversarial attack toolkit.
-
đ Style Transfer
- Styleformer - A neural language style transfer framework to transfer text smoothly between styles.
- ConSERT - A contrastive framework for self-supervised sentence representation transfer.
-
đĸ Sentiment Analysis
- conv-emotion - Implementation of different architectures for emotion recognition in conversations.
-
đē Translation
- EasyNMT - Easy-to-use, state-of-the-art translation library and Docker images based on HF Transformers.
- dl-translate - A deep learning-based translation library based on HF Transformers.
-
đ Knowledge and Entity
- PURE - Entity and relation extraction from text.
-
đ Speech
- s3prl - A self-supervised speech pre-training and representation learning toolkit.
- speechbrain - A PyTorch-based speech toolkit.
-
đ¤¯ Multi-modality
- ViLT - A vision-and-language transformer Without convolution or region supervision.
-
đ Recommender Systems
- Transformers4Rec - A flexible and efficient library powered by Transformers for sequential and session-based recommendations.
-
âī¸ Evaluation
- Jury - Easy to use tool for evaluating NLP model outputs, spesifically for NLG (Natural Language Generation), offering various automated text-to-text metrics.
- Spotlight - Interactively explore your HF dataset with one line of code. Use model results (e.g. embeddings, predictions) to understand critical data segments and model failure modes.
-
đ Neural Search
- ColBERT - A fast and accurate retrieval model, enabling scalable BERT-based search over large text collections in tens of milliseconds.
- Jina Integration - Jina integration of Hugging Face Accelerated API.
- (text2vec) - transformers.html) - Weaviate integration of Hugging Face Transformers.
-
đ¤ Reinforcement Learning
- trl - Fine-tune transformers using Proximal Policy Optimization (PPO) to align with human preferences.
-
â Question Answering
- Haystack - End-to-end framework for developing and deploying question-answering systems in the wild.
-
đ Grammatical Error Correction
- Gramformer - A framework for detecting, highlighting and correcting grammatical errors on natural language text.
Programming Languages
Categories
đ¤ Official Libraries
8
𧰠NLP Toolkits
7
đ Model Scalability
5
đĨĄ Text Representation
4
âī¸ Inference Engines
4
đšī¸ Adversarial Attack
3
đ Neural Search
3
đī¸ Model Compression/Acceleration
3
đą Hardware
2
âī¸ Evaluation
2
đ Style Transfer
2
đ Speech
2
đē Translation
2
đĸ Sentiment Analysis
1
â Cloud
1
đ Recommender Systems
1
đ
Grammatical Error Correction
1
đ¤¯ Multi-modality
1
â Question Answering
1
đŠâđĢ Tutorials
1
đ Knowledge and Entity
1
đ¤ Reinforcement Learning
1
Sub Categories
Keywords
nlp
22
pytorch
19
natural-language-processing
17
deep-learning
11
machine-learning
11
bert
10
transformers
7
python
7
transformer
7
huggingface
4
language-model
4
inference
4
gpt
3
natural-language-understanding
3
computer-vision
3
pretrained-models
3
model-parallelism
2
pipeline-parallelism
2
glue
2
data-parallelism
2
adversarial-attacks
2
data-augmentation
2
sentiment-analysis
2
audio
2
gpu
2
pytorch-transformers
2
model-hub
2
datasets
2
seq2seq
2
tensorflow
2
speech-recognition
2
ai
1
text-transformations
1
training
1
sampling
1
multilingual-nmt
1
diverse-decoding
1
cuda
1
beam-search
1
bart
1
accelerate
1
roberta
1
machine-translation
1
transformation
1
huggingface-transformers
1
adversarial-example
1
robustness-analysis
1
subpopulation
1
model-robustness
1
attack
1