Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

awesome-huggingface

🤗 A list of wonderful open-source projects & applications integrated with Hugging Face libraries.
https://github.com/bobvanluijt/awesome-huggingface

Last synced: 4 days ago
JSON representation

  • 🧰 NLP Toolkits

    • Lightning Transformers - Transformers with PyTorch Lightning interface.
    • Adapter Transformers - Extension to the Transformers library, integrating adapters into state-of-the-art language models.
    • AllenNLP - An open-source NLP research library.
    • Graph4NLP - Enabling easy use of Graph Neural Networks for NLP.
    • Lightning Transformers - Transformers with PyTorch Lightning interface.
    • Adapter Transformers - Extension to the Transformers library, integrating adapters into state-of-the-art language models.
    • Obsei - A low-code AI workflow automation tool and performs various NLP tasks in the workflow pipeline.
  • đŸ—ē Translation

    • dl-translate - A deep learning-based translation library based on HF Transformers.
    • EasyNMT - Easy-to-use, state-of-the-art translation library and Docker images based on HF Transformers.
    • dl-translate - A deep learning-based translation library based on HF Transformers.
  • 🤗 Official Libraries

    • transformers - State-of-the-art natural language processing for Jax, PyTorch and TensorFlow.
    • tokenizers - Fast state-of-the-Art tokenizers optimized for research and production.
    • knockknock - Get notified when your training ends with only two additional lines of code.
    • accelerate - A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision.
    • nn_pruning - Prune a model while finetuning or training.
    • huggingface_hub - Client library to download and publish models and other files on the huggingface.co hub.
    • autonlp - Train state-of-the-art natural language processing models and deploy them in a scalable environment automatically.
    • datasets - The largest hub of ready-to-use NLP datasets for ML models with fast, easy-to-use and efficient data manipulation tools.
  • 👩‍đŸĢ Tutorials

  • 🔁 Style Transfer

    • ConSERT - A contrastive framework for self-supervised sentence representation transfer.
    • Styleformer - A neural language style transfer framework to transfer text smoothly between styles.
  • đŸ’ĸ Sentiment Analysis

    • conv-emotion - Implementation of different architectures for emotion recognition in conversations.
    • Jina Integration - Jina integration of Hugging Face Accelerated API.
    • Weaviate Integration - Weaviate integration of Hugging Face Transformers.
    • ColBERT - A fast and accurate retrieval model, enabling scalable BERT-based search over large text collections in tens of milliseconds.
  • ❓ Question Answering

    • Haystack - End-to-end framework for developing and deploying question-answering systems in the wild.
  • 📱 Hardware

    • Intel - Collaboration with Intel for configuration options.
  • ⚙ī¸ Inference Engines

    • FasterTransformer - A script and recipe to run the highly optimized transformer-based encoder and decoder component on NVIDIA GPUs.
    • lightseq - A high performance inference library for sequence processing and generation implemented in CUDA.
    • FastSeq - Efficient implementation of popular sequence models (e.g., Bart, ProphetNet) for text generation, summarization, translation tasks etc.
    • TurboTransformers - An inference engine for transformers with fast C++ API.
  • 🏎ī¸ Model Compression/Acceleration

    • torchdistill - PyTorch-based modular, configuration-driven framework for knowledge distillation.
    • TextBrewer - State-of-the-art distillation methods to compress language models.
    • BERT-of-Theseus - Compressing BERT by progressively replacing the components of the original BERT.
  • 📖 Knowledge and Entity

    • PURE - Entity and relation extraction from text.
  • 🎙 Speech

    • s3prl - A self-supervised speech pre-training and representation learning toolkit.
    • speechbrain - A PyTorch-based speech toolkit.
  • 🙅 Grammatical Error Correction

    • Gramformer - A framework for detecting, highlighting and correcting grammatical errors on natural language text.
  • đŸĨĄ Text Representation

    • Sentence Transformers - Widely used encoders computing dense vector representations for sentences, paragraphs, and images.
    • WhiteningBERT - An easy unsupervised sentence embedding approach with whitening.
    • SimCSE - State-of-the-art sentence embedding with contrastive learning.
    • DensePhrases - Learning dense representations of phrases at scale.
  • 🏹ī¸ Adversarial Attack

    • TextAttack - A Python framework for adversarial attacks, data augmentation, and model training in NLP.
    • TextFlint - A unified multilingual robustness evaluation toolkit for NLP.
    • OpenAttack - An open-source textual adversarial attack toolkit.
  • đŸ¤¯ Multi-modality

    • ViLT - A vision-and-language transformer Without convolution or region supervision.