Projects in Awesome Lists tagged with transformer-encoder
A curated list of projects in awesome lists tagged with transformer-encoder .
https://github.com/microsoft/deberta
The implementation of DeBERTa
bert deeplearning language-model natural-language-understanding representation-learning roberta self-attention transformer-encoder
Last synced: 14 May 2025
https://github.com/microsoft/DeBERTa
The implementation of DeBERTa
bert deeplearning language-model natural-language-understanding representation-learning roberta self-attention transformer-encoder
Last synced: 18 Apr 2025
https://github.com/VITA-Group/TransGAN
[NeurIPS‘2021] "TransGAN: Two Pure Transformers Can Make One Strong GAN, and That Can Scale Up", Yifan Jiang, Shiyu Chang, Zhangyang Wang
gan pytorch transformer transformer-encoder transformer-models
Last synced: 08 May 2025
https://github.com/vita-group/transgan
[NeurIPS‘2021] "TransGAN: Two Pure Transformers Can Make One Strong GAN, and That Can Scale Up", Yifan Jiang, Shiyu Chang, Zhangyang Wang
gan pytorch transformer transformer-encoder transformer-models
Last synced: 08 Apr 2025
https://github.com/brightmart/bert_language_understanding
Pre-training of Deep Bidirectional Transformers for Language Understanding: pre-train TextCNN
attention-is-all-you-need bert-model document-classification fasttext language-model language-understanding nlp pre-training question-answering self-attention text-classification textcnn transfer-learning transformer-encoder
Last synced: 13 Apr 2025
https://github.com/lilianweng/transformer-tensorflow
Implementation of Transformer Model in Tensorflow
attention-is tensorflow-models transformer transformer-encoder
Last synced: 05 Apr 2025
https://github.com/wgcban/ChangeFormer
[IGARSS'22]: A Transformer-Based Siamese Network for Change Detection
attention-mechanism change-detection climate-change deep-learning multi-temporal pytorch remote-sensing satellite-imagery siamese-network transformer-architecture transformer-encoder
Last synced: 11 May 2025
https://github.com/wgcban/changeformer
[IGARSS'22]: A Transformer-Based Siamese Network for Change Detection
attention-mechanism change-detection climate-change deep-learning multi-temporal pytorch remote-sensing satellite-imagery siamese-network transformer-architecture transformer-encoder
Last synced: 05 Apr 2025
https://github.com/zhongkaifu/seq2seqsharp
Seq2SeqSharp is a tensor based fast & flexible deep neural network framework written by .NET (C#). It has many highlighted features, such as automatic differentiation, different network types (Transformer, LSTM, BiLSTM and so on), multi-GPUs supported, cross-platforms (Windows, Linux, x86, x64, ARM), multimodal model for text and images and so on.
attention-model cuda deep-learning encoder-decoder gpu image lstm machine-translation neural-network seq2seq sequence-to-sequence tensor text transformer transformer-architecture transformer-encoder translation vision-transformer
Last synced: 04 Apr 2025
https://github.com/jackaduma/secbert
pretrained BERT model for cyber security text, learned CyberSecurity Knowledge
apt attention bert bert-embeddings cyber-security cyber-threat-intelligence cybersecurity deep-learning-security deeplearning machine-learning-security nlp nlp-machine-learning security security-automation threat-analysis threat-detection threat-hunting threat-intelligence transformer-encoder transformers
Last synced: 27 Apr 2025
https://github.com/declare-lab/mime
This repository contains PyTorch implementations of the models from the paper An Empirical Study MIME: MIMicking Emotions for Empathetic Response Generation.
conversational-bots dialogue-systems empathetic-dialogues empathetic-responses natural-language-processing transformer-encoder
Last synced: 14 Apr 2025
https://github.com/microsoft/CASPR
CASPR is a deep learning framework applying transformer architecture to learn and predict from tabular data at scale.
attention-mechanism business deep-learning tabular-data transformer transformer-architecture transformer-encoder
Last synced: 05 Apr 2025
https://github.com/helpmefindaname/transformer-smaller-training-vocab
Temporary remove unused tokens during training to save ram and speed.
python transformer-encoder transformers
Last synced: 23 Jun 2025
https://github.com/jiangnanboy/knowledge-automatic-tagging
题目知识点预测标注。Question knowledge point prediction.
knowledge-auto-tagging pytorch textcnn transformer-encoder
Last synced: 16 Jan 2026
https://github.com/heartcored98/transformer_anatomy
Official Pytorch implementation of (Roles and Utilization of Attention Heads in Transformer-based Neural Language Models), ACL 2020
attention-head interpretability interpretable-deep-learning transformer-encoder
Last synced: 13 Apr 2025
https://github.com/nikhilroxtomar/vision-transformer-vit-in-tensorflow
Vision Transformer Implementation in TensorFlow
transformer transformer-architecture transformer-encoder vision-transformer vit
Last synced: 14 Apr 2025
https://github.com/utahnlp/therapist-observer
Code for the ACL 2019 paper "Observing Dialogue in Therapy: Categorizing and Forecasting Behavioral Codes"
acl2019 attention behavior-coding dialog elmo focal-loss hierarchical-attention-networks psychotherapy transformer-encoder
Last synced: 07 May 2025
https://github.com/jaketae/realformer
PyTorch implementation of RealFormer: Transformer Likes Residual Attention
natural-language-processing pytorch transformer transformer-encoder
Last synced: 28 Oct 2025
https://github.com/kyegomez/shallowff
Zeta implemantion of "Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers"
artificial-intelligence attention attention-is-all-you-need attention-mechanism attention-mechanisms feedforward transformer transformer-encoder transformer-models transformers-models
Last synced: 28 Jun 2025
https://github.com/amitkumarj441/bcapsule
When BERT meets Capsule
bert capsule-network transformer-encoder
Last synced: 09 May 2026
https://github.com/pavansomisetty21/attention-is-all-you-need-the-transformer-architecture
In this we explore detailed architecture of Transformer
attention-is-all-you-need attention-mechanism gpt transformer transformer-architecture transformer-decoder transformer-decoder-model transformer-encoder transformer-encoders
Last synced: 12 May 2026
https://github.com/stefanheng/zeroshot-text-classification
Exploring fast & accurate zero-shot text classification
bi-encoder natural-language-processing sentence-transformers text-classification transformer transformer-encoder zero-shot-classification zero-shot-learning
Last synced: 16 Jul 2025