An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with transformer-encoder

A curated list of projects in awesome lists tagged with transformer-encoder .

https://github.com/VITA-Group/TransGAN

[NeurIPS‘2021] "TransGAN: Two Pure Transformers Can Make One Strong GAN, and That Can Scale Up", Yifan Jiang, Shiyu Chang, Zhangyang Wang

gan pytorch transformer transformer-encoder transformer-models

Last synced: 08 May 2025

https://github.com/vita-group/transgan

[NeurIPS‘2021] "TransGAN: Two Pure Transformers Can Make One Strong GAN, and That Can Scale Up", Yifan Jiang, Shiyu Chang, Zhangyang Wang

gan pytorch transformer transformer-encoder transformer-models

Last synced: 08 Apr 2025

https://github.com/lilianweng/transformer-tensorflow

Implementation of Transformer Model in Tensorflow

attention-is tensorflow-models transformer transformer-encoder

Last synced: 05 Apr 2025

https://github.com/zhongkaifu/seq2seqsharp

Seq2SeqSharp is a tensor based fast & flexible deep neural network framework written by .NET (C#). It has many highlighted features, such as automatic differentiation, different network types (Transformer, LSTM, BiLSTM and so on), multi-GPUs supported, cross-platforms (Windows, Linux, x86, x64, ARM), multimodal model for text and images and so on.

attention-model cuda deep-learning encoder-decoder gpu image lstm machine-translation neural-network seq2seq sequence-to-sequence tensor text transformer transformer-architecture transformer-encoder translation vision-transformer

Last synced: 04 Apr 2025

https://github.com/declare-lab/mime

This repository contains PyTorch implementations of the models from the paper An Empirical Study MIME: MIMicking Emotions for Empathetic Response Generation.

conversational-bots dialogue-systems empathetic-dialogues empathetic-responses natural-language-processing transformer-encoder

Last synced: 14 Apr 2025

https://github.com/microsoft/CASPR

CASPR is a deep learning framework applying transformer architecture to learn and predict from tabular data at scale.

attention-mechanism business deep-learning tabular-data transformer transformer-architecture transformer-encoder

Last synced: 05 Apr 2025

https://github.com/helpmefindaname/transformer-smaller-training-vocab

Temporary remove unused tokens during training to save ram and speed.

python transformer-encoder transformers

Last synced: 23 Jun 2025

https://github.com/jiangnanboy/knowledge-automatic-tagging

题目知识点预测标注。Question knowledge point prediction.

knowledge-auto-tagging pytorch textcnn transformer-encoder

Last synced: 16 Jan 2026

https://github.com/heartcored98/transformer_anatomy

Official Pytorch implementation of (Roles and Utilization of Attention Heads in Transformer-based Neural Language Models), ACL 2020

attention-head interpretability interpretable-deep-learning transformer-encoder

Last synced: 13 Apr 2025

https://github.com/utahnlp/therapist-observer

Code for the ACL 2019 paper "Observing Dialogue in Therapy: Categorizing and Forecasting Behavioral Codes"

acl2019 attention behavior-coding dialog elmo focal-loss hierarchical-attention-networks psychotherapy transformer-encoder

Last synced: 07 May 2025

https://github.com/jaketae/realformer

PyTorch implementation of RealFormer: Transformer Likes Residual Attention

natural-language-processing pytorch transformer transformer-encoder

Last synced: 28 Oct 2025

https://github.com/kyegomez/shallowff

Zeta implemantion of "Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers"

artificial-intelligence attention attention-is-all-you-need attention-mechanism attention-mechanisms feedforward transformer transformer-encoder transformer-models transformers-models

Last synced: 28 Jun 2025