Projects in Awesome Lists by asahi417
A curated list of projects in awesome lists by asahi417 .
https://github.com/asahi417/tner
Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An All-Round Python Library for Transformer-based Named Entity Recognition, EACL 2021"
language-model named-entity-recognition nlp transformers
Last synced: 13 Apr 2025
https://github.com/asahi417/lm-question-generation
Multilingual/multidomain question generation datasets, models, and python library for question generation.
bart nlp pytorch question-answering question-generation t5
Last synced: 06 Apr 2025
https://github.com/asahi417/lmppl
Calculate perplexity on a text with pre-trained language models. Support MLM (eg. DeBERTa), recurrent LM (eg. GPT3), and encoder-decoder LM (eg. Flan-T5).
Last synced: 06 Apr 2025
https://github.com/asahi417/kex
Kex is a python library for unsupervised keyword extraction from a document, providing an easy interface and benchmarks on 15 public datasets.
information-retrieval keyword-extraction nlp-machine-learning
Last synced: 23 Mar 2025
https://github.com/asahi417/wikiart-image-dataset
We release WikiART Crawler, a python-library to download/process images from WikiART via WikiART API, and two image datasets: `WikiART Face` and `WikiART General`.
art computer-vision dataset generative-model
Last synced: 23 Mar 2025
https://github.com/asahi417/relbert
The official implementation of "Distilling Relation Embeddings from Pre-trained Language Models, EMNLP 2021 main conference", a high-quality relation embedding based on language models.
Last synced: 09 Apr 2025
https://github.com/asahi417/lm-vocab-trimmer
Vocabulary Trimming (VT) is a model compression technique, which reduces a multilingual LM vocabulary to a target language by deleting irrelevant tokens from its vocabulary. This repository contains a python-library vocabtrimmer, that remove irrelevant tokens from a multilingual LM vocabulary for the target language.
bert gpt language-model model-compression nlp t5
Last synced: 23 Mar 2025
https://github.com/asahi417/analogy-language-model
The official implementation of "BERT is to NLP what AlexNet is to CV: Can Pre-Trained Language Models Identify Analogies?, ACL 2021 main conference"
Last synced: 23 Mar 2025
https://github.com/asahi417/firstcut
Audio/video editor: detecting silent interval and eliminated them from the original audio/video.
Last synced: 23 Mar 2025
https://github.com/asahi417/pytorch-language-model
Pytorch language modeling.
Last synced: 23 Mar 2025
https://github.com/asahi417/semanticsegmentation
Tensorflow implementation of DeepLab v3+. Show results trained over PASCAL data.
deep-learning semantic-segmentation tensorflow
Last synced: 23 Mar 2025
https://github.com/asahi417/mlm-manifold-mapping
BERT based conditional text generation/revision with pseudo perplexity objectives.
Last synced: 23 Mar 2025