Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/JayYip/bert-multiple-gpu
BERT for Multitask Learning
https://github.com/JayYip/bert-multiple-gpu
bert cws encoder-decoder multi-task-learning multitask-learning named-entity-recognition ner nlp part-of-speech pretrained-models text-classification transformer word-segmentation
Last synced: 6 days ago
JSON representation
BERT for Multitask Learning
- Host: GitHub
- URL: https://github.com/JayYip/bert-multiple-gpu
- Owner: JayYip
- License: apache-2.0
- Created: 2018-11-29T06:43:15.000Z (about 6 years ago)
- Default Branch: master
- Last Pushed: 2023-04-12T06:03:03.000Z (over 1 year ago)
- Last Synced: 2024-12-02T01:33:13.522Z (11 days ago)
- Topics: bert, cws, encoder-decoder, multi-task-learning, multitask-learning, named-entity-recognition, ner, nlp, part-of-speech, pretrained-models, text-classification, transformer, word-segmentation
- Language: Jupyter Notebook
- Homepage: https://jayyip.github.io/m3tl/
- Size: 29.1 MB
- Stars: 545
- Watchers: 19
- Forks: 125
- Open Issues: 27
-
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
Awesome Lists containing this project
- awesome-bert - JayYip/bert-multiple-gpu
README
# M3TL
**M**ulti-**M**odal **M**ulti-**T**ask **L**earning
## Install
```
pip install m3tl
```## What is it
This is a project that uses transformers(based on huggingface transformers) as base model to do **multi-modal multi-task learning**.
## Why do I need this
Multi-task learning(MTL) is gaining more and more attention, especially in deep learning era. It is widely used in NLP, CV, recommendation, etc. However, MTL usually involves complicated data preprocessing, task managing and task interaction. Other open-source projects, like TencentNLP and PyText, supports MTL but in a naive way and it's not straightforward to implement complicated MTL algorithm. In this project, we try to make writing MTL model as easy as single task learning model and further extend MTL to multi-modal multi-task learning. To do so, we expose following MTL related programable module to user:
- problem sampling strategy
- loss combination strategy
- gradient surgery
- model after base model(transformers)Apart from programable modules, we also provide various built-in SOTA MTL algorithms.
In a word, you can use this project to:
- implement complicated MTL algorithm
- do SOTA MTL without diving into details
- do multi-modal learningAnd since we use transformers as base model, you get all the benefits that you can get from transformers!
## What type of problems are supported?
```
params = Params()
for problem_type in params.list_available_problem_types():
print('`{problem_type}`: {desc}'.format(
desc=params.problem_type_desc[problem_type], problem_type=problem_type))```
`cls`: Classification
`multi_cls`: Multi-Label Classification
`seq_tag`: Sequence Labeling
`masklm`: Masked Language Model
`pretrain`: NSP+MLM(Deprecated)
`regression`: Regression
`vector_fit`: Vector Fitting
`premask_mlm`: Pre-masked Masked Language Model
`contrastive_learning`: Contrastive Learning## Get Started
Please see tutorials.