https://github.com/JayYip/bert-multiple-gpu

BERT for Multitask Learning
https://github.com/JayYip/bert-multiple-gpu

bert cws encoder-decoder multi-task-learning multitask-learning named-entity-recognition ner nlp part-of-speech pretrained-models text-classification transformer word-segmentation

Last synced: 7 months ago
JSON representation

BERT for Multitask Learning

Host: GitHub
URL: https://github.com/JayYip/bert-multiple-gpu
Owner: JayYip
License: apache-2.0
Created: 2018-11-29T06:43:15.000Z (over 6 years ago)
Default Branch: master
Last Pushed: 2023-04-12T06:03:03.000Z (about 2 years ago)
Last Synced: 2024-12-02T01:33:13.522Z (7 months ago)
Topics: bert, cws, encoder-decoder, multi-task-learning, multitask-learning, named-entity-recognition, ner, nlp, part-of-speech, pretrained-models, text-classification, transformer, word-segmentation
Language: Jupyter Notebook
Homepage: https://jayyip.github.io/m3tl/
Size: 29.1 MB
Stars: 545
Watchers: 19
Forks: 125
Open Issues: 27
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE

Awesome Lists containing this project

awesome-bert - JayYip/bert-multiple-gpu

README

# M3TL

**M**ulti-**M**odal **M**ulti-**T**ask **L**earning

## Install

```
pip install m3tl
```

## What is it

This is a project that uses transformers(based on huggingface transformers) as base model to do **multi-modal multi-task learning**.

## Why do I need this

Multi-task learning(MTL) is gaining more and more attention, especially in deep learning era. It is widely used in NLP, CV, recommendation, etc. However, MTL usually involves complicated data preprocessing, task managing and task interaction. Other open-source projects, like TencentNLP and PyText, supports MTL but in a naive way and it's not straightforward to implement complicated MTL algorithm. In this project, we try to make writing MTL model as easy as single task learning model and further extend MTL to multi-modal multi-task learning. To do so, we expose following MTL related programable module to user:

- problem sampling strategy
- loss combination strategy
- gradient surgery
- model after base model(transformers)

Apart from programable modules, we also provide various built-in SOTA MTL algorithms.

In a word, you can use this project to:

- implement complicated MTL algorithm
- do SOTA MTL without diving into details
- do multi-modal learning

And since we use transformers as base model, you get all the benefits that you can get from transformers!

## What type of problems are supported?

```
params = Params()
for problem_type in params.list_available_problem_types():
print('`{problem_type}`: {desc}'.format(
desc=params.problem_type_desc[problem_type], problem_type=problem_type))

```

`cls`: Classification
`multi_cls`: Multi-Label Classification
`seq_tag`: Sequence Labeling
`masklm`: Masked Language Model
`pretrain`: NSP+MLM(Deprecated)
`regression`: Regression
`vector_fit`: Vector Fitting
`premask_mlm`: Pre-masked Masked Language Model
`contrastive_learning`: Contrastive Learning

## Get Started

Please see tutorials.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/JayYip/bert-multiple-gpu

Awesome Lists containing this project

README