https://github.com/jianguoz/few-shot-intent-detection

Few-Shot-Intent-Detection includes popular challenging intent detection datasets with/without OOS queries and state-of-the-art baselines and results.
https://github.com/jianguoz/few-shot-intent-detection
datasets few-shot intent-classification intent-detection libary
Last synced: 14 days ago
JSON representation
Few-Shot-Intent-Detection includes popular challenging intent detection datasets with/without OOS queries and state-of-the-art baselines and results.
Host: GitHub
URL: https://github.com/jianguoz/few-shot-intent-detection
Owner: jianguoz
Created: 2021-06-03T22:00:23.000Z (almost 4 years ago)
Default Branch: main
Last Pushed: 2023-07-19T05:22:52.000Z (almost 2 years ago)
Last Synced: 2025-03-27T02:39:22.415Z (about 1 month ago)
Topics: datasets, few-shot, intent-classification, intent-detection, libary
Language: Python
Homepage:
Size: 1.35 MB
Stars: 138
Watchers: 4
Forks: 26
Open Issues: 0
Metadata Files:
- Readme: README.md
Awesome Lists containing this project

README

        # Few-Shot-Intent-Detection

## :bangbang: ❤️ ‼️ **07/18/2023: Check our latest updates on [DialogStudio](https://github.com/salesforce/DialogStudio)

[DialogStudio](https://github.com/salesforce/DialogStudio) is a meticulously curated collection of dialogue datasets. These datasets are unified under a consistent format while retaining their original information. We incorporate domain-aware prompts and identify dataset licenses, making DialogStudio an exceptionally rich and diverse resource for dialogue research and model training.**

Few-Shot-Intent-Detection is a repository designed for few-shot intent detection with/without Out-of-Scope (OOS) intents. It includes popular challenging intent detection datasets and baselines. For more details of the new released OOS datasets, please check our [paper](https://arxiv.org/abs/2106.04564).

## Intent detection datasets

We process data based on previous published resources, all the data are in the same format as [DNNC](https://github.com/salesforce/DNNC-few-shot-intent). 

| Dataset      	| Description  | #Train | #Valid | #Test 	|  Processed Data Link| 

|--------------	|------	|------	|------	|---------------	|------	|

| [BANKING77](https://arxiv.org/abs/2003.04807)      	| one banking domain with 77 intents  |8622|1540| 3080  	|  [Link](https://github.com/jianguoz/Few-Shot-Intent-Detection/tree/main/Datasets/BANKING77)                  	|

| [CLINC150](https://www.aclweb.org/anthology/D19-1131/)        | 10 domains and 150 intents |15000| 3000	| 4500 	| [Link](https://github.com/jianguoz/Few-Shot-Intent-Detection/tree/main/Datasets/CLINC150)|                                              	| Link	|

| [HWU64](https://arxiv.org/abs/1903.05566)        | personal assistant with 64 intents and several domains                                                 |8954| 1076	| 1076 	|  [Link](https://github.com/jianguoz/Few-Shot-Intent-Detection/tree/main/Datasets/HWU64)	|

| [SNIPS](https://arxiv.org/pdf/1805.10190.pdf)        |snips voice platform with 7 intents   |13084| 700	| 700 	|  [Link](https://github.com/jianguoz/Few-Shot-Intent-Detection/tree/main/Datasets/SNIPS)	|

| [ATIS](https://ieeexplore.ieee.org/document/5700816)        |airline travel information system   |4478| 500	| 893 	|  [Link](https://github.com/jianguoz/Few-Shot-Intent-Detection/tree/main/Datasets/SNIPS)	|

## Intent detection datasets with OOS queries

What is OOS queires:

`OOD-OOS`: i.e., out-of-domain OOS. General out-of-scope queries which are not supported by the dialog systems, also called out-of-domain OOS. For instance, requesting an online NBA/TV show service in a banking system.

`ID-OOS`: i.e., in-domain OOS. Out-of-scope queries which are more related to the in-scope intents, which makes the intent detection task more challenging. For instance, requesting a banking service that is not supported by the banking system.

| Dataset      	| Description  | #Train | #Valid | #Test 	|#OOD-OOS-Train |#OOD-OOS-Valid|#OOD-OOS-Test| #ID-OOS-Train |#ID-OOS-Valid|#ID-OOS-Test| Processed Data Link| 

|--------------	|------	|------	|------	|---------------	|------	|------	|------	|------	|------	|------|------	|

| [CLINC150](https://www.aclweb.org/anthology/D19-1131/)        | A dataset with general OOS-OOS queries |15000| 3000	| 4500  |	100| 100|1000| -|-|-|[Link](https://github.com/jianguoz/Few-Shot-Intent-Detection/tree/main/Datasets/CLINC150)|

| [CLINC-Single-Domain-OOS](https://arxiv.org/abs/2106.04564)        | Two domains with both general OOS-OOS queries and ID-OOS queries |500| 500	| 500  |-	| 200|1000| -|400|350|[Link](https://github.com/jianguoz/Few-Shot-Intent-Detection/tree/main/Datasets/CLINC-Single-Domain-OOS)|                                             

| [BANKING77-OOS](https://arxiv.org/abs/2106.04564)        | One banking domain with both general OOS-OOS queries and ID-OOS queries |5905| 1506	| 2000  |-	| 200|1000| 2062|530|1080|[Link](https://github.com/jianguoz/Few-Shot-Intent-Detection/tree/main/Datasets/BANKING77-OOS)|      

Data structure:

```

Datasets/

├── BANKING77

│   ├── train

│   ├── train_10

│   ├── train_5

│   ├── valid

│   └── test

├── CLINC150

│   ├── train

│   ├── train_10

│   ├── train_5

│   ├── valid

│   ├── test

│   ├── oos

│       ├──train

│       ├──valid

│       └──test

├── HWU64

│   ├── train

│   ├── train_10

│   ├── train_5

│   ├── valid

│   └── test

├── SNIPS

│   ├── train

│   ├── valid

│   └── test

├── ATIS

│   ├── train

│   ├── valid

│   └── test

├── BANKING77-OOS

│   ├── train

│   ├── valid

│   ├── test

│   ├── id-oos

│   │   ├──train

│   │   ├──valid

│   │   └──test

│   ├── ood-oos

│       ├──valid

│       └──test

├── CLINC-Single-Domain-OOS

│   ├── banking

│   │   ├── train

│   │   ├── valid

│   │   ├── test

│   │   ├── id-oos

│   │   │   ├──valid

│   │   │   └──test

│   │   ├── ood-oos

│   │       ├──valid

│   │       └──test

│   ├── credit_cards

│   │   ├── train

│   │   ├── valid

│   │   ├── test

│   │   ├── id-oos

│   │   │   ├──valid

│   │   │   └──test

│   │   ├── ood-oos

│   │       ├──valid

└── └──     └──test

```

Briefly describe the [BANKING77-OOS](https://arxiv.org/abs/2106.04564) dataset. 

*  A dataset with a single banking domain, includes both general Out-of-Scope (OOD-OOS) queries and In-Domain but Out-of-Scope (ID-OOS) queries, where ID-OOS queries are semantically similar intents/queries with in-scope intents.  BANKING77 originally includes 77 intents. BANKING77-OOS includes 50 in-scope intents in this dataset, and the ID-OOS queries are built up based on 27 held-out semantically similar in-scope intents.

Briefly describe the [CLINC-Single-Domain-OOS](https://arxiv.org/abs/2106.04564) dataset. 

*  A dataset with two separate domains, i.e., the  "Banking''  domain and the "Credit cards''  domain with both general Out-of-Scope (OOD-OOS) queries and In-Domain but Out-of-Scope (ID-OOS) queries, where ID-OOS queries are semantically similar intents/queries with in-scope intents. Each domain in CLINC150 originally includes 15 intents. Each domain in the new dataset includes ten in-scope intents in this dataset, and the ID-OOS queries are built up based on five held-out semantically similar in-scope intents.

Both datasets can be used to conduct intent detection with and without OOD-OOS and ID-OOS queries

You can easily load the processed data:

```python

class IntentExample:

    def __init__(self, text, label, do_lower_case):

        self.original_text = text

        self.text = text

        self.label = label

        if do_lower_case:

            self.text = self.text.lower()

        

def load_intent_examples(file_path, do_lower_case=True):

    examples = []

    with open('{}/seq.in'.format(file_path), 'r', encoding="utf-8") as f_text, open('{}/label'.format(file_path), 'r', encoding="utf-8") as f_label:

        for text, label in zip(f_text, f_label):

            e = IntentExample(text.strip(), label.strip(), do_lower_case)

            examples.append(e)

    return examples

```

More details can check [code for load data and do random sampling for few-shot learning](https://github.com/salesforce/DNNC-few-shot-intent/blob/master/train_classifier.py#L127).

## State-of-the art models and baselines

**[DNNC](https://www.aclweb.org/anthology/2020.emnlp-main.411/)**

Download pre-trained RoBERTa NLI checkpoint: 

```bash

wget https://storage.googleapis.com/sfr-dnnc-few-shot-intent/roberta_nli.zip

```

Access to public code: [Link](https://github.com/salesforce/DNNC-few-shot-intent)

**[CONVERT](https://www.aclweb.org/anthology/2020.nlp4convai-1.5/)**

Download pre-trained checkpoint: 

```bash

wget https://github.com/connorbrinton/polyai-models/releases/download/v1.0/model.tar.gz

```

Access to public code:

```bash

wget https://github.com/connorbrinton/polyai-models/archive/refs/tags/v1.0.zip

```

**[CONVBERT](https://arxiv.org/abs/2009.13570)** 

Download pre-trained checkpoints: 

Step-1: install [AWS CL2](https://aws.amazon.com/cli/): e.g., install [MacOS PKG](https://awscli.amazonaws.com/AWSCLIV2.pkg)

Step-2: 

```bash

aws s3 cp s3://dialoglue/ --no-sign-request `Your_folder_name` --recursive

```

Then the checkpoints are downloaded into  `Your_folder_name`

## Few-shot intent detection baselines/leaderboard:

**5-shot learning**

| Model      	| BANKING77  | CLICN150 | HWU64 | 

|--------------	|------	|------	|------	|

|[RoBERTa+Classifier](https://www.aclweb.org/anthology/2020.emnlp-main.411/) (EMNLP 2020) | 74.04 | 87.99 | 75.56 |

|[USE](https://www.aclweb.org/anthology/2020.nlp4convai-1.5/) (ACL 2020 NLP4ConvAI)| 76.29 | 87.82 | 77.79 |

|[CONVERT](https://www.aclweb.org/anthology/2020.nlp4convai-1.5/) (ACL 2020 NLP4ConvAI)| 75.32 | 89.22 | 76.95|

|[USE+CONVERT](https://www.aclweb.org/anthology/2020.nlp4convai-1.5/) (ACL 2020 NLP4ConvAI)      | 77.75 | 90.49 | 80.01 | 

|[CONVBERT+MLM+Example+Observers](https://arxiv.org/abs/2010.08684)  (NAACL 2021)     | - | - | - |

|[DNNC](https://www.aclweb.org/anthology/2020.emnlp-main.411/) (EMNLP 2020)              | 80.40 | 91.02 | 80.46 | 

|[CPFT](https://arxiv.org/pdf/2109.06349.pdf) (EMNLP 2021) |80.86| 92.34 | 82.03|

|[ICDA](https://arxiv.org/abs/2302.05096) (EACL 2023) |84.01| 92.62 | 82.45|

**10-shot learning**

| Model      	| BANKING77  | CLICN150 | HWU64 | 

|--------------	|------	|------	|------	|

|[RoBERTa+Classifier](https://www.aclweb.org/anthology/2020.emnlp-main.411/) (EMNLP 2020) | 84.27 | 91.55 | 82.90 |

|[USE](https://www.aclweb.org/anthology/2020.nlp4convai-1.5/) (ACL 2020 NLP4ConvAI)| 84.23 | 90.85 | 83.75 |

|[CONVERT](https://www.aclweb.org/anthology/2020.nlp4convai-1.5/)(ACL 2020 NLP4ConvAI) | 83.32 | 92.62 | 82.65|

|[USE+CONVERT](https://www.aclweb.org/anthology/2020.nlp4convai-1.5/) (ACL 2020 NLP4ConvAI)       | 85.19 | 93.26 | 85.83 | 

|[CONVBERT](https://arxiv.org/abs/2009.13570) (ArXiv 2020)| 83.63 | 92.10 | 83.77 |

|[CONVBERT+MLM](https://arxiv.org/abs/2009.13570)  (ArXiv 2020)     | 83.99 | 92.75 | 84.52 |

|[CONVBERT+MLM+Example+Observers](https://arxiv.org/abs/2010.08684) (NAACL 2021) | 85.95 | 93.97 | 86.28 |

|[DNNC](https://www.aclweb.org/anthology/2020.emnlp-main.411/) (EMNLP 2020)              | 86.71 | 93.76 | 84.72 |

|[CPFT](https://arxiv.org/pdf/2109.06349.pdf) (EMNLP 2021) |87.20| 94.18 | 87.13|

|[ICDA](https://arxiv.org/abs/2302.05096) (EACL 2023) |89.79| 94.84 | 87.41|

`Note:` the 5-shot learning results of RoBERTa+Classifier, DNNC and CPFT, and the 10-shot learning results of all the models are reported by the paper authors. 

## Citation

Please cite our paper if you use above resources in your work:

```bibtex

@article{zhang2020discriminative,

  title={Discriminative nearest neighbor few-shot intent detection by transferring natural language inference},

  author={Zhang, Jian-Guo and Hashimoto, Kazuma and Liu, Wenhao and Wu, Chien-Sheng and Wan, Yao and Yu, Philip S and Socher, Richard and Xiong, Caiming},

  journal={EMNLP},

  pages={5064--5082},

  year={2020}

}

@article{zhang2021few,

  title={Few-Shot Intent Detection via Contrastive Pre-Training and Fine-Tuning},

  author={Zhang, Jianguo and Bui, Trung and Yoon, Seunghyun and Chen, Xiang and Liu, Zhiwei and Xia, Congying and Tran, Quan Hung and Chang, Walter and Yu, Philip},

  journal={EMNLP},

  year={2021}

}

@article{zhang2022pretrained,

  title={Are Pretrained Transformers Robust in Intent Classification? A Missing Ingredient in Evaluation of Out-of-Scope Intent Detection},

  author={Zhang, Jian-Guo and Hashimoto, Kazuma and Wan, Yao and Liu, Zhiwei and Liu, Ye and Xiong, Caiming and Yu, Philip S},

  journal={The 4th Workshop on NLP for Conversational AI, ACL 2022},

  year={2022}

}

```
ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/jianguoz/few-shot-intent-detection

Awesome Lists containing this project

README