https://github.com/salesforce/fewxc

Official code and data release for Efficiently Aligned Cross-Lingual Transfer Learning for Conversational Tasks using Prompt-Tuning, accepted by findings of EACL 2024.
https://github.com/salesforce/fewxc

cross-lingual nlp task-oriented-dialogue

Last synced: 11 months ago
JSON representation

Official code and data release for Efficiently Aligned Cross-Lingual Transfer Learning for Conversational Tasks using Prompt-Tuning, accepted by findings of EACL 2024.

Host: GitHub
URL: https://github.com/salesforce/fewxc
Owner: salesforce
License: apache-2.0
Created: 2023-03-22T23:07:45.000Z (over 3 years ago)
Default Branch: main
Last Pushed: 2025-05-29T13:03:32.000Z (about 1 year ago)
Last Synced: 2025-06-01T23:24:22.020Z (about 1 year ago)
Topics: cross-lingual, nlp, task-oriented-dialogue
Language: Python
Homepage: https://arxiv.org/abs/2304.01295
Size: 44.9 KB
Stars: 3
Watchers: 3
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE.txt
- Code of conduct: CODE_OF_CONDUCT.md
- Codeowners: CODEOWNERS
- Security: SECURITY.md

Awesome Lists containing this project

README

# FewXC (Few-shot Cross-lingual Conversational Research)
Official code and data release for [Efficiently Aligned Cross-Lingual Transfer Learning for Conversational Tasks using Prompt-Tuning](https://arxiv.org/abs/2304.01295), accepted by findings of EACL 2024.

## XSGD Data
https://console.cloud.google.com/storage/browser/multilingual-sgd-data-research

## MASSIVE Data
Please refer to https://github.com/alexa/massive for data and pre-processing pipeline.

## Backbone Model
XLM-RoBERTa series, https://huggingface.co/docs/transformers/model_doc/xlm-roberta

## Examples
```python
# NLI style intent classificaiton with MASSIVE dataset
CUDA_VISIBLE_DEVICES=0 python xlmr_massive_finetuning_nli.py \
--training_strategy 5 \
--learning_rate 1e-5 \
--training_data /export/home/xclm/1.1/parsed_data.train \
--intent_schema /export/home/xclm/1.1/parsed_data.intents \
--slot_schema /export/home/xclm/1.1/parsed_data.slots \
--dev_data /export/home/xclm/1.1/parsed_data.dev \
--prefix # turn on prefix tuning

# Regular intent classfication with MASSIVE dataset
CUDA_VISIBLE_DEVICES=0 python xlmr_massive_finetuning_vanilla.py \
--training_strategy 5 \
--learning_rate 1e-5 \
--training_data /export/home/xclm/1.1/parsed_data.train \
--intent_schema /export/home/xclm/1.1/parsed_data.intents \
--slot_schema /export/home/xclm/1.1/parsed_data.slots \
--dev_data /export/home/xclm/1.1/parsed_data.dev \
--prefix \
--task intent,slot
```

## Citation
```
@inproceedings{
anonymous2024efficiently,
title={Efficiently Aligned Cross-Lingual Transfer Learning for Conversational Tasks using Prompt-Tuning},
author={Lifu Tu, Jin Qu, Semih Yavuz, Shafiq Joty, Wenhao Liu, Caiming Xiong, Yingbo Zhou},
booktitle={18th Conference of the European Chapter of the Association for Computational Linguistics},
year={2024},
url={https://openreview.net/forum?id=Lb4qW0NiIb}
}
```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/salesforce/fewxc

Awesome Lists containing this project

README