Skip to content

Latest commit

 

History

History
139 lines (110 loc) · 6.25 KB

README.md

File metadata and controls

139 lines (110 loc) · 6.25 KB

MEffi-Prompt

The repository for our paper Multilingual Relation Classification via Efficient and Effective Prompting, to appear at EMNLP-2022 (main conference).

In this paper, we extend the power of prompting to the underexplored task of multilingual relation classification and aim to find out best ways to prompt for different languages, data regimes, etc. with minimal handcraft (i.e. translation). We are especially interested in its in-language and cross-lingual performance, as well as different behaviour between code-switch and in-language prompts.

Effectiveness is validated over 14 languaged covered by the SMiLER dataset.

drawing

Table of Contents

🔭  Overview

Path Description
config/ This directory contains the Hydra config files that specify pre-defined settings.
data/ This directory where the user should put their data files, as well as some pre-processing scripts.
docs/ This directory contains the auxiliary files for documentation, such as the figure(s)presented in README.
src/meffi_prompt/ This directory is the package to be installed, which contains the source code of our implementation.

🚀  Installation

git clone git@github.com:DFKI-NLP/meffi-prompt.git
cd meffi-prompt
pip install -e .

💡  Usage

To evaluate the default setting (i.e. fully supervised scenario with model="google/mt5-base", max_length=256, batch_size=16, num_epochs=10, lr=3e-5, soft_token_length=0), run:

python main.py

To run your own setting:

python main.py model="google/mt5-small" batch_size=4 num_epochs=5

Hydra provides a simple way to sweep the arguments for hyperparameter-finetuning. The following command will excute 3 * 2 * 1= 6 runs in a row:

python main.py -m batch_size=4,8,16 model="google/mt5-base","google/mt5-small" max_length=512

To show the available options and the default config, do:

python main.py --help

which results in something like this:

== Config ==
Override anything in the config (foo.bar=value)

seed: 1234
cuda_device: 0
train_file: ./data/smiler/de_corpora_train.json
eval_file: ./data/smiler/de_corpora_test.json
model: google/mt5-base
soft_token_length: 0
max_length: 256
batch_size: 16
lr: 3.0e-05
num_epochs: 5

Note that the different run-scripts correspond to different evaluation scenarios:

script name scenario
main.py fully supervised
main_fs.py few-shot
main_iczs.py in-context zero-shot
main_zslt.py zero-shot lingual transfer

🔎  Prompt Construction

drawing

The templates we employ (see the table above) are already in the code, so it involves no work from your side to reproduce our results.

You can also define your own template. For example, if you want the template to be "$x$. The relation between $e_h$ and $e_t$ is _____., just modify the prompt as

template = {
    "input": ["x", "The relation between", "eh", "and", "et", "is", "<extra_id_0>"],
    "target": ["<extra_id_0>", "r", "<extra_id_1>"],
}

where $x$, $e_h$, $e_t$, $r$ are variants, and <extra_id_?> are special tokens preserved by T5 to denote either (1) start of a blank or (2) end of decoded sequence. The rest elements are hard tokens. To insert soft tokens, use [vN] (v means virtual token; N is the length of inserted soft tokens and can be specified in config.)

📝  Dataset

We evaluate the SMiLER dataset which covers 14 languages.

The dataset can be downloaded from https://github.com/samsungnlp/smiler. The pre-processing script is at ./data/smiler/reformatter.py. Main statistics per language are listed as follows:

Language #Class #Train #Test % no-rel (train)
ar 9 9303 190 3.46
de 22 51490 1051 0.89
en 36 267579 5461 4.91
es 21 11061 226 4.83
fa 8 2624 54 7.93
fr 22 60884 1243 0.90
it 22 73974 1510 0.70
ko 28 18711 382 1.67
nl 22 38850 793 0.86
pl 21 16831 344 0.00
pt 22 43335 885 0.84
ru 8 6395 131 1.86
sv 22 4482 92 0.60
uk 7 968 20 7.02

📚  Citation

@inproceedings{chen-etal-2022-multilingual,
    title = "Multilingual Relation Classification via Efficient and Effective Prompting",
    author = "Chen, Yuxuan and Harbecke, David and Hennig, Leonhard",
    booktitle = "Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing",
    month = december,
    year = "2022",
    address = "Online and Abu Dhabi, the United Arab Emirates",
    publisher = "Association for Computational Linguistics",
    abstract = "Prompting pre-trained language models has achieved impressive performance on various NLP tasks, especially in low data regimes. Despite the success of prompting in monolingual settings, applying prompt-based methods in multilingual scenarios has been limited to a narrow set of tasks, due to the high cost of handcrafting multilingual prompts. In this paper, we present the first work on prompt-based multilingual relation classification (RC), by introducing an efficient and effective method that constructs prompts from relation triples and involves only minimal translation for the class labels. We evaluate its performance in fully supervised, few-shot and zero-shot scenarios, and analyze its effectiveness across 14 languages, prompt variants, and English-task training in cross-lingual settings. We find that in both fully supervised and few-shot scenarios, our prompt method beats competitive baselines: fine-tuning XLM-R_EM and null prompts. It also outperforms the random baseline by a large margin in zero-shot experiments. Our method requires little in-language knowledge and can be used as a strong baseline for similar multilingual classification tasks.",
}

📘  License

This repository is released under the terms of the Apache 2.0 license.