QualityAdapt

This repo implements the Paper QualityAdapt: an Automatic Dialogue Quality Estimation Framework.

QualityAdapt composes dialogue subqualities using adapter fusion for the task of overall dialogue quality estimation. Unlike similar approaches, QualityAdapt only requires a single forward pass on a Language Model to produce predictions for overall quality, thus reducing computational complexity.

Section
Abstract
Apply QualityAdapt to your own dialogues
Training from scratch
Implementation Details
Citation

Abstract

Despite considerable advances in open-domain neural dialogue systems, their evaluation remains a bottleneck. Several automated metrics have been proposed to evaluate these systems, however, they mostly focus on a single notion of quality, or, when they do combine several sub-metrics, they are computationally expensive. This paper attempts to solve the latter: QualityAdapt leverages the Adapter framework for the task of Dialogue Quality Estimation. Using well defined semi-supervised tasks, we train adapters for different subqualities and score generated responses with AdapterFusion. This compositionality provides an easy to adapt metric to the task at hand that incorporates multiple subqualities. It also reduces computational costs as individual predictions of all subqualities are obtained in a single forward pass. This approach achieves comparable results to state-of-the-art metrics on several datasets, whilst keeping the previously mentioned advantages.

Apply QualityAdapt to your own dialogues

1. Download adapters

Obtain the trained adapter and adapter fusion from AdapterHub or directly from Drive.

2. Run prediction code

Adjust the example script predict.py to your requirements. You can choose from Parallel prediction of all subqualities or simply Overall Quality.

Training from scratch

Here we use an example to illustrate how to reproduce the main results of the paper.

1. Download data

Download the preprocessed data you wish to train and/or evaluate on and insert it somewhere in your workspace.

2. Subquality Adapter training

Adjust and run train_u and train_s to train the Understandability and Sensibleness Adapters, respectively.

3. Overall Quality Adapter Fusion training

Adjust and run train_h to train the Adapter Fusion module on Overall Quality annotations.

Implementation Details

Tokenization

Raw dialog corpora may have very different data structures, so we leave to the user to convert their own data to QualityAdapt format.

The format is straightforward:

The tokenizer receives as input res and optionally ctx (context is needed to evaluate context dependent metrics such as Sensibleness and Overall Quality).
ctx can be multi-turn, the only limitation relates to max_length=124.
Who said what is determined by appending the speaker token at the start of the sentence.

A: Gosh, you took all the word right out of my mouth. Let's go out and get crazy tonight.
B: Let's go to the new club on West Street .
A: I'm afraid I can't.


ctx = "<speaker1>Gosh , you took all the word right out of my mouth . Let's go out and get crazy tonight .</s><s><speaker2>Let's go to the new club on West Street ."
res = "<speaker1>I ' m afraid I can ' t ."

Citation

If you use this work, please consider citing:

John Mendonca, Alon Lavie and Isabel Trancoso. 2022. QualityAdapt: an Automatic Dialogue Quality Estimation Framework In Proceedings of the 23rd Annual Meeting of the Special Interest Group on Discourse and Dialogue, to appear, Edinburgh and Online. Association for Computational Linguistics.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
QualityAdapt		QualityAdapt
LICENSE		LICENSE
README.md		README.md
environment.yaml		environment.yaml
predict.py		predict.py
train_h.sh		train_h.sh
train_s.sh		train_s.sh
train_u.sh		train_u.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

QualityAdapt

Abstract

Apply QualityAdapt to your own dialogues

1. Download adapters

2. Run prediction code

Training from scratch

1. Download data

2. Subquality Adapter training

3. Overall Quality Adapter Fusion training

Implementation Details

Tokenization

Citation

About

Releases

Packages

Languages

License

johndmendonca/qualityadapt

Folders and files

Latest commit

History

Repository files navigation

QualityAdapt

Abstract

Apply QualityAdapt to your own dialogues

1. Download adapters

2. Run prediction code

Training from scratch

1. Download data

2. Subquality Adapter training

3. Overall Quality Adapter Fusion training

Implementation Details

Tokenization

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages