Diversity and Quality: Comparing Decoding Methods with PEGASUS for Text Summarization

唐科南; Thompson, Keenan Nathaniel

Diversity and Quality: Comparing Decoding Methods with PEGASUS for Text Summarization

Files

60847093S-40679.pdf (377.71 KB)

Date

2021

Authors

唐科南

Thompson, Keenan Nathaniel

Abstract

none
This thesis offers three major contributions: (1) It considers a number of diverse decoding methods to address degenerate repetition in model output text and investigates what can be done to mitigate the loss in summary quality associated with the use of such methods. (2) It provides evidence that measure of textual lexical diversity (MTLD) is as viable tool as perplexity is for comparing text diversity in this context. (3) It presents a detailed analysis of the strengths and shortcomings of ROUGE, particularly in regard to abstractive summarization. To explore these issues the work analyzes the results of experiments run on the CNN/DailyMail dataset with the PEGASUS model.

Keywords

none, summarization, diverse decoding, PEGASUS, ROUGE, lexical diversity

URI

https://etds.lib.ntnu.edu.tw/thesis/detail/8a988c2eaa76dfbc4f7d610b1e56fba9/
http://rportal.lib.ntnu.edu.tw/handle/20.500.12235/117349

Collections

學位論文

Full item page

Diversity and Quality: Comparing Decoding Methods with PEGASUS for Text Summarization

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections

Endorsement

Review

Supplemented By

Referenced By