Sawicki, Piotr, Grzes, Marek, Jordanous, Anna, Brown, Dan, Peeperkorn, Max (2022) Training GPT-2 to represent two Romantic-era authors: challenges, evaluations and pitfalls. In: 13th International Conference on Computational Creativity. . pp. 34-43. Association for Computational Creativity (ACC) ISBN 978-989-54-1604-2. (KAR id:94992)
PDF
Author's Accepted Manuscript
Language: English |
|
Download this file (PDF/866kB) |
Preview |
Request a format suitable for use with assistive technology e.g. a screenreader | |
Official URL: http://computationalcreativity.net/iccc22/papers/I... |
Abstract
Poetry generation within style constraints has many creative challenges, despite the recent advances in Transformer models for text generation. We study 1) how overfitting of various versions of GPT-2 models affects the quality of the generated text, and 2) which model is better at generating text in a specific style. For that purpose, we propose a novel setup for text evaluation with neural networks. Our GPT-2 models are trained on datasets of collected works of the two Romantic-era poets: Byron and Shelley. With some models, overfitting manifests by producing malformed samples, with others, the samples are always well-formed, but contain increasingly higher levels of n-grams duplicated from the original corpus. This behaviour can lead to incorrect evaluations of generated text because the plagiarised output can deceive neural network classifiers and even human judges.
To determine which model is better at preserving style before it becomes overfitted, we conduct two series of experiments with BERT-based classifiers.
Overall, our results provide a novel way of selecting the right models for fine-tuning on a specific dataset, while highlighting the pitfalls that come with overfitting, like reordering and replicating text, towards more credible creative text generation.
Item Type: | Conference or workshop item (Paper) |
---|---|
Subjects: | Q Science > QA Mathematics (inc Computing science) > QA 76 Software, computer programming, > QA76.87 Neural computers, neural networks |
Divisions: | Divisions > Division of Computing, Engineering and Mathematical Sciences > School of Computing |
Funders: | University of Kent (https://ror.org/00xkeyj56) |
Depositing User: | Marek Grzes |
Date Deposited: | 11 May 2022 17:53 UTC |
Last Modified: | 06 Dec 2022 10:28 UTC |
Resource URI: | https://kar.kent.ac.uk/id/eprint/94992 (The current URI for this page, for reference purposes) |
- Link to SensusAccess
- Export to:
- RefWorks
- EPrints3 XML
- BibTeX
- CSV
- Depositors only (login required):