Skip to main content
Kent Academic Repository

A Comprehensive Survey of Natural Language Generation Advances from the Perspective of Digital Deception

Jones, Keenan and Altuncu, Enes and Franqueira, Virginia N. L. and Wang, Yichao and Li, Shujun (2022) A Comprehensive Survey of Natural Language Generation Advances from the Perspective of Digital Deception. [Preprint] (doi:10.48550/arXiv.2208.05757) (The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided) (KAR id:97944)

The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided.
Official URL:
https://doi.org/10.48550/arXiv.2208.05757

Abstract

In recent years there has been substantial growth in the capabilities of systems designed to generate text that mimics the fluency and coherence of human language. From this, there has been considerable research aimed at examining the potential uses of these natural language generators (NLG) towards a wide number of tasks. The increasing capabilities of powerful text generators to mimic human writing convincingly raises the potential for deception and other forms of dangerous misuse. As these systems improve, and it becomes ever harder to distinguish between human-written and machine-generated text, malicious actors could leverage these powerful NLG systems to a wide variety of ends, including the creation of fake news and misinformation, the generation of fake online product reviews, or via chatbots as means of convincing users to divulge private information. In this paper, we provide an overview of the NLG field via the identification and examination of 119 survey-like papers focused on NLG research. From these identified papers, we outline a proposed high-level taxonomy of the central concepts that constitute NLG, including the methods used to develop generalised NLG systems, the means by which these systems are evaluated, and the popular NLG tasks and subtasks that exist. In turn, we provide an overview and discussion of each of these items with respect to current research and offer an examination of the potential roles of NLG in deception and detection systems to counteract these threats. Moreover, we discuss the broader challenges of NLG, including the risks of bias that are often exhibited by existing text generation systems. This work offers a broad overview of the field of NLG with respect to its potential for misuse, aiming to provide a high-level understanding of this rapidly developing area of research.

Item Type: Preprint
DOI/Identification number: 10.48550/arXiv.2208.05757
Refereed: No
Other identifier: https://arxiv.org/abs/2208.05757
Name of pre-print platform: arXiv
Uncontrolled keywords: Natural Language Generation, NLG, Digital Deception, Survey, Taxonomy
Subjects: Q Science > QA Mathematics (inc Computing science) > QA 76 Software, computer programming, > QA76.87 Neural computers, neural networks
Divisions: Divisions > Division of Computing, Engineering and Mathematical Sciences > School of Computing
University-wide institutes > Institute of Cyber Security for Society
Funders: University of Kent (https://ror.org/00xkeyj56)
Depositing User: Virginia Franqueira
Date Deposited: 13 Nov 2022 09:44 UTC
Last Modified: 10 Oct 2023 11:10 UTC
Resource URI: https://kar.kent.ac.uk/id/eprint/97944 (The current URI for this page, for reference purposes)

University of Kent Author Information

  • Depositors only (login required):

Total unique views for this document in KAR since July 2020. For more details click on the image.