Skip to main content
Kent Academic Repository

Improving Language Modelling with Noise-contrastive estimation

Liza, Farhana Ferdousi and Grzes, Marek (2017) Improving Language Modelling with Noise-contrastive estimation. [Preprint] (doi:10.48550/arXiv.1709.07758) (The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided) (KAR id:98779)

The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided.
Official URL:
https://doi.org/10.48550/arXiv.1709.07758

Abstract

Neural language models do not scale well when the vocabulary is large. Noise-contrastive estimation (NCE) is a sampling-based method that allows for fast learning with large vocabularies. Although NCE has shown promising performance in neural machine translation, it was considered to be an unsuccessful approach for language modelling. A sufficient investigation of the hyperparameters in the NCE-based neural language models was also missing. In this paper, we showed that NCE can be a successful approach in neural language modelling when the hyperparameters of a neural network are tuned appropriately. We introduced the 'search-then-converge' learning rate schedule for NCE and designed a heuristic that specifies how to use this schedule. The impact of the other important hyperparameters, such as the dropout rate and the weight initialisation range, was also demonstrated. We showed that appropriate tuning of NCE-based neural language models outperforms the state-of-the-art single-model methods on a popular benchmark.

Item Type: Preprint
DOI/Identification number: 10.48550/arXiv.1709.07758
Refereed: No
Other identifier: https://arxiv.org/abs/1709.07758
Name of pre-print platform: arXiv
Subjects: Q Science
Divisions: Divisions > Division of Computing, Engineering and Mathematical Sciences > School of Computing
Funders: University of Kent (https://ror.org/00xkeyj56)
Depositing User: Marek Grzes
Date Deposited: 06 Dec 2022 10:51 UTC
Last Modified: 05 Nov 2024 13:04 UTC
Resource URI: https://kar.kent.ac.uk/id/eprint/98779 (The current URI for this page, for reference purposes)

University of Kent Author Information

Liza, Farhana Ferdousi.

Creator's ORCID:
CReDIT Contributor Roles:

Grzes, Marek.

Creator's ORCID: https://orcid.org/0000-0003-4901-1539
CReDIT Contributor Roles:
  • Depositors only (login required):

Total unique views for this document in KAR since July 2020. For more details click on the image.