Skip to main content
Kent Academic Repository

Screffva: A Lexicographer's Workbench

Mills, Jon (2000) Screffva: A Lexicographer's Workbench. In: Second International Conference on Language Resources and Evaluation, 31 May - 2 June, Athens, Greece. (Access to this publication is currently restricted. You may be able to access a copy if URLs are provided) (KAR id:8381)

PDF
Language: English

Restricted to Repository staff only
[thumbnail of Mills,_Jon_(2000)_Screffva_-_A_Lexicographer's_Workbench.pdf]
Official URL:
http://www.lrec-conf.org/lrec2000/www.xanthi.ilsp....

Abstract

This paper describes the implementation of Screffva, a computer system written in Prolog that employs a parallel corpus for the automatic generation of bilingual dictionary entries. Screffva provides a lemmatised interface between a parallel corpus and its bilingual dictionary. The system has been trialled with a parallel corpus of Cornish-English bitext. Screffva is able to retrieve any given segment of text, and uniquely identifies lexemes and the equivalences that exist between the lexical items in a bitext. Furthermore the system is able to cope with discontinuous multiword lexemes. The system is thus able to find glosses for individual lexical items or to produce longer lexical entries which include part-of-speech, glosses and example sentences from the corpus. The corpus is converted to a Prolog text database and lemmatised. Equivalents are then aligned. Finally Prolog predicates are defined for the retrieval of glosses, part-of-speech and example sentences to illustrate usage. Lexemes, including discontinuous multiword lexemes, are uniquely identified by the system and indexed to their respective segments of the corpus. Insofar as the system is able to identify specific translation equivalents in the bitext, the system provides a much more powerful research tool than existing concordancers such as ParaConc, WordSmith, XCorpus and Multiconcord. The system is able to automatically generate a bilingual dictionary which can be exported and used as the basis for a paper dictionary. Alternatively the system can be used directly as an electronic bilingual dictionary.

Item Type: Conference or workshop item (Paper)
Subjects: T Technology
P Language and Literature
Divisions: Divisions > Division of Arts and Humanities > School of Culture and Languages
Depositing User: Francis Mills
Date Deposited: 01 Jun 2009 06:07 UTC
Last Modified: 16 Nov 2021 09:46 UTC
Resource URI: https://kar.kent.ac.uk/id/eprint/8381 (The current URI for this page, for reference purposes)

University of Kent Author Information

Mills, Jon.

Creator's ORCID:
CReDIT Contributor Roles:
  • Depositors only (login required):

Total unique views for this document in KAR since July 2020. For more details click on the image.