SILEX - a Lexico-Morphological Software for Romanian

Teodor Vuºcan, Emma Tamâianu, Sanda Cherata


1. Introduction

The SILEX software system was developed at the Centre of Text Analysis (CTA) of the Faculty of Letters of "Babeº-Bolyai" University. CTA was founded in 1990; its main research projects belong to the fields of lexical analysis for the Romanian language and text processing - more specifically, the production and publication of concordances for both old and contemporary Romanian poetical texts.

This paper presents SILEX, a multi-functional software system which makes it possible to solve a rather wide range of problems of computational linguistics concerning the Romanian language and the processing of Romanian texts.

The main functions of SILEX are:

Recently, the SILEX system was enriched with several other functions: the automatic generation of lexical homographs, of homonyms correlated with morphological classes etc.

In performing these functions, SILEX uses an internal machine dictionary, which contains, in a structured form, a Romanian vocabulary of approximately 40,000 basic entries with all the morphological information needed for defining the systemic and (grammatical-) textual status of any lexical unit. These entries, as will be shown later, cover a Romanian vocabulary of approximately 60,000 words.

This dictionary is used exclusively for performing the two main functions of SILEX. For building this internal dictionary and for other tasks related to the Romanian vocabulary, there are a number of auxiliary dictionaries, one for each main lexico-morphologic class. These dictionaries contain specific information for the morphological class they refer to and will be presented in section 2.3. The internal dictionary used by SILEX is created through a fusion process, which consists of selectively picking up information from all these auxiliary dictionaries and in putting it together in a structured, uniform manner. Henceforward, the internal dictionary of SILEX will be referred to as the SILEX dictionary, or simply dictionary.


63

Previous Index Next