Teodor Vuºcan, Emma Tamâianu, Sanda Cherata *
SILEX - a Lexico-Morphological Software for Romanian
5. Applications of SILEX
The automatic processing of Romanian
texts cannot be performed without a tool like SILEX. Its
functions, as well as the applications that SILEX make
possible, are of theoretic-descriptive relevance, as well as of
practical importance.
Here are some of these applications,
in the decreasing order of their functional complexity.
- orthographic and morphological
machine dictionary for Romanian, easy to maintain and to enrich,
available in both printable and electronic form;
- machine dictionary of homograph
forms for the Romaian language, available in both printable and
electronic form;
- spelling-checker for Romanian
texts;
- system for statistical studies
on the Romanian vocabulary, on various criteria and criteria combinations;
- support to any kind of synchronic
(and, in the future, for diachronic) research on the Romanian
language; (some examples of such research objectives are: productivity
of some derivation procedures, the relative occurrence of various
types of paradigms);
- system for lexical and grammatical
statistic studies on Romanian literary texts;
- system for concordances of
Romanian literary texts, lemmatization being automatically performed
(in most of the cases);
- educational support for computer-aided
study of Romanian grammar, in secondary education and in teaching
Romanian as a foreign language (orthography, morphology and vocabulary).
6. Considerations on SILEX implementation and efficiency
SILEX
is implemented under the FoxPro 2.7 database system, on a IBM
PC-compatible computer. It performs about 1100 anayses per minute
on a computer with 4MB RAM and about 2200 analyses per minute
on a computer with 8MB RAM. A Romanian spelling-checker is in
an advanced stage of development. SILEX was used to
spelling-check
several issues of Echinox and it proved to be useful
and effective.
The use of the lemmatization
function of SILEX in the CONCORD system,
for generating
concordances of Romanian poetic texts, proved its functionality
and efficiency.
New directions of research are
meant to extend the morphological analysis function by taking
into account the left-context of the analyzed word. In this way,
the problem of analytical word inflected forms will also be successfully
solved.
References
- *** Gramatica limbii române, Edit.Academiei,
Bucureºti, 1966.
- *** Dictionar ortografic, ortoepic ºi morfologic al
limbii române, Edit. Academiei, 1982.
- *** COLING BUDAPEST, Proceedings of the 12th International
Conference on Computational Linguistics,1988.
- *** Dicþionar ortografic cu elemente de ortoepie
ºi morfologie, Redacþia principalã a Enciclopediei
Sovietice Moldoveneºti, Chiºinãu, 1991.
- *** Indreptar ortografic, ortoepic ºi de punctuaþie,
Edit. Academiei, 1996.
- BIEN, J.S., Toward computerized dictionaries for inflexional
languages internal design philosophy, in Machine Translation,
Cambridge University Press, 1987.
- BREBAN, V., Dicþionar general al limbii române,
Edit. Enciclopedicã, 1992.
- CHERATA, SANDA, Sistem de realizare a concordanþelor
limbii poetice româneºti, în D. Tufiº
(ed.), Limbaj ºi Tehnologie, Edit. Academiei Române,
1996, pp. 215-220.
- CHERATA, SANDA, VUªCAN, T., TAMÂIANU, E., SILEX
-Un sistem lexico-morfologic computerizat pentru analiza textelor
româneºti, Dacoromania, Nr. 1, 1996 (forthcoming).
- CHERATA, SANDA, VUªCAN, T., TAMÂIANU E., SILEX
- Funcþiile de lematizare ºi de generare a paradigmelor,
Dacoromania, Nr. 2, 1996 (forthcoming).
- COTEANU, I., Gramatica de bazã a limbii române,
Edit. Albatros, 1982.
- DIMITRIU, C., Gramatica limbii române explicatã
- Morfologia, Edit. Junimea, 1979.
- GUÞU-ROMALO, VALERIA, Morfologia structuralã
a limbii române (substantiv, adjectiv, verb), Edit.
Academiei, Bucureºti, 1968.
- IRIMIA , D., Structura gramaticalã a limbii române.
VERBUL, Edit. Junimea, 1976.
- IORDAN, I., ROBU, V, Limba românã contemporanã,
Edit. Didacticã si Pedagogicã, 1978.
- KAY, M., The Dictionary of the Future and the Future of
the Dictionary, in Machine Translation, Cambridge University
Press, 1987.
- LYONS, J., Introducere în lingvistica teoreticã,
Edit. ªiinþificã, Bucureºti, 1995.
- MANOLIU, MARIA, Morfemul în lingvistica modernã,
Limba românã, XII, nr. 1, Bucureºti, 1963.
- MARCU, F., MANECA, C., Dicþionar de neologisme,
Edit. Academiei, 1978.
- MOISIL, Gr., Problèmes posés par la traduction
automatique. Le declinaison en roumain écrit, Cahiers
de linguistique thèorique et appliqué, II, Bucarest,
1962.
- ªUTEU, F., Dicþionar ortografic al limbii române,
Edit. Floarea Darurilor, Bucureºti, 1995, Edit. I.O.
- ULLMAN, J.D., Principles of Databases System, Computer
Science Press, 1982.
- ZAMPOLLI, A., Lexicological and Lexicographical Activities
at the Istituto di Lingvistica, Lingvistica Computazionale,
1983/3.
67