Alexandru Timotin, Florin Teodor Tãnãsescu * Structures for a Thesaurus of Technical Terminology
The IEC proposed the preparation of an English edition, containing the same concepts. This led to a full revision of the information structures in the form of a multilingual data bank of concepts. In 1989 a bilingual version, French and English, with 26,000 concepts was ready [7]. The logical and lexical structures were developed by Prof. Dr. Eng. Al. Timotin of PUB. The information structures and the software were created by Dr. Eng. S. Gheþaru of ICPE. Besides the concepts coming from IEV, there were included concepts defined in other IEC publications, as well as additional concepts, not defined but identified through the context in all these publications and requested to fill in the gaps of the obtained hierarchical structures.
Ever since, the moral and physical degradation of the informatics equipment that had been used imposed the shifting on DOS-operating computers and the thorough revision of the software, with many difficulties referring to the acquirement of the necessary equipment (PC network). The work of collecting the terms and concepts has been continued, in order to include all the concepts defined until 1993, with some temporary technical means. A third language, Romanian, has been added to the data bank, and the introduction of a file of definitions in Romanian is in course of completion. New informatics equipment became operational at the beginning of 1995. The requirement for a transportable version led to a complete revision of the data bank and of the software by Eng. I. Lopãtan of PUB and Mathematician P. Onica of ICPE. Developed due to the editing of new IEV chapters, in 1997 the thesaurus acquired funds have about 30,000 concepts and about 33,000 terms in each of the three languages.
2.1. Features of a multilingual thesaurus of concepts
A thesaurus of technical concepts is an identification tool of a concept
within a limited conceptual fund. This aspect facilitates the
identification of the relations which link the concept to its closest
concepts, i.e. establishing its semantic vicinity. The semantic
vicinity can identify the concept in a more suggestive way - in terms of
its current usage - than a definition, which is often restrictive.
A thesaurus emphasizes sense correlation in a way that is invariant to
the change of presentation language.
A multilingual thesaurus of concepts can facilitate the monitoring of
the coherence of dictionaries, and allows their subsequent development.
It can be useful to authors and translators, especially when it is associated
with an internationally standardized terminology for texts of
scientific, commercial, and legal importance. A multilingual thesaurus
of concepts can efficiently support the computer-aided translation.
2.2. The data bank of a thesaurus of concepts
The multilingual thesaurus of concepts is a relational data bank
composed of files (Annex 1), each file having entries with the
same structure, containing particular information attached to the
data - concept, relation, term, document-source, etc.
The first file is the file of concepts, each concept
being represented in an abstract way in an entry by an alphanumeric code,
and certain taxonomic and logical characteristics. This file contains the
conceptual base of the thesaurus, a language-independent one.
The significance of the concepts is suggested by the semantic relations
between the considered concept and others from the same fund. These are
binary relations, hierarchical and nonhierarchical (associative), grouped
in a file of relations between concepts, language-independent too,
which describes the semantic structure of the thesaurus. The data bank of
a thesaurus also contains a file of sources, i.e. of the
language-independent characteristics, of the original documents
(type, edition, year, etc.).
106