Poul Andersen * Cooperation with Central and Eastern Europe in Language Engineering
Speech input/output is going to be a major growth area in information technology in the coming years. Scientific development is dependent on the quality and quantity of recorded speech material available for training and testing systems, and speech databases should be produced according to carefully defined standards, with high priority given to re-usability.
As speech scientists in Central and Eastern Europe become more closely associated with their colleagues in Western Europe, it is vital that speech database standards should not be allowed to diversify. The BABEL project trains and equips a number of non-EU speech research laboratories to use existing EU speech database standards, with researchers from the C&EE partner institutions spending some time working in Western laboratories.
The project will produce a database of high scientific quality that will be used for many years in the development of European speech technology.
East European languages covered: Bulgarian, Estonian, Hungarian, Polish and Romanian.
The Romanian partner is M. Boldea, Computer Science Department, Faculty of Computer and Control Systems, Timisoara.
The 6 projects without Romanian participation are:
CEGLEX aims at extending
the generic electronic dictionary model developed in the EUREKA
GENELEX project to three central European languages and to:
The work consists in:
The aim of GRAMLEX is to facilitate
the initiation, coordination and standardisation of the construction
of morphological dictionary packages for French, Hungarian,
Italian and Polish, including detailed formal description
of the morphology of the languages. The major challenges in such
an enterprise are to give the description the largest possible
coverage, in order to be able to process unrestricted texts; to
share as much as possible of the formats, methods and algorithms;
and to improve time and space efficiency of programs.
Keywords:
lexical tagging, morphological dictionaries, lexical resources.
The goals of BILEDITA are:
17