Romanian Language Technology

Maria-Mirela Petrea, Dan Cristea * Dealingwith Prosody. A Computer-Assisted Language Learning Approach

Fig. 12.- Anannotation example.

Tagging is done according to the following grammar:

<utterance>::= <list_of_adnotated_segms> |
<list_of_adnotated_segms> <silence_segm>
<list_of_adnotated_segms>::= <adnotated_segm> |
<adnotated_segm> <list_of_adnotated_segms>
<adnotated_segm>::= <text> (<list_of_adjacent_pitch_segms>) |
<silence_segm> <text> (<list_of_adjacent_pitch_segms>)
<list_of_adjacent_pitch_segms>::= <adjacent_pitch_segm> |
<adjacent_pitch_segm> <pitch_interuption> <list_of_adjacent_pitch_segms>
<adjacent_pitch_segms>::= <list_of_pitch_segms> <pitch_level>
<list_of_pitch_segms>::= <pitch_segm> |
<pitch_segm> <list_of_pitch_segms>
<pitch_segm>::= <pitch_level> <pitch_move> <loudness_move>
<pitch_level> ::=W| L| M| H| X| Z
<pitch_move> ::= S| R| /| F| \
<loudness_move> ::= *| #| !| ?| +| -
<pitch_interruption>::= <silence> | <unvoiced>
<silence> ::= =
<unvoiced> ::= _
<text> ::= any string of letters, but also ' and blank

Following is the annotation result of the phrase "I'd like cream" uttered by a male speaker:

=[23.23]I'd(_[9.62]MS*[93.78]M_[40.93]LS#[14.20]LR+[62.32])
like(LR+[80.25]MS-[104.43]L_[82.41])
=[155.61]cream(_[56.44]LR+[48.08]MF-[302.13]L_[14.06])

4. Conclusions and further work

The paper presented PROSODICS, an application dealing with the problem of computer-assisted language pronunciation learning. Its realization starts from the premise of the importance of prosody (intonation, stress and rhythm) in CALL. So far, PROSODICS was used for English teaching, but it could be easily extended for other languages. In fact its only link with a certain language is the phonemes features table, and perhaps also the font used for the graphical representation of the texts. Techniques that deal with prosody in artificial systems are yet scarcely described in the literature (see [1, 2, 11]). The paper shows how a bundle of elementary speech processing techniques can be used with spectacular results in a CALL application.

PROSODICS has two main functionalities:

it helps in the creation of annotated speech data bases allowing a developer to either record or import utterances produced by native speakers of the target language, to attach the corresponding text to them in the orthographic or the phonemic transcription form, and to tag speech records on pitch/loudness criteria for laboratory studies;
it assists a student to improve her pronunciation in the target language by displaying in an easily understandable format a comparison between the master's and her own utterance, where duration and rhythm, prosodic melody and stress are put in evidence. Exercises that concentrate on word level stress or sentence level stress can be imagined. At the word level, rules of words' accentuation can be devised (in a language as Italian, for instance, where most parts of the words receive the accent on the last but one syllable, the exceptions could be an easy trap for beginners; for a language rich in tonalities as Chinese the graphical facilities that PROSODICS offers for observing tones' movements could be of valuable help). At the sentence level, final intonation, as well as the melodic line of the utterance could be subjects of special exercises. Using PROSODICS, special training applications for actors or singers should be easily projected.

Acknowledgments. PROSODICS was implemented by a team formed by the authors together with Ciprian Bacalu, at the Department of Computational Linguistics of the University of Venice. We thank Rodolfo Delmonte, the head of the Laboratory of Computational Linguistics at the University of Venice for good critics, comments and suggestions. We are also indebted to Francesco Stiffoni for his permanent disposability to answer the developers questions regarding tricks of the Macintosh environment.

161