Romanian Language Technology

Peter Roach * Speech Technology: a Look into the Future

References

W.A. LEA, (1980a), The value of speech recognition systems, in Lea 1980, pp. 3-18.
W.A. LEA, (ed.) (1980b), Trends in Speech Recognition, Prentice-Hall.
J. LAVER, (1994), Speech technology overview, in Asher, R. (ed), Encyclopaedia of Linguistics.
E. KELLER, (ed.) (1994), Fundamentals of Speech Synthesis and Speech Recognition, London, John Wiley.
J. BERNSTEIN and H. FRANCO, (1996), Speech recognition by computer, in Lass, N.J. (ed.) Principles of Experimental Phonetics, Mosby, pp. 408-434.
H. JAVKIN, (1996), Speech analysis and synthesis, in Lass, N.J. (ed.), Principles of Experimental Phonetics, Mosby, pp. 245-273.
M.A. JACK and J. LAVER, (1988), Aspects of Speech Technology, Edinburgh University Press.
K-F. LEE, et al (1990), Speech recognition using Hidden Markov Models: a CMU perspective, Speech Comm. 9.
ROACH, P. (ed.) (1992) - Computing in Linguistics & Phonetics, Academic Press.
F. FALLSIDE and W.A. WOODS, (eds.) (1985), Computer Speech Processing, Prentice-Hall.
W.A.AINSWORTH, (1988), Speech Recognition by Machine, IEE, Peter Peregrinus.
J.K. CHEN, L.S. LEE and F.K SOONG, (1995), Large vocabulary, word-based Mandarin dictation system, Proceedings of Eurospeech, Madrid, vol.1, pp. 285-291.
A. ASADI, D. LUBENSKY et al. (1995), Combining speech algorithms into a natural application of speech technology for telephone network services, Proceedings of Eurospeech, Madrid, vol.1, pp. 273-276.
P. DALSGAARD and A. BAEKGAARD, (1990), Recognition of continuous speech using neural nets and expert system processing, Speech Comm. 9.
D.E. RUMELHART and J. MCLELLAND, (1986), Parallel Distributed Processing, M.I.T. SILVERMAN, K. (1984) 'F0 perturbations as a function of voicing of prevocalic and postvocalic stops and fricatives, and of syllable stress', Proceedings of the Institute of Acoustics.
M. KOHONEN and K. TORKKOLA, (1990), Using self-organizing maps and multi-layered feed-forward nets to obtain phonemic transcriptions of spoken utterances, Speech Comm. 9.
A. MISHEVA, S. DIMITROVA et al. (1995), Bulgarian Speech Database - a Pilot Study, Proceedings of Eurospeech, Madrid, vol.1, pp. 859-863.
G. NIEDERMAIR, M. STREIT and H. TROPF, (1990), Linguistic processing related to speech understanding in SPICOS II, Speech Comm. 9.
P.J. ROACH, G.O. KNOWLES, T. VARADI and S.C. ARNFIELD, (1994), MARSEC: a MAchine-Readable Speech Database, Journal of the International Phonetic Association, vol.23:2, pp.47-54.
ROACH, P.J., KNOWLES, G.O., VARADI, T., GHALI, N. and ARNFIELD, S.C. (1992)- MARSEC Speech Database: CD-ROM disk.
P.J. ROACH, S. ARNFIELD, W. BARRY, J. BALTOVA, M. BOLDEA, A. FOURCIN, W. GONET, R. GUBRYNOWICZ, E. HALLUM, L. LAMEL, K. MARASEK, A. MARCHAL, E. MEISTER and K. VICSI, (1996), BABEL: an Eastern European Multi-Language Database, Proceedings of 4th International Congress of Spoken Language Processing, Philadelphia, SaP2P1.1.
D. CHAN, A.J. FOURCIN et al. (1995), EUROM - a Spoken Language Resource for the EU, Proceedings of Eurospeech, Madrid, vol.1, pp. 867-871.
S. YOUNG, (1990), Use of dialogue, pragmatics and semantics to enhance speech recognition, Speech Comm. 9.
R. LINGGARD, (1985), Electronic Synthesis of Speech, Cambridge University Press.
P. ROACH and J. HARTMAN, (1997), The Daniel Jones English Pronouncing Dictionary, 15th edition, Cambridge University Press.
J.C. WELLS, (1991), Longman Pronunciation Dictionary, Longman.
K. SILVERMAN, (1990), The separation of prosodies: comments on Kohler's paper, in LabPhon 1, eds. J.Kingston and M.Beckman, pp. 139-151.
H.D. WANG, D. DEGRYSE and F. CARRARO, (1993), A prosody modification approach for auditory user feedback in the SPELL pronunciation teaching system, Proceedings of Eurospeech 93, Berlin.
P. ROACH and S. ARNFIELD, (1995), Aligning prosodic transcription with the time dimension, in G.N.Leech, G.Myers and J.Thomas (eds.) Spoken English on Computer, Longman.
I. MURRAY and J. ARNOTT, (1993), Toward the simulation of emotion in synthetic speech: a review of the literature on human vocal emotion, JASA, 93(2), pp.1097-1108.
G. BLOOTHOOFT, V. HAZAN, D. HUBER and J. LLISTERRI, (1995), European Studies in Phonetics and Speech Communication, Utrecht: OTS.
P. BAGSHAW, S.M. HILLER and M.A. JACK, (1993), Enhanced pitch tracking and the processing of F0 contours for computer aided intonation teaching, Proceedings of Eurospeech 93, Berlin.
S. HILLER, E. ROONEY, J.P. LEFEVRE and M. JACK, (1993a), SPELL: A pronunciation training device based on speech technology, Proceedings of ESCA/NATO Workshop on Applications of Speech Technology, Lautrach, Germany.
S. HILLER, E. ROONEY, J.P. LEFEVRE and M. JACK, (1993b), SPELL: an automated system for computer-aided pronunciation teaching, Proceedings of Eurospeech 93, Berlin.
S.M. HILLER, E. ROONEY, R. VAUGHAN, M. ECKERT, J. LAVE and M. JACK, (1993c), An automated system for computer-aided pronunciation learning, paper presented at CALL 93: "Reactive and Creative CALL", University of Exeter.
E. ROONEY, R. VAUGHAN, S. HILLER, F. CARRARO and J. LAVER, (1993), Training vowel pronunciation using a computer-aided teaching system, Proceedings of Eurospeech 93, Berlin.
C. HIESTER and J. ABERCROMBIE, (1994), Penn's virtual language lab on the Internet, (Internet under address http://philae.sas.upenn.edu).

138