Peter Roach *
Speech Technology: a Look into the Future
References
- W.A. LEA, (1980a), The value of speech recognition
systems, in Lea 1980, pp. 3-18.
- W.A. LEA, (ed.) (1980b), Trends in Speech Recognition,
Prentice-Hall.
- J. LAVER, (1994), Speech technology overview, in Asher,
R. (ed), Encyclopaedia of Linguistics.
- E. KELLER, (ed.) (1994),
Fundamentals of Speech Synthesis
and Speech Recognition, London, John Wiley.
- J. BERNSTEIN and H. FRANCO, (1996), Speech recognition
by computer, in Lass, N.J. (ed.) Principles of Experimental
Phonetics, Mosby, pp. 408-434.
- H. JAVKIN, (1996), Speech analysis and synthesis, in
Lass, N.J. (ed.), Principles of Experimental Phonetics, Mosby,
pp. 245-273.
- M.A. JACK and J. LAVER, (1988), Aspects of Speech Technology,
Edinburgh University Press.
- K-F. LEE, et al (1990), Speech recognition using Hidden
Markov Models: a CMU perspective, Speech Comm. 9.
- ROACH, P. (ed.) (1992) - Computing in Linguistics &
Phonetics, Academic Press.
- F. FALLSIDE and W.A. WOODS, (eds.) (1985),
Computer Speech Processing, Prentice-Hall.
- W.A.AINSWORTH, (1988),
Speech Recognition by Machine, IEE,
Peter Peregrinus.
- J.K. CHEN, L.S. LEE and F.K SOONG, (1995),
Large vocabulary,
word-based Mandarin dictation system, Proceedings of Eurospeech,
Madrid, vol.1, pp. 285-291.
- A. ASADI, D. LUBENSKY et al. (1995),
Combining speech algorithms
into a natural application of speech technology for telephone
network services, Proceedings of Eurospeech, Madrid,
vol.1, pp. 273-276.
- P. DALSGAARD and A. BAEKGAARD, (1990),
Recognition of continuous
speech using neural nets and expert system processing, Speech
Comm. 9.
- D.E. RUMELHART and J. MCLELLAND, (1986),
Parallel Distributed
Processing, M.I.T. SILVERMAN, K. (1984) 'F0 perturbations
as a function of voicing of prevocalic and postvocalic stops and
fricatives, and of syllable stress', Proceedings of the Institute
of Acoustics.
- M. KOHONEN and K. TORKKOLA, (1990),
Using self-organizing
maps and multi-layered feed-forward nets to obtain phonemic transcriptions
of spoken utterances, Speech Comm. 9.
- A. MISHEVA, S. DIMITROVA et al. (1995),
Bulgarian Speech
Database - a Pilot Study, Proceedings of Eurospeech, Madrid,
vol.1, pp. 859-863.
- G. NIEDERMAIR, M. STREIT and H. TROPF, (1990),
Linguistic
processing related to speech understanding in SPICOS II, Speech
Comm. 9.
- P.J. ROACH, G.O. KNOWLES, T. VARADI and S.C. ARNFIELD, (1994),
MARSEC: a MAchine-Readable Speech Database, Journal of
the International Phonetic Association, vol.23:2, pp.47-54.
- ROACH, P.J., KNOWLES, G.O., VARADI, T., GHALI, N. and ARNFIELD,
S.C. (1992)- MARSEC Speech Database: CD-ROM disk.
- P.J. ROACH, S. ARNFIELD, W. BARRY, J. BALTOVA, M. BOLDEA,
A. FOURCIN, W. GONET, R. GUBRYNOWICZ, E. HALLUM, L. LAMEL, K.
MARASEK, A. MARCHAL, E. MEISTER and K. VICSI, (1996), BABEL:
an Eastern European Multi-Language Database, Proceedings of
4th International Congress of Spoken Language Processing, Philadelphia,
SaP2P1.1.
- D. CHAN, A.J. FOURCIN et al. (1995),
EUROM - a Spoken Language
Resource for the EU, Proceedings of Eurospeech, Madrid,
vol.1, pp. 867-871.
- S. YOUNG, (1990),
Use of dialogue, pragmatics and semantics
to enhance speech recognition, Speech Comm. 9.
- R. LINGGARD, (1985),
Electronic Synthesis of Speech, Cambridge
University Press.
- P. ROACH and J. HARTMAN, (1997),
The Daniel Jones English
Pronouncing Dictionary, 15th edition, Cambridge University
Press.
- J.C. WELLS, (1991),
Longman Pronunciation Dictionary, Longman.
- K. SILVERMAN, (1990),
The separation of prosodies: comments
on Kohler's paper, in LabPhon 1, eds. J.Kingston and
M.Beckman, pp. 139-151.
- H.D. WANG, D. DEGRYSE and F. CARRARO, (1993),
A prosody
modification approach for auditory user feedback in the SPELL
pronunciation teaching system, Proceedings of Eurospeech 93,
Berlin.
- P. ROACH and S. ARNFIELD, (1995),
Aligning prosodic transcription
with the time dimension, in G.N.Leech, G.Myers and J.Thomas
(eds.) Spoken English on Computer, Longman.
- I. MURRAY and J. ARNOTT, (1993),
Toward the simulation
of emotion in synthetic speech: a review of the literature on
human vocal emotion, JASA, 93(2), pp.1097-1108.
- G. BLOOTHOOFT, V. HAZAN, D. HUBER and J. LLISTERRI,
(1995),
European Studies in Phonetics and Speech Communication, Utrecht:
OTS.
- P. BAGSHAW, S.M. HILLER and M.A. JACK, (1993),
Enhanced
pitch tracking and the processing of F0 contours for computer
aided intonation teaching, Proceedings of Eurospeech 93,
Berlin.
- S. HILLER, E. ROONEY, J.P. LEFEVRE and M. JACK, (1993a),
SPELL:
A pronunciation training device based on speech technology,
Proceedings of ESCA/NATO Workshop on Applications of Speech Technology,
Lautrach, Germany.
- S. HILLER, E. ROONEY, J.P. LEFEVRE and M. JACK, (1993b),
SPELL:
an automated system for computer-aided pronunciation teaching,
Proceedings of Eurospeech 93, Berlin.
- S.M. HILLER, E. ROONEY, R. VAUGHAN, M. ECKERT, J. LAVE and
M. JACK, (1993c), An automated system for computer-aided pronunciation
learning, paper presented at CALL 93: "Reactive and Creative
CALL", University of Exeter.
- E. ROONEY, R. VAUGHAN, S. HILLER, F. CARRARO and J. LAVER,
(1993), Training vowel pronunciation using a computer-aided
teaching system, Proceedings of Eurospeech 93, Berlin.
- C. HIESTER and J. ABERCROMBIE, (1994),
Penn's virtual language lab on the Internet,
(Internet under address
http://philae.sas.upenn.edu).
138