Peter Roach * Speech Technology: a Look into the Future
Another big area within the applications field is the "hands and eyes busy" situation - situations where someone needs to interact with a computer but is not in a position to use a keyboard. The example most often quoted is that of aircraft pilots, but I think there are many less exotic applications. One is certainly the car phone: it is well known that drivers dialling telephone numbers while at the wheel are unsafe, and the technology exists already to allow drivers to request a telephone number by voice. Manual dialling while driving should be made illegal, and this would dramatically boost the sale of "voice-dialling" systems. There are many other applications: I have been involved in several projects with the Institute for Transport Studies in Leeds University. Research in transport engineering requires a lot of observation of traffic in motion, and researchers often have to stand on motorway bridges or railway platforms manually recording what they see. Studies of urban car parking may require regular recording of information on all the cars in a car park, while surveys of street fittings such as warning notices and road markings also have to be surveyed regularly. We found these tasks could be made much easier if the researchers were equipped with hand-held portable computers with voice recognition capability - the spoken data was entered directly into a database. In our research, we found that recogniser performance could be unsatisfactory, and could deteriorate over time as the speaker became tired, unless they received immediate feedback from the computer confirming what had been said. We also looked at the work of geologists: in inspecting core samples, the geologists were often working in difficult and dirty conditions which would have resulted in damage to most computers and would have resulted in keyboards covered in mud; however, using a radio microphone connected to a speech recogniser allowed observations of the samples to be entered instantly into the computer being used for the survey work.
Another application area for speech technology, and one with a value that everyone can see, is in helping the disabled. There are many people who are physically unable to operate a keyboard but have the power of speech. To be able to control their environment by spoken commands (open the door, switch on the heating, operate an alarm) would be a big help to such people, and voice-operated devices can provide this.
2.2. Techniques in speech recognition
Many books and papers on speech
technology devote considerable space to reviewing the heroic days
of the 1970's and 80's when many of the computational techniques
in use today were laboriously worked out. There is not space in
this paper to go through such a historical review. I would simply
like to make two basic points. Firstly, the most lasting developments
of speech technology have been the result of partnership between
specialists in computer science and electronic engineering on
the one hand and specialists in speech science and linguistics
on the other. Attempts to solve the many problems of speech recognition
simply by advanced engineering have resulted in systems that work
satisfactorily within the laboratory for an ideal speaker, but
have been unable to survive exposure to the enormous variability
of speech in the real world. The input of speech science has been
of different types in different applications, but I believe phonetic
expertise is always an essential component of a successful system.
134