Speech Synthesis And Recognition Holmes Pdf Download
This extensively reworked and updated new edition of Speech Synthesis and Recognition is an easy-to-read introduction to current speech technology. Aimed at advanced undergraduates and graduates in electronic engineering, computer science and information technology, the emphasis is on explaining und.
You are here:Speech Synthesis
The PDF links in the Readings column will take you to PDF versions of all. The paper is not required reading). Key for sources of readings: Holmes: Speech Synthesis and Recognition, J. Holmes R+J: Fundmentals of Speech Recognition, Rabiner, Juang J+M: Speech and Language Processing, Jurafsky, Martin, 2nd ed. Speech Applications — coding, synthesis, recognition, understanding, verification, language translation, speed-up/slow-down 5 Speech Applications. We look first at the top of the speech processing stack—namely applications –speech coding –speech synthesis –speech recognition and understanding –other speech applications 6 Decom.
JCMSTVolume 28, Number 2, ISSN 0731-9258Publisher: Association for the Advancement of Computing in Education (AACE), Waynesville, NC USAWondershare dr.fone keygen iphone 5.
Abstract
Speech technology – especially automatic speech recognition – has now advanced to a level where it can be of great benefit both to able-bodied people and those with various disabilities. In this paper we describe an application “TalkMaths” which, using the output from a commonly-used conventional automatic speech recognition system, enables the user to dictate mathematical expressions in a relatively straightforward way. These then get converted into electronic formats, so that they can be embedded in a document and/or displayed in an editor or web browser. This process can be used for preparing teaching material, assignments, or entering mathematical content for online tests. Our system does not require the user to have extensive knowledge of the syntax of any markup language or mathematical document specification language, so that learning to use it should be relatively straightforward for non-specialists. The way in which our system analyses, converts and encodes the spoken mathematical expressions is a novel approach.
Citation
Wigmore, A., Hunter, G., Pflügel, E., Denholm-Price, J. & Binelli, V. (2009). Using Automatic Speech Recognition to Dictate Mathematical Expressions: The Development of the “TalkMaths” Application at Kingston University. Journal of Computers in Mathematics and Science Teaching, 28(2), 177-189. Waynesville, NC USA: Association for the Advancement of Computing in Education (AACE). Retrieved September 14, 2019 from https://www.learntechlib.org/primary/p/30301/.
© 2009Association for the Advancement of Computing in Education (AACE)
Keywords
References
View References & Citations Map- Holmes, J. & Holmes, W. (2001). Speech Synthesis and Recognition (2nd ed.). London: taylor& Francis. Hunter, G.J.A., Pfluegel, E. & Jalan, S. (2006). The Development of Speech Interfaces to Enhance I.T. Access for Physically Disabled Students. Uk: kingston university, faculty of cism technical report. Hunter, G.J.A., Pfluegel, E. & Jalan, S. (2007). “ku-talk” – A Speech User-Interface for an Intelligent Working, Learning and Teaching Environment. 3rd IET International Conference on Intelligent Environments (IE 07), 124-130.
These references have been extracted automatically and may have some errors.Signed in users can suggest corrections to these mistakes.
Suggest Corrections to ReferencesAlso Read
Related Collections
- L. Baum, An inequality and associated maximization technique in statistical estimation for probabilistic functions of Markov processes, Inequalities, III (1972), pp. 1–8.Google Scholar
- A. Dempster, N. Laird, AND D. Rubin, Maximum likelihood from incomplete data via the em algorithm, Journal of the Royal Statistical Society, Series B, 39 (1977), pp. 1–38.MathSciNetzbMATHGoogle Scholar
- L. Deng, M. Aksmanovic, D. Sun, AND J. Wu, Speech recognition using hidden Markov models with mixtures of trend functions, IEEE Trans Speech and Audio Processing, 2 (1994), pp. 507–520.CrossRefGoogle Scholar
- L. Deng AND J. Ma, Spontaneous speech recognition using a statistical coarticulatory model for the vocal-tract-resonance dynamics, Journal of the Acoustical Society of America, 108 (2000), pp. 1–13.CrossRefGoogle Scholar
- V. Digalakis, Segment-based stochastic models of spectral dynamics for continuous speech recognition, PhD thesis, Boston University, 1992.Google Scholar
- B.C. Dupree, Formant coding of speech using dynamic programming, Electronics Letters, 20 (1980), pp. 279–280.CrossRefGoogle Scholar
- J. Fiscus, W. Fisher, A. Martin, M. Przybocki, AND D. Pallett, 2000 nist evaluation of conversational speech recognition over the telephone: English and mandarin performance results, in Proceedings of the 2000 Speech Transcription Workshop, University of Maryland, 2000.Google Scholar
- M.J. Gales AND S.J. Young, Segmental hidden Markov models, in EUROSPEECH, Berlin, 1993, pp. 1611–1614.Google Scholar
- P. Garner AND W. Holmes, On the robust incorporation of formant features into hidden Markov models for automatic speech recognition, in ICASSP, Seattle, 1998, pp. 1–4.Google Scholar
- J. Garofolo, L.F. Lamel, W. Fisher, J. Fiscus, D. Pallett, AND N. Dahlgren, The darpa tirait acoustic-phonetic continuous speech corpus cdrom, ntis order number pbOl-100354, available from 1dc, 1993.Google Scholar
- H. Glsh AND K. Ng, A segmental speech model with applications to word spotting, in ICASSP, Minneapolis, 1993, pp. 447–450.Google Scholar
- J.N. Holmes, A parallel-formant synthesizer for machine voice output, in Computer Speech Processing, 1985.Google Scholar
- J.N. Holmes, W.J. Holmes, AND P.N. Garner, Using formant frequencies in speech recognition, in EUROSPEECH, Rhodes, 1997.Google Scholar
- W.J. Holmes, Modelling segmental variability for automatic speech recognition, PhD thesis, University of London, 1997.Google Scholar
- W.J. Holmes, Towards a unified model for low bit-rate speech coding using a recognitionsynthesis approach, in ICSLP, Sydney, 1998.Google Scholar
- W.J. Holmes AND M.J. Russell, Probabilistic-trajectory segmental HMMs, Computer Speech and Language, 13 (1999), pp. 3–37.CrossRefGoogle Scholar
- M. Ismail AND K.M. Ponting, Between recognition and synthesis-300 bits/second speech coding, in EUROSPEECH, Rhodes, 1997, pp. 441–444.Google Scholar
- K.F. Lee AND H.W. Hon, Speaker-independent phone recognition using hidden Markov models, ASSP, 37 (1989), pp. 1641–1648.CrossRefGoogle Scholar
- L.A. Liporace, Maximum likelihood estimation for multivariate observations of Markov sources, IT, 28 (1982), pp. 729–734.MathSciNetzbMATHGoogle Scholar
- M. Ostendorf, V.V. Digalakis, AND O.A. Kimball, From HMM’s to segment models: A unified view of stochastic modeling for speech recognition, IEEE Trans Speech and Audio Processing, 4 (1996), pp. 360–378.CrossRefGoogle Scholar
- D. Pallett, J. Fiscus, J. Garofolo, A. Martin, AND M. Przybocki, 1998 broadcast news benchmark test results: English and non-english word error rate performance measures, in Proceedings of the DARPA Broadcast News Workshop, Virginia, 1999.Google Scholar
- J. Picone AND G. Doddington, A phonetic vocoder, in ICASSP, Glasgow, 1989, pp. 580–583.Google Scholar
- C. Ribeiro AND I. Trancoso, Improving speaker recognisability in phonetic vocoders, in ICSLP, Sydney, 1998, pp. 2611–2614.Google Scholar
- H.B. Richards AND J.S. Bridle, The hdm: a segmental hidden dynamic model of coarticulation, in ICASSP, Phoenix, 1999.Google Scholar
- M.J. Russell AND W.J. Holmes, Progress towards a unified model for speech pattern processing, Proc. IOA, 20 (1998), pp. 21–28.Google Scholar
- M.J. Russell AND R.K. Moore, Explicit modelling of state occupancy in hidden Markov models for automatic speech recognition, in ICASSP, 1985, pp. 5–8.Google Scholar
- K. Tokuda, T. Masuko, J. Hiroi, T. Kobayahi, AND T. Kitamura, A very low bit rate speech coder using HMM-based speech recognition/synthesis techniques, in ICASSP, Seattle, 1998, pp. 609–612.Google Scholar
- P. Zolfaghari AND T. Robinson, A segmentai formant vocoder based on linearly varying mixture of Gaussians, in EUROSPEECH, Rhodes, 1997, pp. 425–428.Google Scholar
Comments are closed.