Coverart for item
The Resource Quality of Synthetic Speech : Perceptual Dimensions, Influencing Factors, and Instrumental Assessment

Quality of Synthetic Speech : Perceptual Dimensions, Influencing Factors, and Instrumental Assessment

Label
Quality of Synthetic Speech : Perceptual Dimensions, Influencing Factors, and Instrumental Assessment
Title
Quality of Synthetic Speech
Title remainder
Perceptual Dimensions, Influencing Factors, and Instrumental Assessment
Creator
Subject
Language
eng
Member of
Cataloging source
MiAaPQ
Literary form
non fiction
Nature of contents
dictionaries
Series statement
T-Labs Series in Telecommunication Services Ser
Quality of Synthetic Speech : Perceptual Dimensions, Influencing Factors, and Instrumental Assessment
Label
Quality of Synthetic Speech : Perceptual Dimensions, Influencing Factors, and Instrumental Assessment
Link
http://libproxy.rpi.edu/login?url=https://ebookcentral.proquest.com/lib/rpi/detail.action?docID=4838376
Publication
Copyright
Related Contributor
Related Location
Related Agents
Related Authorities
Related Subjects
Related Items
Carrier category
online resource
Carrier category code
cr
Carrier MARC source
rdacarrier
Color
multicolored
Content category
text
Content type code
txt
Content type MARC source
rdacontent
Contents
  • Acknowledgements -- Contents -- Acronyms -- Abstract -- 1 Introduction -- 1.1 Motivation -- 1.2 Outline -- References -- 2 Speech Synthesis -- 2.1 Setup of a Speech Synthesizer -- 2.1.1 Natural Language Processing (NLP) -- 2.1.2 Prosody Generation -- 2.1.3 Concatenation and Generation of Speech-Signal Parameters -- 2.1.4 Speech Signal Generation -- 2.2 The Mary Text-to-Speech System (MaryTTS) -- References -- 3 Auditory and Instrumental Quality Evaluation Metrics -- 3.1 What Is Perceptual Quality? -- 3.2 Taxonomy for the Quality Assessment of Synthetic Speech -- 3.2.1 Glass Box Versus Black Box -- 3.2.2 Laboratory Versus Field Studies -- 3.2.3 Linguistic Versus Acoustic -- 3.2.4 Auditory Versus Instrumental -- 3.3 Auditory Quality Evaluation Metrics -- 3.3.1 Functional TestsThe content of this section has previously been published in a slightly different version in [6]. -- 3.3.2 Judgment TestsParts of the content of this section have previously been published in a slightly different version in [13] and [6]. -- 3.4 Instrumental Quality Evaluation Metrics -- 3.4.1 Reference-Based MeasuresParts of the content of this section have previously been published in a slightly different version in [21]. -- 3.4.2 Reference-Free Measures -- References -- 4 Perceptual Quality Dimensions -- 4.1 State-of-the-Art Perceptual Quality DimensionsParts of the content of this section have previously been published in a slightly different version in [1]. -- 4.1.1 Study: Kraft and Portele (Kraft1995) -- 4.1.2 Study: Mayo et al. I (Mayo2005) -- 4.1.3 Study: Viswanathan and Viswanathan (Vis2005) -- 4.1.4 Study: Seget (Seget2007) -- 4.1.5 Study: Hinterleitner (Hint2010) -- 4.1.6 Study: Mayo et al. II (Mayo2011) -- 4.1.7 Restrictions of Discussed Studies
  • 4.2 Semantic Differential and Factor AnalysisParts of the content of this section have previously been published in a slightly different version in [13]. -- 4.2.1 Experimental Setup -- 4.2.2 Statistical Analysis -- 4.3 Sorting Task and Multidimensional ScalingParts of the content of this section have previously been published in a slightly different version in [16]. -- 4.3.1 Experimental Setup -- 4.3.2 Statistical Analysis -- 4.4 Summary of the SD/FA and ST/MDS StudiesParts of the content of this section have previously been published in a slightly different version in [16]. -- 4.5 4.5 Universal Perceptual Quality Dimensions -- 4.5.1 Naturalness of Voice -- 4.5.2 Prosodic Quality -- 4.5.3 Fluency and Intelligibility -- 4.5.4 Absence of Disturbances -- 4.5.5 Calmness -- 4.5.6 Instructions for TTS Quality Assessment -- 4.6 Summary -- References -- 5 Influencing Factors on Perceptual Quality -- 5.1 Influence of the ApplicationParts of the content of this section have previously been published in a slightly different version in [1]. -- 5.1.1 Pretest -- 5.1.2 Main TestThe content of this section has previously been published in a slightly different version in [10]. -- 5.1.3 Conclusions -- 5.2 Influence of a Speakers VoiceThe content of this section has previously been published in a slightly different version in [12]. -- 5.2.1 Experimental Setup -- 5.2.2 Statistical Analysis -- 5.2.3 Conclusions -- 5.3 Influence of Corpus Size and UtteranceParts of the content of this section have been submitted for publishing in a slightly different version [16]. -- 5.3.1 Experimental Setup -- 5.3.2 Statistical Analysis -- 5.3.3 Conclusions -- 5.4 Summary -- References -- 6 Instrumental Quality Assessment -- 6.1 Reference-Based Measures -- 6.1.1 State of the Art
  • 6.1.2 Quality PredictionThe content of this section has previously been published in a slightly different version in [6]. -- 6.1.3 Conclusions -- 6.2 Reference-Free Measures -- 6.2.1 State of the Art -- 6.2.2 Linear Regression Models -- 6.2.3 Conclusions -- 6.3 Summary -- References -- 7 Requirements for the Integration of an Instrumental Quality Measure into a Concatenative TTS System -- 7.1 Regular Perception Model (RPM) -- 7.1.1 Model Training -- 7.1.2 Results -- 7.2 Unit Selection Voice Creation in MaryTTS -- 7.2.1 Test Database -- 7.2.2 Generation of Alternative Versions -- 7.3 Experimental Setup -- 7.3.1 Pairwise Comparison (PC) -- 7.3.2 Semantic Differential (SD) -- 7.3.3 Test Procedure -- 7.4 Statistical Analysis -- 7.4.1 PC Data -- 7.4.2 SD Data -- 7.4.3 Discussion -- 7.5 Quality Prediction -- 7.5.1 RPM1 Performance -- 7.5.2 RPM2 Training -- 7.5.3 RPM2 Performance -- 7.6 Automatic Selection of Alternative Versions -- 7.7 Potential Improvements -- 7.8 Summary -- References -- 8 Conclusions and Future Work -- 8.1 Summary -- 8.2 Conclusions -- 8.3 Future Work -- 8.3.1 Perceptual Quality Dimensions -- 8.3.2 Influencing Factors -- 8.3.3 Instrumental Quality Measurement -- Appendix Statistical Analysis of Chap.7 -- A.1 Results of the PC Test -- A.2 Results of the SD
http://library.link/vocab/cover_art
https://contentcafe2.btol.com/ContentCafe/Jacket.aspx?Return=1&Type=S&Value=9789811037344&userID=ebsco-test&password=ebsco-test
Dimensions
unknown
http://library.link/vocab/discovery_link
{'f': 'http://opac.lib.rpi.edu/record=b4387906'}
Extent
1 online resource (170 pages)
Form of item
online
Isbn
9789811037344
Media category
computer
Media MARC source
rdamedia
Media type code
c
Sound
unknown sound
Specific material designation
remote

Library Locations

    • Folsom LibraryBorrow it
      110 8th St, Troy, NY, 12180, US
      42.729766 -73.682577
Processing Feedback ...