外文翻译----一个实验文语转换系统在分析韵律短语的贡献(编辑修改稿)内容摘要:

ite well on word pronunciation, they fall short when it es to providing good prosody for plete sentences. Current texttospeech systems have no access to the syntactic and semantic properties of a sentence that influence phraselevel prosody. Hence rules for sentence prosody, when they are provided at all typically depend on superficial aspects of text (. punctuation) and on heuristics that vary widely in sophistication. Although such techniques often add a more natural quality to the resulting synthetic speech, they can fail in important ways, for example, by ignoring the prosodic event between a lengthy subject and a predicate, so that there is no clear prosodic boundary between right and mark in The characters on the right mark the salient features. Several authors (. Allen 1976。 Elovitz et al. 1976。 Luce et al. 1983) have suggested that prosodic differences between synthetic and natural speech are the primary, unaddressed factor leading to difficulties in the prehension of fluent synthetic speech. The relation between phraselevel prosody and its sources, however, is so poorly understood that we have no good sense of the degree to which different levels of explanationsyntactic, semantic, or pragmaticare applicable. We currently have reasonable tools for automatic syntactic analysis of a text. but there is nothing equivalently welldeveloped for semantic or pragmatic textual analysis. Thus an obvious goal is to explore the extent to which phraselevel prosody can be explained by the syntax tree and develop a detailed description of that relation. A further goal is to convert the resulting insights about this relation into a system that can work with a speech synthesizer. This allows us to test our description more adequately and perhaps also produce something that will further text tospeech technology. SYNTACTIC STRUCTURE AND PROSODIC PHRASING Beyond the word level, however, there has been little investigation of systematic connections between syntactic structure and prosodic phrasing. The psycholinguistic and acoustic investigations of Cooper and PacciaCooper (1980), Umeda (1982) and Gee and Grosjean (1983)and the prosodic theory of Selkirk (1984) are among the more notable studies and represent the two main approaches to syntax/prosody relations. In Cooper and PacciaCooper (1980) and Umeda (1982), the connection from syntax to prosodic phrasing is unmediated by any filtering process, .. they propose that the details of prosodic phrasing can be determined directly from syntactic structure by associating particular syntactic nodes (or constituent boundaries) with a phoic value, either pausing, segmental lengthening, or the blocking of the cross word conditioning of phonological rules. By contrast, Gee and Grosjean (1983) and Selkirk (1984) believe that the syntaxprosody relation is indirect: prosodic phrasing is derived by rules that refer to lefttoright ordering, length (or branching patterns), and, in the case of Selkirk grammatical function, as well as constituent membership in order to infer a hierarchical prosodic structure. But while their respective positions are quite clear, none of these studies is conclusive. All lack a syntactic framework sufficiently detailed and formalized to allow extensive testing, and most consider only a small number of sentences and sentence types. To develop our analysis, we first examined prosodic phrasing in the speech of one of us reading prose from various texts, including four instruction manuals. These texts were later augmented by a professional reading of a prose story. The boundaries between prosodic phrases were identified and then classed according to their syntactic context and semantic function. Texttospeech Synthesis The programs that make up the speech ponent are described in Liberman and Buchsbaum (personal munication). These programs take character text as input and produce digitized speech output. By annotating the input text to this system, many aspects of its operation can be overridden or modified: . the location of major and minor phrase boundaries, the stress given to words, the transcription of words and the boundaries between them, the timing of segments, and details of the pitch contour. As we will show, with our prosody system we are able to produce strings in which four boundary levels are identified and perceptually distinguished, using the current text tospeech system annotations. Prosodic Phrasing The prosody rules use information about constituent structure, grammatical role, and length to map a surface structure. The prosody tree identifies the location of phrase boundaries (signified by the nodes) and the relative strength of each boundary (signified by a number in the node). It is this information that is used to annotate the input text with escape sequences that provide the textto speech system with instructions about prosodic phrasing. In formulating our rules for building the prosodic structure, we began with the idea of simply implem。
阅读剩余 0%
本站所有文章资讯、展示的图片素材等内容均为注册用户上传(部分报媒/平媒内容转载自网络合作媒体),仅供学习参考。 用户通过本站上传、发布的任何内容的知识产权归属用户或原始著作权人所有。如有侵犯您的版权,请联系我们反馈本站将在三个工作日内改正。