• Nem Talált Eredményt

Syntactical Analysis of Hungarian Sentences to Produce Prosodic Information for Speech Synthesis

N/A
N/A
Protected

Academic year: 2022

Ossza meg "Syntactical Analysis of Hungarian Sentences to Produce Prosodic Information for Speech Synthesis"

Copied!
1
0
0

Teljes szövegt

(1)

Syntactical Analysis of Hungarian Sentences to Produce Prosodic Information for Speech Synthesis

Péter Olaszi

Linguistics consultant:

Ilona Koutny, Uniwersytet Adama Mickiewicza, Pozna´n, Poland Speech synthesis advisors:

Gábor Olaszy, Hungarian Academy of Sciences, Linguistics Institute Géza Németh, TUB DTT, Speech Research Laboratory

Telecommunication related speech research has traditions at the Department of Telecommunication and Telematics of the Technical University of Budapest. Good results were archived by the speech recognition groups lead by Klára Vicsi and Péter Tatai. The speech synthesis group coordinated by Géza Németh and Gábor Olaszy has been developing the Multivox multilingual speech synthesizer available for Hungarian, German, Italian, Portuguese and other languages. Our aim now is to produce more natural sounding synthesized speech.

One way to improve speech quality is to add prosodic information to the text. Prosody, ie. the base frequency, amplitude and rhythm changes, is generally not carried by the written text. The reader, recognizing the syntactic structures and the actual meaning of the text, indicates the focus, adds emphasis and pauses to the sentence. Providing prosodic control sequences for the synthesizer, it can parse this additional information and use it when forming the intonation. This results in more natural sounding speech. The control sequences can be added to the text either manually or automatically.

This paper discusses the problem of automatic prosodic information generation for Hungarian by means of syntactical analysis of the sentences. This process can be divided into two separate phases. In the first phase the syntactic analyzer performs an analysis on the sentence. The output of this module is a representation of the structure of the sentence: the predicate, object, subject and other parts of the sentence are identified. This information forms the input of the second phase. The prosody module per- forms two tasks: assigns stress to each word, and places pauses into the sentence, using phrase boundary information.

The operation of the syntactic analyzer is described in detail. Phrase detection is based on phrase pattern matching, using unification techniques. Phrase patterns are stored in a separate file — the rule matching engine is language independent. The structure of the sentence is determined using structure describing rules.

Stress and pause assignment is performed by the prosody module. The input of this module is the result of the syntactic analysis. Both syntactic and prosodic rules are programmed according to the directions of Ilona Koutny.

New issues in this work:

A new representation of the syntactic structure is introduced. This representation can provide information for the prosody generator. The result of the syntactic analysis can also be used by any other natural language processing program.

Syntactic analysis of Hungarian poses special issues. The major difficulties derive from the ag- glutinative character of the language. This also results in complex morphology of the words and unconstrained word order in the sentence.

We developed a prosody generator using the output of the syntactic analyzer. This prosody gener- ator can be used as a development tool for the speech synthesizer.

The prototype of the system is worked out for single-clause sentences of constrained topic. The sentences were taken from weather forecasts. The program is implemented in standard C.

Hivatkozások

KAPCSOLÓDÓ DOKUMENTUMOK

Aim of this study was to produce and analysis about the regularities of shearing properties of soils, identified by routine testing methods for the

Extraction of voiced speech using residual signal provides poor results in emo- tional speech signals because modeling of new speech sig- nal based on analysis loses

FSAF procedure computes the unit step response (or the unit impulse response) of the open-loop (or closed-loop) control system from the frequency function obtained by

The aim of this paper is threefold: (a) to establish a new model for the analysis using this analogy, (b) to produce a simple closed-form solution for the ro- tation of a building,

Introduction to Information Engineering: User-oriented information systems, human, organizational and cultural aspects, problem analysis and modelling, conflict analysis

The Tinker standard created two prongs: it “reasonably forecast(s) a substantial disruption because of the expression, or it collides with the rights of others.” 32 As

c0–3 polynomial coefs for f0 shape vowel ml rms f0 midline deviation from sentence level vowel rng rms f0 range deviation from sentence range vowel en iqr nrm normalized

The parameters used in the analysis were the duration of the vowel /a/ in the final syllable of the target word normalised to the initial syllable of the target word /la:tlak/,