• Shuffle
    Toggle On
    Toggle Off
  • Alphabetize
    Toggle On
    Toggle Off
  • Front First
    Toggle On
    Toggle Off
  • Both Sides
    Toggle On
    Toggle Off
  • Read
    Toggle On
    Toggle Off
Reading...
Front

Card Range To Study

through

image

Play button

image

Play button

image

Progress

1/10

Click to flip

Use LEFT and RIGHT arrow keys to navigate between flashcards;

Use UP and DOWN arrow keys to flip the card;

H to show hint;

A reads text to speech;

10 Cards in this Set

  • Front
  • Back

What are 3 techniques for speech signal analyse?

1. Spectral analysis
2. Cepstrum
3. Linear prediction

What is the basic principle of Tunable bandpass and Search-tone method methods?

The idea behind Tunable bandpass and Search-tone method methods is to use a filter (band pass, low pass) to get the parts of the spectrum you are currently interested and to suppress the rest of the spectrum.

Explain the flow of Tunable bandpass method.

For spectral analysis you can use a tunable bandpass which lets pass only frequencies of a small range (ΔΩ) close to the center frequency Ωm.
This results in a (complex) vibration with amplitude of the midfrequency Ωm which is proportional to the value of the spectrum.
To separate the amplitude a demodulator is used.

What is different in Search-tone method compare to Tunable bandpass method.

Search-tone method (windowing in the temporal domain)
Instead of changing the frequency of a bandpass filter the search-tone method uses a modulator to shift the input signal to the frequency range ∆Ω of the filter.

What is the main problem of Tunable bandpass and Search-tone method methods and what's the solution?

For calculating the entire spectrum at all frequencies we need to store the signal and calculate the spectrum sequentiall.
Solution:
Use bandpass filterbank with several parallel filters

What is bandpass filter bank method about?

Analysis all of the frequencies at the same time (with parallel filters) with absolute or relative ∆Ω
Needs windowing.

Analysis all of the frequencies at the same time (with parallel filters) with absolute or relative ∆Ω
Needs windowing.

What is the difference between different window functions?

When selecting the shape of the window you need to consider the following criteria:
- a high trap attenuation
- a low bandpass width
The square window has a very small bandpass width but a poor trap attenuation.
The Hamming and Hann windows feature a good trap attenuation but a relatively wide bandpass.
The characteristics of the square window would lead to an imprecise and blurred spectrogram, thus Hann and Hamming windows are preferred.

What Radix-2/Decimation-in-Time algorithm is about?

Reduces the number of FFT needed to be counted

Reduces the number of FFT needed to be counted

What is cepstrum and what is used for?

A cepstrum is the result of taking the Inverse Fourier transform (IFT) of the logarithm of the estimated spectrum of a signal.
Helps to differentiate noise-like and quasi-periodic sounds, helps to separate information about transfer function and f...

A cepstrum is the result of taking the Inverse Fourier transform (IFT) of the logarithm of the estimated spectrum of a signal.
Helps to differentiate noise-like and quasi-periodic sounds, helps to separate information about transfer function and fundamental frequency.
The amplitude close to zero by x-axel (quefrency) relates to transfer function of Vocal tract.
Higher amplitude by quasi-periodic signals. There are also some picks by quasi-periodic sounds which represent fundamental frequency.

What is idea of linear prediction?

Makes inverse filtering
Goal: separate the source signal s(t) from the filter function H(f)
Assumption: the source signal s(t) is an impulse or white noise that gets filtered by a synthesis filter
H(f) and creates the speech signal g(t) --> Source...

Makes inverse filtering
Goal: separate the source signal s(t) from the filter function H(f)
Assumption: the source signal s(t) is an impulse or white noise that gets filtered by a synthesis filter
H(f) and creates the speech signal g(t) --> Source-Filter-Model

Tune the inverse analysis filter 1/H(f) until the difference between s’(t) and s(t) gets close to zero.

Use cases:
- speech compression in mobile phones
- digital coded transmission of voice through narrowband channels
- formant analysis