Use LEFT and RIGHT arrow keys to navigate between flashcards;
Use UP and DOWN arrow keys to flip the card;
H to show hint;
A reads text to speech;
81 Cards in this Set
- Front
- Back
What is the Fourier Transform?
|
when we use the techniques that Fourier developed
|
|
What does a Fourier Transform do?
|
creates a spectrum from a time domain waveform - shows what the individual components are.
Example - prism/light cake/ingredients |
|
Time Domain Waveform?
x and y axis? |
is time domain data, you see it when you have a microphone recording on a screen in front of you.
x axis-time y axis-amplitude |
|
What do you get when you subject this time domain waveform to a Fourier Transform?
|
a Frequency Domain Display - spectrum, slice of time/snapshot
|
|
The Fourier Transform is a way from going to one view of a signal to another.
From the ____ Domain to the _____ Domain |
Time, Frequency
|
|
Time domain data (2)
|
a waveform that represents sound directly
air pressure changes over time |
|
Frequency domain data (spectrum display)
|
line spectrum show the frequency components of a periodic sound
-we get it from the time domain and doing a Fourier transform on it to give us the spectrum display) |
|
Great benefit of Frequency Domain Display
|
show us what individual components are and also the relative proportions of each
|
|
What is a harmonic?
|
the harmonics are exact multiples of that fundamental frequency
-there is no energy between the individual harmonic components that make up the entire spectrum |
|
that the human voice is nearly _____
|
periodic
|
|
It has a fundamental and then harmonics which are multiples of that fundamental. And just as you see the upper harmonic components getting shorter in height as the frequency gets higher, the same is true of the _____ ______. The ____ harmonics get progressively _____as you go up in frequency.
|
Human Voice
upper weaker |
|
PERIODIC SIGNAL SPECTRUM (4)
|
-frequency domain description of the signal
-has harmonics that are multiples of the Fundamental -has nothing between the lines -the lines represent the harmonic frequencies |
|
If you make your voice softer or louder, you’ll see some changes in the relative strength of the ______. If you raise and lower your voice pitch, you’ll see the _____ spreading a part from one another or coming back together as the pitch comes back down again.
|
harmonics, harmonics
|
|
NOISE is when ___ frequencies are present with various ______ phase relationships to each other with approximately equal amplitude
|
all, different
|
|
NOISE (3) on a spectrum
|
equal amplitude
all frequencies random phase |
|
Complex periodic signals would have ______ lines.
|
multiple
|
|
simple sine wave, there would only be ___ _____ in your display
|
one line
|
|
the spectral envelope encloses an area that would have been completely filled in by those vertical lines that represent the _____ sine waves
|
individual
|
|
So the ____ ______ here is a representation of where the tops of all those lines would have been. That shows us the relative strength of the different frequency components in the noise.
|
spectral envelope
|
|
REAL VOICE SIGNALS (5)
|
- the voice source is not truly periodic
- it is ‘nearly periodic’ - spectrum does not have pure lines - spectrum has peaks - some spread of energy around the fundamental and harmonics |
|
by definition if it’s not ____ in the voice, then it’s noise
|
harmonic
|
|
Harmonics to Noise Ratio
|
ratio of the height of these harmonic components to the level of noise between them
|
|
FFT SPECTRUM (4)
|
-clearly shows harmonic energy (range)
- each peak is a harmonic - less clear at showing formants - more revealing of source |
|
LPC SPECTRUM
|
-shows spectral envelope
- good at revealing formants - does not show harmonics - more revealing of filter (what its resonance frequencies are relative to one another) |
|
FFT stands for?
|
Fast Fourier Transform
-more up and downs lines on graph (individual harmonics) |
|
LPC stands for?
|
Linear Predictive Coding
-long horizontal line on graphs (formants) |
|
Formants
|
resonant energy peaks in the vocal tract transfer function
|
|
Formants don't care if you have phonated or not. (t/f)
|
TRUE
This vocal tract transfer function applies whether the sound coming into it is periodic (or almost) and phonated, or whether it’s merely being whispered. The formants you see in the vocal tract will be present because of the way you have shaped the vocal tract. |
|
spectrogram shows us?
|
individual spectral slices that are arranged side by side over time
-how the frequency components of the signal do change over time. |
|
A _______ shows the ingredients of a sound at a single point in time.
|
spectrum
|
|
Spectrogram is hybrid because?
|
hybrid domain display since it shows both time and frequency.
shows us how the strength in each range of frequencies is changing or evolving over time. |
|
SPEECH SPECTROGRAM (3)
|
- x-axis is time
- y-axis is frequency - darkness indicates intensity |
|
SPECTROGRAM–it combines _______ and _____
|
frequency, time
|
|
The sample rate needs to be at _____ the frequency of the highest component frequency in order to accurately record and reproduce it.
|
twice
|
|
Spectrograms are made from digital recordings, and so the frequency display on the Y axis is limited to the highest frequency component you were able to save and reproduce, or the ______ _______.
|
Nyquist frequency
|
|
Examples: If you were interested in fricative energy that might go all the way up to 12-14 kHz, you’d have to make sure that the sample rate of your digital recording was at least double this amount --maybe ____-____ kHz to make sure that you include all of that interesting fricative energy. If you only care about the info from 0-4000 Hz, then you could sample at __-____ Hz. (The ____ ______ would be half your sample rate: 4-5000 Hz.) Then you would be able to display all the information that you needed.
|
25-30, 8-10000, Nyquist frequency
|
|
When you’ve made a high frequency recording, you can always _______ it later.
|
downsample
|
|
SOUND SPECTROGRAMS (3)
|
• display reflects the contribution of
many structures and movements • much detail is present for even simple utterances • need to be selective, specific in interpreting the display |
|
SPECTROGRAM PARAMETERS(4)
|
• y-axis (frequency) limited to Nyquist
–actually, slightly below Nyquist • sample at a high enough rate –what details do you want to see? • display can be adjusted downward • no way to adjust upward beyond Nyquist frequency |
|
A ____ band spectrographic analysis gives you good TIME, or temporal detail. It can show you individual events that are very _____, and resolve those with great _______.
|
wide, brief, accuracy
|
|
A _______ band spectrogram is a little different in that it tends to blur things together over time, but it gives you extremely good FREQUENCY resolution.
|
narrow
|
|
the laws of physics prevent us from having good ____ AND ______ detail on a 3D spectrogram display
|
time, frequency
|
|
ANALYSIS BANDWIDTH
wide vs. narrow? |
ANALYSIS BANDWIDTH
• ‘wide band’ spectrogram gives clear temporal detail –frequency resolution is poor • ‘narrow band’ spectrogram gives clear frequency detail –time resolution is poor |
|
On a wide band spectrogram the vertical striations, or the vertical thin lines, represent each ____ _____ as it occurs in voicing.
|
glottal pulse
|
|
A___ Hz bandwidth is what we refer to as a Wide Band spectrogram.
|
300
|
|
Wide Band Spectrogram :
x and y axis |
X axis going from left to right represents TIME. It is time-aligned with the microphone signal displayed in the waveform above. Up the Y axis is the FREQUENCY.
You can see that there are faint grey lines. Each one of those represents 1 kHz (kilohertz). This one goes up to about 4 kHz on its Y axis. The lightness or darkness of the trace represents the strength of energy, or amplitude. Darker = more energy. |
|
This was made using a ___ Hz bandwidth –it’s a NARROW BAND display
|
45
|
|
Narrow Band Spectrogram characteristics:
|
you can see horizontal, almost parallel, lines. These are the HARMONICS,which the wide-band display could not show because it couldn’t give us good frequency detail. In this narrow band display, we have extremely GOOD FREQUENCY DETAIL.We can tell the harmonics apart from one another.
|
|
Main Idea of Wide Band Spectrogram:
|
WIDE BAND: shows individual vertical glottal pulses (vertical striations) good TIME resolution
|
|
Main Idea of Narrow Band Spectrogram:
|
NARROW BAND: shows horizontal lines that represents HARMONICS of the voice, separates them out. –good FREQUENCY resolution
|
|
A wide and narrow band spectrogram display of a normal and hoarse person. You can see by comparison that the normal voice has harmonics that go _____ up the screen. The hoarse voice’s harmonics ___ out sooner and are replace by faint noise higher up.
|
higher, drop
|
|
Function, or the way we use our ______, etc., can also contribute to our dialect or other features of our voice which are not anatomically based.
|
articulators
|
|
The larynx and vocal tract differ ________ from one person to the next. This of course influences the acoustics of the sounds the person will produce. There are structural differences that cause the _______of your own voice.
|
anatomically, uniqueness
|
|
SLPs work on altering the way the client uses his ______to improve sound output.
|
structures
|
|
_____ & ______ tend to carry the greatest amount of energy in speech
|
Vowels & diphthongs
|
|
why won’t a man’s high pitched voice
sound like a woman’s? |
–vocal fold closure patterns
–harmonic spectral slope –noise between formants |
|
Women are about an octave higher pitch than men because
|
–men’s vocal cords are larger
|
|
When you produce a vowel?
On the other hand, consonants are usually produced very _____. |
The vocal tract shape can be held constant, and a vowel can be prolonged as long as you have breath –the vowel is TIME INVARIANT
briefly |
|
It’s the _____ __ ___ _____ _____ or the configuration of the _______that determines the particular vowel that we will be producing at that moment.
|
shape of the vocal tract, articulators
|
|
What makes one vowel ______ from another is that the frequencies of the formants are different for each one.
|
different
|
|
The output from the larynx is basically the ______ for every sound we produce.
|
same
|
|
On the ____ (what kind of display) exact locations of these formants allow us to distinguish between the different _____ that are being produced.
|
LPC, vowels
|
|
Usually you can identify the vowel from the first __ formants, even with synthetic speech.
|
2
|
|
The higher formants –F3, F4, F5 and so on –tend to make the vowels sound more ____.
|
natural
|
|
______ are peaks in the vocal tract transfer function.
|
FORMANTS
In other words, as energy passes through the vocal tract, there are certain frequencies that are favored, and other frequencies that are weakened or attenuated |
|
The ____of a resonator will dictate what frequencies are favored (or echoed or resonated) and which are not.
|
size
Ex pipes on an organ |
|
_____ transfer is more efficient at the formants.
|
Energy
|
|
Children have _____formant frequencies than men or women, because they have _____ vocal tract structures, including _____ resonating cavities.
|
higher, smaller, smaller
|
|
F1 and F2 plots on a graph:
the vowel quadrilateral for children is the _____ and it spreads the _____ up and to the right. This is because their formant frequencies are higher. The women’s quadrilateral is in the middle, and the men’s is the furthest down and to the left, because their formant frequencies are lower. (The men have larger resonating cavities in their vocal tracts.) |
largest, furtherst
|
|
/i/ is a ___ ____ vowel; its first formant is rather ___.
|
high front, low
because the constriction formed by the tongue in the vocal tract is forward in the mouth and high. The constriction is relatively small because the tongue nearly touches the palate. |
|
/ae/ (which is a ___ _____ vowel,
because the tongue is still far forward in the mouth but the constriction is quite wide) the first formant _____in frequency. |
low front, increases
|
|
/u/ is a ___ ____vowel –so its first formant is rather ___.
|
high back, low
|
|
________THE JAW AND TONGUE INCREASES THE FREQUENCY OF THE FIRST FORMANT (F1).
|
LOWERING
|
|
AS YOU MOVE THE TONGUE _______, THE SECOND FORMANT’S (F2) FREQUENCY INCREASES.
|
FORWARD
|
|
HIGHER VOWELS = _____ F1
|
LOWER
|
|
LIP ROUNDING _______ ALL FORMANTS.
|
LOWERS
lengthening the vocal tract |
|
LOWER VOWELS = _______ F1
|
HIGHER
|
|
FRONT VOWELS = _______ F2
|
HIGHER
|
|
BACK VOWELS = ________ F2
|
LOWER
|
|
______________ interference, where the waves will add up and give you more strength.
|
Constructive
|
|
___________ interference, where they may cancel one another out.
|
Destructive
|