Use LEFT and RIGHT arrow keys to navigate between flashcards;
Use UP and DOWN arrow keys to flip the card;
H to show hint;
A reads text to speech;
827 Cards in this Set
- Front
- Back
What are the 4 goals for this semester?
|
1. Understand speech sound production 2. Learn about speech measurement 3. See its clinical application 4. Build confidence in using technology
|
|
What is the ultimate test of speech measurement?
|
Does it sound right?
|
|
No instrument can replace the:
|
experienced SLP
|
|
Technology merely has tools to:
|
help us
|
|
Technology complements:
|
our own experience and skills
|
|
What are the 4 characteristics of the scientific method?
|
Empirical, Deterministic, Predictive, and Parsimonious
|
|
What does empirical mean?
|
based on data and direct experience
|
|
What does deterministic mean?
|
It obeys physical laws of cause and effect -- it's not random
|
|
What does predictive mean?
|
If you do this… than that will happen.
|
|
What does parsimonious mean?
|
Use the simplest explanation possible -- boil it down to its essence.
|
|
Why do we collect data and interpret it?
|
to make sense of the behavior we are trying to understand
|
|
We make decisions based on:
|
our own direct experience
|
|
The empirical approach is opposed to the ___________ approach.
|
rational
|
|
What is the approach used in philosophy where you make decisions based on reasoning and careful thought?
|
rational approach
|
|
How does the predictive nature of science help us?
|
we can extrapolate that information to apply to other situations that we may not have experienced yet.
|
|
What are 5 reasons why we use technology to analyze speech?
|
1. Overcome listener bias 2. Describe the severity of a disorder objectively 3. track progress over teime 4. ASHA has focus on EBP (Evidence Based Practice) 5. Provide biofeedback to the client
|
|
Analysis with technology provides measurement that is _____________ and ____________.
|
consistent and reliable
|
|
Technology provides objective criteria to measure our client against:
|
standardized measures
|
|
By tracking our progress over time, we can demonstrate:
|
treatment efficacy (or not)
|
|
What does EBP stand for?
|
Evidence Based Practice
|
|
In order to objectively prove that the treatment was effective, we must measure our client's performance when?
|
before and after treatment
|
|
Using hard data, we don't have to rely on :
|
our opinions
|
|
One limitation that occurs with non-technical observation over time is:
|
becoming habituated to the client's speech
|
|
What does biofeedback do?
|
Shows the person what they are doing on a computer display in real time, and can help them change behavior.
|
|
_________________ can show voice patterns in real time.
|
Computer software displays such as Busy Pitch
|
|
Name an example of something the client could see about his voice on the computer screen.
|
Patterns in prosody, etc.
|
|
The SLP needs to know normal physiology in order to :
|
understand & recognize disordered physiology.
|
|
Why does the computer software require you as an educated SLP?
|
You must understand what the numbers from the instrument represent. You provide the expert interpretation.
|
|
What are the 3 Key Arenas of speech analysis?
|
1. Acoustic phonetics 2. Physiologic phonetics 3. speech perception
|
|
The measuring of speech production through microphone recordings and studying what it can tell us about the way the larynx is operating or the articulators are moving is called:
|
Acoustic Phonetics
|
|
What are the 3 subcategories of Physiologic Phonetics?
|
1. Kinematics 2. aerodymanics 3. electromyography
|
|
What do you call the study of speech movements?
|
kinematics
|
|
What do you call the study of the pressure, flow & resistance we use to regulate our breathstream?
|
aerodynamics
|
|
What do you call the study of the electrical activity of muscles as they are activated?
|
Electromyography
|
|
What do you call the aspects of signal processing of the brain?
|
Speech perception
|
|
We can draw parallels between the ______________ aspects of speech understanding and the ____________ techniques that we use to evaluate a speech signal in a quantitative way.
|
perceptual, acoustic
|
|
What serves as the sound source of the voice?
|
the larynx (with air pressure)
|
|
After the sound leaves the larynx, it goes into the:
|
vocal tract
|
|
What does the vocal tract do to the sound?
|
it filters it
|
|
The larynx needs the _____________ to provide the air pressure it needs to function.
|
lungs
|
|
The vocal tract doesn’t just remove things, it actually _____________________, ___________________ & _____________ some of the frequency components.
|
resonates, enhances & strengthens
|
|
In theory these 2 systems (________________ & _________________) operate independently of each other, but in reality they sometimes interact under certain circumstances.
|
larynx & vocal tract
|
|
What are 4 ways the sound source can vary?
|
loudness, pitch, voice quality, phonation vs. whispering
|
|
Low air pressure equals ____________ sounds, high air pressure equals ______________ sounds.
|
softer, louder
|
|
With ____________ air pressure, the larynx doesn’t have to adduct very tightly.
|
low
|
|
With _______________ air pressure, the vocal folds will have bigger movements -- muscle movement increases.
|
high
|
|
Stretching the vocal folds increases their _____________, which increases their ____________ frequency.
|
tension, vibration
|
|
To stretch the vocal folds, first you contract the __________ muscle, which then rocks the ______________ ____________ forward and stretches the vocal folds.
|
cricothyroid, thyroid cartilage
|
|
If you subtly adjust the space between the vocal folds, between the arytenoids, you can change your:
|
voice quality
|
|
Increasing the space between the vocal folds makes the voice more ____________.
|
breathy
|
|
If you press the vocal folds together more firmly you will get a __________ sounding voice.
|
tighter
|
|
What muscles contract in order to press the vocal folds together?
|
the muscles of adduction (primarily the lateral cricoarytenoid and the thyroarytenoid).
|
|
How do you create a whisper?
|
By pushing air between the vocal cords to create turbulence without causing them to vibrate.
|
|
A whisper requires the__________ to provide the source of air pressure and the __________ to provide the constriction in order to make the turbulence happen.
|
lungs, vocal cords
|
|
How do phonations differ from whispering?
|
Phonation requires the vocal cords to vibrate during this process.
|
|
What is the fundamental frequency at which the vocal cords vibrate?
|
For men, it is about 100-120 times per second, for women it is double that.
|
|
We perceive the fundamental frequency as the :
|
pitch
|
|
The voice is composed of a fundamental frequency or sine wave, and also a:
|
series of harmonics
|
|
What do you call integer multiples of the fundamental frequency?
|
harmonics
|
|
If the fundamental frequency is 100 Hz, the harmonics will be at:
|
100, 200, 300, 400, 500, etc.
|
|
If the fundamental is 200 Hz, the harmonics will be at:
|
400, 600, 800, 1000, 1200, etc.
|
|
The higher the voice, the greater the _________________ between the harmonics.
|
spacing
|
|
The larynx emits a whole:
|
spectrum of sound
|
|
As the fundamental frequency rises and falls, the spread of the harmonics will:
|
increase and decrease.
|
|
The higher harmonics tend to be _____________ in amplitude than the fundamental itself.
|
weaker
|
|
The gradual weakening of harmonics as they get higher results in a:
|
harmonic spectral slope
|
|
What is the strongest element in a spectral slope?
|
The fundamental frequency
|
|
Harmonics get weaker as they get:
|
higher
|
|
On a LARYNGEAL SOURCE SPECTRUM GRAPH, what is represented on the X axis?
|
X = frequency in kHz
|
|
On a LARYNGEAL SOURCE SPECTRUM GRAPH, What is represented on the Y axis?
|
Y = amplitude
|
|
The strength or energy of the signal that increases as you get higher up is called the:
|
amplitude
|
|
In a laryngeal source spectrum graph, each vertical line represents:
|
one harmonic component.
|
|
In a laryngeal source spectrum graph, the tallest line on the far left represents the _____________ ___________. The _______________ are equally spaced to the right.
|
fundamental frequency, harmonics
|
|
The _______________ of the harmonics decreases as you get into the higher frequencies.
|
amplitude
|
|
In a laryngeal source spectrum graph, the slope value represents:
|
how sharply the harmonics drop off.
|
|
If your graph shows 3 spectral slopes of 6 db, 12 db, and 18 db, which is closest to a normal voice?
|
12 db
|
|
If your graph shows 3 spectral slopes of 6 db, 12 db, and 18 db, which would represent a bright, brassy and resonant voice?
|
6 db
|
|
If your graph shows 3 spectral slopes of 6 db, 12 db, and 18 db, which would represent a weak and thin voice?
|
18 db
|
|
At the 6 db level of our graph, the harmonics are:
|
relatively strong compared to the fundamental, almost as high in amplitude
|
|
At the 18 db level of our graph, the harmonics do what?
|
drop off sharply in the upper frequency range
|
|
What does attenuation or anti-resonance of the vocal tract filter mean?
|
That it can weaken or remove what comes out of the larynx.
|
|
The vocal tract filter doesn't just remove sounds, it can also:
|
resonate or echo sound.
|
|
The __________ _____________ ____________ can increase the amplitude of some frequencies even as it weakens the amplitude of others.
|
vocal tract filter
|
|
The sound that leaves the ____________ ________________ has been substantially changed from what it was when it entered.
|
vocal tract
|
|
Tubes do not generate sounds, but they can __________ & ___________ sounds that are put into them.
|
shape and alter
|
|
What features of the tube allow resonance?
|
its dimensions
|
|
What type of tubes are the simplest to model?
|
straight, uniform tubes
|
|
The best resonance frequency of the tube (the one that causes it to resonate or echo the best) depends on its:
|
length
|
|
The best resonance you will get will be for sounds that have a wavelength that is:
|
4 times that of the tube.
|
|
A tube that is small will resonate _______ frequencies because they have a shorter wavelength, and a tube that is long will resonate ___________ frequencies because they have a longer wavelength.
|
higher, lower
|
|
The vocal tract is not a uniform tube; what are 2 things that vary?
|
the diameter will vary along its length, and thus the cross-sectional area varies. And the length?
|
|
Resonance frequencies in the vocal tract differ because:
|
the vocal tract is not a uniform tube
|
|
In the vocal tract, the specific sound features that change depend upon:
|
where the constrictions are, and also whether they are relatively mild or severe constrictions.
|
|
When the diameter of the tube suddenly _____________, some of the sound will actually be bounced back to where it came from, and another part will go on through.
|
increases (because the nature of the resonance characteristics has changed)
|
|
When the shape and diameter of the vocal tract changes, it changes the blend of ____________ and the strength of the different ______________ as they make their way through the tube.
|
frequencies, components
|
|
You change the size and shape of the vocal tract at various points along its length by moving your:
|
articulators.
|
|
When you move the tongue upward or downward or forward or backward, this changes the size of the _______________ in particular places.
|
pharynx or oral cavity
|
|
You can use the tongue to form constrictions, by:
|
moving it towards the palate or the post pharyngeal wall.
|
|
Why does moving your tongue cause the vocal tract's resonance to change?
|
Because it alters the cross-sectional area.
|
|
This is how different sounds are created, because their ______________ _______________ differ.
|
acoustic features
|
|
When you change the shape of your vocal tract, some frequencies are ______________ and others are ___________.
|
boosted, attenuated (weakened)
|
|
The way the sound has changed from larynx to lips is called the:
|
Vocal Tract Transfer Function
|
|
The constrictions of the ____________ ___________ will change according to the various sounds you are articulating.
|
vocal tract
|
|
When you reshape the vocal tract filter, you cause sounds to be ______________.
|
altered
|
|
Sounds are altered according to the size and shape of the _________________ that resonate the sounds that pass through the vocal tract.
|
cavities
|
|
The Vocal Tract Transfer Function refers to the change in sound between:
|
entering the vocal tract and leaving the vocal tract, or the output minus the input.
|
|
The sound that leaves the larynx is basically a _______________ wave form.
|
sawtooth
|
|
The resonating cavities between the ___________ and the ____________ turn the sawtooth waveform into different sounds, vowels, etc.
|
larynx, lips
|
|
The Vocal tract transfer function is a ___________ filter.
|
variable
|
|
The vocal tract filter is changing all the time as we move our:
|
articulators.
|
|
The _________________ _________________ determine the transfer function.
|
resonating cavities
|
|
A resonant peak in the vocal tract transfer function is called a:
|
formant
|
|
A formant occurs when some frequencies in a given range are:
|
amplified to boost their prominence in the spectrum.
|
|
For a given vowel, there can be _________________ formants.
|
many
|
|
Which formants are the most important for determining which vowel you are listening to?
|
the first 2 or 3
|
|
The formant is NOT a sound _____________.
|
source
|
|
A formant reflects the way the vocal tract is:
|
shaping the sound coming from the larynx.
|
|
When you look at the spectrum of the sound that leaves the person’s mouth, you can identify formants as:
|
peaks in the spectrum.
|
|
Formants result from the _____________ of the vocal tract filter.
|
activity
|
|
Formants are NOT a feature of the ____________ _____________of the larynx.
|
sound source
|
|
If you whisper, will there be formants?
|
Yes
|
|
When you whisper, you will have _____________ created due to the shaping of the cavities by moving the articulators.
|
formants
|
|
In THEORY, the glottal source and the vocal tract filter are ___________________ from each other.
|
independent
|
|
Without moving your articulators, you can change the fundamental frequency by:
|
raising or lowering your pitch, causing the harmonics to change.
|
|
You can pronounce a given vowel at high or low pitch, or adjust loudness, all while the _____________ ____________ configuration remains constant.
|
vocal tract
|
|
Even if you hold the same pitch, you can create different sounds by:
|
moving your tongue or other articulators
|
|
If you change the filter characteristics (moving your tongue, lips or jaw) while not changing what your larynx is doing – the fundamental frequency:
|
remains the same.
|
|
Can the source and vocal tract act independently?
|
yes
|
|
Can the source and vocal tract interact?
|
yes
|
|
There are some links between the source and vocal tract that cause them to behave:
|
in a very coordinated way.
|
|
If you target your treatment toward one part of the speech mechanism, will you see effects in other parts of the system that you did not directly treat?
|
Yes, there are spillover effects.
|
|
In Parkinson’s Disease, there is a condition called hypokinetic dysarthria, which is characterized by:
|
a weak, monotone voice and imprecise consonant articulation.
|
|
The Lee Silverman Voice Treatment focuses intensively on the larynx to improve:
|
vocal loudness.
|
|
If you increase the loudness of your voice, there is also an effect on the _____________ system and the ________________.
|
respiratory, articulators
|
|
Treating one aspect of speech, such as loudness, in order to improve other features as well, is called:
|
1 stop shopping
|
|
By treating the loudness of the speech only, you can avoid working with:
|
the other disordered aspects
|
|
____________________ can be considered the “global” treatment variable. Why?
|
Loudness because it affects the whole treatment production mechanism even though the treatment focuses only on the loudness of the voice.
|
|
What are some reasons that loudness can improve other features of speech?
|
There are natural changes that accompany increasing the loudness of the voice, and most people have a lifetime of practice in changing the loudness of their voice.
|
|
STI stands for:
|
Spatial Temporal Index.
|
|
STI can represent:
|
variability in the speech samples.
|
|
Hyperarticulate means to:
|
exaggerate your speech movements
|
|
In the Parkinson's Patients study, which patients showed improved speech: hyperarticulaters or loud speakers?
|
loud speakers
|
|
We can conclude that speech production is more natural, consistent, and predictable when a person is ____________ ______________ rather than if they are deliberately concentrating on ____________ __________ _____________.
|
speaking loudly, moving their articulators.
|
|
What are some other drawbacks to having patients concentrate on moving their articulators deliberately?
|
requires more cognitive attention and is more difficult for elderly patients.
|
|
In the Parkinson's study, the loud samples have less __________________ and the speech is more ________________ & ______________.
|
variability, natural and straightforward.
|
|
In hypokinetic dysarthria, the movements tend to be rather shrunken or small, and so increasing their amplitude can:
|
make the patient's intelligibility much better.
|
|
In the Parkinson's study, were the movements in the loud speech smaller than those in the hyperarticulate sample?
|
no, they were just as large.
|
|
With loud speech, you not only get a louder stronger voice, you also get:
|
more precise articulation.
|
|
Hyperarticulation requires a lot of ___________ and you don’t get any particular ____________.
|
effort, benefit
|
|
The Parkinson's study shows us that by changing vocal function (increasing loudness) you can reap benefits in _____________ _______________, without having to directly treat _____________.
|
oral articulation, articulation
|
|
By treating loudness, articulation improvement is a positive ____________ _____________. The ________ has influenced the filter in this case.
|
side effect. Source
|
|
Spasmodic dysphonia is a neurological voice disorder (neurolaryngological disorder) where the brain sends signals through the laryngeal nerve to the larynx and these signals lead to ___________ during voice production.
|
spasms
|
|
In spasmodic dysphonia, the voice has a very _____________ & ___________ sound intermittently during speech.
|
strained and strangled
|
|
The current treatment for spasmodic dysphonia is for a physician to inject Botox into the :
|
thyroarytenoid muscle.
|
|
Botox treatment for spasmodic dysphonia partially paralyzes the muscle, so that it cannot respond to the _________ ____________ that travel down the nerve to that muscle.
|
incorrect signals
|
|
After _______________ treatment for spasmodic dysphonia, the muscle can no longer tighten up and crampen and strangulate the speech.
|
botox
|
|
After botox treatment for spasmodic dysphonia, the correlation of ______________ ___________ improved quite a bit.
|
lip movement
|
|
In the Botox/spasmodic dysphonia experiment, they treated the voice by injection to the larynx and got improved outcomes in :
|
articulatory behavior.
|
|
The botox/spasmodic dysphonia experiment results are an example of the ____________ influencing the ___________.
|
source, filter
|
|
In Muscle Tension Dysphonia, patients also have a ____________ ______________ sounding voice.
|
strained strangled
|
|
Muscle Tension Dysphonia is caused by:
|
muscle tension in the neck, including the muscles of the larynx.
|
|
In this disorder, there is excessive neck muscle activity:
|
Muscle Tension Dysphonia
|
|
Muscle Tension Dysphonia can be treated by:
|
circumlaryngeal massage therapy
|
|
This treatment involves the clinician actually moving the larynx further down in the neck as the patient phonates.
|
circumlaryngeal massage therapy
|
|
After circumlaryngeal massage therapy, a post treatment spectrogram shows greater flexibility in _____________ ______________ during speech production.
|
lingual movement
|
|
The Muscle Tension Dysphonia experiment results are an example of how treatment of the ____________ influences the movement of the ____________.
|
larynx, tongue
|
|
Why does working on the sound source yield benefits in the vocal tract filter behavior?
|
Because there can be neural and biomechanical linkage between them. They’re controlled by the brain in such a way that the activity of one is closely coordinated with the activity of the other.
|
|
The sound source and the vocal tract filter typically behave as a:
|
unit
|
|
Sound itself is not digital; it's the way we __________ the sound.
|
store
|
|
_____________ ______________ tends to make a clearer, higher quality copy of the sound.
|
Digital storage
|
|
Can your ears hear digits?
|
NO
|
|
Music is stored in __________ form but it is played back in ________ form.
|
numeric, analog
|
|
‘Analog’ means ‘________________’ – one phenomenon is represented by another.
|
analogy
|
|
The height of the mercury in a thermometer represents, or is _____________ to, the temperature .
|
analogous
|
|
The microphone’s sound signal that travels through a wire is analogous to, or represents, the :
|
sound pressure coming through the air that reaches the microphone.
|
|
The microphone ___________ represents sound.
|
voltage
|
|
large positive voltage = air _____________
|
compression
|
|
large negative voltage = air ______________
|
rarefaction
|
|
A microphone is a ________________ – it is converting _____________ from one form to another.
|
transducer, energy
|
|
The sound pressure vibrations that reach the microphone are converted by the microphone into a rapidly changing ___________ _______________
|
electric signal.
|
|
Any change in __________ _____________ that causes the internal mechanism of the microphone to move will result in a commensurate change in ____________ .
|
air pressure, voltage
|
|
The microphone's voltage will change at the same_____________ and in an ____________ proportional to the size of the sound that came in.
|
frequency, amplitude
|
|
The __________ _____________ going through the microphone’s wire is an analog representation of the sound pressure that the microphone picked up.
|
electrical signal
|
|
The microphone's electrical signal is a precise representation of the:
|
signal coming in
|
|
If you recorded a sine wave, you would have an electrical sine wave in the wire.
|
sine wave
|
|
If you were recording phonation, you would have a ____________ ______ of electrical signal in the wire.
|
periodic type
|
|
If you were recording noise, you would have a ______________ signal in the wire.
|
“noisy”
|
|
A signal that is rapidly & randomly changing would be called:
|
noise
|
|
What are the 2 most important characteristics of analog signals?
|
continuous in time and continuous in amplitude
|
|
If you were to examine or zoom in on continuous analog signals at any level of detail, you would not find any ___________ in the recording.
|
gaps
|
|
a line drawn on a page or a microphone signal in a wire are examples of what type of signal?
|
analog
|
|
An analog recording can represent what 2 things?
|
an infinite number of points in time and an infinite number of amplitude values between the minimum and the maximum
|
|
An ____________ signal CANNOT be fully represented by a table of numbers.
|
analog
|
|
To represent a signal, each row in a table of numbers would represent the amplitude at a :
|
given point time
|
|
You cannot have an infinite number of rows to reflect the infinite number of time intervals that there are in a truly ______________ signal.
|
continuous (analog)
|
|
What could represent an analog signal "truly"?
|
a graph
|
|
What are some COMMON ANALOG DEVICES ?
|
Clocks with hands, watches with hands, mercury thermometer, tape measures (markings only limited by your eyesight), cassette recorder (magnetic impressions on the tape), dimmer switch on a light (infinitely variable, continuous)
|
|
In what 2 areas are digital recordings discrete?
|
in time and in amplitude
|
|
Digital signals are made up of a series of _____________ _____________ recorded in one long string.
|
separate numbers
|
|
The numbers of a digital recording represent ______________ over time.
|
snapshots
|
|
in a digital signal, the ____________ themselves have finite precision – there are a limited number of ___________ __________, which could be many or few.
|
numbers, decimal places
|
|
If you were to write down all the numbers of a digital recording, you could fully represent it by a:
|
table of numbers.
|
|
The limitations of digital recordings are that there are _____________ between the points.
|
‘unknowns’
|
|
But actually, if the individual sample points are close enough together in time, it is almost as if the sample were:
|
continuous
|
|
Name 3 common digital devices:
|
digital clock, computer, compact disc
|
|
Computers and compact discs store data as:
|
ones and zeros
|
|
The analog signal from the wire has to be converted to ____________ in order to be used by a computer, etc. , and vice versa.
|
numbers
|
|
Going from a microphone to a computer is an example of ____________________ conversion.
|
analog to digital (ADC)
|
|
Going from a CD to the speakers is an example of ______________ conversion.
|
digital to analog (DAC)
|
|
PC sound cards belong to which type of conversion ?
|
both ADC and DAC
|
|
Computers only deal with __________ data.
|
binary
|
|
Binary data contains only:
|
ones and zeros
|
|
Microphones and speakers are ____________ devices.
|
analog
|
|
Sound is an _______________ function.
|
analog
|
|
The frequency with which the numbers are recorded to represent amplitude values of the sound you are recording is called:
|
the sampling rate
|
|
To better represent the original signal, you should take _____________ samples.
|
more
|
|
The frequency of snapshots in a digital recording is called the:
|
Sampling Rate.
|
|
The sample rate is specified in:
|
Hz
|
|
If you take 10 samples/snapshots in a second, that would be a ________ recording.
|
10 Hz
|
|
A very low sample rate produces a ____________ recording.
|
poor
|
|
If the original sample contains many rapid changes, your sample rate must be ________ in order to capture those changes.
|
higher
|
|
A higher sampling rate gives better:
|
fidelity
|
|
A higher sampling rate requires ___________ files.
|
bigger
|
|
What are 3 disadvantages of a higher sample rate requiring bigger files?
|
1. more memory usage 2. more disk space for storage 3. more processing time for computation
|
|
The ‘Nyquist’ frequency is :
|
half the sample rate.
|
|
The Nyquist frequency represents the highest frequency you can :
|
record and then accurately play back.
|
|
You will sample at twice the rate of the ____________ _______________ in the signal.
|
highest frequency
|
|
For example, if you want to record a signal accurately up to 100 hz, you must sample at :
|
200 Hz.
|
|
For example, if you want to record a signal accurately up to 5 hz, you must sample at :
|
10 khz
|
|
The sample rate for compact discs is ____________ samples per second, or __________ kHz.
|
44,100, 44.1
|
|
Since the sample rate for a compact disc is 44,100 samples per second, it can store signals up to ______________Hz, and play back sounds up to _________________ Hz.
|
22,050 , 22,050
|
|
The upper limits of most people’s hearing capacity is about:
|
22,050 Hz
|
|
On a digital recording device, the highest quality setting gives you the ____________ amount of time to record.
|
shortest
|
|
Why does a high quality setting give you the shortest amount of time to record?
|
Because it uses more data and takes up more space on the device.
|
|
A higher sampling rate = higher _____________.
|
quality
|
|
Before we can do decent ___________, we must make sure the recordings themselves are good.
|
analysis
|
|
What are 4 things you must have to obtain a good recording:
|
1. a quiet recording environmnet 2. use a quality microphone 3. good signal strength 4. proper sample rate
|
|
How can make your recording environment more quiet?
|
Reduce reverberation & echoing from hard surfaces in the room.
|
|
To have good signal strength, you must make sure that the info you want is not :
|
lost in noise
|
|
Saturation = the signal as it:
|
comes into your recorder
|
|
If the signal is too tiny, it won't be audible over:
|
background noise
|
|
To make sure you signal is a good strength, you can do what 2 things:
|
1. adjust the recording input level 2. adjust the distance between the microphone and the subject
|
|
Be sure not to clip the signal by:
|
bringing it in at too high a level
|
|
If the strength of the signal coming in is bigger than your recording device can accommodate, the signal will be:
|
badly distorted
|
|
If the signal is ____________ _____________, you won’t be able to do any decent acoustic analysis .
|
badly distorted
|
|
Driving the signal too hard can cause ______________ which causes _____________.
|
clipping, noise
|
|
If you have enough memory space, it’s better to use a ___________ sample rate .
|
higher
|
|
You can always downsample later, but you can never :
|
upsample and replace info that wasn’t in your recording.
|
|
All ____________ has to do with the vibration of air molecules around you.
|
sound
|
|
Give 3 examples of sound originating with movement:
|
1. vibrating string of an instrument 2. oscillation of the vocal folds 3. turbulance of air molecules leaving a tire
|
|
Why does a vibrating string create sound?
|
it causes the air molcules around it to oscillate.
|
|
Why does the oscillation of the vocal folds create sound?
|
it chops the air into small pulses; they oscillate the air in the vocal tract as well as the air between the speaker and listener
|
|
Why does air leaving a tire create sound?
|
the air molecules are trying to leave through a very small hole and bounce against each other, creating turbulence
|
|
In speech production, there are many structures that contribute to the way the sound is generated. We talk about this as having:
|
Many degrees of freedom
|
|
What are three principles of sound:
|
1. all sounds originate with movement 2. the movement characteristics determine the nature of the sound 3. speech production has many degrees of freedom
|
|
In speech production, there are many different elements that can be individually adjusted to create the exact sound that you hear. Name two:
|
voice/larynx, articulators
|
|
The speech waveform is complex because:
|
there are so many structures that can contribute to the way it’s generated.
|
|
Name 2 benfits of acoustic analysis:
|
1. it's noninvasive 2. from the changes in the signal we can draw inferences about the movements that generated the sound
|
|
One of the greates benefits of acoustic analysis is:
|
that it's noninvasive; we don't have to attach any transducers to the person's vocal tracts, etc.
|
|
A disordered voice indicates:
|
disordered vocal fold movement
|
|
changes in vocal fold movement causes a change in the sound's:
|
source
|
|
Distorted or imprecise articulation indicates:
|
abnormal movement of the articulators
|
|
A change in movement of the articulators causes a change in the sound's:
|
filter
|
|
We can get a very rich level of detail from something as simple as a microphone recording, if we apply the appropriate:
|
analysis techniques.
|
|
The acoustic patterns reflect the vocal tract movements with some ambiguity, because of :
|
Motor Equivalence.
|
|
Motor Equivalence means that:
|
the same sound can be produced several physical ways.
|
|
You could move the articulators in slightly different ways and still have:
|
the same end result.
|
|
Acoustic analysis cannot always distinguish with any level of precision between:
|
different types of articulatory movements
|
|
Acoustics cannot reveal all details of:
|
Movement
|
|
You can’t tell how many mm a tongue moved upward or forward when a person:
|
articulated a given sound.
|
|
Acoustic analysis doesn’t reflect the movements that caused the sound in a ____________________ way.
|
perfectly linear, one to one
|
|
There are some limitations in getting calibrated, physical measures of articulatory behavior from a microphone signal because there are:
|
several different structures working at once to contribute to the way that sound was made.
|
|
What are the 2 main limitations of acoustic analysis?
|
1. It reflects the vocal tract movments with some ambiguity because of Motor Equivalence. 2. It cannot reveal all the details of movement.
|
|
Speech Sounds Contain Many :
|
Frequencies
|
|
A _________________ _______________ allows us to be selective in which frequencies we keep for analysis.
|
speech filter
|
|
If you want to understand the impact of having hearing loss in a certain range, you could filter out _______________ _______________ in a speech sample of normal speech.
|
certain frequencies
|
|
What are 4 filter types?
|
High pass, low pass, band pass, band reject
|
|
Which filter allows high frequencies through and holds back lower frequencies?
|
high pass filter
|
|
Which filter allows low frequencies through and holds back higher frequencies?
|
low pass
|
|
Which filter allows a middle band of frequencies through and holds back both higher and lower frequencies?
|
band pass
|
|
Which filter holds back a middle band of frequencies and allows both higher and lower frequencies through?
|
band reject
|
|
What is a main reason to use a band reject filter?
|
to remove unwanted noise
|
|
If you contaminated an original signal with a 300 hz tone, what type of filter would you use to remove the unwanted noise?
|
a band reject filter to filter out the 300 hz tone
|
|
Which type of band reject filter, wide or narrow, would least impact the quality of the speech sample?
|
narrow band
|
|
In a ____________________ filter , you would be deleting a number of frequencies that would be important for speech perception.
|
wide band
|
|
What is a main reason to use a band pass filter?
|
To show each frequency and it's energy.
|
|
This type of filter was used in the early days of speech analysis because they helped us understand which frequencies were present in the speech sample:
|
band pass filter
|
|
A band pass filter yields a crude idea of the ____________________ of a certain sound.
|
ingredients list / frequencies
|
|
Remember that it’s difficult to tell where the regions of high or low energy lie, just by:
|
listening to the sound
|
|
Some of the early pioneers in acoustic analysis employed techniques like these different _____________________ to understand whether the energy was in the lower part of the frequency range, the middle range, or the higher range for a certain sound.
|
band-pass filters
|
|
A _______________ filter passes a signal through sets of filters like these: 0 - 300 Hz, 300 - 600 Hz, 600 - 900 Hz, etc…
|
band pass
|
|
This type of filter allows you to move from low to high frequencies as the frequency is being analyzed:
|
variable band pass filter
|
|
The meter at the right hand side of a variable band pass filter represents :
|
how strong the energy is at any given frequency level that you analyze
|
|
Band Pass Filters can be these 2 types:
|
Fixed and Variable
|
|
A Fixed Band Pass filter looks at:
|
a series of band pass filters, one at a time.
|
|
A variable band pass filter can:
|
move from low to high frequencies as the frequency is being analyzed.
|
|
In order to do either type of band pass filter analysis, the researcher would need to play the sound again and again, looking at :
|
a different frequency each time in order to compare the energy for each.
|
|
A band pass filter can give you information about the __________ of the different frequencies in a given sound.
|
strength
|
|
You could say that _____________ ______________ is a little like looking for the ingredients in a complex sound.
|
acoustic analysis
|
|
What does acoustic analysis do?
|
splits up a complex sound into its component sounds
|
|
A complex sound has ______________ _____________ – we don’t speak in sine waves!
|
many ingredients
|
|
The _____________________ _________________ of the components make one sound different from another – they alter its quality perceptually.
|
relative proportions
|
|
Altered proportions = altered ___________.
|
quality.
|
|
You could have the same frequency ingredients – some higher and some lower in amplitude – for one type of sound, and if you adjusted the _________________ of those frequencies you would get a different type of sound.
|
amplitude or proportion
|
|
Joseph Fourier was a ____________________________ who lived from 1768-1830.
|
French mathematician & physicist
|
|
Fourier came up with a way of analyzing complex signals and :
|
decomposing them or splitting them into a series of component frequencies.
|
|
This person's great contribution was to show that even complex sounds can be broken down into their individual sinusoid components.
|
Joseph Fourier
|
|
All _____________ sounds are made of a combination of sine waves. (component frequencies)
|
periodic
|
|
What are 3 ways that individual sine waves may vary?
|
1. amplitude 2. phase angle 3. frequency
|
|
What does phase angle of a frequency mean?
|
where they are in the cycle relative to one another
|
|
What is a Fourier Transform?
|
when we use the techniques that Fourier developed
|
|
A Fourier Transform creates a ___________ from a _______________ waveform (such as a microphone signal).
|
spectrum, time domain
|
|
A spectrum of a signal shows us:
|
what the individual components are.
|
|
What you see when you have a microphone recording on a screen in front of you is ________________ __________ data.
|
time domain
|
|
In a time domain data waveform, you will have ___________ on the X axis and __________ on the y axis.
|
time, amplitude
|
|
If you subject a time domain waveform to a Fourier Transform, what you get is a ______________ _____________ Display that shows you a ____________.
|
Frequency Domain, spectrum
|
|
A _______________ represents a slice in time, or a snapshot in time.
|
spectrum
|
|
A spectrum has ________ on its X axis and _______ on its Y axis.
|
frequency, amplitude
|
|
The Fourier Transform is a way from going from one view of a signal to another view – from the ___________ domain to the __________ domain.
|
time, frequency
|
|
In an audio signal, the voltage of the signal increases and decreases in proportion to the :
|
sound pressure of the wave.
|
|
A waveform that repeats regularly and has multiple sine wave components is a :
|
periodic wave
|
|
2 characteristics of time domain data are that:
|
a waveform represents sound directly and air pressure changes over time
|
|
In a spectrum display, each column represents:
|
one of the frequency components
|
|
A ________________ ________________ splits out the individual sounds from the combined total.
|
specrum display
|
|
In a spectrum display, when you play all of the frequencies together, it will sound the same as a:
|
time domain waveform
|
|
In a spectrum display, if you play each frequency separately, they will sound:
|
different from each other
|
|
Two great benefits of a Frequency Domain display are that it can:
|
show us what the individual components are and show us the relative proportions of each component
|
|
In a spectrum display, the height of each bar represents the :
|
strength of each of the components.
|
|
A spectrum shows ___________ ___________ data.
|
frequency domain
|
|
A line spectrum shows the frequency components of a:
|
periodic sound
|
|
In a _____________ ___________, there is no energy between the individual harmonic components that make up the entire spectrum.
|
periodic signal
|
|
The sound source that comes out of the larynx is very similar to a ___________ signal, but there is also some __________ mixed in it.
|
periodic, noise
|
|
The _____________ ____________ is nearly periodic.
|
human voice
|
|
The human voice has a fundamental and then ________ which are multiples of that fundamental.
|
harmonics
|
|
In the human voice, the upper harmonics get progressively ___________ as you go up in frequency.
|
weaker
|
|
A PERIODIC SIGNAL SPECTRUM is a _____________ ______________ description of the signal.
|
frequency domain
|
|
In a time domain display, __________ is represented from left to right.
|
time
|
|
In a line spectrum, the peaks represent:
|
the individual harmonic components of this signal.
|
|
If you make your voice softer or louder, you’ll see some changes in the relative strength of the :
|
harmonics.
|
|
If you raise and lower your voice pitch, you’ll see the _________ spreading apart from one another or coming back together as the pitch comes back down again.
|
harmonics
|
|
If you had a ____________ __________ __________, there would only be one line in your display. _________ __________ ______________ would have multiple lines in the display.
|
simple sine wave, complex periodic signals
|
|
Noise is when: (3 things)
|
1. all frequencies are present 2. with various different phase relationships to each other (random phase) 3. with approximately equal amplitude.
|
|
Name 4 types of noise:
|
white noise, pink noise, brown noise, multi-talker noise
|
|
A spectral envelope shows us:
|
the relative strength of the different frequency components in the noise.
|
|
A ____________ _____________ encloses an area that would have been completely filled in by those vertical lines that represent the individual sine waves.
|
spectral envelope
|
|
The voice source is not truly :
|
periodic. (it is NEARLY periodic)
|
|
A spectrum does not have pure lines; instead it has:
|
peaks
|
|
In a voice signal, there will be some ___________________ around the fundamental and harmonics
|
spread of energy
|
|
The spread around the fundamental and harmonics represents the _________________________ that is present in every human voice.
|
imperfections or noise
|
|
In a voice signal, if it’s not a harmonic, then it’s _____________-.
|
noise.
|
|
A ratio of the height of these harmonic components to the level of noise between them is called a:
|
Harmonics to Noise ratio
|
|
What are the two major kinds of spectra?
|
FFT and LPC
|
|
What does FFT stand for?
|
Fast Fourier Transform
|
|
What does LPC stand for?
|
Linear Predictive Coding
|
|
The FFT shows us the:
|
range of harmonics that are present in each sound
|
|
In an FFT spectrum, there is a peak for each of the:
|
harmonics
|
|
An FFT does not show us ___________ very well.
|
formants
|
|
This type of display is useful for displaying features of the sound source, but not so much about the vocal tract filter:
|
FFT spectrum
|
|
This type of display shows a spectral envelope, but not individual harmonics:
|
LPC spectrum
|
|
With an LPC spectrum, you don’t get details of the :
|
sound source
|
|
This type of display can be quite revealing of what the vocal tract filter is doing – how the vocal tract shapes the sound given to it by the larynx:
|
LPC spectrum
|
|
Different vowel __________ show different energy regions.
|
formants
|
|
What are formants?
|
resonant energy peaks in the vocal tract transfer function
|
|
This vocal tract _____________ _____________ applies whether the sound coming into it is periodic (or almost) and phonated, or whether it’s merely being whispered.
|
transfer function
|
|
The formants you see in the vocal tract will be present because of the way you have :
|
shaped the vocal tract.
|
|
Formants will be the same whether the sound is being ___________ or ____________.
|
phonated, whispered
|
|
This type of display can show you what the vocal tract's resonance frequencies are relative to one another.
|
LPC spectrum
|
|
An FFT spectrum shows _______________ ________ that are harmonic components.
|
individual peaks
|
|
The peaks in an FFT spectrum are spaced equally because they are:
|
multiples of the fundamental.
|
|
An LPC spectrum shows peaks that represent:
|
the formants
|
|
The first 3 formants of an LPC spectrum can help us:
|
identify a vowel
|
|
The LPC spectrum reveals details of the:
|
sound FILTER
|
|
The FFT spectrum reveals details of the:
|
sound SOURCE
|
|
A ___________ spectrum will not tell us if a sound is pure or disordered.
|
LPC
|
|
A _________________ shows the ingredients of a sound at a single point in time.
|
spectrum
|
|
A _____________________________ shows the progression of a signal over time.
|
time domain display
|
|
A ______________ shows us how the frequency components of a sound changed over time.
|
spectrogram
|
|
A spectrogram which can show us individual ___________ __________ that are arranged side by side over time.
|
spectral slices
|
|
As we speak, we are producing, in rapid succession, a whole series of sounds that differ from one another in their :
|
spectral characteristics.
|
|
A spectrogram is sometimes called a:
|
3D spectrogram
|
|
Why did the 3D spectrogram get that name?
|
because it can display 3 different parameters at a time:
|
|
What 3 parameters can a spectrogam display?
|
Time, Frequency, and intensity
|
|
A spectrogram shows ________ on the X axis and ___________ on the Y axis.
|
time, frequency
|
|
On a spectrogram, Whether the trace is lighter or darker represents the __________ of the signal.
|
INTENSITY
|
|
On a spectrogram a darker trace = stronger:
|
intensity or amplitude
|
|
A _____________ is kind of a hybrid domain display since it shows both time and frequency.
|
spectrogram
|
|
A ____________ shows us how the strength in each range of frequencies is changing or evolving over time.
|
spectrogram
|
|
A _________________ display is rather rich in information because it shows frequency changes over time and the changes in amplitude of those components – whether they are strong or weak.
|
3D spectrographic
|
|
Spectrograms are made from digital recordings, and so the frequency display on the Y axis is limited to the highest frequency component you were able to save and reproduce, the:
|
Nyquist frequency.
|
|
If you are interested in looking at _____________ _______________ speech on a spectrogram, you must at least double the sample rate, and maybe even give it a little bit more.
|
high frequency
|
|
If you were interested in fricative energy that might go all the way up to 12-14 kHz, you’d have to make sure that the sample rate of your digital recording was at least _________________________ to make sure that you include all of that interesting fricative energy.
|
double this amount --maybe 25-30 kHz
|
|
If you only care about the info from ___________________, then you could sample at 8-10000 Hz.
|
0-4000 Hz
|
|
If you sample at too low of a rate, there’s no way to :
|
recover info that you never had in your recording.
|
|
What is one analysis parameter you can adjust?
|
bandwidth
|
|
A ___________ ______ spectrographic analysis gives you good TIME, or temporal detail.
|
wide band
|
|
A wide band spectrographic analysis can show you individual events that are very ____________, and resolve those with great accuracy.
|
brief
|
|
A ___________ ________ spectrogram tends to blur things together over time, but it gives you extremely good FREQUENCY resolution.
|
narrow band
|
|
Which kind of spectrogram display would show individual frequency components, separate from one another?
|
narrow band
|
|
Unfortunately, the laws of physics prevent us from having good_______ AND _____________ detail on a 3D spectrogram display.
|
time, frequency
|
|
The software takes in ____________________ in order to provide the display, and you need more and more ____________ to get a good idea of what frequency components are present and to separate them.
|
strings of numbers, numbers
|
|
If you want to look at only brief events in time, these will occur over very few samples in a _____________________________.
|
string of numbers.
|
|
You can have good TIME resolution OR good FREQUENCY resolution, but you can’t have :
|
both at the same time.
|
|
A 300 Hz bandwidth is what we refer to as a ______________ ____________ spectrogram.
|
Wide Band
|
|
In a spectrogram, the vertical striations or thin vertical lines, represent:
|
each glottal pulse as it occurs in voicing.
|
|
Because you have a high level of detail and you can see events as brief as individual glottal pulses, you can make very good _________________ from a wide band display.
|
time measures
|
|
On the wide band spectrogram, the X axis represents ___________ and the Y axis represents ____________.
|
time, frequency
|
|
A 45 Hz bandwidth is what we refer to as a ________________ spectrogram.
|
narrow band
|
|
In a narrow band display, we can no longer see the individual :
|
vertical striations that represent glottal pulses
|
|
On a narrow band display, you can see horizontal, almost parallel, lines called:
|
HARMONICS
|
|
The wide-band display could not show harmonics because:
|
it couldn’t give us good frequency detail
|
|
In a ____________________ display, we have extremely GOOD FREQUENCY DETAIL.
|
narrow band
|
|
In a narrow band display, we can tell the _______________ apart from one another.
|
harmonics
|
|
Which type of spectrogram shows individual vertical glottal pulses (vertical striations) and has good TIME resolution?
|
wide band
|
|
Which type of spectrogram shows horizontal lines that represent HARMONICS of the voice, and separates them out… It has –good FREQUENCY resolution?
|
narrow band
|
|
From looking at a spectrogram, you can get a _____________ idea of what’s going on in the voice –kind of like looking at an ______________.
|
qualitative, X-ray
|
|
The larynx and vocal tract differ _________________ from one person to the next, which influences the ____________ of the sounds the person will produce.
|
anatomically, acoustics
|
|
There are structural _______________ that cause the uniqueness of your own voice.
|
differences
|
|
Function, or the way we use our _____________________, can also contribute to our dialect or other features of our voice which are not anatomically based.
|
articulators, etc.
|
|
In speech therapy, a lot of the work we do involves helping people change the function of their _____________. We don’t really get involved with ________________ changes.
|
articulators, structural
|
|
In speech therapy, we work on altering the way the client uses his structures to improve ____________ __________.
|
sound output.
|
|
You can judge a person you hear by their ______________; whether they are older, younger, healthy, sick, how they feel.
|
voice
|
|
A person's voice ______________ come through the acoustic signal.
|
characteristics
|
|
We can study voice features such as __________________________ to understand more about how people speak differently.
|
prosodic variables or vowel dialect variables
|
|
Women’s voices are about ____________________ in their fundamental frequency than men’s are.
|
an octave higher
|
|
What is the main reason that women's voices are typically higher than men's?
|
Anatomy -- the men’s vocal cords are larger.
|
|
However, even when a man raises his pitch to sound like a woman, it is still not very convincing. There are other features that contribute to the difference: (name 4)
|
1. vocal fold closure patterns 2. harmonic spectral slope 3. differences in noise between the formants 4. the frequency values of those formants
|
|
What is spectral slope?
|
The amplitude of the harmonics resulting from vocal fold vibration falls off by 12 dB per octave. This means that each time the frequency doubles, the amplitude of the harmonics decreases by 12 dB. This is called the spectral slope (or tilt or roll-off) in the source spectrum.
|
|
The frequency values of the formants are based on the _______ of the vocal tract.
|
size
|
|
Why are vowels and diphthongs interesting to speech scientists?
|
Because they tend to carry the greatest amount of energy in speech.
|
|
In theory, when you produce a vowel: The vocal tract shape can be held constant, and a vowel can be prolonged as long as you have breath – the vowel is _______________ _______________.
|
TIME INVARIANT.
|
|
During the time you are sustaining a ____________ production, one point in time looks like any other, acoustically.
|
vowel
|
|
___________________ are usually produced very briefly.
|
consonants
|
|
What determines the particular vowel that we will be producing at a certain moment?
|
the shape of the vocal tract or the configuration of the articulators .
|
|
The output from the _____________ is basically the same for every sound we produce (plays an electrical buzzing type sound).
|
larynx
|
|
What makes one vowel different from another?
|
The frequencies of the formants are different for each one.
|
|
The ____________ spectrum reveals features of the vocal tract transfer function.
|
LPC
|
|
The LPC spectrum shows what the ____________ ____________ has done to the sounds that were given to it.
|
vocal tract
|
|
In the LPC display, formants are represented by:
|
peaks
|
|
The exact locations of these _________________ allow us to distinguish between the different vowels that are being produced
|
formants
|
|
F1, F2 & F3 differ across _____________.
|
vowels.
|
|
In a spectral display, the numbering order of these formants goes from:
|
left to right
|
|
Usually you can identify the vowel from the _________________________, even with synthetic speech.
|
first 2 formants
|
|
You could generate speech with a computer, by allocating certain _________________ ______________ that correspond to the formants.
|
frequency prominences
|
|
A computer can generate an F1 and F2 by:
|
giving more energy to those frequency ranges
|
|
The locations of the _______________ – where you put them on the spectrum -- will allow the listener to perceive the vowel as being an /u/ or an /i/ or an /a/.
|
the formants
|
|
_______________ formants tend to make the vowels sound more natural.
|
The higher formants – F3, F4, F5 and so on
|
|
If you don’t include the higher formants in generated speech, it will sound rather ___________________.
|
robotic.
|
|
Formants are:
|
peaks in the vocal tract transfer function
|
|
As energy passes through the vocal tract, there are certain ___________ that are favored, and other frequencies that are weakened or attenuated.
|
frequencies
|
|
The __________________________ will dictate what frequencies are favored (or echoed or resonated) and which are not.
|
size of the resonator
|
|
Think of the pipes of an organ – the small pipes will play __________ notes and the large pipes play ________ notes.
|
higher, lower
|
|
The vocal tract is made up of different _______________ cavities.
|
resonating
|
|
The resonating cavities of the vocal tract differ in their size and shape depending on how you move your ____________.
|
articulators
|
|
_____________ frequencies will differ depending on the articulatory configuration.
|
Formant
|
|
When you reach a formant peak in the vocal tract, energy in that vicinity – in that frequency range – will be _____________________________ so that it comes out of the mouth at a much higher amplitude.
|
amplified or boosted
|
|
Frequencies that occur in a trough of the vocal tract transfer function tend to be:
|
filtered out or weakened.
|
|
______________ ______________is more efficient at the formants.
|
Energy transfer
|
|
Remember that we can identify a vowel just by the frequencies of:
|
F1 and F2.
|
|
The absolute values of vowel ______________ can be influenced by the person’s dialect and gender (men’s vocal tracts are larger than women’s and children’s).
|
frequencies
|
|
Why do children have higher formant frequencies than men or women?
|
Because they have smaller vocal tract structures, including smaller resonating cavities.
|
|
Men, women and children have a similar _____________ _____________ shape.
|
vowel quadrilateral
|
|
A larger vocal tract yields ___________ formant frequencies.
|
lower
|
|
Children have higher_______________ ________________________ than men or women, because they have smaller vocal tract structures, including smaller resonating cavities.
|
formant frequencies
|
|
We can plot the _______________ and ______________ formant frequencies to show VOWEL SPACE.
|
first and second
|
|
The vowel quadrilateral for children is the __________________ and it spreads the furthest up and to the right. This is because their ________________ ____________________ are higher.
|
largest, formant frequencies
|
|
The women’s quadrilateral is in the middle, and the men’s is the furthest down and to the left, because their formant frequencies are ________________. (The men have larger resonating cavities in their vocal tracts.)
|
lower
|
|
As you lower the jaw and tongue, you _______________ the frequency of the first formant.
|
increase
|
|
When you lower the tongue and jaw, you are ___________________________ and causing different cavities to resonate different frequencies.
|
reshaping the vocal tract
|
|
These interact in very complex ways – you can’t anatomically associate F1 with just one ____________ in the vocal tract. They do interact with one another in that the sounds bounce backwards and forwards and ________________ in different ways.
|
structure, resonate
|
|
LOWERING THE JAW AND TONGUE INCREASES THE FREQUENCY OF THE ___________________ ____________________.
|
FIRST FORMANT.
|
|
F2 is influenced by the _____________________________ position of the mouth.
|
forward or backward
|
|
If the tongue is further back in the mouth, the second formant has a relatively _______________ frequency.
|
lower
|
|
AS YOU MOVE THE TONGUE FORWARD, THE SECOND FORMANT’S FREQUENCY ________________.
|
INCREASES.
|
|
Higher vowels =
|
lower F1
|
|
Lower vowels =
|
higher F1
|
|
Front vowels =
|
higher F2
|
|
Back vowels =
|
lower F2
|
|
LIP ROUNDING ______________ ALL FORMANTS, REGARDLESS OF VOWEL TYPE.
|
LOWERS
|
|
The reason lip rounding lowers the formants is because as you round your lips, you are actually ___________________ the vocal tract. The tube that you are resonating, from the glottis to the outside world, is actually becoming ________________.
|
lengthening, longer
|
|
You can lower the larynx to _______________ the tube length, and just like the longer organ pipe, it would resonate lower frequencies. All of your formants would decrease in _________________.
|
increase, frequency
|
|
It is difficult to anatomically pinpoint where a given formant is resonated – there are ________________ _____________ between these chambers or cavities.
|
complex interactions
|
|
The sounds bounce back and forth between the resonating cavities, and influence each other by what 2 ways?
|
constructive interference and destructive interference
|
|
When the waves in the resonating cavities add up to gain more strength, it's called _______________ interference.
|
constructive
|
|
When the waves in the resonating cavities cancel each other out, it's called _______________ interference.
|
destructive
|
|
You can see the vowel /i/ has a high second formant and a lower first formant. So what type of vowel is this?
|
high front vowel
|
|
You can see that /u/, which is a high back vowel, will have what type of formats?
|
low f1, low f2
|
|
The different formants are caused by the associations between the tongue height and its _____________ position in the mouth.
|
front/back
|
|
The absolute formant frequencies of F1 and F2 differ substantially between :
|
children, women and men.
|
|
When we speak and produce these vowels in the context of consonants in the words that we utter, what happens?
|
we may not produce them in exactly the same ways.
|
|
What is one of the main challenges of studying formants?
|
vocal tract size affects the formants, and so our formants are not uniform.
|
|
What are some reasons why our formants can vary for the same vowel?
|
gender, age, vocal tract size, individual anatomical differences
|
|
How do we learn to recognize speech in spite of the variations in formants for the same vowel?
|
listeners may apply ‘templates’ and timbre and pitch may serve as cues
|
|
The differences in formant frequencies are related to the different:
|
sizes in vocal tracts.
|
|
Individuals will differ in their vocal tract anatomy so that their vocal tracts will resonate different _______________.
|
frequencies.
|
|
So formants are going to ________________ from speaker to speaker.
|
vary widely
|
|
The absolute value in ________ of the 1st and 2nd formants for the different vowels that a person produces will be different for each person.
|
Hz
|
|
When we hear speech, we are perceiving completely different sets of _____________________ from each person.
|
frequencies
|
|
It’s possible that listeners use _______________ in their auditory perceptual system.
|
‘templates’
|
|
We try to match features of vowel ________________ to other features of the sound that the person is making.
|
articulation
|
|
The _______________ and ____________ of the voice may be valuable cues that the brain uses to decode vowel signals.
|
timbre and pitch
|
|
If you hear a higher pitch, you would usually associate that with a person who has a _________________ larynx & vocal tract, and therefore you would expect ____________- formant frequencies.
|
smaller, higher
|
|
This variability in _______________________________________ also has a huge impact for people who are working on computer recognition of speech – it’s very difficult for software to recognize what you are saying.
|
absolute formant values
|
|
Speech recognition software gets used to the way you produce your “a” or “i” or “u”, by tuning into your specific :
|
frequencies
|
|
Automatic speech recognition systems, where you are asked to say a word rather than pressing a number, invariably tend to :
|
make mistakes.
|
|
When human listeners encounter new people, they adapt to their speech ___________.
|
very quickly.
|
|
Our brains might also look at the __________ between these frequencies.
|
ratios
|
|
Even if a new person’s absolute values in Hz differ from another person’s we’ve heard, the ratio of the ____________ may be broadly similar.
|
peaks
|
|
There may be a large gap between the first 2 formants for one sound, or they may be close together for another, so the __________ of the peaks is similar, regardless of frequency.
|
ratio
|
|
The _______________ is very good at processing speech sounds.
|
brain
|
|
A ____________ is a sound you can prolong as long as you have breath and hold the vocal tract steady.
|
vowel
|
|
In real speech, vowels are seldom produced in ________________.
|
isolation
|
|
Usually there are _______________ on either side of a vowel.
|
consonants
|
|
The production of the vowel is influenced by the sounds that are:
|
adjacent to it.
|
|
What is coarticulation?
|
When the produciton of a sound is influenced by the sounds adjacent to it.
|
|
It’s quite common for the articulation of one of the corner vowels - /a/ , /i/, /u/, /ae/ to be ________________________________, when the person is producing normal speech.
|
undershot, or not quite achieved
|
|
What are the corner vowels?
|
/a/ , /i/, /u/, /ae/
|
|
When the vowel is undershot, or not quite achieved, it is called:
|
VOWEL NEUTRALIZATION
|
|
During vowel neurtralization, the position of the tongue is a little closer to the neutral vowel /ə/ (schwa) in the ___________ of the vocal tract.
|
middle
|
|
What is the "neutral" vowel?
|
/ə/ (schwa)
|
|
In ________________ , where you have motor speech disorder, the vowels will be under-articulated too much, and the speech is ___________________.
|
dysarthria, distorted.
|
|
The _______________ has to make a judgment on the distinction between a normal and a disordered case of underarticulation.
|
The clinician
|
|
In clinical research, we sometimes use a ___________________ ______________ to quantify how well a person is articulating.
|
VOWEL QUADRILATERAL
|
|
The area within the vowel quadrilateral is called the:
|
VOWEL SPACE AREA
|
|
The vowel space area within the quadrilateral can be reflective of a person’s:
|
overall intelligibility.
|
|
if you were going to treat a person with dysarthria, you could measure the person’s vowel quadrilateral area before it started, and again after, and you could document :
|
changes/ improvements to the vowel space
|
|
Increases in the overall space of the vowel quadrilateral are reflective of larger:
|
articulator movements
|
|
Larger articulator movements are typically associated with improved :
|
speech intelligibility.
|
|
What do vowel ellipses show?
|
the differences in the way individuals produce vowels
|
|
You may produce a vowel rather differently depending on the vowel’s _____________-- which consonants surround it.
|
context
|
|
The _______________ ____________ is a reflection of how much variability there can be in the first and second formants, as you produce that vowel in different contexts.
|
vowel ellipse
|
|
Vowel ellipses show that the frequencies can vary around the ________________________, but it’s still an acceptable version of that vowel.
|
central or ideal target
|
|
What are the most common vowels across all languages?
|
/i/, /u/ and /a/
|
|
Generally speaking, languages tend to have between _____ and ________ vowels.
|
3 and 9
|
|
The acoustic spacing of these vowels tends to put them _____________________ of the vowel quadrilateral.
|
at the edge
|
|
The fact that vowels tend to be around the edges of the vowel quadrilateral means that they are more :
|
distinctive from one another.
|
|
If vowels were all clustered in the middle of the quadrilateral, close to the neutral vowel, it would be difficult to make them:
|
distinct from one another.
|
|
If vowels are ___________________ in the acoustic space, it gives the brain a better chance to separate one vowel from another.
|
spread out more
|
|
When learning a foreign language, one problem is encountering vowels that :
|
you’ve never spoken before.
|
|
It can be very hard as a new speaker of a language to make:
|
those subtle differences between the vowels that are very meaningful to the native.
|
|
We are used to hearing certain proportions of formants, but we’re not used to _____________________ of formants.
|
new combinations
|
|
When learning a new language, it can require a great deal of mental effort to get your ears and brain around these new combinations of __________ that you have not been exposed to before.
|
formants
|
|
When you produce a vowel for a few seconds, its first and second formants :
|
won’t change.
|
|
As you go from a consonant to a vowel, or vice versa, we have things called ______________ _____________, where F1 and F2 will change.
|
FORMANT TRANSITIONS
|
|
You can trace the history or evolution of F1 and F2 as they change over time, going in and out of :
|
consonants and vowels.
|
|
Formants are not always the ______________ ______________ that must be produced as they are in the isolated state.
|
steady states
|
|
The _______________ of a talking person really don’t stay put for very long.
|
the articulators
|
|
The tongue, lips and jaw are in almost constant _____________.
|
motion.
|
|
We don't really produce these successive phonemes in isolation, one at a time, but instead we:
|
blend them into each other.
|
|
Each sound influences, and is influenced by, what?
|
all of its neighbors
|
|
The __________________________________ that we can measure in a vowel really don’t last very long.
|
F1 and F2 frequencies
|
|
The constant movement of the articulators means that the ______________ _____________ are changing shape and size, and thus the ___________ _________________are shifting all over the place very rapidly.
|
resonating cavities, formant frequencies
|
|
An ordinary vowel is called a ____________________ – it has a relatively clear steady state that is maintained for a period of time.
|
MONOPHTHONG
|
|
A DIPHTHONG is defined by its ______________. The formant frequencies will _____________ during its production.
|
change, change
|
|
In a diphthong, the _______________ is the starting frequency, before the movement begins.
|
ONGLIDE
|
|
The ____________ is the ending frequency after you’ve finished making the diphthong.
|
OFFGLIDE
|
|
In between the onglide and the offglide is the ______________, when the formants are changing.
|
TRANSITION
|
|
The ______________ phase is what we perceive as the dipthong.
|
transition
|
|
/oi/ or /au/ or /ai/ are examples of:
|
dipthongs
|
|
It is the movement of the ____________ that really defines the diphthong that we perceive.
|
formants
|
|
What 3 things do all vowels have in common?
|
They are all voiced, the vocal tract is relatively open, and they can be identified by their formant patterns.
|
|
If you have plotted ____________ and _____________, you can identify the vowel.
|
F1 and F2
|
|
What are 4 ways that consonants differ from each other?
|
1. degree of vocal tract constriction 2. degree of noise 3. degree of nasality 4.articulation features
|
|
What makes an important distinction between oral consonants and nasal consonants?
|
Whether or not the velopharyngeal port is open
|
|
The only nasal consonants in English are:
|
/m/, /n/, and /ng/.
|
|
We produce the nasals by lowering the ________________, allowing air and sound to pass through the _______________________ and resonate in the nasal cavity.
|
velum, velopharyngeal port
|
|
While producing nasals, there’s a blockage of the __________ _____________.
|
oral cavity
|
|
Where is the oral cavity blockage for /m/ ?
|
at the lips for /m/
|
|
Where is the oral cavity blockage for /n/ ?
|
at the alveolar ridge for /n/
|
|
Where is the oral cavity blockage for /ng/ ?
|
at the posterior part of the palate for /ng/.
|
|
Compared to vowels, there is far more potential for ________________________ with consonants.
|
things to go wrong
|
|
How do consonants compare to vowels in terms of speech motor control?
|
There is greater complexity and precision involved in the timing and in the physical articulation of consonants. Vowels, by comparison, are relatively simple to articulate.
|
|
SLPs will spend more of their time treating errors in ________________ production.
|
consonant
|
|
What are the two main categories of consonants?
|
obstruents vs. non-obstruents
|
|
The obstruents category of consonants includes what 3 sub-categories?
|
stops/ stop-plosives, fricatives, and affricates
|
|
With _______________ – you get a complete closure of the vocal tract, just momentarily.
|
STOPS
|
|
With _______________ there is a narrowing of the vocal tract, and this forms a partial obstruction.
|
FRICATIVES
|
|
_____________________ are combined stops and fricatives .
|
AFFRICATES
|
|
An affricate has a fairly brief _____________ and a fairly brief _____________ interval.
|
stop, frication
|
|
An ______________ consonant is sort of a combination sound.
|
affricate
|
|
What are the 3 sub-categories of Non-obstruents?
|
nasals, glides/semi-vowels/approximants and liquids
|
|
The ______________ are sounds that do not completely block off the vocal tract, but have some form of constriction.
|
non-obstruents
|
|
Often a non-obstruent sound is produced with a change in the _________________________________________ throughout the production of that phoneme.
|
pattern of constriction
|
|
Just as there is articulatory movement during the production of a diphthong like “oi”, there’s also movement of the articulators during the production of:
|
glides and semivowels.
|
|
When you pronounce the phonemes, __________, __________, and __________, you have articulators moving during their production -- these are not static sounds.
|
/w/, /l/, and /j/
|
|
For English Stops, what are the 3 places of production?
|
bilabial, alveolar, and velar
|
|
What is the Manner for Stops?
|
a brief closure of the vocal tract
|
|
Stops can be either voiced or _______________.
|
voiceless
|
|
STOPS are consonants that involve the stoppage of air through the vocal tract, or complete closure of the vocal tract by :
|
an articulatory constriction.
|
|
In English, the bilabial stops are:
|
/p/’s and /b/’s
|
|
In English, the alveolar stops are:
|
/t/’s and /d/’s
|
|
In English, the velar stops are:
|
/k/ and /g/
|
|
The bilabial stops have a constriction formed by:
|
putting both lips together
|
|
The alveolar stops have a constriction formed by:
|
raising the tongue against the alveolar ridge.
|
|
The velar stops have a constriction formed when:
|
the back of the tongue reaches up to touch the velum.
|
|
If a stop is accurately articulated, there will be a brief instance during which there will be :
|
no sound in the vocal tract.
|
|
Part of a ________________ is reflected in the microphone signal as a brief silent event.
|
stop
|
|
Sometimes a tiny bit of ________________ may sneak through for voiced stops.
|
energy
|
|
During the closure, air pressure builds up behind the __________________________________, and then as the closure is released, this air bursts out.
|
point of constriction
|
|
The __________________________ creates the characteristic sound of a stop being released in the vocal tract.
|
frication energy
|
|
There is also a period of transition when the articulators move from having formed the stop to the upcoming ____________.
|
vowel.
|
|
The stop closure is also sometimes called the _______ _______.
|
STOP GAP.
|
|
A stop gap is characterized by a________________________ during the time that the articulatory constriction is being formed.
|
lack of energy
|
|
A stop gap is most easily seen if the stop occurs ____________ ____________.
|
between vowels.
|
|
If the stop is produced at the beginning of a word: “ta” or “ka”, then it would very difficult in many cases to see the stop closure duration, because of the :
|
lack of energy prior to the formation of that stop.
|
|
The duration of the closure is usually from ______________ milliseconds.
|
50 -150
|
|
In a voiced stop you may have _________________________ that continues throughout the closure of the stop.
|
vocal fold oscillation
|
|
During a stop gap, there is an absence of energy just a brief period of time -- it may last about :
|
100 milliseconds or so – about 1/10 of a second.
|
|
In the stop gap of a voiced stop, there is still some _____________________.
|
acoustic energy
|
|
There is some grey trace in the stop gap of a voiced stop spectrogram that indicates a very low frequency. It is called a:
|
VOICING BAR
|
|
The Voicing Bar corresponds to the oscillation of the ____________ ___________, which is continuing, even after the stop has been formed.
|
vocal folds
|
|
The trace in the voicing bar gradually ____________ during the stop gap.
|
decreases
|
|
The VOICELESS STOPS have a _____________ closure duration.
|
LONGER
|
|
The VOICED STOPS have a _____________ closure duration.
|
shorter
|
|
Voiced Stops have a shorter closure duration no matter what the place of ____________ is.
|
articulation
|
|
For stops in the initial position, It’s very difficult to tell how long the stop closure was because:
|
there’s no vowel energy prior to the production of the stop.
|
|
For stops in the initial position, it's impossible to measure a meaningful stop closure duration because:
|
you could be holding your articulators in place for a long time before you actually utter one of these syllables
|
|
When we want to measure stop closure duration, we have to look at contexts where there is some:
|
some vowel energy happening before the stop.
|
|
If you were to look at stop closure durations in certain speech disorders, you might find that they are increased or decreased, depending on:
|
what the disorder is
|
|
If a person was severely dysfluent and had stuttering blocks, you might find that the stop closure duration is drastically__________ as the person blocks on that sound and is unable to get the mechanism moving ahead into the next vowel.
|
increased
|
|
In some ______________, where you have neuromuscular speech deficits, the speaker may not be able to make a very good stop closure.
|
dysarthrias
|
|
With dysarthria, the tongue or the lips may not approximate their targets, and so they don’t make a particularly long ____________ and it isn’t a very effective one, and that’s why the speech may sound ____________________.
|
closure, slurred and imprecise.
|
|
We can document changes in things like stop closure duration as an index of _______________ adequacy.
|
articulatory
|
|
During the time that the constriction is formed for a stop,_________________ builds up behind the constriction.
|
air pressure
|
|
Once the ______________ of a stop is released, the pressure that has built up behind it forces air out through the space that is now gradually opening.
|
constriction
|
|
When the stop is released, first we get a small _____________, which is a rather sharp burst of air.
|
TRANSIENT
|
|
In acoustics we use the term transients to refer to a ________ ___________ acoustic event .
|
very brief
|
|
Once this burst of a stop has taken place, the noise continues, but in a slightly different form – it’s of __________ amplitude, but of much ____________ duration.
|
lower, longer
|
|
A stop burst is a very__________ event.
|
brief
|
|
After a stop burst, the ___________ continues while the opening is becoming wider and wider – the air is still rushing through it.
|
frication
|
|
After the stop burst, the air coming through is forming turbulence because the airflow is still ____________________ – the pressure has not yet decreased to match that of the ___________ around the speaker.
|
relatively high, atmosphere
|
|
It can be difficult to distinguish between the transient and the ______________ _____________ on a spectrogram.
|
frication interval
|
|
Regardless of the place of articulation, there will be some _____________________ produced when a stop constriction is released.
|
frication noise
|
|
The ______________________ of the stop's frication noise, however, are not constant across all places of articulation.
|
characteristics
|
|
What things will be different if you articulate a bilabial stop versus an alveolar stop?
|
the spectrum varies; the frequency components are not going to be identical
|
|
The spectra for _______________ noise looks very steady and level all the way across. All frequencies are present and are equal in amplitude. Sometimes the envelopes that enclose the spectrum can be more variable.
|
white
|
|
The spectra for _______________ and ______________ noise do not include all of the frequencies from the lowest to the highest. As you get into the higher frequencies, the noise has been shaped to taper off as you get to the upper end of the spectrum, so there is less high frequency energy.
|
pink and brown
|
|
Noise characteristics vary, depending on the :
|
place of constriction
|
|
Why does the place of constriction make the noise characteristics vary?
|
This is because there are different resonating cavities on either side of the stop closure which will influence the sounds that are being resonated or attenuated.
|
|
Different resonating cavities on either side of the ______________________ will influence the sounds that are being resonated or attenuated.
|
stop closure
|
|
This place of constriction has a lower frequency:
|
bilabial
|
|
This place of constriction has a higher frequency:
|
alveolar
|
|
This place of constriction has a mid-range frequency:
|
velar
|
|
With vowel formants, F1 and F2 are created by :
|
resonating cavities
|
|
As the sizes and shapes of the resonating cavities are adjusted by articulating, the _______________________ change correspondingly.
|
formant frequencies
|
|
Large organ pipe = _____________ frequency
|
low
|
|
Small organ pipe = _____________ frequency.
|
high
|
|
When you have a ______________ STOP, there is no narrow passageway in front of it, instead there is a wide open space in front of the speaker’s face. This large resonating space results in a ___________ frequency.
|
BILABIAL, low
|
|
Directly in front of an ______________ STOP is a small space between the lips where the fricative is formed. Since the space is smaller, the frequency is ______________.
|
ALVEOLAR, higher
|
|
There is some space in front of the velum with _____________ STOPS, but it is inside the oral cavity. This space, because of its size and shape, resonates _____________ frequencies.
|
VELAR, mid-range
|
|
The noise characteristic of each of the places of articulation can help the listener identify what?
|
where the stop was formed.
|
|
Frication is common among all of these places of articulation but the sound is going to be shaped by the ____________________________ associated with that place of articulation.
|
resonating cavities
|
|
Even though stops can be produced in the same place of articulation, there can be differences, depending on the _______________________ in which they are produced.
|
phonetic context
|
|
When the stop occurs in the _______________________ POSITION, or at the end of a word, the listener will hear the frication as the stop is released.
|
WORD-FINAL
|
|
(Ex: pad, pat) When you clearly release the stop closure in a word-final stop, it will definitely help with :
|
intelligibility and speech clarity.
|
|
With more casual speech, it is harder to tell where the closure ends because the stop is not released as clearly, thus preventing the :
|
production of a burst of noise.
|
|
In casual speech, the place of articulation, since it is associated with certain shaping of the noise (or adjustment of the noise spectrum), can’t be identified because:
|
the information is missing that indicates this to the listener.
|
|
If you are learning a foreign language and listening to a native speaker of that language, you will be better able to identify________________________ if they clearly release their stops.
|
places of articulation
|
|
If a person ____________________________________ , it will give you more cues as to the place of articulation and the particular phoneme they are producing.
|
clearly releases their stops
|
|
A native listener might understand even if the stops are not released, because they would be able to tell from the ________________________________ what that sound ought to be.
|
context and their familiarity with the language in general
|
|
For unfamiliar listeners of a language, it’s helpful to have the extra information of having the stop released, and to hear the _________________________ that is included in that signal.
|
fricative energy
|
|
The darkness of the trace in a spectrogram indicates the _____________________ of the signal.
|
amplitude / strength
|
|
The frication is typically weaker in a _______________ stop.
|
voiced
|
|
The phenomenon of ____________________ is only seen in a voiceless stop context , and only when the stop is followed by a vowel (CV context).
|
ASPIRATION
|
|
ASPIRATION is only seen in a ______________________ context , and only when the stop is followed by a ___________.
|
voiceless stop, vowel
|
|
During aspiration, the vocal folds are beginning to _____________ for the vowel that is about to begin.
|
adduct
|
|
As the vocal folds come closer together, the air that’s moving between them creates turbulence at the ___________.
|
glottis.
|
|
When a person has just formed a voiceless stop, the vocal folds have been pulled apart from one another by activating the ______________________ muscles.
|
posterior cricoarytenoid
|
|
There is a complete separation of the vocal folds during the time that the voiceless stop has been formed. This is called a:
|
laryngeal devoicing gesture.
|
|
Once the stop is released, the air starts to flow, giving us the ________________________ interval.
|
transient and frication
|
|
Before the vowel actually starts, the vocal folds have to ________________________ for phonation to occur.
|
come into position
|
|
As the vocal folds are gradually drawing together, ______________________ is generated.
|
aspiration noise
|
|
During aspiration, air is moving through the glottis -- the vocal folds are not close enough together to produce ______________, but they are close enough to produce some _____________________ because the air is rushing between them in a narrow space.
|
phonation, turbulent noise
|
|
Aspiration noise is visible on a ___________________.
|
spectrogram.
|
|
On a spectrogram, the aspiration shows a weaker frication energy than the :
|
post-stop burst
|
|
The aspiration is coming from the _______________, not the point of constriction.
|
glottis, from the vocal folds themselves
|
|
The frication noise occurs at :
|
the place of constriction
|
|
________________ is a relatively soft acoustic event, rather weak in its energy.
|
Aspiration
|
|
_____________________ is a weaker noise to begin with, and it also has a further distance to travel before it reaches the person’s mouth.
|
Aspiration
|
|
Aspiration is only seen in the context of a _______________________.
|
voiceless stop.
|
|
In a ___________________, there is vocal fold oscillation very shortly after the burst release has occurred.
|
voiced stop
|
|
In a voiced stop, the vocal folds are ____________________ so there is no opportunity for the creation of aspiration.
|
already together
|
|
Stops can be categorized into voiced and voiceless, but this does not mean that there is a _________________ of voicing on either side of a voiceless stop, and it doesn’t necessarily mean that there is voicing ______________ a voiced stop.
|
total absence, all the way through
|
|
The difference between ‘p’ and ‘b’ or between ‘t’ and ‘d’ depends on :
|
VOICE ONSET TIME
|
|
The time between the release of the stop closure and the onset of vocal fold vibration is called:
|
Voice onset time (VOT)
|
|
A voiced stop occurs when the voice onset time is :
|
close to zero
|
|
Sometimes there are even negative values in voice onset time. This is called:
|
PREVOICING or VOICING LEAD.
|
|
Voiced stops typically have ____________ VOT values .
|
low
|
|
VOT's can range between ________________ for b, d, & g.
|
-20 to +20 milliseconds
|
|
Voice onset time is the time between:
|
the release of the closure and the start of vocal fold oscillation.
|
|
The notion of voice onset time refers to coordination between the activity of the ___________ and the ____________ as it forms constrictions.
|
vocal folds, vocal tract
|
|
The timing between releasing a stop and ________________________________ makes all the difference between voiced and voiceless stops.
|
initiating vocal fold vibration
|
|
When you say ‘pa’ with aspiration, _____________ does not immediately begin when the lips open.
|
voicing
|
|
When you say ‘ba’, there is _________________. There is vocal fold oscillation during the closure and it continues once the __________ have opened.
|
prevoicing, lips
|
|
When we look at a waveform with a spectrogram, how do we make the decision that voicing has actually begun? We rely on these two things:
|
(1) The microphone signal at the top shows regular up and down oscillations where the second line is. (2) The time aligned spectrogram below shows that the vertical striations (which represent individual glottal pulses) also begin at that second line.
|
|
In a Voiced Stop, the VOT includes:
|
the duration of the frication interval (from the opening of the stop to the onset of vocal fold vibration)
|
|
In an Aspirated Voiceless Stop, the VOT includes:
|
the duration of the frication interval PLUS the aspiration
|
|
Voice onset times for voiceless stops tends to be above ____________________, and can in some cases can be as long as ____________.
|
25 milliseconds, 100 ms.
|
|
After the stop constriction has been released and the burst occurs, it can take up to about _____________________ before vocal fold oscillation commences for the following vowel.
|
a tenth of a second
|
|
Name 3 factors that influence how long voice onset time is in a given context:
|
Age, Rate of Speech, and Phonetic Context
|
|
If we rush through our sound productions, then VOTs will be somewhat :
|
shortened.
|
|
If we rush through our speech, we would still make the same voiced and voiceless distinctions, but all of the values would be somewhat ____________________-- because the overall production was sped up.
|
abbreviated
|
|
When we speak, we experience the phenomenon of ________________, in that sounds influence one another.
|
coarticulation
|
|
The VOT for ‘ata’ is ______________ since it is a VOICELESS STOP.
|
longer
|
|
‘Ada’ will have a much _________________ VOT than 'ata'.
|
shorter
|
|
Since ‘ata’ is a voiceless stop occurring before a vowel you’ll see what 2 things happening on the spectrogram in the VOT space?
|
1. There is a darker portion at the beginning of the VOT area that is the burst release and frication. 2. After that there is a lighter portion of noise energy, which is the aspiration created as the vocal folds are coming together. You have turbulence from the glottis there.
|
|
The total VOT for a voiceless stop is the duration of the _______________ plus the duration of the ____________.
|
burst and frication, aspiration.
|
|
VOT's can be separated into what two groups?
|
Voiced and voiceless
|
|
In English, is there overlap between the voiced VOT group and the voiceless VOT group?
|
No, unless there is a speech disorder that is causing voicing errors.
|
|
When the voicing has commenced prior to the release of the stop closure, it is called:
|
voicing lead
|
|
VOT's from ____________ up to about __________ milliseconds have a short lag, which we associate with voiced stops.
|
0-20
|
|
Voiceless stops have VOT's with a ______________ lag because the time between the release of the constriction and the onset of vocal fold vibration is much ________________.
|
long, longer
|
|
Many of the world’s languages (including English & Spanish) have just __________ VOT categories.
|
two
|
|
The _____________ for English and Spanish, however, are not identical.
|
VOTs
|
|
If you were to compare ‘pa’, and ‘ba’ in both English and Spanish you would find that for English the VOT values are somewhat _______________. They would be shifted more to the _____________ on a distribution diagram.
|
higher, right
|
|
VOTs for Spanish are shorter, there is often a ________________________ for the voiced stops. The distribution would be pushed somewhat to the __________ on a diagram.
|
voicing lead/ prevoicing, left
|
|
The absolute number of _______________________ for the boundary between a voiced and a voiceless stop is not going to be the same for English and for Spanish.
|
milliseconds
|
|
English and Spanish each have _____________________ distributions so voiced/voiceless distinctions can be clearly made.
|
non-overlapping
|
|
Some other languages (such as Thai) have 3 _____________ categories.
|
VOT
|
|
It can be challenging for a native speaker of English to learn Thai because you have to distinguish among 3 different categories of voicing rather than the __________ that we are used to.
|
two
|
|
We don’t generally do voicing lead, or _____________, for 'ba' in English , but would still recognize it as ‘ba’.
|
prevoicing
|
|
The English 'pa' would have a _________ VOT.
|
long
|
|
In Thai, there is a sound that has the features of a voiceless bilabial in terms of the force of producing this plosive, but its VOT is closer to where we would expect it to be for a voiced plosive, /b/. To our ears, it seems to have elements of ________________.
|
both/p/ and /b/.
|
|
The human _________________________ is very keen on assigning sounds to categories.
|
auditory perceptual system
|
|
So it is rather confusing to a native English listener to be confronted with 3 categories when we are only used to _____.
|
2
|
|
When we hear an unfamiliar voicing sound, we try try to :
|
make it fit into a category with which we are already familiar.
|
|
A -62 VOT is an example of _________________, which we typically don’t do in English.
|
prevoicing
|
|
A VOT of 10 ms is ________________ in English.
|
uncommon
|
|
Place of articulation has a __________ impact on VOT .
|
slight
|
|
For VOT's, there is a progression from the _____________ of the mouth to the ____________ of the mouth.
|
front, back
|
|
BILABIAL VOTs are generally ______________. (p’s and b’s)
|
shortest.
|
|
ALVEOLAR VOTs are _______________________ (t’s & d’s)
|
in an intermediate range
|
|
VELAR VOTs are _______________ (k’s and g’s)
|
longest
|
|
For each place of articulation, we still have non-overlapping ____________________ categories. T’s and d’s don’t overlap with each other, neither do p’s and b’s or k’s and g’s.
|
voiced/voiceless
|
|
There’s a general rule that the place of _________________ does influence the overall VOT values.
|
articulation
|
|
If you looked at all of the VOICELESS VOTs you would find the shortest ones for the ______, the intermediate ones for the ______ , and the longest ones for the________.
|
p,t,k
|
|
If you looked at the VOICED ONES, the VOTs would be shortest for the ___________, intermediate for the ________, and longest for the ________.
|
b,d,g
|
|
Sometimes there won’t even be a VOT. This will occur if the stop is ______________________.
|
at the end of a word
|
|
If a stop is at the end of a word, there will be no vowel following it with which to:
|
measure starting phonation after the release of a stop.
|
|
Even when there's no VOT, our perceptual system has to decide whether a stop is :
|
voiced or voiceless
|
|
What are 2 cues to help us decide if the stop is voiceless?
|
a longer stop gap and a stronger release burst
|
|
What are 2 cues to help us decide if the stop is voiced?
|
a voicing bar is visible on the spectrogram and There is a longer vowel before a voiced stop
|
|
A voicing bar is visible on a ____________, as well as audible to our ears.
|
spectrogram
|
|
During the stop closure, there is some __________________________ available to the auditory system which we use in our overall processing of the sound and determine that it was likely voiced. We don’t process this at a ___________ level.
|
low frequency energy, conscious
|
|
There tends to be a LONGER ___________________ for the voiceless stops.
|
STOP GAP
|
|
Subconsciously, the brain processes the differences between _____________for stops that are voiced or voiceless.
|
stop gap durations
|
|
When VOICELESS STOPS are released, the burst is quite a bit stronger because:
|
there is more oral pressure built up behind the constriction.
|
|
VOICED STOPS have less pressure available because :
|
the pressure coming from the lungs is being used to drive the voice to contribute to phonation.
|
|
The ___________________ STOPS ARE MORE VIGOROUS in their production and the burst is much clearer.
|
VOICELESS
|
|
Voiced stops have ____________ pressure available to build up directly behind the constriction.
|
less
|
|
There are differences in the way we produce our _____________ prior to a voiced or voiceless stop, i.e. Had vs. Hat.
|
vowels
|
|
The VOWEL IS QUITE A BIT ____________BEFORE THE VOICED d, and______________ before the voiceless t. ( i.e. Had vs. Hat.)
|
LONGER, shorter
|
|
We can hear how long the vowel is that leads up to a final _____________. It helps us to identify ___________________.
|
stop, the final consonant
|
|
The redundancy between the stop cues helps the ________________ determine whether a stop was voiced or voiceless.
|
auditory cortex
|
|
The height of the jaw, or the size of the mouth opening will influence the _____________ formant.
|
first
|
|
As the _____________________ opens wider, the 1st formant increases in frequency.
|
mouth
|
|
If the mouth is closed for a stop production, F1 will be ____________, but it will ___________ in frequency as you move into the following vowel.
|
low, increase
|
|
F2 and F3 are influenced by the :
|
place of constriction.
|
|
Our ears and auditory cortices pick up these changes in formants, helping us identify where the place of ____________ was, while moving from consonant to vowel.
|
articulation
|
|
The burst release following a stop closure will help the listener identify the place of articulation because the spectral characteristics of the burst will differ depending on the:
|
place of articulation.
|
|
VOT also changes with place of_______________ , giving us a second cue to help us.
|
articulation
|
|
Combining the information we can get from the burst release and from VOT with the information we get from the ____________________ from consonants to vowels gives us even more information to help determine the place of articulation.
|
formant transitions
|
|
The stop burst __________________________ help to identify whether it was a bilabial, alveolar, or velar stop.
|
spectral characteristics
|
|
When we produce FRICATIVES, the sound is generated by:
|
air turbulence
|
|
For fricatives, turbulence arises because air is pressurized on one side of a constriction and it flows through a :
|
very small opening
|
|
For fricatives, when the moving air collides with the still air beyond the constriction it causes turbulence ; thus, _____________ is generated.
|
noise
|
|
Fricatives sound different from each other because they have different :
|
places of articulation
|
|
One reason different fricatives make different sounds is because the __________________________ on either side of the constriction will vary in shape and size.
|
resonating cavities
|
|
A larger cavity will naturally resonate ______________ frequencies.
|
lower
|
|
The __________________________ will determine the spectral characteristics of the fricative.
|
place of articulation
|
|
It doesn’t matter to the vocal tract whether the sound given to it is a _______________ source or a ____________ source --it will still shape, resonate, and adjust the sound that is given to it.
|
noise, periodic
|
|
The burst of a stop, when it is released, creates ___________________ that is shaped depending on the place of articulation.
|
frication noise
|
|
A _______________ is just like an extended version of one of those stop burst releases.
|
fricative
|
|
STOPS have a frication at the end of the __________________________.
|
stop release.
|
|
When we produce a FRICATIVE it may be a similar form of noise to a ____________but it lasts longer.
|
stop
|
|
When you produce a voiced fricative, you have what two sound sources?
|
the vibration of the vocal folds and the turbulence that is created at the constriction.
|
|
When you produce a voiceless fricative, you have what single sound source?
|
the turbulence that is created at the constriction.
|
|
When you produce a voiceless fricative, there is no sound being produced by the ______________.
|
larynx.
|
|
As listeners we don’t usually separate out /z/ and /s/ as having two sound sources; we just perceive them as being completely different sounds. In reality, they are almost identical. The only difference is that :
|
one has voicing and the other doesn’t.
|
|
When the larynx is not contributing to phonation (such as in a voiceless fricative like /s/), all of the air pressure from the lungs is directed to ________________. This produces quite a strong fricative, because all of the pressure is directed to that one sound source.
|
that constriction.
|
|
When you are producing /z/, the actual frication that takes place at the alveolar ridge is not quite as vigorous as it was for the /s/, because :
|
the larynx has used up a certain amount of pressure that the lungs generate
|
|
For a voiced fricative, some of the pressure was used to create___________________ before it even reached the oral cavity. This leaves less pressure available to create turbulence at the fricative constriction.
|
vocal fold oscillation
|
|
Since you have voicing going on WITH frication for /z/, those two sounds add together and you get a qualitatively different type of sound that contains both harmonic energy from the __________ and turbulent energy from the ___________________________.
|
voice, fricative constriction
|
|
In a spectrum, the ______________ is the more jagged fuzzy line, and the more solid line is the __________, which outlines the envelope of this spectrum more clearly.
|
FFT, LPC
|
|
The lips are _____________ somewhat if you produce an ‘s’ on its own. The lips are _____________ to produce ‘sh’, which creates a cavity anterior to (in front of) the constriction. These differences will cause the sound to _________________________.
|
retracted, rounded , resonate differently
|
|
‘f’ is a low___________________ sound.
|
intensity
|
|
'theta' means the _____________'th' sound.
|
unvoiced
|
|
In some types of ________________, the production of the two fricatives ‘s’ and ‘sh’ becomes rather similar . This makes it harder to _____________________________________ one from the other.
|
dysarthria, perceptually distinguish
|
|
In dysarthria, the 's' and 'sh' are both pronounced closer to the middle than they should be. The spectra for these two sounds then becomes __________________________, diminishing intelligibility.
|
more similar to each other
|
|
Why are nonstridents (f, v, theta & thorn) more difficult to distinguish from one another ?
|
Their spectral shapes are broadly similar and they are acoustically weak
|
|
The ____________________________ are so soft that we have a harder time telling them apart.
|
non-stridents (f, v, theta & thorn)
|
|
The phone acts like a _________________________________. It eliminates the highest and the lowest frequencies. In the case of these fricatives, the ___________________ can be very valuable in helping you tell the fricatives apart. The lower frequency frication energy you are left with on the phone is common to more than one fricative. Thus, they sound similar to one another over the phone.
|
band pass filter, high frequencies
|
|
An AFFRICATE is a combination of what two different phoneme types?
|
a stop and a fricative
|
|
In an Affricate, first there is a stop, with a more noisy release than a typical stop production because the frication lasts_____________________________ than a stop. This friction interval_____________________ as an isolated fricative.
|
somewhat longer, is not as long
|
|
An _________________ is sort of a compromised sound, somewhere between a stop and a fricative in its production.
|
Affricate
|
|
When we say the affricates ch(urch) and j(udge), we don’t think of them as having a separate stop and fricative component because:
|
they are produced in such rapid succession we hear them as a single sound.
|
|
NASALS are ________________ different from other consonants.
|
qualitatively
|
|
The English language has what three nasal consonants?
|
/m/, /n/& /ng/.
|
|
If you produce a vowel between two nasal consonants, the vowel will probably become nasalized because :
|
the open velopharyngeal port contributes to damping of the energy that enters the nasal cavity.
|
|
We produce NASAL CONSONANTS by opening the velopharyngeal port and :
|
creating an obstruction in the oral cavity.
|
|
All nasal sounds are ______________, so the larynx is contributing phonation to their production.
|
voiced,
|
|
The place in the oral cavity where the _________________________________ will determine the acoustic characteristics of the nasal consonant.
|
obstruction is made
|
|
If you make an 'm' (bilabial) you have a __________________________ resonating cavity in the mouth, and leading up into the nasal passageway.
|
relatively large
|
|
If you make an 'n' (alveolar constriction) you change the _________________________________________, which makes the sound different from an 'm'.
|
shape and size of the resonating cavity
|
|
If you make an 'ng', the constriction is ________________________ in the oral cavity, so it has its own acoustic characteristics which are determined by its place of articulation.
|
most posterior
|
|
Even though the listener can tell the difference between the nasals fairly well, they are not very distinctive _____________________. Looking at them on a ________________, it is not easy to tell one from another.
|
acoustically, spectrogram
|
|
Nasals are very LOW IN ___________________. Because the sound is being radiated through the nasal cavity, the soft tissues there _________________ the sound. Therefore, the ________________ of the sound that reaches the listener is rather low.
|
INTENSITY, attenuate/dampen, intensity
|
|
/w/ and /j/ are called GLIDES because:
|
the tongue is moving (gliding) during their production.
|
|
Glides can be compared to ___________________, in which you both phonate and move the tongue, creating a change in the formant frequencies from one part of the sound to the next.
|
diphthongs
|
|
______________________ are similar to diphthongs in that the tongue is gliding or moving from one articulatory configuration to another.
|
Glides
|
|
Glides are also called:
|
approximants or semi-vowels
|
|
Why are glides sometimes called approximants?
|
because they approximate certain positions or constrictions
|
|
Why are glides sometimes called semi-vowels?
|
because they are half vowel like -- they have a relatively open vocal tract and there is phonation going on during their production
|
|
Glides have a _________________________ vocal tract and there is _________________ going on during their production.
|
relatively open, phonation
|
|
Besides the glides, the term approximant is also used for:
|
/r/ and /l/.
|
|
/w/ involves lip rounding, and the elevation of the ______________________. This is similar to how we produce an /u/ vowel. To illustrate how similar they are, put a w between two /u/ vowels. The articulators don’t move much at all. They are fairly close in terms of their _____________.
|
back of the tongue, configuration
|
|
The rate of change during ________________________ can be a cue to the listener as to which of these sounds is being produced.
|
formant transition
|
|
With b, you have formed a plosive; so the plosive burst, along with _____________will cue the listener as to the manner of production. (plosive)
|
VOT
|
|
The two liquids in English are :
|
/r/ and /l/.
|
|
The LIQUIDS are voiced ________________________.
|
all the way through.
|
|
The F3 for /r/ is ______________; There is a characteristic ___________ in F3 on a spectrogram representation of an /r/.
|
low, dip
|
|
/r/ and /l/ are completely separate phonemes in English, but this is not the case in all languages. They are __________________________ in Korean.
|
allophonic variants
|
|
An ______________is a version of a phoneme that may be produced slightly differently under different conditions.
|
ALLOPHONE
|
|
An allophone is subject to the influence of ________________________. The allophone varies depending on the ______________________________.
|
COARTICULATION, sounds that surround it.
|
|
In English SLP, there is a lot of opportunity to help children articulate these ____________________ sounds.
|
liquid (/r/ and /l/)
|
|
Liquids (/r/ and /l/) are difficult to master because of the _________________ involved in their production.
|
muscular patterns
|
|
Either of the liquid sounds (____________________) can potentially be sustained, but this is not what we really do in speech. It is the movement of the ______________________ that is characteristic of the production of these sounds in normal speech production. There isn’t much of a ______________ state, or prolonged element in their production.
|
(/r/ and /l/, formants, steady
|
|
It is the __________________ that is particularly characteristic of /r/ as it is produced in running speech.
|
F3 change
|