• Shuffle
    Toggle On
    Toggle Off
  • Alphabetize
    Toggle On
    Toggle Off
  • Front First
    Toggle On
    Toggle Off
  • Both Sides
    Toggle On
    Toggle Off
  • Read
    Toggle On
    Toggle Off
Reading...
Front

Card Range To Study

through

image

Play button

image

Play button

image

Progress

1/827

Click to flip

Use LEFT and RIGHT arrow keys to navigate between flashcards;

Use UP and DOWN arrow keys to flip the card;

H to show hint;

A reads text to speech;

827 Cards in this Set

  • Front
  • Back
What are the 4 goals for this semester?
1. Understand speech sound production 2. Learn about speech measurement 3. See its clinical application 4. Build confidence in using technology
What is the ultimate test of speech measurement?
Does it sound right?
No instrument can replace the:
experienced SLP
Technology merely has tools to:
help us
Technology complements:
our own experience and skills
What are the 4 characteristics of the scientific method?
Empirical, Deterministic, Predictive, and Parsimonious
What does empirical mean?
based on data and direct experience
What does deterministic mean?
It obeys physical laws of cause and effect -- it's not random
What does predictive mean?
If you do this… than that will happen.
What does parsimonious mean?
Use the simplest explanation possible -- boil it down to its essence.
Why do we collect data and interpret it?
to make sense of the behavior we are trying to understand
We make decisions based on:
our own direct experience
The empirical approach is opposed to the ___________ approach.
rational
What is the approach used in philosophy where you make decisions based on reasoning and careful thought?
rational approach
How does the predictive nature of science help us?
we can extrapolate that information to apply to other situations that we may not have experienced yet.
What are 5 reasons why we use technology to analyze speech?
1. Overcome listener bias 2. Describe the severity of a disorder objectively 3. track progress over teime 4. ASHA has focus on EBP (Evidence Based Practice) 5. Provide biofeedback to the client
Analysis with technology provides measurement that is _____________ and ____________.
consistent and reliable
Technology provides objective criteria to measure our client against:
standardized measures
By tracking our progress over time, we can demonstrate:
treatment efficacy (or not)
What does EBP stand for?
Evidence Based Practice
In order to objectively prove that the treatment was effective, we must measure our client's performance when?
before and after treatment
Using hard data, we don't have to rely on :
our opinions
One limitation that occurs with non-technical observation over time is:
becoming habituated to the client's speech
What does biofeedback do?
Shows the person what they are doing on a computer display in real time, and can help them change behavior.
_________________ can show voice patterns in real time.
Computer software displays such as Busy Pitch
Name an example of something the client could see about his voice on the computer screen.
Patterns in prosody, etc.
The SLP needs to know normal physiology in order to :
understand & recognize disordered physiology.
Why does the computer software require you as an educated SLP?
You must understand what the numbers from the instrument represent. You provide the expert interpretation.
What are the 3 Key Arenas of speech analysis?
1. Acoustic phonetics 2. Physiologic phonetics 3. speech perception
The measuring of speech production through microphone recordings and studying what it can tell us about the way the larynx is operating or the articulators are moving is called:
Acoustic Phonetics
What are the 3 subcategories of Physiologic Phonetics?
1. Kinematics 2. aerodymanics 3. electromyography
What do you call the study of speech movements?
kinematics
What do you call the study of the pressure, flow & resistance we use to regulate our breathstream?
aerodynamics
What do you call the study of the electrical activity of muscles as they are activated?
Electromyography
What do you call the aspects of signal processing of the brain?
Speech perception
We can draw parallels between the ______________ aspects of speech understanding and the ____________ techniques that we use to evaluate a speech signal in a quantitative way.
perceptual, acoustic
What serves as the sound source of the voice?
the larynx (with air pressure)
After the sound leaves the larynx, it goes into the:
vocal tract
What does the vocal tract do to the sound?
it filters it
The larynx needs the _____________ to provide the air pressure it needs to function.
lungs
The vocal tract doesn’t just remove things, it actually _____________________, ___________________ & _____________ some of the frequency components.
resonates, enhances & strengthens
In theory these 2 systems (________________ & _________________) operate independently of each other, but in reality they sometimes interact under certain circumstances.
larynx & vocal tract
What are 4 ways the sound source can vary?
loudness, pitch, voice quality, phonation vs. whispering
Low air pressure equals ____________ sounds, high air pressure equals ______________ sounds.
softer, louder
With ____________ air pressure, the larynx doesn’t have to adduct very tightly.
low
With _______________ air pressure, the vocal folds will have bigger movements -- muscle movement increases.
high
Stretching the vocal folds increases their _____________, which increases their ____________ frequency.
tension, vibration
To stretch the vocal folds, first you contract the __________ muscle, which then rocks the ______________ ____________ forward and stretches the vocal folds.
cricothyroid, thyroid cartilage
If you subtly adjust the space between the vocal folds, between the arytenoids, you can change your:
voice quality
Increasing the space between the vocal folds makes the voice more ____________.
breathy
If you press the vocal folds together more firmly you will get a __________ sounding voice.
tighter
What muscles contract in order to press the vocal folds together?
the muscles of adduction (primarily the lateral cricoarytenoid and the thyroarytenoid).
How do you create a whisper?
By pushing air between the vocal cords to create turbulence without causing them to vibrate.
A whisper requires the__________ to provide the source of air pressure and the __________ to provide the constriction in order to make the turbulence happen.
lungs, vocal cords
How do phonations differ from whispering?
Phonation requires the vocal cords to vibrate during this process.
What is the fundamental frequency at which the vocal cords vibrate?
For men, it is about 100-120 times per second, for women it is double that.
We perceive the fundamental frequency as the :
pitch
The voice is composed of a fundamental frequency or sine wave, and also a:
series of harmonics
What do you call integer multiples of the fundamental frequency?
harmonics
If the fundamental frequency is 100 Hz, the harmonics will be at:
100, 200, 300, 400, 500, etc.
If the fundamental is 200 Hz, the harmonics will be at:
400, 600, 800, 1000, 1200, etc.
The higher the voice, the greater the _________________ between the harmonics.
spacing
The larynx emits a whole:
spectrum of sound
As the fundamental frequency rises and falls, the spread of the harmonics will:
increase and decrease.
The higher harmonics tend to be _____________ in amplitude than the fundamental itself.
weaker
The gradual weakening of harmonics as they get higher results in a:
harmonic spectral slope
What is the strongest element in a spectral slope?
The fundamental frequency
Harmonics get weaker as they get:
higher
On a LARYNGEAL SOURCE SPECTRUM GRAPH, what is represented on the X axis?
X = frequency in kHz
On a LARYNGEAL SOURCE SPECTRUM GRAPH, What is represented on the Y axis?
Y = amplitude
The strength or energy of the signal that increases as you get higher up is called the:
amplitude
In a laryngeal source spectrum graph, each vertical line represents:
one harmonic component.
In a laryngeal source spectrum graph, the tallest line on the far left represents the _____________ ___________. The _______________ are equally spaced to the right.
fundamental frequency, harmonics
The _______________ of the harmonics decreases as you get into the higher frequencies.
amplitude
In a laryngeal source spectrum graph, the slope value represents:
how sharply the harmonics drop off.
If your graph shows 3 spectral slopes of 6 db, 12 db, and 18 db, which is closest to a normal voice?
12 db
If your graph shows 3 spectral slopes of 6 db, 12 db, and 18 db, which would represent a bright, brassy and resonant voice?
6 db
If your graph shows 3 spectral slopes of 6 db, 12 db, and 18 db, which would represent a weak and thin voice?
18 db
At the 6 db level of our graph, the harmonics are:
relatively strong compared to the fundamental, almost as high in amplitude
At the 18 db level of our graph, the harmonics do what?
drop off sharply in the upper frequency range
What does attenuation or anti-resonance of the vocal tract filter mean?
That it can weaken or remove what comes out of the larynx.
The vocal tract filter doesn't just remove sounds, it can also:
resonate or echo sound.
The __________ _____________ ____________ can increase the amplitude of some frequencies even as it weakens the amplitude of others.
vocal tract filter
The sound that leaves the ____________ ________________ has been substantially changed from what it was when it entered.
vocal tract
Tubes do not generate sounds, but they can __________ & ___________ sounds that are put into them.
shape and alter
What features of the tube allow resonance?
its dimensions
What type of tubes are the simplest to model?
straight, uniform tubes
The best resonance frequency of the tube (the one that causes it to resonate or echo the best) depends on its:
length
The best resonance you will get will be for sounds that have a wavelength that is:
4 times that of the tube.
A tube that is small will resonate _______ frequencies because they have a shorter wavelength, and a tube that is long will resonate ___________ frequencies because they have a longer wavelength.
higher, lower
The vocal tract is not a uniform tube; what are 2 things that vary?
the diameter will vary along its length, and thus the cross-sectional area varies. And the length?
Resonance frequencies in the vocal tract differ because:
the vocal tract is not a uniform tube
In the vocal tract, the specific sound features that change depend upon:
where the constrictions are, and also whether they are relatively mild or severe constrictions.
When the diameter of the tube suddenly _____________, some of the sound will actually be bounced back to where it came from, and another part will go on through.
increases (because the nature of the resonance characteristics has changed)
When the shape and diameter of the vocal tract changes, it changes the blend of ____________ and the strength of the different ______________ as they make their way through the tube.
frequencies, components
You change the size and shape of the vocal tract at various points along its length by moving your:
articulators.
When you move the tongue upward or downward or forward or backward, this changes the size of the _______________ in particular places.
pharynx or oral cavity
You can use the tongue to form constrictions, by:
moving it towards the palate or the post pharyngeal wall.
Why does moving your tongue cause the vocal tract's resonance to change?
Because it alters the cross-sectional area.
This is how different sounds are created, because their ______________ _______________ differ.
acoustic features
When you change the shape of your vocal tract, some frequencies are ______________ and others are ___________.
boosted, attenuated (weakened)
The way the sound has changed from larynx to lips is called the:
Vocal Tract Transfer Function
The constrictions of the ____________ ___________ will change according to the various sounds you are articulating.
vocal tract
When you reshape the vocal tract filter, you cause sounds to be ______________.
altered
Sounds are altered according to the size and shape of the _________________ that resonate the sounds that pass through the vocal tract.
cavities
The Vocal Tract Transfer Function refers to the change in sound between:
entering the vocal tract and leaving the vocal tract, or the output minus the input.
The sound that leaves the larynx is basically a _______________ wave form.
sawtooth
The resonating cavities between the ___________ and the ____________ turn the sawtooth waveform into different sounds, vowels, etc.
larynx, lips
The Vocal tract transfer function is a ___________ filter.
variable
The vocal tract filter is changing all the time as we move our:
articulators.
The _________________ _________________ determine the transfer function.
resonating cavities
A resonant peak in the vocal tract transfer function is called a:
formant
A formant occurs when some frequencies in a given range are:
amplified to boost their prominence in the spectrum.
For a given vowel, there can be _________________ formants.
many
Which formants are the most important for determining which vowel you are listening to?
the first 2 or 3
The formant is NOT a sound _____________.
source
A formant reflects the way the vocal tract is:
shaping the sound coming from the larynx.
When you look at the spectrum of the sound that leaves the person’s mouth, you can identify formants as:
peaks in the spectrum.
Formants result from the _____________ of the vocal tract filter.
activity
Formants are NOT a feature of the ____________ _____________of the larynx.
sound source
If you whisper, will there be formants?
Yes
When you whisper, you will have _____________ created due to the shaping of the cavities by moving the articulators.
formants
In THEORY, the glottal source and the vocal tract filter are ___________________ from each other.
independent
Without moving your articulators, you can change the fundamental frequency by:
raising or lowering your pitch, causing the harmonics to change.
You can pronounce a given vowel at high or low pitch, or adjust loudness, all while the _____________ ____________ configuration remains constant.
vocal tract
Even if you hold the same pitch, you can create different sounds by:
moving your tongue or other articulators
If you change the filter characteristics (moving your tongue, lips or jaw) while not changing what your larynx is doing – the fundamental frequency:
remains the same.
Can the source and vocal tract act independently?
yes
Can the source and vocal tract interact?
yes
There are some links between the source and vocal tract that cause them to behave:
in a very coordinated way.
If you target your treatment toward one part of the speech mechanism, will you see effects in other parts of the system that you did not directly treat?
Yes, there are spillover effects.
In Parkinson’s Disease, there is a condition called hypokinetic dysarthria, which is characterized by:
a weak, monotone voice and imprecise consonant articulation.
The Lee Silverman Voice Treatment focuses intensively on the larynx to improve:
vocal loudness.
If you increase the loudness of your voice, there is also an effect on the _____________ system and the ________________.
respiratory, articulators
Treating one aspect of speech, such as loudness, in order to improve other features as well, is called:
1 stop shopping
By treating the loudness of the speech only, you can avoid working with:
the other disordered aspects
____________________ can be considered the “global” treatment variable. Why?
Loudness because it affects the whole treatment production mechanism even though the treatment focuses only on the loudness of the voice.
What are some reasons that loudness can improve other features of speech?
There are natural changes that accompany increasing the loudness of the voice, and most people have a lifetime of practice in changing the loudness of their voice.
STI stands for:
Spatial Temporal Index.
STI can represent:
variability in the speech samples.
Hyperarticulate means to:
exaggerate your speech movements
In the Parkinson's Patients study, which patients showed improved speech: hyperarticulaters or loud speakers?
loud speakers
We can conclude that speech production is more natural, consistent, and predictable when a person is ____________ ______________ rather than if they are deliberately concentrating on ____________ __________ _____________.
speaking loudly, moving their articulators.
What are some other drawbacks to having patients concentrate on moving their articulators deliberately?
requires more cognitive attention and is more difficult for elderly patients.
In the Parkinson's study, the loud samples have less __________________ and the speech is more ________________ & ______________.
variability, natural and straightforward.
In hypokinetic dysarthria, the movements tend to be rather shrunken or small, and so increasing their amplitude can:
make the patient's intelligibility much better.
In the Parkinson's study, were the movements in the loud speech smaller than those in the hyperarticulate sample?
no, they were just as large.
With loud speech, you not only get a louder stronger voice, you also get:
more precise articulation.
Hyperarticulation requires a lot of ___________ and you don’t get any particular ____________.
effort, benefit
The Parkinson's study shows us that by changing vocal function (increasing loudness) you can reap benefits in _____________ _______________, without having to directly treat _____________.
oral articulation, articulation
By treating loudness, articulation improvement is a positive ____________ _____________. The ________ has influenced the filter in this case.
side effect. Source
Spasmodic dysphonia is a neurological voice disorder (neurolaryngological disorder) where the brain sends signals through the laryngeal nerve to the larynx and these signals lead to ___________ during voice production.
spasms
In spasmodic dysphonia, the voice has a very _____________ & ___________ sound intermittently during speech.
strained and strangled
The current treatment for spasmodic dysphonia is for a physician to inject Botox into the :
thyroarytenoid muscle.
Botox treatment for spasmodic dysphonia partially paralyzes the muscle, so that it cannot respond to the _________ ____________ that travel down the nerve to that muscle.
incorrect signals
After _______________ treatment for spasmodic dysphonia, the muscle can no longer tighten up and crampen and strangulate the speech.
botox
After botox treatment for spasmodic dysphonia, the correlation of ______________ ___________ improved quite a bit.
lip movement
In the Botox/spasmodic dysphonia experiment, they treated the voice by injection to the larynx and got improved outcomes in :
articulatory behavior.
The botox/spasmodic dysphonia experiment results are an example of the ____________ influencing the ___________.
source, filter
In Muscle Tension Dysphonia, patients also have a ____________ ______________ sounding voice.
strained strangled
Muscle Tension Dysphonia is caused by:
muscle tension in the neck, including the muscles of the larynx.
In this disorder, there is excessive neck muscle activity:
Muscle Tension Dysphonia
Muscle Tension Dysphonia can be treated by:
circumlaryngeal massage therapy
This treatment involves the clinician actually moving the larynx further down in the neck as the patient phonates.
circumlaryngeal massage therapy
After circumlaryngeal massage therapy, a post treatment spectrogram shows greater flexibility in _____________ ______________ during speech production.
lingual movement
The Muscle Tension Dysphonia experiment results are an example of how treatment of the ____________ influences the movement of the ____________.
larynx, tongue
Why does working on the sound source yield benefits in the vocal tract filter behavior?
Because there can be neural and biomechanical linkage between them. They’re controlled by the brain in such a way that the activity of one is closely coordinated with the activity of the other.
The sound source and the vocal tract filter typically behave as a:
unit
Sound itself is not digital; it's the way we __________ the sound.
store
_____________ ______________ tends to make a clearer, higher quality copy of the sound.
Digital storage
Can your ears hear digits?
NO
Music is stored in __________ form but it is played back in ________ form.
numeric, analog
‘Analog’ means ‘________________’ – one phenomenon is represented by another.
analogy
The height of the mercury in a thermometer represents, or is _____________ to, the temperature .
analogous
The microphone’s sound signal that travels through a wire is analogous to, or represents, the :
sound pressure coming through the air that reaches the microphone.
The microphone ___________ represents sound.
voltage
large positive voltage = air _____________
compression
large negative voltage = air ______________
rarefaction
A microphone is a ________________ – it is converting _____________ from one form to another.
transducer, energy
The sound pressure vibrations that reach the microphone are converted by the microphone into a rapidly changing ___________ _______________
electric signal.
Any change in __________ _____________ that causes the internal mechanism of the microphone to move will result in a commensurate change in ____________ .
air pressure, voltage
The microphone's voltage will change at the same_____________ and in an ____________ proportional to the size of the sound that came in.
frequency, amplitude
The __________ _____________ going through the microphone’s wire is an analog representation of the sound pressure that the microphone picked up.
electrical signal
The microphone's electrical signal is a precise representation of the:
signal coming in
If you recorded a sine wave, you would have an electrical sine wave in the wire.
sine wave
If you were recording phonation, you would have a ____________ ______ of electrical signal in the wire.
periodic type
If you were recording noise, you would have a ______________ signal in the wire.
“noisy”
A signal that is rapidly & randomly changing would be called:
noise
What are the 2 most important characteristics of analog signals?
continuous in time and continuous in amplitude
If you were to examine or zoom in on continuous analog signals at any level of detail, you would not find any ___________ in the recording.
gaps
a line drawn on a page or a microphone signal in a wire are examples of what type of signal?
analog
An analog recording can represent what 2 things?
an infinite number of points in time and an infinite number of amplitude values between the minimum and the maximum
An ____________ signal CANNOT be fully represented by a table of numbers.
analog
To represent a signal, each row in a table of numbers would represent the amplitude at a :
given point time
You cannot have an infinite number of rows to reflect the infinite number of time intervals that there are in a truly ______________ signal.
continuous (analog)
What could represent an analog signal "truly"?
a graph
What are some COMMON ANALOG DEVICES ?
Clocks with hands, watches with hands, mercury thermometer, tape measures (markings only limited by your eyesight), cassette recorder (magnetic impressions on the tape), dimmer switch on a light (infinitely variable, continuous)
In what 2 areas are digital recordings discrete?
in time and in amplitude
Digital signals are made up of a series of _____________ _____________ recorded in one long string.
separate numbers
The numbers of a digital recording represent ______________ over time.
snapshots
in a digital signal, the ____________ themselves have finite precision – there are a limited number of ___________ __________, which could be many or few.
numbers, decimal places
If you were to write down all the numbers of a digital recording, you could fully represent it by a:
table of numbers.
The limitations of digital recordings are that there are _____________ between the points.
‘unknowns’
But actually, if the individual sample points are close enough together in time, it is almost as if the sample were:
continuous
Name 3 common digital devices:
digital clock, computer, compact disc
Computers and compact discs store data as:
ones and zeros
The analog signal from the wire has to be converted to ____________ in order to be used by a computer, etc. , and vice versa.
numbers
Going from a microphone to a computer is an example of ____________________ conversion.
analog to digital (ADC)
Going from a CD to the speakers is an example of ______________ conversion.
digital to analog (DAC)
PC sound cards belong to which type of conversion ?
both ADC and DAC
Computers only deal with __________ data.
binary
Binary data contains only:
ones and zeros
Microphones and speakers are ____________ devices.
analog
Sound is an _______________ function.
analog
The frequency with which the numbers are recorded to represent amplitude values of the sound you are recording is called:
the sampling rate
To better represent the original signal, you should take _____________ samples.
more
The frequency of snapshots in a digital recording is called the:
Sampling Rate.
The sample rate is specified in:
Hz
If you take 10 samples/snapshots in a second, that would be a ________ recording.
10 Hz
A very low sample rate produces a ____________ recording.
poor
If the original sample contains many rapid changes, your sample rate must be ________ in order to capture those changes.
higher
A higher sampling rate gives better:
fidelity
A higher sampling rate requires ___________ files.
bigger
What are 3 disadvantages of a higher sample rate requiring bigger files?
1. more memory usage 2. more disk space for storage 3. more processing time for computation
The ‘Nyquist’ frequency is :
half the sample rate.
The Nyquist frequency represents the highest frequency you can :
record and then accurately play back.
You will sample at twice the rate of the ____________ _______________ in the signal.
highest frequency
For example, if you want to record a signal accurately up to 100 hz, you must sample at :
200 Hz.
For example, if you want to record a signal accurately up to 5 hz, you must sample at :
10 khz
The sample rate for compact discs is ____________ samples per second, or __________ kHz.
44,100, 44.1
Since the sample rate for a compact disc is 44,100 samples per second, it can store signals up to ______________Hz, and play back sounds up to _________________ Hz.
22,050 , 22,050
The upper limits of most people’s hearing capacity is about:
22,050 Hz
On a digital recording device, the highest quality setting gives you the ____________ amount of time to record.
shortest
Why does a high quality setting give you the shortest amount of time to record?
Because it uses more data and takes up more space on the device.
A higher sampling rate = higher _____________.
quality
Before we can do decent ___________, we must make sure the recordings themselves are good.
analysis
What are 4 things you must have to obtain a good recording:
1. a quiet recording environmnet 2. use a quality microphone 3. good signal strength 4. proper sample rate
How can make your recording environment more quiet?
Reduce reverberation & echoing from hard surfaces in the room.
To have good signal strength, you must make sure that the info you want is not :
lost in noise
Saturation = the signal as it:
comes into your recorder
If the signal is too tiny, it won't be audible over:
background noise
To make sure you signal is a good strength, you can do what 2 things:
1. adjust the recording input level 2. adjust the distance between the microphone and the subject
Be sure not to clip the signal by:
bringing it in at too high a level
If the strength of the signal coming in is bigger than your recording device can accommodate, the signal will be:
badly distorted
If the signal is ____________ _____________, you won’t be able to do any decent acoustic analysis .
badly distorted
Driving the signal too hard can cause ______________ which causes _____________.
clipping, noise
If you have enough memory space, it’s better to use a ___________ sample rate .
higher
You can always downsample later, but you can never :
upsample and replace info that wasn’t in your recording.
All ____________ has to do with the vibration of air molecules around you.
sound
Give 3 examples of sound originating with movement:
1. vibrating string of an instrument 2. oscillation of the vocal folds 3. turbulance of air molecules leaving a tire
Why does a vibrating string create sound?
it causes the air molcules around it to oscillate.
Why does the oscillation of the vocal folds create sound?
it chops the air into small pulses; they oscillate the air in the vocal tract as well as the air between the speaker and listener
Why does air leaving a tire create sound?
the air molecules are trying to leave through a very small hole and bounce against each other, creating turbulence
In speech production, there are many structures that contribute to the way the sound is generated. We talk about this as having:
Many degrees of freedom
What are three principles of sound:
1. all sounds originate with movement 2. the movement characteristics determine the nature of the sound 3. speech production has many degrees of freedom
In speech production, there are many different elements that can be individually adjusted to create the exact sound that you hear. Name two:
voice/larynx, articulators
The speech waveform is complex because:
there are so many structures that can contribute to the way it’s generated.
Name 2 benfits of acoustic analysis:
1. it's noninvasive 2. from the changes in the signal we can draw inferences about the movements that generated the sound
One of the greates benefits of acoustic analysis is:
that it's noninvasive; we don't have to attach any transducers to the person's vocal tracts, etc.
A disordered voice indicates:
disordered vocal fold movement
changes in vocal fold movement causes a change in the sound's:
source
Distorted or imprecise articulation indicates:
abnormal movement of the articulators
A change in movement of the articulators causes a change in the sound's:
filter
We can get a very rich level of detail from something as simple as a microphone recording, if we apply the appropriate:
analysis techniques.
The acoustic patterns reflect the vocal tract movements with some ambiguity, because of :
Motor Equivalence.
Motor Equivalence means that:
the same sound can be produced several physical ways.
You could move the articulators in slightly different ways and still have:
the same end result.
Acoustic analysis cannot always distinguish with any level of precision between:
different types of articulatory movements
Acoustics cannot reveal all details of:
Movement
You can’t tell how many mm a tongue moved upward or forward when a person:
articulated a given sound.
Acoustic analysis doesn’t reflect the movements that caused the sound in a ____________________ way.
perfectly linear, one to one
There are some limitations in getting calibrated, physical measures of articulatory behavior from a microphone signal because there are:
several different structures working at once to contribute to the way that sound was made.
What are the 2 main limitations of acoustic analysis?
1. It reflects the vocal tract movments with some ambiguity because of Motor Equivalence. 2. It cannot reveal all the details of movement.
Speech Sounds Contain Many :
Frequencies
A _________________ _______________ allows us to be selective in which frequencies we keep for analysis.
speech filter
If you want to understand the impact of having hearing loss in a certain range, you could filter out _______________ _______________ in a speech sample of normal speech.
certain frequencies
What are 4 filter types?
High pass, low pass, band pass, band reject
Which filter allows high frequencies through and holds back lower frequencies?
high pass filter
Which filter allows low frequencies through and holds back higher frequencies?
low pass
Which filter allows a middle band of frequencies through and holds back both higher and lower frequencies?
band pass
Which filter holds back a middle band of frequencies and allows both higher and lower frequencies through?
band reject
What is a main reason to use a band reject filter?
to remove unwanted noise
If you contaminated an original signal with a 300 hz tone, what type of filter would you use to remove the unwanted noise?
a band reject filter to filter out the 300 hz tone
Which type of band reject filter, wide or narrow, would least impact the quality of the speech sample?
narrow band
In a ____________________ filter , you would be deleting a number of frequencies that would be important for speech perception.
wide band
What is a main reason to use a band pass filter?
To show each frequency and it's energy.
This type of filter was used in the early days of speech analysis because they helped us understand which frequencies were present in the speech sample:
band pass filter
A band pass filter yields a crude idea of the ____________________ of a certain sound.
ingredients list / frequencies
Remember that it’s difficult to tell where the regions of high or low energy lie, just by:
listening to the sound
Some of the early pioneers in acoustic analysis employed techniques like these different _____________________ to understand whether the energy was in the lower part of the frequency range, the middle range, or the higher range for a certain sound.
band-pass filters
A _______________ filter passes a signal through sets of filters like these: 0 - 300 Hz, 300 - 600 Hz, 600 - 900 Hz, etc…
band pass
This type of filter allows you to move from low to high frequencies as the frequency is being analyzed:
variable band pass filter
The meter at the right hand side of a variable band pass filter represents :
how strong the energy is at any given frequency level that you analyze
Band Pass Filters can be these 2 types:
Fixed and Variable
A Fixed Band Pass filter looks at:
a series of band pass filters, one at a time.
A variable band pass filter can:
move from low to high frequencies as the frequency is being analyzed.
In order to do either type of band pass filter analysis, the researcher would need to play the sound again and again, looking at :
a different frequency each time in order to compare the energy for each.
A band pass filter can give you information about the __________ of the different frequencies in a given sound.
strength
You could say that _____________ ______________ is a little like looking for the ingredients in a complex sound.
acoustic analysis
What does acoustic analysis do?
splits up a complex sound into its component sounds
A complex sound has ______________ _____________ – we don’t speak in sine waves!
many ingredients
The _____________________ _________________ of the components make one sound different from another – they alter its quality perceptually.
relative proportions
Altered proportions = altered ___________.
quality.
You could have the same frequency ingredients – some higher and some lower in amplitude – for one type of sound, and if you adjusted the _________________ of those frequencies you would get a different type of sound.
amplitude or proportion
Joseph Fourier was a ____________________________ who lived from 1768-1830.
French mathematician & physicist
Fourier came up with a way of analyzing complex signals and :
decomposing them or splitting them into a series of component frequencies.
This person's great contribution was to show that even complex sounds can be broken down into their individual sinusoid components.
Joseph Fourier
All _____________ sounds are made of a combination of sine waves. (component frequencies)
periodic
What are 3 ways that individual sine waves may vary?
1. amplitude 2. phase angle 3. frequency
What does phase angle of a frequency mean?
where they are in the cycle relative to one another
What is a Fourier Transform?
when we use the techniques that Fourier developed
A Fourier Transform creates a ___________ from a _______________ waveform (such as a microphone signal).
spectrum, time domain
A spectrum of a signal shows us:
what the individual components are.
What you see when you have a microphone recording on a screen in front of you is ________________ __________ data.
time domain
In a time domain data waveform, you will have ___________ on the X axis and __________ on the y axis.
time, amplitude
If you subject a time domain waveform to a Fourier Transform, what you get is a ______________ _____________ Display that shows you a ____________.
Frequency Domain, spectrum
A _______________ represents a slice in time, or a snapshot in time.
spectrum
A spectrum has ________ on its X axis and _______ on its Y axis.
frequency, amplitude
The Fourier Transform is a way from going from one view of a signal to another view – from the ___________ domain to the __________ domain.
time, frequency
In an audio signal, the voltage of the signal increases and decreases in proportion to the :
sound pressure of the wave.
A waveform that repeats regularly and has multiple sine wave components is a :
periodic wave
2 characteristics of time domain data are that:
a waveform represents sound directly and air pressure changes over time
In a spectrum display, each column represents:
one of the frequency components
A ________________ ________________ splits out the individual sounds from the combined total.
specrum display
In a spectrum display, when you play all of the frequencies together, it will sound the same as a:
time domain waveform
In a spectrum display, if you play each frequency separately, they will sound:
different from each other
Two great benefits of a Frequency Domain display are that it can:
show us what the individual components are and show us the relative proportions of each component
In a spectrum display, the height of each bar represents the :
strength of each of the components.
A spectrum shows ___________ ___________ data.
frequency domain
A line spectrum shows the frequency components of a:
periodic sound
In a _____________ ___________, there is no energy between the individual harmonic components that make up the entire spectrum.
periodic signal
The sound source that comes out of the larynx is very similar to a ___________ signal, but there is also some __________ mixed in it.
periodic, noise
The _____________ ____________ is nearly periodic.
human voice
The human voice has a fundamental and then ________ which are multiples of that fundamental.
harmonics
In the human voice, the upper harmonics get progressively ___________ as you go up in frequency.
weaker
A PERIODIC SIGNAL SPECTRUM is a _____________ ______________ description of the signal.
frequency domain
In a time domain display, __________ is represented from left to right.
time
In a line spectrum, the peaks represent:
the individual harmonic components of this signal.
If you make your voice softer or louder, you’ll see some changes in the relative strength of the :
harmonics.
If you raise and lower your voice pitch, you’ll see the _________ spreading apart from one another or coming back together as the pitch comes back down again.
harmonics
If you had a ____________ __________ __________, there would only be one line in your display. _________ __________ ______________ would have multiple lines in the display.
simple sine wave, complex periodic signals
Noise is when: (3 things)
1. all frequencies are present 2. with various different phase relationships to each other (random phase) 3. with approximately equal amplitude.
Name 4 types of noise:
white noise, pink noise, brown noise, multi-talker noise
A spectral envelope shows us:
the relative strength of the different frequency components in the noise.
A ____________ _____________ encloses an area that would have been completely filled in by those vertical lines that represent the individual sine waves.
spectral envelope
The voice source is not truly :
periodic. (it is NEARLY periodic)
A spectrum does not have pure lines; instead it has:
peaks
In a voice signal, there will be some ___________________ around the fundamental and harmonics
spread of energy
The spread around the fundamental and harmonics represents the _________________________ that is present in every human voice.
imperfections or noise
In a voice signal, if it’s not a harmonic, then it’s _____________-.
noise.
A ratio of the height of these harmonic components to the level of noise between them is called a:
Harmonics to Noise ratio
What are the two major kinds of spectra?
FFT and LPC
What does FFT stand for?
Fast Fourier Transform
What does LPC stand for?
Linear Predictive Coding
The FFT shows us the:
range of harmonics that are present in each sound
In an FFT spectrum, there is a peak for each of the:
harmonics
An FFT does not show us ___________ very well.
formants
This type of display is useful for displaying features of the sound source, but not so much about the vocal tract filter:
FFT spectrum
This type of display shows a spectral envelope, but not individual harmonics:
LPC spectrum
With an LPC spectrum, you don’t get details of the :
sound source
This type of display can be quite revealing of what the vocal tract filter is doing – how the vocal tract shapes the sound given to it by the larynx:
LPC spectrum
Different vowel __________ show different energy regions.
formants
What are formants?
resonant energy peaks in the vocal tract transfer function
This vocal tract _____________ _____________ applies whether the sound coming into it is periodic (or almost) and phonated, or whether it’s merely being whispered.
transfer function
The formants you see in the vocal tract will be present because of the way you have :
shaped the vocal tract.
Formants will be the same whether the sound is being ___________ or ____________.
phonated, whispered
This type of display can show you what the vocal tract's resonance frequencies are relative to one another.
LPC spectrum
An FFT spectrum shows _______________ ________ that are harmonic components.
individual peaks
The peaks in an FFT spectrum are spaced equally because they are:
multiples of the fundamental.
An LPC spectrum shows peaks that represent:
the formants
The first 3 formants of an LPC spectrum can help us:
identify a vowel
The LPC spectrum reveals details of the:
sound FILTER
The FFT spectrum reveals details of the:
sound SOURCE
A ___________ spectrum will not tell us if a sound is pure or disordered.
LPC
A _________________ shows the ingredients of a sound at a single point in time.
spectrum
A _____________________________ shows the progression of a signal over time.
time domain display
A ______________ shows us how the frequency components of a sound changed over time.
spectrogram
A spectrogram which can show us individual ___________ __________ that are arranged side by side over time.
spectral slices
As we speak, we are producing, in rapid succession, a whole series of sounds that differ from one another in their :
spectral characteristics.
A spectrogram is sometimes called a:
3D spectrogram
Why did the 3D spectrogram get that name?
because it can display 3 different parameters at a time:
What 3 parameters can a spectrogam display?
Time, Frequency, and intensity
A spectrogram shows ________ on the X axis and ___________ on the Y axis.
time, frequency
On a spectrogram, Whether the trace is lighter or darker represents the __________ of the signal.
INTENSITY
On a spectrogram a darker trace = stronger:
intensity or amplitude
A _____________ is kind of a hybrid domain display since it shows both time and frequency.
spectrogram
A ____________ shows us how the strength in each range of frequencies is changing or evolving over time.
spectrogram
A _________________ display is rather rich in information because it shows frequency changes over time and the changes in amplitude of those components – whether they are strong or weak.
3D spectrographic
Spectrograms are made from digital recordings, and so the frequency display on the Y axis is limited to the highest frequency component you were able to save and reproduce, the:
Nyquist frequency.
If you are interested in looking at _____________ _______________ speech on a spectrogram, you must at least double the sample rate, and maybe even give it a little bit more.
high frequency
If you were interested in fricative energy that might go all the way up to 12-14 kHz, you’d have to make sure that the sample rate of your digital recording was at least _________________________ to make sure that you include all of that interesting fricative energy.
double this amount --maybe 25-30 kHz
If you only care about the info from ___________________, then you could sample at 8-10000 Hz.
0-4000 Hz
If you sample at too low of a rate, there’s no way to :
recover info that you never had in your recording.
What is one analysis parameter you can adjust?
bandwidth
A ___________ ______ spectrographic analysis gives you good TIME, or temporal detail.
wide band
A wide band spectrographic analysis can show you individual events that are very ____________, and resolve those with great accuracy.
brief
A ___________ ________ spectrogram tends to blur things together over time, but it gives you extremely good FREQUENCY resolution.
narrow band
Which kind of spectrogram display would show individual frequency components, separate from one another?
narrow band
Unfortunately, the laws of physics prevent us from having good_______ AND _____________ detail on a 3D spectrogram display.
time, frequency
The software takes in ____________________ in order to provide the display, and you need more and more ____________ to get a good idea of what frequency components are present and to separate them.
strings of numbers, numbers
If you want to look at only brief events in time, these will occur over very few samples in a _____________________________.
string of numbers.
You can have good TIME resolution OR good FREQUENCY resolution, but you can’t have :
both at the same time.
A 300 Hz bandwidth is what we refer to as a ______________ ____________ spectrogram.
Wide Band
In a spectrogram, the vertical striations or thin vertical lines, represent:
each glottal pulse as it occurs in voicing.
Because you have a high level of detail and you can see events as brief as individual glottal pulses, you can make very good _________________ from a wide band display.
time measures
On the wide band spectrogram, the X axis represents ___________ and the Y axis represents ____________.
time, frequency
A 45 Hz bandwidth is what we refer to as a ________________ spectrogram.
narrow band
In a narrow band display, we can no longer see the individual :
vertical striations that represent glottal pulses
On a narrow band display, you can see horizontal, almost parallel, lines called:
HARMONICS
The wide-band display could not show harmonics because:
it couldn’t give us good frequency detail
In a ____________________ display, we have extremely GOOD FREQUENCY DETAIL.
narrow band
In a narrow band display, we can tell the _______________ apart from one another.
harmonics
Which type of spectrogram shows individual vertical glottal pulses (vertical striations) and has good TIME resolution?
wide band
Which type of spectrogram shows horizontal lines that represent HARMONICS of the voice, and separates them out… It has –good FREQUENCY resolution?
narrow band
From looking at a spectrogram, you can get a _____________ idea of what’s going on in the voice –kind of like looking at an ______________.
qualitative, X-ray
The larynx and vocal tract differ _________________ from one person to the next, which influences the ____________ of the sounds the person will produce.
anatomically, acoustics
There are structural _______________ that cause the uniqueness of your own voice.
differences
Function, or the way we use our _____________________, can also contribute to our dialect or other features of our voice which are not anatomically based.
articulators, etc.
In speech therapy, a lot of the work we do involves helping people change the function of their _____________. We don’t really get involved with ________________ changes.
articulators, structural
In speech therapy, we work on altering the way the client uses his structures to improve ____________ __________.
sound output.
You can judge a person you hear by their ______________; whether they are older, younger, healthy, sick, how they feel.
voice
A person's voice ______________ come through the acoustic signal.
characteristics
We can study voice features such as __________________________ to understand more about how people speak differently.
prosodic variables or vowel dialect variables
Women’s voices are about ____________________ in their fundamental frequency than men’s are.
an octave higher
What is the main reason that women's voices are typically higher than men's?
Anatomy -- the men’s vocal cords are larger.
However, even when a man raises his pitch to sound like a woman, it is still not very convincing. There are other features that contribute to the difference: (name 4)
1. vocal fold closure patterns 2. harmonic spectral slope 3. differences in noise between the formants 4. the frequency values of those formants
What is spectral slope?
The amplitude of the harmonics resulting from vocal fold vibration falls off by 12 dB per octave. This means that each time the frequency doubles, the amplitude of the harmonics decreases by 12 dB. This is called the spectral slope (or tilt or roll-off) in the source spectrum.
The frequency values of the formants are based on the _______ of the vocal tract.
size
Why are vowels and diphthongs interesting to speech scientists?
Because they tend to carry the greatest amount of energy in speech.
In theory, when you produce a vowel: The vocal tract shape can be held constant, and a vowel can be prolonged as long as you have breath – the vowel is _______________ _______________.
TIME INVARIANT.
During the time you are sustaining a ____________ production, one point in time looks like any other, acoustically.
vowel
___________________ are usually produced very briefly.
consonants
What determines the particular vowel that we will be producing at a certain moment?
the shape of the vocal tract or the configuration of the articulators .
The output from the _____________ is basically the same for every sound we produce (plays an electrical buzzing type sound).
larynx
What makes one vowel different from another?
The frequencies of the formants are different for each one.
The ____________ spectrum reveals features of the vocal tract transfer function.
LPC
The LPC spectrum shows what the ____________ ____________ has done to the sounds that were given to it.
vocal tract
In the LPC display, formants are represented by:
peaks
The exact locations of these _________________ allow us to distinguish between the different vowels that are being produced
formants
F1, F2 & F3 differ across _____________.
vowels.
In a spectral display, the numbering order of these formants goes from:
left to right
Usually you can identify the vowel from the _________________________, even with synthetic speech.
first 2 formants
You could generate speech with a computer, by allocating certain _________________ ______________ that correspond to the formants.
frequency prominences
A computer can generate an F1 and F2 by:
giving more energy to those frequency ranges
The locations of the _______________ – where you put them on the spectrum -- will allow the listener to perceive the vowel as being an /u/ or an /i/ or an /a/.
the formants
_______________ formants tend to make the vowels sound more natural.
The higher formants – F3, F4, F5 and so on
If you don’t include the higher formants in generated speech, it will sound rather ___________________.
robotic.
Formants are:
peaks in the vocal tract transfer function
As energy passes through the vocal tract, there are certain ___________ that are favored, and other frequencies that are weakened or attenuated.
frequencies
The __________________________ will dictate what frequencies are favored (or echoed or resonated) and which are not.
size of the resonator
Think of the pipes of an organ – the small pipes will play __________ notes and the large pipes play ________ notes.
higher, lower
The vocal tract is made up of different _______________ cavities.
resonating
The resonating cavities of the vocal tract differ in their size and shape depending on how you move your ____________.
articulators
_____________ frequencies will differ depending on the articulatory configuration.
Formant
When you reach a formant peak in the vocal tract, energy in that vicinity – in that frequency range – will be _____________________________ so that it comes out of the mouth at a much higher amplitude.
amplified or boosted
Frequencies that occur in a trough of the vocal tract transfer function tend to be:
filtered out or weakened.
______________ ______________is more efficient at the formants.
Energy transfer
Remember that we can identify a vowel just by the frequencies of:
F1 and F2.
The absolute values of vowel ______________ can be influenced by the person’s dialect and gender (men’s vocal tracts are larger than women’s and children’s).
frequencies
Why do children have higher formant frequencies than men or women?
Because they have smaller vocal tract structures, including smaller resonating cavities.
Men, women and children have a similar _____________ _____________ shape.
vowel quadrilateral
A larger vocal tract yields ___________ formant frequencies.
lower
Children have higher_______________ ________________________ than men or women, because they have smaller vocal tract structures, including smaller resonating cavities.
formant frequencies
We can plot the _______________ and ______________ formant frequencies to show VOWEL SPACE.
first and second
The vowel quadrilateral for children is the __________________ and it spreads the furthest up and to the right. This is because their ________________ ____________________ are higher.
largest, formant frequencies
The women’s quadrilateral is in the middle, and the men’s is the furthest down and to the left, because their formant frequencies are ________________. (The men have larger resonating cavities in their vocal tracts.)
lower
As you lower the jaw and tongue, you _______________ the frequency of the first formant.
increase
When you lower the tongue and jaw, you are ___________________________ and causing different cavities to resonate different frequencies.
reshaping the vocal tract
These interact in very complex ways – you can’t anatomically associate F1 with just one ____________ in the vocal tract. They do interact with one another in that the sounds bounce backwards and forwards and ________________ in different ways.
structure, resonate
LOWERING THE JAW AND TONGUE INCREASES THE FREQUENCY OF THE ___________________ ____________________.
FIRST FORMANT.
F2 is influenced by the _____________________________ position of the mouth.
forward or backward
If the tongue is further back in the mouth, the second formant has a relatively _______________ frequency.
lower
AS YOU MOVE THE TONGUE FORWARD, THE SECOND FORMANT’S FREQUENCY ________________.
INCREASES.
Higher vowels =
lower F1
Lower vowels =
higher F1
Front vowels =
higher F2
Back vowels =
lower F2
LIP ROUNDING ______________ ALL FORMANTS, REGARDLESS OF VOWEL TYPE.
LOWERS
The reason lip rounding lowers the formants is because as you round your lips, you are actually ___________________ the vocal tract. The tube that you are resonating, from the glottis to the outside world, is actually becoming ________________.
lengthening, longer
You can lower the larynx to _______________ the tube length, and just like the longer organ pipe, it would resonate lower frequencies. All of your formants would decrease in _________________.
increase, frequency
It is difficult to anatomically pinpoint where a given formant is resonated – there are ________________ _____________ between these chambers or cavities.
complex interactions
The sounds bounce back and forth between the resonating cavities, and influence each other by what 2 ways?
constructive interference and destructive interference
When the waves in the resonating cavities add up to gain more strength, it's called _______________ interference.
constructive
When the waves in the resonating cavities cancel each other out, it's called _______________ interference.
destructive
You can see the vowel /i/ has a high second formant and a lower first formant. So what type of vowel is this?
high front vowel
You can see that /u/, which is a high back vowel, will have what type of formats?
low f1, low f2
The different formants are caused by the associations between the tongue height and its _____________ position in the mouth.
front/back
The absolute formant frequencies of F1 and F2 differ substantially between :
children, women and men.
When we speak and produce these vowels in the context of consonants in the words that we utter, what happens?
we may not produce them in exactly the same ways.
What is one of the main challenges of studying formants?
vocal tract size affects the formants, and so our formants are not uniform.
What are some reasons why our formants can vary for the same vowel?
gender, age, vocal tract size, individual anatomical differences
How do we learn to recognize speech in spite of the variations in formants for the same vowel?
listeners may apply ‘templates’ and timbre and pitch may serve as cues
The differences in formant frequencies are related to the different:
sizes in vocal tracts.
Individuals will differ in their vocal tract anatomy so that their vocal tracts will resonate different _______________.
frequencies.
So formants are going to ________________ from speaker to speaker.
vary widely
The absolute value in ________ of the 1st and 2nd formants for the different vowels that a person produces will be different for each person.
Hz
When we hear speech, we are perceiving completely different sets of _____________________ from each person.
frequencies
It’s possible that listeners use _______________ in their auditory perceptual system.
‘templates’
We try to match features of vowel ________________ to other features of the sound that the person is making.
articulation
The _______________ and ____________ of the voice may be valuable cues that the brain uses to decode vowel signals.
timbre and pitch
If you hear a higher pitch, you would usually associate that with a person who has a _________________ larynx & vocal tract, and therefore you would expect ____________- formant frequencies.
smaller, higher
This variability in _______________________________________ also has a huge impact for people who are working on computer recognition of speech – it’s very difficult for software to recognize what you are saying.
absolute formant values
Speech recognition software gets used to the way you produce your “a” or “i” or “u”, by tuning into your specific :
frequencies
Automatic speech recognition systems, where you are asked to say a word rather than pressing a number, invariably tend to :
make mistakes.
When human listeners encounter new people, they adapt to their speech ___________.
very quickly.
Our brains might also look at the __________ between these frequencies.
ratios
Even if a new person’s absolute values in Hz differ from another person’s we’ve heard, the ratio of the ____________ may be broadly similar.
peaks
There may be a large gap between the first 2 formants for one sound, or they may be close together for another, so the __________ of the peaks is similar, regardless of frequency.
ratio
The _______________ is very good at processing speech sounds.
brain
A ____________ is a sound you can prolong as long as you have breath and hold the vocal tract steady.
vowel
In real speech, vowels are seldom produced in ________________.
isolation
Usually there are _______________ on either side of a vowel.
consonants
The production of the vowel is influenced by the sounds that are:
adjacent to it.
What is coarticulation?
When the produciton of a sound is influenced by the sounds adjacent to it.
It’s quite common for the articulation of one of the corner vowels - /a/ , /i/, /u/, /ae/ to be ________________________________, when the person is producing normal speech.
undershot, or not quite achieved
What are the corner vowels?
/a/ , /i/, /u/, /ae/
When the vowel is undershot, or not quite achieved, it is called:
VOWEL NEUTRALIZATION
During vowel neurtralization, the position of the tongue is a little closer to the neutral vowel /ə/ (schwa) in the ___________ of the vocal tract.
middle
What is the "neutral" vowel?
/ə/ (schwa)
In ________________ , where you have motor speech disorder, the vowels will be under-articulated too much, and the speech is ___________________.
dysarthria, distorted.
The _______________ has to make a judgment on the distinction between a normal and a disordered case of underarticulation.
The clinician
In clinical research, we sometimes use a ___________________ ______________ to quantify how well a person is articulating.
VOWEL QUADRILATERAL
The area within the vowel quadrilateral is called the:
VOWEL SPACE AREA
The vowel space area within the quadrilateral can be reflective of a person’s:
overall intelligibility.
if you were going to treat a person with dysarthria, you could measure the person’s vowel quadrilateral area before it started, and again after, and you could document :
changes/ improvements to the vowel space
Increases in the overall space of the vowel quadrilateral are reflective of larger:
articulator movements
Larger articulator movements are typically associated with improved :
speech intelligibility.
What do vowel ellipses show?
the differences in the way individuals produce vowels
You may produce a vowel rather differently depending on the vowel’s _____________-- which consonants surround it.
context
The _______________ ____________ is a reflection of how much variability there can be in the first and second formants, as you produce that vowel in different contexts.
vowel ellipse
Vowel ellipses show that the frequencies can vary around the ________________________, but it’s still an acceptable version of that vowel.
central or ideal target
What are the most common vowels across all languages?
/i/, /u/ and /a/
Generally speaking, languages tend to have between _____ and ________ vowels.
3 and 9
The acoustic spacing of these vowels tends to put them _____________________ of the vowel quadrilateral.
at the edge
The fact that vowels tend to be around the edges of the vowel quadrilateral means that they are more :
distinctive from one another.
If vowels were all clustered in the middle of the quadrilateral, close to the neutral vowel, it would be difficult to make them:
distinct from one another.
If vowels are ___________________ in the acoustic space, it gives the brain a better chance to separate one vowel from another.
spread out more
When learning a foreign language, one problem is encountering vowels that :
you’ve never spoken before.
It can be very hard as a new speaker of a language to make:
those subtle differences between the vowels that are very meaningful to the native.
We are used to hearing certain proportions of formants, but we’re not used to _____________________ of formants.
new combinations
When learning a new language, it can require a great deal of mental effort to get your ears and brain around these new combinations of __________ that you have not been exposed to before.
formants
When you produce a vowel for a few seconds, its first and second formants :
won’t change.
As you go from a consonant to a vowel, or vice versa, we have things called ______________ _____________, where F1 and F2 will change.
FORMANT TRANSITIONS
You can trace the history or evolution of F1 and F2 as they change over time, going in and out of :
consonants and vowels.
Formants are not always the ______________ ______________ that must be produced as they are in the isolated state.
steady states
The _______________ of a talking person really don’t stay put for very long.
the articulators
The tongue, lips and jaw are in almost constant _____________.
motion.
We don't really produce these successive phonemes in isolation, one at a time, but instead we:
blend them into each other.
Each sound influences, and is influenced by, what?
all of its neighbors
The __________________________________ that we can measure in a vowel really don’t last very long.
F1 and F2 frequencies
The constant movement of the articulators means that the ______________ _____________ are changing shape and size, and thus the ___________ _________________are shifting all over the place very rapidly.
resonating cavities, formant frequencies
An ordinary vowel is called a ____________________ – it has a relatively clear steady state that is maintained for a period of time.
MONOPHTHONG
A DIPHTHONG is defined by its ______________. The formant frequencies will _____________ during its production.
change, change
In a diphthong, the _______________ is the starting frequency, before the movement begins.
ONGLIDE
The ____________ is the ending frequency after you’ve finished making the diphthong.
OFFGLIDE
In between the onglide and the offglide is the ______________, when the formants are changing.
TRANSITION
The ______________ phase is what we perceive as the dipthong.
transition
/oi/ or /au/ or /ai/ are examples of:
dipthongs
It is the movement of the ____________ that really defines the diphthong that we perceive.
formants
What 3 things do all vowels have in common?
They are all voiced, the vocal tract is relatively open, and they can be identified by their formant patterns.
If you have plotted ____________ and _____________, you can identify the vowel.
F1 and F2
What are 4 ways that consonants differ from each other?
1. degree of vocal tract constriction 2. degree of noise 3. degree of nasality 4.articulation features
What makes an important distinction between oral consonants and nasal consonants?
Whether or not the velopharyngeal port is open
The only nasal consonants in English are:
/m/, /n/, and /ng/.
We produce the nasals by lowering the ________________, allowing air and sound to pass through the _______________________ and resonate in the nasal cavity.
velum, velopharyngeal port
While producing nasals, there’s a blockage of the __________ _____________.
oral cavity
Where is the oral cavity blockage for /m/ ?
at the lips for /m/
Where is the oral cavity blockage for /n/ ?
at the alveolar ridge for /n/
Where is the oral cavity blockage for /ng/ ?
at the posterior part of the palate for /ng/.
Compared to vowels, there is far more potential for ________________________ with consonants.
things to go wrong
How do consonants compare to vowels in terms of speech motor control?
There is greater complexity and precision involved in the timing and in the physical articulation of consonants. Vowels, by comparison, are relatively simple to articulate.
SLPs will spend more of their time treating errors in ________________ production.
consonant
What are the two main categories of consonants?
obstruents vs. non-obstruents
The obstruents category of consonants includes what 3 sub-categories?
stops/ stop-plosives, fricatives, and affricates
With _______________ – you get a complete closure of the vocal tract, just momentarily.
STOPS
With _______________ there is a narrowing of the vocal tract, and this forms a partial obstruction.
FRICATIVES
_____________________ are combined stops and fricatives .
AFFRICATES
An affricate has a fairly brief _____________ and a fairly brief _____________ interval.
stop, frication
An ______________ consonant is sort of a combination sound.
affricate
What are the 3 sub-categories of Non-obstruents?
nasals, glides/semi-vowels/approximants and liquids
The ______________ are sounds that do not completely block off the vocal tract, but have some form of constriction.
non-obstruents
Often a non-obstruent sound is produced with a change in the _________________________________________ throughout the production of that phoneme.
pattern of constriction
Just as there is articulatory movement during the production of a diphthong like “oi”, there’s also movement of the articulators during the production of:
glides and semivowels.
When you pronounce the phonemes, __________, __________, and __________, you have articulators moving during their production -- these are not static sounds.
/w/, /l/, and /j/
For English Stops, what are the 3 places of production?
bilabial, alveolar, and velar
What is the Manner for Stops?
a brief closure of the vocal tract
Stops can be either voiced or _______________.
voiceless
STOPS are consonants that involve the stoppage of air through the vocal tract, or complete closure of the vocal tract by :
an articulatory constriction.
In English, the bilabial stops are:
/p/’s and /b/’s
In English, the alveolar stops are:
/t/’s and /d/’s
In English, the velar stops are:
/k/ and /g/
The bilabial stops have a constriction formed by:
putting both lips together
The alveolar stops have a constriction formed by:
raising the tongue against the alveolar ridge.
The velar stops have a constriction formed when:
the back of the tongue reaches up to touch the velum.
If a stop is accurately articulated, there will be a brief instance during which there will be :
no sound in the vocal tract.
Part of a ________________ is reflected in the microphone signal as a brief silent event.
stop
Sometimes a tiny bit of ________________ may sneak through for voiced stops.
energy
During the closure, air pressure builds up behind the __________________________________, and then as the closure is released, this air bursts out.
point of constriction
The __________________________ creates the characteristic sound of a stop being released in the vocal tract.
frication energy
There is also a period of transition when the articulators move from having formed the stop to the upcoming ____________.
vowel.
The stop closure is also sometimes called the _______ _______.
STOP GAP.
A stop gap is characterized by a________________________ during the time that the articulatory constriction is being formed.
lack of energy
A stop gap is most easily seen if the stop occurs ____________ ____________.
between vowels.
If the stop is produced at the beginning of a word: “ta” or “ka”, then it would very difficult in many cases to see the stop closure duration, because of the :
lack of energy prior to the formation of that stop.
The duration of the closure is usually from ______________ milliseconds.
50 -150
In a voiced stop you may have _________________________ that continues throughout the closure of the stop.
vocal fold oscillation
During a stop gap, there is an absence of energy just a brief period of time -- it may last about :
100 milliseconds or so – about 1/10 of a second.
In the stop gap of a voiced stop, there is still some _____________________.
acoustic energy
There is some grey trace in the stop gap of a voiced stop spectrogram that indicates a very low frequency. It is called a:
VOICING BAR
The Voicing Bar corresponds to the oscillation of the ____________ ___________, which is continuing, even after the stop has been formed.
vocal folds
The trace in the voicing bar gradually ____________ during the stop gap.
decreases
The VOICELESS STOPS have a _____________ closure duration.
LONGER
The VOICED STOPS have a _____________ closure duration.
shorter
Voiced Stops have a shorter closure duration no matter what the place of ____________ is.
articulation
For stops in the initial position, It’s very difficult to tell how long the stop closure was because:
there’s no vowel energy prior to the production of the stop.
For stops in the initial position, it's impossible to measure a meaningful stop closure duration because:
you could be holding your articulators in place for a long time before you actually utter one of these syllables
When we want to measure stop closure duration, we have to look at contexts where there is some:
some vowel energy happening before the stop.
If you were to look at stop closure durations in certain speech disorders, you might find that they are increased or decreased, depending on:
what the disorder is
If a person was severely dysfluent and had stuttering blocks, you might find that the stop closure duration is drastically__________ as the person blocks on that sound and is unable to get the mechanism moving ahead into the next vowel.
increased
In some ______________, where you have neuromuscular speech deficits, the speaker may not be able to make a very good stop closure.
dysarthrias
With dysarthria, the tongue or the lips may not approximate their targets, and so they don’t make a particularly long ____________ and it isn’t a very effective one, and that’s why the speech may sound ____________________.
closure, slurred and imprecise.
We can document changes in things like stop closure duration as an index of _______________ adequacy.
articulatory
During the time that the constriction is formed for a stop,_________________ builds up behind the constriction.
air pressure
Once the ______________ of a stop is released, the pressure that has built up behind it forces air out through the space that is now gradually opening.
constriction
When the stop is released, first we get a small _____________, which is a rather sharp burst of air.
TRANSIENT
In acoustics we use the term transients to refer to a ________ ___________ acoustic event .
very brief
Once this burst of a stop has taken place, the noise continues, but in a slightly different form – it’s of __________ amplitude, but of much ____________ duration.
lower, longer
A stop burst is a very__________ event.
brief
After a stop burst, the ___________ continues while the opening is becoming wider and wider – the air is still rushing through it.
frication
After the stop burst, the air coming through is forming turbulence because the airflow is still ____________________ – the pressure has not yet decreased to match that of the ___________ around the speaker.
relatively high, atmosphere
It can be difficult to distinguish between the transient and the ______________ _____________ on a spectrogram.
frication interval
Regardless of the place of articulation, there will be some _____________________ produced when a stop constriction is released.
frication noise
The ______________________ of the stop's frication noise, however, are not constant across all places of articulation.
characteristics
What things will be different if you articulate a bilabial stop versus an alveolar stop?
the spectrum varies; the frequency components are not going to be identical
The spectra for _______________ noise looks very steady and level all the way across. All frequencies are present and are equal in amplitude. Sometimes the envelopes that enclose the spectrum can be more variable.
white
The spectra for _______________ and ______________ noise do not include all of the frequencies from the lowest to the highest. As you get into the higher frequencies, the noise has been shaped to taper off as you get to the upper end of the spectrum, so there is less high frequency energy.
pink and brown
Noise characteristics vary, depending on the :
place of constriction
Why does the place of constriction make the noise characteristics vary?
This is because there are different resonating cavities on either side of the stop closure which will influence the sounds that are being resonated or attenuated.
Different resonating cavities on either side of the ______________________ will influence the sounds that are being resonated or attenuated.
stop closure
This place of constriction has a lower frequency:
bilabial
This place of constriction has a higher frequency:
alveolar
This place of constriction has a mid-range frequency:
velar
With vowel formants, F1 and F2 are created by :
resonating cavities
As the sizes and shapes of the resonating cavities are adjusted by articulating, the _______________________ change correspondingly.
formant frequencies
Large organ pipe = _____________ frequency
low
Small organ pipe = _____________ frequency.
high
When you have a ______________ STOP, there is no narrow passageway in front of it, instead there is a wide open space in front of the speaker’s face. This large resonating space results in a ___________ frequency.
BILABIAL, low
Directly in front of an ______________ STOP is a small space between the lips where the fricative is formed. Since the space is smaller, the frequency is ______________.
ALVEOLAR, higher
There is some space in front of the velum with _____________ STOPS, but it is inside the oral cavity. This space, because of its size and shape, resonates _____________ frequencies.
VELAR, mid-range
The noise characteristic of each of the places of articulation can help the listener identify what?
where the stop was formed.
Frication is common among all of these places of articulation but the sound is going to be shaped by the ____________________________ associated with that place of articulation.
resonating cavities
Even though stops can be produced in the same place of articulation, there can be differences, depending on the _______________________ in which they are produced.
phonetic context
When the stop occurs in the _______________________ POSITION, or at the end of a word, the listener will hear the frication as the stop is released.
WORD-FINAL
(Ex: pad, pat) When you clearly release the stop closure in a word-final stop, it will definitely help with :
intelligibility and speech clarity.
With more casual speech, it is harder to tell where the closure ends because the stop is not released as clearly, thus preventing the :
production of a burst of noise.
In casual speech, the place of articulation, since it is associated with certain shaping of the noise (or adjustment of the noise spectrum), can’t be identified because:
the information is missing that indicates this to the listener.
If you are learning a foreign language and listening to a native speaker of that language, you will be better able to identify________________________ if they clearly release their stops.
places of articulation
If a person ____________________________________ , it will give you more cues as to the place of articulation and the particular phoneme they are producing.
clearly releases their stops
A native listener might understand even if the stops are not released, because they would be able to tell from the ________________________________ what that sound ought to be.
context and their familiarity with the language in general
For unfamiliar listeners of a language, it’s helpful to have the extra information of having the stop released, and to hear the _________________________ that is included in that signal.
fricative energy
The darkness of the trace in a spectrogram indicates the _____________________ of the signal.
amplitude / strength
The frication is typically weaker in a _______________ stop.
voiced
The phenomenon of ____________________ is only seen in a voiceless stop context , and only when the stop is followed by a vowel (CV context).
ASPIRATION
ASPIRATION is only seen in a ______________________ context , and only when the stop is followed by a ___________.
voiceless stop, vowel
During aspiration, the vocal folds are beginning to _____________ for the vowel that is about to begin.
adduct
As the vocal folds come closer together, the air that’s moving between them creates turbulence at the ___________.
glottis.
When a person has just formed a voiceless stop, the vocal folds have been pulled apart from one another by activating the ______________________ muscles.
posterior cricoarytenoid
There is a complete separation of the vocal folds during the time that the voiceless stop has been formed. This is called a:
laryngeal devoicing gesture.
Once the stop is released, the air starts to flow, giving us the ________________________ interval.
transient and frication
Before the vowel actually starts, the vocal folds have to ________________________ for phonation to occur.
come into position
As the vocal folds are gradually drawing together, ______________________ is generated.
aspiration noise
During aspiration, air is moving through the glottis -- the vocal folds are not close enough together to produce ______________, but they are close enough to produce some _____________________ because the air is rushing between them in a narrow space.
phonation, turbulent noise
Aspiration noise is visible on a ___________________.
spectrogram.
On a spectrogram, the aspiration shows a weaker frication energy than the :
post-stop burst
The aspiration is coming from the _______________, not the point of constriction.
glottis, from the vocal folds themselves
The frication noise occurs at :
the place of constriction
________________ is a relatively soft acoustic event, rather weak in its energy.
Aspiration
_____________________ is a weaker noise to begin with, and it also has a further distance to travel before it reaches the person’s mouth.
Aspiration
Aspiration is only seen in the context of a _______________________.
voiceless stop.
In a ___________________, there is vocal fold oscillation very shortly after the burst release has occurred.
voiced stop
In a voiced stop, the vocal folds are ____________________ so there is no opportunity for the creation of aspiration.
already together
Stops can be categorized into voiced and voiceless, but this does not mean that there is a _________________ of voicing on either side of a voiceless stop, and it doesn’t necessarily mean that there is voicing ______________ a voiced stop.
total absence, all the way through
The difference between ‘p’ and ‘b’ or between ‘t’ and ‘d’ depends on :
VOICE ONSET TIME
The time between the release of the stop closure and the onset of vocal fold vibration is called:
Voice onset time (VOT)
A voiced stop occurs when the voice onset time is :
close to zero
Sometimes there are even negative values in voice onset time. This is called:
PREVOICING or VOICING LEAD.
Voiced stops typically have ____________ VOT values .
low
VOT's can range between ________________ for b, d, & g.
-20 to +20 milliseconds
Voice onset time is the time between:
the release of the closure and the start of vocal fold oscillation.
The notion of voice onset time refers to coordination between the activity of the ___________ and the ____________ as it forms constrictions.
vocal folds, vocal tract
The timing between releasing a stop and ________________________________ makes all the difference between voiced and voiceless stops.
initiating vocal fold vibration
When you say ‘pa’ with aspiration, _____________ does not immediately begin when the lips open.
voicing
When you say ‘ba’, there is _________________. There is vocal fold oscillation during the closure and it continues once the __________ have opened.
prevoicing, lips
When we look at a waveform with a spectrogram, how do we make the decision that voicing has actually begun? We rely on these two things:
(1) The microphone signal at the top shows regular up and down oscillations where the second line is. (2) The time aligned spectrogram below shows that the vertical striations (which represent individual glottal pulses) also begin at that second line.
In a Voiced Stop, the VOT includes:
the duration of the frication interval (from the opening of the stop to the onset of vocal fold vibration)
In an Aspirated Voiceless Stop, the VOT includes:
the duration of the frication interval PLUS the aspiration
Voice onset times for voiceless stops tends to be above ____________________, and can in some cases can be as long as ____________.
25 milliseconds, 100 ms.
After the stop constriction has been released and the burst occurs, it can take up to about _____________________ before vocal fold oscillation commences for the following vowel.
a tenth of a second
Name 3 factors that influence how long voice onset time is in a given context:
Age, Rate of Speech, and Phonetic Context
If we rush through our sound productions, then VOTs will be somewhat :
shortened.
If we rush through our speech, we would still make the same voiced and voiceless distinctions, but all of the values would be somewhat ____________________-- because the overall production was sped up.
abbreviated
When we speak, we experience the phenomenon of ________________, in that sounds influence one another.
coarticulation
The VOT for ‘ata’ is ______________ since it is a VOICELESS STOP.
longer
‘Ada’ will have a much _________________ VOT than 'ata'.
shorter
Since ‘ata’ is a voiceless stop occurring before a vowel you’ll see what 2 things happening on the spectrogram in the VOT space?
1. There is a darker portion at the beginning of the VOT area that is the burst release and frication. 2. After that there is a lighter portion of noise energy, which is the aspiration created as the vocal folds are coming together. You have turbulence from the glottis there.
The total VOT for a voiceless stop is the duration of the _______________ plus the duration of the ____________.
burst and frication, aspiration.
VOT's can be separated into what two groups?
Voiced and voiceless
In English, is there overlap between the voiced VOT group and the voiceless VOT group?
No, unless there is a speech disorder that is causing voicing errors.
When the voicing has commenced prior to the release of the stop closure, it is called:
voicing lead
VOT's from ____________ up to about __________ milliseconds have a short lag, which we associate with voiced stops.
0-20
Voiceless stops have VOT's with a ______________ lag because the time between the release of the constriction and the onset of vocal fold vibration is much ________________.
long, longer
Many of the world’s languages (including English & Spanish) have just __________ VOT categories.
two
The _____________ for English and Spanish, however, are not identical.
VOTs
If you were to compare ‘pa’, and ‘ba’ in both English and Spanish you would find that for English the VOT values are somewhat _______________. They would be shifted more to the _____________ on a distribution diagram.
higher, right
VOTs for Spanish are shorter, there is often a ________________________ for the voiced stops. The distribution would be pushed somewhat to the __________ on a diagram.
voicing lead/ prevoicing, left
The absolute number of _______________________ for the boundary between a voiced and a voiceless stop is not going to be the same for English and for Spanish.
milliseconds
English and Spanish each have _____________________ distributions so voiced/voiceless distinctions can be clearly made.
non-overlapping
Some other languages (such as Thai) have 3 _____________ categories.
VOT
It can be challenging for a native speaker of English to learn Thai because you have to distinguish among 3 different categories of voicing rather than the __________ that we are used to.
two
We don’t generally do voicing lead, or _____________, for 'ba' in English , but would still recognize it as ‘ba’.
prevoicing
The English 'pa' would have a _________ VOT.
long
In Thai, there is a sound that has the features of a voiceless bilabial in terms of the force of producing this plosive, but its VOT is closer to where we would expect it to be for a voiced plosive, /b/. To our ears, it seems to have elements of ________________.
both/p/ and /b/.
The human _________________________ is very keen on assigning sounds to categories.
auditory perceptual system
So it is rather confusing to a native English listener to be confronted with 3 categories when we are only used to _____.
2
When we hear an unfamiliar voicing sound, we try try to :
make it fit into a category with which we are already familiar.
A -62 VOT is an example of _________________, which we typically don’t do in English.
prevoicing
A VOT of 10 ms is ________________ in English.
uncommon
Place of articulation has a __________ impact on VOT .
slight
For VOT's, there is a progression from the _____________ of the mouth to the ____________ of the mouth.
front, back
BILABIAL VOTs are generally ______________. (p’s and b’s)
shortest.
ALVEOLAR VOTs are _______________________ (t’s & d’s)
in an intermediate range
VELAR VOTs are _______________ (k’s and g’s)
longest
For each place of articulation, we still have non-overlapping ____________________ categories. T’s and d’s don’t overlap with each other, neither do p’s and b’s or k’s and g’s.
voiced/voiceless
There’s a general rule that the place of _________________ does influence the overall VOT values.
articulation
If you looked at all of the VOICELESS VOTs you would find the shortest ones for the ______, the intermediate ones for the ______ , and the longest ones for the________.
p,t,k
If you looked at the VOICED ONES, the VOTs would be shortest for the ___________, intermediate for the ________, and longest for the ________.
b,d,g
Sometimes there won’t even be a VOT. This will occur if the stop is ______________________.
at the end of a word
If a stop is at the end of a word, there will be no vowel following it with which to:
measure starting phonation after the release of a stop.
Even when there's no VOT, our perceptual system has to decide whether a stop is :
voiced or voiceless
What are 2 cues to help us decide if the stop is voiceless?
a longer stop gap and a stronger release burst
What are 2 cues to help us decide if the stop is voiced?
a voicing bar is visible on the spectrogram and There is a longer vowel before a voiced stop
A voicing bar is visible on a ____________, as well as audible to our ears.
spectrogram
During the stop closure, there is some __________________________ available to the auditory system which we use in our overall processing of the sound and determine that it was likely voiced. We don’t process this at a ___________ level.
low frequency energy, conscious
There tends to be a LONGER ___________________ for the voiceless stops.
STOP GAP
Subconsciously, the brain processes the differences between _____________for stops that are voiced or voiceless.
stop gap durations
When VOICELESS STOPS are released, the burst is quite a bit stronger because:
there is more oral pressure built up behind the constriction.
VOICED STOPS have less pressure available because :
the pressure coming from the lungs is being used to drive the voice to contribute to phonation.
The ___________________ STOPS ARE MORE VIGOROUS in their production and the burst is much clearer.
VOICELESS
Voiced stops have ____________ pressure available to build up directly behind the constriction.
less
There are differences in the way we produce our _____________ prior to a voiced or voiceless stop, i.e. Had vs. Hat.
vowels
The VOWEL IS QUITE A BIT ____________BEFORE THE VOICED d, and______________ before the voiceless t. ( i.e. Had vs. Hat.)
LONGER, shorter
We can hear how long the vowel is that leads up to a final _____________. It helps us to identify ___________________.
stop, the final consonant
The redundancy between the stop cues helps the ________________ determine whether a stop was voiced or voiceless.
auditory cortex
The height of the jaw, or the size of the mouth opening will influence the _____________ formant.
first
As the _____________________ opens wider, the 1st formant increases in frequency.
mouth
If the mouth is closed for a stop production, F1 will be ____________, but it will ___________ in frequency as you move into the following vowel.
low, increase
F2 and F3 are influenced by the :
place of constriction.
Our ears and auditory cortices pick up these changes in formants, helping us identify where the place of ____________ was, while moving from consonant to vowel.
articulation
The burst release following a stop closure will help the listener identify the place of articulation because the spectral characteristics of the burst will differ depending on the:
place of articulation.
VOT also changes with place of_______________ , giving us a second cue to help us.
articulation
Combining the information we can get from the burst release and from VOT with the information we get from the ____________________ from consonants to vowels gives us even more information to help determine the place of articulation.
formant transitions
The stop burst __________________________ help to identify whether it was a bilabial, alveolar, or velar stop.
spectral characteristics
When we produce FRICATIVES, the sound is generated by:
air turbulence
For fricatives, turbulence arises because air is pressurized on one side of a constriction and it flows through a :
very small opening
For fricatives, when the moving air collides with the still air beyond the constriction it causes turbulence ; thus, _____________ is generated.
noise
Fricatives sound different from each other because they have different :
places of articulation
One reason different fricatives make different sounds is because the __________________________ on either side of the constriction will vary in shape and size.
resonating cavities
A larger cavity will naturally resonate ______________ frequencies.
lower
The __________________________ will determine the spectral characteristics of the fricative.
place of articulation
It doesn’t matter to the vocal tract whether the sound given to it is a _______________ source or a ____________ source --it will still shape, resonate, and adjust the sound that is given to it.
noise, periodic
The burst of a stop, when it is released, creates ___________________ that is shaped depending on the place of articulation.
frication noise
A _______________ is just like an extended version of one of those stop burst releases.
fricative
STOPS have a frication at the end of the __________________________.
stop release.
When we produce a FRICATIVE it may be a similar form of noise to a ____________but it lasts longer.
stop
When you produce a voiced fricative, you have what two sound sources?
the vibration of the vocal folds and the turbulence that is created at the constriction.
When you produce a voiceless fricative, you have what single sound source?
the turbulence that is created at the constriction.
When you produce a voiceless fricative, there is no sound being produced by the ______________.
larynx.
As listeners we don’t usually separate out /z/ and /s/ as having two sound sources; we just perceive them as being completely different sounds. In reality, they are almost identical. The only difference is that :
one has voicing and the other doesn’t.
When the larynx is not contributing to phonation (such as in a voiceless fricative like /s/), all of the air pressure from the lungs is directed to ________________. This produces quite a strong fricative, because all of the pressure is directed to that one sound source.
that constriction.
When you are producing /z/, the actual frication that takes place at the alveolar ridge is not quite as vigorous as it was for the /s/, because :
the larynx has used up a certain amount of pressure that the lungs generate
For a voiced fricative, some of the pressure was used to create___________________ before it even reached the oral cavity. This leaves less pressure available to create turbulence at the fricative constriction.
vocal fold oscillation
Since you have voicing going on WITH frication for /z/, those two sounds add together and you get a qualitatively different type of sound that contains both harmonic energy from the __________ and turbulent energy from the ___________________________.
voice, fricative constriction
In a spectrum, the ______________ is the more jagged fuzzy line, and the more solid line is the __________, which outlines the envelope of this spectrum more clearly.
FFT, LPC
The lips are _____________ somewhat if you produce an ‘s’ on its own. The lips are _____________ to produce ‘sh’, which creates a cavity anterior to (in front of) the constriction. These differences will cause the sound to _________________________.
retracted, rounded , resonate differently
‘f’ is a low___________________ sound.
intensity
'theta' means the _____________'th' sound.
unvoiced
In some types of ________________, the production of the two fricatives ‘s’ and ‘sh’ becomes rather similar . This makes it harder to _____________________________________ one from the other.
dysarthria, perceptually distinguish
In dysarthria, the 's' and 'sh' are both pronounced closer to the middle than they should be. The spectra for these two sounds then becomes __________________________, diminishing intelligibility.
more similar to each other
Why are nonstridents (f, v, theta & thorn) more difficult to distinguish from one another ?
Their spectral shapes are broadly similar and they are acoustically weak
The ____________________________ are so soft that we have a harder time telling them apart.
non-stridents (f, v, theta & thorn)
The phone acts like a _________________________________. It eliminates the highest and the lowest frequencies. In the case of these fricatives, the ___________________ can be very valuable in helping you tell the fricatives apart. The lower frequency frication energy you are left with on the phone is common to more than one fricative. Thus, they sound similar to one another over the phone.
band pass filter, high frequencies
An AFFRICATE is a combination of what two different phoneme types?
a stop and a fricative
In an Affricate, first there is a stop, with a more noisy release than a typical stop production because the frication lasts_____________________________ than a stop. This friction interval_____________________ as an isolated fricative.
somewhat longer, is not as long
An _________________ is sort of a compromised sound, somewhere between a stop and a fricative in its production.
Affricate
When we say the affricates ch(urch) and j(udge), we don’t think of them as having a separate stop and fricative component because:
they are produced in such rapid succession we hear them as a single sound.
NASALS are ________________ different from other consonants.
qualitatively
The English language has what three nasal consonants?
/m/, /n/& /ng/.
If you produce a vowel between two nasal consonants, the vowel will probably become nasalized because :
the open velopharyngeal port contributes to damping of the energy that enters the nasal cavity.
We produce NASAL CONSONANTS by opening the velopharyngeal port and :
creating an obstruction in the oral cavity.
All nasal sounds are ______________, so the larynx is contributing phonation to their production.
voiced,
The place in the oral cavity where the _________________________________ will determine the acoustic characteristics of the nasal consonant.
obstruction is made
If you make an 'm' (bilabial) you have a __________________________ resonating cavity in the mouth, and leading up into the nasal passageway.
relatively large
If you make an 'n' (alveolar constriction) you change the _________________________________________, which makes the sound different from an 'm'.
shape and size of the resonating cavity
If you make an 'ng', the constriction is ________________________ in the oral cavity, so it has its own acoustic characteristics which are determined by its place of articulation.
most posterior
Even though the listener can tell the difference between the nasals fairly well, they are not very distinctive _____________________. Looking at them on a ________________, it is not easy to tell one from another.
acoustically, spectrogram
Nasals are very LOW IN ___________________. Because the sound is being radiated through the nasal cavity, the soft tissues there _________________ the sound. Therefore, the ________________ of the sound that reaches the listener is rather low.
INTENSITY, attenuate/dampen, intensity
/w/ and /j/ are called GLIDES because:
the tongue is moving (gliding) during their production.
Glides can be compared to ___________________, in which you both phonate and move the tongue, creating a change in the formant frequencies from one part of the sound to the next.
diphthongs
______________________ are similar to diphthongs in that the tongue is gliding or moving from one articulatory configuration to another.
Glides
Glides are also called:
approximants or semi-vowels
Why are glides sometimes called approximants?
because they approximate certain positions or constrictions
Why are glides sometimes called semi-vowels?
because they are half vowel like -- they have a relatively open vocal tract and there is phonation going on during their production
Glides have a _________________________ vocal tract and there is _________________ going on during their production.
relatively open, phonation
Besides the glides, the term approximant is also used for:
/r/ and /l/.
/w/ involves lip rounding, and the elevation of the ______________________. This is similar to how we produce an /u/ vowel. To illustrate how similar they are, put a w between two /u/ vowels. The articulators don’t move much at all. They are fairly close in terms of their _____________.
back of the tongue, configuration
The rate of change during ________________________ can be a cue to the listener as to which of these sounds is being produced.
formant transition
With b, you have formed a plosive; so the plosive burst, along with _____________will cue the listener as to the manner of production. (plosive)
VOT
The two liquids in English are :
/r/ and /l/.
The LIQUIDS are voiced ________________________.
all the way through.
The F3 for /r/ is ______________; There is a characteristic ___________ in F3 on a spectrogram representation of an /r/.
low, dip
/r/ and /l/ are completely separate phonemes in English, but this is not the case in all languages. They are __________________________ in Korean.
allophonic variants
An ______________is a version of a phoneme that may be produced slightly differently under different conditions.
ALLOPHONE
An allophone is subject to the influence of ________________________. The allophone varies depending on the ______________________________.
COARTICULATION, sounds that surround it.
In English SLP, there is a lot of opportunity to help children articulate these ____________________ sounds.
liquid (/r/ and /l/)
Liquids (/r/ and /l/) are difficult to master because of the _________________ involved in their production.
muscular patterns
Either of the liquid sounds (____________________) can potentially be sustained, but this is not what we really do in speech. It is the movement of the ______________________ that is characteristic of the production of these sounds in normal speech production. There isn’t much of a ______________ state, or prolonged element in their production.
(/r/ and /l/, formants, steady
It is the __________________ that is particularly characteristic of /r/ as it is produced in running speech.
F3 change