Use LEFT and RIGHT arrow keys to navigate between flashcards;
Use UP and DOWN arrow keys to flip the card;
H to show hint;
A reads text to speech;
186 Cards in this Set
- Front
- Back
Resonant consonants
|
Nasals /m/ /n/ /ng/
glides /w/ /j/ liquids /r/ /l/ |
|
Nonresonant consonants
|
fricatives /f/ /v/ /s/ /z/ /th/ /TH/ /sh/ /3/
stops /p/ /b/ /t/ /d/ /k/ /g/ affricates /ch/ /dj/ |
|
resonant consonants are characterized by ______ _______ which are similar to vowels
|
acoustic resonances
|
|
2 articulatory gestures of nasals
|
lowering the velum
oral occlusion |
|
3 resonant cavities of nasals
|
pharyngeal
nasal oral |
|
source of sound of nasals
|
periodic vibration of vocal folds
|
|
POA of nasals
|
place of occlusion
bilabial = lips, low freq alveolar = alveolar ridge, higher freq velar = velar region, highest freq |
|
parts of nasal coupling that reduces intensity
|
1 anti-resonances (0's instead of peaks)
2 elongation of vocal tract due to velopharyngeal opening 3 damping of the nasal cavities |
|
more anterior the place of articulation the _____ the vocal tract and the _____ the resonant frequency
|
longer
lower |
|
POA of fricatives
|
point of articulation
turbulent (aperiodic noise) behind constriction filter: cavity in front of the constriction |
|
4 categories of fricatives
|
labiodental (voiceless /f/ voiced /v/)
linguadental (voiceless /th/ voiced /TH/ alveolar (voiceless /s/ voiced /z/) postalveolar (voiceless /sh/ voiced /3/) |
|
are there formant structures of fricatives?
|
no
|
|
the more _____ the POA (fricatives) the _____ the front resonating cavity the _____ the resonant freq
|
anterior
shorter higher |
|
3 phases of stops
|
1 closing
2 hold 3 release |
|
source of stops
|
aperiodic or aperiodic+periodic
|
|
3 types of stops
|
bilabial (voiceless /p/ voiced /b/)
alveolar (voiceless /t/ voiced /d/) velar (voiceless /k/ voiced /g/) |
|
acoustics of stops there is a _____ during the "hold" phase
|
gap
|
|
definition of voiced onset time in stops
|
time between stop release and initiation of phonation
|
|
VOT is ______ in voiced stops and ______ in voiceless stops
|
shorter, longer
|
|
Process of syllable initial consonants
|
stop gap before burst of sound
period of silence no sound escaping |
|
three characteristics of stop production
|
a) occlusion/ closure (50-100 ms)
b) release (5-40 ms) c) transition to vowel |
|
due to whether it is voiced or not, voicing occurs during ______
|
closure
|
|
VOT (voice onset time) is _______ ms for voicelss
|
40-80 ms
|
|
there is _______ in voiced consonants, and it can last from ______ ms
|
prevoicing
-20-20 |
|
word final stops do not always show a _______
|
release
|
|
_____ _______ occur due to the changing shape of the vocal tract. It is resonating at different frequencies
|
formant transitions
|
|
voiced stops = _____ (high/low) frequency energy
|
low
|
|
stop gap
|
acoustic interval corresponding to the articulatory occlusion phase
|
|
release burst
|
transient produced at the release of the articulatory occlusion (<40 ms)
|
|
formant transition
|
reflect changes in vocal tract shape
|
|
spectrum
x axis ______ y axis ______ |
time
freq |
|
pressure difference below and above glottis creates ______
|
voicing
|
|
/s/ is _____ (high/low) freq energy because the cavity in front is _____ (small/large)
|
high
small |
|
tricks to increase voicing time
|
increase volume oral cavity by blowing up cheeks
increase volume oral-pharyngeal cavity by lowering larynx release air between the lips (or nose) |
|
________ (voiced/voiceless) are most intense
|
voiceless
|
|
bilabial
burst spectrum- voice onset spectrum- VOT- |
burst spectrum- falling
voice onset spectrum- low freq dominance VOT- short |
|
alveolar
burst spectrum- voice onset spectrum- VOT- |
burst spectrum- rising
voice onset spectrum- high freq dominance VOT- intermediate |
|
velar
burst spectrum- voice onset spectrum- VOT- |
burst spectrum- compact
voice onset spectrum- low freq dominance VOT- long |
|
F2 indicates _______
|
POA and mannor
|
|
f1 is always ______ (rising/falling) because of characteristics of vocal tract
|
rising
|
|
when going from a consonant to a vowel f1 ______ (rises or falls)
|
rises
|
|
when going from a vowel to a consonant f1 ______ (rises or falls)
|
falls
|
|
if f2 does not have sharp transitions it may be a problem with ____
|
muscle tone
|
|
dyarthric
|
reduced vowel space, reduce tongue mobility
|
|
F2 order for bilabial, alveolar, and velar
|
bilabail = lowest
alveolar = highest velar = mid |
|
cavity behind /b/ is ______ than /g/
|
longer
|
|
plosives produced in the front part of the oral cavity are more ____ than in the back
|
voiced
|
|
if there is a small cavity behind obstruction, _____ _____ difference cannot be maintained easily
|
transglottal pressure
|
|
whistle is a _____
|
fricative
|
|
fricatives
|
narrow constriction
turbulent airflow |
|
duration of noise segment
stops affricates fricatives |
stops = 75 ms
affricates = 75-130 ms fricatives = 130 ms |
|
reynolds number
|
laminar flow becomes turbulent
low values > laminar flow as it increases, it transitions to turbulent flow critical point is when it is no longer laminar but turbulent |
|
definiton of VOT
|
time between stop release and initiation of phonation
|
|
VOT is shorter in ______ stops (voiced/voiceless)
|
voiced
the closing movement of vocal folds starts earlier in voiced stops to prepare for phonation |
|
spectrum
x axis _____ y axis _____ |
freq
intensity |
|
spectogram
x axis _____ y axis _____ |
time
freq |
|
3 parts to spectogram
|
time
freq intensity darkness |
|
darkness characterizes intensity (light vs dark)
|
light = valleys
dark = peaks |
|
wideband
|
small chucnk of signal
more detailed temporal infor less detailed freq info only formants, no harmonics |
|
narrowband
|
large chunk of signal
less detailed temporal info more detailed freq info formant structures presented by small peaks |
|
good temporal resolution means _____ (fewer/more) samples per spectrum
|
fewer
|
|
narrowband window length is ______ than 2 pitch period
|
longer (25 ms)
|
|
wideband spectrogram window is usually ____ than one pitch period
|
shorter
3-6 sec |
|
What are the 2 categories of nasal speech?
|
Nasal consonants (m, n, ng) and nasal/nasalized vowels (phonemic nasal vowels and phonetically nasalized vowels)
|
|
nasal consonants
|
the velum needs lowering
connects to nasal tract so no air flows out nostrils or mouth |
|
nasalized vowels
|
coarticulation
nasalization = production of sound while the velum is lowered so some air escapes through the nose during the production of the sound by the mouth |
|
nasalization
|
form of coarticulation
related to velar movement |
|
what are the muscles o the velum
|
1. tensor veli palatini
2. levator veli palatini 3. palatoglossus 4. musculus uvulae 5. palatopharyngeus |
|
functions of levator veli palatini
|
from: temporal bone and cartilage of the eustachian tube
to: midline of the velum function: elevate the velum |
|
function of the tensor veli palatini
|
anterior-lateral to LVP
function: to assist the LVP to elevate the velum |
|
function of the palatoglossus
|
from: sides of the velum
to: sides and dorsum of the tongue function: lower the velum and elevate the tongued dorsum |
|
function of the palatopharyngeus
|
from: velum
to: thyroid cartilage and wall of pharynx function: pull the pharynx upward and the uvula downward and backward |
|
musculus uvulae
|
mucle of the uvula
function: move and shape the uvula |
|
nasal murmor
|
the resonance of the nasopharyngeal tract with the oral cavity coupled as a closed side-branch
|
|
lowering of the velum ______ the velo-pharyngeal port
|
opens
|
|
larger volume means _______ (lower/ higher) resonating frequency
|
lower
|
|
in a spectra, peaks = ______ and valleys = _______
|
resonance, anti-resonance
|
|
nasal vowels have ________ intensity
|
reduced
|
|
there are extra peaks and valleys in the spectrum for nasal vowels because...
|
there are more resonances in the nasal cavities
|
|
the intensity of nasal vowels is lower than that of oral vowels because ______
|
the soft mucosa in the nasal tract absorb sound energy
|
|
resonance of nasal tract _____ acoustic energy which _______ energy in the oral cavity. This results in a spectral _____ for the nasal tract and a spectral ____ related to shunting of acoustic energy.
|
increases
reduces peak zero |
|
a spectral zero close to an oral formant can ______ the amplitude and shift the frequency formant
|
attenuate
|
|
frequency of nasal murmur is related to the volume of the vocal tract ______
|
indirectly
|
|
all the cavities in the nose represent ______ peaks and zeros in the spectra
|
multiple
|
|
how does nasalization change formant structures?
|
reduces f1
shifts the frequency, lower f2 |
|
nasality
|
perceptual term that quantifies the degree of nasalization
|
|
the physiological variable underlying the perception of nasality is ______
|
velopharyngeal opening
|
|
the acoustic variables underlying nasality could be ______
|
nasal peaks and zeros
shifted/attenuated oral formants reduction of low-freq nasal energy |
|
hypernasal spech
|
a disorder related to abnormal resonance of the vocal tract due to incomplete closure of the velopharyngeal port.
|
|
cleft palate is a _______ disorder
|
craniofacial
|
|
nasal speech measurement I: Nasal airflow
|
indirect measure of velopharyngeal aperture
nasal mask |
|
nasal speech measurement II: velopharyngeal aperture
|
structural imaging (3D MRI)
Aerodynamic measurement (look up equation) |
|
nasalance
|
ratio of the nasal acoustic energy to the total acoustic energy; an idorect measure of velopharyngeal aperture
measured by nasometer |
|
vocal tract model for oral vowel production
|
modeled as a tube closed at the glottal end and open at the lip end
TWO CAVITIES (oral, pharyngeal) |
|
shape of the tube in a vocal tract model for oral vowel production is determined by ______
|
area function of the vocal tract
|
|
vocal tract model for nasal vowel production
|
produced with an open velopharynx which connects nasal tract with oropharyngeal cavity
THREE BRANCHES (oral, nasal, pharyngeal) |
|
three branch tube model
|
for nasal
the sound pressure recorded by the microphone is a combination of sound pressure radiated from nostrils and lips |
|
direct magnitude estimation
|
psychophysical method to scale a perceptual continuum
|
|
advantages of DME over other scaling methods
|
DME does not assume a linear partition of the continuum
DME is not bounded by fixed minimum/ maximum values continuity of DME rating score is consistent with nasality as a continuous variable |
|
the driving pressure behind vocal fold vibration and vocal intensity is ______
|
subglottal pressure
|
|
loud speech = _______ (higher/lower) intraoral pressure
|
higher
|
|
In a nasal spectogram there is no pressure build up because....
|
air is going out the nose
|
|
Children have higher pressure because:
|
they talk louder
oral cavity is smaller (which means larger pressure) |
|
Consonant with the highest pressure is ______
|
affricates
|
|
pressure and intensity are ______ related
|
proportionately
|
|
what is maximum pressure
|
measure of how much pressure can be developed within oral cavity
|
|
variations in flow are related to variations in __________
|
manner
|
|
Flow = ______
|
movement of a volume of air through a given area over a unit of time
|
|
3 types of airflow
|
intraoral (associated with manner of articulation)
glottal flow nasal flow |
|
pressure and flow are incompatible in _____
|
stops
|
|
pressure and flow may be measured simultaneously in _____
|
fricatives
|
|
flow = ____ over _____
|
volume over time
|
|
pressure, flow and resistence are the aerodynamic equivalent of ______
|
Ohm's Law
|
|
flow is highest in _____ (initial/end) and______ (voiced/voiceless)
|
initial and voiceless
|
|
highest to lowest flow:
|
consonants
fricatives glides |
|
average airflow increases with ______ and _______
|
intensity and f0
|
|
nasometry
|
comparison of acoustic and nasal intensity signal
|
|
nasalance = (hint:ratios)
|
nasal intensity: oral intensity
|
|
high vowels = ____ (more/less) nasal airflow and intensity
|
more
|
|
prosody
|
reflects features of the utterance of the speaker
emotional state form of the utterance irony/sarcasm |
|
Suprasegmental
|
units larger than a phoneme
stress: syllable intonation: phrases duration: varies over many units in speech juncture: whether adjacent phonemes are closely associated or not |
|
Lexical stress
|
stress patterns in words
REcord vs reCORD |
|
Sentential Stress
|
emphasizes words in sentences
is that your RED book |
|
contrastive stress
|
emphasis on a normally weak syllable
REceive vs DEceive |
|
prosody
|
reflects features of the utterance of the speaker
emotional state form of the utterance irony/sarcasm |
|
Suprasegmental
|
units larger than a phoneme
stress: syllable intonation: phrases duration: varies over many units in speech juncture: whether adjacent phonemes are closely associated or not |
|
Lexical stress
|
stress patterns in words
REcord vs reCORD |
|
Sentential Stress
|
emphasizes words in sentences
is that your RED book |
|
contrastive stress
|
emphasis on a normally weak syllable
REceive vs DEceive |
|
Intonation
|
changes in f0 over utterance
emotion final pitch/questions |
|
declination
|
fall in f0 at the end of an unmarked breath group (end with narrow range of f0)
|
|
factors that decrease f0
|
relaxing CT muscle
decreasing subglottal pressure |
|
intrinsic duration (diphthongs)
|
longer
|
|
juncture
|
that way sounds are joined or separated from one another
|
|
assimilation
|
a sound becomes like its neighbor, one articulator is involved
|
|
partial assimilation
|
no change in phonemic category
dentalization of /t/ before /th/ in eat the cake |
|
complete assimilation
|
phonemic class changes
velarization of /n/ before /k/ in ten cards |
|
coarticulation
|
two articulators active at the same time for two sounds
lip rounding for /t/ in too |
|
coarticulation is seen as a _______ process
|
smearing
|
|
coarticulation (heterogenic)
|
different articulators - overlapping movement sequence
|
|
coarticulation (homorganic)
|
same articulators - adjustments in movement sequences influenced by phonetic neighbor
|
|
coarticulation is determined by _______
|
by distinctive features of other segments
|
|
vowels reduced to ______ when unstressed
|
schwa
|
|
auditory system order
|
1) PRESSURE WAVES ARRIVE AT OUTER EAR
2) MIDDLE EAR CONVERTS PRESSURE TO MECHANICAL VIBRATION 3)inner ear converts vibration to vibrations in fluid 4) nerve endings in the cochlea convert the sounds to neural impulses |
|
impedence mismatch
|
difference between air section and fluid section
|
|
total amplification by middle ear is _____ dB
|
30
|
|
why do we need middle ear
|
impedance matching
stapedial or acoustic reflex equalizes pressure |
|
vestibular system
|
motion and position
- semicircular canals and vestibule |
|
cochlea
|
sense of hearing
-basilar membrane, organ of corti, tectorial membrane |
|
cochlea is in ______ portion of temporal bone
|
petrous
|
|
______ is toxic to hair cells
|
perilymph
|
|
fluid change causes vibration in _____ and _____ membrane
|
basilar and reissners
|
|
in cochlear, high frequencies more _____
|
basal, oval window
|
|
in cochlea, low frequencies more ______
|
apical, helicotrema
|
|
most basilar membrane vibration closer to ______
|
base
|
|
primary mode of speech feedback is _____
|
auditory
|
|
technique to track articulators during speech segment:
|
electromagnetic articulography
|
|
technique to measure tongue contact with hard palate during speech:
|
electropalatography
|
|
Electropalatography is useful for which sounds:
|
alveolar and palatal
|
|
symptoms of dysarthria
|
slow movements, discoordination, inaccurate contact patters, undershoot
|
|
stuttering is characterized by
|
higher movement variability
|
|
if one identifies the _____ then the control of a movement is more readily understood
|
goal
|
|
The goal of speech is ______
|
communication
|
|
_________ is the medium to pass, evaluation, and currency of speech
|
acoustics
|
|
strengths of a dynamic MRI for speech production is:
|
view of the entire throat
|
|
Delayed Auditory Feedback (DAF) can create _____ in normal speakers, but ____ some who stutter
|
dysfluency, help
|
|
Delayed Auditory Feedback
|
delay the signal
|
|
Lombard Effect
|
a lot of background noise makes the person speak louder, reducing the noise makes the speak louder
|
|
4 sensory info
|
auditory, touch, motion, position
|
|
primary sense for speech is _____ secondary is______
|
auditory, somatosensory
|
|
auditory and somatosensory feedback are called ______ feedback
|
RE-afferent
|
|
three types of feedback are:
|
intrinsic, tactile and proprioceptive
|
|
intrinsic feedback:
|
auditory and somatosensory
|
|
tactile feedback:
|
touch from articulator contact, air pressure change in subglottal cavities,
|
|
Proprioceptive feedback:
|
sense of position, posture, velocity of movement by tiny sensors in joints tendons and muscle spindles
|
|
muscle spindles
|
when stretched activate and carry information to the brain
lots in jaw and tongue, few in orofacial |
|
closed loop feedback with negative feedback:
|
system perceives an error and then makes an adjustment to minimize the error
|
|
closed loop system of feedback is useful in (short/long) duration
|
long
|
|
open loop control theory
|
system executes a learned movement without dependence on feedback
|
|
vowels
|
most intense
minimal constriction identification relies on patterns of formant freqs, formant transitions to and from neighboring sounds, first and second formant |
|
diphthongs
|
phonated, high intensity
2 vowels in single syllable identify by rapid changing formant (higher f1 lower f2), direction and extent of formant freq change, rate of formant freq change |
|
Semi-vowels
|
manner: rapidly changing formants
place: changes in f2 and f3 |
|
stops: manner cues
|
silent closure interval
transient release burts rapid rise for initial and fall for final of f1 |
|
sops: place cues
|
f3 transition
patterns of f2 transitions frequency of most intense portion of release burst** |
|
acoustic cues of nasal consonant manner
|
low intensity formants caused by anti-resonances
nasal murmer at about 250 Hz preceding vowel for syllable final nasals |
|
acoustic cues to nasal consonant place of articulation
|
f2 transition:
from /m/ lowest and shortest from /n/ high and average duration from /ng/ highest variable and longest |