Use LEFT and RIGHT arrow keys to navigate between flashcards;
Use UP and DOWN arrow keys to flip the card;
H to show hint;
A reads text to speech;
129 Cards in this Set
- Front
- Back
What are the three portions of the vocal apparatus and what are their main functions? |
1. Lungs: respiration 2. Larynx: phonation 3. Vocal Tract: manipulation |
|
What does the sub-glottal system consist of? |
The lungs, trachea and bronchi |
|
What are the mechanisms of inhalation/expiration in the lungs? |
Inspiration: no musles in lungs, external intercostal muscles contract to increase superior lung volume, diaphragram contracts to increase inferior lung volume Expiration: elastic recoil force, passive, inspiration muscles still active to control speed |
|
What is the significance of pressure changes with regards to breathing? |
The pressure in the lungs must be lower than exterior pressure in order for inspiration to occur, increasing lung volume subsequently decreases lung pressure |
|
What are the internal intercostal muscles used for? |
Forced expiration |
|
Definition: Tidal Volume |
Regular volume during inspiration, ~3L at rest |
|
Definition: Vital Capacity |
The difference between volume at max inspiration and max expiration |
|
Definition: Cricoid Cartilage |
Sits on top of the trachea, circle, does not move, provides base for other cartilage structures |
|
Definition: Thyroid Cartilage |
Shield that sits on top of the cricoid cartilage, tilts to lengthen/shorten the vocal folds |
|
Definition: Arytenoid Cartilage |
Right and left, attaches to cricoid, open and close the glottis |
|
What is the significance of the hyoid bone? |
It is the base for the tongue and suspends the larynx |
|
What happens when the thyroid tilts forwards? |
The VF lengthen and therefore frequency of vibration increases |
|
M. Cricothyroideus |
Attachment: cricoid and thyroid Action: tilts thyroid |
|
M. Thyroarytaenoideus |
Attachment: thyroid and arytenoids Action: phonation, changes shape of VF |
|
M. Cricoarytaenoideus Lateralis |
Attachment: cricoid and arytenoids Action: rotate arytenoids inwards |
|
M. Cricoarytaenoideus Posterior |
Attachment: cricoid and arytenoids Action: externally rotate the arytenoids |
|
M. Interarytaenoideus |
Attachment: left and right arytenoids Action: open/close glottis |
|
Vocalis Muscle |
Fine pitch adjustment |
|
What causes the VF to open/close? |
The greater pressure below the VF cause an explosive opening, recoil force causes closing of VF |
|
What is the Bernoulli Force? |
Air blown between two folds creates a decrease in pressure therefore the two folds are sucked together (VF close from bottom to top) |
|
What is the main mechanism of pitch control? |
The tilting of the thyroid cartilage |
|
Is it easier to vibrate the VF in someone with a breathy or creaky voice? |
Breathy |
|
What are the three portions of the VT? |
The pharynx (laryngopharynx, oropharynx and nasopharynx), nasal tract and oral tract (lips, teeth, jaw, tongue, face) |
|
What is the epiglottis used for in speech production? |
It isn't. It is used to close the VT while swallowing. |
|
What is the significance of the mucous membrane that covers the pharynx and nasal cavities? |
The mucous membrane swallows and damps sound energy, making it quieter and less clear |
|
What are the 5 parts of the tongue? |
1. Tip 2. Blade 3. Body 4. Back 5. Root |
|
What is the difference between an intrinsic and an extrinsic tongue muscle? |
Intrinsic muscles originate and insert within the tongue (fast muscles, shape tongue), extrinsic muscles originate from bone and extend to tongue (slow muscles, move tongue) |
|
What are the 4 extrinsic tongue muscles and what are their origins and actions? |
1. Genioglossus: mandible, forward movement of the tongue 2. Hyoglossus: hyoid, retracts/depresses the tongue 3. Styloglossus: styloid process, lifts sides of tongue 4. Palatoglossus: palatine aponeurosis, elevates back of tongue |
|
What are the 3 intrinsic tongue muscles and what are their actions? |
1. Longitudinal (superior and inferior - most important): elevates, retratcs and deviate the tongue tip 2. Transverse: change tongue stiffness 3. Vertical: change tongue stiffness |
|
Give a brief explanation of a microphone's function. |
1. converts air pressure changes to continuous analog electrical signals 2. data represented by the computer is not continuous (many discrete signals) 3. discretization done by quantization |
|
Definition: Quantization Error |
Too little levels when quantizing causing errors in the reconstructed waveform |
|
Definition: Sampling Rate |
Regular interval where the analog sample is sampled (in Hz) |
|
How do you calculate sampling rate? |
Sampling Rate = Samples/Time (Hz) |
|
How do you calculate sampling interval? |
Sampling Interval = Samples / (Sampling Rate^-1) |
|
How do you calculate samples per period? |
Samples Per Period = Period / Sampling Interval |
|
How do you calculate period? |
Period = 1 / Frequency |
|
Definition: Tl:Dr |
The minimum sampling rate needed to capture all the info in a given sample |
|
What is the Nyquist Criterion? |
We can represent a signal as long as we sample at a rate at least twice the maximum F |
|
If the maximum F in a sample is 60Hz, what is the Nyquist F? |
120Hz |
|
What is signal aliasing? |
The reconstructed signal contains signals that were not part of the original signal |
|
What is bit-depth/quantization resolution? |
How many levels used to represent the amplitude |
|
What is the path of original signal to reconstructed signal? |
Original signal --> anti-aliasing filter --> analog-digital conversion (A/D) --> computer processing --> digital-analog conversion (D/A) --> reconstruction filter --> reconstructed signal |
|
What are the 5 types of acoustic signals? |
1. Pure tones (sine/cosine waves) 2. Periodic Signals 3. Quasi-Periodic Signals 4. Non-Periodic Signals (noise) 5. Non-Periodic Signals (impulse) |
|
What are the three characteristics used to describe pure tones? |
1. Frequency 2. Amplitude 3. Phase |
|
What additional characteristic is used to describe a complex signal? |
Timbre |
|
What is the Fourier Transformation? |
The representation of the signal in terms of the frequencies that make it up. |
|
When does constructive interference occur and what happens when it does occur? |
It occurs when two waves are in phase, amplitude is either increased or decreased |
|
What is the discrete Fourier transformation? |
The use of fixed signal lengths of input |
|
What is the fast Fourier transformation? |
The use of input lengths that is 2^n points long, take signal from time domain to frequency domain |
|
What is the frequency domain? |
Y-axis = power X-axis = frequency |
|
What is the time domain? |
Y-axis = amplitude X-axis = time |
|
How is are Fourier transformations represented graphically? |
On a line spectrum |
|
What is the fundamental frequency (F0)? |
Complex signals are made up of multiples of the F0, the F0 is the greatest common multiple of all F. It is the interval between lines on a line spectrum. |
|
What is the objective when windowing? |
To choose a window period as close to the actual signal's period as possible |
|
What effect does a narrow window have on temporal and spectral resolution? |
Increased temporal resolution, decreased spectral resolution |
|
What effect does a wide window have on temporal and spectral resolution? |
Decreased temporal resolution, increased spectral resolution |
|
What is the benefit of using a narrowband spectrogram? |
It has good F resolution and can distinguish between individual harmonics |
|
What is the benefit of using a wideband spectrogram? |
It has good temporal resolution, less visible harmonics but can see transitions between segments and individual glottal pulses |
|
What is linear predictive coding? |
Limits the amount of data needed because it the signal can be reconstructed with parameters of signal (predict future speech sounds), shows info from VT, not VF |
|
How many poles do you need to calculate a peak? |
2 |
|
What is the pressure of air? |
1000hPa |
|
What are the pressure variations between sound waves? |
600uPa - 2Pa |
|
What is the speed of sound? |
c = 340m/s (depends on the medium at which it travels through) |
|
Definition: Longitudinal Wave |
Particles travel in the same direction as the motion of the wave (sound) |
|
Definition: Transverse Wave |
Particles move in the opposite direction as the movement of the wave |
|
How does a microphone work? |
1. membrane stretched over compartment with constant pressure 2. variations of pressure outside the compartment cause the membrane to move in or out 3. the movement of the membrane generates small electrical signals that are recorded by the condenser coil |
|
What are the two main type of microphones? |
Electrodynamic and condenser (better) |
|
Define big and small capsule microphones and which is better? |
Big Capsule (>1/2 inch) - better Small Capsule (<1/2 inch) |
|
What is the difference between an omni-directional and a directional microphone? |
An omni-directional microphone records sound in all directions and a directional microphone records only in a certain direction |
|
What are on the X and Y axes of an oscillogram? |
Y-axis: amplitude X-axis: time |
|
Definition: Period |
The time it takes to complete one entire cycle |
|
Definition: Frequency |
The number of cycles per second (Hz) F = 1 / Period |
|
How is wavelength computed? |
WL = c x period |
|
What are 3 examples of a periodic signal? |
Glottal signal, sine/cosine waves, complex harmonic signals |
|
What are 2 examples of non-periodic signals? |
Noise and impulse |
|
What does P0 represent? |
The reference value for the decibel scale, considered the hearing threshold (0dB) |
|
What are the 2 types of omnidirectional microphones? |
1. Omnidirectional 2. Figure-8 |
|
What are the 2 types of directional microphones? |
1. Cardioid (180 degrees) 2. Supercardioid (180 degrees + 360 degrees in close proximity to mic) |
|
What are the 3 types of plosives? |
1. Voiced 2. Voiceless Unaspirated (plain) 3. Voiceless Aspirated |
|
What are the characteristic of a voiced plosive? |
1. Voicing present throughout the entire VCV sequence 2. No delay between constriction and following vowel |
|
What are the characteristics of a plain plosive? |
1. Absence of voicing during closure 2. No delay between release of constriction and onset of vowel 3. Holding phase where no energy is present |
|
What are the characteristics of a voiceless aspirated plosive? |
1. Absence of voicing during closure 2. Delay between release of constriction and onset of following vowel |
|
How does VOT relate two articulatory gestures? |
It relates the release of a plosive (oral) the with the onset of VF vibration (larynx) |
|
Definition: Voice-Onset Time |
The time between the release of a consonantal constriction (plosive) and the onset of VF vibration (expressed in ms) |
|
What are the values of VOT for the 3 types of plosives? |
1. Voiced (negative < plain and aspirated) 2. Plain (~20ms) 3. Voiceless Aspirated (>40ms) |
|
What does "dark" represent on a spectrogram? |
Closing of the glottis and increased amplitude |
|
What are the three mechanisms of airflow in the production of speech sounds? |
1. Pulmonic Airflow 2. Glottalic Airflow 3. Velaric Airflow |
|
What is the difference between ingressive and egressive airflow? |
Part of pulmonic airflow. Ingressive: inward airflow Egressive: outward airflow, used for speech production |
|
What is the difference between ejectives and implosives? |
Part of glottalic airflow. Ejective: outward airflow, raising of larynx ['] Implosives: inward airflow, lowering of larynx [hook] |
|
What is velaric airflow? |
A complete anterior or posterior constriction in the oral tract that is released suddenly, creating a "click" sound |
|
What are the 5 types of phonation? |
1. Voicelessness 2. Whisper 3. Breathy Voice 4. Creaky Voice 5. Modal Voice (normal) |
|
How are the VF positioned with a. whispering, b. breathy voice and c. creaky voice |
a. posterior opening near arytenoids [.] b. VF slightly separated and never fully adducted [..] c. VF adducted along posterior region, anterior region is slack [~ (on bottom)] |
|
Are voiced or voiceless plosives longer? |
Voiceless because voiced plosives are more difficult to make. |
|
What are the 3 source sounds? |
1. glottis 2. constriction in VT 3. vibration of articulators |
|
What is the filter? |
The VT |
|
What portion of the source-filter theory determines the voicing and presence of aspiration? |
The source |
|
Definition: Resonance Frequencies |
Preferred F at which a object oscillates when it is set into vibration |
|
What does resonance F depend on? |
The material and size of the object |
|
If a tube has one open end and one closed end, what are the resonances? |
1/4 wavelength: 1st wavelength = c / 4L |
|
If a tube has both ends closed/open, what are the resonances? |
1/2 wavelength: 1st wavelength = c / 2L |
|
What is the difference between a resonance and a formant? |
They are the same thing |
|
Definition: Damping |
How quickly an object will cease to oscillate (energy absorbed quicker in thicker objects) |
|
Definition: bandwidth/steepness |
How strongly other components (frequencies) are compressed in dB/octave |
|
Definition: Center F |
Where the signal begins to be attenuated |
|
What is a bandpass filter? |
Frequencies on the extremes are attenuated |
|
At what rate does the larynx spectrum decrease? |
12dB/octave |
|
What are the 1st, 2nd and 3rd resonances of the VT when a schwa is produced? |
1 - 500Hz 2 - 1500 Hz 3 - 2500 Hz |
|
What are formant F, F0 and harmonics characteristics of? |
Formant - VT F0 - source Harmonics - source |
|
Why is the formant chart organized the way it is? |
To visually represent the tongue placement when the vowel is produced |
|
What is on the X and Y axis of a formant chart? |
X-axis: second formant Y-axis: first formant |
|
What does the first formant represent? |
Tongue height |
|
What does the second formant represent? |
Tongue backness |
|
What does the third formant represent? |
Nothing in English, in languages that distinguish between rounding of vowels F2 and F3 become closer to each other |
|
What do formants 4 and 5 represent? |
Nothing specific, they are speaker specific |
|
How can you recognize a diphthong on a spectrogram? |
There is change in formant pattern |
|
What type of consonant has the greatest VT disturbance? |
Plosives |
|
What is the frequency range for bilabial bursts? |
500-1500Hz |
|
What is the F range for alveolar bursts? |
2500-4000Hz |
|
What is the F range for velar bursts? |
1500-2500Hz |
|
What are the parts and functions of the external ear? |
1. Pinna: localizes sound 2. Meatus: amplifies and directs sound 3. Tympanic Membrane: converts air signal to mechanical signal, amplifies sound |
|
What are the parts and functions of the middle ear? |
The ossicles (malleus, incus and stapes): transmits mechanical sound to the oval window where it is converted to fluid movement |
|
How much do the ossicles increase pressure? |
>20x |
|
How is sound attenuated in the middle ear? |
Tensing of muscles attached to ossicles to make it harder for them to vibrate |
|
What are the parts and functions of the inner ear? |
The cochlea (upper - scala vestibuli and lower - scala tympani separated by basilar membrane): change fluid movement into neural impulses |
|
How is F distributed on the basilar membrane? |
Base: high F Apex: low F |
|
Are there more inner or outer hair cells? |
Outer |
|
How many rows are the inner and outer hair cells in? |
1 and 3 respectively |
|
What F does the inner ear code at? |
20 - 20000Hz |
|
Do the inner or outer hair cells touch the tectorial membrane constantly? |
Outer hair cells |