Disadvantages Of Speech Reverberation

Improved Essays
Speech reverberation is the distortion of the sound by its delayed and attenuated copies. These speech copies originate from the reflections of surrounding walls or objects. This phenomenon of reverberation reduces speech intelligibility and degrades the performance of hearing aids and Automatic Speech recognition (ASR) systems [1]- [4]. The resulted speech distortion can be contributed by two distinct components of reverberation - the early reflections which cause the coloration of speech, and the late reflections that contribute to echo and other significant distortions. While both types of reverberation components cause speech deformation, it is the latter one that is found to be more detrimental in practice [5], [6]. In real environments, …show more content…
However, the learning-based algorithms are generally resource intensive and require a long time context, which makes them hard to implement in real time processing. In [16], Nakatani et al. proposed harmonicity-based dereverberation (HERB) methods, which modeled RIR inverse filters as a ratio of the direct path component to the received signal. The design of the inverse filters exploited the harmonicity characteristics of the speech signal and estimated the filter coefficients in two distinct methods - one method estimated the average filter that transformed reverberant signals into harmonic signals, while the other method used a minimum mean squared error criterion that evaluated the quasi-periodicity of target signals. HERB algorithms take relatively longer time to converge, which also makes them difficult to use in real time processing. Linear predictive multi-input equalization (LIME) algorithm was used in [17] to achieve muti-channel dereverberation. The whitened speech residuals from the LIME output was mixed with the estimation of source auto regressive polynomials to obtain clean …show more content…
A long-term multi-step linear predictionbased late reverberation signal estimation was used in SS by Kinoshita et al. in [1]. Wisdom et al. proposed speech coherence-based minimum mean square error (MMSE) log spectral amplitude estimator in [25]. Another variation of SS-based method was proposed by Cauchi et al. who incorporated temporal cepstrum smoothing [26]. Wu et al. estimated the late reverberation power spectrum using an asymmetrical smoothing window based on Rayleigh distribution [27]. Veras et al. extended Wu’s method in their formulation of speech derverberation in [28]. Kokkinakis et al. used variable subtraction factor as a function of the a posteriori signal to noise ratio (SNR) and evaluated the performance in cochlear implant devices [29]. However, most of the spectral enhancement techniques assume that the speech signal is orthogonal to the undesired signal, be it a random background noise or reverberation, and ignore any cross-term between the signal components. However, Yang et al. argued that the cross-term was not necessarily zero in all the scenarios and depended on the a priori SNR in practical cases involving white background noise

Related Documents

  • Improved Essays

    Essay On Stereophonics

    • 744 Words
    • 3 Pages

    Imagine storming the beach of Normandy, France on June 6, 1944. Shell fire from behind almost knocking you flat. Shrapnel flying from your left to your right in a second. Bursts of gunfire directly overheard. The ground shaking as you charge forward.…

    • 744 Words
    • 3 Pages
    Improved Essays
  • Improved Essays

    Tone Threshold Paper

    • 1025 Words
    • 5 Pages

    20.83 19.70 - 150 0.075 16.76 17.15 250 0.125 13.52 15.14 350 0.175 10.06 13.00 550 0.275 2.74 8.42 750 0.375 -4.89 3.60 Table 1 . Results The purpose of this research was to illustrate and compare the differences in shapes of auditory filters of two individuals. The signal to noise ratio (P/N0) represented in Table 1, is a ratio of the power in the signal (P) over the spectrum level of noise (N0) which remained at 35 decibels except in the notch. The P/N0 is given in decibels and is derived by multiplying the log of this ratio by 20.…

    • 1025 Words
    • 5 Pages
    Improved Essays
  • Improved Essays

    In the middle to late 1900’s Phil Spector was an American producer, musician, and songwriter whose popularity spiked to the top after developing the Wall Of Sound also called Spector’s Sound. Withstanding establishment of the common stereo and preferring to have various instruments combined in a single monaural track, Spector composed this technique at the Gold Star Studios in Los Angeles. Conforming relatively facile equalizations aided in his popularity because his approach was crucial to avoid sounding like the “nonsense” that many teenagers at that time listened to on transistor radio. He included two major things in his production of music; reverb, an electronically produced echo effect in recorded music, and layering, different instruments…

    • 477 Words
    • 2 Pages
    Improved Essays
  • Improved Essays

    Lrad Research Paper

    • 774 Words
    • 4 Pages

    First let start by saying what is the (LRAD) The Long Range Acoustic Device? Is a loud microphone or speaker to convey important message to loud noise making indidvuals and the environment. It is not a less lethal weapon. The LRAD get the attention of a crowd or a criminal is about to break the law by disturbing the peach, the LRAD will get the attention of the trouble maker, prevent injury, and save lives.…

    • 774 Words
    • 4 Pages
    Improved Essays
  • Improved Essays

    Room 101 Lab Report

    • 1050 Words
    • 5 Pages

    Aim - The aim of this experiment is to find out at what age the ability to hear high frequencies begin to degrade. Hypothesis - As people get older the ability to hear high frequencies is much lower than the ability to hear high frequencies as a child. Background - As people get older, their capability to hear degrades whereas younger people like children, have a much better hearing since adults have experienced loud noises throughout their lives which degrades their hearing a little. The U.S department of health and Human services further explain that when a noise/sound/frequency is heard, the parts of the ear (inner, middle and outer ear)…

    • 1050 Words
    • 5 Pages
    Improved Essays
  • Superior Essays

    Visual feedback is used to show the patients the level of loudness that they need to achieve. LVST focuses mostly on the respiratory system, resonance and phonation. Efficacy of the Study, Level and Study Design of Lee Silverman Voice…

    • 1555 Words
    • 7 Pages
    Superior Essays
  • Decent Essays

    Aspects of speech perception that were examined include the age of implantation effect…

    • 237 Words
    • 1 Pages
    Decent Essays
  • Improved Essays

    The most salient feature of silence and voice is that Kingston uses both in order to create an…

    • 733 Words
    • 3 Pages
    Improved Essays
  • Improved Essays

    Advantages Of Single Voice

    • 1328 Words
    • 6 Pages

    Coherent identity and single voice Through the human history, human beings have been establishing their own cultures in various ways. Since the ancient times, human developed the way how they can survive themselves and it made people to be together. By the time goes, people established community and it formed as the ‘country’. When the country formed in formal way, people started sharing their opinions, rules, instructions, and even their life styles. Among this processes, it has been settled as a certain way and people called it ‘culture’.…

    • 1328 Words
    • 6 Pages
    Improved Essays
  • Improved Essays

    As regards alpha power analysis in our study, there was no significant difference between both groups in the alpha power values during the baseline resting eyes-closed state denoting that the resting alpha power values could not be used in evaluation of cognitive dysfunction in children with BCECTS. By presenting the target tones, the alpha power values were significantly reduced over different brain regions in the healthy control group, unlike the epileptic children who showed less reduction in the alpha power values. Different participants of the two study groups reacted differently to the presentation of target tones, some showed alpha power reduction (ERD) while others showed alpha power enhancement (ERS) in the different brain regions, however those showing ERD were more than those showing ERS and this difference…

    • 484 Words
    • 2 Pages
    Improved Essays
  • Improved Essays

    Sonnenschein On Sound

    • 1896 Words
    • 8 Pages

    Sonnenschein, (2001), suggests that hearing is the first sense that we develop in our mother’s womb and the last one we lose before death. Our ears together offer a stereophonic reception, whilst providing distance and spatial perception and therefore our place in the world. However, we tend to downgrade the ear’s function to almost a reflex and only become aware of its significant role when the eyes cannot perceive the information provided. Still, this gives the opportunity to the sound designer to work with the audience’s subconscious. Sonnenschein (2001) gives an example to understand the function and structure of the ear.…

    • 1896 Words
    • 8 Pages
    Improved Essays
  • Great Essays

    I. Summary (1-2 paragraphs) The documentary Sound and Fury addresses the use of cochlear implants for individuals who are considered by a medical professional or speech and language pathologist as either deaf or hard-of-hearing. In this specific film, Heather, age 6, and Peter, who is almost 2 years of age, are individuals who, after the consultation of numerous respective occupations, believes could benefit from a cochlear implant. This documentary focuses on the fact that the implementation of a cochlear implant isn’t a simple process in terms of the decision to do so by the family to the actual procedure, as it needs to be surgically implanted. Throughout the documentary, numerous concerns are brought to light on the effects a cochlear…

    • 1821 Words
    • 8 Pages
    Great Essays
  • Improved Essays

    Effective Patient Advocate

    • 1705 Words
    • 7 Pages

    1.1 The General Practitioner as a therapeutic agent: Good communication between health providers and patients is the cornerstone of high quality, patient centered care. A caring attitude to the patient’s psychosocial/emotional needs is an important aspect of the patient experience and one that receives the greatest emphasis in the literature. Patient centered care is associated with higher rates of patient satisfaction, adherence to treatment and psychological and physical functioning.…

    • 1705 Words
    • 7 Pages
    Improved Essays
  • Great Essays

    Just as cell phones today have the capability of sending text messages to one another, so do standard household phones. With this text messaging available, the hearing impaired can communicate just as any other. Technology has made it capable to transmit not just the spoken word, but also the written word through telephone lines. Now that television shows and movies are equipped with the technology to include closed captioning, the hearing-impaired can view them. Listening devices can now be used with the telephone, TV, radio, or theaters.…

    • 1723 Words
    • 7 Pages
    Great Essays
  • Improved Essays

    from the work of Claude and Warren Weaver. Shannon in 1949; this three-part model was intended to capture radio and television transmission process. The three parts are: source, channel, and receiver. Shannon and Weaver also identify another component that can interfere while listening to a telephone call that is called noise. However, this model was adapted to human communication, and it has some useful parallels to public speaking.…

    • 956 Words
    • 4 Pages
    Improved Essays