Disadvantages Of Speech Reverberation

Improved Essays
Speech reverberation is the distortion of the sound by its delayed and attenuated copies. These speech copies originate from the reflections of surrounding walls or objects. This phenomenon of reverberation reduces speech intelligibility and degrades the performance of hearing aids and Automatic Speech recognition (ASR) systems [1]- [4]. The resulted speech distortion can be contributed by two distinct components of reverberation - the early reflections which cause the coloration of speech, and the late reflections that contribute to echo and other significant distortions. While both types of reverberation components cause speech deformation, it is the latter one that is found to be more detrimental in practice [5], [6]. In real environments, …show more content…
However, the learning-based algorithms are generally resource intensive and require a long time context, which makes them hard to implement in real time processing. In [16], Nakatani et al. proposed harmonicity-based dereverberation (HERB) methods, which modeled RIR inverse filters as a ratio of the direct path component to the received signal. The design of the inverse filters exploited the harmonicity characteristics of the speech signal and estimated the filter coefficients in two distinct methods - one method estimated the average filter that transformed reverberant signals into harmonic signals, while the other method used a minimum mean squared error criterion that evaluated the quasi-periodicity of target signals. HERB algorithms take relatively longer time to converge, which also makes them difficult to use in real time processing. Linear predictive multi-input equalization (LIME) algorithm was used in [17] to achieve muti-channel dereverberation. The whitened speech residuals from the LIME output was mixed with the estimation of source auto regressive polynomials to obtain clean …show more content…
A long-term multi-step linear predictionbased late reverberation signal estimation was used in SS by Kinoshita et al. in [1]. Wisdom et al. proposed speech coherence-based minimum mean square error (MMSE) log spectral amplitude estimator in [25]. Another variation of SS-based method was proposed by Cauchi et al. who incorporated temporal cepstrum smoothing [26]. Wu et al. estimated the late reverberation power spectrum using an asymmetrical smoothing window based on Rayleigh distribution [27]. Veras et al. extended Wu’s method in their formulation of speech derverberation in [28]. Kokkinakis et al. used variable subtraction factor as a function of the a posteriori signal to noise ratio (SNR) and evaluated the performance in cochlear implant devices [29]. However, most of the spectral enhancement techniques assume that the speech signal is orthogonal to the undesired signal, be it a random background noise or reverberation, and ignore any cross-term between the signal components. However, Yang et al. argued that the cross-term was not necessarily zero in all the scenarios and depended on the a priori SNR in practical cases involving white background noise

Related Documents

  • Improved Essays

    Modal Analysis Essay

    • 1479 Words
    • 6 Pages

    However, it may be difficult and expensive to excite the large civil structures such as bridges by artificial force. The levels of vibration under operational conditions may exceed the artificially induced vibration in this case. Therefore, another area of modal analysis known as OMA, starting in the last decade of twentieth century, has focused attention of civil engineers. OMA provides the identification of modal parameters of the structure being exited by unknown ambient force. The input is assumed as a stochastic process, also known as white noise, in this case.…

    • 1479 Words
    • 6 Pages
    Improved Essays
  • Great Essays

    Although, compromizing on time resolution in our case, STSF identified rotating stall frequency on waterfall plot and spectrogram but left a question mark on temporal resolution and hence in detecting the rotating deisturbance. AWT features both Fourier transform and wavelet transform. Suitable for high sampled data withiout pre-requisition of filters, AWT spectrogram provides an excelent temporal and spectral resolution of pre-stall and stall process however, it costs longer computational time than all other techniques. A comparison…

    • 1521 Words
    • 7 Pages
    Great Essays
  • Improved Essays

    Depending on the number of path vectors, that many transport equations are solved. One advantage of the DO model is that it takes into account the directional dependence of radiation and spans the complete range of optical thickness. So it can include the effects of anisotropy, semi-transparent walls, particulate effects, etc. A possible disadvantage of this model is that it becomes even more computationally intensive for finer angular discretization. The DO model is not used in the current case due to its inclusion of directional dependence of radiation intensity, which is not that in consideration here.…

    • 1613 Words
    • 7 Pages
    Improved Essays
  • Improved Essays

    For area the expected was 1.52, the volume 4.23, and for density 4.29. Finally I found the expected uncertainty buy using the formula (original value)*(expected percentage uncertainty)/100. The values for area, volume, and density…

    • 976 Words
    • 4 Pages
    Improved Essays
  • Improved Essays

    A unknown smoothing function is represented by the g(A_i ), and it is assumed to be constant across the pre- and posttest time periods (for further discussion of a smoothing parameter see Peng, 1999). The relationship between the assignment variable and the outcome variable during the pretest period are the foundation of this design, it allows for extrapolation beyond the assignment cut-off criterion in the posttest period (Wing and Cook,…

    • 1016 Words
    • 4 Pages
    Improved Essays
  • Superior Essays

    Verbal Aspect Analysis

    • 2252 Words
    • 10 Pages

    Dynamicity can be contrasted with stativity, the latter of which refers to the unlikelihood of a temporal situation to change (Yap et al., 2009). Telicity includes a natural endpoint, and indicates whether a temporal situation is complete or not. Durativity indicates duration of action, meaning how long or how briefly a temporal situation…

    • 2252 Words
    • 10 Pages
    Superior Essays
  • Superior Essays

    ROMAC Stability Test Fig

    • 830 Words
    • 4 Pages

    Using the idenitification method [Li], the first mode parameters are estimated (1B: f=85.3Hz, LogDec=0.726; 1F: 86.3Hz, LogDec=0.442). By comparing the prdicted and measured first forward mode parameters, we can drow conclusion that the force coeffients of bearings are reasonably…

    • 830 Words
    • 4 Pages
    Superior Essays
  • Decent Essays

    We are now interested to vary in order to observe the qualitative changes of solution trajectories near . Corollary 1. Assume that Theorems [3-7] are hold. Then there is…

    • 716 Words
    • 3 Pages
    Decent Essays
  • Great Essays

    It is suggested that direct labelling theory would cause errors in trials of cross category stimuli to be less than within category as there are already memorised verbal labels. He noted that the act of labelling a colour through perception of its being the most different would invalidate the results of the task. To summarise, he concluded that there is a weak Whorfian effect on lower level cognitive functioning, however if there is an effect, it would also be apparent in higher level cognitive functions. He suggests tests on memory storage as future…

    • 1355 Words
    • 6 Pages
    Great Essays
  • Improved Essays

    The goal of filter tuning chases every variant of the Kalman filter which can at best be minimized but not completely ignored if one desires to get near optimal solutions. Further it becomes difficult for one to infer if the performance of the variants of Kalman filter are due to their formulation or filter tuning! It should be remarked that in the best spirit of the estimation theory in particular the recursive Kalman filter approach even if X0, P0, Q, R and Q namely the initial states, their covariance, parameters in the state and measurement equations, the measurement and state process noise covariance are not available or inaccurately known the filter should still have the ability to estimate all the above from the ‘observables’ that are measured and commencing not too far from the proper estimates for the algorithm to converge. One would like to have the initial choice of all the unknowns should not be very critical. The filter should be self-consistent in estimating all the…

    • 785 Words
    • 4 Pages
    Improved Essays