1609.03213

Model: gemini-2.0-flash

# Relaxed Binaural LCMV Beamforming **Authors**: Andreas I. Koutrouvelis, Richard C. Hendriks, Richard Heusdens, Jesper Jensen ## Relaxed Binaural LCMV Beamforming Andreas I. Koutrouvelis, Richard C. Hendriks, Richard Heusdens and Jesper Jensen Abstract -In this paper we propose a new binaural beamforming technique which can be seen as a relaxation of the linearly constrained minimum variance (LCMV) framework. The proposed method can achieve simultaneous noise reduction and exact binaural cue preservation of the target source, similar to the binaural minimum variance distortionless response (BMVDR) method. However, unlike BMVDR, the proposed method is also able to preserve the binaural cues of multiple interferers to a certain predefined accuracy. Specifically, it is able to control the trade-off between noise reduction and binaural cue preservation of the interferers by using a separate trade-off parameter perinterferer. Moreover, we provide a robust way of selecting these trade-off parameters in such a way that the preservation accuracy for the binaural cues of the interferers is always better than the corresponding ones of the BMVDR. The relaxation of the constraints in the proposed method achieves approximate binaural cue preservation of more interferers than other previously presented LCMV-based binaural beamforming methods that use strict equality constraints. Index Terms -Binaural cue preservation, beamforming, hearing aids, LCMV, multi-microphone noise reduction, MVDR. ## I. INTRODUCTION Traditionally, hearing aids have been fitted bilaterally , i.e., the user wears a hearing aid on each ear, and the hearing aids are operating essentially independently of each other. As such, the noise reduction algorithm in each hearing aid estimates the signal of interest using only the recordings of the microphones from that specific hearing aid [9]. Such a setup with an independent multi-microphone algorithm per ear may severely distort the binaural cues since phase and magnitude relations of the sources reaching the two ears are modified [10]. This is harmful for the naturalness of the total sound field as received by the hearing-aid user. Ideally, all sound sources (including the undesired ones) that are present after processing should still sound as if originating from the original direction. This does not only lead to a more natural C OMPARED to normal-hearing people, hearing-impaired people generally have more difficulties in understanding a target talker in complex acoustic environments with multiple interfering sources. To improve speech quality and intelligibility, single-microphone (see e.g. [1] for an overview) or multimicrophone noise reduction algorithms (see e.g., [2] for an overview) can be used. While the former are mostly effective in improving speech quality and reducing listening effort, the latter are also effective in improving speech intelligibility [3]. Examples of multi-microphone noise reduction algorithms include the multi-channel Wiener filter (MWF) [4], [5], the minimum variance distrortionless response (MVDR) beamformer [6], [7], or, its generalization, the linearly constrained minimum variance (LCMV) beamformer [7], [8]. This work was supported by the Oticon Foundation and the Dutch Technology Foundation STW. perception of the acoustic environment, but can also lead to an improved intelligibility of a target speaker in certain cases; more specifically, in spatial unmasking experiments [11] it has been shown that a target speaker in a noisy background is significantly easier to understand when the noise sources are separated in space from the speaker, as compared to the situation where speaker and noise sources are co-located. The LCMV algorithm [7], [8] minimizes the output power of the noise under multiple linear equality constraints. One of these equality constraints is typically used to guarantee that the target source remains undistorted with respect to a certain reference location or microphone. The remaining constraints can be used as additional degrees of freedom in designing the final filter. For example, it can be used to steer nulls in the directions of the interferers [7], [12], or to broaden the beam towards the target source in order to avoid pointing error problems, also known as steering vector mismatches [13], [14]. A special case of the LCMV method is the minimum variance distortionless response (MVDR) beamformer, which only uses the distortionless constraint of the target source [6], [7]. Binaural hearing aids are able to wirelessly exchange microphone signals between hearing aids. This facilitates the use of multi-microphone noise reduction algorithms which combine all microphone recordings from both hearing aids, hence allowing the usage of more microphone recordings than with the bilateral noise reduction. As such, the increased number of microphone recordings can potentially lead to better noise suppression and, thus, to a higher speech intelligibility. Moreover, by introducing proper constraints on the beamformer coefficients, binaural cue preservation of the sources can be achieved. An alternative multi-microphone noise reduction method is the MWF which leads to the minimum mean square error (MMSE) estimate of the target source if the estimator is constrained to be linear, or, the target source and the noise are assumed to be jointly Gaussian distributed [15]. However, in [16]-[18], it was demonstrated that speech signals in time and frequency domains tend to be super-Gaussian distributed rather than Gaussian distributed. Thus, the MWF is generally not MMSE optimal. The MWF does not include a distortionless constraint for the target source and, thus, it generally introduces speech distortion in the output [4]. Several generalizations of the MWF have been proposed, among which the speech distortion weighted MWF (SDWMWF) [5], which introduces a parameter in the minimization procedure to control the trade-off between speech distortion and noise reduction. A well-known property of the MWF is the fact that it can be decomposed into an MVDR beamformer and a single-channel Wiener filter as a post-processor [19]. Notice that this holds in general, also if the filter is not constrained to be linear and the target source is not assumed to be Gaussian distributed [15]. The binaural version of the SDW-MWF (BSDWMWF) [21], [22] preserves the binaural cues of the target. However, it was theoretically proven that the binaural cues of the interferers collapse on the binaural cues of the target source [23] (i.e., after processing the binaural cues of the interferers become identical to the binaural cues of the target source). In [22], a variation of the BSDW-MWF (called BSDW-MWF-N) was proposed which tries to partially preserve the binaural cues of the interferers. This method inserts a portion of the unprocessed noisy signal at the reference microphones to the coresponding BSDW-MWF enhanced signals. The larger the portion of the unprocessed noisy signals, the lower the noise reduction, but the better the preservation of binaural cues of the interferers and vice versa. As such, this solution exhibits a trade-off between the preservation of binaural cues and the amount of noise reduction. In [24], a subjective evaluation of BSDW-MWF and BSDW-MWF-N shows that for a moderate input SNR indeed the subjects localized the processed interferer correctly with BSDW-MWF-N and incorrectly with BSDW-MWF. However, for a small input SNR the processed interferer was also localized correctly for BSDW-MWF. This is mainly due to the inaccurate estimates of the cross power spectral density (CPSD) matrix of the target, and due to masking effects when the processed target and processed interferer are represented to the subjects simultaneously [24]. In [25], two other variations of the BSDW-MWF were proposed. The first one is capable of preserving the binaural cues of the target and completely cancel one interferer. The second one is capable of accurately preserving the binaural cues of only one interferer, while distorting the binaural cues of the target. There are several binaural multi-microphone noise reduction methods known from the literature. These can be devided into two main categories [20]: a) methods based on the linearly constrained minimum variance (LCMV) framework and b) methods based on the multi-channel Wiener filter (MWF). Similarly to SDW-MWF, the BSDW-MWF can be decomposed into the binaural MVDR (BMVDR) beamformer and a single-channel Wiener filter [25]. The BMVDR can preserve the binaural cues of the target source, but the binaural cues of the interferers collapse to the binaural cues of the target source. In [26], [27], the binaural linearly constrained minimum variance (BLCMV) algorithm was proposed, which achieves simultaneous noise reduction and binaural cue preservation of the target source and multiple interferers. Unlike the BMVDR, the BLCMV uses two additional linear constraints per interferer to preserve its binaural cues. A fixed interference rejection parameter is used in combination with these constraints to control the amount of noise reduction. The BLCMV is thus capable of controlling the amount of noise reduction using two constraints per interferer. However, in hearing-aid systems with a rather limited number of microphones, the degrees of freedom for noise reduction are exhausted quickly when increasing the number of interferers. This makes the BLCMV less suitable for this application. In [28], a similar method to BLCMV, called optimal BLCMV (OBLCMV), was proposed which is able to achieve simultaneous noise reduction and binaural cue preservation of the target source and only one interferer. Unlike the BLCMV, the OBLCMV uses an optimal interference rejection parameter with respect to the binaural output SNR. In [29], [30] two independent works proposed the same LCMV-based method (we call it joint BLCMV (JBLCMV)) as an alternative to the BLCMV, which preserves the binaural cues of the target source and more than twice the number of interferers compared to the BLCMV [29]. Unlike the BLCMV, the JBLCMV requires only one linear constraint per interferer and, as a result, it has more degrees of freedom left for noise reduction. The linear constraints for the preservation of the binaural cues of the interferers have the same form as the linear constraint used in [25]. However, unlike the method in [25], the JBLCMV can preserve the binaural cues of a limited number of interferers and does not distort the binaural cues of the target source. The remainder of this paper is organized as follows. In Section II, the signal model and the notation are presented. In Section III the key idea of the binaural beamforming is explained and several existing binaural LCMV-based algorithms are summarized. In Sections IV and V, a novel nonconvex binaural beamforming problem and its iterative convex approximation are presented, respectively. In Section VI, the evaluation of the proposed algorithm is provided. Finally, in Section VII, we draw some conclusions. In this paper, we present an iterative, relaxed binaural LCMV beamforming method. Similar to the other binaural LCMV-based approaches, the proposed method strictly preserves the binaural cues of the target source. However, the proposed method is flexible to control the accuracy of binaural cue preservation of the interferers and, therefore, trade-off against additional noise reduction. This is achieved by using inequality constraints instead of the commonly used equality constraints. The task of each inequality constraint is the (approximate) preservation of the binaural cues of a single interferer in a controlled way. The proposed method is flexible to select a different value for the trade-off parameter of each interferer according to importance. The BMVDR and the JBLCMV can be seen as two extreme cases of the proposed method. On one hand, the BMVDR can achieve the best possible overall noise suppression compared to all the other aforementioned binaural LCMV-based methods, but causes full collapse of the binaural cues of the interferers towards the binaural cues of the target source. On the other hand, the JBLCMV can achieve the preservation of the maximum possible number of interferers compared to the other aforementioned binaural LCMV-based methods, but at the expense of less noise suppression. Unlike the JBLCMV and the BMVDR, the proposed method, is flexible to control the amount of noise suppression and binaural cue preservation according to the needs of the user. The relaxations used in the proposed method allow the usage of a substantially larger number of constraints for the approximate preservation of more interferers compared to all the other binaural LCMVbased methods including JBLCMV. ## II. SIGNAL MODEL AND NOTATION Assume for convenience that each of the two hearing aids consists of M/ 2 microphones, where M is an even number. Therefore, the microphone array consists of M microphones in total. The multi-microphone noise reduction methods considered in this paper operate in the frequency domain on a frame-by-frame basis. Let l denote the frame index and k the frequency-bin index. Assume that there is only one target source and there are r interferers. The k -th frequency coefficient of the l -th frame of the j -th microphone noisy signal, y j ( k, l ) , j = 1 , · · · , M , is given by  where - s ( k, l ) denotes the target signal at the source location. - a j ( k, l ) is the acoustic transfer function (ATF) of the target signal with respect to the j -th microphone. - u i ( k, l ) , is the i -th interfering signal at the source location. - b ij ( k, l ) is the ATF of the i -th interfering signal with respect to the j -th microphone. - n ij ( k, l ) is the i -th received interfering signal at the j -th microphone. - x j ( k, l ) is the received target signal at the j -th microphone. - v j ( k, l ) is additive noise at the j -th microphone. Here we use in the signal model the ATFs for notational convinience. However, note that the ATFs can be replaced with relative acoustic transfer functions (RATF)s which can often be identified easier than the ATFs [12], [20]. In the remainder of the paper, the frequency and frame indices are neglected to simplify the notation. Using vector notation, Eq. (1) can be written as  where y ∈ C M × 1 , x ∈ C M × 1 , n i ∈ C M × 1 and v ∈ C M × 1 are the stacked vectors of the y j , x j , n ij , v j (for j = 1 , · · · , M ) components, respectively. Moreover, x = a s and n i = b i u i , where a ∈ C M × 1 and b i ∈ C M × 1 are the stacked vectors of the a j and b ij (for j = 1 , · · · , M ) components, respectively. Assuming that all sources and the additive noise are mutually uncorrelated, the CPSD matrix P y = E [ yy H ] of y is given by where  - P n i = E [ n i n H i ] = p u i b i b H i ∈ C M × M is the CPSD matrix of n i , with p u i = E [ | u i | 2 ] the PSD of u i . - P x = E [ xx H ] = p s aa H ∈ C M × M is the CPSD matrix of x , with p s = E [ | s | 2 ] the power spectral density (PSD) of s . - P v = E [ vv H ] ∈ C M × M is the CPSD matrix of v . · P = r ∑ i =1 P n i + P v is the total CPSD matrix of all disturbances. ## III. BINAURAL BEAMFORMING Binaural multi-microphone noise reduction methods aim at the simultaneous noise reduction and binaural cue preservation of the sources. In order to preserve the binaural cues, two different spatial filters ˆ w L ∈ C M × 1 and ˆ w R ∈ C M × 1 , are applied to the left and right hearing aid, respectively, where constraints can be used to guarantee that certain phase and magnitude relations between the left and right hearing aid outputs are preserved. Note that both spatial filters use all microphone recordings from both hearing aids. Assume for convenience and without loss of generality that the reference microphone for the left and right hearing aid is indexed as j = 1 and j = M , respectively. In the sequel of the paper, for ease of notation, the reference terms of Eq. (1) use the subscripts L and R instead of j = 1 and j = M , respectively. The two enhanced output signals at the left and right hearing aids are then given by  In Section III-A, objective measures for the preservation of binaural cues are presented. In Sections III-C-III-F, the binaural MVDR (BMVDR), the binaural LCMV (BLCMV), the optimal BLCMV (OBLCMV), and the JBLCMV are reviewed, respectively. All reviewed methods are special cases of the general binaural LCMV (GBLCMV) framework, presented in Section III-B. Finally, the basic properties of all reviewed methods are summarized in Section III-G. ## A. Binaural Cues The extent to which the binaural cues of a specific source are preserved can be expressed using the input and output interaural tranfer function (ITF) [31]. Often the ITF is decomposed into its magnitude, describing the interaural level differences (ILDs) and its phase, describing the interaural phase differences (IPDs). The input and output ITFs of the i -th interferer are defined as [31]  The input and output ILDs are defined as [31]  The input and output IPDs are given by [31]  Note that frequently, the IPDs are converted and measured as time delays [32], i.e., interaural time differences (ITDs). The IPDs and ILDs are the dominant cues for binaural localization for low and high frequencies, respectively [33]. Typically, the IPDs become more important for frequencies below 1 kHz, while ILDs become more important for frequencies above 3 kHz [33]. In [34] it was experimentally shown that for broadband signals, the IPDs are perceptually much more important than the ILDs for localizing a source. More specifically, it was shown that the low frequency IPDs play the most important role perceptually for correct localization. Based on this observation several proposed multi-microphone noise reduction techniques [32], [35] leave the low frequency content of the noisy measurements unprocessed, and process only the higher frequency content. Unfortunately, if a large portion of the power of the noise is concentrated at low frequencies, the noise reduction capabilities are reduced significantly. Therefore, in this paper we aim at the simultaneous preservation of binaural cues of all sources and noise reduction at all frequencies. A binaural spatial filter, w = [ w T L w T R ] T , exactly preserves the binaural cues of the i -th interferer if ITF in n i = ITF out n i [31]. Exact preservation of ITFs also implies preservation of ILDs and IPDs [31], i.e., ILD in n i = ILD out n i and IPD in n i = IPD out n i . Non-exact preservation of binaural cues implies that there is some positive ITF error given by  Moreover, non-exact presevation of binaural cues implies that there is some ILD and/or IPD errors, given by  where 0 ≤ T n i ≤ 1 [31]. Eqs. (5), (6), (7), (8) and (9) apply also for the target source x . As it will become obvious in the sequel, for all methods that will be discussed in this paper, the errors in Eqs. (8), (9) with respect to the target source are always zero. As explained before, the IPD error is perceptually more important measure for binaural localization than the ILD error for broadband signals (such as speech signals contaminated by broad-band noise signals), because the IPDs are perceptually more important than the ILDs for this category of signals. Moreover, the IPD error is perceptually more informative at low frequencies, while the ILD error is perceptually more informative at high frequencies. ## B. General Binaural LCMV Framework All binaural LCMV-based methods discussed in this section are based on a general binaural LCMV (GBLCMV) 1 framework which is the binaural version of the classical LCMV framework [7], [8]. The GBLCMV minimizes the sum of the left and right output noise powers under multiple linear equality constraints. That is,  where ˆ w GBLCMV = [ ˆ w T GBLCMV ,L ˆ w T GBLCMV ,R ] T ∈ C 2 M × 1 , Λ ∈ C 2 M × d is assumed to be a full column rank matrix (i.e., rank ( Λ ) = d ), f ∈ C d × 1 , d is the number of linear equality constraints, and  1 We used the word general in order to distinguish it from the BLCMV method [26], [27]. Similarly to the classical LCMV framework [7], [8], if d ≤ 2 M , and Λ is full column rank, the GBLCMV has a closedform solution given by In GBLCMV, the total number of degrees of freedom devoted to noise reduction is DOFGBLCMV = 2 M -d . Note that in the special case where d = 2 M , there are no degrees of freedom left for controlled noise reduction, i.e., ˆ w GBLCMV cannot reduce the objective function of the GBLCMV problem in a controlled way. Finally, if d > 2 M , the feasible set { w : w H Λ = f H } is empty and the GBLCMV problem has no solution. In conclusion, the matrix Λ has to be 'tall' (i.e., d < 2 M ), to be able to simultaneously achieve controlled noise reduction and satisfy the constraints of the GBLCMV problem. The maximum number of constraints that the GBLCMV framework can handle, while achieving controlled noise reduction, is d max = 2 M -1 , i.e., there should be always left at least one degree of freedom for noise reduction. Generally, the more degrees of freedom (i.e., the larger DOFGBLCMV), the more controlled noise reduction can be achieved.  The set of linear constraints of the GBLCMV framework in Eq. (10) can be devided into two parts,  The first part consists of two distortionless constraints w H L a = a L and w H R a = a R which preserve the target source at the two reference microphones. This can be written compactly as   where All binaural methods discussed in this section are special cases of the GBLCMV framework and they share the constraints in Eq. (14), while the constraints w H Λ 2 = f H 2 are different. In the sequel of the paper we use the term m ( m max ) to indicate the number (maximum number) of interferers that a special case of the GBLCMV framework can preserve, while at the same time achieving controlled noise reduction. Recall that controlled noise reduction means that there is at least one degree of freedom left for noise reduction. Moreover, m max ≤ r which means that some methods may be unable to preserve all simultaneously present interferers of the acoustic scene, because there are not enough available degrees of freedom. ## C. BMVDR The BMVDR beamformer [30] can be formulated using the combination of the following two beamformers   with closed-form solutions  The BMVDR is the simplest special case of the GBLCMV framework in the sense that it has the minimum number of constraints ( d = 2 ) given by Eq. (14). Specifically, the two optimization problems in Eqs. (15) and (16) can be reformulated as the following joint optimization problem,  where ˆ w BMVDR = [ ˆ w T BMVDR ,L ˆ w T BMVDR ,R ] T ∈ C 2 M × 1 . Since, the BMVDR algorithm has the minimum possible number of constraints, the total number of degrees of freedom which can be devoted to noise reduction is DOFBMVDR = 2 M -2 . The BMVDR beamformer preserves the binaural cues of the target source, but distorts the binaural cues of all the interferers [30], i.e., m max = 0 . More specifically, after beamforming, the binaural cues of the interferers collapse on the binaural cues of the target source. It can be easily shown [30] that the binaural cues of the target source are preserved due to the satisfaction of the two distortionless constraints of the problems in Eqs. (15) and (16). That is,  Therefore, the ITF error is E x , BMVDR = 0 . Furthermore, it can be easily shown that the binaural cues of the interferers collapse to the binaural cues of the target source [30]. More specifically, the ITF in n i is given by  while ITF out n i is given by  Thus, after beamforming, the interferers will have the same ITF as the target source and their ITF error is given by ## D. BLCMV  Another special case of the GBLCMV framework is the binaural linearly constrained minimum variance (BLCMV) beamformer [26], [27] which, unlike the BMVDR, uses additional constraints for the preservation of the binaural cues of m interferers. The left and right spatial filters of the BLCMV are given by [26], [27]  and  where the constraints w H L a = a L and w H R a = a R are the two common distortionless constraints used in all special cases in the GBLCMV framework, while the constraints w H L b i = η L b iL and w H R b i = η R b iR , for i = 1 , . . . , m , aim at a) preserving the binaural cues and b) supressing the m interferers. The amount of supression is controlled via the interference rejection parameters η L and η R which are predefined ( 0 ≤ η L , η R < 1 ) real-valued scalars. Binaural cue preservation is achieved only if η = η L = η R [26], [28]. The two optimization problems in Eqs. (23) and (24) can be compactly formulated as a joint optimization problem. That is,  where and   The available degrees of freedom for noise reduction are DOFBLCMV = 2 M -d = 2 M -2 m -2 . Since d max = 2 M -1 (see Section III-B), BLCMV can simultaneously achieve controlled noise suppression and binaural cue preservation of at most m max = M -2 interferers. The ITF errors of the target source and of the m interferers that are included in the constraints are zero, i.e., E x , BLCMV = 0 and E n i , BLCMV = 0 , for i = 1 , · · · , m ≤ r . However, if some interferers are not included in the constraints, their ITF error will be non-zero, i.e., E n i , BLCMV > 0 , for i = m +1 , · · · , r . ## E. OBLCMV The OBLCMV [28] can be seen as a special case of the BLCMV (and, hence, the GBLCMV) since it solves the same optimization problem. However, it preserves the binaural cues of only one interferer (e.g., the k -th interferer) using an optimal complex-valued interference rejection parameter ˆ η = ˆ η L = ˆ η R with respect to the binaural output SNR (defined in Sec. VI-B2). More specifically, OBLCMV solves the optimization problem in Eq. (25) where Λ and f T , are given by [28]  where 1 ≤ k ≤ r . The available degrees of freedom for noise reduction are DOFOBLCMV = 2 M -4 . The ITF errors of the target source and of the k -th interferer that are included in the constraints are zero, i.e., E x , OBLCMV = 0 and E n k , OBLCMV = 0 . However, the binaural cues of all the other r -1 interferers will be distorted, i.e., E n i , BLCMV > 0 , for i ∈ { 1 , · · · , r } - { k } . ## F. JBLCMV Recall from Section III-A that preserving binaural cues of the i -th interferer implies that the following constraint has to be satisfied  which can be reformulated as:  Compared to (O)BLCMV this unified constraint reduces the number of constraints, used for binaural cue preservation, by a factor 2. As a result, for a given number of interferers, more degrees of freedom can be devoted to noise reduction. The JBLCMV [29], [30] uses this type of equality constraints for the preservation of the binaural cues of m interferers. More specifically, the JBLCMV problem is given by  where  and w JBLCMV = [ w T JBLCMV ,L w T JBLCMV ,R ] T . Moreover, Similarly to all other special cases of the GBLCMV framework, w H Λ 1 = f H 1 is used for the exact binaural cue preservation of the target source, while w H Λ 2 = f H 2 is used for the preservation of the binaural cues of m interferers.  The JBLCMV can simultaneously achieve controlled noise reduction and binaural cue preservation of up to m max = 2 M -3 interferers [29]. Moreover, the degrees of freedom devoted to noise reduction is DOFJBLCMV = 2 M -m -2 . ## G. Summary of GBLCMV methods We summarize some of the properties of the methods discussed in Section III. Table I gives an overview of two important factors: a) the maximum number of interferers' binaural cues that can be preserved while achieving controlled noise reduction m max, and b) the degrees of freedom (DOF) available for noise reduction. The following conclusions can be drawn from this table: - The BMVDR has the maximum DOF, which means that it can achieve the best possible noise reduction. It ## TABLE I SUMMARY OF A) MAXIMUM NUMBER OF INTERFERERS' BINAURAL CUES THAT CAN BE PRESERVED WHILE ACHIEVING CONTROLLED NOISE REDUCTION ( m MAX ), AND B) NUMBER OF AVAILABLE DEGREES OF FREEDOM FOR NOISE REDUCTION (DOF). ALL METHODS ARE SPECIAL CASES OF THE GBLCMV FRAMEWORK. M IS THE TOTAL NUMBER OF MICROPHONES, AND m IS THE NUMBER OF THE CONSTRAINED INTERFERERS. | Method | m max | DOF | |-------------------|---------|---------------| | BMVDR [30] | 0 | 2 M - 2 | | BLCMV [27] | M - 2 | 2 M - 2 m - 2 | | OBLCMV [28] | 1 | 2 M - 4 | | JBLCMV [29], [30] | 2 M - 3 | 2 M - m - 2 | preserves the binaural cues of the target source, but not the binaural cues of the interferers. - Unlike (O)BLCMV which uses two constraints per interferer, JBLCMV uses only one constraint per interferer. Therefore, JBLCMV can preserve the binaural cues of more interferers, or equivalently, given the same number of interferers it has more available degrees of freedom devoted to noise reduction. In this paper, if the number of simultaneously present interferers is r > m max , the extra interferers r -m max are not included in the constraints in the GBLCMV methods, in order to always have one degree of freedom left for controlled noise reduction. ## IV. PROPOSED NON-CONVEX PROBLEM In this section, we present a general optimization problem of which BMVDR and JBLCMV are special cases. More specifically, we relax the constraints on the binaural cues of the interferers, while keeping the strict equality constraints on the target source (i.e., w H Λ 1 = f H 1 ). The relaxation allows to trade-off the amount of noise reduction and binaural cue preservation per interferer in a controlled way. The proposed optimization problem is defined as  The inequality constraints bound the ITF error (see Eq. (8)), for the interferers i = 1 , · · · , m to be less than a positive tradeoff parameter e i , i = 1 , · · · , m . These inequality constraints will be transformed, in the sequel of this section (see Eqs. (34), (35)), in such a way that they can be viewed as relaxations of the strict equality constraints in Eq. (28) used in the JBLCMV method. Note that the proposed method is flexible to choose a different e i for every interferer according to its importance. For instance, maybe certain locations are more important to be preserved than others and, therefore, a smaller e i must be used. The trade-off parameter, e i , is selected as  where 0 ≤ c i ≤ 1 controls the amount of binaural cue collapse towards the target source, and the amount of noise reduction of the i -th interferer. If c i = 1 , ∀ i is used in the optimization problem in Eq. (32), then ˆ w = ˆ w BMVDR which is seen as a worst case, with respect to binaural cue preservation, because there is total collapse of binaural cues of the interferers towards the binaural cues of the target source. If c i = 0 , ∀ i we have perfect preservation of binaural cues of the m interferers, and ˆ w = ˆ w JBLCMV. Without any loss of generality, for notational convenience, we assume that the binaural cues of all interferers are of equal importance and, therefore, c i = c, ∀ i . Moreover, we keep c fixed over all frequency bins. It is worth noting that other strategies for choosing c may exist, which might lead to a better tradeoff between maximum possible noise reduction and perceptual binaural cue preservation. As explained in Section III-A, low frequency content is perceptually more important for binaural cue preservation than high frequency content. Thus, smaller c values for low frequencies and larger c values for higher frequencies may give a better perceptual trade-off. The problem in Eq. (32) is not a convex problem and it is hard to solve. In Section V we propose a method that approximately solves the non-convex problem in an iterative way by solving at each iteration a convex problem. ## V. PROPOSED ITERATIVE CONVEX PROBLEM By doing some simple algebraic manipulations, the optimization problem in Eq. (32) can equivalently be written as  Furthermore, the optimization problem in Eq. (34) can be rewritten as  We approximately solve the non-convex problem in Eq. (35) in an iterative way using w H R of the previous iteration in f 2 ,i , i = 1 , · · · , m . The new iterative problem is convex at each iteration and is given by where Λ 2 ,i is the i -th column of Λ 2 in Eq. (30).  where ˆ w ( k ) = [ ˆ w T L, ( k ) ˆ w T R, ( k ) ] T is the estimated binaural spatial filter of the k -th iteration, which is initialized as ˆ w (0) = ˆ w BMVDR. Similarly to other existing minimum variance beamformers with inequality constraints [36], [37], the convex optimization problem in Eq. (36) can be equivalently written as a second order cone programming (SOCP) problem with equality and inequality constraints (see Appendix) and it can be solved efficiently with interior point methods [38].  The ITF error of the i -th interferer at the k -th iteration is given by ∣ ∣ This iterative method is stopped when all the constraints of the original problem in Eq. (32) are satisfied. The stopping criterion that we use for the proposed iterative method is given by  The termination of the proposed iterative method may need a large amount of iterations because of the fixed c in Eq. (36). The reason for this is explained in detail in Section V-A. To control the speed of termination we replace in Section V-B the fixed c in Eq. (36) with a decreasing parameter τ ( k ) (initialized with τ (0) = c ) which controls the speed of termination. In Section V-C we show under which conditions the proposed algorithm guarantees that it will find a feasible solution satisfying the stopping criterion in Eq. (38) in a finite number of iterations. An overview of the proposed method using the adaptive τ ( k ) is given in Algorithm 1. where e i ( c ) is given in Eq. (33). Recall that f 2 = 0 (i.e., f 2 ,i = 0 , ∀ i ) is used in JBLCMV. Unlike JBLCMV, the proposed method uses f 2 ,i, ( k ) ≥ 0 , ∀ i and, therefore, the constraints dedicated for the preservation of binaural cues of the interferers are seen as relaxations of the strict equality constraints of the JBLCMV method. These relaxations enlarge the feasible set of the problem, allowing more constraints to be used compared to JBLCMV. The JBLCMV can be seen as a special case of the proposed method for c = 0 , f 2 ,i, (1) = 0 , i = 1 , · · · m . In this case, the relaxed constraints in the proposed method become identical to the strict constraints of the JBLCMV. Hence, the JBLCMV needs to run only one iteration of the problem in Eq. (36). If c = 0 , the proposed method follows the same strategy for handling r > m max simultaneously present interferers as in Section III-G. However, if c > 0 , then there is a typically large, difficult to predict m max 2 , due to the inequality constraints and, therefore, the proposed method uses m = r, ∀ r constraints for the preservation of the binaural cues of all simultaneously present interferers. Finally, if c = 1 , the proposed method does not iterate and stops immediately giving as output the initialization ˆ w (0) = ˆ w BMVDR. ## A. Speed of Termination The proposed iterative method may have slow termination due to the fixed choice of c . In this section we explain the reason and in Section V-B we explain how to control the speed of termination. 2 The feasible set of the proposed method typically reduces by adding more inequality constraints. However it is difficult to predict after how many constraints, m , it becomes empty, i.e., what is the value of m max. Let Φ ( k ) denote the convex feasible set in the k -th iteration of the iterative optimization problem in Eq. (36) given by   and Ψ( c ) the non-convex feasible set of the original nonconvex problem of Eqs. (32), (33) given by where ˆ w JBLCMV ∈ Ψ(0) , and Ψ(0) ⊆ Ψ( c ) , 0 ≤ c ≤ 1 and, therefore, ˆ w JBLCMV ∈ Ψ( c ) , 0 ≤ c ≤ 1 . In words, ˆ w JBLCMV is the feasible solution that belongs to Ψ(0) and gives the minimum output noise power. Note that the proposed iterative method will typically try to find a solution on the boundary of Φ ( k ) . Some parts of the boundary of Φ ( k ) will be inside or on the boundary of Ψ( c ) , while other parts can be outside the set Ψ( c ) . Therefore, it is possible that the estimated ˆ w ( k ) will be outside of Ψ( c ) (see Fig. 1(a) for instance). In this case, obviously, the stopping criterion is not satisfied and, therefore, the problem goes to the next iteration. In the next iteration, Φ ( k +1) changes and a new ˆ w ( k +1) is estimated which can be again outside of Ψ( c ) (see Fig. 1(a) for instance). This repetition can happen many times leading to a very slow termination because the new estimate ˆ w ( k +1) is not selected according to a binauralcue error descent direction. To avoid this undesirable situation, we propose in Section V-B to replace the fixed c in Eq. (36) with an adaptive reduction parameter τ ( k ) , in order to make sure that solutions that are on the boundary of Φ ( k ) and that are outside Ψ( c ) will progressively provide a reduced binauralcue error, i.e., to move towards the direction of the interior of Ψ( c ) (see Fig. 1(b) for instance). Note that the Φ ( k ) changes for every next iteration, while Ψ( c ) is constant over time. We can think of Φ ( k ) as a convex approximation set of Ψ( c ) at iteration k (see a simplistic example of the two sets in Fig. 1(a)). ## B. Avoiding Slow Termination The termination of the proposed iterative method may need a large amount of iterations because of the fixed c in Eq. (36), as explained in Section V-A. Therefore, the replacement of c with an adaptive reduction parameter τ ( k ) only in Eq. (36) is useful for guaranteed termination within a pre-selected finite maximum number of iterations, k max . More specifically, the new adaptive reduction parameter that we use in Eq. (36) instead of c is given by  where τ (0) = c is selected according to the initial desired amount of collapse of binaural cues in the original non-convex problem in Eqs. (32), (33). The step α ( k max ) controls the speed of termination (i.e., how fast the stopping criterion will be satisfied), and is a function of the maximum allowed number of iterations for termination given from the user. That is,  ``` ``` Note that we replace c with τ ( k ) only in Eq. (36) and not in the stopping criterion in Eq. (38). This is because, the stopping criterion is based on the fixed feasible set Ψ( c ) of the nonconvex problem in Eq. (32) which should remain constant over iterations (see an example of two consecutive iterations in Fig. 1). Moreover, the τ ( k ) is always non-negative, because τ ( k max ) = 0 . Small k max , speeds up the reduction of τ ( k ) and, thus, it also speeds up the termination of the proposed method. Of course a very small k max can lead to a feasible solution, ˆ w ( k ) , for which ∑ i E n i , ( k ) /lessmuch ∑ i e i ( c ) , i.e., to be far away from the boundary of Ψ( c ) . This means that ˆ w ( k ) provides better binaural cue preservation than the desired amount of binaural cue preservation, e i ( c ) . As a result, there will be less noise suppression. Ideally, we would like to arrive as close as possible to the controlled trade-off between noise reduction and binaural cue preservation given by our initial specifications (i.e., amount of collapse). Therefore, a careful choice of k max is needed in order to find a feasible solution ˆ w ( k ) that: - to terminate as fast as possible. - achieves a binaural-cue error ∑ i E n i , ( k ) ≈ ∑ i e i ( c ) , i.e., to be as close as possible to the boundary of Ψ( c ) . Of course there is a trade-off between the two goals. ## C. Guaranteed Termination In this section, we prove that the proposed iterative method using the adaptive reduction parameter in Eq. (41) guarantees termination, simultaneous controlled approximate binaural cue preservation, and controlled noise reduction, in at most k max iterations, for a limited number of interferers m ≤ 2 M -3 . Fig. 1. Simplistic visualization of two successive iterations ( k and k +1 ) of the proposed method with (a) a fixed c , (b) a reducing τ ( k ) . In k +1 iteration the stopping criterion is satisfied in (b). On the contrary, in (a) the stopping criterion is not satisfied, because ˆ w ( k +1) / ∈ Ψ( c ) . <details> <summary>Image 1 Details</summary> ![fd816ddf](/v1/image/fd816ddf9783a713a278bbd08e55d35b7d16ef0efe82813788b68f6ae2ab58f7) ### Visual Description ## Diagram: Shape Approximation ### Overview The image shows two diagrams, labeled (a) and (b), illustrating the approximation of a shape using iterative methods. Each diagram contains three shapes: a blue dashed line representing Ψ(c), a red solid line representing Φ(k), and a black dashed line representing Φ(k+1). Additionally, each diagram includes a red circle marker representing w^(k) and a black star marker representing w^(k+1). The diagrams show how the shape Φ is updated from iteration k to k+1, converging towards the shape Ψ. ### Components/Axes * **Shapes:** * Ψ(c): Represented by a blue dashed line. * Φ(k): Represented by a red solid line. * Φ(k+1): Represented by a black dashed line. * **Markers:** * w^(k): Represented by a red circle. * w^(k+1): Represented by a black star. * **Labels:** * (a): Label for the left diagram. * (b): Label for the right diagram. * **Legend (Top-Left of each diagram):** * Blue dashed line: Ψ(c) * Red solid line: Φ(k) * Black dashed line: Φ(k+1) * Red circle: w^(k) * Black star: w^(k+1) ### Detailed Analysis **Diagram (a):** * The blue dashed line Ψ(c) forms an irregular polygon. * The red solid line Φ(k) is a polygon that approximates Ψ(c). * The black dashed line Φ(k+1) is a polygon that is closer to Ψ(c) than Φ(k). * The red circle w^(k) is located on the lower-left side of the shape. * The black star w^(k+1) is located very close to the red circle w^(k), slightly below and to the right. **Diagram (b):** * The blue dashed line Ψ(c) forms an irregular polygon. * The red solid line Φ(k) is a polygon that approximates Ψ(c). * The black dashed line Φ(k+1) is a polygon that is closer to Ψ(c) than Φ(k). * The red circle w^(k) is located on the lower-left side of the shape. * The black star w^(k+1) is located very close to the red circle w^(k), slightly to the right. ### Key Observations * In both diagrams, Φ(k+1) is a better approximation of Ψ(c) than Φ(k). * The markers w^(k) and w^(k+1) are located close to each other, indicating a small adjustment in the iterative process. * The shapes in diagram (b) appear to be slightly rotated compared to diagram (a). ### Interpretation The diagrams illustrate an iterative process for approximating a shape (Ψ(c)) using successive approximations (Φ(k) and Φ(k+1)). The markers w^(k) and w^(k+1) likely represent control points or parameters that are adjusted in each iteration to improve the approximation. The fact that Φ(k+1) is closer to Ψ(c) than Φ(k) suggests that the iterative process is converging towards the target shape. The slight rotation of the shapes in diagram (b) compared to diagram (a) could indicate a different starting point or a different set of parameters for the iterative process. </details> Nevertheless, our simulation experiments (see Section VI-C) show that our algorithm a) is capable of simultaneously achieving controlled approximate binaural cue preservation and, in most cases, controlled noise reduction of more interferers than 2 M -3 for c > 0 , and b) finds a feasible solution in much fewer iterations, on average, than k max , for k max = 10 , 50 . Theorem 1: If m ≤ 2 M -3 , the proposed method a) will always find a solution in a finite number of iterations k ≤ k max satisfying the stopping criterion of Eq. (38), and b) will always have a bounded ITF error, i.e., The adaptive decreasing of τ ( k ) (see Eq. (41)) results in an adaptive shrinking of Φ ( k ) . Therefore, in the case where the estimated ˆ w ( k ) will be outside of Ψ( c ) , the stopping criterion is not satisfied and, therefore, the algorithm continues with the next iteration. In the next iteration, Φ ( k ) typically shrinks due to the decreased value of τ ( k ) according to Eq. (41). The algorithm continues until there is a solution ˆ w ( k ) ∈ Ψ( c ) . Note that this does not necessarily mean that the algorithm will stop if and only if Φ ( k ) ⊆ Ψ( c ) (see e.g., Fig. 1(b) where the algorithm stops before Φ ( k ) ⊆ Ψ( c ) ). Only in the worst case scenario a solution is found when Φ ( k ) ⊆ Ψ( c ) . We show below that, for m ≤ 2 M -3 , the proposed method guarantees termination within a pre-defined finite maximum number of iterations, k max , while achieving controlled binaural cue preservation accuracy and controlled noise reduction. This is written more formally in the following theorem.  and a bounded noise output power  Proof: Note that for m ≤ 2 M -3 , after k max iterations τ ( k ) = 0 (see Eq. 41) and, therefore, ˆ w ( k max ) = ˆ w JBLCMV because the relaxations of the proposed method in Eq. (36) become w H ( k ) Λ 2 = 0 , which is the same as in JBLCMV as explained in Section V. Therefore, for m ≤ 2 M -3 , the algorithm, in the worst case scenario, will terminate after k max iterations (specified by the user), giving the solution ˆ w JBLCMV which always satisfies the stopping criterion, i.e., ˆ w JBLCMV ∈ Ψ( c ) , for 0 ≤ c ≤ 1 (see Section V-A). This means that the algorithm in the worst case scenario (after k max ) will have the noise output power ˆ w H JBLCMV ˜ P ˆ w JBLCMV and an ITF error equal 0 . Moreover, the noise output power cannot be less than ˆ w H BMVDR ˜ P ˆ w BMVDR (because ˆ w BMVDR achieves the best noise reduction over all the aforementioned methods, because it has the largest feasible set) and the ITF error will be E n i , ( k ) ≤ e i ( c ) ∀ i , because the stopping criterion is satisfied. Therefore, Eqs. (43) and (44) are proved. Note that, for k = k max and m > 2 M -2 , Φ ( k max ) = ∅ 3 . However, for k < k max and m > 2 M -2 , Φ ( k ) may not be empty. As we will show in our experiments, indeed, usually it is not empty and, therefore, we may achieve simultaneous controlled approximate binaural cue preservation and, in most cases, controlled noise reduction of m > 2 M -2 interferers. This can be observed experimentally from our results in Sections VI-C2 and VI-C3. ## VI. EXPERIMENTAL RESULTS In this section, the proposed algorithm is experimentally evaluated. In Section VI-A, the setup of our experiments is demonstrated. In Section VI-B, the performance measures are presented. In Section VI-C, the proposed method is compared to other LCMV-based methods with regard to binaural cue preservation and noise reduction. Moreover, we provide results with regard to the speed of the proposed method in terms of number of iterations. ## A. Experiment Setup Fig. 2 shows the experimental setup that we used for our experiments. Two behind-the-ear (BTE) hearing aids, with two microphones each, are used for the experiments. Therefore, the total number of microphones is M = 4 . The publicly available database with the BTE impulse responses (IRs) in [39] is used to simulate the head IRs (we used the front and middle microphone for each hearing aid). The front microphones are selected as reference microphones. We placed all sources on a h = 80 cm radius circle centered at the origin (0 , 0) (center of head) with an elavation of 0 o degrees. The index of each interferer (denoted by 'x' marker) is indicated in Fig. 2. The interferers 1 , 2 , 3 , 4 , 5 , 6 and 7 are speech shaped noise realizations with the same power and are placed at 15 o , 45 o , 75 o , 105 o , 165 o , 240 o and 300 o degrees, respectively. The target source (denoted by 'o' marker) is a speech signal in the look direction, i.e., 90 o degrees. The duration of all sources is 60 sec. The microphone self noise at each microphone is simulated as white Gaussian noise (WGN) with P V = σ 2 I , where σ = 3 . 8 ∗ 10 -5 which corresponds to an SNR of 50 dB with respect to the target signal at the left reference microphone. The noise CPSD matrices, P , are calculated (as in Eq. (3)) using the ATFs of the truncated true BTE IRs, from the 3 Recall that for m = 2 M -2 (i.e., d = 2 M ), there is a feasible solution which does not provide controlled noise reduction (see Section III-B). Fig. 2. Experimental setup: /square hearing aids, 'o' target source, 'x' speech shaped interferers. Each source has the same distance, h , from the center of the head. <details> <summary>Image 2 Details</summary> ![6dfc73c0](/v1/image/6dfc73c0e2fca5f9f3090d743ae91a149601f984323635a08224f0b61fbb6e6b) ### Visual Description ## Diagram: Target Localization Setup ### Overview The image depicts a diagram illustrating a target localization setup. It shows a central figure (presumably a person) with a target and several points marked with 'x' symbols, numbered 1 through 7. The diagram includes coordinate axes and annotations indicating distances and angles. ### Components/Axes * **Axes:** * x-axis: Labeled "x-axis" and runs horizontally across the center of the diagram. * y-axis: Labeled "y-axis" and runs vertically along the left edge of the diagram. * **Central Figure:** A circular representation of a person's head, with two square markers on either side. * **Target:** A small circle labeled "target" located above the central figure. * **Points:** Seven points marked with blue 'x' symbols, numbered 1 through 7. * **Distance:** A dashed line labeled "h" connects the central figure to the target. * **Angle:** A dashed red line extending from the central figure to point 1, labeled "15°". * **Zero Lines:** Red lines indicate the zero positions on both the x and y axes. ### Detailed Analysis * **Point Locations:** * Point 1: Located to the right of the central figure, at approximately y=0. * Point 2: Located in the upper-right quadrant. * Point 3: Located above and slightly to the right of the central figure. * Point 4: Located above and to the left of the central figure. * Point 5: Located to the left of the central figure, at approximately y=0. * Point 6: Located in the lower-left quadrant. * Point 7: Located in the lower-right quadrant. * **Distance "h":** The dashed line "h" represents the distance between the central figure and the target. * **Angle "15°":** The dashed red line indicates an angle of 15 degrees relative to the horizontal axis. ### Key Observations * The diagram is symmetrical about the y-axis. * Points 1 and 5 are located on the x-axis. * Points 6 and 7 are located below the x-axis, while points 2, 3, and 4 are located above the x-axis. ### Interpretation The diagram likely represents a setup for studying or simulating target localization. The central figure represents a listener or observer, the target is the sound source to be localized, and the points represent potential locations of other sound sources or measurement points. The distance "h" and angle "15°" are parameters used to define the spatial relationships between the listener, the target, and other elements in the environment. The diagram could be used to analyze the accuracy and precision of different localization algorithms or to investigate the effects of noise and reverberation on localization performance. </details> database, and the estimated PSDs of the sources using all available data without voice activity detection (VAD) errors. Also, the constraints of all the aforementioned methods use the ATFs of the truncated true BTE IRs. The truncated BTE IRs length is 12 . 5 ms. The sampling frequency is f s = 16 kHz. We use a simple overlap-and-add analysis/synthesis method [40] with frame length 10 ms, overlap 50% and an FFT size of 256 . The analysis/synthesis window is a square-root-Hann window. The ATFs are also computed with an FFT size of 256 . Finally, the microphone signals are computed by convolving the truncated BTE IRs with the source signals at the original locations. ## B. Performance Evaluation In this section we define the performance evaluation measures that we use to evaluate the results.  1) ITFs, IPDs & ILDs: In this section three average performance measures for binaural cue preservation are defined: the average ILD error, the average IPD error, and the average ITF error. Note, as explained in Section III-A, that the IPD errors are perceptually important only for frequencies below 1 kHz, and the ILD errors are perceptually important only for frequencies above 3 kHz. Therefore, the evaluation of IPDs and ILDs will be done only for these frequency regions. We evaluate the average ILD and IPD error for all interferers as follows. Let L n i ( k, l ) and T n i ( k, l ) denote the ILD and IPD errors (for the k -th frequency bin and l -th frame), respectively, defined in Eq. (9). Then the average ILD and ITD errors are defined as and where N and T are the number of frequency bins and the number of frames, respectively, k ILD and k IPD are the first and last frequency-bin indices in the frequency regions 3 -  8 kHz and 0 -1 kHz, respectively.. Note that since the max possible value of T n i ( k, l ) is 1 , the max value of TotER IPD is r . Moreover, we evaluate the average ITF error given by where E n i is the ITF error defined in Eq. (8). Finally, we evaluate the average ITF error ratio given by   Please note that in these experiments we use the true ATFs in the constraints of the optimization problems of all competing methods. Therefore, we do not measure the corresponding error measures for the binaural cues of target source since they are always zero, because in all compared methods the constraints perfectly preserve the binaural cues of the target source. which measures the average amount of binaural cue collapse by comparing the ITF error of the proposed method with the ITF error of the BMVDR. Since the proposed method will always satisfy the condition E n i , ( k ) ( k, l ) ≤ c E n i , BMVDR ( k, l ) for r ≤ 2 M -3 (see Theorem 1 ), obviously AvER ITF ( c ) ≤ c for r ≤ 2 M -3 . Note that ideally the proposed method will provide a solution as close as possible to the boundary of Ψ( c ) , i.e., AvER ITF ( c ) ≈ c (see Section V-B). Moreover, for the proposed method AvER ITF (0) = 0 and AvER ITF (1) = 1 because for c = 0 , E n i ( k, l ) = 0 (for r ≤ 2 M -3 ), and for c = 1 , E n i ( k, l ) = E n i , BMVDR ( k, l ) . 2) SNR measures: We define the binaural global segmental signal-to-noise-ratio (gsSNR) gain as  where the gsSNR input and output are defined as  respectively, where for the l -th frame, the binaural input signalto-noise-ratio (SNR) is defined as    where e T = [ e T L e T R ] , e T L = [1 , 0 , · · · , 0] and e T R = [0 , · · · , 0 , 1] , ˜ P is defined in Eq. (11) and ˜ P x is similarly defined but it uses as diagonal block matrices the P x matrix. The binaural output SNR for the l -th frame, is defined as where w = [ w T L ( k, l ) w T R ( k, l )] T . Note that gsSNR out and gsSNR in can be seen as average measures of the binaural SNR measures defined in [30]. Fig. 3. Anechoic environment: Performance curves for the competing methods in terms of (a) noise reduction, (b) average ITF error, (c) average ILD error, (d) average IPD error. <details> <summary>Image 3 Details</summary> ![faff259f](/v1/image/faff259f7f04320039aa87079b8849f9d00302a2be1cd900325ce54347406e9e) ### Visual Description ## Chart: Performance Metrics vs. Number of Interferers ### Overview The image presents four line charts arranged in a 2x2 grid, each displaying a different performance metric as a function of the number of interferers. The metrics are gsSNR gain (dB), TotER^{ITF}, TotER^{ILD}, and TotER^{ITD}. The x-axis represents the number of interferers (r), ranging from 1 to 7 in all four charts. Different lines represent different algorithms or parameter settings. ### Components/Axes **General Components:** * **X-axis (all charts):** Number of interferers (r), ranging from 1 to 7. * **Legends:** Two legends are present. The top-right legend applies to the top two charts, and the bottom-left legend applies to the bottom two charts. **Chart (a): gsSNR^{gain} (dB)** * **Y-axis:** gsSNR^{gain} (dB), ranging from 0 to 30. * **Legend (Top-Right):** * Red solid line: JBLCMV * Blue dotted line: Prop.-c = 0.3, k_{max} = 10 * Purple line with x markers: Prop.-c = 0.3, k_{max} = 50 * Green dashed line with diamond markers: Prop.-c = 0.6, k_{max} = 10 * Yellow line with circle markers: Prop.-c = 0.6, k_{max} = 50 * Black dashed line: BMVDR **Chart (b): TotER^{ITF}** * **Y-axis:** TotER^{ITF}, ranging from 0 to 30. * **Legend (Top-Right):** Same as Chart (a). **Chart (c): TotER^{ILD}** * **Y-axis:** TotER^{ILD}, ranging from 0 to 1000. * **Legend (Bottom-Left):** * Brown line with square markers: BLCMV-η = 0.1 * Dark-Brown line with plus markers: BLCMV-η = 0.001 * Green line with star markers: OBLCMV **Chart (d): TotER^{ITD}** * **Y-axis:** TotER^{ITD}, ranging from 0 to 2. * **Legend (Bottom-Left):** Same as Chart (c). ### Detailed Analysis **Chart (a): gsSNR^{gain} (dB)** * **JBLCMV (Red solid line):** Starts at approximately 31 dB and decreases sharply to around 3 dB by r=4, then remains relatively constant. * **Prop.-c = 0.3, k_{max} = 10 (Blue dotted line):** Starts at approximately 31 dB, decreases to around 4 dB by r=4, then remains relatively constant. * **Prop.-c = 0.3, k_{max} = 50 (Purple line with x markers):** Starts at approximately 31 dB, decreases to around 4 dB by r=4, then remains relatively constant. * **Prop.-c = 0.6, k_{max} = 10 (Green dashed line with diamond markers):** Starts at approximately 31 dB, decreases to around 5 dB by r=4, then remains relatively constant. * **Prop.-c = 0.6, k_{max} = 50 (Yellow line with circle markers):** Starts at approximately 31 dB, decreases to around 5 dB by r=4, then remains relatively constant. * **BMVDR (Black dashed line):** Starts at approximately 31 dB, decreases to around 5 dB by r=4, then remains relatively constant. * **BLCMV-η = 0.1 (Brown line with square markers):** Starts at approximately 14 dB, remains relatively constant until r=3, then decreases to around 3 dB by r=4, then remains relatively constant. * **BLCMV-η = 0.001 (Dark-Brown line with plus markers):** Starts at approximately 31 dB, decreases to around 3 dB by r=4, then remains relatively constant. **Chart (b): TotER^{ITF}** * **JBLCMV (Red solid line):** Remains close to 0 for r=1 to r=4, then increases to approximately 2 by r=7. * **Prop.-c = 0.3, k_{max} = 10 (Blue dotted line):** Remains close to 0 for r=1 to r=4, then increases to approximately 3 by r=7. * **Prop.-c = 0.3, k_{max} = 50 (Purple line with x markers):** Remains close to 0 for r=1 to r=4, then increases to approximately 7 by r=7. * **Prop.-c = 0.6, k_{max} = 10 (Green dashed line with diamond markers):** Remains close to 0 for r=1 to r=4, then increases to approximately 10 by r=7. * **Prop.-c = 0.6, k_{max} = 50 (Yellow line with circle markers):** Remains close to 0 for r=1 to r=4, then increases to approximately 25 by r=7. * **BMVDR (Black dashed line):** Starts at approximately 1, increases to approximately 26 by r=7. * **BLCMV-η = 0.1 (Brown line with square markers):** Remains close to 0 for r=1 to r=4, then increases to approximately 26 by r=7. * **BLCMV-η = 0.001 (Dark-Brown line with plus markers):** Remains close to 0 for r=1 to r=4, then increases to approximately 26 by r=7. **Chart (c): TotER^{ILD}** * **BLCMV-η = 0.1 (Brown line with square markers):** Remains close to 0 until r=5, then increases sharply to approximately 1500 by r=7. * **BLCMV-η = 0.001 (Dark-Brown line with plus markers):** Remains close to 0. * **OBLCMV (Green line with star markers):** Remains close to 0 until r=5, then increases sharply to approximately 700 by r=7. * **JBLCMV (Red solid line):** Remains close to 0. * **Prop.-c = 0.3, k_{max} = 10 (Blue dotted line):** Remains close to 0 until r=5, then increases to approximately 400 by r=7. * **Prop.-c = 0.3, k_{max} = 50 (Purple line with x markers):** Remains close to 0 until r=5, then increases to approximately 500 by r=7. * **Prop.-c = 0.6, k_{max} = 10 (Green dashed line with diamond markers):** Remains close to 0 until r=5, then increases to approximately 600 by r=7. * **Prop.-c = 0.6, k_{max} = 50 (Yellow line with circle markers):** Remains close to 0 until r=5, then increases to approximately 1000 by r=7. * **BMVDR (Black dashed line):** Remains close to 0 until r=5, then increases sharply to approximately 1500 by r=7. **Chart (d): TotER^{ITD}** * **BLCMV-η = 0.1 (Brown line with square markers):** Remains close to 0. * **BLCMV-η = 0.001 (Dark-Brown line with plus markers):** Starts at approximately 0, increases to approximately 0.6 by r=7. * **OBLCMV (Green line with star markers):** Starts at approximately 0.2, increases to approximately 1.2 by r=7. * **JBLCMV (Red solid line):** Starts at approximately 0, increases to approximately 0.6 by r=7. * **Prop.-c = 0.3, k_{max} = 10 (Blue dotted line):** Starts at approximately 0.2, increases to approximately 0.4 by r=7. * **Prop.-c = 0.3, k_{max} = 50 (Purple line with x markers):** Starts at approximately 0.2, increases to approximately 0.5 by r=7. * **Prop.-c = 0.6, k_{max} = 10 (Green dashed line with diamond markers):** Starts at approximately 0.2, increases to approximately 0.7 by r=7. * **Prop.-c = 0.6, k_{max} = 50 (Yellow line with circle markers):** Starts at approximately 0.2, increases to approximately 0.9 by r=7. * **BMVDR (Black dashed line):** Starts at approximately 0.2, increases to approximately 1 by r=7. ### Key Observations * In Chart (a), gsSNR^{gain} generally decreases as the number of interferers increases. * In Charts (b), (c), and (d), TotER metrics generally increase as the number of interferers increases. * The BLCMV-η = 0.1 algorithm shows a sharp increase in TotER^{ILD} after r=5. * The BMVDR algorithm shows a sharp increase in TotER^{ILD} after r=5. ### Interpretation The charts illustrate the performance of different beamforming algorithms under varying interference conditions. The gsSNR^{gain} metric reflects the signal quality, which degrades as more interferers are present. The TotER metrics (ITF, ILD, ITD) represent different types of error, which generally increase with the number of interferers. The sharp increase in TotER^{ILD} for BLCMV-η = 0.1 and BMVDR after r=5 suggests that these algorithms become unstable or less effective when dealing with a high number of interferers. The other algorithms exhibit more gradual increases in error, indicating better robustness to interference. The different parameter settings for the "Prop.-c" algorithm (k_{max} = 10 vs. k_{max} = 50, and c = 0.3 vs. c = 0.6) also influence performance, with some combinations showing better error rates than others. </details> ## C. Results In the following experiments we evaluate the performance of the proposed and reference methods (i.e., BLCMV [27] with two different values of η , OBLCMV [28], BMVDR [30] and JBLCMV [29], [30]) as a function of the number of simultaneously present interferers, 1 ≤ r ≤ 7 . For instance, for r = 1 , only the interferer with index 1 is enabled while all the others are silent. For r = 2 , only the interferers with indices 1 , 2 are enabled, while the others are silent, and so on. The binaural gsSNR in values for r = 1 , 2 , 3 , 4 , 5 , 6 and 7 are 0 . 46 , -1 . 45 , -2 . 29 , -2 . 92 , -3 . 76 , -4 . 15 , and -4 . 53 dB, respectively. Recall that each method has a different m max , except for the proposed method for c > 0 where m max is difficult to be estimated, as explained in Section V, and, therefore, m is always set to m = r . For each of the reference methods and the proposed method in the case of c = 0 and if r > m max , we will use in the constraints only the first m max interferers and the last r -m max will not be preserved. In Sections VI-C1, VI-C2 the simulations are carried out without taking into account room acoustics. In Section VI-C3 the simulations are carried out by taking into account room acoustics. Figs. 3 and 4 show the comparison of the proposed method (denoted by Prop. -c = value , k max = value ) with the aforementioned reference methods in terms of binaural cue preservation and noise reduction. Note that BMVDR and the JBLCMV are the two extreme special cases of our method which can be denoted as Prop. -c = 1 and Prop. -c = 0 , 1) SNR & Binaural Cue Preservation: For simplicity, we used the same c = c j , for j = 1 , · · · , m for all interferers in the proposed method. In other words, we assumed that the binaural cues of all interferers are equally important. Moreover, we selected for the adaptive change of τ ( k ) the step parameter α ( k max ) with k max ∈ { 10 , 50 } . Fig. 4. Anechoic environment: Combination of performance curves from Fig. 3 for the competing methods in terms of (a) noise reduction, (b) average ITF error for different number of simultaneously present interferers r . The counting of r starts at the top left part of each curve. <details> <summary>Image 4 Details</summary> ![dbcca6f6](/v1/image/dbcca6f6b53831059dbf3c9d7398d3c397618c00c99dc1c525b7caf1d0eac78e) ### Visual Description ## Chart: gSSNR gain vs. TotER^{ITF} ### Overview The image is a 2D line chart comparing the performance of different algorithms (JBLCMV, Prop, BMVDR, BLCMV, OBLCMV) based on their gSSNR gain (in dB) versus TotER^{ITF}. Each algorithm is represented by a different colored line with a unique marker. The chart shows how the gSSNR gain changes as TotER^{ITF} increases. ### Components/Axes * **X-axis:** TotER^{ITF}. Scale ranges from 0 to 25, with tick marks at intervals of 5. * **Y-axis:** gSSNR gain (dB). Scale ranges from 0 to 35, with tick marks at intervals of 5. * **Legend:** Located in the top-right of the chart, it identifies each algorithm by color and marker. * Red triangle pointing right: JBLCMV * Blue triangle pointing up: Prop.-c = 0.3, k_{max} = 10 * Magenta cross: Prop.-c = 0.3, k_{max} = 50 * Lime green star: Prop.-c = 0.6, k_{max} = 10 * Yellow circle: Prop.-c = 0.6, k_{max} = 50 * Black diamond: BMVDR * Brown square: BLCMV-η = 0.1 * Dark red plus sign: BLCMV-η = 0.001 * Green asterisk: OBLCMV ### Detailed Analysis * **JBLCMV (Red Triangles):** The gSSNR gain starts around 3 dB at TotER^{ITF} = 0, then decreases to approximately 1 dB as TotER^{ITF} approaches 1. * **Prop.-c = 0.3, k_{max} = 10 (Blue Triangles):** The gSSNR gain starts around 30 dB at TotER^{ITF} = 0, then decreases to approximately 4 dB as TotER^{ITF} approaches 5. * **Prop.-c = 0.3, k_{max} = 50 (Magenta Crosses):** The gSSNR gain starts around 30 dB at TotER^{ITF} = 0, then decreases to approximately 4 dB as TotER^{ITF} approaches 5. * **Prop.-c = 0.6, k_{max} = 10 (Lime Green Stars):** The gSSNR gain starts around 32 dB at TotER^{ITF} = 0, then decreases to approximately 5 dB as TotER^{ITF} approaches 10. * **Prop.-c = 0.6, k_{max} = 50 (Yellow Circles):** The gSSNR gain starts around 30 dB at TotER^{ITF} = 0, then decreases to approximately 5 dB as TotER^{ITF} approaches 10. * **BMVDR (Black Diamonds):** The gSSNR gain starts around 33 dB at TotER^{ITF} = 0, then decreases to approximately 6 dB as TotER^{ITF} approaches 5, and then decreases slowly to approximately 5 dB as TotER^{ITF} approaches 25. * **BLCMV-η = 0.1 (Brown Squares):** The gSSNR gain starts around 14 dB at TotER^{ITF} = 0, then decreases to approximately 1 dB as TotER^{ITF} approaches 25. * **BLCMV-η = 0.001 (Dark Red Plus Signs):** The gSSNR gain starts around 14 dB at TotER^{ITF} = 0, then decreases to approximately 1 dB as TotER^{ITF} approaches 25. * **OBLCMV (Green Asterisks):** The gSSNR gain starts around 30 dB at TotER^{ITF} = 0, then decreases to approximately 5 dB as TotER^{ITF} approaches 10, and then remains relatively constant as TotER^{ITF} approaches 25. ### Key Observations * The algorithms Prop.-c = 0.3, k_{max} = 10, Prop.-c = 0.3, k_{max} = 50, Prop.-c = 0.6, k_{max} = 10, Prop.-c = 0.6, k_{max} = 50, BMVDR, and OBLCMV all start with a high gSSNR gain and decrease rapidly as TotER^{ITF} increases. * The algorithms BLCMV-η = 0.1 and BLCMV-η = 0.001 start with a lower gSSNR gain and decrease slowly as TotER^{ITF} increases. * The algorithm JBLCMV has the lowest gSSNR gain and decreases rapidly as TotER^{ITF} increases. ### Interpretation The chart illustrates the trade-off between gSSNR gain and TotER^{ITF} for different algorithms. Algorithms like BMVDR and OBLCMV initially provide high gSSNR gain but experience a decrease as TotER^{ITF} increases, eventually stabilizing. JBLCMV consistently performs poorly in terms of gSSNR gain. The "Prop" algorithms show a similar trend to BMVDR and OBLCMV, starting high and decreasing rapidly. The BLCMV algorithms start with a lower gain and decrease more gradually. This suggests that the choice of algorithm depends on the desired balance between gSSNR gain and TotER^{ITF}. </details> respectively. However, in these figures we used the original names for clarity. The performance curves are for different number of simultaneously present interferers r . As expected, the performance curves of the proposed method always lie between the BMVDR and the JBLCMV. Fig. 4 is the combination of the curves of Figs. 3(a,b) into a single figure. Notice that the number of interferers r in this combined figure increase from r = 1 up to r = 7 along the curves from top-left, to bottom-right. As expected, the proposed method for k max = 50 achieves slightly better noise reduction and worse binaural cue preservation than for k max = 10 . This is because for a larger k max , the proposed algorithm will provide a feasible solution closer to the boundary of Ψ( c ) , as explained Fig. 5. Anechoic environment: Average ITF error ratio as a function of c for 1 ≤ r ≤ 7 for (a) k max = 10 and (b) k max = 50 . The solid line is the c values. <details> <summary>Image 5 Details</summary> ![1b8c07d7](/v1/image/1b8c07d7338c13f67142a71a9ee840b85588eaf264f8d198e26ee1ec41eba333) ### Visual Description ## Line Charts: AVER^{ITF}(c) vs. Amount of Collapse (c) ### Overview The image contains two line charts, labeled (a) and (b), displaying the relationship between AVER^{ITF}(c) and the amount of collapse (c). Both charts show multiple data series, each representing a different value of 'r' (ranging from 1 to 7), along with a series for 'c = 0:0.1:1'. The x-axis represents the amount of collapse (c), ranging from 0 to 1. The y-axis represents AVER^{ITF}(c), also ranging from 0 to 1. ### Components/Axes * **X-axis (Horizontal):** * Label: "amount of collapse (c)" * Scale: 0 to 1, with tick marks at 0, 0.2, 0.4, 0.6, 0.8, and 1. * **Y-axis (Vertical):** * Label: "AVER^{ITF}(c)" * Scale: 0 to 1, with tick marks at 0, 0.5, and 1. * **Legends (Top):** * `c = 0:0.1:1`: Red line with diamond markers. * `r = 1`: Blue line with diamond markers. * `r = 2`: Brown line with plus markers. * `r = 3`: Yellow line with plus markers. * `r = 4`: Purple dotted line with square markers. * `r = 5`: Green line with square markers. * `r = 6`: Cyan line with asterisk markers. * `r = 7`: Dark Brown line with star markers. * **Chart Labels:** * (a): Located below the left chart. * (b): Located below the right chart. ### Detailed Analysis **Chart (a):** * **`c = 0:0.1:1` (Red line with diamond markers):** The line starts at approximately 0 at c=0, and increases linearly to 1 at c=1. * **`r = 1` (Blue line with diamond markers):** The line starts at approximately 0 at c=0, and increases linearly to 1 at c=1. * **`r = 2` (Brown line with plus markers):** The line starts at approximately 0.15 at c=0, and increases linearly to approximately 0.9 at c=1. * **`r = 3` (Yellow line with plus markers):** The line starts at approximately 0.1 at c=0, and increases linearly to approximately 0.9 at c=1. * **`r = 4` (Purple dotted line with square markers):** The line starts at approximately 0.05 at c=0, and increases linearly to approximately 0.8 at c=1. * **`r = 5` (Green line with square markers):** The line starts at approximately 0 at c=0, and increases linearly to approximately 0.5 at c=0.8, then increases sharply to 1 at c=1. * **`r = 6` (Cyan line with asterisk markers):** The line starts at approximately 0.2 at c=0, decreases to approximately 0.0 at c=0.1, then increases linearly to approximately 0.5 at c=0.8, then increases sharply to 1 at c=1. * **`r = 7` (Dark Brown line with star markers):** The line starts at approximately 0.25 at c=0, decreases to approximately 0.1 at c=0.1, then increases linearly to approximately 0.7 at c=0.8, then increases sharply to 1 at c=1. **Chart (b):** * **`c = 0:0.1:1` (Red line with diamond markers):** The line starts at approximately 0 at c=0, and increases linearly to 1 at c=1. * **`r = 1` (Blue line with diamond markers):** The line starts at approximately 0 at c=0, and increases linearly to 1 at c=1. * **`r = 2` (Brown line with plus markers):** The line starts at approximately 0.15 at c=0, and increases linearly to approximately 0.9 at c=1. * **`r = 3` (Yellow line with plus markers):** The line starts at approximately 0.1 at c=0, and increases linearly to approximately 0.9 at c=1. * **`r = 4` (Purple dotted line with square markers):** The line starts at approximately 0.05 at c=0, and increases linearly to approximately 0.8 at c=1. * **`r = 5` (Green line with square markers):** The line starts at approximately 0 at c=0, and increases linearly to approximately 0.5 at c=0.8, then increases sharply to 1 at c=1. * **`r = 6` (Cyan line with asterisk markers):** The line starts at approximately 0.2 at c=0, decreases to approximately 0.0 at c=0.1, then increases linearly to approximately 0.5 at c=0.8, then increases sharply to 1 at c=1. * **`r = 7` (Dark Brown line with star markers):** The line starts at approximately 0.25 at c=0, decreases to approximately 0.1 at c=0.1, then increases linearly to approximately 0.7 at c=0.8, then increases sharply to 1 at c=1. ### Key Observations * The charts (a) and (b) appear to be identical. * The lines for `c = 0:0.1:1` and `r = 1` are identical and linear, increasing directly from (0,0) to (1,1). * The lines for `r = 2`, `r = 3`, and `r = 4` are mostly linear, but do not start at (0,0). * The lines for `r = 5`, `r = 6`, and `r = 7` show a dip at c=0.1, then increase linearly until c=0.8, after which they sharply increase to 1 at c=1. ### Interpretation The charts illustrate how the AVER^{ITF}(c) changes with the amount of collapse (c) for different values of 'r'. The linear relationship observed for `c = 0:0.1:1` and `r = 1` suggests a direct proportionality between the amount of collapse and AVER^{ITF}(c) in these cases. The other 'r' values show varying degrees of non-linearity, especially for `r = 5`, `r = 6`, and `r = 7`, indicating a more complex relationship where AVER^{ITF}(c) is not directly proportional to the amount of collapse. The dip at c=0.1 for `r = 6` and `r = 7` suggests a possible threshold effect or an initial resistance to collapse at low values of 'c'. The sharp increase to 1 at c=1 for `r = 5`, `r = 6`, and `r = 7` indicates that regardless of the initial behavior, AVER^{ITF}(c) eventually reaches its maximum value when the amount of collapse is complete. </details> Fig. 6. Anechoic environment: Average number of iterations as a function of simultaneously present interferers, r . <details> <summary>Image 6 Details</summary> ![9056fb24](/v1/image/9056fb24e33a1dba9b42e146a119ceb9efeb0066328ed948a6de7346bd03cd38) ### Visual Description ## Line Chart: Average k vs. Number of Interferers ### Overview The image is a line chart that plots the average value of 'k' against the number of interferers 'r'. There are four data series, each representing a different configuration of parameters 'c' and 'kmax'. The chart aims to show how the average 'k' changes as the number of interferers increases for different parameter settings. ### Components/Axes * **X-axis:** Number of interferers (r), with values ranging from 1 to 7. * **Y-axis:** Average k, with values ranging from 1 to 7. * **Legend (Top-Right):** * Dotted Blue Line: Prop.-c = 0.3, kmax = 10 * Purple Line with X markers: Prop.-c = 0.3, kmax = 50 * Green Line with Diamond markers: Prop.-c = 0.6, kmax = 10 * Yellow Line with Circle markers: Prop.-c = 0.6, kmax = 50 ### Detailed Analysis * **Prop.-c = 0.3, kmax = 10 (Dotted Blue Line):** * Trend: Initially increases, then plateaus. * Data Points: * r = 1, average k ≈ 1.6 * r = 2, average k ≈ 1.9 * r = 3, average k ≈ 2.0 * r = 4, average k ≈ 2.3 * r = 5, average k ≈ 3.0 * r = 6, average k ≈ 3.2 * r = 7, average k ≈ 3.3 * **Prop.-c = 0.3, kmax = 50 (Purple Line with X markers):** * Trend: Increases and then plateaus. * Data Points: * r = 1, average k ≈ 1.6 * r = 2, average k ≈ 1.9 * r = 3, average k ≈ 2.0 * r = 4, average k ≈ 2.5 * r = 5, average k ≈ 3.8 * r = 6, average k ≈ 3.9 * r = 7, average k ≈ 3.9 * **Prop.-c = 0.6, kmax = 10 (Green Line with Diamond markers):** * Trend: Increases and then plateaus. * Data Points: * r = 1, average k ≈ 1.6 * r = 2, average k ≈ 1.9 * r = 3, average k ≈ 2.0 * r = 4, average k ≈ 2.6 * r = 5, average k ≈ 3.7 * r = 6, average k ≈ 4.1 * r = 7, average k ≈ 4.1 * **Prop.-c = 0.6, kmax = 50 (Yellow Line with Circle markers):** * Trend: Increases significantly and then plateaus at a higher level. * Data Points: * r = 1, average k ≈ 1.6 * r = 2, average k ≈ 1.9 * r = 3, average k ≈ 2.4 * r = 4, average k ≈ 3.4 * r = 5, average k ≈ 5.1 * r = 6, average k ≈ 5.7 * r = 7, average k ≈ 5.8 ### Key Observations * For all configurations, the average 'k' generally increases as the number of interferers increases. * The configuration with Prop.-c = 0.6 and kmax = 50 (Yellow Line) consistently yields the highest average 'k' values. * The configuration with Prop.-c = 0.3 and kmax = 10 (Dotted Blue Line) consistently yields the lowest average 'k' values. * The lines for Prop.-c = 0.3, kmax = 50 and Prop.-c = 0.6, kmax = 10 are very close to each other. ### Interpretation The data suggests that increasing the parameter 'c' from 0.3 to 0.6 and increasing 'kmax' from 10 to 50 generally leads to a higher average 'k' value, especially as the number of interferers increases. The parameter 'kmax' seems to have a more significant impact on the average 'k' when 'c' is higher (0.6). The plateauing of the lines indicates that beyond a certain number of interferers, the average 'k' value stabilizes for each configuration. The closeness of the purple and green lines suggests that the impact of 'c' is less significant when 'kmax' is not at its maximum value. </details> in Section V-B. Fig. 5 serves to visualize better the trade-off between fast termination and closeness to the boundary of Ψ( c ) (see Section V-B for details). More specifically, Fig. 5 shows the average ITF ratio of the proposed method, for k max = 10 , 50 , as a function of c for different number of simultaneously present interferers r . As expected (see Section VI-B1), AvER ITF ( c ) ≤ c for 1 ≤ r ≤ 5 . This is also the case for the curves for r = 6 , 7 except for c = 0 , as expected, because the proposed method becomes identical to the JBLCMV which can preserve the binaural cues of up to m max = 2 M -3 = 5 interferers while achieving controlled noise reduction. As expected, for k max = 50 all performance curves are closer to the boundary. In general, the larger the r , the less close the AvER ITF ( c ) of From Figs. 3(a,b), and Fig. 4 it is clear that, indeed the proposed method achieves a controlled noise reduction and controlled approximate binaural cue preservation. The BMVDR achieves the best noise reduction performance, but it does not preserve the binaural cues of the interferers. The JBLCMV accurately preserves the highest number of simultaneously present interferers and it has worse noise reduction performance than all parametrizations of the proposed method. Note that m max = 5 for JBLCMV and, therefore, the last two interferers cannot be included in the constraints and that is why the binaural cue preservation is not perfect. The OBLCMV comes second in terms of SNR performance, but it preserves the binaural cues of only one interferer. the proposed method is to c . In words, the more constraints we use in the proposed method, the harder it is to reach the boundary of Ψ( c ) . Note that for the two extreme values c = 0 and c = 1 , the proposed method becomes identical to the JBLCMV and the BMVDR, respectively. As was expected, for c = 0 and r ≤ 5 , AvER ITF (0) = 0 . The JBLCMV has m max = 2 M -3 = 5 and, therefore, for c = 0 and r = 6 , 7 , AvER ITF (0) > 0 . Finally, for c = 1 , for all values of r , AvER ITF (1) = 1 as expected. 2) Speed of Termination: In Fig. 6 we evaluate the number of iterations required for the proposed method to satisfy the stopping criterion (i.e., when it terminates). Fig. 6 shows the average number of iterations as a function of the simultaneously present interferers, r , of the four configurations of the proposed method that are tested in Fig. 3 and 4. It is clear that the proposed method terminates after 3-4 iterations on average, even for r = 6 , 7 > 2 M -3 . Note that for both tested values of k max , for all frames and frequency bins the proposed method terminated before reaching k max . Fig. 7 shows a 3D histogram which depicts the statistical termination behaviour of the proposed method. Specifically, the proposed method is evaluated with different c values from 0 . 1 to 0 . 9 with a step-size 0 . 1 . For each c value it is evaluated for all numbers of simultaneously present interferers, i.e., for r = 1 , · · · , 7 as in Fig 6. Hence, this histogram represents all gathered pair-values ( c, k ) of all frequency bins for all r = 1 , · · · , 7 . The pairs ( c, k ) express the number of iterations (per frequency bin), k , that the proposed method need in order to terminate for a certain initial c . The z -axis, which is depicted with different colors, is the number of frequency bins that are associated with a certain pair ( c, k ) in the x-y axes. Again we see that, on average, after 3-4 iterations the algorithm terminates for c = 0 . 1 : 0 . 1 : 0 . 9 . 3) Reverberation: Figs. 8, 9 and 10 show the same experiments as in Figs. 4, 5, and 6, respectively, but this time in a reverberant office environment. The same signals for the interferers and the target are used here. The BTE IRs containing reverberation are also taken from the database in [39]. Note that, the aforementioned database does not have the reverberant (for the office environment) head IRs corresponding to 240 o and 300 o degrees [39]. Therefore, we Fig. 7. Anechoic environment: Top view of 3D histogram of number of frequency bins that have pairs ( k, c ) for the proposed method for (a) k max = 10 and (b) k max = 50 . <details> <summary>Image 7 Details</summary> ![3b35dff1](/v1/image/3b35dff1257bd214b1d3808afac6c8bdfe3f5ccf9fcc7adf2a18687da5b9cd9a) ### Visual Description ## Heatmap: Number of Iterations vs. Amount of Collapse ### Overview The image presents two heatmaps, labeled (a) and (b), visualizing the relationship between the number of iterations (k) and the amount of collapse (c). The color intensity represents a third, unspecified variable, ranging from 0 (dark blue) to 1000 (yellow). Both heatmaps share the same axes and color scale. ### Components/Axes * **X-axis (Horizontal):** "amount of collapse (c)". Values range from 0.1 to 0.9 in increments of 0.1. * **Y-axis (Vertical):** "number of iterations (k)". Values range from 0 to 50 in increments of 10. * **Color Scale (Legend):** Located on the right side of the image. The scale ranges from 0 (dark blue) to 1000 (yellow). Intermediate colors include shades of blue, green, and yellow. * **Heatmap Labels:** The heatmaps are labeled as "(a)" and "(b)" below the x-axis. ### Detailed Analysis **Heatmap (a):** * The heatmap is predominantly dark blue, indicating low values. * A thin band of yellow/green is visible for the first few iterations (k < 5) across all values of 'c'. * The intensity of the yellow/green band appears relatively consistent across the range of 'c' values. **Heatmap (b):** * Similar to heatmap (a), the heatmap is predominantly dark blue. * A thin band of yellow/green is visible for the first few iterations (k < 5) across all values of 'c'. * The intensity of the yellow/green band appears to increase slightly as 'c' increases from 0.1 to 0.9. * There is a slight increase in the number of iterations that are not dark blue as 'c' increases. ### Key Observations * Both heatmaps show a similar pattern: low values for most combinations of 'k' and 'c', with higher values concentrated at low iteration numbers. * Heatmap (b) shows a slightly higher intensity and spread of higher values compared to heatmap (a), especially at higher 'c' values. ### Interpretation The heatmaps visualize the effect of the "amount of collapse (c)" on some unspecified variable as the "number of iterations (k)" increases. The data suggests that a higher amount of collapse may lead to slightly higher values of the unspecified variable, especially at lower iteration numbers. The difference between heatmaps (a) and (b) suggests that there is some difference in the underlying process or parameters between the two scenarios. The nature of the unspecified variable and the specific differences between the scenarios would need further context to fully understand. </details> used the avalaible angles, 125 o , 145 o for the 6 -th and 7 -th interferer, respectively. Moreover, the sources are now placed on a h = 100 cm radius circle centered at the origin (0 , 0) (center of head) with an elavation of 0 o degrees (because only in this distance is available for the office environment in [39]). The binaural gsSNR in values for r = 1 , 2 , 3 , 4 , 5 , 6 and 7 are now -0 . 03 , -2 . 1 , -3 . 03 , -3 . 75 , -4 . 44 , -4 . 95 , and -5 . 42 dB, respectively. As it is shown in Fig. 8, again the performance of the proposed method is bounded between the performance of the BMVDR and the JBLCMV, except for the case with parameters c = 0 . 3 and k max = 10 for r = 7 . The reason for that is that for r > 2 M -3 there are no guarantees for bounded performance (see Theorem 1). However, in all other cases of the proposed method for r > 2 M -3 the performance is bounded. In Fig. 9 it is clear that the proposed method has very similar behaviour as in Fig. 5, i.e., by increasing k max the the proposed method approaches closer to the boundary. Finally, in Fig. 10 it is shown that the speed of termination is not effected significantly due to reverberation. ## VII. CONCLUSION In this paper we proposed a new multi-microphone iterative binaural noise reduction method. The proposed method is capable of controlling the amount of noise reduction and the accuracy of binaural cue preservation per interferer using a robust methodology. Specifically, the inequality constraints introduced for the binaural cue preservation of the interferers, are selected in such a way that a) the binaural-cue error is always less or equal than a fraction of the corresponding binaural-cue error of the BMVDR method, and b) the achieved amount of noise suppression is larger or equal to the one achieved via JBLCMV. Therefore, the proposed method provides the flexibility to the users to parametrize the proposed method according to their needs. Moreover, the proposed method always preserves strictly the binaural cues of the target source. Although the proposed method guarantees controlled approximate binaural cue preservation and controlled noise Fig. 8. Reverberant environment (office): Combination of performance curves from Fig. 3 for the competing methods in terms of (a) noise reduction, (b) average ITF error for different number of simultaneously present interferers r . The counting of r starts at the top left part of each curve. <details> <summary>Image 8 Details</summary> ![e2ea367f](/v1/image/e2ea367f74b423c9d5648afa12286516d240c295ff2207220d0e3b14d909884c) ### Visual Description ## Chart: gSSNR gain vs TotERITF ### Overview The image is a line chart comparing the performance of different algorithms (JBLCMV, Prop, BMVDR, BLCMV, OBLCMV) in terms of gSSNR gain (dB) versus TotERITF. The chart displays multiple data series, each representing a different algorithm or configuration. The x-axis represents TotERITF, and the y-axis represents gSSNR gain in dB. ### Components/Axes * **X-axis:** TotERITF, ranging from 0 to approximately 90. * **Y-axis:** gSSNR gain (dB), ranging from 0 to 35. * **Title:** gSSNR gain (dB) vs TotERITF (inferred from axis labels and context). * **Legend (Top-Right):** * Red triangle: JBLCMV * Blue triangle: Prop.-c = 0.3, kmax = 10 * Magenta x: Prop.-c = 0.3, kmax = 50 * Light green star: Prop.-c = 0.6, kmax = 10 * Yellow circle: Prop.-c = 0.6, kmax = 50 * Black diamond: BMVDR * Brown square: BLCMV-η = 0.1 * Red plus: BLCMV-η = 0.001 * Dark green asterisk: OBLCMV ### Detailed Analysis * **JBLCMV (Red Triangles):** Starts at approximately (0, 4) and increases slightly before decreasing and remaining relatively flat around 2-4 dB. * **Prop.-c = 0.3, kmax = 10 (Blue Triangles):** Starts at approximately (0, 5) and increases to approximately (15, 6) before decreasing and remaining relatively flat around 4-6 dB. * **Prop.-c = 0.3, kmax = 50 (Magenta x):** Starts at approximately (0, 6) and increases to approximately (15, 5) before decreasing and remaining relatively flat around 2-5 dB. * **Prop.-c = 0.6, kmax = 10 (Light green star):** Starts at approximately (0, 7) and increases to approximately (40, 6) before decreasing and remaining relatively flat around 4-7 dB. * **Prop.-c = 0.6, kmax = 50 (Yellow circle):** Starts at approximately (0, 7) and increases to approximately (40, 5) before decreasing and remaining relatively flat around 4-6 dB. * **BMVDR (Black Diamonds):** Starts at approximately (0, 20) and decreases sharply to approximately (5, 8) before increasing and remaining relatively flat around 6-8 dB. * **BLCMV-η = 0.1 (Brown Squares):** Starts at approximately (0, 15) and decreases sharply to approximately (5, 7) before increasing and remaining relatively flat around 6-7 dB. * **BLCMV-η = 0.001 (Red Plus):** Starts at approximately (0, 30) and decreases sharply to approximately (5, 7) before increasing and remaining relatively flat around 6-7 dB. * **OBLCMV (Dark green Asterisk):** Starts at approximately (0, 33) and decreases sharply to approximately (5, 8) before increasing and remaining relatively flat around 6-8 dB. ### Key Observations * The algorithms OBLCMV, BLCMV-η = 0.001, and BMVDR show the highest initial gSSNR gain but experience a sharp decrease as TotERITF increases. * The "Prop" algorithms generally have lower initial gSSNR gain but maintain a more stable performance as TotERITF increases. * All algorithms tend to converge to a relatively flat gSSNR gain as TotERITF increases beyond 20. ### Interpretation The chart illustrates a trade-off between initial gSSNR gain and stability with respect to TotERITF. Algorithms like OBLCMV and BMVDR offer high initial gain but are more sensitive to changes in TotERITF. In contrast, the "Prop" algorithms provide a more consistent performance across different TotERITF values, albeit at a lower initial gain. The choice of algorithm would depend on the specific application and the importance of initial gain versus stability. The BLCMV algorithms perform similarly to BMVDR and OBLCMV. </details> reduction only for m ≤ 2 M -3 interferers, it is experimentally demonstrated that is also capable of doing the same (in most cases) for more interferers and terminate in just a few iterations. ## APPENDIX In this section, we show how the optimization problem in Eq. (36) can be equivalently written as a second order cone programming (SOCP) problem. For convenience, we reformulate the optimization problem in Eq. (36) using RATFs instead of ATFs. The left and right RATFs of the i -th interferer are ¯ b i,L = (1 /b iL ) b i and ¯ b i,R = (1 /b iR ) b i , respectively, while the left and right RATFs of the target are ¯ a L = (1 /a L ) a and ¯ a R = (1 /a R ) a , respectively. It is easy to show that the constraints of the optimization problem in Eq. (36) can be Fig. 9. Reverberant environment (office): Average ITF error ratio as a function of c for 1 ≤ r ≤ 7 for (a) k max = 10 and (b) k max = 50 . The solid line is the c values. <details> <summary>Image 9 Details</summary> ![9bca74c6](/v1/image/9bca74c65a30d840bcb226f5d00021d0b1e0940953f54fe170ec8bfff20d88a8) ### Visual Description ## Line Chart: AverITF(c) vs. Amount of Collapse (c) ### Overview The image contains two line charts, labeled (a) and (b), that depict the relationship between AverITF(c) and the amount of collapse (c) for different values of 'r'. The charts share the same axes and legend, allowing for a direct comparison of the data. ### Components/Axes * **Title (Y-axis):** AverITF(c) * Scale: 0 to 1, with tick marks at 0, 0.5, and 1. * **Title (X-axis):** amount of collapse (c) * Scale: 0 to 1, with tick marks at 0, 0.2, 0.4, 0.6, 0.8, and 1. * **Legend (Top):** Located above the charts, it identifies the lines by color and 'r' value. * Red: c = 0 : 0.1 : 1 * Blue: r = 1 * Yellow: r = 2 * Dotted Black: r = 3 * Green: r = 4 * Cyan: r = 5 * Brown: r = 7 * **Chart Labels:** (a) and (b) are located below each chart. ### Detailed Analysis **Chart (a):** * **Red (c = 0 : 0.1 : 1):** Starts at approximately 0, increases linearly to 1 as 'c' goes to 1. * **Blue (r = 1):** Starts at approximately 0, increases linearly to 1 as 'c' goes to 1. * **Yellow (r = 2):** Starts at approximately 0, increases linearly to approximately 0.9 as 'c' goes to 0.8, then increases sharply to 1 as 'c' goes to 1. * **Dotted Black (r = 3):** Starts at approximately 0, increases linearly to approximately 0.7 as 'c' goes to 0.8, then increases sharply to 1 as 'c' goes to 1. * **Green (r = 4):** Starts at approximately 0, increases linearly to approximately 0.6 as 'c' goes to 0.8, then increases sharply to 1 as 'c' goes to 1. * **Cyan (r = 5):** Starts at approximately 0.05, decreases to approximately 0.02 at c=0.1, then increases linearly to approximately 0.6 as 'c' goes to 0.8, then increases sharply to 1 as 'c' goes to 1. * **Brown (r = 7):** Starts at approximately 0.2, decreases to approximately 0.05 at c=0.1, then increases linearly to approximately 0.55 as 'c' goes to 0.8, then increases sharply to 1 as 'c' goes to 1. **Chart (b):** * **Red (c = 0 : 0.1 : 1):** Starts at approximately 0, increases linearly to 1 as 'c' goes to 1. * **Blue (r = 1):** Starts at approximately 0, increases linearly to 1 as 'c' goes to 1. * **Yellow (r = 2):** Starts at approximately 0, increases linearly to approximately 0.9 as 'c' goes to 0.8, then increases sharply to 1 as 'c' goes to 1. * **Dotted Black (r = 3):** Starts at approximately 0, increases linearly to approximately 0.7 as 'c' goes to 0.8, then increases sharply to 1 as 'c' goes to 1. * **Green (r = 4):** Starts at approximately 0, increases linearly to approximately 0.6 as 'c' goes to 0.8, then increases sharply to 1 as 'c' goes to 1. * **Cyan (r = 5):** Starts at approximately 0.15, decreases to approximately 0.05 at c=0.1, then increases linearly to approximately 0.6 as 'c' goes to 0.8, then increases sharply to 1 as 'c' goes to 1. * **Brown (r = 7):** Starts at approximately 0.3, decreases to approximately 0.1 at c=0.1, then increases linearly to approximately 0.55 as 'c' goes to 0.8, then increases sharply to 1 as 'c' goes to 1. ### Key Observations * For both charts, the red and blue lines (c = 0 : 0.1 : 1 and r = 1) show a consistent linear increase from 0 to 1 as the amount of collapse increases. * The yellow, dotted black, and green lines (r = 2, r = 3, and r = 4) also increase with the amount of collapse, but their rate of increase changes as 'c' approaches 1. * The cyan and brown lines (r = 5 and r = 7) initially decrease slightly before increasing with the amount of collapse. * The primary difference between chart (a) and chart (b) lies in the initial values of the cyan and brown lines. ### Interpretation The charts illustrate how the AverITF(c) changes with the amount of collapse (c) for different values of 'r'. The linear increase observed in the red and blue lines suggests a direct proportional relationship between AverITF(c) and the amount of collapse. The other lines show a more complex relationship, where the rate of increase changes as 'c' approaches 1. The initial decrease in the cyan and brown lines suggests that for certain values of 'r', a small amount of collapse initially reduces the AverITF(c) before it starts to increase. The difference in initial values between chart (a) and chart (b) for the cyan and brown lines suggests that the initial impact of collapse on AverITF(c) may vary depending on other factors not explicitly represented in the chart. </details> Fig. 10. Reverberant environment (office): Average number of iterations as a function of simultaneously present interferers, r . <details> <summary>Image 10 Details</summary> ![27895c00](/v1/image/27895c00d3b44d5db5a733979bf59f5586f55a3d3655761e82d8e947c706fafe) ### Visual Description ## Line Chart: Average k vs. Number of Interferers ### Overview The image is a line chart comparing the average value of 'k' against the number of interferers 'r'. There are four data series, each representing a different configuration of parameters 'c' and 'k_max'. The chart illustrates how the average 'k' changes as the number of interferers increases for each configuration. ### Components/Axes * **X-axis:** "number of interferers (r)". The axis ranges from 1 to 7 in integer increments. * **Y-axis:** "average k". The axis ranges from 1 to 7 in integer increments. * **Legend:** Located in the top-right corner of the chart. It defines the four data series: * Dotted Blue Line: "Prop.-c = 0.3, k_max = 10" * Purple Line with X markers: "Prop.-c = 0.3, k_max = 50" * Green Dashed Line with Diamond markers: "Prop.-c = 0.6, k_max = 10" * Yellow Line with Circle markers: "Prop.-c = 0.6, k_max = 50" ### Detailed Analysis * **Prop.-c = 0.3, k_max = 10 (Dotted Blue Line):** * Trend: Generally increasing, but with a slower rate of increase compared to other series. * Data Points: * r = 1, average k ≈ 1.7 * r = 2, average k ≈ 1.9 * r = 3, average k ≈ 2.1 * r = 4, average k ≈ 2.5 * r = 5, average k ≈ 3.1 * r = 6, average k ≈ 3.2 * r = 7, average k ≈ 3.3 * **Prop.-c = 0.3, k_max = 50 (Purple Line with X markers):** * Trend: Increasing, with a steeper initial slope that flattens out. * Data Points: * r = 1, average k ≈ 1.7 * r = 2, average k ≈ 2.1 * r = 3, average k ≈ 2.2 * r = 4, average k ≈ 2.6 * r = 5, average k ≈ 3.9 * r = 6, average k ≈ 4.0 * r = 7, average k ≈ 4.3 * **Prop.-c = 0.6, k_max = 10 (Green Dashed Line with Diamond markers):** * Trend: Increasing, similar to the purple line but slightly higher. * Data Points: * r = 1, average k ≈ 1.7 * r = 2, average k ≈ 2.0 * r = 3, average k ≈ 2.2 * r = 4, average k ≈ 2.7 * r = 5, average k ≈ 4.0 * r = 6, average k ≈ 4.0 * r = 7, average k ≈ 4.3 * **Prop.-c = 0.6, k_max = 50 (Yellow Line with Circle markers):** * Trend: Increasing, with a significant jump between r = 4 and r = 5. * Data Points: * r = 1, average k ≈ 1.7 * r = 2, average k ≈ 2.1 * r = 3, average k ≈ 2.3 * r = 4, average k ≈ 2.6 * r = 5, average k ≈ 5.5 * r = 6, average k ≈ 5.6 * r = 7, average k ≈ 6.2 ### Key Observations * For a given 'c' value, increasing 'k_max' generally results in a higher average 'k'. * The yellow line (Prop.-c = 0.6, k_max = 50) shows a much more significant increase in average 'k' as the number of interferers increases, especially after r = 4. * The dotted blue line (Prop.-c = 0.3, k_max = 10) consistently has the lowest average 'k' values. ### Interpretation The chart demonstrates the relationship between the number of interferers and the average value of 'k' under different parameter configurations. The parameter 'c' and 'k_max' significantly influence the average 'k'. Specifically, a higher 'k_max' value allows for a greater average 'k', especially when combined with a higher 'c' value. The jump in the yellow line suggests a threshold effect, where increasing the number of interferers beyond a certain point (r=4) has a disproportionately large impact on the average 'k' when both 'c' and 'k_max' are high. The data suggests that the system's behavior is highly sensitive to the interplay between these parameters and the number of interferers. </details> equivalently written as   where ζ = | ¯ a ∗ R, 1 ¯ b ∗ i,L,M -1 | (with ¯ a ∗ R, 1 the first element of ¯ a H R and ¯ b ∗ i,L,M is the last element of ¯ b i,L ) and Φ 2 ,i is the i -th column of the matrix Φ 2 given by Similar to [36], [37], we convert the complex vectors and matrices to real-valued ones, i.e.,          Note that w T ˜ ˘ Pw = || ˜ ˘ P 1 / 2 w || 2 2 , where ˜ ˘ P 1 / 2 is the principal square root of ˜ ˘ P . The convex optimization problem in Eq. (36) can be equivalently written as  where ˘ q T 1 = [ 1 1 0 0 ] , ˘ Φ 2 ,i is the i -th column of ˘ Φ 2 , and ˇ Φ 2 ,i is the i -th column of ˇ Φ 2 . Note that the problem in Eq. (65) is a standard-form SOCP problem [38]. ## ACKNOWLEDGMENT The authors would like to thank Dr. Meng Guo for his helpful comments and suggestions. ## REFERENCES - [1] R. C. Hendriks, T. Gerkmann, and J. Jensen, DFT-Domain Based SingleMicrophone Noise Reduction for Speech Enhancement: A Survey of the State of the Art . Morgan & Claypool, 2013. - [3] K. Eneman et al., 'Evaluation of signal enhancement algorithms for hearing instruments,' in EURASIP Europ. Signal Process. Conf. (EUSIPCO) , Aug. 2008. - [2] M. Brandstein and D. Ward (Eds.), Microphone arrays: signal processing techniques and applications . Springer, 2001. - [4] S. Doclo and M. Moonen, 'GSVD-based optimal filtering for single and multimicrophone speech enhancement,' IEEE Trans. Signal Process. , vol. 50, no. 9, pp. 2230-2244, Sept. 2002. - [5] A. Spriet, M. Moonen, and J. Wouters, 'Spatially pre-processed speech distortion weighted multi-channel Wiener filtering for noise reduction,' Signal Process. , vol. 84, no. 12, pp. 2367-2387, Dec. 2004. - [7] B. D. Van Veen and K. M. Buckley, 'Beamforming: A versatile approach to spatial filtering,' IEEE ASSP Mag. , vol. 5, no. 5, pp. 4-24, Apr. 1988. - [6] J. Capon, 'High-resolution frequency-wavenumber spectrum analysis,' Proc. IEEE , vol. 57, no. 8, pp. 1408-1418, Aug. 1969. - [8] O. L. Frost III, 'An algorithm for linearly constrained adaptive array processing,' Proceedings of the IEEE , vol. 60, no. 8, pp. 926-935, Aug. 1972. - [10] T. Van den Bogaert, T. J. Klasen, L. Van Deun, J. Wouters, and M. Moonen, 'Horizontal localization with bilateral hearing aids: without is better than with,' J. Acoust. Soc. Amer. , vol. 119, no. 1, pp. 515-526, Jan. 2006. - [9] J. M. Kates, Digital hearing aids . Plural publishing, 2008. - [11] A. W. Bronkhorst, 'The cocktail party phenomenon: A review of research on speech intelligibility in multiple-talker conditions,' Acta Acoustica , vol. 86, no. 1, pp. 117-128, 2000. - [13] H. Schmidt, A. B. Baggeroer, W. A. Kuperman, and E. K. Scheer, 'Environmentally tolerant beamforming for high-resolution matched field processing: deterministic mismatch,' J. Acoust. Soc. Amer. , vol. 88, no. 4, Oct. 1990. - [12] S. Markovich, S. Gannot, and I. Cohen, 'Multichannel eigenspace beamforming in a reverberant noisy environment with multiple interfering speech signals,' IEEE Trans. Audio, Speech, Language Process. , vol. 17, no. 6, pp. 1071-1086, Aug. 2009. - [14] S. A. Vorobyov, 'Principles of minimum variance robust adaptive beamforming design,' ELSEVIER Signal Process. , vol. 93, no. 12, pp. 3264-3277, Dec. 2013. - [16] S. Gazor and W. Zhang, 'Speech probability distribution,' IEEE Signal Process. Lett. , vol. 10, no. 7, pp. 204-207, Jul. 2003. - [15] R. C. Hendriks, R. Heusdens, U. Kjems, and J. Jensen, 'On optimal multichannel mean-squared error estimators for speech enhancement,' IEEE Signal Process. Lett. , vol. 16, no. 10, pp. 885-888, Oct. 2009. - [17] R. Martin, 'Speech enhancement based on minimum mean-square error estimation and supergaussian priors,' IEEE Trans. Speech Audio Process. , vol. 13, no. 5, pp. 845-856, Sep. 2005. - [19] P. Vary and R. Martin, Digital speech transmission: Enhancement, coding and error concealment . John Wiley & Sons, 2006. - [18] J. S. Erkelens, R. C. Hendriks, R. Heusdens, and J. Jensen, 'Minimum mean-square error estimation of discrete Fourier coefficients with generalized gamma priors,' IEEE Trans. Audio, Speech, Language Process. , vol. 15, no. 6, pp. 1741-1752, Aug. 2007. - [20] S. Doclo, W. Kellermann, S. Makino, and S. Nordholm, 'Multichannel signal enhancement algorithms for assisted listening devices,' IEEE Signal Process. Mag. , vol. 32, no. 2, pp. 18-30, Mar. 2015. - [22] T. Klasen, T. Van den Bogaert, M. Moonen, and J. Wouters, 'Binaural noise reduction algorithms for hearing aids that preserve interaural time delay cues,' IEEE Trans. Signal Process. , vol. 55, no. 4, pp. 1579-1585, Apr. 2007. - [21] T. J. Klasen, T. Van den Bogaert, M. Moonen, and J. Wouters, 'Preservation of interaural time delay for binaural hearing aids through multichannel Wiener filtering based noise reduction,' in IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP) , Mar. 2005, pp. 29-32. - [23] S. Doclo, T. J. Klasen, T. Van den Bogaert, J. Wouters, and M. Moonen, 'Theoretical analysis of binaural cue preservation using multi-channel Wiener filtering and interaural transfer functions,' in Int. Workshop Acoustic Echo, Noise Control (IWAENC) , Sep. 2006. - [25] D. Marquardt, E. Hadad, S. Gannot, and S. Doclo, 'Theoretical analysis of linearly constrained multi-channel Wiener filtering algorithms for combined noise reduction and binaural cue preservation in binaural hearing aids,' IEEE Trans. Audio, Speech, Language Process. , vol. 23, no. 12, Sept. 2015. - [24] T. Van den Bogaert, S. Doclo, J. Wouters, and M. Moonen, 'The effect of multimicrophone noise reduction systems on sound source localization by users of binaural hearing aids,' J. Acoust. Soc. Amer. , vol. 124, no. 1, pp. 484-497, July 2008. - [26] E. Hadad, S. Gannot, and S. Doclo, 'Binaural linearly constrained minimum variance beamformer for hearing aid applications,' in Int. Workshop Acoustic Signal Enhancement (IWAENC) , Sep. 2012, pp. 1-4. - [28] D. Marquardt, E. Hadad, S. Gannot, and S. Doclo, 'Optimal binaural lcmv beamformers for combined noise reduction and binaural cue preservation,' in Int. Workshop Acoustic Signal Enhancement (IWAENC) , Sep. 2014, pp. 288-292. - [27] E. Hadad, S. Doclo, and S. Gannot, 'The binaural LCMV beamformer and its performance analysis,' IEEE Trans. Audio, Speech, Language Process. , vol. 24, no. 3, pp. 543-558, Mar. 2016. - [29] A. I. Koutrouvelis, R. C. Hendriks, J. Jensen, and R. Heusdens, 'Improved multi-microphone noise reduction preserving binaural cues,' in IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP) , Mar. 2016. - [31] B. Cornelis, S. Doclo, T. Van den Bogaert, M. Moonen, and J. Wouters, 'Theoretical analysis of binaural multimicrophone noise reduction techniques,' IEEE Trans. Audio, Speech, Language Process. , vol. 18, no. 2, pp. 342-355, Feb. 2010. - [30] E. Hadad, D. Marquardt, S. Doclo, and S. Gannot, 'Theoretical analysis of binaural transfer function MVDR beamformers with interference cue preservation constraints,' IEEE Trans. Audio, Speech, Language Process. , vol. 23, no. 12, pp. 2449-2464, Dec. 2015. - [32] J. G. Desloge, W. M. Rabinowitz, and P. M. Zurek, 'Microphone-array hearing aids with binaural output .I. Fixed-processing systems,' IEEE Trans. Speech Audio Process. , vol. 5, no. 6, pp. 529-542, Nov. 1997. - [34] F. L. Wightman and D. J. Kistler, 'The dominant role of low-frequency interaural time differences in sound localization,' J. Acoust. Soc. Amer. , vol. 91, no. 3, pp. 1648-1661, Mar. 1992. - [33] W. M. Hartmann, 'How we localize sound,' Physics Today , vol. 52, no. 11, pp. 24-29, Nov. 1999. - [35] D. P. Welker, J. E. Greenberg, J. G. Desloge, and P. M. Zurek, 'Microphone-array hearing aids with binaural output .II. A twomicrophone adaptive system,' IEEE Trans. Speech Audio Process. , vol. 5, no. 6, pp. 543-551, Nov. 1997. - [37] R. G. Lorenz and S. P. Boyd, 'Robust minimum variance beamforming,' IEEE Trans. Signal Process. , vol. 53, no. 5, pp. 1684-1696, May 2005. - [36] S. A. Vorobyov, A. B. Gershman, and Z. Q. Luo, 'Robust adaptive beamforming using worst-case performance optimization: A solution to the signal mismatch problem,' IEEE Trans. Signal Process. , vol. 51, no. 2, pp. 313-323, Feb. 2003. - [38] S. Boyd and L. Vandenberghe, Convex Optimization . Cambridge University Press, 2004. - [40] J. Allen, 'Short-term spectral analysis, and modification by discrete Fourier transform,' IEEE Trans. Acoust., Speech, Signal Process. , vol. 25, no. 3, pp. 235-238, June 1977. - [39] H. Kayser, S. Ewert, J. Annemuller, T. Rohdenburg, V. Hohmann, and B. Kollmeier, 'Database of multichannel in-ear and behind-the-ear headrelated and binaural room impulse responses,' EURASIP J. Advances Signal Process. , vol. 2009, pp. 1-10, Dec. 2009.

Rendering Paper...