IWAENC 2016: Program Sept. 14, 2016 (Wed.) 09:30 - 10:30 [Plenary talk 1] Statistical parametric speech processing - Golden hall Mads Græsbøll Christensen - Chair: Walter Kellermann 11:00 - 12:00 [Plenary talk 2] Estimating power spectral density for acoustic signal enhancement - an effective approach for practical applications - Golden hall Yusuke Hioka - Chair: Andy W. H. Khong 14:30 - 15:30 [Plenary talk 3] Acoustic signal enhancement under variable speed of sound conditions - Golden hall Rudolf Rabenstein - Chair: Thushara Abhayapala 15:30 - 17:00 Session SPS - Special session on echo cancellation dedicated to S. Grant - Golden hall Chair: Eric Diethorn, mh acoustics LLC 15:30 - 16:00 Session SPS - 01 (special talk) - Make it work, make it fast, make a difference - A tribute to Dr. Steven L. Grant Pratik Shah 16:00 - 16:20 Session SPS - 02 Inter-channel coherence reduction method for stereophonic acoustic echo cancellation Tomas Gaensler and Eric Diethorn mh acoustics LLC Session SPS - 03 16:20 - 16:40 Jianming Liu1, Quintin Liu1, Steven Grant2, Yahong Rosa Zheng2 1ForteMedia 16:40 - 17:00 19:30 - 21:30 The block-sparse proportionate second-order Volterra filtering algorithms for nonlinear echo cancellation Inc, 2Missouri University of Science and Technology Session SPS - 04 Insight into linear periodically time-varying coherence reduction methods for stereophonic acoustic echo cancellation Maria Luis Valero and Emanuel Habets International Audio Laboratories Erlangen Session PS-I - Poster session 1 - Golden hall Chair: Changchun Bao Session PS-I - 01 Subspace superdirective beamforming with uniform circular microphone arrays Gongping Huang1, Jacob Benesty2, Jingdong Chen3 1Center of Intelligent Acoustics and Immersive Communications, Northwestern Polytechnical University, 2INRS-EMT, University of Quebec, 3Center of Immersive and Intelligent Acoustics, Northwestern Polytechnical University Session PS-I - 02 Dual-microphone phase-difference-based SNR estimation with applications to speech enhancement Fred Mustiere, Renato Nakagawa, Kamil Wojcicki, Ivo Merks, Tao Zhang Starkey Hearing Technologies Session PS-I - 03 Cue-preserving MMSE filter for binaural speech enhancement Gerald Enzner, Masoumeh Azarpour, Jan Siska Ruhr-University Bochum Session PS-I - 04 Robust superdirective beamformer with optimal regularization Aviva Atkins1, Yuval Ben-Hur1, Israel Cohen1, Jacob Benesty2 1Department of Electrical Engineering Technion - Israel Institute of Technology, 2INRS-EMT, University of Quebec Session PS-I - 05 Affine projection algorithm for acoustic feedback cancellation using prediction error method in hearing aids Linh Tran, Hai Dam, Sven Nordholm Curtin University Session PS-I - 06 First-order differential microphone arrays from a time-domain broadband perspective Yaakov Buchris1, Israel Cohen2, Jacob Benesty3 1Technion, Israel Institute of Technology, 2Technion - Israel Institute of Technology, 3INRS-EMT Session PS-I - 07 A directionally constrained distortionless multistage LCMV beamformer Daniel Wolff1, Yaakov Buchris1, Israel Cohen2 1Technion, Israel Institute of Technology, 2Technion - Israel Institute of Technology Session PS-I - 08 Oracle performance investigation of the ideal masks Ziteng Wang, Xiaofei Wang, Xu Li, Qiang Fu, Yonghong Yan Institute of Acoustics, Chinese Academy of Sciences Session PS-I - 09 Student's t multichannel nonnegative matrix factorization for blind source separation Koichi Kitamura, Yoshiaki Bando, Katsutoshi Itoyama, Kazuyoshi Yoshii Kyoto University Session PS-I - 10 Solving permutation problem with a cascade combination of phase difference entropy and power spectral correlation Masahito Togami, Ryoichi Takashima, Yusuke Fujita Hitachi, Ltd. Session PS-I - 11 Deep sparse rectifier neural networks for speech denoising Lie Xu, Chiu Sing Choy, Yiwen Li Department of Electronic Engineering, the Chinese University of Hong Kong, HK Session PS-I - 12 Recursive implementations of informed spatial filters Maja Taseska, Reza Varzandeh, Emanuël Habets International Audio Laboratories Erlangen Session PS-I - 13 Artificial bandwidth extension using deep neural networks for spectral envelope estimation Johannes Abel, Maximilian Strake, Tim Fingscheidt Technische Universität Braunschweig Session PS-I - 14 Multi-speaker DOA estimation in reverberant conditions using expectation-maximization Ofer Schwartz1, Yuval Dorfan1, Emanuel Habets2, Sharon Gannot3 1Bar-Ilan University, Israel, 2International Audio Laboratories Erlangen, Germany, 3Bar-Ilan University Session PS-I - 15 On the evaluation of multichannel blind system identification from the viewpoint of system equalization Wancheng Zhang1, Patrick Naylor2, Zunwen He1, Yan Zhang1 1Beijing Institute of Technology, 2Imperial College London Session PS-I - 16 An iterative method for equalization of multichannel acoustic systems robust to system identification errors Wancheng Zhang1 and Patrick Naylor2 1 Beijing Institute of Technology, 2Imperial College London Session PS-I - 17 Spherical microphone array post-filtering for reverberation suppression using isotropic beamformings Yuhei Yamamoto and Yoichi Haneda The University of Electro-Communications Session PS-I - 18 Modeling audio directional statistics using a probabilistic spatial dictionary for speaker diarization in real meetings Mahmoud Fakhry, Nobutaka Ito, Shoko Araki, Tomohiro Nakatani NTT Corporation Session PS-I - 19 Direction of arrival estimation based on joint diagonalization of matrices in different direct-to-reverberation ratios Ryusuke Tanaka and Yoichi Haneda Graduate School of Informatics and Engineering The University of Electro-Communications Session PS-I - 20 Dynamic group sparsity for non-negative matrix factorization with application to unsupervised source separation Xu Li, Xiaofei Wang, Qiang Fu, Yonghong Yan Institute of Acoustics, Chinese Academy of Sciences Session PS-I - 21 Variable step-size diffusion proportionate affine projection algorithm Juan Shi, Jingen Ni, Xiaoping Chen Soochow University Session PS-I - 22 Hammerstein model-based nonlinear echo cancellation using a cascade of neural network and adaptive linear filter Jiri Malek and Zbynek Koldovsky Technical University of Liberec Session PS-I - 23 Performance comparison of intrusive and non-intrusive instrumental quality measures for enhanced speech Anderson R. Avila1, Benjamin Cauchi2, Simon Doclo3, Tiago Falk4, Stefan Goetze5 1Universidade Federal do ABC, 2Fraunhofer IDMT, Project Group Hearing, Speech and Audio Technology, 3University of Oldenburg, Dept. of Medical Physics and Acoustics, 4INRS- EMT, University of Quebec, 5Fraunhofer IDMT, Project Group Hearing Speech and Audio Technology Session PS-I - 24 HRTF-based robust least-squares frequency-invariant polynomial beamforming Hendrik Barfuss, Marcel Mueglich, Walter Kellermann Friedrich-Alexander University Erlangen-Nürnberg Session PS-I - 25 A contingency multi-microphone noise reduction strategy based on linearly constrained multi-channel Wiener filtering. Randall Ali and Marc Moonen KU Leuven Session PS-I - 26 Application of neural network to source PSD estimation for Wiener filter based array sound source enhancement Tomoko Kawase1, Kenta Niwa1, Kazunori Kobayashi1, Yusuke Hioka2 1 NTT Media Intelligence Laboratories, 2University of Auckland Session PS-I - 27 PSD estimation in beamspace using property of M-matrix Kenta Niwa1, Tomoko Kawase1, Kazunori Kobayashi1, Yusuke Hioka2 1 NTT Media Intelligence Laboratories, 2University of Auckland Session PS-I - 28 Speech enhancement using a microphone array mounted on an unmanned aerial vehicle Yusuke Hioka, Michael Kingan, Gian Schmid, Karl A. Stol University of Auckland Session PS-I - 29 Bi-magnitude processing framework for nonlinear acoustic echo cancellation on Android devices Yiteng Huang, Jan Skoglund, Alejandro Luebs Google Inc. Session PS-I - 30 Voice activity detection based on statistical likelihood ratio with adaptive thresholding Xiaofei Li1, Radu Horaud1, Laurent Girin2, Sharon Gannot3 1 INRIA, Grenoble, 2INRIA Grenoble, GIPSA-Lab & University Grenoble Alpes, 3Bar-Ilan University Sept. 15, 2016 (Thu.) 08:00 - 09:00 [Plenary talk 4] Audio challenges in virtual and augmented reality devices - Golden hall Ivan Tashev - Chair: Rainer Martin 09:40 - 10:40 [Plenary talk 5] Microphone array power ratio for speech quality assessment in noisy reverberant environments - Golden hall Israel Cohen - Chair: Sharon Gannot 10:40 - 12:00 Session SPC-I - Student paper competition 1 - Golden hall Chair: Tomas Gaensler Session SPC-I - 01 10:40 - 11:00 Reuven Berkun1, Israel Cohen1, Jacob Benesty2 1Technion 11:00 - 11:20 - Israel Institute of Technology, 2INRS-EMT, University of Quebec Session SPC-I - 02 Convolutive blind source separation with low latency Jiawen Chua, Ganlong Wang, Bastiaan Kleijn Victoria University of Wellington Session SPC-I - 03 11:20 - 11:40 Anechoic phase estimation from reverberant signals Arthur Belhomme1, Yves Grenier2, Roland Badeau2, Eric Humbert3 1Telecom 11:40 - 12:00 A tunable beamformer for robust superdirective beamforming ParisTech, 2LTCI CNRS Telecom ParisTech, 3Invoxia Session SPC-I - 04 Structured total least squares based internal delay estimation for distributed microphone auto-localization Jie Zhang, Richard Hendriks, Richard Heusdens Delft University of Technology Sept. 16, 2016 (Fri.) 08:30 - 10:30 08:30 - 08:50 Session SPC-II - Student paper competition 2 - Splendid hall (Jinxiuhuating) Chair: Shoji Makino Session SPC-II - 01 Head-orientation compensation with video-informed single channel speech enhancement Soumitro Chakrabarty, Deepth Pilakeezhu, Emanuël A. P. Habets International Audio Laboratories Erlangen Session SPC-II - 02 08:50 - 09:10 Steven Van Kuyk1, Bastiaan Kleijn1, Richard Hendriks2 1Victoria 09:10 - 09:30 University of Wellington, 2Delft University of Technology Session SPC-II - 03 A phoneme-based pre-training approach for deep neural network with application to speech enhancement Shlomo E. Chazan, Sharon Gannot, Jacob Goldberger Bar-Ilan University Session SPC-II - 04 09:30 - 09:50 An intelligibility metric based on a simple model of speech communication Efficient initialization for nonnegative matrix factorization based on nonnegative independent component analysis Daichi Kitamura1 and Nobutaka Ono2 1 SOKENDAI (The Graduate University for Advanced Studies), 2National Institute of Informatics 09:50 - 10:10 Session SPC-II - 05 Speech enhancement using ARCH model Israel Cohen Department of Electrical Engineering Technion - Israel Institute of Technology 10:10 - 10:30 Session SPC-II - 06 Extraction of exterior field from a mixed sound field for 2D height-invariant sound propagation Abdullah Fahim, Prasanga Samarasinghe, Thushara Abhayapala The Australian National University 11:00 - 12:00 [Tutorial] DNN and speech enhancement - Splendid hall (Jinxiuhuating) Deliang Wang - Chair: Jie Chen 14:00 - 16:00 Session PS-II - Poster Session 2 - Auspicious hall (Jixiangruyi) Chair: Simon Doclo Session PS-II - 01 New method for synthesizing personalized head-related transfer function Lei Wang and Xiangyang Zeng Northwestern Polytechnical University Session PS-II - 02 Ego-noise reduction for a hose-shaped rescue robot using determined rank-1 multichannel nonnegative matrix factorization Moe Takakusaki1, Daichi Kitamura2, Nobutaka Ono3, Takeshi Yamada1, Shoji Makino1, Hiroshi Saruwatari4 1 University of Tsukuba, 2SOKENDAI (The Graduate University for Advanced Studies), 3National Institute of Informatics, 4The University of Tokyo Session PS-II - 03 An improved soft decision based noise power estimation employing adaptive prior and conditional smoothing Pei Chee Yong and Sven Nordholm Curtin University Session PS-II - 04 Sparseness-based multichannel nonnegative matrix factorization for blind source separation Takuya Higuchi1, Takuya Yoshioka1, Tomohiro Nakatani2 1 NTT, 2NTT Corporation Session PS-II - 05 Assessing the segmental contribution to the non-intrusive intelligibility prediction of noise-suppressed speech Lei Wang and Fei Chen Southern University of Science and Technology Session PS-II - 06 Acoustic feedback cancellation for a multi-microphone earpiece based on a null-steering beamformer Henning Schepker1, Linh T. T. Tran2, Sven Nordholm2, Simon Doclo1 1 University of Oldenburg, Department of Medical Physics and Acoustics, Signal Processing Group, Oldenburg, Germany, 2Department of Electrical and Computer Engineering, Curtin University, Bentley, WA, Australia Session PS-II - 07 A computationally cheaper method for blind speech separation based on AuxIVA and incomplete demixing transform Jakub Janský1, Zbyněk Koldovský1, Nobutaka Ono2 1 Technical University of Liberec, 2National Institute of Informatics Session PS-II - 08 Constrained multi-channel linear prediction for adaptive speech dereverberation Ante Jukić1, Zichao Wang2, Toon van Waterschoot3, Timo Gerkmann4, Simon Doclo1 1 University of Oldenburg, 2Rice University, 3KU Leuven, 4Technicolor Research and Innovation Session PS-II - 09 Improved nonnegative adaptive filtering algorithms Kai Zhao, Jingen Ni, Xiaoping Chen Soochow University Session PS-II - 10 Under-modelled blind system identification for time delay estimation in reverberant environments Wei Xue1, Mike Brookes2, Patrick A. Naylor2 1 Imperial College London, 2Dept. of Electrical and Electronic Engineering, Imperial College London, UK Session PS-II - 11 On directivity of a circular array with directional microphones Hongsen He1, Xiaojun Qiu2, Tao Yang1 1 Southwest University of Science and Technology, 2Nanjing University Session PS-II - 12 A real-time noise energy estimation method Yaodu Wei, Li Liu, Lizhong Wang Beijing Samsung Telecom R&D Center Session PS-II - 13 Performance analysis of a dual microphone superdirective beamformer and approximate expressions for the near-field propagation regime Shmulik Markovich-Golan1, Dovid Levin2, Sharon Gannot2 1Faculty of Engineering, Bar Ilan University, 2Faculty of Engineering, Bar-Ilan University Session PS-II - 14 Multichannel time delay estimation for acoustic source localization via robust adaptive blind system identification Hongsen He1, Jingdong Chen2, Jacob Benesty3, Tao Yang1 1Southwest University of Science and Technology, 2Northwestern Polytechnical University, 3University of Quebec Session PS-II - 15 Approximate MVDR and MMSE beamformers exploiting scale-invariant reconstruction of signals on microphones Zbyněk Koldovský1 and Francesco Nesta2 1Technical University of Liberec, 2Conexant System Session PS-II - 16 Partitioned block frequency domain Kalman filter for multi-channel linear prediction based blind speech dereverberation Thomas Dietzen1, Ann Spriet2, Wouter Tirry2, Simon Doclo3, Marc Moonen4, Toon van Waterschoot4 1 NXP Software, 2NXP Software, Leuven, Belgium, 3University of Oldenburg, Dept. of Medical Physics and Acoustics and the Cluster of Excellence Hearing4All, Oldenburg, Germany, 4 KU Leuven, Dept. of Electrical Engineering (ESAT), STADIUS Center for Dynamical Systems, Signal Processing and Data Analytics, Leuven, Belgium Session PS-II - 17 Late reverberation PSD estimation for single-channel dereverberation using relative convolutive transfer functions Sebastian Braun1, Boaz Schwartz2, Sharon Gannot2, Emanuel Habets1 1 International Audio Laboratories Erlangen, 2Bar-Ilan University Session PS-II - 18 Increasing the environment-awareness of rake beamforming for directive acoustic sources Pasi Pertilä1 and Alessio Brutti2 1 Tampere University of Technology, 2FBK Session PS-II - 19 An informed separation algorithm based on sound field mapping for speech recognition systems Dejan Markovic1, Jigyasa Popat2, Fabio Antonacci1, Augusto Sarti1, T. Kishore Kumar2 1Politecnico di Milano, 2National Institute of Technology Warangal Session PS-II - 20 A robust data-independent near-field beamformer for linear microphone arrays Federico Borra, Lucio Bianchi, Fabio Antonacci, Augusto Sarti, Stefano Tubaro Politecnico di Milano Session PS-II - 21 Room transfer function measurement from a directional loudspeaker Prasanga Samarasinghe and Thushara Abhayapala Australian National University 16:30 - 18:30 Session PS-III - Poster Session 3 - Auspicious hall (Jixiangruyi) Chair: Emanuel Habets Session PS-III - 01 Fast simulation method for room impulse responses based on the mirror image source assumption Jia Yan1 and Bastiaan Kleijn2 1Technology University of Delft, 2Victoria University of Wellington Session PS-III - 02 Discriminative and reconstructive basis training for audio source separation with semi-supervised nonnegative matrix factorization Daichi Kitamura1, Nobutaka Ono2, Hiroshi Saruwatari3, Yu Takahashi4, Kazunobu Kondo4 1 SOKENDAI (The Graduate University for Advanced Studies), 2National Institute of Informatics, 3The University of Tokyo, 4Yamaha Corporation Session PS-III - 03 Binaural speech enhancement using a codebook based approach Mathew Shaji Kavalekalam1, Mads Græsbøll Christensen1, Jesper Boldt2 1 Aalborg University, 2GN Resound Session PS-III - 04 Noise reduction using independent vector analysis and noise cancellation for a hose-shaped rescue robot Masaru Ishimura1, Shoji Makino1, Takeshi Yamada1, Nobutaka Ono2, Hiroshi Saruwatari3 1University of Tsukuba, 2National Institute of Informatics, 3The University of Tokyo Session PS-III - 05 Statistical analysis and improvement of coherent-to-diffuse power ratio estimators for dereverberation Chengshi Zheng1, Xiaodong Li1, Andreas Schwarz2, Walter Kellermann2 1Institute of Acoustics, Chinese Academy of Sciences, 2Chair of Multimedia Communications and Signal Processing, Friedrich-Alexander-Universität Erlangen-Nürnberg Session PS-III - 06 ALE for robots! A single-channel approach to robot self-noise cancellation Jalal Taghia, Dorothea Kolossa, Rainer Martin Ruhr-Universität Bochum Session PS-III - 07 The open-set problem in acoustic scene classification Daniele Battaglino1, Ludovick Lepauloux2, Nicholas Evans1 1 Institut Eurecom at Sophia Antipolis, 2NXP Software Session PS-III - 08 Robust TDOA-based joint source and microphone localization in a reverberant environment using medians of acceptable recovered TOAs Trung-Kien Le and Nobutaka Ono National Institute of Informatics Session PS-III - 09 Spherical harmonic Rake receivers for dereverberation Hamza Javed, Alastair Moore, Patrick Naylor Imperial College London Session PS-III - 10 Linear prediction based dereverberation for spherical microphone arrays Alastair Moore and Patrick Naylor Imperial College London Session PS-III - 11 A multiframe parametric Wiener filter for acoustic echo suppression Hai Huang1, Christian Hofmann2, Walter Kellermann2, Jingdong Chen1, Jacob Benesty3 1 Northwestern Polytechnical University, 2University of Erlangen-Nuremberg, 3INRS-EMT Session PS-III - 12 A modified a priori SER for acoustic echo suppression using Wiener filter Ying Tong1 and Yaping Gu2 1Institute of Acoustics, Chinese Academy of Sciences, 2Shanghai Acoustics Laboratory Session PS-III - 13 Mask estimate through Itakura-Saito nonnegative RPCA for speech enhancement Gang MIN, xiongwei zhang, xia zou, meng sun PLA University of Science and Technology Session PS-III - 14 Relative impulse response estimation during doubletalk with an artificial neural network-based step size control Stefan Meier and Walter Kellermann Friedrich-Alexander-Universtität Erlangen-Nürnberg (FAU) Session PS-III - 15 Perceptual improvement of deep neural networks for monaural speech enhancement Wei Han, Xiongwei Zhang, Meng Sun, Wenhua Shi, Xushan Chen PLA University of Science and Technology Session PS-III - 16 Synthesis of device-independent noise corpora for speech quality assessment Hannes Gamper1, Lyle Corbin2, David Johnston1, Ivan Tashev1 1Microsoft Research, 2Microsoft Session PS-III - 17 Wenhua Shi PLAUST An experimental study of noise on the performance of a low bit rate parametric speech coder Session PS-III - 18 Approximation of a nonlinear distortion function for combined linear and nonlinear residual echo suppression Ingo Schalk-Schupp1, Friedrich Faubel1, Markus Buck1, Andreas Wendemuth2 1 Nuance Communications Deutschland GmbH, 2Otto-von-Guericke-Universität Magdeburg Session PS-III - 19 Immersive audio for human-machine interface of unmanned ground vehicles Vinay Kothapally, Steven Grant, Rosa Y. Zheng Missouri University of Science & Technology Session PS-III - 20 On pre-filtering strategies for the GCC-phat algorithm Hong-Goo Kang1, Michael Graczyk2, Jan Skoglund2 1Yonsei University, 2Google Session PS-III - 21 Evaluation of spatial active noise cancellation performance using spherical harmonic analysis Hanchi Chen, Jihui Zhang, Prasanga Samarasinghe, Thushara D. Abhayapala Australian National University