Contact
Collaboratory
Website
Zheng-Hua's P1 Publications
Masked Autoencoders with Multi-Window Local-Global Attention Are Better Audio Learners
Sarthak Yadav, Sergios Theodoridis, Lars K. Hansen, Zheng-Hua Tan
The Effect of Training Dataset Size on Discriminative and Diffusion-Based Speech Enhancement Systems
Philippe Gonzalez, Zheng‐Hua Tan, Jan Østergaard, Jesper Jensen, Tommy S. Alstrøm, Tobias May
Speaker and Style Disentanglement of Speech Based on Contrastive Predictive Coding Supported Factorized Variational Autoencoder
Yuying Xie, Michael Kuhlmann, Frederik Rautenberg, Zheng-Hua Tan, Reinhold Haeb-Umbach
Self-supervised Pretraining for Robust Personalized Voice Activity Detection in Adverse Conditions
Holger S. Bovbjerg, Jesper Jensen, Jan Østergaard, Zheng-Hua Tan
PAC-Bayesian Error Bound, via Rényi Divergence, for a Class of Linear Time-Invariant State-Space Models
Deividas Eringis, John Leth, Zheng-Hua Tan, Rafal Wisniewski, Mihály Petreczky
PAC-Bayes Generalisation Bounds for Dynamical Systems Including Stable RNNs
Deividas Eringis, John Leth, Zheng-Hua Tan, Rafal Wisniewski, Mihály Petreczky
Near-End Listening Enhancement Using a Noise-Robust Linear Time-Invariant Filter
Filippo Villani, Wai-Yip Chan, Zheng-Hua Tan, Jan Østergaard, Jesper Jensen
Joint Minimum Processing Beamforming and Near-end Listening Enhancement
Andreas J. Fuglsig, Jesper Jensen, Zheng-Hua Tan, Lars S. Bertelsen, Jens C. Lindof, Jan Østergaard
Joint Far- and Near-end Speech and Listening Enhancement with Minimum Processing
Andreas J. Fuglsig, Zheng-Hua Tan, Lars S. Bertelsen, Jesper Jensen, Jens C. Lindof, Jan Østergaard
How to train your ears: Auditory-model emulation for large-dynamic-range inputs and mild-to-severe hearing losses
Peter A. L. Bysted, Jesper Jensen, Zheng-Hua Tan, Jan Østergaard, Lars Bramsløw
Generating Accurate and Diverse Audio Captions Through Variational Autoencoder Framework
Yiming Zhang, Ruoyi Du, Zheng-Hua Tan, Wenwu Wang, Zhanyu Ma
Extending battery life in CubeSats by charging current control utilizing a long short-term memory network for solar power predictions
Vaclav Knap, Gustav A.P. Bonvang, Frederik R. Fagerlund, Sune Krøyer, Kim Nguyen, Mathias Thorsager, Zheng-Hua Tan
Diffusion-Based Speech Enhancement in Matched and Mismatched Conditions Using a Heun-Based Sampler
Philippe Gonzalez, Zheng-Hua Tan, Jan Østergaard, Jesper Jensen, Tommy S. Alstrøm, Tobias May
Complex Recurrent Variational Autoencoder for Speech Resynthesis and Enhancement
Yuying Xie, Thomas Arildsen, Zheng-Hua Tan
Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations
Sarthak Yadav, Zheng-Hua Tan
Minimum Processing Near-End Listening Enhancement
Andreas J. Fuglsig, Jesper Jensen, Zheng-Hua Tan, Lars S. Bertelsen, Jens C. Lindof, Jan Østergaard
Leveraging Domain Features for Detecting Adversarial Attacks Against Deep Speech Recognition in Noise
Christian H. Nielsen, Zheng-Hua Tan
Improving Label-Deficient Keyword Spotting Through Self-Supervised Pretraining
Holger S. Bovbjerg, Zheng-Hua Tan
Data-Driven Non-Intrusive Speech Intelligibility Prediction using Speech Presence Probability
Mathias B. Pedersen, Søren H. Jensen, Zheng-Hua Tan, Jesper Jensen
Filterbank Learning for Noise-Robust Small-Footprint Keyword Spotting
Iván López-Espejo, Ram C. M. C. Shekar, Zheng-Hua Tan, Jesper Jensen, John H. L. Hansen
Utilization of acoustic signals with generative Gaussian and autoencoder modeling for condition-based maintenance of injection moulds
Georg Ø. Rønsch, Iván López-Espejo, Daniel Michelsanti, Yuying Xie, Petar Popovski, Zheng-Hua Tan
Advanced Dropout: A Model-free Methodology for Bayesian Dropout Optimization
Jiyang Xie, Zhanyu Ma, Jianjun Lei, Guoqiang Zhang, Jing-Hao Xue, Zheng-Hua Tan, Jun Guo
Training Data-Driven Speech Intelligibility Predictors on Heterogeneous Listening Test Data
Mathias B. Pedersen, Asger H. Andersen, Søren H. Jensen, Zheng-Hua Tan, Jesper Jensen
The Minimum Overlap-Gap Algorithm for Speech Enhancement
Poul Hoang, Zheng-Hua Tan, Jan M. de Haan, Jesper Jensen
Joint Far- and Near-End Speech Intelligibility Enhancement Based on the Approximated Speech Intelligibility Index
Andreas J. Fuglsig, Jan Østergaard, Jesper Jensen, Lars S. Bertelsen, Peter Mariager, Zheng-Hua Tan
AoI and Throughput Optimization for Hybrid Traffic in Cellular Uplink Using Reinforcement Learning
Chien-Cheng Wu, Zheng-Hua Tan, Čedomir Stefanović
User Localization using RF Sensing: A Performance comparison between LIS and mmWave Radars
Cristian J. Vaca-Rubio, Dariush Salami, Petar Popovski, Elisabeth De Carvalho, Zheng-Hua Tan, Stephan Sigg
Floor Map Reconstruction Through Radio Sensing and Learning by a Large Intelligent Surface
Cristian J. Vaca-Rubio, Roberto Pereira, Xavier Mestre, David Gregoratti, Zheng-Hua Tan, Elisabeth De Carvalho, Petar Popovski
Summary on the ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge
Fan Yu, Shiliang Zhang, Pengcheng Guo, Yihui Fu, Zhihao Du, Siqi Zheng, Weilong Huang, Lei Xie, Zheng-Hua Tan, DeLiang Wang, Yanmin Qian, Kong A. Lee, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu
An Experimental Study on Light Speech Features for Small-Footprint Keyword Spotting
Iván López-Espejo, Zheng-Hua Tan, Jesper Jensen
A parameter-conditional neural network framework for modelling parameterized auditory models
Peter A. L. Bysted, Jesper Jensen, Zheng-Hua Tan, Jan Østergaard, Lars Bramsløw
Multichannel Speech Enhancement With Own Voice-Based Interfering Speech Suppression for Hearing Assistive Devices
Poul Hoang, Jan M. de Haan, Zheng-Hua Tan, Jesper Jensen
iVAE-GAN: Identifiable VAE-GAN Models for Latent Representation Learning
Bjørn U. Dideriksen, Kristoffer Derosche, Zheng-Hua Tan
Deep Spoken Keyword Spotting: An Overview
Ivan Lopez-Espejo, Zheng-Hua Tan, John H. L. Hansen, Jesper Jensen