Zheng-Hua Tan

Contact

zt@es.aau.dk

Website

vbn.aau.dk

Zheng-Hua's P1 Publications

Masked Autoencoders with Multi-Window Local-Global Attention Are Better Audio Learners

Sarthak Yadav, Sergios Theodoridis, Lars K. Hansen, Zheng-Hua Tan

The Effect of Training Dataset Size on Discriminative and Diffusion-Based Speech Enhancement Systems
Philippe Gonzalez, Zheng‐Hua Tan, Jan Østergaard, Jesper Jensen, Tommy S. Alstrøm, Tobias May
Speaker and Style Disentanglement of Speech Based on Contrastive Predictive Coding Supported Factorized Variational Autoencoder
Yuying Xie, Michael Kuhlmann, Frederik Rautenberg, Zheng-Hua Tan, Reinhold Haeb-Umbach
Self-supervised Pretraining for Robust Personalized Voice Activity Detection in Adverse Conditions
Holger S. Bovbjerg, Jesper Jensen, Jan Østergaard, Zheng-Hua Tan
PAC-Bayesian Error Bound, via Rényi Divergence, for a Class of Linear Time-Invariant State-Space Models
Deividas Eringis, John Leth, Zheng-Hua Tan, Rafal Wisniewski, Mihály Petreczky
PAC-Bayes Generalisation Bounds for Dynamical Systems Including Stable RNNs
Deividas Eringis, John Leth, Zheng-Hua Tan, Rafal Wisniewski, Mihály Petreczky
Near-End Listening Enhancement Using a Noise-Robust Linear Time-Invariant Filter
Filippo Villani, Wai-Yip Chan, Zheng-Hua Tan, Jan Østergaard, Jesper Jensen
Joint Minimum Processing Beamforming and Near-end Listening Enhancement
Andreas J. Fuglsig, Jesper Jensen, Zheng-Hua Tan, Lars S. Bertelsen, Jens C. Lindof, Jan Østergaard
Joint Far- and Near-end Speech and Listening Enhancement with Minimum Processing
Andreas J. Fuglsig, Zheng-Hua Tan, Lars S. Bertelsen, Jesper Jensen, Jens C. Lindof, Jan Østergaard
How to train your ears: Auditory-model emulation for large-dynamic-range inputs and mild-to-severe hearing losses
Peter A. L. Bysted, Jesper Jensen, Zheng-Hua Tan, Jan Østergaard, Lars Bramsløw
Generating Accurate and Diverse Audio Captions Through Variational Autoencoder Framework
Yiming Zhang, Ruoyi Du, Zheng-Hua Tan, Wenwu Wang, Zhanyu Ma
Extending battery life in CubeSats by charging current control utilizing a long short-term memory network for solar power predictions
Vaclav Knap, Gustav A.P. Bonvang, Frederik R. Fagerlund, Sune Krøyer, Kim Nguyen, Mathias Thorsager, Zheng-Hua Tan
Diffusion-Based Speech Enhancement in Matched and Mismatched Conditions Using a Heun-Based Sampler
Philippe Gonzalez, Zheng-Hua Tan, Jan Østergaard, Jesper Jensen, Tommy S. Alstrøm, Tobias May
Complex Recurrent Variational Autoencoder for Speech Resynthesis and Enhancement
Yuying Xie, Thomas Arildsen, Zheng-Hua Tan
Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations
Sarthak Yadav, Zheng-Hua Tan
Minimum Processing Near-End Listening Enhancement
Andreas J. Fuglsig, Jesper Jensen, Zheng-Hua Tan, Lars S. Bertelsen, Jens C. Lindof, Jan Østergaard
Leveraging Domain Features for Detecting Adversarial Attacks Against Deep Speech Recognition in Noise
Christian H. Nielsen, Zheng-Hua Tan
Improving Label-Deficient Keyword Spotting Through Self-Supervised Pretraining
Holger S. Bovbjerg, Zheng-Hua Tan
Data-Driven Non-Intrusive Speech Intelligibility Prediction using Speech Presence Probability
Mathias B. Pedersen, Søren H. Jensen, Zheng-Hua Tan, Jesper Jensen
Filterbank Learning for Noise-Robust Small-Footprint Keyword Spotting
Iván López-Espejo, Ram C. M. C. Shekar, Zheng-Hua Tan, Jesper Jensen, John H. L. Hansen
Utilization of acoustic signals with generative Gaussian and autoencoder modeling for condition-based maintenance of injection moulds
Georg Ø. Rønsch, Iván López-Espejo, Daniel Michelsanti, Yuying Xie, Petar Popovski, Zheng-Hua Tan
Advanced Dropout: A Model-free Methodology for Bayesian Dropout Optimization
Jiyang Xie, Zhanyu Ma, Jianjun Lei, Guoqiang Zhang, Jing-Hao Xue, Zheng-Hua Tan, Jun Guo
Training Data-Driven Speech Intelligibility Predictors on Heterogeneous Listening Test Data
Mathias B. Pedersen, Asger H. Andersen, Søren H. Jensen, Zheng-Hua Tan, Jesper Jensen
The Minimum Overlap-Gap Algorithm for Speech Enhancement
Poul Hoang, Zheng-Hua Tan, Jan M. de Haan, Jesper Jensen
Joint Far- and Near-End Speech Intelligibility Enhancement Based on the Approximated Speech Intelligibility Index
Andreas J. Fuglsig, Jan Østergaard, Jesper Jensen, Lars S. Bertelsen, Peter Mariager, Zheng-Hua Tan
AoI and Throughput Optimization for Hybrid Traffic in Cellular Uplink Using Reinforcement Learning
Chien-Cheng Wu, Zheng-Hua Tan, Čedomir Stefanović
User Localization using RF Sensing: A Performance comparison between LIS and mmWave Radars
Cristian J. Vaca-Rubio, Dariush Salami, Petar Popovski, Elisabeth De Carvalho, Zheng-Hua Tan, Stephan Sigg
Floor Map Reconstruction Through Radio Sensing and Learning by a Large Intelligent Surface
Cristian J. Vaca-Rubio, Roberto Pereira, Xavier Mestre, David Gregoratti, Zheng-Hua Tan, Elisabeth De Carvalho, Petar Popovski
Summary on the ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge
Fan Yu, Shiliang Zhang, Pengcheng Guo, Yihui Fu, Zhihao Du, Siqi Zheng, Weilong Huang, Lei Xie, Zheng-Hua Tan, DeLiang Wang, Yanmin Qian, Kong A. Lee, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu
An Experimental Study on Light Speech Features for Small-Footprint Keyword Spotting
Iván López-Espejo, Zheng-Hua Tan, Jesper Jensen
A parameter-conditional neural network framework for modelling parameterized auditory models
Peter A. L. Bysted, Jesper Jensen, Zheng-Hua Tan, Jan Østergaard, Lars Bramsløw
Multichannel Speech Enhancement With Own Voice-Based Interfering Speech Suppression for Hearing Assistive Devices
Poul Hoang, Jan M. de Haan, Zheng-Hua Tan, Jesper Jensen
iVAE-GAN: Identifiable VAE-GAN Models for Latent Representation Learning
Bjørn U. Dideriksen, Kristoffer Derosche, Zheng-Hua Tan
Deep Spoken Keyword Spotting: An Overview
Ivan Lopez-Espejo, Zheng-Hua Tan, John H. L. Hansen, Jesper Jensen

Zheng-Hua's Programs

View all People

People

P1 Collaboratory Co-Lead and Professor

Aalborg University

Contact

Website

Zheng-Hua's P1 Publications

Masked Autoencoders with Multi-Window Local-Global Attention Are Better Audio Learners

The Effect of Training Dataset Size on Discriminative and Diffusion-Based Speech Enhancement Systems

Speaker and Style Disentanglement of Speech Based on Contrastive Predictive Coding Supported Factorized Variational Autoencoder

Self-supervised Pretraining for Robust Personalized Voice Activity Detection in Adverse Conditions

PAC-Bayesian Error Bound, via Rényi Divergence, for a Class of Linear Time-Invariant State-Space Models

PAC-Bayes Generalisation Bounds for Dynamical Systems Including Stable RNNs

Near-End Listening Enhancement Using a Noise-Robust Linear Time-Invariant Filter

Joint Minimum Processing Beamforming and Near-end Listening Enhancement

Joint Far- and Near-end Speech and Listening Enhancement with Minimum Processing

How to train your ears: Auditory-model emulation for large-dynamic-range inputs and mild-to-severe hearing losses

Generating Accurate and Diverse Audio Captions Through Variational Autoencoder Framework

Extending battery life in CubeSats by charging current control utilizing a long short-term memory network for solar power predictions

Diffusion-Based Speech Enhancement in Matched and Mismatched Conditions Using a Heun-Based Sampler

Complex Recurrent Variational Autoencoder for Speech Resynthesis and Enhancement

Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations

Minimum Processing Near-End Listening Enhancement

Leveraging Domain Features for Detecting Adversarial Attacks Against Deep Speech Recognition in Noise

Improving Label-Deficient Keyword Spotting Through Self-Supervised Pretraining

Data-Driven Non-Intrusive Speech Intelligibility Prediction using Speech Presence Probability

Filterbank Learning for Noise-Robust Small-Footprint Keyword Spotting

Utilization of acoustic signals with generative Gaussian and autoencoder modeling for condition-based maintenance of injection moulds

Advanced Dropout: A Model-free Methodology for Bayesian Dropout Optimization

Training Data-Driven Speech Intelligibility Predictors on Heterogeneous Listening Test Data

The Minimum Overlap-Gap Algorithm for Speech Enhancement

Joint Far- and Near-End Speech Intelligibility Enhancement Based on the Approximated Speech Intelligibility Index

AoI and Throughput Optimization for Hybrid Traffic in Cellular Uplink Using Reinforcement Learning

User Localization using RF Sensing: A Performance comparison between LIS and mmWave Radars

Floor Map Reconstruction Through Radio Sensing and Learning by a Large Intelligent Surface

Summary on the ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge

An Experimental Study on Light Speech Features for Small-Footprint Keyword Spotting

A parameter-conditional neural network framework for modelling parameterized auditory models

Multichannel Speech Enhancement With Own Voice-Based Interfering Speech Suppression for Hearing Assistive Devices

iVAE-GAN: Identifiable VAE-GAN Models for Latent Representation Learning

Deep Spoken Keyword Spotting: An Overview

Zheng-Hua's Programs

P1 Programs

Learning and Optimal Control in Dynamical Systems

P1 Programs

Arctic AI

P1 Programs

Bridging Minds and Machines: AI, HCI & Psychology

P1 Programs

Multimodal Anomaly Detection