Personal information

Activities

Works (4)

AudioLDM 2: Learning Holistic Audio Generation With Self-Supervised Pretraining

IEEE/ACM Transactions on Audio, Speech, and Language Processing
2024 | Journal article
Contributors: Haohe Liu; Yi Yuan; Xubo Liu; Xinhao Mei; Qiuqiang Kong; Qiao Tian; Yuping Wang; Wenwu Wang; Yuxuan Wang; Mark D. Plumbley
Source: check_circle
Crossref

Towards Generating Diverse Audio Captions via Adversarial Training

IEEE/ACM Transactions on Audio, Speech, and Language Processing
2024 | Journal article
Contributors: Xinhao Mei; Xubo Liu; Jianyuan Sun; Mark D. Plumbley; Wenwu Wang
Source: check_circle
Crossref

WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research

IEEE/ACM Transactions on Audio, Speech, and Language Processing
2024 | Journal article
Contributors: Xinhao Mei; Chutong Meng; Haohe Liu; Qiuqiang Kong; Tom Ko; Chengqi Zhao; Mark D. Plumbley; Yuexian Zou; Wenwu Wang
Source: check_circle
Crossref

Automated audio captioning: an overview of recent progress and new challenges

EURASIP Journal on Audio, Speech, and Music Processing
2022-10-09 | Journal article
Contributors: Xinhao Mei; Xubo Liu; Mark D. Plumbley; Wenwu Wang
Source: check_circle
Crossref