Gigaspeech: An evolving, multi-domain asr corpus with 10,000 hours of transcribed audio G Chen, S Chai, G Wang, J Du, WQ Zhang, C Weng, D Su, D Povey, ... arXiv preprint arXiv:2106.06909, 2021 | 165 | 2021 |
speechocean762: An open-source non-native english speech corpus for pronunciation assessment J Zhang, Z Zhang, Y Wang, Z Yan, Q Song, Y Huang, K Li, D Povey, ... arXiv preprint arXiv:2104.01378, 2021 | 64 | 2021 |
Data Augmentation For Children's Speech Recognition--The" Ethiopian" System For The SLT 2021 Children Speech Recognition Challenge G Chen, X Na, Y Wang, Z Yan, J Zhang, S Ma, Y Wang arXiv preprint arXiv:2011.04547, 2020 | 21 | 2020 |
Av-sepformer: Cross-attention sepformer for audio-visual target speaker extraction J Lin, X Cai, H Dinkel, J Chen, Z Yan, Y Wang, J Zhang, Z Wu, Y Wang, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 12 | 2023 |
The smallrice submission to the dcase2021 task 4 challenge: A lightweight approach for semi-supervised sound event detection with unsupervised data augmentation H Dinkel, X Cai, Z Yan, Y Wang, J Zhang, Y Wang Detection and Classification of Acoustic Scenes and Events (DCASE) Challenge, 2021 | 11 | 2021 |
Pseudo strong labels for large scale weakly supervised audio tagging H Dinkel, Z Yan, Y Wang, J Zhang, Y Wang ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 8 | 2022 |
A large multi-modal ensemble for sound event detection H Dinkel, Z Yan, Y Wang, M Song, J Zhang, W Wang Detection and classification of acoustic scenes and events (DCASE) challenge, 2022 | 6 | 2022 |
A lightweight approach for semi-supervised sound event detection with unsupervised data augmentation H Dinkel, X Cai, Z Yan, Y Wang, J Zhang, Y Wang Proceedings of the 6th Workshop on Detection and Classification of Acoustic …, 2021 | 5 | 2021 |
Focus on the sound around you: Monaural target speaker extraction via distance and speaker information J Lin, P Wang, H Dinkel, J Chen, Z Wu, Z Yan, Y Wang, J Zhang, Y Wang arXiv preprint arXiv:2306.16241, 2023 | 4 | 2023 |
Unified keyword spotting and audio tagging on mobile devices with transformers H Dinkel, Y Wang, Z Yan, J Zhang, Y Wang ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 3 | 2023 |
Pepe: Plain efficient pretrained embeddings for sound event detection Y Wang, H Dinkel, Z Yan, J Zhang, Y Wang DCASE2023 Challenge, Tech. Rep, 2023 | 3 | 2023 |
UniKW-AT: Unified Keyword Spotting and Audio Tagging H Dinkel, Y Wang, Z Yan, J Zhang, Y Wang arXiv preprint arXiv:2209.11377, 2022 | 3 | 2022 |
Speech enhancement with an acoustic vector sensor: an effective adaptive beamforming and post-filtering approach YX Zou, P Wang, YQ Wang, CH Ritz, J Xi EURASIP Journal on Audio, Speech, and Music Processing 2014, 1-12, 2014 | 3 | 2014 |
Leveraging multi-task training and image retrieval with clap for audio captioning H Sun, Z Yan, Y Wang, H Dinkel, J Zhang, Y Wang Proc. Conf. Detection Classification Acoust. Scenes Events Challenge, 1-4, 2023 | 2 | 2023 |
The XIAOMITALKFREELY system for audio-visual speech recognition in MISP challenge 2021 Q Wang, X Cai, W Zhuang, Y Kong, Y Wang, J Wu, D Li, Z Yan, M Luo, ... Xiaomi_tr_task2. pdf, 2022 | 2 | 2022 |
An effective target speech enhancement with single acoustic vector sensor based on the speech time-frequency sparsity YX Zou, YQ Wang, P Wang, CH Ritz, J Xi 2014 19th International Conference on Digital Signal Processing, 547-551, 2014 | 2 | 2014 |
CED: Consistent ensemble distillation for audio tagging H Dinkel, Y Wang, Z Yan, J Zhang, Y Wang ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 1 | 2024 |
Streaming Audio Transformers for Online Audio Tagging H Dinkel, Z Yan, Y Wang, J Zhang, Y Wang arXiv preprint arXiv:2305.17834, 2023 | 1 | 2023 |
A Contrastive Semi-Supervised Learning Framework For Anomaly Sound Detection. X Cai, H Dinkel, Z Yan, Y Wang, J Zhang, Z Wu, Y Wang DCASE, 31-34, 2021 | 1 | 2021 |
Understanding temporally weakly supervised training: A case study for keyword spotting H Dinkel, W Zhuang, Z Yan, Y Wang, J Zhang, Y Wang arXiv preprint arXiv:2305.18794, 2023 | | 2023 |