SUPERB: Speech processing Universal PERformance Benchmark S Yang, PH Chi, YS Chuang, CIJ Lai, K Lakhotia, YY Lin, AT Liu, J Shi, ... arXiv preprint arXiv:2105.01051, 2021 | 105 | 2021 |
Angular Softmax for Short-Duration Text-independent Speaker Verification. Z Huang, S Wang, K Yu Interspeech, 3623-3627, 2018 | 86 | 2018 |
Speaker diarization with region proposal network Z Huang, S Watanabe, Y Fujita, P García, Y Shao, D Povey, S Khudanpur ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 40 | 2020 |
Integration of speech separation, diarization, and recognition for multi-speaker meetings: System description, comparison, and analysis D Raj, P Denisov, Z Chen, H Erdogan, Z Huang, M He, S Watanabe, J Du, ... 2021 IEEE Spoken Language Technology Workshop (SLT), 897-904, 2021 | 30 | 2021 |
DOVER-Lap: A method for combining overlap-aware diarization outputs D Raj, LP Garcia-Perera, Z Huang, S Watanabe, D Povey, A Stolcke, ... 2021 IEEE Spoken Language Technology Workshop (SLT), 881-888, 2021 | 27 | 2021 |
Recover missing sensor data with iterative imputing network J Zhou, Z Huang Workshops at the Thirty-Second AAAI Conference on Artificial Intelligence, 2018 | 27 | 2018 |
Discriminative neural embedding learning for short-duration text-independent speaker verification S Wang, Z Huang, Y Qian, K Yu IEEE/ACM Transactions on Audio, Speech, and Language Processing 27 (11 …, 2019 | 19 | 2019 |
Joint i-vector with end-to-end system for short duration text-independent speaker verification Z Huang, S Wang, Y Qian 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 16 | 2018 |
The hitachi-jhu dihard iii system: Competitive end-to-end neural diarization and x-vector clustering systems combined by dover-lap S Horiguchi, N Yalta, P Garcia, Y Takashima, Y Xue, D Raj, Z Huang, ... arXiv preprint arXiv:2102.01363, 2021 | 15 | 2021 |
Target-speaker Voice Activity Detection with Improved I-Vector Estimation for Unknown Number of Speaker M He, D Raj, Z Huang, J Du, Z Chen, S Watanabe arXiv preprint arXiv:2108.03342, 2021 | 9 | 2021 |
Multi-class spectral clustering with overlaps for speaker diarization D Raj, Z Huang, S Khudanpur 2021 IEEE Spoken Language Technology Workshop (SLT), 582-589, 2021 | 9 | 2021 |
JHU Diarization System Description. Z Huang, LP García-Perera, J Villalba, D Povey, N Dehak IberSPEECH, 236-239, 2018 | 5 | 2018 |
SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities HS Tsai, HJ Chang, WC Huang, Z Huang, K Lakhotia, S Yang, S Dong, ... arXiv preprint arXiv:2203.06849, 2022 | 3 | 2022 |
Investigating self-supervised learning for speech enhancement and separation Z Huang, S Watanabe, S Yang, P García, S Khudanpur ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 2 | 2022 |
Joint speaker diarization and speech recognition based on region proposal networks Z Huang, M Delcroix, LP Garcia, S Watanabe, D Raj, S Khudanpur Computer Speech & Language 72, 101316, 2022 | | 2022 |