Densely Connected Time Delay Neural Network for Speaker Verification YQ Yu, WJ Li INTERSPEECH, 921-925, 2020 | 81 | 2020 |
Ensemble Additive Margin Softmax for Speaker Verification YQ Yu, L Fan, WJ Li ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 75 | 2019 |
CAM: Context-Aware Masking for Robust Speaker Verification YQ Yu, S Zheng, H Suo, Y Lei, WJ Li ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 25 | 2021 |
Deep Hashing for Speaker Identification and Retrieval. L Fan, QY Jiang, YQ Yu, WJ Li INTERSPEECH, 2908-2912, 2019 | 24 | 2019 |
Texthawk: Exploring efficient fine-grained perception of multimodal large language models YQ Yu, M Liao, J Wu, Y Liao, X Zheng, W Zeng arXiv preprint arXiv:2404.09204, 2024 | 23 | 2024 |
Texthawk2: A large vision-language model excels in bilingual ocr and grounding with 16x fewer tokens YQ Yu, M Liao, J Zhang, J Wu arXiv preprint arXiv:2410.05261, 2024 | 11 | 2024 |
Ui-hawk: Unleashing the screen stream understanding for gui agents J Zhang, Y Yu, M Liao, W Li, J Wu, Z Wei Preprints, manuscript/202408.2137, 2024 | 6 | 2024 |
Cross-Lingual Text-Rich Visual Comprehension: An Information Theory Perspective X Yu, X Feng, Y Li, M Liao, YQ Yu, X Feng, W Zhong, R Chen, M Hu, J Wu, ... arXiv preprint arXiv:2412.17787, 2024 | | 2024 |