Adaptformer: Adapting vision transformers for scalable visual recognition S Chen*, C Ge*, Z Tong, J Wang, Y Song, J Wang, P Luo Advances in Neural Information Processing Systems 35, 16664-16678, 2022 | 284 | 2022 |
CycleMLP: a MLP-like architecture for dense visual predictions S Chen, E Xie, C Ge, R Chen, D Liang, P Luo IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023 | 242 | 2023 |
Not all patches are what you need: Expediting vision transformers via token reorganizations Y Liang, C Ge, Z Tong, Y Song, J Wang, P Xie arXiv preprint arXiv:2202.07800, 2022 | 211 | 2022 |
Amos: A large-scale abdominal multi-organ benchmark for versatile medical image segmentation Y Ji, H Bai, C Ge, J Yang, Y Zhu, R Zhang, Z Li, L Zhanng, W Ma, X Wan, ... Advances in Neural Information Processing Systems 35, 36722-36732, 2022 | 163 | 2022 |
Parser-free virtual try-on via distilling appearance flows Y Ge, Y Song, R Zhang, C Ge, W Liu, P Luo Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021 | 156 | 2021 |
Disentangled cycle consistency for highly-realistic virtual try-on C Ge, Y Song, Y Ge, H Yang, W Liu, P Luo Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021 | 93 | 2021 |
Watch only once: An end-to-end video action detection framework S Chen, P Sun, E Xie, C Ge, J Wu, L Ma, J Shen, P Luo Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 51 | 2021 |
PixArt-: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis J Chen*, J Yu*, C Ge*, L Yao*, E Xie, Y Wu, Z Wang, J Kwok, P Luo, H Lu, ... arXiv preprint arXiv:2310.00426, 2023 | 50 | 2023 |
Revitalizing cnn attention via transformers in self-supervised visual representation learning C Ge, Y Liang, Y Song, J Jiao, J Wang, P Luo Advances in Neural Information Processing Systems 34, 4193-4206, 2021 | 32 | 2021 |
Metabev: Solving sensor failures for bev detection and map segmentation C Ge*, J Chen*, E Xie, Z Wang, L Hong, H Lu, Z Li, P Luo arXiv preprint arXiv:2304.09801, 2023 | 24* | 2023 |
Deepaccident: A motion and accident prediction benchmark for v2x autonomous driving T Wang, S Kim, J Wenxuan, E Xie, C Ge, J Chen, Z Li, P Luo Proceedings of the AAAI Conference on Artificial Intelligence 38 (6), 5599-5606, 2024 | 21 | 2024 |
Soft neighbors are positive supporters in contrastive visual representation learning C Ge, J Wang, Z Tong, S Chen, Y Song, P Luo arXiv preprint arXiv:2303.17142, 2023 | 20* | 2023 |
A torsional thrust balance with asymmetrical configuration for microthruster performance evaluation Y Wang, C Ge, L Cheng, W Ding, J Geng Review of Scientific Instruments 90 (7), 2019 | 5 | 2019 |
PixArt-\Sigma: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation J Chen*, C Ge*, E Xie*, Y Wu*, L Yao, X Ren, Z Wang, P Luo, H Lu, Z Li arXiv preprint arXiv:2403.04692, 2024 | 4 | 2024 |
Advancing vision transformers with group-mix attention C Ge, X Ding, Z Tong, L Yuan, J Wang, Y Song, P Luo arXiv preprint arXiv:2311.15157, 2023 | 4 | 2023 |
Large Language Models as Automated Aligners for benchmarking Vision-Language Models Y Ji*, C Ge*, W Kong, E Xie, Z Liu, Z Li, P Luo arXiv preprint arXiv:2311.14580, 2023 | 3 | 2023 |
Rethinking attentive object detection via neural attention learning C Ge, Y Song, C Ma, Y Qi, P Luo IEEE Transactions on Image Processing, 2023 | 3 | 2023 |
Instructdet: Diversifying referring object detection with generalized instructions R Dang, J Feng, H Zhang, C Ge, L Song, L Gong, C Liu, Q Chen, F Zhu, ... arXiv preprint arXiv:2310.05136, 2023 | 2 | 2023 |
RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis Y Mu, J Chen, Q Zhang, S Chen, Q Yu, C Ge, R Chen, Z Liang, M Hu, ... arXiv preprint arXiv:2402.16117, 2024 | | 2024 |
Speed Co-Augmentation for Unsupervised Audio-Visual Pre-training J Wang, J Jiao, Y Song, S James, Z Tong, C Ge, P Abbeel, YH Liu arXiv preprint arXiv:2309.13942, 2023 | | 2023 |