Follow
Hao Feng
Hao Feng
Verified email at mail.ustc.edu.cn - Homepage
Title
Cited by
Cited by
Year
DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction
H Feng, Y Wang, W Zhou, J Deng, H Li
ACM International Conference on Multimedia (ACM MM), 2021, 2021
462021
Geometric Representation Learning for Document Image Rectification
H Feng, W Zhou, J Deng, Y Wang, H Li
European Conference on Computer Vision (ECCV), 2022, 475-492, 2022
212022
DocScanner: Robust Document Image Rectification with Progressive Learning
H Feng, W Zhou, J Deng, Q Tian, H Li
arXiv preprint arXiv:2110.14968, 2021
162021
UniDoc: A Universal Large Multimodal Model for Simultaneous Text Detection, Recognition, Spotting and Understanding
H Feng, Z Wang, J Tang, J Lu, W Zhou, H Li, C Huang
arXiv preprint arXiv:2308.11592, 2023
142023
Deep Unrestricted Document Image Rectification
H Feng, S Liu, J Deng, W Zhou, H Li
IEEE Transactions on Multimedia (TMM), 2023, 2023
102023
DocPedia: Unleashing the Power of Large Multimodal Model in the Frequency Domain for Versatile Document Understanding
H Feng, Q Liu, H Liu, W Zhou, H Li, C Huang
arXiv preprint arXiv:2311.11810, 2023
82023
Recurrent Generic Contour-based Instance Segmentation with Progressive Learning
H Feng, K Zhou, W Zhou, Y Yin, J Deng, Q Sun, H Li
IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2024, 2023
7*2023
Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs
Y Wang, W Zhou, H Feng, K Zhou, H Li
arXiv preprint arXiv:2311.13194, 2023
62023
Sign Language Translation with Iterative Prototype
H Yao, W Zhou, H Feng, H Hu, H Zhou, H Li
International Conference on Computer Vision (ICCV), 2023, 2023
32023
SimFIR: A Simple Framework for Fisheye Image Rectification with Self-supervised Representation Learning
H Feng, W Wang, J Deng, W Zhou, L Li, H Li
International Conference on Computer Vision (ICCV), 2023, 2023
32023
DocMAE: Document Image Rectification via Self-supervised Representation Learning
S Liu, H Feng, W Zhou, H Li, C Liu, F Wu
International Conference on Multimedia and Expo (ICME), 2023, 2023
22023
Progressive Recurrent Network for Shadow Removal
Y Wang, W Zhou, H Feng, L Li, H Li
Computer Vision and Image Understanding (CVIU), 103861, 2023
12023
Model-aware Pre-training for Radial Distortion Rectification
W Wang, H Feng, W Zhou, Z Liao, H Li
IEEE Transactions on Image Processing (TIP), 2023
12023
TextSquare: Scaling up Text-Centric Visual Instruction Tuning
J Tang, C Lin, Z Zhao, S Wei, B Wu, Q Liu, H Feng, Y Li, S Wang, L Liao, ...
arXiv preprint arXiv:2404.12803, 2024
2024
Progressive Multi-modal Conditional Prompt Tuning
X Qiu, H Feng, Y Wang, W Zhou, H Li
arXiv preprint arXiv:2404.11864, 2024
2024
TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding
B Luan, H Feng, H Chen, Y Wang, W Zhou, H Li
arXiv preprint arXiv:2404.09797, 2024
2024
DeepEraser: Deep Iterative Context Mining for Generic Text Eraser
H Feng, W Wang, S Liu, J Deng, W Zhou, H Li
arXiv preprint arXiv:2402.19108, 2024
2024
Rethinking Supervision in Document Unwarping: A Self-consistent Flow-free Approach
S Liu, H Feng, W Zhou
IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2023, 2023
2023
PolyTracker: Progressive Contour Regression for Multiple Object Tracking and Segmentation
S Shen, H Feng, W Zhou, H Li
Chinese Conference on Pattern Recognition and Computer Vision (PRCV), 2022 …, 2022
2022
The system can't perform the operation now. Try again later.
Articles 1–19