Publications
Vision & Intelligence Group 论文发表
2025
-
Prompting Vision-Language Model for Nuclei Instance Segmentation and Classification 📄 PDF
IEEE TMI -
DGTR: Distributed Gaussian Turbo-Reconstruction for Sparse-View Vast Scenes 📄 PDF
ICRA -
VDG: Vision-Only Dynamic Gaussian for Driving Simulation 📄 PDF
RA-L -
Unsupervised Pre-training with Language-Vision Prompts for Low-Data Instance Segmentation 📄 PDF
TPAMI -
GGRt: Towards Pose-free Generalizable 3D Gaussian Splatting in Real-time 📄 PDF
ECCV -
CityGS-X: A Scalable Architecture for Efficient and Geometrically Accurate Large-Scale Scene Reconstruction 📄 PDF
ICCV -
LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion 📄 PDF
ICCV -
STRIDER: Navigation via Instruction-Aligned Structural Decision Space Optimization 📄 PDF
NeurIPS -
Mamba Capsule Routing Towards Part-Whole Relational Camouflaged Object Detection 📄 PDF
IJCV -
Semantic-Aligned Learning with Collaborative Refinement for Unsupervised VI-ReID 📄 PDF
IJCV -
Discriminative Co-Saliency and Background Mining Transformer for Co-Salient Object Detection 📄 PDF
TPAMI -
Frequency-Aware B-Line and Pleural Line Analysis in Lung Ultrasound Videos 📄 PDF
J-BHI
2024
-
A Memory-based Robust region feature synthesizer for zero-shot object detection 📄 PDF
IJCV -
Position-based anchor optimization for point supervised dense nuclei detection 📄 PDF
NN -
Continual All-in-One Adverse Weather Removal with Knowledge Replay on a Unified Network Structure 📄 PDF
arXiv -
Contextual Dependency Vision Transformer for spectrogram-based multivariate time series analysis 📄 PDF
Neurocomputing -
LTGC: Long-Tail Recognition via leveraging Generated Content 📄 PDF
CVPR -
Uncertainty Modeling for Gaze Estimation 📄 PDF
TIP -
Task-aware Orthogonal Sparse Network for Exploring Shared Knowledge in Continual Learning 📄 PDF
ICML -
PolSAM: Polarimetric Scattering Mechanism Informed Segment Anything Model 📄 PDF
arXiv -
CoSurfGS: Collaborative 3D Surface Gaussian Splatting with Distributed Learning for Large Scene Reconstruction 📄 PDF
IJCV
2023
2022
-
Integrating part-object relationship and contrast for camouflaged object detection 📄 PDF
TIFS -
Cross-Modality High-Frequency Transformer for MR Image Super-Resolution 📄 PDF
ACM MM -
Structured Attention Composition for Temporal Action Localization 📄 PDF
T-IP -
Colar: Effective and efficient online action detection by consulting exemplars 📄 PDF
CVPR -
Onfocus detection: identifying individual-camera eye contact from unconstrained images 📄 PDF
SCIS -
Learning Self-supervised Low-Rank Network for Single-Stage Weakly and Semi-supervised Semantic Segmentation 📄 PDF
IJCV -
Incremental Cross-View Mutual Distillation for Self-Supervised Medical CT Synthesis 📄 PDF
CVPR -
Exploring rich intermediate representations for reconstructing 3D shapes from 2D images 📄 PDF
PR -
Robust Region Feature Synthesizer for Zero-Shot Object Detection 📄 PDF
CVPR
2021
-
Htd: Heterogeneous task decoupling for two-stage object detection 📄 PDF
TIP -
HUMA21: 2nd International Workshop on Human-centric Multimedia Analysis 📄 PDF
MM -
CLRNet: Component-Level Refinement Network for Deep Face Parsing 📄 PDF
TNNLS -
Scribble-Supervised Video Object Segmentation 📄 PDF
JAS -
Strengthen Learning Tolerance for Weakly Supervised Object Localization 📄 PDF
CVPR -
Equivalent classification mapping for weakly supervised temporal action localization 📄 PDF
T-PAMI
2020
2018
-
Segmentation in weakly labeled videos via a semantic ranking and optical warping network 📄 PDF
T-IP -
Salient Object Detection via Integrity Learning 📄 PDF
T-PAMI -
Revisiting anchor mechanisms for temporal action localization 📄 PDF
T-IP -
Reinforcement cutting-agent learning for video object segmentation 📄 PDF
CVPR -
Background-click supervision for temporal action localization 📄 PDF
T-PAMI -
PoseFlow: A Deep Motion Representation for Understanding Human Behaviors in Videos 📄 PDF
CVPR