Most Cited ICCV "temporal predicate prediction" Papers

2,701 papers found • Page 14 of 14

#2601

Stereo Any Video: Temporally Consistent Stereo Matching

Junpeng Jing, Weixun Luo, Ye Mao et al.

ICCV 2025highlightarXiv:2503.05549
#2602

χ: Symmetry Understanding of 3D Shapes via Chirality Disentanglement

Weikang Wang, Tobias Weißberg, Nafie El Amrani et al.

ICCV 2025poster
#2603

Dirichlet-Constrained Variational Codebook Learning for Temporally Coherent Video Face Restoration

Baoyou Chen, Ce Liu, Weihao Yuan et al.

ICCV 2025highlightarXiv:2506.13355
#2604

VideoAuteur: Towards Long Narrative Video Generation

Junfei Xiao, Feng Cheng, Lu Qi et al.

ICCV 2025posterarXiv:2501.06173
#2605

ViT-Split: Unleashing the Power of Vision Foundation Models via Efficient Splitting Heads

Yifan Li, Xin Li, Tianqin Li et al.

ICCV 2025posterarXiv:2506.03433
#2606

Robust and Efficient 3D Gaussian Splatting for Urban Scene Reconstruction

Zhensheng Yuan, Haozhi Huang, Zhen Xiong et al.

ICCV 2025posterarXiv:2507.23006
#2607

Neural Architecture Search Driven by Locally Guided Diffusion for Personalized Federated Learning

PENG LIAO, Xilu Wang, Yaochu Jin et al.

ICCV 2025poster
#2608

Hierarchical 3D Scene Graphs Construction Outdoors

Jon Nyffeler, Federico Tombari, Daniel Barath

ICCV 2025poster
#2609

Cycle-Consistent Learning for Joint Layout-to-Image Generation and Object Detection

Xinhao Cai, Qiuxia Lai, Gensheng Pei et al.

ICCV 2025poster
#2610

Bridging Local Inductive Bias and Long-Range Dependencies with Pixel-Mamba for End-to-end Whole Slide Image Analysis

Zhongwei Qiu, Hanqing Chao, Tiancheng Lin et al.

ICCV 2025poster
#2611

Neuroverse3D: Developing In-Context Learning Universal Model for Neuroimaging in 3D

Jiesi Hu, Hanyang Peng, Yanwu Yang et al.

ICCV 2025posterarXiv:2503.02410
#2612

Taming Flow Matching with Unbalanced Optimal Transport into Fast Pansharpening

Zihan Cao, Yu Zhong, Liang-Jian Deng

ICCV 2025posterarXiv:2503.14975
#2613

CarGait: Cross-Attention based Re-ranking for Gait recognition

Gavriel Habib, Noa Barzilay, Or Shimshi et al.

ICCV 2025posterarXiv:2503.03501
#2614

Incremental Few-Shot Semantic Segmentation via Multi-Level Switchable Visual Prompts

Maoxian Wan, Kaige Li, Qichuan Geng et al.

ICCV 2025poster
#2615

ZeroKey: Point-Level Reasoning and Zero-Shot 3D Keypoint Detection from Large Language Models

Bingchen Gong, Diego Gomez, Abdullah Hamdi et al.

ICCV 2025posterarXiv:2412.06292
#2616

StyleSRN: Scene Text Image Super-Resolution with Text Style Embedding

Shengrong Yuan, Runmin Wang, Ke Hao et al.

ICCV 2025poster
#2617

Frequency-Guided Diffusion for Training-Free Text-Driven Image Translation

Zheng Gao, Jifei Song, Zhensong Zhang et al.

ICCV 2025poster
#2618

Frequency-Semantic Enhanced Variational Autoencoder for Zero-Shot Skeleton-based Action Recognition

Wenhan Wu, Zhishuai Guo, Chen Chen et al.

ICCV 2025posterarXiv:2506.22179
#2619

How Do Multimodal Large Language Models Handle Complex Multimodal Reasoning? Placing Them in An Extensible Escape Game

Ziyue Wang, Yurui Dong, Fuwen Luo et al.

ICCV 2025poster
#2620

Towards Human-like Virtual Beings: Simulating Human Behavior in 3D Scenes

CHEN LIANG, Wenguan Wang, Yi Yang

ICCV 2025poster
#2621

Cross-Category Subjectivity Generalization for Style-Adaptive Sketch Re-ID

Zechao Hu, Zhengwei Yang, Hao Li et al.

ICCV 2025poster
#2622

S3R-GS: Streamlining the Pipeline for Large-Scale Street Scene Reconstruction

Guangting Zheng, Jiajun Deng, Xiaomeng Chu et al.

ICCV 2025posterarXiv:2503.08217
#2623

The Source Image is the Best Attention for Infrared and Visible Image Fusion

Song Wang, Xie Han, Liqun Kuang et al.

ICCV 2025poster
#2624

Learnable Fractional Reaction-Diffusion Dynamics for Under-Display ToF Imaging and Beyond

Xin Qiao, Matteo Poggi, Xing Wei et al.

ICCV 2025posterarXiv:2511.01704
#2625

Video2BEV: Transforming Drone Videos to BEVs for Video-based Geo-localization

Hao Ju, Shaofei Huang, Si Liu et al.

ICCV 2025posterarXiv:2411.13610
#2626

CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-Vocabulary Semantic Segmentation

Lin Sun, Jiale Cao, Jin Xie et al.

ICCV 2025posterarXiv:2411.13836
#2627

Discretized Gaussian Representation for Tomographic Reconstruction

Shaokai Wu, Yuxiang Lu, Yapan Guo et al.

ICCV 2025posterarXiv:2411.04844
#2628

Wave-MambaAD: Wavelet-driven State Space Model for Multi-class Unsupervised Anomaly Detection

Qiao Zhang, Mingwen Shao, Xinyuan Chen et al.

ICCV 2025poster
#2629

3D Test-time Adaptation via Graph Spectral Driven Point Shift

Xin Wei, Qin Yang, Yijie Fang et al.

ICCV 2025posterarXiv:2507.18225
#2630

Task-Decoupled Bézier Surface Constraint for Uneven Low-Light Image Enhancement

Xingxiang Zhou, Xiangdong Su, Haoran Zhang et al.

ICCV 2025poster
#2631

EMoTive: Event-guided Trajectory Modeling for 3D Motion Estimation

Zengyu Wan, Wei Zhai, Yang Cao et al.

ICCV 2025posterarXiv:2503.11371
#2632

Scendi Score: Prompt‑Aware Diversity Evaluation via Schur Complement of CLIP Embeddings

Azim Ospanov, Mohammad Jalali, Farzan Farnia

ICCV 2025highlightarXiv:2412.18645
#2633

Scaling Laws for Native Multimodal Models

Mustafa Shukor, Enrico Fini, Victor Guilherme Turrisi da Costa et al.

ICCV 2025posterarXiv:2504.07951
#2634

Unlearning the Noisy Correspondence Makes CLIP More Robust

Haochen Han, Alex Jinpeng Wang, Peijun Ye et al.

ICCV 2025posterarXiv:2507.03434
#2635

KDA: Knowledge Diffusion Alignment with Enhanced Context for Video Temporal Grounding

Ran Ran, Jiwei Wei, Shiyuan He et al.

ICCV 2025poster
#2636

VisNumBench: Evaluating Number Sense of Multimodal Large Language Models

Tengjin Weng, Jingyi Wang, Wenhao Jiang et al.

ICCV 2025posterarXiv:2503.14939
#2637

STEP-DETR: Advancing DETR-based Semi-Supervised Object Detection with Super Teacher and Pseudo-Label Guided Text Queries

Tahira Shehzadi, Khurram Azeem Hashmi, Shalini Sarode et al.

ICCV 2025poster
#2638

Text-to-Any-Skeleton Motion Generation Without Retargeting

Qingyuan Liu, Ke Lv, Kun Dong et al.

ICCV 2025poster
#2639

Completing 3D Partial Assemblies with View-Consistent 2D-3D Correspondence

Weihao Wang, Yu Lan, Mingyu You et al.

ICCV 2025poster
#2640

Aligning Global Semantics and Local Textures in Generative Video Enhancement

Zhikai Chen, Fuchen Long, Zhaofan Qiu et al.

ICCV 2025poster
#2641

Simulating Dual-Pixel Images From Ray Tracing For Depth Estimation

Fengchen He, Dayang Zhao, Hao Xu et al.

ICCV 2025posterarXiv:2503.11213
#2642

Robust 3D-Masked Part-level Editing in 3D Gaussian Splatting with Regularized Score Distillation Sampling

Hayeon Kim, Ji Ha Jang, Se Young Chun

ICCV 2025posterarXiv:2507.11061
#2643

VoxelKP: A Voxel-based Network Architecture for Human Keypoint Estimation in LiDAR Data

Jian Shi, Peter Wonka

ICCV 2025posterarXiv:2312.08871
#2644

Structure Matters: Revisiting Boundary Refinement in Video Object Segmentation

Guanyi Qin, Ziyue Wang, Daiyun Shen et al.

ICCV 2025highlightarXiv:2507.18944
#2645

A View-consistent Sampling Method for Regularized Training of Neural Radiance Fields

Aoxiang Fan, Corentin Dumery, Nicolas Talabot et al.

ICCV 2025posterarXiv:2507.04408
#2646

Autoregressive Denoising Score Matching is a Good Video Anomaly Detector

hanwen Zhang, Congqi Cao, Qinyi Lv et al.

ICCV 2025posterarXiv:2506.23282
#2647

AIM: Amending Inherent Interpretability via Self-Supervised Masking

Eyad Alshami, Shashank Agnihotri, Bernt Schiele et al.

ICCV 2025highlightarXiv:2508.11502
#2648

One Last Attention for Your Vision-Language Model

Liang Chen, Ghazi Shazan Ahmad, Tianjun Yao et al.

ICCV 2025posterarXiv:2507.15480
#2649

IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation

YINWEI WU, Xianpan Zhou, bing ma et al.

ICCV 2025posterarXiv:2409.08240
#2650

A Constrained Optimization Approach for Gaussian Splatting from Coarsely-posed Images and Noisy Lidar Point Clouds

Jizong Peng, Tze Ho Elden Tse, Kai Xu et al.

ICCV 2025highlightarXiv:2504.09129
#2651

Conditional Visual Autoregressive Modeling for Pathological Image Restoration

Ziyi Liu, Zhe Xu, Jiabo MA et al.

ICCV 2025poster
#2652

Towards Scalable Spatial Intelligence via 2D-to-3D Data Lifting

Xingyu Miao, Haoran Duan, Quanhao Qian et al.

ICCV 2025highlightarXiv:2507.18678
#2653

EYE3:Turn Anything into Naked-eye 3D

Yingde Song, Zongyuan Yang, Baolin Liu et al.

ICCV 2025poster
#2654

C2MIL: Synchronizing Semantic and Topological Causalities in Multiple Instance Learning for Robust and Interpretable Survival Analysis

Min Cen, Zhenfeng Zhuang, Yuzhe Zhang et al.

ICCV 2025poster
#2655

SEGA: A Stepwise Evolution Paradigm for Content-Aware Layout Generation with Design Prior

Bo Zhao, Haoran Wang, Jinghui Wang et al.

ICCV 2025highlightarXiv:2510.15749
#2656

RobustSplat: Decoupling Densification and Dynamics for Transient-Free 3DGS

Chuanyu Fu, Yuqi Zhang, Kunbin Yao et al.

ICCV 2025posterarXiv:2506.02751
#2657

High-Resolution Spatiotemporal Modeling with Global-Local State Space Models for Video-Based Human Pose Estimation

Runyang Feng, Hyung Jin Chang, Tze Ho Elden Tse et al.

ICCV 2025posterarXiv:2510.11017
#2658

Pi-GPS: Enhancing Geometry Problem Solving by Unleashing the Power of Diagrammatic Information

Junbo Zhao, Ting Zhang, Jiayu Sun et al.

ICCV 2025posterarXiv:2503.05543
#2659

TryOn-Refiner: Conditional Rectified-flow-based TryOn Refiner for More Accurate Detail Reconstruction

Wen Qian

ICCV 2025poster
#2660

Mitigating Catastrophic Overfitting in Fast Adversarial Training via Label Information Elimination

Chao Pan, Ke Tang, Li Qing et al.

ICCV 2025poster
#2661

Consistency Trajectory Matching for One-Step Generative Super-Resolution

Weiyi You, Mingyang Zhang, Leheng Zhang et al.

ICCV 2025posterarXiv:2503.20349
#2662

Scoring, Remember, and Reference: Catching Camouflaged Objects in Videos

Yuang Feng, Shuyong Gao, Fuzhen Yan et al.

ICCV 2025posterarXiv:2503.17050
#2663

Amodal Depth Anything: Amodal Depth Estimation in the Wild

Zhenyu Li, Mykola Lavreniuk, Jian Shi et al.

ICCV 2025posterarXiv:2412.02336
#2664

One Perturbation is Enough: On Generating Universal Adversarial Perturbations against Vision-Language Pre-training Models

Hao Fang, Jiawei Kong, Wenbo Yu et al.

ICCV 2025posterarXiv:2406.05491
#2665

CVPT: Cross Visual Prompt Tuning

Lingyun Huang, Jianxu Mao, Junfei YI et al.

ICCV 2025posterarXiv:2408.14961
#2666

Recognizing Actions from Robotic View for Natural Human-Robot Interaction

Ziyi Wang, Peiming Li, Hong Liu et al.

ICCV 2025posterarXiv:2507.22522
#2667

Addressing Text Embedding Leakage in Diffusion-based Image Editing

Sunung Mun, Jinhwan Nam, Sunghyun Cho et al.

ICCV 2025posterarXiv:2412.04715
#2668

Robin3D: Improving 3D Large Language Model via Robust Instruction Tuning

Weitai Kang, Haifeng Huang, Yuzhang Shang et al.

ICCV 2025posterarXiv:2410.00255
#2669

DDB: Diffusion Driven Balancing to Address Spurious Correlations

Aryan Yazdan Parast, Basim Azam, Naveed Akhtar

ICCV 2025posterarXiv:2503.17226
#2670

Geometric Alignment and Prior Modulation for View-Guided Point Cloud Completion on Unseen Categories

Jingqiao Xiu, Yicong Li, Na Zhao et al.

ICCV 2025poster
#2671

FRET: Feature Redundancy Elimination for Test Time Adaptation

Linjing You, Jiabao Lu, Xiayuan Huang et al.

ICCV 2025posterarXiv:2505.10641
#2672

Motion-2-to-3: Leveraging 2D Motion Data for 3D Motion Generations

Ruoxi Guo, Huaijin Pi, Zehong Shen et al.

ICCV 2025poster
#2673

AnyBimanual: Transferring Unimanual Policy for General Bimanual Manipulation

Guanxing Lu, Tengbo Yu, Haoyuan Deng et al.

ICCV 2025posterarXiv:2412.06779
#2674

A₀ : An Affordance-Aware Hierarchical Model for General Robotic Manipulation

Rongtao Xu, Jian Zhang, Minghao Guo et al.

ICCV 2025posterarXiv:2504.12636
#2675

FVGen: Accelerating Novel-View Synthesis with Adversarial Video Diffusion Distillation

Wenbin Teng, Gonglin Chen, Haiwei Chen et al.

ICCV 2025posterarXiv:2508.06392
#2676

PVMamba: Parallelizing Vision Mamba via Dynamic State Aggregation

Fei Xie, Zhongdao Wang, Weijia Zhang et al.

ICCV 2025poster
#2677

Controllable and Expressive One-Shot Video Head Swapping

Chaonan Ji, Jinwei Qi, Peng Zhang et al.

ICCV 2025posterarXiv:2506.16852
#2678

CoralSRT: Revisiting Coral Reef Semantic Segmentation by Feature Rectifying via Self-supervised Guidance

Zheng Ziqiang, Wong Kwan, Binh-Son Hua et al.

ICCV 2025poster
#2679

Learning Dense Feature Matching via Lifting Single 2D Image to 3D Space

Yingping Liang, Yutao Hu, Wenqi Shao et al.

ICCV 2025posterarXiv:2507.00392
#2680

Diagnosing Pretrained Models for Out-of-distribution Detection

Haipeng Xiong, Kai Xu, Angela Yao

ICCV 2025poster
#2681

CHORDS: Diffusion Sampling Accelerator with Multi-core Hierarchical ODE Solvers

Jiaqi Han, Haotian Ye, Puheng Li et al.

ICCV 2025posterarXiv:2507.15260
#2682

RayGaussX: Accelerating Gaussian-Based Ray Marching for Real-Time and High-Quality Novel View Synthesis

Hugo Blanc, Jean-Emmanuel Deschaud, Alexis Paljic

ICCV 2025posterarXiv:2509.07782
#2683

Adversarial Training for Probabilistic Robustness

YI ZHANG, Yuhang Chen, Zhen Chen et al.

ICCV 2025poster
#2684

Learning to See Inside Opaque Liquid Containers using Speckle Vibrometry

Matan Kichler, Shai Bagon, Mark Sheinin

ICCV 2025posterarXiv:2507.20757
#2685

Scaling Omni-modal Pretraining with Multimodal Context: Advancing Universal Representation Learning Across Modalities

Yiyuan Zhang, Handong Li, Jing Liu et al.

ICCV 2025poster
#2686

LightBSR: Towards Lightweight Blind Super-Resolution via Discriminative Implicit Degradation Representation Learning

Jiang Yuan, ji ma, Bo Wang et al.

ICCV 2025posterarXiv:2506.22710
#2687

When Pixel Difference Patterns Meet ViT: PiDiViT for Few-Shot Object Detection

Hongliang Zhou, Yongxiang Liu, Canyu Mo et al.

ICCV 2025poster
#2688

What Makes for Text to 360-degree Panorama Generation with Stable Diffusion?

Jinhong Ni, Chang-Bin Zhang, Qiang Zhang et al.

ICCV 2025posterarXiv:2505.22129
#2689

Learning Normals of Noisy Points by Local Gradient-Aware Surface Filtering

Qing Li, Huifang Feng, Xun Gong et al.

ICCV 2025posterarXiv:2507.03394
#2690

Bayesian-Inspired Space-Time Superpixels

Kent Gauen, Stanley Chan

ICCV 2025poster
#2691

Kaleidoscopic Background Attack: Disrupting Pose Estimation with Multi-Fold Radial Symmetry Textures

Xinlong Ding, Hongwei Yu, Jiawei Li et al.

ICCV 2025highlightarXiv:2507.10265
#2692

INSTINCT: Instance-Level Interaction Architecture for Query-Based Collaborative Perception

yunjiang xu, Yupeng Ouyang, Lingzhi Li et al.

ICCV 2025posterarXiv:2509.23700
#2693

SPD: Shallow Backdoor Protecting Deep Backdoor Against Backdoor Detection

Shunjie Yuan, Xinghua Li, Xuelin Cao et al.

ICCV 2025poster
#2694

Rethinking DPO-style Diffusion Aligning Frameworks

XUN WU, Shaohan Huang, Lingjie Jiang et al.

ICCV 2025highlight
#2695

Debiased Curriculum Adaptation for Safe Transfer Learning in Chest X-ray Classification

Mingyang Liu, Xinyang Chen, Yang Shu et al.

ICCV 2025poster
#2696

PHATNet: A Physics-guided Haze Transfer Network for Domain-adaptive Real-world Image Dehazing

Fu-Jen Tsai, Yan-Tsung Peng, Yen-Yu Lin et al.

ICCV 2025posterarXiv:2507.14826
#2697

End-to-End Entity-Predicate Association Reasoning for Dynamic Scene Graph Generation

LiWei Wang, YanDuo Zhang, Tao Lu et al.

ICCV 2025poster
#2698

Ensemble Foreground Management for Unsupervised Object Discovery

Ziling Wu, Armaghan Moemeni, Praminda Caleb-Solly

ICCV 2025highlightarXiv:2507.20860
#2699

Forensic-MoE: Exploring Comprehensive Synthetic Image Detection Traces with Mixture of Experts

Mingqi Fang, Ziguang Li, Lingyun Yu et al.

ICCV 2025poster
#2700

Information-Bottleneck Driven Binary Neural Network for Change Detection

Kaijie Yin, Zhiyuan Zhang, Shu Kong et al.

ICCV 2025posterarXiv:2507.03504
#2701

Entropy-Adaptive Diffusion Policy Optimization with Dynamic Step Alignment

Renye Yan, Jikang Cheng, Yaozhong Gan et al.

ICCV 2025poster