Most Cited 2025 "numerical reconstruction" Papers

22,274 papers found • Page 96 of 112

Filters:Most Cited 2025 numerical reconstruction Clear all

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

#19001

ConText: Driving In-context Learning for Text Removal and Segmentation

Fei Zhang, Pei Zhang, Baosong Yang et al.

ICML 2025posterarXiv:2506.03799

#19002

Reaction Graph: Towards Reaction-Level Modeling for Chemical Reactions with 3D Structures

Yingzhao Jian, Yue Zhang, Ying Wei et al.

ICML 2025poster

#19003

Advancing Personalized Learning with Neural Collapse for Long-Tail Challenge

Hanglei Hu, Yingying Guo, Zhikang Chen et al.

ICML 2025poster

#19004

Learning the Electronic Hamiltonian of Large Atomic Structures

Chen Hao Xia, Manasa Kaniselvan, Alexandros Nikolaos Ziogas et al.

ICML 2025posterarXiv:2501.19110

#19005

Diffusion Counterfactual Generation with Semantic Abduction

Rajat Rasal, Avinash Kori, Fabio De Sousa Ribeiro et al.

ICML 2025posterarXiv:2506.07883

#19006

When Dynamic Data Selection Meets Data Augmentation: Achieving Enhanced Training Acceleration

Suorong Yang, Peng Ye, Furao Shen et al.

ICML 2025poster

#19007

Non-Stationary Predictions May Be More Informative: Exploring Pseudo-Labels with a Two-Phase Pattern of Training Dynamics

Hongbin Pei, Jingxin Hai, Yu Li et al.

ICML 2025oral

#19008

Beyond Induction Heads: In-Context Meta Learning Induces Multi-Phase Circuit Emergence

Gouki Minegishi, Hiroki Furuta, Shohei Taniguchi et al.

ICML 2025posterarXiv:2505.16694

#19009

Weakly-Supervised Contrastive Learning for Imprecise Class Labels

Zi-Hao Zhou, Jun-Jie Wang, Tong Wei et al.

ICML 2025spotlightarXiv:2505.22028

#19010

Maintaining Proportional Committees with Dynamic Candidate Sets

Chris Dong, Jannik Peters

ICML 2025poster

#19011

Solving Satisfiability Modulo Counting Exactly with Probabilistic Circuits

Jinzhao Li, Nan Jiang, Yexiang Xue

ICML 2025posterarXiv:2503.01009

#19012

Exact Upper and Lower Bounds for the Output Distribution of Neural Networks with Random Inputs

Andrey Kofnov, Daniel Kapla, Ezio Bartocci et al.

ICML 2025posterarXiv:2502.11672

#19013

Reward Translation via Reward Machine in Semi-Alignable MDPs

Yun Hua, Haosheng Chen, Wenhao Li et al.

ICML 2025poster

#19014

TUMTraf VideoQA: Dataset and Benchmark for Unified Spatio-Temporal Video Understanding in Traffic Scenes

Xingcheng Zhou, Konstantinos Larintzakis, Hao Guo et al.

ICML 2025oral

#19015

Lexico: Extreme KV Cache Compression via Sparse Coding over Universal Dictionaries

Junhyuck Kim, Jongho Park, Jaewoong Cho et al.

ICML 2025posterarXiv:2412.08890

#19016

Catching Two Birds with One Stone: Reward Shaping with Dual Random Networks for Balancing Exploration and Exploitation

Haozhe Ma, Fangling Li, Jing Lim et al.

ICML 2025poster

#19017

Refined generalization analysis of the Deep Ritz Method and Physics-Informed Neural Networks

Xianliang Xu, Ye Li, Zhongyi Huang

ICML 2025posterarXiv:2401.12526

#19018

On the Out-of-Distribution Generalization of Self-Supervised Learning

Wenwen Qiang, Jingyao Wang, Zeen Song et al.

ICML 2025posterarXiv:2505.16675

#19019

Leveraging Diffusion Model as Pseudo-Anomalous Graph Generator for Graph-Level Anomaly Detection

Jinyu Cai, Yunhe Zhang, Fusheng Liu et al.

ICML 2025spotlight

#19020

Neural Collapse Beyond the Unconstrained Features Model: Landscape, Dynamics, and Generalization in the Mean-Field Regime

Diyuan Wu, Marco Mondelli

ICML 2025spotlightarXiv:2501.19104

#19021

Stealing That Free Lunch: Exposing the Limits of Dyna-Style Reinforcement Learning

Brett Barkley, David Fridovich-Keil

ICML 2025posterarXiv:2412.14312

#19022

AtlasD: Automatic Local Symmetry Discovery

Manu Bhat, Jonghyun Park, Jianke Yang et al.

ICML 2025posterarXiv:2504.10777

#19023

Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale

Fan Zhou, Zengzhi Wang, Qian Liu et al.

ICML 2025posterarXiv:2409.17115

#19024

The Devil Is in the Details: Tackling Unimodal Spurious Correlations for Generalizable Multimodal Reward Models

Zichao Li, Xueru Wen, Jie Lou et al.

ICML 2025posterarXiv:2503.03122

#19025

Automated Red Teaming with GOAT: the Generative Offensive Agent Tester

Maya Pavlova, Erik Brinkman, Krithika Iyer et al.

ICML 2025posterarXiv:2410.01606

#19026

Language Models May Verbatim Complete Text They Were Not Explicitly Trained On

Ken Ziyu Liu, Christopher A. Choquette Choo, Matthew Jagielski et al.

ICML 2025spotlightarXiv:2503.17514

#19027

InfoSEM: A Deep Generative Model with Informative Priors for Gene Regulatory Network Inference

Tianyu Cui, Song-Jun Xu, Artem Moskalev et al.

ICML 2025posterarXiv:2503.04483

#19028

Polynomial-Time Approximability of Constrained Reinforcement Learning

Jeremy McMahan

ICML 2025posterarXiv:2502.07764

#19029

Benchmarking Abstract and Reasoning Abilities Through A Theoretical Perspective

Qingchuan Ma, Yuhang Wu, Xiawu Zheng et al.

ICML 2025posterarXiv:2505.23833

#19030

No Soundness in the Real World: On the Challenges of the Verification of Deployed Neural Networks

Attila Szász, Balázs Bánhelyi, Mark Jelasity

ICML 2025spotlightarXiv:2506.01054

#19031

ReVISE: Learning to Refine at Test-Time via Intrinsic Self-Verification

Hyunseok Lee, Seunghyuk Oh, Jaehyung Kim et al.

ICML 2025posterarXiv:2502.14565

#19032

Large Language Models are Demonstration Pre-Selectors for Themselves

Jiarui Jin, Yuwei Wu, Haoxuan Li et al.

ICML 2025posterarXiv:2506.06033

#19033

WAVE: Weighted Autoregressive Varying Gate for Time Series Forecasting

Jiecheng Lu, Xu Han, Yan Sun et al.

ICML 2025oralarXiv:2410.03159

#19034

Large Language-Geometry Model: When LLM meets Equivariance

Zongzhao Li, Jiacheng Cen, Bing Su et al.

ICML 2025posterarXiv:2502.11149

#19035

Closed-Loop Long-Horizon Robotic Planning via Equilibrium Sequence Modeling

Jinghan Li, Zhicheng Sun, Yadong Mu

ICML 2025posterarXiv:2410.01440

#19036

Open Materials Generation with Stochastic Interpolants

Philipp Höllmer, Thomas Egg, Maya Martirossyan et al.

ICML 2025poster

#19037

AGAV-Rater: Adapting Large Multimodal Model for AI-Generated Audio-Visual Quality Assessment

Yuqin Cao, Xiongkuo Min, Yixuan Gao et al.

ICML 2025posterarXiv:2501.18314

#19038

GuidedQuant: Large Language Model Quantization via Exploiting End Loss Guidance

Jinuk Kim, Marwa El Halabi, Wonpyo Park et al.

ICML 2025posterarXiv:2505.07004

#19039

Attributes Shape the Embedding Space of Face Recognition Models

Pierrick Leroy, Antonio Mastropietro, Marco Nurisso et al.

ICML 2025posterarXiv:2507.11372

#19040

Collapse or Thrive: Perils and Promises of Synthetic Data in a Self-Generating World

Joshua Kazdan, Rylan Schaeffer, Apratim Dey et al.

ICML 2025posterarXiv:2410.16713

#19041

Linear Transformers as VAR Models: Aligning Autoregressive Attention Mechanisms with Autoregressive Forecasting

Jiecheng Lu, Shihao Yang

ICML 2025posterarXiv:2502.07244

#19042

One Example Shown, Many Concepts Known! Counterexample-Driven Conceptual Reasoning in Mathematical LLMs

Yinghui Li, Jiayi Kuang, Haojing Huang et al.

ICML 2025posterarXiv:2502.10454

#19043

Probing Visual Language Priors in VLMs

Tiange Luo, Ang Cao, Gunhee Lee et al.

ICML 2025posterarXiv:2501.00569

#19044

Control and Realism: Best of Both Worlds in Layout-to-Image without Training

Bonan Li, Yinhan Hu, Songhua Liu et al.

ICML 2025posterarXiv:2506.15563

#19045

Sketch to Adapt: Fine-Tunable Sketches for Efficient LLM Adaptation

Tianyi Zhang, Junda Su, Aditya Desai et al.

ICML 2025posterarXiv:2410.06364

#19046

Tracking Most Significant Shifts in Infinite-Armed Bandits

Joe Suk, Jung-hun Kim

ICML 2025posterarXiv:2502.00108

#19047

When to Forget? Complexity Trade-offs in Machine Unlearning

Martin Van Waerebeke, Marco Lorenzi, Giovanni Neglia et al.

ICML 2025posterarXiv:2502.17323

#19048

Direct Density Ratio Optimization: A Statistically Consistent Approach to Aligning Large Language Models

Rei Higuchi, Taiji Suzuki

ICML 2025posterarXiv:2505.07558

#19049

M2PDE: Compositional Generative Multiphysics and Multi-component PDE Simulation

Tao Zhang, Zhenhai Liu, Feipeng Qi et al.

ICML 2025posterarXiv:2412.04134

#19050

Spherical Rotation Dimension Reduction with Geometric Loss Functions

Hengrui Luo, Jeremy E. Purvis, Didong Li

ICML 2025posterarXiv:2204.10975

#19051

Improving the Scaling Laws of Synthetic Data with Deliberate Practice

Reyhane Askari Hemmat, Mohammad Pezeshki, Elvis Dohmatob et al.

ICML 2025oralarXiv:2502.15588

#19052

Widening the Network Mitigates the Impact of Data Heterogeneity on FedAvg

Like Jian, Dong Liu

ICML 2025posterarXiv:2508.12576

#19053

Federated Disentangled Tuning with Textual Prior Decoupling and Visual Dynamic Adaptation

Yihao Yang, Wenke Huang, Guancheng Wan et al.

ICML 2025poster

#19054

Understanding High-Dimensional Bayesian Optimization

Leonard Papenmeier, Matthias Poloczek, Luigi Nardi

ICML 2025posterarXiv:2502.09198

#19055

Learning Configurations for Data-Driven Multi-Objective Optimization

Zhiyang Chen, Hailong Yao, Xia Yin

ICML 2025poster

#19056

End-to-End Learning Framework for Solving Non-Markovian Optimal Control

Xiaole Zhang, Peiyu Zhang, Xiongye Xiao et al.

ICML 2025posterarXiv:2502.04649

#19057

SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer

Enze Xie, Junsong Chen, Yuyang Zhao et al.

ICML 2025posterarXiv:2501.18427

#19058

Going Deeper into Locally Differentially Private Graph Neural Networks

Longzhu He, Chaozhuo Li, Peng Tang et al.

ICML 2025oral

#19059

Federated Node-Level Clustering Network with Cross-Subgraph Link Mending

Jingxin Liu, Renda Han, Wenxuan Tu et al.

ICML 2025poster

#19060

Causal Effect Identification in lvLiNGAM from Higher-Order Cumulants

Daniele Tramontano, Yaroslav Kivva, Saber Salehkaleybar et al.

ICML 2025posterarXiv:2506.05202

#19061

HuMoCon: Concept Discovery for Human Motion Understanding

Qihang Fang, Chengcheng Tang, Bugra Tekin et al.

CVPR 2025posterarXiv:2505.20920

#19062

Reloc3r: Large-Scale Training of Relative Camera Pose Regression for Generalizable, Fast, and Accurate Visual Localization

Siyan Dong, Shuzhe Wang, Shaohui Liu et al.

CVPR 2025posterarXiv:2412.08376

#19063

Bridge Frame and Event: Common Spatiotemporal Fusion for High-Dynamic Scene Optical Flow

Hanyu Zhou, Haonan Wang, Haoyue Liu et al.

CVPR 2025posterarXiv:2503.06992

#19064

StoryGPT-V: Large Language Models as Consistent Story Visualizers

Xiaoqian Shen, Mohamed Elhoseiny

CVPR 2025posterarXiv:2312.02252

#19065

Invisible Backdoor Attack against Self-supervised Learning

Hanrong Zhang, Zhenting Wang, Boheng Li et al.

CVPR 2025posterarXiv:2405.14672

#19066

S^3-Face: SSS-Compliant Facial Reflectance Estimation via Diffusion Priors

Xingyu Ren, Jiankang Deng, Yuhao Cheng et al.

CVPR 2025poster

#19067

SeedVR: Seeding Infinity in Diffusion Transformer Towards Generic Video Restoration

Jianyi Wang, Zhijie Lin, Meng Wei et al.

CVPR 2025highlightarXiv:2501.01320

#19068

RSAR: Restricted State Angle Resolver and Rotated SAR Benchmark

Xin Zhang, Xue Yang, Yuxuan Li et al.

CVPR 2025posterarXiv:2501.04440

#19069

Diffusion Model is Effectively Its Own Teacher

Xinyin Ma, Runpeng Yu, Songhua Liu et al.

CVPR 2025poster

#19070

Towards Visual Discrimination and Reasoning of Real-World Physical Dynamics: Physics-Grounded Anomaly Detection

wenqiao Li, Yao Gu, Xintao Chen et al.

CVPR 2025posterarXiv:2503.03562

#19071

Flow-NeRF: Joint Learning of Geometry, Poses, and Dense Flow within Unified Neural Representations

Xunzhi Zheng, Dan Xu

CVPR 2025posterarXiv:2503.10464

#19072

LiVOS: Light Video Object Segmentation with Gated Linear Matching

Qin Liu, Jianfeng Wang, Zhengyuan Yang et al.

CVPR 2025posterarXiv:2411.02818

#19073

Dynamic Content Prediction with Motion-aware Priors for Blind Face Video Restoration

Lianxin Xie, csbingbing zheng, Si Wu et al.

CVPR 2025poster

#19074

BADGR: Bundle Adjustment Diffusion Conditioned by Gradients for Wide-Baseline Floor Plan Reconstruction

Yuguang Li, Ivaylo Boyadzhiev, Zixuan Liu et al.

CVPR 2025highlightarXiv:2503.19340

#19075

Towards More General Video-based Deepfake Detection through Facial Component Guided Adaptation for Foundation Model

Yue-Hua Han, Tai-Ming Huang, Kailung Hua et al.

CVPR 2025posterarXiv:2404.05583

#19076

WF-VAE: Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model

Zongjian Li, Bin Lin, Yang Ye et al.

CVPR 2025posterarXiv:2411.17459

#19077

MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling

Yifang Men, Yuan Yao, Miaomiao Cui et al.

CVPR 2025posterarXiv:2409.16160

#19078

Leveraging Perturbation Robustness to Enhance Out-of-Distribution Detection

Wenxi Chen, Raymond A. Yeh, Shaoshuai Mou et al.

CVPR 2025posterarXiv:2503.18784

#19079

Noise-Consistent Siamese-Diffusion for Medical Image Synthesis and Segmentation

Kunpeng Qiu, Zhiqiang Gao, Zhiying Zhou et al.

CVPR 2025posterarXiv:2505.06068

#19080

Parametric Point Cloud Completion for Polygonal Surface Reconstruction

Zhaiyu Chen, Yuqing Wang, Liangliang Nan et al.

CVPR 2025posterarXiv:2503.08363

#19081

RoadSocial: A Diverse VideoQA Dataset and Benchmark for Road Event Understanding from Social Video Narratives

Chirag Parikh, Deepti Rawat, Rakshitha R. T. et al.

CVPR 2025posterarXiv:2503.21459

#19082

AIM-Fair: Advancing Algorithmic Fairness via Selectively Fine-Tuning Biased Models with Contextual Synthetic Data

Zengqun Zhao, Ziquan Liu, Yu Cao et al.

CVPR 2025posterarXiv:2503.05665

#19083

TAET: Two-Stage Adversarial Equalization Training on Long-Tailed Distributions

Wang Yu-Hang, Junkang Guo, Aolei Liu et al.

CVPR 2025poster

#19084

LiMoE: Mixture of LiDAR Representation Learners from Automotive Scenes

Xiang Xu, Lingdong Kong, hui shuai et al.

CVPR 2025posterarXiv:2501.04004

#19085

Interpreting Object-level Foundation Models via Visual Precision Search

Ruoyu Chen, Siyuan Liang, Jingzhi Li et al.

CVPR 2025highlightarXiv:2411.16198

#19086

Descriptor-In-Pixel : Point-Feature Tracking For Pixel Processor Arrays

Laurie Bose, Piotr Dudek, Jianing Chen

CVPR 2025poster

#19087

Consistent Normal Orientation for 3D Point Clouds via Least Squares on Delaunay Graph

Rao Fu, Jianmin Zheng, Liang Yu

CVPR 2025poster

#19088

AdaCM^2: On Understanding Extremely Long-Term Video with Adaptive Cross-Modality Memory Reduction

Yuanbin Man, Ying Huang, Chengming Zhang et al.

CVPR 2025highlightarXiv:2411.12593

#19089

Movie Weaver: Tuning-Free Multi-Concept Video Personalization with Anchored Prompts

Feng Liang, Haoyu Ma, Zecheng He et al.

CVPR 2025posterarXiv:2502.07802

#19090

Exploring Timeline Control for Facial Motion Generation

Yifeng Ma, Jinwei Qi, Chaonan Ji et al.

CVPR 2025posterarXiv:2505.20861

#19091

IRGS: Inter-Reflective Gaussian Splatting with 2D Gaussian Ray Tracing

Chun Gu, Xiaofei Wei, Zixuan Zeng et al.

CVPR 2025posterarXiv:2412.15867

#19092

OSLoPrompt: Bridging Low-Supervision Challenges and Open-Set Domain Generalization in CLIP

Mohamad Hassan N C, Divyam Gupta, Mainak Singha et al.

CVPR 2025posterarXiv:2503.16106

#19093

EchoTraffic: Enhancing Traffic Anomaly Understanding with Audio-Visual Insights

Zhenghao Xing, Hao Chen, Binzhu Xie et al.

CVPR 2025poster

#19094

Learning Temporally Consistent Video Depth from Video Diffusion Priors

Jiahao Shao, Yuanbo Yang, Hongyu Zhou et al.

CVPR 2025posterarXiv:2406.01493

#19095

Yo’Chameleon: Personalized Vision and Language Generation

Thao Nguyen, Krishna Kumar Singh, Jing Shi et al.

CVPR 2025poster

#19096

PersonaBooth: Personalized Text-to-Motion Generation

Boeun Kim, Hea In Jeong, JungHoon Sung et al.

CVPR 2025posterarXiv:2503.07390

#19097

Benchmarking Object Detectors under Real-World Distribution Shifts in Satellite Imagery

Sara Al-Emadi, Yin Yang, Ferda Ofli

CVPR 2025posterarXiv:2503.19202

#19098

Electromyography-Informed Facial Expression Reconstruction for Physiological-Based Synthesis and Analysis

Tim Büchner, Christoph Anders, Orlando Guntinas-Lichius et al.

CVPR 2025highlightarXiv:2503.09556

#19099

InsTaG: Learning Personalized 3D Talking Head from Few-Second Video

Jiahe Li, Jiawei Zhang, Xiao Bai et al.

CVPR 2025posterarXiv:2502.20387

#19100

Unseen Visual Anomaly Generation

HAN SUN, Yunkang Cao, Hao Dong et al.

CVPR 2025posterarXiv:2406.01078

#19101

Towards Zero-Shot Anomaly Detection and Reasoning with Multimodal Large Language Models

Jiacong Xu, Shao-Yuan Lo, Bardia Safaei et al.

CVPR 2025highlightarXiv:2502.07601

#19102

SLAM3R: Real-Time Dense Scene Reconstruction from Monocular RGB Videos

Yuzheng Liu, Siyan Dong, Shuzhe Wang et al.

CVPR 2025highlightarXiv:2412.09401

#19103

EchoMatch: Partial-to-Partial Shape Matching via Correspondence Reflection

Yizheng Xie, Viktoria Ehm, Paul Roetzer et al.

CVPR 2025poster

#19104

JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation

Yiyang Ma, Xingchao Liu, Xiaokang Chen et al.

CVPR 2025posterarXiv:2411.07975

#19105

PatchDEMUX: A Certifiably Robust Framework for Multi-label Classifiers Against Adversarial Patches

Dennis Jacob, Chong Xiang, Prateek Mittal

CVPR 2025posterarXiv:2505.24703

#19106

CALICO: Part-Focused Semantic Co-Segmentation with Large Vision-Language Models

Kiet A. Nguyen, Adheesh Juvekar, Tianjiao Yu et al.

CVPR 2025posterarXiv:2412.19331

#19107

Repurposing Pre-trained Video Diffusion Models for Event-based Video Interpolation

Jingxi Chen, Brandon Y. Feng, Haoming Cai et al.

CVPR 2025posterarXiv:2412.07761

#19108

VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos

Ziyang Wang, Shoubin Yu, Elias Stengel-Eskin et al.

CVPR 2025posterarXiv:2405.19209

#19109

MM-OR: A Large Multimodal Operating Room Dataset for Semantic Understanding of High-Intensity Surgical Environments

Ege Özsoy, Chantal Pellegrini, Tobias Czempiel et al.

CVPR 2025posterarXiv:2503.02579

#19110

VideoSPatS: Video SPatiotemporal Splines for Disentangled Occlusion, Appearance and Motion Modeling and Editing

Juan Luis Gonzalez Bello, Xu Yao, Alex Whelan et al.

CVPR 2025posterarXiv:2504.07146

#19111

Masked Scene Modeling: Narrowing the Gap Between Supervised and Self-Supervised Learning in 3D Scene Understanding

Pedro Hermosilla, Christian Stippel, Leon Sick

CVPR 2025posterarXiv:2504.06719

#19112

Prior Does Matter: Visual Navigation via Denoising Diffusion Bridge Models

Hao Ren, Yiming Zeng, Zetong Bi et al.

CVPR 2025posterarXiv:2504.10041

#19113

LoRA Recycle: Unlocking Tuning-Free Few-Shot Adaptability in Visual Foundation Models by Recycling Pre-Tuned LoRAs

Zixuan Hu, Yongxian Wei, Li Shen et al.

CVPR 2025poster

#19114

TensoFlow: Tensorial Flow-based Sampler for Inverse Rendering

Chun Gu, Xiaofei Wei, Li Zhang et al.

CVPR 2025posterarXiv:2503.18328

#19115

STAR-Edge: Structure-aware Local Spherical Curve Representation for Thin-walled Edge Extraction from Unstructured Point Clouds

Zikuan Li, Honghua Chen, Yuecheng Wang et al.

CVPR 2025posterarXiv:2503.00801

#19116

ZoomLDM: Latent Diffusion Model for Multi-scale Image Generation

Srikar Yellapragada, Alexandros Graikos, Kostas Triaridis et al.

CVPR 2025posterarXiv:2411.16969

#19117

RainyGS: Efficient Rain Synthesis with Physically-Based Gaussian Splatting

Qiyu Dai, Xingyu Ni, Qianfan Shen et al.

CVPR 2025posterarXiv:2503.21442

#19118

Design2GarmentCode: Turning Design Concepts to Tangible Garments Through Program Synthesis

Feng Zhou, Ruiyang Liu, chen liu et al.

CVPR 2025posterarXiv:2412.08603

#19119

Efficient Dynamic Scene Editing via 4D Gaussian-based Static-Dynamic Separation

Joohyun Kwon, Hanbyel Cho, Junmo Kim

CVPR 2025posterarXiv:2502.02091

#19120

EventFly: Event Camera Perception from Ground to the Sky

Lingdong Kong, Dongyue Lu, Xiang Xu et al.

CVPR 2025posterarXiv:2503.19916

#19121

Exploiting Deblurring Networks for Radiance Fields

Haeyun Choi, Heemin Yang, Janghyeok Han et al.

CVPR 2025posterarXiv:2502.14454

#19122

ICE: Intrinsic Concept Extraction from a Single Image via Diffusion Models

Fernando Julio Cendra, Kai Han

CVPR 2025highlightarXiv:2503.19902

#19123

Can Generative Video Models Help Pose Estimation?

Ruojin Cai, Jason Y. Zhang, Philipp Henzler et al.

CVPR 2025highlightarXiv:2412.16155

#19124

MMRL: Multi-Modal Representation Learning for Vision-Language Models

Yuncheng Guo, Xiaodong Gu

CVPR 2025posterarXiv:2503.08497

#19125

VidTwin: Video VAE with Decoupled Structure and Dynamics

Yuchi Wang, Junliang Guo, Xinyi Xie et al.

CVPR 2025posterarXiv:2412.17726

#19126

Continuous, Subject-Specific Attribute Control in T2I Models by Identifying Semantic Directions

Stefan Andreas Baumann, Felix Krause, Michael Neumayr et al.

CVPR 2025posterarXiv:2403.17064

#19127

3DTopia-XL: Scaling High-quality 3D Asset Generation via Primitive Diffusion

Zhaoxi Chen, Jiaxiang Tang, Yuhao Dong et al.

CVPR 2025highlightarXiv:2409.12957

#19128

Unraveling Normal Anatomy via Fluid-Driven Anomaly Randomization

Peirong Liu, Ana Lawry Aguila, Juan Iglesias

CVPR 2025posterarXiv:2501.13370

#19129

VoCo-LLaMA: Towards Vision Compression with Large Language Models

Xubing Ye, Yukang Gan, Xiaoke Huang et al.

CVPR 2025posterarXiv:2406.12275

#19130

Period-LLM: Extending the Periodic Capability of Multimodal Large Language Model

Yuting Zhang, Hao Lu, Qingyong Hu et al.

CVPR 2025posterarXiv:2505.24476

#19131

Continuous Locomotive Crowd Behavior Generation

Inhwan Bae, Junoh Lee, Hae-Gon Jeon

CVPR 2025posterarXiv:2504.04756

#19132

A Unified Latent Schrödinger Bridge Diffusion Model for Unsupervised Anomaly Detection and Localization

Shilhora Akshay, Niveditha Lakshmi Narasimhan, Jacob George et al.

CVPR 2025poster

#19133

Token Cropr: Faster ViTs for Quite a Few Tasks

Benjamin Bergner, Christoph Lippert, Aravindh Mahendran

CVPR 2025posterarXiv:2412.00965

#19134

CacheQuant: Comprehensively Accelerated Diffusion Models

Xuewen Liu, Zhikai Li, Qingyi Gu

CVPR 2025posterarXiv:2503.01323

#19135

Text-guided Sparse Voxel Pruning for Efficient 3D Visual Grounding

Wenxuan Guo, Xiuwei Xu, Ziwei Wang et al.

CVPR 2025highlightarXiv:2502.10392

#19136

SimLTD: Simple Supervised and Semi-Supervised Long-Tailed Object Detection

Phi Vu Tran

CVPR 2025posterarXiv:2412.20047

#19137

What’s in the Image? A Deep-Dive into the Vision of Vision Language Models

Omri Kaduri, Shai Bagon, Tali Dekel

CVPR 2025posterarXiv:2411.17491

#19138

MV-MATH: Evaluating Multimodal Math Reasoning in Multi-Visual Contexts

Peijie Wang, Zhong-Zhi Li, Fei Yin et al.

CVPR 2025posterarXiv:2502.20808

#19139

MonoTAKD: Teaching Assistant Knowledge Distillation for Monocular 3D Object Detection

Hou-I Liu, Christine Wu, Jen-Hao Cheng et al.

CVPR 2025posterarXiv:2404.04910

#19140

APHQ-ViT: Post-Training Quantization with Average Perturbation Hessian Based Reconstruction for Vision Transformers

Zhuguanyu Wu, Jiayi Zhang, Jiaxin Chen et al.

CVPR 2025posterarXiv:2504.02508

#19141

RestorGS: Depth-aware Gaussian Splatting for Efficient 3D Scene Restoration

Yuanjian Qiao, Mingwen Shao, Lingzhuang Meng et al.

CVPR 2025poster

#19142

Identity-Preserving Text-to-Video Generation by Frequency Decomposition

Shenghai Yuan, Jinfa Huang, Xianyi He et al.

CVPR 2025highlightarXiv:2411.17440

#19143

Associative Transformer

Yuwei Sun, Hideya Ochiai, Zhirong Wu et al.

CVPR 2025posterarXiv:2309.12862

#19144

Blood Flow Speed Estimation with Optical Coherence Tomography Angiography Images

Wensheng Cheng, Zhenghong Li, Jiaxiang Ren et al.

CVPR 2025poster

#19145

World-consistent Video Diffusion with Explicit 3D Modeling

Qihang Zhang, Shuangfei Zhai, Miguel Ángel Bautista et al.

CVPR 2025highlightarXiv:2412.01821

#19146

DPFlow: Adaptive Optical Flow Estimation with a Dual-Pyramid Framework

Henrique Morimitsu, Xiaobin Zhu, Roberto M. Cesar Jr et al.

CVPR 2025posterarXiv:2503.14880

#19147

OSDFace: One-Step Diffusion Model for Face Restoration

Jingkai Wang, Jue Gong, Lin Zhang et al.

CVPR 2025posterarXiv:2411.17163

#19148

Free-viewpoint Human Animation with Pose-correlated Reference Selection

Fa-Ting Hong, Zhan Xu, Haiyang Liu et al.

CVPR 2025highlightarXiv:2412.17290

#19149

3D Gaussian Inpainting with Depth-Guided Cross-View Consistency

Sheng-Yu Huang, Zi-Ting Chou, Yu-Chiang Frank Wang

CVPR 2025posterarXiv:2502.11801

#19150

Nonisotropic Gaussian Diffusion for Realistic 3D Human Motion Prediction

Cecilia Curreli, Dominik Muhle, Abhishek Saroha et al.

CVPR 2025posterarXiv:2501.06035

#19151

Temporal Score Analysis for Understanding and Correcting Diffusion Artifacts

Yu Cao, Zengqun Zhao, Ioannis Patras et al.

CVPR 2025posterarXiv:2503.16218

#19152

Visual Representation Learning through Causal Intervention for Controllable Image Editing

Shanshan Huang, Haoxuan Li, Chunyuan Zheng et al.

CVPR 2025highlight

#19153

Three-view Focal Length Recovery From Homographies

Yaqing Ding, Viktor Kocur, Zuzana Berger Haladova et al.

CVPR 2025posterarXiv:2501.07499

#19154

ProAPO: Progressively Automatic Prompt Optimization for Visual Classification

Xiangyan Qu, Gaopeng Gou, Jiamin Zhuang et al.

CVPR 2025posterarXiv:2502.19844

#19155

ShapeWords: Guiding Text-to-Image Synthesis with 3D Shape-Aware Prompts

Dmitrii M Petrov, Pradyumn Goyal, Divyansh Shivashok et al.

CVPR 2025posterarXiv:2412.02912

#19156

EgoPressure: A Dataset for Hand Pressure and Pose Estimation in Egocentric Vision

Yiming Zhao, Taein Kwon, Paul Streli et al.

CVPR 2025highlightarXiv:2409.02224

#19157

SyncVP: Joint Diffusion for Synchronous Multi-Modal Video Prediction

Enrico Pallotta, Sina Mokhtarzadeh Azar, Shuai Li et al.

CVPR 2025posterarXiv:2503.18933

#19158

AuraFusion360: Augmented Unseen Region Alignment for Reference-based 360° Unbounded Scene Inpainting

Chung-Ho Wu, Yang-Jung Chen, Ying-Huan Chen et al.

CVPR 2025posterarXiv:2502.05176

#19159

Real-time Free-view Human Rendering from Sparse-view RGB Videos using Double Unprojected Textures

Guoxing Sun, Rishabh Dabral, Heming Zhu et al.

CVPR 2025highlightarXiv:2412.13183

#19160

Scene-agnostic Pose Regression for Visual Localization

Junwei Zheng, Ruiping Liu, Yufan Chen et al.

CVPR 2025posterarXiv:2503.19543

#19161

Zero-Shot Image Restoration Using Few-Step Guidance of Consistency Models (and Beyond)

Tomer Garber, Tom Tirer

CVPR 2025posterarXiv:2412.20596

#19162

Localizing Events in Videos with Multimodal Queries

Gengyuan Zhang, Mang Ling Ada Fok, Jialu Ma et al.

CVPR 2025posterarXiv:2406.10079

#19163

HuPerFlow: A Comprehensive Benchmark for Human vs. Machine Motion Estimation Comparison

Yung-Hao Yang, Zitang Sun, Taiki Fukiage et al.

CVPR 2025highlight

#19164

Realistic Test-Time Adaptation of Vision-Language Models

Maxime Zanella, Clément Fuchs, Christophe De Vleeschouwer et al.

CVPR 2025highlightarXiv:2501.03729

#19165

GOAL: Global-local Object Alignment Learning

Hyungyu Choi, Young Kyun Jang, Chanho Eom

CVPR 2025posterarXiv:2503.17782

#19166

Magma: A Foundation Model for Multimodal AI Agents

Jianwei Yang, Reuben Tan, Qianhui Wu et al.

CVPR 2025posterarXiv:2502.13130

#19167

HOTFormerLoc: Hierarchical Octree Transformer for Versatile Lidar Place Recognition Across Ground and Aerial Views

Ethan Griffiths, Maryam Haghighat, Simon Denman et al.

CVPR 2025posterarXiv:2503.08140

#19168

Time of the Flight of the Gaussians: Optimizing Depth Indirectly in Dynamic Radiance Fields

Runfeng Li, Mikhail Okunev, Zixuan Guo et al.

CVPR 2025posterarXiv:2505.05356

#19169

Generative Photomontage

Sean J. Liu, Nupur Kumari, Ariel Shamir et al.

CVPR 2025posterarXiv:2408.07116

#19170

MambaVision: A Hybrid Mamba-Transformer Vision Backbone

Ali Hatamizadeh, Jan Kautz

CVPR 2025posterarXiv:2407.08083

#19171

MotiF: Making Text Count in Image Animation with Motion Focal Loss

Shijie Wang, Samaneh Azadi, Rohit Girdhar et al.

CVPR 2025posterarXiv:2412.16153

#19172

Learning Physics-Based Full-Body Human Reaching and Grasping from Brief Walking References

Yitang Li, Mingxian Lin, Zhuo Lin et al.

CVPR 2025posterarXiv:2503.07481

#19173

Prof. Robot: Differentiable Robot Rendering Without Static and Self-Collisions

Quanyuan Ruan, Jiabao Lei, Wenhao Yuan et al.

CVPR 2025posterarXiv:2503.11269

#19174

Attention IoU: Examining Biases in CelebA using Attention Maps

Aaron Serianni, Tyler Zhu, Olga Russakovsky et al.

CVPR 2025posterarXiv:2503.19846

#19175

Stochastic Human Motion Prediction with Memory of Action Transition and Action Characteristic

Jianwei Tang, Hong Yang, Tengyue Chen et al.

CVPR 2025posterarXiv:2507.04062

#19176

Feature Selection for Latent Factor Models

Rittwika Kansabanik, Adrian Barbu

CVPR 2025posterarXiv:2412.10128

#19177

Generative Multiview Relighting for 3D Reconstruction under Extreme Illumination Variation

Hadi Alzayer, Philipp Henzler, Jonathan T. Barron et al.

CVPR 2025highlightarXiv:2412.15211

#19178

LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant

Yikun Liu, Yajie Zhang, jiayin cai et al.

CVPR 2025posterarXiv:2412.01720

#19179

DeepLA-Net: Very Deep Local Aggregation Networks for Point Cloud Analysis

Ziyin Zeng, Mingyue Dong, Jian Zhou et al.

CVPR 2025poster

#19180

ClimbingCap: Multi-Modal Dataset and Method for Rock Climbing in World Coordinate

Ming Yan, Xincheng Lin, Yuhua Luo et al.

CVPR 2025highlightarXiv:2503.21268

#19181

MVDoppler-Pose: Multi-Modal Multi-View mmWave Sensing for Long-Distance Self-Occluded Human Walking Pose Estimation

Jae-Ho Choi, Soheil Hor, Shubo Yang et al.

CVPR 2025poster

#19182

SoundVista: Novel-View Ambient Sound Synthesis via Visual-Acoustic Binding

Mingfei Chen, Israel D. Gebru, Ishwarya Ananthabhotla et al.

CVPR 2025highlightarXiv:2504.05576

#19183

Omni-ID: Holistic Identity Representation Designed for Generative Tasks

Guocheng Qian, Kuan-Chieh Wang, Or Patashnik et al.

CVPR 2025posterarXiv:2412.09694

#19184

Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields

Shijie Zhou, Hui Ren, Yijia Weng et al.

CVPR 2025posterarXiv:2503.20776

#19185

Generative Inbetweening through Frame-wise Conditions-Driven Video Generation

Tianyi Zhu, Dongwei Ren, Qilong Wang et al.

CVPR 2025posterarXiv:2412.11755

#19186

Exploring Temporally-Aware Features for Point Tracking

Inès Hyeonsu Kim, Seokju Cho, Gabriel Huang et al.

CVPR 2025posterarXiv:2501.12218

#19187

Style-Editor: Text-driven Object-centric Style Editing

Jihun Park, Jongmin Gim, Kyoungmin Lee et al.

CVPR 2025highlightarXiv:2408.08461

#19188

Locally Orderless Images for Optimization in Differentiable Rendering

Ishit Mehta, Manmohan Chandraker, Ravi Ramamoorthi

CVPR 2025highlightarXiv:2503.21931

#19189

Efficient Event-Based Object Detection: A Hybrid Neural Network with Spatial and Temporal Attention

Soikat Hasan Ahmed, Jan Finkbeiner, Emre Neftci

CVPR 2025posterarXiv:2403.10173

#19190

A Dataset for Semantic Segmentation in the Presence of Unknowns

Zakaria Laskar, Tomas Vojir, Matej Grcic et al.

CVPR 2025posterarXiv:2503.22309

#19191

Light Transport-aware Diffusion Posterior Sampling for Single-View Reconstruction of 3D Volumes

Ludwic Leonard, Nils Thuerey, rüdiger westermann

CVPR 2025highlightarXiv:2501.05226

#19192

DFormerv2: Geometry Self-Attention for RGBD Semantic Segmentation

Bo-Wen Yin, Jiao-Long Cao, Ming-Ming Cheng et al.

CVPR 2025posterarXiv:2504.04701

#19193

Recover and Match: Open-Vocabulary Multi-Label Recognition through Knowledge-Constrained Optimal Transport

Hao Tan, Zichang Tan, Jun Li et al.

CVPR 2025posterarXiv:2503.15337

#19194

MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models

Wenyi Hong, Yean Cheng, Zhuoyi Yang et al.

CVPR 2025posterarXiv:2501.02955

#19195

Adaptive Parameter Selection for Tuning Vision-Language Models

Yi Zhang, Yi-Xuan Deng, Meng-Hao Guo et al.

CVPR 2025poster

#19196

TokenHSI: Unified Synthesis of Physical Human-Scene Interactions through Task Tokenization

Liang Pan, Zeshi Yang, Zhiyang Dou et al.

CVPR 2025posterarXiv:2503.19901

#19197

ImagineFSL: Self-Supervised Pretraining Matters on Imagined Base Set for VLM-based Few-shot Learning

Haoyuan Yang, Xiaoou Li, Jiaming Lv et al.

CVPR 2025highlight

#19198

DarkIR: Robust Low-Light Image Restoration

Daniel Feijoo, Juan C. Benito, Alvaro Garcia et al.

CVPR 2025posterarXiv:2412.13443

#19199

PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models

Chenyu Yang, Xuan Dong, Xizhou Zhu et al.

CVPR 2025posterarXiv:2412.09613

#19200

PUP 3D-GS: Principled Uncertainty Pruning for 3D Gaussian Splatting

Alex Hanson, Allen Tu, Vasu Singla et al.

CVPR 2025posterarXiv:2406.10219

← Previous

1...94 95 96 97 98...112