Most Cited ECCV "rf-vision sensing" Papers
2,387 papers found • Page 8 of 12
Conference
SpatialFormer: Towards Generalizable Vision Transformers with Explicit Spatial Understanding
Han Xiao, Wenzhao Zheng, Sicheng Zuo et al.
Online Video Quality Enhancement with Spatial-Temporal Look-up Tables
Zefan Qu, Xinyang Jiang, Yifan Yang et al.
Interactive 3D Object Detection with Prompts
Ruifei Zhang, Xiangru Lin, Wei Zhang et al.
ComFusion: Enhancing Personalized Generation by Instance-Scene Compositing and Fusion
Yan Hong, Yuxuan Duan, Bo Zhang et al.
MetaAT: Active Testing for Label-Efficient Evaluation of Dense Recognition Tasks
Sanbao Su, Xin Li, Thang Doan et al.
Deep Online Probability Aggregation Clustering
Yuxuan Yan, Na Lu, Ruofan Yan
Efficient Unsupervised Visual Representation Learning with Explicit Cluster Balancing
Ioannis Maniadis Metaxas, Georgios Tzimiropoulos, ioannis Patras
Effective Lymph Nodes Detection in CT Scans Using Location Debiased Query Selection and Contrastive Query Representation in Transformer
Qinji Yu, Yirui Wang, Ke Yan et al.
Learning Trimodal Relation for Audio-Visual Question Answering with Missing Modality
Kyu Ri Park, Hong Joo Lee, Jung Uk Kim
MaxMI: A Maximal Mutual Information Criterion for Manipulation Concept Discovery
Pei Zhou, Yanchao Yang
SIMBA: Split Inference - Mechanisms, Benchmarks and Attacks
Abhishek Singh, Vivek Sharma, Rohan Sukumaran et al.
LineFit: A Geometric Approach for Fitting Line Segments in Images
Marion BOYER, David Youssefi, Florent Lafarge
Multi-scale Cross Distillation for Object Detection in Aerial Images
Kun Wang, Zi Wang, Zhang Li et al.
PAV: Personalized Head Avatar from Unstructured Video Collection
Akin Caliskan, Berkay Kicanaoglu, H K
Local Occupancy-Enhanced Object Grasping with Multiple Triplanar Projection
Kangqi Ma, Hao Dong, Yadong Mu
FedVAD: Enhancing Federated Video Anomaly Detection with GPT-Driven Semantic Distillation
Fan Qi, Ruijie Pan, Huaiwen Zhang et al.
LEROjD: Lidar Extended Radar-Only Object Detection
Patrick Palmer, Martin Krüger, Stefan Schütte et al.
OP-Align: Object-level and Part-level Alignment for Self-supervised Category-level Articulated Object Pose Estimation
Yuchen Che, Ryo Furukawa, Asako Kanezaki
Decomposition of Neural Discrete Representations for Large-Scale 3D Mapping
Minseong Park, Suhan Woo, Euntai Kim
Learning-based Axial Video Motion Magnification
Kwon Byung-Ki, HYUNBIN OH, Kim Jun-Seong et al.
MTaDCS: Moving Trace and Feature Density-based Confidence Sample Selection under Label Noise
Qingzheng Huang, Xilin He, Xiaole Xian et al.
Synchronization of Projective Transformations
Rakshith Madhavan, Andrea Fusiello, Federica Arrigoni
Sync from the Sea: Retrieving Alignable Videos from Large-Scale Datasets
Ishan Rajendrakumar Dave, Fabian Caba, Shah Mubarak et al.
Synthesizing Time-varying BRDFs via Latent Space
Takuto Narumoto, Hiroaki Santo, Fumio Okura
VF-NeRF: Viewshed Fields for Rigid NeRF Registration
Leo Segre, Shai Avidan
Adapting to Shifting Correlations with Unlabeled Data Calibration
Minh Nguyen, Alan Q Wang, Heejong Kim et al.
Fine-grained Dynamic Network for Generic Event Boundary Detection
Ziwei Zheng, Lijun He, Le Yang et al.
Divide and Fuse: Body Part Mesh Recovery from Partially Visible Human Images
Tianyu Luan, Zhongpai Gao, Luyuan Xie et al.
Stable Preference: Redefining training paradigm of human preference model for Text-to-Image Synthesis
Hanting Li, Hongjing Niu, Feng Zhao
MagicMirror: Fast and High-Quality Avatar Generation with Constrained Search Space
Armand Comas Massague, Di Qiu, Menglei Chai et al.
Single-Photon 3D Imaging with Equi-Depth Photon Histograms
Kaustubh Sadekar, David Maier, Atul Ingle
Rethinking Video-Text Understanding: Retrieval from Counterfactually Augmented Data
Wufei Ma, Kai Li, Zhongshi Jiang et al.
Do Generalised Classifiers really work on Human Drawn Sketches?
Hmrishav Bandyopadhyay, Pinaki Nath Chowdhury, Aneeshan Sain et al.
Towards Model-Agnostic Dataset Condensation by Heterogeneous Models
Jun-Yeong Moon, Jung Uk Kim, Gyeong-Moon Park
Alignist: CAD-Informed Orientation Distribution Estimation by Fusing Shape and Correspondences
Shishir Reddy Vutukur, Junwen Huang, Rasmus Laurvig Haugaard et al.
ArtVLM: Attribute Recognition Through Vision-Based Prefix Language Modeling
William Zhu, Keren Ye, Junjie Ke et al.
LNL+K: Enhancing Learning with Noisy Labels Through Noise Source Knowledge Integration
Siqi Wang, Bryan Plummer
MaRINeR: Enhancing Novel Views by Matching Rendered Images with Nearby References
Lukas Bösiger, Mihai Dusmanu, Marc Pollefeys et al.
Multi-Task Domain Adaptation for Language Grounding with 3D Objects
Penglei SUN, Yaoxian Song, Xinglin Pan et al.
RoofDiffusion: Constructing Roofs from Severely Corrupted Point Data via Diffusion
Kyle Lo, Jorg Peters, Eric Spellman
Efficient Cascaded Multiscale Adaptive Network for Image Restoration
Yichen Zhou, Pan Zhou, Teck Khim Ng
Rotated Orthographic Projection for Self-Supervised 3D Human Pose Estimation
YAO YAO, Yixuan Pan, Wenjun Shi et al.
Learned Rate Control for Frame-Level Adaptive Neural Video Compression via Dynamic Neural Network
Chenhao Zhang, WEI GAO
Flowed Time of Flight Radiance Fields
Mikhail Okunev, Marc Mapeke, Benjamin Attal et al.
SNP: Structured Neuron-level Pruning to Preserve Attention Scores
Kyunghwan Shim, Jaewoong Yun, Shinkook Choi
Toward INT4 Fixed-Point Training via Exploring Quantization Error for Gradients
Dohyung Kim, Junghyup Lee, Jeimin Jeon et al.
FreeAugment: Data Augmentation Search Across All Degrees of Freedom
Tom Bekor, Niv Nayman, Lihi Zelnik-Manor
Characterizing Model Robustness via Natural Input Gradients
Adrian Rodriguez-Munoz, Tongzhou Wang, Antonio Torralba
Unleashing Text-to-Image Diffusion Prior for Zero-Shot Image Captioning
Jianjie Luo, Jingwen Chen, Yehao Li et al.
NGP-RT: Fusing Multi-Level Hash Features with Lightweight Attention for Real-Time Novel View Synthesis
Yubin Hu, Xiaoyang Guo, Yang Xiao et al.
Topology-Preserving Downsampling of Binary Images
Chia-Chia Chen, Chi-Han Peng
FroSSL: Frobenius Norm Minimization for Efficient Multiview Self-Supervised Learning
Oscar Skean, Aayush Dhakal, Nathan Jacobs et al.
CARFF: Conditional Auto-encoded Radiance Field for 3D Scene Forecasting
Jiezhi Yang, Khushi P Desai, Charles Packer et al.
A Rotation-invariant Texture ViT for Fine-Grained Recognition of Esophageal Cancer Endoscopic Ultrasound Images
Tianyi Liu, Shuaishuai S Zhuang, Jiacheng Nie et al.
POA: Pre-training Once for Models of All Sizes
Yingying Zhang, Xin Guo, Jiangwei Lao et al.
Confidence-Based Iterative Generation for Real-World Image Super-Resolution
Jialun Peng, Xin Luo, Jingjing Fu et al.
VP-SAM: Taming Segment Anything Model for Video Polyp Segmentation via Disentanglement and Spatio-temporal Side Network
Zhixue Fang, Yuzhi Liu, Huisi Wu et al.
Quantization-Friendly Winograd Transformations for Convolutional Neural Networks
Vladimir Protsenko, Vladimir Kryzhanovskiy, Alexander Filippov
Platypus: A Generalized Specialist Model for Reading Text in Various Forms
Peng Wang, Zhaohai Li, Jun Tang et al.
ViG-Bias: Visually Grounded Bias Discovery and Mitigation
Badr-Eddine Marani, Mohamed HANINI, Nihitha Malayarukil et al.
Frugal 3D Point Cloud Model Training via Progressive Near Point Filtering and Fused Aggregation
Donghyun Lee, Yejin Lee, Jae W. Lee et al.
Global-to-Pixel Regression for Human Mesh Recovery
Yabo Xiao, MINGSHU HE, Dongdong Yu
Out-of-Bounding-Box Triggers: A Stealthy Approach to Cheat Object Detectors
Tao Lin, lijia Yu, Gaojie Jin et al.
Formula-Supervised Visual-Geometric Pre-training
Ryosuke Yamada, Kensho Hara, Hirokatsu Kataoka et al.
Distributed Semantic Segmentation with Efficient Joint Source and Task Decoding
Danish Nazir, Timo Bartels, Jan Piewek et al.
Learning Where to Look: Self-supervised Viewpoint Selection for Active Localization using Geometrical Information
Luca Di Giammarino, Boyang Sun, Giorgio Grisetti et al.
Unveiling Advanced Frequency Disentanglement Paradigm for Low-Light Image Enhancement
Kun Zhou, Xinyu Lin, Wenbo Li et al.
SDPT: Synchronous Dual Prompt Tuning for Fusion-based Visual-Language Pre-trained Models
Yang Zhou, Yongjian Wu, Jiya Saiyin et al.
HandDAGT: A Denoising Adaptive Graph Transformer for 3D Hand Pose Estimation
WENCAN CHENG, Eun-Ji Kim, Jong Hwan Ko
Region-Native Visual Tokenization
Mengyu Wang, Yuyao Huang, Henghui Ding et al.
Adversarial Robustification via Text-to-Image Diffusion Models
Daewon Choi, Jongheon Jeong, Huiwon Jang et al.
SeiT++: Masked Token Modeling Improves Storage-efficient Training
Minhyun Lee, Song Park, Byeongho Heo et al.
SparseRadNet: Sparse Perception Neural Network on Subsampled Radar Data
Jialong Wu, Mirko Meuter, Markus Schoeler et al.
Adaptive Annealing for Robust Averaging
Sidhartha Chitturi, Venu Madhav Govindu
FAMOUS: High-Fidelity Monocular 3D Human Digitization Using View Synthesis
Vishnu Mani Hema, Shubhra Aich, Christian Haene et al.
Multi-RoI Human Mesh Recovery with Camera Consistency and Contrastive Losses
Yongwei Nie, Changzhen Liu, Chengjiang Long et al.
Beyond Viewpoint: Robust 3D Object Recognition under Arbitrary Views through Joint Multi-Part Representation
Linlong Fan, Ye Huang, Yanqi Ge et al.
Unified Local-Cloud Decision-Making via Reinforcement Learning
Kathakoli Sengupta, Zhongkai Shangguan, Sandesh Bharadwaj et al.
MC-PanDA: Mask Confidence for Panoptic Domain Adaptation
Ivan Martinovic, Josip Šarić, Siniša Šegvić
UNIKD: UNcertainty-Filtered Incremental Knowledge Distillation for Neural Implicit Representation
Mengqi GUO, Chen Li, Hanlin Chen et al.
DetailSemNet: Elevating Signature Verification through Detail-Semantic Integration
Meng-Cheng Shih, Tsai-Ling Huang, Yu-Heng Shih et al.
Spatio-Temporal Proximity-Aware Dual-Path Model for Panoramic Activity Recognition
Sumin Lee, Yooseung Wang, Sangmin Woo et al.
GTMS: A Gradient-driven Tree-guided Mask-free Referring Image Segmentation Method
Haoxin Lyu, Tianxiong Zhong, Sanyuan Zhao
Weighting Pseudo-Labels via High-Activation Feature Index Similarity and Object Detection for Semi-Supervised Segmentation
Prantik Howlader, Hieu Le, Dimitris Samaras
OAT: Object-Level Attention Transformer for Gaze Scanpath Prediction
Yini Fang, Jingling Yu, Haozheng Zhang et al.
Remove Projective LiDAR Depthmap Artifacts via Exploiting Epipolar Geometry
Shengjie Zhu, Girish Chandar Ganesan, Abhinav Kumar et al.
Exploring Active Learning in Meta-Learning: Enhancing Context Set Labeling
Wonho Bae, Jing Wang, Danica J. Sutherland
Towards Robust Full Low-bit Quantization of Super Resolution Networks
Denis Makhov, Irina Zhelavskaya, Ruslan Ostapets et al.
MetaAug: Meta-Data Augmentation for Post-Training Quantization
Cuong Pham, Hoang Anh Dung, Cuong Cao Nguyen et al.
Face Reconstruction Transfer Attack as Out-of-Distribution Generalization
Yoon Gyo Jung, Jaewoo Park, Xingbo Dong et al.
Beyond the Data Imbalance: Employing the Heterogeneous Datasets for Vehicle Maneuver Prediction
Hyeongseok Jeon, Sanmin Kim, Abi Rahman Syamil et al.
Inter-Class Topology Alignment for Efficient Black-Box Substitute Attacks
lingzhuang meng, Mingwen Shao, Yuanjian Qiao et al.
Transferable 3D Adversarial Shape Completion using Diffusion Models
Xuelong Dai, Bin Xiao
Resolving Scale Ambiguity in Multi-view 3D Reconstruction using Dual-Pixel Sensors
Kohei Ashida, Hiroaki Santo, Fumio Okura et al.
Group Testing for Accurate and Efficient Range-Based Near Neighbor Search for Plagiarism Detection
Harsh Shah, Kashish Mittal, Ajit Rajwade
BaSIC: BayesNet Structure Learning for Computational Scalable Neural Image Compression
Yufeng Zhang, Hang Yu, Shizhan Liu et al.
GRAPE: Generalizable and Robust Multi-view Facial Capture
Jing Li, Di Kang, Zhenyu He
Multimodal Label Relevance Ranking via Reinforcement Learning
Taian Guo, Taolin Zhang, Haoqian Wu et al.
An Information Theoretical View for Out-Of-Distribution Detection
Jinjing Hu, Wenrui Liu, Hong Chang et al.
REFRAME: Reflective Surface Real-Time Rendering for Mobile Devices
Chaojie Ji, Yufeng Li, Yiyi Liao
Efficient Pre-training for Localized Instruction Generation of Procedural Videos
Anil Batra, Davide Moltisanti, Laura Sevilla-Lara et al.
Multiscale Graph Texture Network
Ravishankar Evani, Deepu Rajan, Shangbo Mao
Depth-guided NeRF Training via Earth Mover’s Distance
Anita Rau, Josiah Aklilu, Floyd C Holsinger et al.
Efficient Neural Video Representation with Temporally Coherent Modulation
Seungjun Shin, Suji Kim, Dokwan Oh
Wavelength-Embedding-guided Filter-Array Transformer for Spectral Demosaicing
haijin zeng, Hiep Luong, Wilfried Philips
CriSp: Leveraging Tread Depth Maps for Enhanced Crime-Scene Shoeprint Matching
Samia Shafique, Shu Kong, Charless Fowlkes
Commonly Interesting Images
Fitim Abdullahu, Helmut Grabner
Learning to Build by Building Your Own Instructions
Aaron Walsman, Muru Zhang, Adam Fishman et al.
BugNIST - a Large Volumetric Dataset for Detection under Domain Shift
Patrick Jensen, Vedrana Dahl, Rebecca Engberg et al.
Efficient Snapshot Spectral Imaging: Calibration-Free Parallel Structure with Aperture Diffraction Fusion
Tao Lv, Lihao Hu, Shiqiao Li et al.
Classification Matters: Improving Video Action Detection with Class-Specific Attention
Jinsung Lee, Taeoh Kim, Inwoong Lee et al.
Refine, Discriminate and Align: Stealing Encoders via Sample-Wise Prototypes and Multi-Relational Extraction
Shuchi Wu, Chuan Ma, Kang Wei et al.
Rebalancing Using Estimated Class Distribution for Imbalanced Semi-Supervised Learning under Class Distribution Mismatch
Taemin Park, Hyuck Lee, Heeyoung Kim
Möbius Transform for Mitigating Perspective Distortions in Representation Learning
Prakash Chandra Chhipa, Meenakshi Subhash Chippa, Kanjar De et al.
Neural Poisson Solver: A Universal and Continuous Framework for Natural Signal Blending
Delong Wu, Hao Zhu, Qi Zhang et al.
Training A Secure Model against Data-Free Model Extraction
Zhenyi Wang, Li Shen, junfeng guo et al.
Optimizing Factorized Encoder Models: Time and Memory Reduction for Scalable and Efficient Action Recognition
Shreyank Narayana Gowda, Anurag Arnab, Jonathan Huang
Semantic-guided Robustness Tuning for Few-Shot Transfer Across Extreme Domain Shift
kangyu xiao, Zilei Wang, junjie li
Adaptive Selection of Sampling-Reconstruction in Fourier Compressed Sensing
Seongmin Hong, Jaehyeok Bae, Jongho Lee et al.
UniVoxel: Fast Inverse Rendering by Unified Voxelization of Scene Representation
Shuang Wu, Songlin Tang, Guangming Lu et al.
AID-AppEAL: Automatic Image Dataset and Algorithm for Content Appeal Enhancement and Assessment Labeling
Sherry Chen, Yaron Vaxman, Elad Ben Baruch et al.
An Optimal Control View of LoRA and Binary Controller Design for Vision Transformers
CHI Zhang, Jingpu Cheng, Qianxiao Li
Distractor-Free Novel View Synthesis via Exploiting Memorization Effect in Optimization
Yukun Wang, Kunhong Li, Minglin Chen et al.
Image-Feature Weak-to-Strong Consistency: An Enhanced Paradigm for Semi-Supervised Learning
Zhiyu Wu, Jin shi Cui
On Spectral Properties of Gradient-based Explanation Methods
Amir Mehrpanah, Erik Englesson, Hossein Azizpour
Human Motion Forecasting in Dynamic Domain Shifts: A Homeostatic Continual Test-time Adaptation Framework
Qiongjie Cui, Huaijiang Sun, Bin Li et al.
POCA: Post-training Quantization with Temporal Alignment for Codec Avatars
Jian Meng, Yuecheng Li, CHENGHUI Li et al.
Occlusion Handling in 3D Human Pose Estimation with Perturbed Positional Encoding
niloofar azizi, Mohsen Fayyaz, Horst Bischof
Learning Non-Linear Invariants for Unsupervised Out-of-Distribution Detection
Lars Doorenbos, Raphael Sznitman, Pablo Márquez Neila
Semicalibrated Relative Pose from an Affine Correspondence and Monodepth
Petr Hrubý, Marc Pollefeys, Daniel Barath
Diverse Text-to-3D Synthesis with Augmented Text Embedding
Uy Tran, Minh N. Hoang Luu, Phong Nguyen et al.
GroundUp: Rapid Sketch-Based 3D City Massing
Gizem Esra Unlu, Mohamed Sayed, Yulia Gryaditskaya et al.
Mutual Learning for Acoustic Matching and Dereverberation via Visual Scene-driven Diffusion
Jian Ma, Wenguan Wang, Yi Yang et al.
Forget More to Learn More: Domain-specific Feature Unlearning for Semi-supervised and Unsupervised Domain Adaptation
Hritam Basak, Zhaozheng Yin
Flatness-aware Sequential Learning Generates Resilient Backdoors
Hoang Pham, The-Anh Ta, Anh Tran et al.
Not Just Change the Labels, Learn the Features: Watermarking Deep Neural Networks with Multi-View Data
Yuxuan Li, Sarthak Kumar Maharana, Yunhui Guo
ActionSwitch: Class-agnostic Detection of Simultaneous Actions in Streaming Videos
Kang Hyolim, Jeongseok Hyun, Joungbin An et al.
FairViT: Fair Vision Transformer via Adaptive Masking
Bowei Tian, Ruijie Du, Yanning Shen
GOEmbed: Gradient Origin Embeddings for Representation Agnostic 3D Feature Learning
Animesh Karnewar, Roman Shapovalov, Tom Monnier et al.
Multi-Granularity Sparse Relationship Matrix Prediction Network for End-to-End Scene Graph Generation
lei wang, Zejian Yuan, Badong Chen
PACE: Pose Annotations in Cluttered Environments
Yang You, kai xiong, Zhening Yang et al.
Learning a Dynamic Privacy-preserving Camera Robust to Inversion Attacks
Jiacheng Cheng, Xiang Dai, Jia Wan et al.
TreeSBA: Tree-Transformer for Self-Supervised Sequential Brick Assembly
Mengqi GUO, Chen Li, Yuyang Zhao et al.
MO-EMT-NAS: Multi-Objective Continuous Transfer of Architectural Knowledge Between Tasks from Different Datasets
Peng Liao, Xilu Wang, Yaochu Jin et al.
Harmonizing knowledge Transfer in Neural Network with Unified Distillation
yaomin huang, faming Fang, Zaoming Yan et al.
Self-Supervised Video Copy Localization with Regional Token Representation
Minlong Lu, Yichen Lu, Siwei Nie et al.
Catastrophic Overfitting: A Potential Blessing in Disguise
MN Zhao, Lihe Zhang, Yuqiu Kong et al.
Aligning Neuronal Coding of Dynamic Visual Scenes with Foundation Vision Models
Rining Wu, Feixiang Zhou, Ziwei Yin et al.
Deep Nets with Subsampling Layers Unwittingly Discard Useful Activations at Test-Time
Chiao-An Yang, Ziwei Liu, Raymond Yeh
Forecasting Future Videos from Novel Views via Disentangled 3D Scene Representation
Sudhir Kumar Reddy Yarram, Junsong Yuan
Diffusion-Refined VQA Annotations for Semi-Supervised Gaze Following
Qiaomu Miao, Alexandros Graikos, Jingwei Zhang et al.
SUP-NeRF: A Streamlined Unification of Pose Estimation and NeRF for Monocular 3D Object Reconstruction
Yuliang Guo, Abhinav Kumar, Cheng Zhao et al.
LASS3D: Language-Assisted Semi-Supervised 3D Semantic Segmentation with Progressive Unreliable Data Exploitation
Jianan Li, Qiulei Dong
Stripe Observation Guided Inference Cost-free Attention Mechanism
Zhongzhan Huang, Shanshan Zhong, Wushao Wen et al.
A Probability-guided Sampler for Neural Implicit Surface Rendering
Gonçalo José Dias Pais, Valter André Piedade, Moitreya Chatterjee et al.
Learning Anomalies with Normality Prior for Unsupervised Video Anomaly Detection
Haoyue Shi, Le Wang, Sanping Zhou et al.
Towards Certifiably Robust Face Recognition
Seunghun Paik, Dongsoo Kim, Chanwoo Hwang et al.
Leveraging Imperfect Restoration for Data Availability Attack
YI HUANG, Jeremy Styborski, Mingzhi Lyu et al.
Data Collection-free Masked Video Modeling
Yuchi Ishikawa, Masayoshi Kondo, Yoshimitsu Aoki
Bones Can't Be Triangles: Accurate and Efficient Vertebrae Keypoint Estimation through Collaborative Error Revision
Jinhee Kim, Taesung Kim, Choo Jaegul
A Secure Image Watermarking Framework with Statistical Guarantees via Adversarial Attacks on Secret Key Networks
Feiyu CHEN, Wei Lin, Ziquan Liu et al.
CARB-Net: Camera-Assisted Radar-Based Network for Vulnerable Road User Detection
Wei-Yu Lee, Martin Dimitrievski, David Van Hamme et al.
EpipolarGAN: Omnidirectional Image Synthesis with Explicit Camera Control
Christopher May, Daniel Aliaga
On-the-fly Category Discovery for LiDAR Semantic Segmentation
HYEONSEONG KIM, Sung-Hoon Yoon, Minseok Kim et al.
An Optimization Framework to Enforce Multi-View Consistency for Texturing 3D Meshes
Zhengyi Zhao, Chen Song, Xiaodong Gu et al.
Text Motion Translator: A Bi-Directional Model for Enhanced 3D Human Motion Generation from Open-Vocabulary Descriptions
Yijun Qian, Jack Urbanek, Alexander Hauptmann et al.
Time-Efficient and Identity-Consistent Virtual Try-On Using A Variant of Altered Diffusion Models
Phuong Dam, Jihoon Jeong, Anh Tran et al.
Random Walk on Pixel Manifolds for Anomaly Segmentation of Complex Driving Scenes
Zelong Zeng, Kaname Tomite
Bayesian Detector Combination for Object Detection with Crowdsourced Annotations
Zhi Qin Tan, Olga Isupova, Gustavo Carneiro et al.
Source-Free Domain-Invariant Performance Prediction
Ekaterina Khramtsova, Mahsa Baktashmotlagh, Guido Zuccon et al.
Freeview Sketching: View-Aware Fine-Grained Sketch-Based Image Retrieval
Aneeshan Sain, Pinaki Nath Chowdhury, Subhadeep Koley et al.
Integration of Global and Local Representations for Fine-grained Cross-modal Alignment
Seungwan Jin, Hoyoung Choi, Taehyung Noh et al.
Exploiting Supervised Poison Vulnerability to Strengthen Self-Supervised Defense
Jeremy Styborski, Mingzhi Lyu, YI HUANG et al.
ADen: Adaptive Density Representations for Sparse-view Camera Pose Estimation
Hao Tang, Weiyao Wang, Pierre Gleize et al.
ELSE: Efficient Deep Neural Network Inference through Line-based Sparsity Exploration
Zeqi Zhu, Alberto Garcia-Ortiz, Luc Waeijen et al.
Differentiable Convex Polyhedra Optimization from Multi-view Images
Daxuan Ren, Haiyi Mei, Hezi Shi et al.
Rethinking Unsupervised Outlier Detection via Multiple Thresholding
Zhonghang Liu, Panzhong Lu, Guoyang Xie et al.
Compositional Substitutivity of Visual Reasoning for Visual Question Answering
Chuanhao Li, Zhen Li, Chenchen Jing et al.
Dependency-aware Differentiable Neural Architecture Search
Buang Zhang, Xinle Wu, Hao Miao et al.
DεpS: Delayed ε-Shrinking for Faster Once-For-All Training
Aditya Annavajjala, Alind Khare, Animesh Agrawal et al.
Object-Oriented Anchoring and Modal Alignment in Multimodal Learning
Shibin Mei, Bingbing Ni, Hang Wang et al.
GTPT: Group-based Token Pruning Transformer for Efficient Human Pose Estimation
Haonan Wang, Jie Liu, Jie Tang et al.
SUMix: Mixup with Semantic and Uncertain Information
Huafeng Qin, Xin Jin, Hongyu Zhu et al.
Learning Exhaustive Correlation for Spectral Super-Resolution: Where Spatial-Spectral Attention Meets Linear Dependence
Hongyuan Wang, Lizhi Wang, Jiang Xu et al.
See and Think: Embodied Agent in Virtual Environment
Zhonghan Zhao, Xuan Wang, Wenhao Chai et al.
Scalar Function Topology Divergence: Comparing Topology of 3D Objects
Ilya Trofimov, Daria Voronkova, Eduard Tulchinskii et al.
Forest2Seq: Revitalizing Order Prior for Sequential Indoor Scene Synthesis
Qi Sun, Hang Zhou, Wengang Zhou et al.
Convex Relaxations for Manifold-Valued Markov Random Fields with Approximation Guarantees
Robin Kenis, Emanuel Laude, Panagiotis Patrinos
SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding
Baoxiong Jia, Yixin Chen, Huangyue Yu et al.
Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation
Mengchen Zhang, Tong Wu, Tai Wang et al.
SENC: Handling Self-collision in Neural Cloth Simulation
Zhouyingcheng Liao, Sinan Wang, Taku Komura
m&m’s: A Benchmark to Evaluate Tool-Use for multi-step multi-modal Tasks
Zixian Ma, Weikai Huang, Jieyu Zhang et al.
3DFG-PIFu: 3D Feature Grids for Human Digitization from Sparse Views
Kennard Yanting Chan, Fayao Liu, Guosheng Lin et al.
High-Fidelity 3D Textured Shapes Generation by Sparse Encoding and Adversarial Decoding
Qi Zuo, Xiaodong Gu, Yuan Dong et al.
Text to Layer-wise 3D Clothed Human Generation
Junting Dong, Qi Fang, Zehuan Huang et al.
Local All-Pair Correspondence for Point Tracking
Seokju Cho, Jiahui Huang, Jisu Nam et al.
Distribution Alignment for Fully Test-Time Adaptation with Dynamic Online Data Streams
Ziqiang Wang, Zhixiang Chi, Yanan Wu et al.
Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis
Basile Van Hoorick, Rundi Wu, Ege Ozguroglu et al.
MyVLM: Personalizing VLMs for User-Specific Queries
Yuval Alaluf, Elad Richardson, Sergey Tulyakov et al.
SIGMA: Sinkhorn-Guided Masked Video Modeling
Mohammadreza Salehi, Michael Dorkenwald, Fida Mohammad Thoker et al.