Most Cited ECCV "generalist agents" Papers
2,387 papers found • Page 11 of 12
Conference
POA: Pre-training Once for Models of All Sizes
Yingying Zhang, Xin Guo, Jiangwei Lao et al.
Responsible Visual Editing
Minheng Ni, Yeli Shen, Yabin Zhang et al.
Do Generalised Classifiers really work on Human Drawn Sketches?
Hmrishav Bandyopadhyay, Pinaki Nath Chowdhury, Aneeshan Sain et al.
ViG-Bias: Visually Grounded Bias Discovery and Mitigation
Badr-Eddine Marani, Mohamed HANINI, Nihitha Malayarukil et al.
MARs: Multi-view Attention Regularizations for Patch-based Feature Recognition of Space Terrain
Timothy Chase, Karthik Dantu
Synchronization of Projective Transformations
Rakshith Madhavan, Andrea Fusiello, Federica Arrigoni
Neural graphics texture compression supporting random access
Farzad Farhadzadeh, Qiqi Hou, Hoang Le et al.
COSMU: Complete 3D human shape from monocular unconstrained images
Marco Pesavento, Marco Volino, Adrian Hilton
MC-PanDA: Mask Confidence for Panoptic Domain Adaptation
Ivan Martinovic, Josip Šarić, Siniša Šegvić
A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties
Junfei Xiao, Ziqi Zhou, Wenxuan Li et al.
RoofDiffusion: Constructing Roofs from Severely Corrupted Point Data via Diffusion
Kyle Lo, Jorg Peters, Eric Spellman
LineFit: A Geometric Approach for Fitting Line Segments in Images
Marion BOYER, David Youssefi, Florent Lafarge
Distributed Semantic Segmentation with Efficient Joint Source and Task Decoding
Danish Nazir, Timo Bartels, Jan Piewek et al.
Single-Mask Inpainting for Voxel-based Neural Radiance Fields
Jiafu Chen, Tianyi Chu, Jiakai Sun et al.
PAV: Personalized Head Avatar from Unstructured Video Collection
Akin Caliskan, Berkay Kicanaoglu, H K
DISCO: Embodied Navigation and Interaction via Differentiable Scene Semantics and Dual-level Control
Xinyu Xu, Shengcheng Luo, Yanchao Yang et al.
Stable Preference: Redefining training paradigm of human preference model for Text-to-Image Synthesis
Hanting Li, Hongjing Niu, Feng Zhao
Effective Lymph Nodes Detection in CT Scans Using Location Debiased Query Selection and Contrastive Query Representation in Transformer
Qinji Yu, Yirui Wang, Ke Yan et al.
REDIR: Refocus-free Event-based De-occlusion Image Reconstruction
Qi Guo, Hailong Shi, Huan Li et al.
UniVoxel: Fast Inverse Rendering by Unified Voxelization of Scene Representation
Shuang Wu, Songlin Tang, Guangming Lu et al.
Integration of Global and Local Representations for Fine-grained Cross-modal Alignment
Seungwan Jin, Hoyoung Choi, Taehyung Noh et al.
Exploiting Supervised Poison Vulnerability to Strengthen Self-Supervised Defense
Jeremy Styborski, Mingzhi Lyu, YI HUANG et al.
Remove Projective LiDAR Depthmap Artifacts via Exploiting Epipolar Geometry
Shengjie Zhu, Girish Chandar Ganesan, Abhinav Kumar et al.
Neural Poisson Solver: A Universal and Continuous Framework for Natural Signal Blending
Delong Wu, Hao Zhu, Qi Zhang et al.
Source-Free Domain-Invariant Performance Prediction
Ekaterina Khramtsova, Mahsa Baktashmotlagh, Guido Zuccon et al.
EpipolarGAN: Omnidirectional Image Synthesis with Explicit Camera Control
Christopher May, Daniel Aliaga
Evaluating the Adversarial Robustness of Semantic Segmentation: Trying Harder Pays Off
Levente Ferenc Halmosi, Bálint Mohos, Márk Jelasity
Data Overfitting for On-Device Super-Resolution with Dynamic Algorithm and Compiler Co-Design
Li, zhihao shu, Jie Ji et al.
Forecasting Future Videos from Novel Views via Disentangled 3D Scene Representation
Sudhir Kumar Reddy Yarram, Junsong Yuan
RANRAC: Robust Neural Scene Representations via Random Ray Consensus
Benno Buschmann, Andreea Dogaru, Elmar Eisemann et al.
Diverse Text-to-3D Synthesis with Augmented Text Embedding
Uy Tran, Minh N. Hoang Luu, Phong Nguyen et al.
GroundUp: Rapid Sketch-Based 3D City Massing
Gizem Esra Unlu, Mohamed Sayed, Yulia Gryaditskaya et al.
Möbius Transform for Mitigating Perspective Distortions in Representation Learning
Prakash Chandra Chhipa, Meenakshi Subhash Chippa, Kanjar De et al.
Leveraging Imperfect Restoration for Data Availability Attack
YI HUANG, Jeremy Styborski, Mingzhi Lyu et al.
Learning a Dynamic Privacy-preserving Camera Robust to Inversion Attacks
Jiacheng Cheng, Xiang Dai, Jia Wan et al.
On Spectral Properties of Gradient-based Explanation Methods
Amir Mehrpanah, Erik Englesson, Hossein Azizpour
Multimodal Label Relevance Ranking via Reinforcement Learning
Taian Guo, Taolin Zhang, Haoqian Wu et al.
Leveraging Text Localization for Scene Text Removal via Text-aware Masked Image Modeling
Zixiao Wang, Hongtao Xie, YuXin Wang et al.
Face Reconstruction Transfer Attack as Out-of-Distribution Generalization
Yoon Gyo Jung, Jaewoo Park, Xingbo Dong et al.
Adaptive Selection of Sampling-Reconstruction in Fourier Compressed Sensing
Seongmin Hong, Jaehyeok Bae, Jongho Lee et al.
POET: Prompt Offset Tuning for Continual Human Action Adaptation
Prachi Garg, Joseph K J, Vineeth N Balasubramanian et al.
CARB-Net: Camera-Assisted Radar-Based Network for Vulnerable Road User Detection
Wei-Yu Lee, Martin Dimitrievski, David Van Hamme et al.
Text Motion Translator: A Bi-Directional Model for Enhanced 3D Human Motion Generation from Open-Vocabulary Descriptions
Yijun Qian, Jack Urbanek, Alexander Hauptmann et al.
Bayesian Detector Combination for Object Detection with Crowdsourced Annotations
Zhi Qin Tan, Olga Isupova, Gustavo Carneiro et al.
CriSp: Leveraging Tread Depth Maps for Enhanced Crime-Scene Shoeprint Matching
Samia Shafique, Shu Kong, Charless Fowlkes
Commonly Interesting Images
Fitim Abdullahu, Helmut Grabner
Fast Registration of Photorealistic Avatars for VR Facial Animation
Chaitanya Patel, Shaojie Bai, Te-Li Wang et al.
DatasetNeRF: Efficient 3D-aware Data Factory with Generative Radiance Fields
Yu Chi, Fangneng Zhan, Sibo Wu et al.
ADen: Adaptive Density Representations for Sparse-view Camera Pose Estimation
Hao Tang, Weiyao Wang, Pierre Gleize et al.
An Optimization Framework to Enforce Multi-View Consistency for Texturing 3D Meshes
Zhengyi Zhao, Chen Song, Xiaodong Gu et al.
RING-NeRF : Rethinking Inductive Biases for Versatile and Efficient Neural Fields
Doriand Petit, Steve Bourgeois, Dumitru Pavel et al.
Distractor-Free Novel View Synthesis via Exploiting Memorization Effect in Optimization
Yukun Wang, Kunhong Li, Minglin Chen et al.
Consistent 3D Line Mapping
Xulong Bai, Hainan Cui, Shuhan Shen
ActionSwitch: Class-agnostic Detection of Simultaneous Actions in Streaming Videos
Kang Hyolim, Jeongseok Hyun, Joungbin An et al.
Stripe Observation Guided Inference Cost-free Attention Mechanism
Zhongzhan Huang, Shanshan Zhong, Wushao Wen et al.
Random Walk on Pixel Manifolds for Anomaly Segmentation of Complex Driving Scenes
Zelong Zeng, Kaname Tomite
Multi-Granularity Sparse Relationship Matrix Prediction Network for End-to-End Scene Graph Generation
lei wang, Zejian Yuan, Badong Chen
Freeview Sketching: View-Aware Fine-Grained Sketch-Based Image Retrieval
Aneeshan Sain, Pinaki Nath Chowdhury, Subhadeep Koley et al.
Beyond the Data Imbalance: Employing the Heterogeneous Datasets for Vehicle Maneuver Prediction
Hyeongseok Jeon, Sanmin Kim, Abi Rahman Syamil et al.
Transferable 3D Adversarial Shape Completion using Diffusion Models
Xuelong Dai, Bin Xiao
An Information Theoretical View for Out-Of-Distribution Detection
Jinjing Hu, Wenrui Liu, Hong Chang et al.
Dependency-aware Differentiable Neural Architecture Search
Buang Zhang, Xinle Wu, Hao Miao et al.
Refine, Discriminate and Align: Stealing Encoders via Sample-Wise Prototypes and Multi-Relational Extraction
Shuchi Wu, Chuan Ma, Kang Wei et al.
Understanding and Mitigating Human-Labelling Errors in Supervised Contrastive Learning
Zijun Long, Lipeng Zhuang, George W Killick et al.
Leveraging scale- and orientation-covariant features for planar motion estimation
Marcus Valtonen Örnhag, Alberto Jaenal
MeshFeat: Multi-Resolution Features for Neural Fields on Meshes
Mihir Mahajan, Florian Hofherr, Daniel Cremers
An Optimal Control View of LoRA and Binary Controller Design for Vision Transformers
CHI Zhang, Jingpu Cheng, Qianxiao Li
Learning Non-Linear Invariants for Unsupervised Out-of-Distribution Detection
Lars Doorenbos, Raphael Sznitman, Pablo Márquez Neila
Flatness-aware Sequential Learning Generates Resilient Backdoors
Hoang Pham, The-Anh Ta, Anh Tran et al.
On-the-fly Category Discovery for LiDAR Semantic Segmentation
HYEONSEONG KIM, Sung-Hoon Yoon, Minseok Kim et al.
Object-Oriented Anchoring and Modal Alignment in Multimodal Learning
Shibin Mei, Bingbing Ni, Hang Wang et al.
Time-Efficient and Identity-Consistent Virtual Try-On Using A Variant of Altered Diffusion Models
Phuong Dam, Jihoon Jeong, Anh Tran et al.
Compositional Substitutivity of Visual Reasoning for Visual Question Answering
Chuanhao Li, Zhen Li, Chenchen Jing et al.
Learning Anomalies with Normality Prior for Unsupervised Video Anomaly Detection
Haoyue Shi, Le Wang, Sanping Zhou et al.
MOD-UV: Learning Mobile Object Detectors from Unlabeled Videos
Yihong Sun, Bharath Hariharan
OLAF: A Plug-and-Play Framework for Enhanced Multi-object Multi-part Scene Parsing
Pranav Gupta, Rishubh Singh, Pradeep Shenoy et al.
Semantic-guided Robustness Tuning for Few-Shot Transfer Across Extreme Domain Shift
kangyu xiao, Zilei Wang, junjie li
ST-LDM: A Universal Framework for Text-Grounded Object Generation in Real Images
Xiangtian Xue, Jiasong Wu, Youyong Kong et al.
REFRAME: Reflective Surface Real-Time Rendering for Mobile Devices
Chaojie Ji, Yufeng Li, Yiyi Liao
MetaAug: Meta-Data Augmentation for Post-Training Quantization
Cuong Pham, Hoang Anh Dung, Cuong Cao Nguyen et al.
UNIT: Backdoor Mitigation via Automated Neural Distribution Tightening
Siyuan Cheng, Guangyu Shen, Kaiyuan Zhang et al.
Rebalancing Using Estimated Class Distribution for Imbalanced Semi-Supervised Learning under Class Distribution Mismatch
Taemin Park, Hyuck Lee, Heeyoung Kim
Inter-Class Topology Alignment for Efficient Black-Box Substitute Attacks
lingzhuang meng, Mingwen Shao, Yuanjian Qiao et al.
Harmonizing knowledge Transfer in Neural Network with Unified Distillation
yaomin huang, faming Fang, Zaoming Yan et al.
A Secure Image Watermarking Framework with Statistical Guarantees via Adversarial Attacks on Secret Key Networks
Feiyu CHEN, Wei Lin, Ziquan Liu et al.
POCA: Post-training Quantization with Temporal Alignment for Codec Avatars
Jian Meng, Yuecheng Li, CHENGHUI Li et al.
ELSE: Efficient Deep Neural Network Inference through Line-based Sparsity Exploration
Zeqi Zhu, Alberto Garcia-Ortiz, Luc Waeijen et al.
A Probability-guided Sampler for Neural Implicit Surface Rendering
Gonçalo José Dias Pais, Valter André Piedade, Moitreya Chatterjee et al.
Semicalibrated Relative Pose from an Affine Correspondence and Monodepth
Petr Hrubý, Marc Pollefeys, Daniel Barath
Diffusion-Refined VQA Annotations for Semi-Supervised Gaze Following
Qiaomu Miao, Alexandros Graikos, Jingwei Zhang et al.
Efficient Pre-training for Localized Instruction Generation of Procedural Videos
Anil Batra, Davide Moltisanti, Laura Sevilla-Lara et al.
Mutual Learning for Acoustic Matching and Dereverberation via Visual Scene-driven Diffusion
Jian Ma, Wenguan Wang, Yi Yang et al.
Forget More to Learn More: Domain-specific Feature Unlearning for Semi-supervised and Unsupervised Domain Adaptation
Hritam Basak, Zhaozheng Yin
LASS3D: Language-Assisted Semi-Supervised 3D Semantic Segmentation with Progressive Unreliable Data Exploitation
Jianan Li, Qiulei Dong
PACE: Pose Annotations in Cluttered Environments
Yang You, kai xiong, Zhening Yang et al.
Group Testing for Accurate and Efficient Range-Based Near Neighbor Search for Plagiarism Detection
Harsh Shah, Kashish Mittal, Ajit Rajwade
BaSIC: BayesNet Structure Learning for Computational Scalable Neural Image Compression
Yufeng Zhang, Hang Yu, Shizhan Liu et al.
Data Collection-free Masked Video Modeling
Yuchi Ishikawa, Masayoshi Kondo, Yoshimitsu Aoki
Deep Nets with Subsampling Layers Unwittingly Discard Useful Activations at Test-Time
Chiao-An Yang, Ziwei Liu, Raymond Yeh
Classification Matters: Improving Video Action Detection with Class-Specific Attention
Jinsung Lee, Taeoh Kim, Inwoong Lee et al.
Optimizing Factorized Encoder Models: Time and Memory Reduction for Scalable and Efficient Action Recognition
Shreyank Narayana Gowda, Anurag Arnab, Jonathan Huang
Multiscale Graph Texture Network
Ravishankar Evani, Deepu Rajan, Shangbo Mao
Human Motion Forecasting in Dynamic Domain Shifts: A Homeostatic Continual Test-time Adaptation Framework
Qiongjie Cui, Huaijiang Sun, Bin Li et al.
Rethinking Unsupervised Outlier Detection via Multiple Thresholding
Zhonghang Liu, Panzhong Lu, Guoyang Xie et al.
Deep Cost Ray Fusion for Sparse Depth Video Completion
Jungeon Kim, Soongjin Kim, Jaesik Park et al.
AID-AppEAL: Automatic Image Dataset and Algorithm for Content Appeal Enhancement and Assessment Labeling
Sherry Chen, Yaron Vaxman, Elad Ben Baruch et al.
Aligning Neuronal Coding of Dynamic Visual Scenes with Foundation Vision Models
Rining Wu, Feixiang Zhou, Ziwei Yin et al.
An accurate detection is not all you need to combat label noise in web-noisy datasets
Paul Albert, Kevin McGuinness, Eric Arazo et al.
Self-Supervised Video Copy Localization with Regional Token Representation
Minlong Lu, Yichen Lu, Siwei Nie et al.
DεpS: Delayed ε-Shrinking for Faster Once-For-All Training
Aditya Annavajjala, Alind Khare, Animesh Agrawal et al.
Image-Feature Weak-to-Strong Consistency: An Enhanced Paradigm for Semi-Supervised Learning
Zhiyu Wu, Jin shi Cui
Wavelength-Embedding-guided Filter-Array Transformer for Spectral Demosaicing
haijin zeng, Hiep Luong, Wilfried Philips
Not Just Change the Labels, Learn the Features: Watermarking Deep Neural Networks with Multi-View Data
Yuxuan Li, Sarthak Kumar Maharana, Yunhui Guo
DreamDrone: Text-to-Image Diffusion Models are Zero-shot Perpetual View Generators
Hanyang Kong, Dongze Lian, Michael Bi Mi et al.
Open-set Domain Adaptation via Joint Error based Multi-class Positive and Unlabeled Learning
Dexuan Zhang, Thomas Westfechtel, Tatsuya Harada
Efficient Snapshot Spectral Imaging: Calibration-Free Parallel Structure with Aperture Diffraction Fusion
Tao Lv, Lihao Hu, Shiqiao Li et al.
Training A Secure Model against Data-Free Model Extraction
Zhenyi Wang, Li Shen, junfeng guo et al.
Differentiable Convex Polyhedra Optimization from Multi-view Images
Daxuan Ren, Haiyi Mei, Hezi Shi et al.
Towards Robust Full Low-bit Quantization of Super Resolution Networks
Denis Makhov, Irina Zhelavskaya, Ruslan Ostapets et al.
BugNIST - a Large Volumetric Dataset for Detection under Domain Shift
Patrick Jensen, Vedrana Dahl, Rebecca Engberg et al.
Resolving Scale Ambiguity in Multi-view 3D Reconstruction using Dual-Pixel Sensors
Kohei Ashida, Hiroaki Santo, Fumio Okura et al.
Self-supervised Shape Completion via Involution and Implicit Correspondences
Mengya Liu, Ajad Chhatkuli, Janis Postels et al.
Exploring Active Learning in Meta-Learning: Enhancing Context Set Labeling
Wonho Bae, Jing Wang, Danica J. Sutherland
Learning to Build by Building Your Own Instructions
Aaron Walsman, Muru Zhang, Adam Fishman et al.
GOEmbed: Gradient Origin Embeddings for Representation Agnostic 3D Feature Learning
Animesh Karnewar, Roman Shapovalov, Tom Monnier et al.
Towards compact reversible image representations for neural style transfer
Xiyao Liu, Siyu Yang, Jian Zhang et al.
Bones Can't Be Triangles: Accurate and Efficient Vertebrae Keypoint Estimation through Collaborative Error Revision
Jinhee Kim, Taesung Kim, Choo Jaegul
Catastrophic Overfitting: A Potential Blessing in Disguise
MN Zhao, Lihe Zhang, Yuqiu Kong et al.
TreeSBA: Tree-Transformer for Self-Supervised Sequential Brick Assembly
Mengqi GUO, Chen Li, Yuyang Zhao et al.
Debiasing surgeon: fantastic weights and how to find them
Remi Nahon, Ivan Luiz De Moura Matos, Van-Tam Nguyen et al.
Towards Certifiably Robust Face Recognition
Seunghun Paik, Dongsoo Kim, Chanwoo Hwang et al.
Unveiling Privacy Risks in Stochastic Neural Networks Training: Effective Image Reconstruction from Gradients
Yiming Chen, Xiangyu Yang, Nikos Deligiannis
Post-training Quantization with Progressive Calibration and Activation Relaxing for Text-to-Image Diffusion Models
Siao Tang, Xin Wang, Hong Chen et al.
A Direct Approach to Viewing Graph Solvability
Federica Arrigoni, Andrea Fusiello, Tomas Pajdla
Three Things We Need to Know About Transferring Stable Diffusion to Visual Dense Prediciton Tasks
Manyuan Zhang, Guanglu Song, Xiaoyu Shi et al.
Unsqueeze [CLS] Bottleneck to Learn Rich Representations
Qing Su, Shihao Ji
DomainFusion: Generalizing To Unseen Domains with Latent Diffusion Models
Yuyang Huang, Yabo Chen, Yuchen Liu et al.
Efficient Vision Transformers with Partial Attention
Xuan-Thuy Vo, Duy-Linh Nguyen, Adri Priadana et al.
TurboEdit: Real-time text-based disentangled real image editing
Zongze Wu, Nicholas I Kolkin, Jonathan Brandt et al.
Rethinking Weakly-supervised Video Temporal Grounding From a Game Perspective
Xiang Fang, Zeyu Xiong, Wanlong Fang et al.
Improving Vision and Language Concepts Understanding with Multimodal Counterfactual Samples
Chengen Lai, Shengli Song, Sitong Yan et al.
Functional Transform-Based Low-Rank Tensor Factorization for Multi-Dimensional Data Recovery
Jian-Li Wang, Xi-Le Zhao
Veil Privacy on Visual Data: Concealing Privacy for Humans, Unveiling for DNNs
Shuchao Pang, Ruhao Ma, Bing Li et al.
TrajPrompt: Aligning Color Trajectory with Vision-Language Representations
Li-Wu Tsao, Hao-Tang Tsui, Yu-Rou Tuan et al.
Clean & Compact: Efficient Data-Free Backdoor Defense with Model Compactness
Huy Phan, Jinqi Xiao, Yang Sui et al.
Align before Collaborate: Mitigating Feature Misalignment for Robust Multi-Agent Perception
Dingkang Yang, Ke Li, Dongling Xiao et al.
Energy-induced Explicit quantification for Multi-modality MRI fusion
Xiaoming Qi, Yuan Zhang, Tong Wang et al.
Language-Assisted Skeleton Action Understanding for Skeleton-Based Temporal Action Segmentation
Haoyu Ji, Bowen Chen, Xinglong Xu et al.
Bridging the Gap: Studio-like Avatar Creation from a Monocular Phone Capture
ShahRukh Athar, Shunsuke Saito, Stanislav Pidhorskyi et al.
LoA-Trans: Enhancing Visual Grounding by Location-Aware Transformers
Ziling Huang, Shin’ichi Satoh
Gradient-based Out-of-Distribution Detection
Taha Entesari, Sina Sharifi, Bardia Safaei et al.
Identity-Consistent Diffusion Network for Grading Knee Osteoarthritis Progression in Radiographic Imaging
Wenhua Wu, Kun Hu, Wenxi Yue et al.
MLPHand: Real Time Multi-View 3D Hand Reconstruction via MLP Modeling
Jian Yang, Jiakun Li, Guoming Li et al.
Spectral Subsurface Scattering for Material Classification
Haejoon Lee, Aswin C. Sankaranarayanan
COIN-Matting: Confounder Intervention for Image Matting
Zhaohe Liao, Jiangtong Li, Jun Lan et al.
When and How do negative prompts take effect?
Yuanhao Ban, Ruochen Wang, Tianyi Zhou et al.
Pseudo-Embedding for Generalized Few-Shot Point Cloud Segmentation
Chih-Jung Tsai, Hwann-Tzong Chen, Tyng-Luh Liu
A Geometric Distortion Immunized Deep Watermarking Framework with Robustness Generalizability
Linfeng Ma, Han Fang, Tianyi Wei et al.
StereoGlue: Joint Feature Matching and Robust Estimation
Daniel Barath, Dmytro Mishkin, Luca Cavalli et al.
Rethinking and Improving Visual Prompt Selection for In-Context Learning Segmentation Framework
Wei Suo, Lanqing Lai, Mengyang Sun et al.
Object-Aware NIR-to-Visible Translation
Yunyi Gao, Lin Gu, Qiankun Liu et al.
iMatching: Imperative Correspondence Learning
Chen Wang, Dasong Gao, Yun-Jou Lin et al.
AnyHome: Open-Vocabulary Large-Scale Indoor Scene Generation with First-Person View Exploration
Rao Fu, Zehao Wen, Zichen Liu et al.
Correspondences of the Third Kind: Camera Pose Estimation from Object Reflection
Kohei Yamashita, Vincent Lepetit, Ko Nishino
Idea2Img: Iterative Self-Refinement with GPT-4V for Automatic Image Design and Generation
Zhengyuan Yang, Jianfeng Wang, Linjie Li et al.
Syn-to-Real Domain Adaptation for Point Cloud Completion via Part-based Approach
Yunseo Yang, Jihun Kim, Kuk-Jin Yoon
Phase Concentration and Shortcut Suppression for Weakly Supervised Semantic Segmentation
Hoyong Kwon, Jaeseok Jeong, Sung-Hoon Yoon et al.
AdaIFL: Adaptive Image Forgery Localization via a Dynamic and Importance-aware Transformer Network
Yuxi Li, Fuyuan Cheng, Wangbo Yu et al.
Lost in Translation: Modern Neural Networks Still Struggle With Small Realistic Image Transformations
Ofir Shifman, Yair Weiss
Event-based Head Pose Estimation: Benchmark and Method
jiahui yuan, Hebei Li, Yansong Peng et al.
How Far Can a 1-Pixel Camera Go? Solving Vision Tasks using Photoreceptors and Computationally Designed Visual Morphology
Andrei Atanov, Rishubh Singh, Jiawei Fu et al.
Robustness Tokens: Towards Adversarial Robustness of Transformers
Brian Pulfer, Yury Belousov, Slava Voloshynovskiy
EINet: Point Cloud Completion via Extrapolation and Interpolation
Pingping Cai, Canyu Zhang, LINGJIA SHI et al.
U-COPE: Taking a Further Step to Universal 9D Category-level Object Pose Estimation
li zhang, Weiqing Meng, Yan Zhong et al.
ReCON: Training-Free Acceleration for Text-to-Image Synthesis with Retrieval of Concept Prompt Trajectories
Chen-yi Lu, Shubham Agarwal, Mehrab Tanjim et al.
Free-ATM: Harnessing Free Attention Masks for Representation Learning on Diffusion-Generated Images
Junhao Zhang, Mutian Xu, Jay Zhangjie Wu et al.
Learning Unsigned Distance Functions from Multi-view Images with Volume Rendering Priors
Wen Yuan Zhang, Kanle Shi, Yushen Liu et al.
MONTRAGE: Monitoring Training for Attribution of Generative Diffusion Models
Jonathan Brokman, Omer Hofman, Roman Vainshtein et al.
Grid-Attention: Enhancing Computational Efficiency of Large Vision Models without Fine-Tuning
Pengyu Li, Biao Wang, Tianchu Guo et al.
E3V-K5: An Authentic Benchmark for Redefining Video-Based Energy Expenditure Estimation
Shengxuming Zhang, Lei Jin, Yifan Wang et al.
Distributed Active Client Selection With Noisy Clients Using Model Association Scores
Kwang In Kim
Enhanced Motion Forecasting with Visual Relation Reasoning
Sungjune Kim, Hadam Baek, Seunggwan Lee et al.
Nymeria: A Massive Collection of Egocentric Multi-modal Human Motion in the Wild
Lingni Ma, Yuting Ye, Rowan Postyeni et al.
Pathformer3D: A 3D Scanpath Transformer for 360° Images
Rong Quan, yantao Lai, Mengyu Qiu et al.
DSA: Discriminative Scatter Analysis for Early Smoke Segmentation
Lujian Yao, Haitao Zhao, Jingchao Peng et al.
Visual Prompting via Partial Optimal Transport
MENGYU ZHENG, Zhiwei Hao, Yehui Tang et al.
LiteSAM is Actually what you Need for segment Everything
Jianhai Fu, Yuanjie Yu, Ningchuan Li et al.
Continuous SO(3) Equivariant Convolution for 3D Point Cloud Analysis
Jaein Kim, HEE BIN YOO, Dong-Sig Han et al.
Clearer Frames, Anytime: Resolving Velocity Ambiguity in Video Frame Interpolation
Zhihang Zhong, Gurunandan Krishnan, Xiao Sun et al.
Efficient Training of Spiking Neural Networks with Multi-Parallel Implicit Stream Architecture
Zhigao Cao, Meng Li, Xiashuang Wang et al.
Towards Unified Representation of Invariant-Specific Features in Missing Modality Face Anti-Spoofing
Guanghao Zheng, Yuchen Liu, Wenrui Dai et al.
NeRF-XL: NeRF at Any Scale with Multi-GPU
Ruilong Li, Sanja Fidler, Angjoo Kanazawa et al.
Probabilistic Image-Driven Traffic Modeling via Remote Sensing
Scott Workman, Armin Hadzic
Diff3DETR: Agent-based Diffusion Model for Semi-supervised 3D Object Detection
Jiacheng Deng, Jiahao Lu, Tianzhu Zhang
6DoF Head Pose Estimation through Explicit Bidirectional Interaction with Face Geometry
Sungho Chun, Ju Yong Chang
S-JEPA: A Joint Embedding Predictive Architecture for Skeletal Action Recognition
Mohamed Abdelfattah, Alexandre ALahi
Self-Supervised Underwater Caustics Removal and Descattering via Deep Monocular SLAM
Jonathan Sauder, Devis TUIA
UAV First-Person Viewers Are Radiance Field Learners
Liqi Yan, Qifan Wang, Junhan Zhao et al.
AdversariaLeak: External Information Leakage Attack Using Adversarial Samples on Face Recognition Systems
Roye Katzav, Amit Giloni, Edita Grolman et al.
CrossScore: A Multi-View Approach to Image Evaluation and Scoring
Zirui Wang, Wenjing Bian, Victor Adrian Prisacariu