Most Cited ECCV "privacy-preserving machine learning" Papers
2,387 papers found • Page 12 of 12
Conference
Textual Knowledge Matters: Cross-Modality Co-Teaching for Generalized Visual Class Discovery
Haiyang Zheng, Pu Nan, Wenjing Li et al.
HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting
Zhenglin Zhou, Fan Ma, Hehe Fan et al.
Explicitly Guided Information Interaction Network for Cross-modal Point Cloud Completion
Hang Xu, Chen Long, Wenxiao Zhang et al.
Textual Grounding for Open-vocabulary Visual Information Extraction in Layout-diversified Documents
MENGJUN CHENG, Chengquan Zhang, Chang Liu et al.
TAPTR: Tracking Any Point with Transformers as Detection
Hongyang Li, Hao Zhang, Shilong Liu et al.
Relightable 3D Gaussians: Realistic Point Cloud Relighting with BRDF Decomposition and Ray Tracing
Jian Gao, chun gu, Youtian Lin et al.
COMPOSE: Comprehensive Portrait Shadow Editing
Andrew Hou, Zhixin Shu, Xuaner Zhang et al.
Learning Representations from Foundation Models for Domain Generalized Stereo Matching
Yongjian Zhang, Longguang Wang, Kunhong Li et al.
PreciseControl: Enhancing Text-To-Image Diffusion Models with Fine-Grained Attribute Control
Rishubh Parihar, Sachidanand VS, Sabariswaran Mani et al.
Controllable Human-Object Interaction Synthesis
Jiaman Li, Alexander Clegg, Roozbeh Mottaghi et al.
Nymeria: A Massive Collection of Egocentric Multi-modal Human Motion in the Wild
Lingni Ma, Yuting Ye, Rowan Postyeni et al.
MAD-DR: Map Compression for Visual Localization with Matchness Aware Descriptor Dimension Reduction
Qiang Wang
SpeedUpNet: A Plug-and-Play Adapter Network for Accelerating Text-to-Image Diffusion Models
Weilong Chai, Dandan Zheng, Jiajiong Cao et al.
LLM as Copilot for Coarse-grained Vision-and-Language Navigation
Yanyuan Qiao, Qianyi Liu, Jiajun Liu et al.
Physically Plausible Color Correction for Neural Radiance Fields
Qi Zhang, Ying Feng, HONGDONG LI
Tuning-Free Image Customization with Image and Text Guidance
Pengzhi Li, Qiang Nie, Ying Chen et al.
MegaScenes: Scene-Level View Synthesis at Scale
Joseph Tung, Gene Chou, Ruojin Cai et al.
Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation
Fu-Yun Wang, Xiaoshi Wu, Zhaoyang Huang et al.
Mono-ViFI: A Unified Learning Framework for Self-supervised Single- and Multi-frame Monocular Depth Estimation
Jinfeng Liu, Lingtong Kong, Bo Li et al.
Idea2Img: Iterative Self-Refinement with GPT-4V for Automatic Image Design and Generation
Zhengyuan Yang, Jianfeng Wang, Linjie Li et al.
Preventing Catastrophic Forgetting through Memory Networks in Continuous Detection
Gaurav Bhatt, Leonid Sigal, James Ross
Tiny Models are the Computational Saver for Large Models
Qingyuan Wang, Barry Cardiff, Antoine Frappé et al.
Distractors-Immune Representation Learning with Cross-modal Contrastive Regularization for Change Captioning
Yunbin Tu, Liang Li, Li Su et al.
Score Distillation Sampling with Learned Manifold Corrective
Thiemo Alldieck, Nikos Kolotouros, Cristian Sminchisescu
Rethinking Weakly-supervised Video Temporal Grounding From a Game Perspective
Xiang Fang, Zeyu Xiong, Wanlong Fang et al.
AdaDiffSR: Adaptive Region-aware Dynamic acceleration Diffusion Model for Real-World Image Super-Resolution
Yuanting Fan, Chengxu Liu, Nengzhong Yin et al.
Prompt-Driven Contrastive Learning for Transferable Adversarial Attacks
Hunmin Yang, Jongoh Jeong, Kuk-Jin Yoon
A Framework for Efficient Model Evaluation through Stratification, Sampling, and Estimation
Riccardo Fogliato, Pratik Patil, Mathew Monfort et al.
Domesticating SAM for Breast Ultrasound Image Segmentation via Spatial-frequency Fusion and Uncertainty Correction
Wanting Zhang, Huisi Wu, Jing Qin
Versatile Incremental Learning: Towards Class and Domain-Agnostic Incremental Learning
Minyeong Park, Jae-Ho Lee, Gyeong-Moon Park
Interaction-centric Spatio-Temporal Context Reasoning for Multi-Person Video HOI Recognition
Yisong Wang, Nan Xi, Jingjing Meng et al.
Parrot Captions Teach CLIP to Spot Text
Yiqi Lin, Conghui He, Alex Jinpeng Wang et al.
DECIDER: Leveraging Foundation Model Priors for Improved Model Failure Detection and Explanation
Rakshith Subramanyam, Kowshik Thopalli, Vivek Sivaraman Narayanaswamy et al.
TTD: Text-Tag Self-Distillation Enhancing Image-Text Alignment in CLIP to Alleviate Single Tag Bias
Sanghyun Jo, Soohyun Ryu, Sungyub Kim et al.
Diff-Tracker: Text-to-Image Diffusion Models are Unsupervised Trackers
Zhengbo Zhang, Li Xu, Duo Peng et al.
Echoes of the Past: Boosting Long-tail Recognition via Reflective Learning
Qihao Zhao, YALUN DAI, Shen Lin et al.
Lego: Learning to Disentangle and Invert Personalized Concepts Beyond Object Appearance in Text-to-Image Diffusion Models
Saman Motamed, Danda Pani Paudel, Luc Van Gool
A Direct Approach to Viewing Graph Solvability
Federica Arrigoni, Andrea Fusiello, Tomas Pajdla
Three Things We Need to Know About Transferring Stable Diffusion to Visual Dense Prediciton Tasks
Manyuan Zhang, Guanglu Song, Xiaoyu Shi et al.
Strike a Balance in Continual Panoptic Segmentation
Jinpeng Chen, Runmin Cong, Yuxuan Luo et al.
Expressive Whole-Body 3D Gaussian Avatar
Gyeongsik Moon, Takaaki Shiratori, Shunsuke Saito
Discovering Unwritten Visual Classifiers with Large Language Models
Mia Chiquier, Utkarsh Mall, Carl Vondrick
Parameter-Efficient and Memory-Efficient Tuning for Vision Transformer: A Disentangled Approach
Taolin Zhang, Jiawang Bai, Zhihe Lu et al.
Bridging the Gap: Studio-like Avatar Creation from a Monocular Phone Capture
ShahRukh Athar, Shunsuke Saito, Stanislav Pidhorskyi et al.
Receler: Reliable Concept Erasing of Text-to-Image Diffusion Models via Lightweight Erasers
Chi-Pin Huang, Kai-Po Chang, Chung-Ting Tsai et al.
ExMatch: Self-guided Exploitation for Semi-Supervised Learning with Scarce Labeled Samples
Noo-ri Kim, Jin-Seop Lee, Jee-Hyong LEE
URS-NeRF: Unordered Rolling Shutter Bundle Adjustment for Neural Radiance Fields
Bo Xu, Liu Ziao, Mengqi GUO et al.
Hierarchically Structured Neural Bones for Reconstructing Animatable Objects from Casual Videos
Subin Jeon, In Cho, Minsu Kim et al.
HYPE: Hyperbolic Entailment Filtering for Underspecified Images and Texts
Wonjae Kim, Sanghyuk Chun, Taekyung Kim et al.
MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation
KUNPENG SONG, Yizhe Zhu, Bingchen Liu et al.
Protecting NeRFs' Copyright via Plug-And-Play Watermarking Base Model
Qi Song, Ziyuan Luo, Ka Chun Cheung et al.
V-Trans4Style: Visual Transition Recommendation for Video Production Style Adaptation
Pooja Guhan, Tsung-Wei Huang, Guan-Ming Su et al.
Uncertainty-Driven Spectral Compressive Imaging with Spatial-Frequency Transformer
Lintao Peng, Siyu Xie, Liheng Bian
Weakly-Supervised Spatio-Temporal Video Grounding with Variational Cross-Modal Alignment
Yang Jin, Yadong Mu
Global Structure-from-Motion Revisited
Linfei Pan, Daniel Barath, Marc Pollefeys et al.
DEAL: Disentangle and Localize Concept-level Explanations for VLMs
Tang Li, Mengmeng Ma, Xi Peng
Domain Reduction Strategy for Non-Line-of-Sight Imaging
Hyunbo Shim, In Cho, Daekyu Kwon et al.
Learning to Enhance Aperture Phasor Field for Non-Line-of-Sight Imaging
In Cho, Hyunbo Shim, Seon Joo Kim
AlignDiff: Aligning Diffusion Models for General Few-Shot Segmentation
Ri-Zhao Qiu, Yu-Xiong Wang, Kris Hauser
Appearance-based Refinement for Object-Centric Motion Segmentation
Junyu Xie, Weidi Xie, Andrew ZISSERMAN
CaesarNeRF: Calibrated Semantic Representation for Few-Shot Generalizable Neural Rendering
Haidong Zhu, Tianyu Ding, Tianyi Chen et al.
AnyHome: Open-Vocabulary Large-Scale Indoor Scene Generation with First-Person View Exploration
Rao Fu, Zehao Wen, Zichen Liu et al.
UNIT: Backdoor Mitigation via Automated Neural Distribution Tightening
Siyuan Cheng, Guangyu Shen, Kaiyuan Zhang et al.
Open-Vocabulary RGB-Thermal Semantic Segmentation
Guoqiang Zhao, JunJie Huang, Xiaoyun Yan et al.
SlotLifter: Slot-guided Feature Lifting for Learning Object-Centric Radiance Fields
Yu Liu, Baoxiong Jia, Yixin Chen et al.
The Role of Masking for Efficient Supervised Knowledge Distillation of Vision Transformers
Seungwoo Son, Jegwang Ryu, Namhoon Lee et al.
Improving Vision and Language Concepts Understanding with Multimodal Counterfactual Samples
Chengen Lai, Shengli Song, Sitong Yan et al.
Functional Transform-Based Low-Rank Tensor Factorization for Multi-Dimensional Data Recovery
Jian-Li Wang, Xi-Le Zhao
Confidence Self-Calibration for Multi-Label Class-Incremental Learning
Kaile Du, Yifan Zhou, Fan Lyu et al.
Fast View Synthesis of Casual Videos with Soup-of-Planes
Yao-Chih Lee, Zhoutong Zhang, Kevin Blackburn-Matzen et al.
Dense Hand-Object(HO) GraspNet with Full Grasping Taxonomy and Dynamics
Woojin Cho, Jihyun Lee, Minjae Yi et al.
Fisher Calibration for Backdoor-Robust Heterogeneous Federated Learning
Wenke Huang, Mang Ye, zekun shi et al.
CoMusion: Towards Consistent Stochastic Human Motion Prediction via Motion Diffusion
Jiarui Sun, Girish Chowdhary
Watch Your Steps: Local Image and Scene Editing by Text Instructions
Ashkan Mirzaei, Tristan T Aumentado-Armstrong, Marcus A Brubaker et al.
VideoAgent: Long-form Video Understanding with Large Language Model as Agent
Xiaohan Wang, Yuhui Zhang, Orr Zohar et al.
ControlCap: Controllable Region-level Captioning
Yuzhong Zhao, Liu Yue, Zonghao Guo et al.
MoE-DiffIR: Task-customized Diffusion Priors for Universal Compressed Image Restoration
Yulin Ren, Xin Li, Bingchen Li et al.
Neural graphics texture compression supporting random access
Farzad Farhadzadeh, Qiqi Hou, Hoang Le et al.
Early Anticipation of Driving Maneuvers
Abdul Wasi Lone, Shankar Gangisetty, Shyam Nandan et al.
Causality-inspired Discriminative Feature Learning in Triple Domains for Gait Recognition
Haijun Xiong, Bin Feng, Xinggang Wang et al.
Trajectory-aligned Space-time Tokens for Few-shot Action Recognition
Pulkit Kumar, Namitha Padmanabhan, Luke Luo et al.
DualBEV: Unifying Dual View Transformation with Probabilistic Correspondences
Peidong Li, Wancheng Shen, Qihao Huang et al.
Towards Unified Representation of Invariant-Specific Features in Missing Modality Face Anti-Spoofing
Guanghao Zheng, Yuchen Liu, Wenrui Dai et al.
Robust Incremental Structure-from-Motion with Hybrid Features
Shaohui Liu, Yidan Gao, Tianyi Zhang et al.
E3V-K5: An Authentic Benchmark for Redefining Video-Based Energy Expenditure Estimation
Shengxuming Zhang, Lei Jin, Yifan Wang et al.
Unsupervised Moving Object Segmentation with Atmospheric Turbulence
Dehao Qin, Ripon Saha, Woojeh Chung et al.
IGNORE: Information Gap-based False Negative Loss Rejection for Single Positive Multi-Label Learning
Gyeong Ryeol Song, Noo-ri Kim, Jin-Seop Lee et al.
Asymmetric Mask Scheme for Self-Supervised Real Image Denoising
Xiangyu Liao, Tianheng Zheng, Jiayu Zhong et al.
Pathformer3D: A 3D Scanpath Transformer for 360° Images
Rong Quan, yantao Lai, Mengyu Qiu et al.
Elysium: Exploring Object-level Perception in Videos through Semantic Integration Using MLLMs
Han Wang, Yanjie Wang, Ye Yongjie et al.
AdaCLIP: Adapting CLIP with Hybrid Learnable Prompts for Zero-Shot Anomaly Detection
Yunkang Cao, Jiangning Zhang, Luca Frittoli et al.
Federated Learning with Local Openset Noisy Labels
Zonglin Di, Zhaowei Zhu, Xiaoxiao Li et al.
Match-Stereo-Videos: Bidirectional Alignment for Consistent Dynamic Stereo Matching
Junpeng Jing, Ye Mao, Krystian Mikolajczyk
Visual Prompting via Partial Optimal Transport
MENGYU ZHENG, Zhiwei Hao, Yehui Tang et al.
LiteSAM is Actually what you Need for segment Everything
Jianhai Fu, Yuanjie Yu, Ningchuan Li et al.
Rethinking Tree-Ring Watermarking for Enhanced Multi-Key Identification
Hai Ci, Pei Yang, Yiren Song et al.
Deep Patch Visual SLAM
Lahav Lipson, Zachary Teed, Jia Deng
Dual-stage Hyperspectral Image Classification Model with Spectral Supertoken
Peifu Liu, Tingfa Xu, Jie Wang et al.
Optimal Transport of Diverse Unsupervised Tasks for Robust Learning from Noisy Few-Shot Data
Xiaofan Que, Qi Yu
LITA: Language Instructed Temporal-Localization Assistant
De-An Huang, Shijia Liao, Subhashree Radhakrishnan et al.
BurstM: Deep Burst Multi-scale SR using Fourier Space with Optical Flow
EungGu Kang, Byeonghun Lee, Sunghoon Im et al.
MEVG : Multi-event Video Generation with Text-to-Video Models
Gyeongrok Oh, Jaehwan Jeong, Sieun Kim et al.
Unsupervised Dense Prediction using Differentiable Normalized Cuts
Yanbin Liu, Stephen Gould
Flexible Distribution Alignment: Towards Long-tailed Semi-supervised Learning with Proper Calibration
Emanuel Sanchez Aimar, Nathaniel D Helgesen, Yonghao Xu et al.
Diff3DETR: Agent-based Diffusion Model for Semi-supervised 3D Object Detection
Jiacheng Deng, Jiahao Lu, Tianzhu Zhang
Deep Cost Ray Fusion for Sparse Depth Video Completion
Jungeon Kim, Soongjin Kim, Jaesik Park et al.
SSL-Cleanse: Trojan Detection and Mitigation in Self-Supervised Learning
Mengxin Zheng, Jiaqi Xue, Zihao Wang et al.
Adaptive High-Frequency Transformer for Diverse Wildlife Re-Identification
Chenyue Li, Shuoyi Chen, Mang Ye
Masked Angle-Aware Autoencoder for Remote Sensing Images
Zhihao Li, Biao Hou, Siteng Ma et al.
Enhancing Tampered Text Detection through Frequency Feature Fusion and Decomposition
Zhongxi Chen, Shen Chen, Taiping Yao et al.
An accurate detection is not all you need to combat label noise in web-noisy datasets
Paul Albert, Kevin McGuinness, Eric Arazo et al.
6DoF Head Pose Estimation through Explicit Bidirectional Interaction with Face Geometry
Sungho Chun, Ju Yong Chang
3D Human Pose Estimation via Non-Causal Retentive Networks
Kaili Zheng, Feixiang Lu, Yihao Lv et al.
Gated Temporal Diffusion for Stochastic Long-term Dense Anticipation
Olga Zatsarynna, Emad Bahrami, Yazan Abu Farha et al.
PISR: Polarimetric Neural Implicit Surface Reconstruction for Textureless and Specular Objects
Guangcheng Chen, Yicheng He, Li He et al.
ProTIP: Probabilistic Robustness Verification on Text-to-Image Diffusion Models against Stochastic Perturbation
Yi Zhang, Yun Tang, Wenjie Ruan et al.
Reconstruction and Simulation of Elastic Objects with Spring-Mass 3D Gaussians
Licheng Zhong, Hong-Xing Yu, Jiajun Wu et al.
S-JEPA: A Joint Embedding Predictive Architecture for Skeletal Action Recognition
Mohamed Abdelfattah, Alexandre ALahi
Improving Unsupervised Domain Adaptation: A Pseudo-Candidate Set Approach
Aveen Dayal, Rishabh Lalla, Linga Reddy Cenkeramaddi et al.
PoseAugment: Generative Human Pose Data Augmentation with Physical Plausibility for IMU-based Motion Capture
Zhuojun Li, Chun Yu, Chen Liang et al.
BeNeRF:Neural Radiance Fields from a Single Blurry Image and Event Stream
Wenpu Li, Pian Wan, Peng Wang et al.
Enhancing Optimization Robustness in 1-bit Neural Networks through Stochastic Sign Descent
NianHui Guo, Hong Guo, Christoph Meinel et al.
Motion Keyframe Interpolation for Any Human Skeleton using Point Cloud-based Human Motion Data Homogenisation
Clinton Mo, Kun Hu, Chengjiang Long et al.
HyTAS: A Hyperspectral Image Transformer Architecture Search Benchmark and Analysis
Fangqin Zhou, Mert Kilickaya, Joaquin Vanschoren et al.
Dual-Path Adversarial Lifting for Domain Shift Correction in Online Test-time Adaptation
Yushun Tang, Shuoshuo Chen, Zhihe Lu et al.
Cross-Domain Learning for Video Anomaly Detection with Limited Supervision
Yashika Jain, Ali Dabouei, Min Xu
When and How do negative prompts take effect?
Yuanhao Ban, Ruochen Wang, Tianyi Zhou et al.
AttnZero: Efficient Attention Discovery for Vision Transformers
Lujun Li, Zimian Wei, Peijie Dong et al.
Accelerating Image Super-Resolution Networks with Pixel-Level Classification
Jinho Jeong, Jinwoo Kim, Younghyun Jo et al.
3DEgo: 3D Editing on the Go!
Umar Khalid, Hasan Iqbal, Azib Farooq et al.
RING-NeRF : Rethinking Inductive Biases for Versatile and Efficient Neural Fields
Doriand Petit, Steve Bourgeois, Dumitru Pavel et al.
Scissorhands: Scrub Data Influence via Connection Sensitivity in Networks
Jing Wu, Mehrtash Harandi
Real-data-driven 2000 FPS Color Video from Mosaicked Chromatic Spikes
Siqi Yang, Zhaojun Huang, Yakun Chang et al.
Cross-Platform Video Person ReID: A New Benchmark Dataset and Adaptation Approach
Shizhou Zhang, Wenlong Luo, De Cheng et al.
CoherentGS: Sparse Novel View Synthesis with Coherent 3D Gaussians
Avinash Paliwal, Wei Ye, Jinhui Xiong et al.
Modeling Label Correlations with Latent Context for Multi-Label Recognition
Zhao-Min Chen, Quan Cui, Ruoxi Deng et al.
Linearly Controllable GAN: Unsupervised Feature Categorization and Decomposition for Image Generation and Manipulation
Sehyung Lee, Mijung Kim, Yeongnam Chae et al.
SEDiff: Structure Extraction for Domain Adaptive Depth Estimation via Denoising Diffusion Models
Dongseok Shim, Hyoun Jin Kim
Towards Reliable Advertising Image Generation Using Human Feedback
Zhenbang Du, Wei Feng, Haohan Wang et al.
Few-shot Class Incremental Learning with Attention-Aware Self-Adaptive Prompt
Chenxi Liu, Zhenyi Wang, Tianyi Xiong et al.
Decomposition Betters Tracking Everything Everywhere
Rui Li, Dong Liu
R3DS: Reality-linked 3D Scenes for Panoramic Scene Understanding
Qirui Wu, Sonia Raychaudhuri, Daniel Ritchie et al.
AddressCLIP: Empowering Vision-Language Models for City-wide Image Address Localization
Shixiong Xu, Chenghao Zhang, Lubin Fan et al.
Lost in Translation: Modern Neural Networks Still Struggle With Small Realistic Image Transformations
Ofir Shifman, Yair Weiss
Controlling the World by Sleight of Hand
Sruthi Sudhakar, Ruoshi Liu, Basile Van Hoorick et al.
Pseudo-Labelling Should Be Aware of Disguising Channel Activations
Changrui Chen, Kurt Debattista, Jungong Han
TPA3D: Triplane Attention for Fast Text-to-3D Generation
Bin-Shih Wu, HONG-EN CHEN, Sheng-Yu Huang et al.
Lost in Translation: Latent Concept Misalignment in Text-to-Image Diffusion Models
Juntu Zhao, Junyu Deng, Yixin Ye et al.
Weakly Supervised 3D Object Detection via Multi-Level Visual Guidance
Kuan-Chih Huang, Yi-Hsuan Tsai, Ming-Hsuan Yang
Guide-and-Rescale: Self-Guidance Mechanism for Effective Tuning-Free Real Image Editing
Vadim Titov, Madina Khalmatova, Alexandra Ivanova et al.
SplatFields: Neural Gaussian Splats for Sparse 3D and 4D Reconstruction
Marko Mihajlovic, Sergey Prokudin, Siyu Tang et al.
CPT-VR: Improving Surface Rendering via Closest Point Transform with View-Reflection Appearance
Zhipeng Hu, Yongqiang Zhang, Chen Liu et al.
MapDistill: Boosting Efficient Camera-based HD Map Construction via Camera-LiDAR Fusion Model Distillation
Xiaoshuai Hao, Ruikai Li, Hui Zhang et al.
D-SCo: Dual-Stream Conditional Diffusion for Monocular Hand-Held Object Reconstruction
Bowen Fu, Gu Wang, Chenyangguang Zhang et al.
Decoupling Common and Unique Representations for Multimodal Self-supervised Learning
Yi Wang, Conrad M Albrecht, Nassim Ait Ali Braham et al.
GKGNet: Group K-Nearest Neighbor based Graph Convolutional Network for Multi-Label Image Recognition
Ruijie Yao, Sheng Jin, Lumin Xu et al.
E.T. the Exceptional Trajectory: Text-to-camera-trajectory generation with character awareness
Robin Courant, Nicolas Dufour, Xi WANG et al.
EgoPoser: Robust Real-Time Egocentric Pose Estimation from Sparse and Intermittent Observations Everywhere
Jiaxi Jiang, Paul Streli, Manuel Meier et al.
SceneGraphLoc: Cross-Modal Coarse Visual Localization on 3D Scene Graphs
Yang Miao, Francis Engelmann, Olga Vysotska et al.
Synchronous Diffusion for Unsupervised Smooth Non-Rigid 3D Shape Matching
Dongliang Cao, Zorah Laehner, Florian Bernard
Projecting Points to Axes: Oriented Object Detection via Point-Axis Representation
Zeyang Zhao, Qilong Xue, Yifan Bai et al.
Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation
Yunhao Gou, Kai Chen, Zhili LIU et al.
uCAP: An Unsupervised Prompting Method for Vision-Language Models
A. Tuan Nguyen, Kai Sheng Tai, Bor-Chun Chen et al.
QUAR-VLA: Vision-Language-Action Model for Quadruped Robots
Pengxiang Ding, Han Zhao, Wenjie Zhang et al.
RISurConv: Rotation Invariant Surface Attention-Augmented Convolutions for 3D Point Cloud Classification and Segmentation
Zhiyuan Zhang, Licheng Yang, Zhiyu Xiang
CLIP-Guided Generative Networks for Transferable Targeted Adversarial Attacks
Hao Fang, Jiawei Kong, Bin Chen et al.
Rethinking and Improving Visual Prompt Selection for In-Context Learning Segmentation Framework
Wei Suo, Lanqing Lai, Mengyang Sun et al.
MMBENCH: Is Your Multi-Modal Model an All-around Player?
Yuan Liu, Haodong Duan, Yuanhan Zhang et al.
3DSA:Multi-View 3D Human Pose Estimation With 3D Space Attention Mechanisms
Po Han Chen, Chia-Chi Tsai
Unsupervised Exposure Correction
Ruodai Cui, Li Niu, Guosheng Hu
FMBoost: Boosting Latent Diffusion with Flow Matching
Johannes Schusterbauer-Fischer, Ming Gui, Pingchuan Ma et al.
GlobalPointer: Large-Scale Plane Adjustment with Bi-Convex Relaxation
Bangyan Liao, Zhenjun Zhao, Lu Chen et al.
3D Congealing: 3D-Aware Image Alignment in the Wild
Yunzhi Zhang, Zizhang Li, Amit Raj et al.
Semi-Supervised Video Desnowing Network via Temporal Decoupling Experts and Distribution-Driven Contrastive Regularization
Hongtao Wu, Yijun Yang, Angelica I Aviles-Rivero et al.
Semi-Supervised Teacher-Reference-Student Architecture for Action Quality Assessment
Wulian Yun, Mengshi Qi, Fei Peng et al.
Occluded Gait Recognition with Mixture of Experts: An Action Detection Perspective
Panjian Huang, Yunjie Peng, Saihui Hou et al.
Radiative Gaussian Splatting for Efficient X-ray Novel View Synthesis
Yuanhao Cai, Yixun Liang, Jiahao Wang et al.
Weak-to-Strong Compositional Learning from Generative Models for Language-based Object Detection
Kwanyong Park, Kuniaki Saito, Donghyun Kim
TurboEdit: Real-time text-based disentangled real image editing
Zongze Wu, Nicholas I Kolkin, Jonathan Brandt et al.
Head360: Learning a Parametric 3D Full-Head for Free-View Synthesis in 360°
Yuxiao He, Yiyu Zhuang, Yanwen Wang et al.
TCC-Det: Temporarily consistent cues for weakly-supervised 3D detection
Jan Skvrna, Lukas Neumann
Correspondences of the Third Kind: Camera Pose Estimation from Object Reflection
Kohei Yamashita, Vincent Lepetit, Ko Nishino
Robust Fitting on a Gate Quantum Computer
Frances Yang, Michele Sasdelli, Tat-Jun Chin
Defect Spectrum: A Granular Look of Large-scale Defect Datasets with Rich Semantics
Shuai Yang, ZhiFei Chen, Pengguang Chen et al.
Self-supervised Shape Completion via Involution and Implicit Correspondences
Mengya Liu, Ajad Chhatkuli, Janis Postels et al.
A Geometric Distortion Immunized Deep Watermarking Framework with Robustness Generalizability
Linfeng Ma, Han Fang, Tianyi Wei et al.
Energy-induced Explicit quantification for Multi-modality MRI fusion
Xiaoming Qi, Yuan Zhang, Tong Wang et al.