Most Cited ECCV "conditional outcome invariance" Papers
2,387 papers found • Page 8 of 12
Conference
Encapsulating Knowledge in One Prompt
Qi Li, Runpeng Yu, Xinchao Wang
Upper-body Hierarchical Graph for Skeleton Based Emotion Recognition in Assistive Driving
Jiehui Wu, Jiansheng Chen, Qifeng Luo et al.
ItTakesTwo: Leveraging Peer Representations for Semi-supervised LiDAR Semantic Segmentation
Yuyuan Liu, Yuanhong Chen, Hu Wang et al.
Retargeting Visual Data with Deformation Fields
Tim Elsner, Julia Berger, Tong Wu et al.
ShoeModel: Learning to Wear on the User-specified Shoes via Diffusion Model
Wenyu Li, Binghui Chen, Yifeng Geng et al.
HPFF: Hierarchical Locally Supervised Learning with Patch Feature Fusion
Junhao Su, Chenghao He, Feiyu Zhu et al.
Overcoming Distribution Mismatch in Quantizing Image Super-Resolution Networks
Cheeun Hong, Kyoung Mu Lee
Imaging with Confidence: Uncertainty Quantification for High-dimensional Undersampled MR Images
Frederik Hoppe, Claudio Mayrink Verdun, Hannah Sophie Laus et al.
SelfSwapper: Self-Supervised Face Swapping via Shape Agnostic Masked AutoEncoder
Jaeseong Lee, Junha Hyung, Sohyun Jeong et al.
MIGS: Multi-Identity Gaussian Splatting via Tensor Decomposition
Aggelina Chatziagapi, Grigorios Chrysos, Dimitris Samaras
Light-in-Flight for a World-in-Motion
Jongho Lee, Ryan J Suess, Mohit Gupta
Equivariant Spatio-Temporal Self-Supervision for LiDAR Object Detection
Deepti Hegde, Suhas Lohit, Kuan-Chuan Peng et al.
Learning with Unmasked Tokens Drives Stronger Vision Learners
Taekyung Kim, Sanghyuk Chun, Byeongho Heo et al.
MRSP: Learn Multi-Representations of Single Primitive for Compositional Zero-Shot Learning
Dongyao Jiang, Hui Chen, Haodong Jing et al.
Zero-shot Text-guided Infinite Image Synthesis with LLM guidance
Soyeong Kwon, TAEGYEONG LEE, Taehwan Kim
Data-to-Model Distillation: Data-Efficient Learning Framework
Ahmad Sajedi, Samir Khaki, Lucy Z. Liu et al.
Learned HDR Image Compression for Perceptually Optimal Storage and Display
Peibei Cao, HAOYU CHEN, Jingzhe Ma et al.
Boosting Gaze Object Prediction via Pixel-level Supervision from Vision Foundation Model
Yang Jin, Lei Zhang, Shi Yan et al.
GMM-IKRS: Gaussian Mixture Models for Interpretable Keypoint Refinement and Scoring
Emanuele Santellani, Martin Zach, Christian Sormann et al.
Non-transferable Pruning
Ruyi Ding, Lili Su, A. Adam Ding et al.
Walker: Self-supervised Multiple Object Tracking by Walking on Temporal Object Appearance Graphs
Mattia Segu, Luigi Piccinelli, Siyuan Li et al.
Look Hear: Gaze Prediction for Speech-directed Human Attention
Sounak Mondal, Seoyoung Ahn, Zhibo Yang et al.
An Adaptive Screen-Space Meshing Approach for Normal Integration
Moritz Heep, Eduard Zell
Visual Grounding for Object-Level Generalization in Reinforcement Learning
Haobin Jiang, Zongqing Lu
WAS: Dataset and Methods for Artistic Text Segmentation
Xudong Xie, Yuzhe Li, Yang Liu et al.
Efficient Active Domain Adaptation for Semantic Segmentation by Selecting Information-rich Superpixels
Yuan Gao, Zilei Wang, Yixin Zhang et al.
NAMER: Non-Autoregressive Modeling for Handwritten Mathematical Expression Recognition
Chenyu Liu, Jia Pan, Jinshui Hu et al.
A New Dataset and Framework for Real-World Blurred Images Super-Resolution
Rui Qin, Ming Sun, Chao Zhou et al.
Stepwise Multi-grained Boundary Detector for Point-supervised Temporal Action Localization
Mengnan Liu, Le Wang, Sanping Zhou et al.
Deep Polarization Cues for Single-shot Shape and Subsurface Scattering Estimation
chenhao li, Trung Thanh Ngo, Hajime Nagahara
Sparse Refinement for Efficient High-Resolution Semantic Segmentation
Zhijian Liu, Zhuoyang Zhang, Samir Khaki et al.
Human-in-the-Loop Visual Re-ID for Population Size Estimation
Gustavo Perez, Daniel Sheldon, Grant Van Horn et al.
Towards Architecture-Agnostic Untrained Networks Priors for Image Reconstruction with Frequency Regularization
Yilin Liu, Yunkui Pang, Jiang Li et al.
UNIKD: UNcertainty-Filtered Incremental Knowledge Distillation for Neural Implicit Representation
Mengqi GUO, Chen Li, Hanlin Chen et al.
Adversarial Robustification via Text-to-Image Diffusion Models
Daewon Choi, Jongheon Jeong, Huiwon Jang et al.
Formula-Supervised Visual-Geometric Pre-training
Ryosuke Yamada, Kensho Hara, Hirokatsu Kataoka et al.
Characterizing Model Robustness via Natural Input Gradients
Adrian Rodriguez-Munoz, Tongzhou Wang, Antonio Torralba
Sync from the Sea: Retrieving Alignable Videos from Large-Scale Datasets
Ishan Rajendrakumar Dave, Fabian Caba, Shah Mubarak et al.
MaRINeR: Enhancing Novel Views by Matching Rendered Images with Nearby References
Lukas Bösiger, Mihai Dusmanu, Marc Pollefeys et al.
SpatialFormer: Towards Generalizable Vision Transformers with Explicit Spatial Understanding
Han Xiao, Wenzhao Zheng, Sicheng Zuo et al.
Quantization-Friendly Winograd Transformations for Convolutional Neural Networks
Vladimir Protsenko, Vladimir Kryzhanovskiy, Alexander Filippov
Rethinking Video-Text Understanding: Retrieval from Counterfactually Augmented Data
Wufei Ma, Kai Li, Zhongshi Jiang et al.
Unified Local-Cloud Decision-Making via Reinforcement Learning
Kathakoli Sengupta, Zhongkai Shangguan, Sandesh Bharadwaj et al.
Global-to-Pixel Regression for Human Mesh Recovery
Yabo Xiao, MINGSHU HE, Dongdong Yu
Interactive 3D Object Detection with Prompts
Ruifei Zhang, Xiangru Lin, Wei Zhang et al.
Topology-Preserving Downsampling of Binary Images
Chia-Chia Chen, Chi-Han Peng
NGP-RT: Fusing Multi-Level Hash Features with Lightweight Attention for Real-Time Novel View Synthesis
Yubin Hu, Xiaoyang Guo, Yang Xiao et al.
ArtVLM: Attribute Recognition Through Vision-Based Prefix Language Modeling
William Zhu, Keren Ye, Junjie Ke et al.
Alignist: CAD-Informed Orientation Distribution Estimation by Fusing Shape and Correspondences
Shishir Reddy Vutukur, Junwen Huang, Rasmus Laurvig Haugaard et al.
Towards Model-Agnostic Dataset Condensation by Heterogeneous Models
Jun-Yeong Moon, Jung Uk Kim, Gyeong-Moon Park
Synchronization of Projective Transformations
Rakshith Madhavan, Andrea Fusiello, Federica Arrigoni
PAV: Personalized Head Avatar from Unstructured Video Collection
Akin Caliskan, Berkay Kicanaoglu, H K
Stable Preference: Redefining training paradigm of human preference model for Text-to-Image Synthesis
Hanting Li, Hongjing Niu, Feng Zhao
Learning Trimodal Relation for Audio-Visual Question Answering with Missing Modality
Kyu Ri Park, Hong Joo Lee, Jung Uk Kim
Effective Lymph Nodes Detection in CT Scans Using Location Debiased Query Selection and Contrastive Query Representation in Transformer
Qinji Yu, Yirui Wang, Ke Yan et al.
ComFusion: Enhancing Personalized Generation by Instance-Scene Compositing and Fusion
Yan Hong, Yuxuan Duan, Bo Zhang et al.
SeiT++: Masked Token Modeling Improves Storage-efficient Training
Minhyun Lee, Song Park, Byeongho Heo et al.
DetailSemNet: Elevating Signature Verification through Detail-Semantic Integration
Meng-Cheng Shih, Tsai-Ling Huang, Yu-Heng Shih et al.
SDPT: Synchronous Dual Prompt Tuning for Fusion-based Visual-Language Pre-trained Models
Yang Zhou, Yongjian Wu, Jiya Saiyin et al.
Coarse-to-Fine Implicit Representation Learning for 3D Hand-Object Reconstruction from a Single RGB-D Image
Xingyu Liu, Pengfei Ren, Jingyu Wang et al.
Platypus: A Generalized Specialist Model for Reading Text in Various Forms
Peng Wang, Zhaohai Li, Jun Tang et al.
Beyond Viewpoint: Robust 3D Object Recognition under Arbitrary Views through Joint Multi-Part Representation
Linlong Fan, Ye Huang, Yanqi Ge et al.
Deep Online Probability Aggregation Clustering
Yuxuan Yan, Na Lu, Ruofan Yan
FedVAD: Enhancing Federated Video Anomaly Detection with GPT-Driven Semantic Distillation
Fan Qi, Ruijie Pan, Huaiwen Zhang et al.
Single-Mask Inpainting for Voxel-based Neural Radiance Fields
Jiafu Chen, Tianyi Chu, Jiakai Sun et al.
ViG-Bias: Visually Grounded Bias Discovery and Mitigation
Badr-Eddine Marani, Mohamed HANINI, Nihitha Malayarukil et al.
POA: Pre-training Once for Models of All Sizes
Yingying Zhang, Xin Guo, Jiangwei Lao et al.
VF-NeRF: Viewshed Fields for Rigid NeRF Registration
Leo Segre, Shai Avidan
A Rotation-invariant Texture ViT for Fine-Grained Recognition of Esophageal Cancer Endoscopic Ultrasound Images
Tianyi Liu, Shuaishuai S Zhuang, Jiacheng Nie et al.
LEROjD: Lidar Extended Radar-Only Object Detection
Patrick Palmer, Martin Krüger, Stefan Schütte et al.
LNL+K: Enhancing Learning with Noisy Labels Through Noise Source Knowledge Integration
Siqi Wang, Bryan Plummer
Flowed Time of Flight Radiance Fields
Mikhail Okunev, Marc Mapeke, Benjamin Attal et al.
Confidence-Based Iterative Generation for Real-World Image Super-Resolution
Jialun Peng, Xin Luo, Jingjing Fu et al.
Frugal 3D Point Cloud Model Training via Progressive Near Point Filtering and Fused Aggregation
Donghyun Lee, Yejin Lee, Jae W. Lee et al.
MagicMirror: Fast and High-Quality Avatar Generation with Constrained Search Space
Armand Comas Massague, Di Qiu, Menglei Chai et al.
Out-of-Bounding-Box Triggers: A Stealthy Approach to Cheat Object Detectors
Tao Lin, lijia Yu, Gaojie Jin et al.
GTMS: A Gradient-driven Tree-guided Mask-free Referring Image Segmentation Method
Haoxin Lyu, Tianxiong Zhong, Sanyuan Zhao
Rotated Orthographic Projection for Self-Supervised 3D Human Pose Estimation
YAO YAO, Yixuan Pan, Wenjun Shi et al.
MO-EMT-NAS: Multi-Objective Continuous Transfer of Architectural Knowledge Between Tasks from Different Datasets
Peng Liao, Xilu Wang, Yaochu Jin et al.
Adaptive Annealing for Robust Averaging
Sidhartha Chitturi, Venu Madhav Govindu
MaxMI: A Maximal Mutual Information Criterion for Manipulation Concept Discovery
Pei Zhou, Yanchao Yang
Synthesizing Time-varying BRDFs via Latent Space
Takuto Narumoto, Hiroaki Santo, Fumio Okura
MTaDCS: Moving Trace and Feature Density-based Confidence Sample Selection under Label Noise
Qingzheng Huang, Xilin He, Xiaole Xian et al.
Removing Rows and Columns of Tokens in Vision Transformer enables Faster Dense Prediction without Retraining
Diwei Su, cheng fei, Jianxu Luo
Decomposition of Neural Discrete Representations for Large-Scale 3D Mapping
Minseong Park, Suhan Woo, Euntai Kim
Online Video Quality Enhancement with Spatial-Temporal Look-up Tables
Zefan Qu, Xinyang Jiang, Yifan Yang et al.
Local Occupancy-Enhanced Object Grasping with Multiple Triplanar Projection
Kangqi Ma, Hao Dong, Yadong Mu
Region-Native Visual Tokenization
Mengyu Wang, Yuyao Huang, Henghui Ding et al.
OP-Align: Object-level and Part-level Alignment for Self-supervised Category-level Articulated Object Pose Estimation
Yuchen Che, Ryo Furukawa, Asako Kanezaki
Enhanced Sparsification via Stimulative Training
Shengji Tang, Weihao Lin, Hancheng Ye et al.
Weighting Pseudo-Labels via High-Activation Feature Index Similarity and Object Detection for Semi-Supervised Segmentation
Prantik Howlader, Hieu Le, Dimitris Samaras
SIMBA: Split Inference - Mechanisms, Benchmarks and Attacks
Abhishek Singh, Vivek Sharma, Rohan Sukumaran et al.
MetaAT: Active Testing for Label-Efficient Evaluation of Dense Recognition Tasks
Sanbao Su, Xin Li, Thang Doan et al.
SNP: Structured Neuron-level Pruning to Preserve Attention Scores
Kyunghwan Shim, Jaewoong Yun, Shinkook Choi
DiffSurf: A Transformer-based Diffusion Model for Generating and Reconstructing 3D Surfaces in Pose
Yoshiyasu Yusuke, Leyuan Sun
Learned Rate Control for Frame-Level Adaptive Neural Video Compression via Dynamic Neural Network
Chenhao Zhang, WEI GAO
SparseRadNet: Sparse Perception Neural Network on Subsampled Radar Data
Jialong Wu, Mirko Meuter, Markus Schoeler et al.
Learning Where to Look: Self-supervised Viewpoint Selection for Active Localization using Geometrical Information
Luca Di Giammarino, Boyang Sun, Giorgio Grisetti et al.
Distributed Semantic Segmentation with Efficient Joint Source and Task Decoding
Danish Nazir, Timo Bartels, Jan Piewek et al.
FroSSL: Frobenius Norm Minimization for Efficient Multiview Self-Supervised Learning
Oscar Skean, Aayush Dhakal, Nathan Jacobs et al.
CARFF: Conditional Auto-encoded Radiance Field for 3D Scene Forecasting
Jiezhi Yang, Khushi P Desai, Charles Packer et al.
Unleashing Text-to-Image Diffusion Prior for Zero-Shot Image Captioning
Jianjie Luo, Jingwen Chen, Yehao Li et al.
REDIR: Refocus-free Event-based De-occlusion Image Reconstruction
Qi Guo, Hailong Shi, Huan Li et al.
MC-PanDA: Mask Confidence for Panoptic Domain Adaptation
Ivan Martinovic, Josip Šarić, Siniša Šegvić
Photon Inhibition for Energy-Efficient Single-Photon Imaging
Lucas Koerner, Shantanu Gupta, Atul N Ingle et al.
FAMOUS: High-Fidelity Monocular 3D Human Digitization Using View Synthesis
Vishnu Mani Hema, Shubhra Aich, Christian Haene et al.
From Pixels to Objects: A Hierarchical Approach for Part and Object Segmentation Using Local and Global Aggregation
Yunfei Xie, Cihang Xie, Alan Yuille et al.
MonoTTA: Fully Test-Time Adaptation for Monocular 3D Object Detection
Hongbin Lin, Yifan Zhang, SHUAICHENG NIU et al.
FAFA: Frequency-Aware Flow-Aided Self-Supervision for Underwater Object Pose Estimation
Jingyi Tang, Gu Wang, Zeyu Chen et al.
Point-supervised Panoptic Segmentation via Estimating Pseudo Labels from Learnable Distance
Jing Li, Junsong Fan, Zhaoxiang Zhang
CSOT: Cross-Scan Object Transfer for Semi-Supervised LiDAR Object Detection
Jinglin Zhan, Tiejun Liu, Rengang Li et al.
GRACE: Graph-Based Contextual Debiasing for Fair Visual Question Answering
Yifeng Zhang, Ming Jiang, Qi Zhao
Efficient Unsupervised Visual Representation Learning with Explicit Cluster Balancing
Ioannis Maniadis Metaxas, Georgios Tzimiropoulos, ioannis Patras
Fine-grained Dynamic Network for Generic Event Boundary Detection
Ziwei Zheng, Lijun He, Le Yang et al.
VP-SAM: Taming Segment Anything Model for Video Polyp Segmentation via Disentanglement and Spatio-temporal Side Network
Zhixue Fang, Yuzhi Liu, Huisi Wu et al.
Spatio-Temporal Proximity-Aware Dual-Path Model for Panoramic Activity Recognition
Sumin Lee, Yooseung Wang, Sangmin Woo et al.
Divide and Fuse: Body Part Mesh Recovery from Partially Visible Human Images
Tianyu Luan, Zhongpai Gao, Luyuan Xie et al.
Efficient Cascaded Multiscale Adaptive Network for Image Restoration
Yichen Zhou, Pan Zhou, Teck Khim Ng
LineFit: A Geometric Approach for Fitting Line Segments in Images
Marion BOYER, David Youssefi, Florent Lafarge
RoofDiffusion: Constructing Roofs from Severely Corrupted Point Data via Diffusion
Kyle Lo, Jorg Peters, Eric Spellman
COSMU: Complete 3D human shape from monocular unconstrained images
Marco Pesavento, Marco Volino, Adrian Hilton
Do Generalised Classifiers really work on Human Drawn Sketches?
Hmrishav Bandyopadhyay, Pinaki Nath Chowdhury, Aneeshan Sain et al.
Multi-scale Cross Distillation for Object Detection in Aerial Images
Kun Wang, Zi Wang, Zhang Li et al.
Adapting to Shifting Correlations with Unlabeled Data Calibration
Minh Nguyen, Alan Q Wang, Heejong Kim et al.
Learning-based Axial Video Motion Magnification
Kwon Byung-Ki, HYUNBIN OH, Kim Jun-Seong et al.
OAT: Object-Level Attention Transformer for Gaze Scanpath Prediction
Yini Fang, Jingling Yu, Haozheng Zhang et al.
Multi-RoI Human Mesh Recovery with Camera Consistency and Contrastive Losses
Yongwei Nie, Changzhen Liu, Chengjiang Long et al.
Single-Photon 3D Imaging with Equi-Depth Photon Histograms
Kaustubh Sadekar, David Maier, Atul Ingle
Toward INT4 Fixed-Point Training via Exploring Quantization Error for Gradients
Dohyung Kim, Junghyup Lee, Jeimin Jeon et al.
Multi-Task Domain Adaptation for Language Grounding with 3D Objects
Penglei SUN, Yaoxian Song, Xinglin Pan et al.
Self-Supervised Video Copy Localization with Regional Token Representation
Minlong Lu, Yichen Lu, Siwei Nie et al.
Leveraging scale- and orientation-covariant features for planar motion estimation
Marcus Valtonen Örnhag, Alberto Jaenal
CARB-Net: Camera-Assisted Radar-Based Network for Vulnerable Road User Detection
Wei-Yu Lee, Martin Dimitrievski, David Van Hamme et al.
Remove Projective LiDAR Depthmap Artifacts via Exploiting Epipolar Geometry
Shengjie Zhu, Girish Chandar Ganesan, Abhinav Kumar et al.
SUP-NeRF: A Streamlined Unification of Pose Estimation and NeRF for Monocular 3D Object Reconstruction
Yuliang Guo, Abhinav Kumar, Cheng Zhao et al.
An Optimization Framework to Enforce Multi-View Consistency for Texturing 3D Meshes
Zhengyi Zhao, Chen Song, Xiaodong Gu et al.
Differentiable Convex Polyhedra Optimization from Multi-view Images
Daxuan Ren, Haiyi Mei, Hezi Shi et al.
Efficient Neural Video Representation with Temporally Coherent Modulation
Seungjun Shin, Suji Kim, Dokwan Oh
Stripe Observation Guided Inference Cost-free Attention Mechanism
Zhongzhan Huang, Shanshan Zhong, Wushao Wen et al.
EpipolarGAN: Omnidirectional Image Synthesis with Explicit Camera Control
Christopher May, Daniel Aliaga
Forget More to Learn More: Domain-specific Feature Unlearning for Semi-supervised and Unsupervised Domain Adaptation
Hritam Basak, Zhaozheng Yin
Towards Robust Full Low-bit Quantization of Super Resolution Networks
Denis Makhov, Irina Zhelavskaya, Ruslan Ostapets et al.
Learning to Build by Building Your Own Instructions
Aaron Walsman, Muru Zhang, Adam Fishman et al.
Flatness-aware Sequential Learning Generates Resilient Backdoors
Hoang Pham, The-Anh Ta, Anh Tran et al.
Rebalancing Using Estimated Class Distribution for Imbalanced Semi-Supervised Learning under Class Distribution Mismatch
Taemin Park, Hyuck Lee, Heeyoung Kim
Exploring Active Learning in Meta-Learning: Enhancing Context Set Labeling
Wonho Bae, Jing Wang, Danica J. Sutherland
ActionSwitch: Class-agnostic Detection of Simultaneous Actions in Streaming Videos
Kang Hyolim, Jeongseok Hyun, Joungbin An et al.
Not Just Change the Labels, Learn the Features: Watermarking Deep Neural Networks with Multi-View Data
Yuxuan Li, Sarthak Kumar Maharana, Yunhui Guo
Fast Registration of Photorealistic Avatars for VR Facial Animation
Chaitanya Patel, Shaojie Bai, Te-Li Wang et al.
On-the-fly Category Discovery for LiDAR Semantic Segmentation
HYEONSEONG KIM, Sung-Hoon Yoon, Minseok Kim et al.
Inter-Class Topology Alignment for Efficient Black-Box Substitute Attacks
lingzhuang meng, Mingwen Shao, Yuanjian Qiao et al.
Random Walk on Pixel Manifolds for Anomaly Segmentation of Complex Driving Scenes
Zelong Zeng, Kaname Tomite
BugNIST - a Large Volumetric Dataset for Detection under Domain Shift
Patrick Jensen, Vedrana Dahl, Rebecca Engberg et al.
Resolving Scale Ambiguity in Multi-view 3D Reconstruction using Dual-Pixel Sensors
Kohei Ashida, Hiroaki Santo, Fumio Okura et al.
UniVoxel: Fast Inverse Rendering by Unified Voxelization of Scene Representation
Shuang Wu, Songlin Tang, Guangming Lu et al.
Bones Can't Be Triangles: Accurate and Efficient Vertebrae Keypoint Estimation through Collaborative Error Revision
Jinhee Kim, Taesung Kim, Choo Jaegul
PACE: Pose Annotations in Cluttered Environments
Yang You, kai xiong, Zhening Yang et al.
Harmonizing knowledge Transfer in Neural Network with Unified Distillation
yaomin huang, faming Fang, Zaoming Yan et al.
Occlusion Handling in 3D Human Pose Estimation with Perturbed Positional Encoding
niloofar azizi, Mohsen Fayyaz, Horst Bischof
Learning a Dynamic Privacy-preserving Camera Robust to Inversion Attacks
Jiacheng Cheng, Xiang Dai, Jia Wan et al.
TreeSBA: Tree-Transformer for Self-Supervised Sequential Brick Assembly
Mengqi GUO, Chen Li, Yuyang Zhao et al.
On Spectral Properties of Gradient-based Explanation Methods
Amir Mehrpanah, Erik Englesson, Hossein Azizpour
DεpS: Delayed ε-Shrinking for Faster Once-For-All Training
Aditya Annavajjala, Alind Khare, Animesh Agrawal et al.
Towards Certifiably Robust Face Recognition
Seunghun Paik, Dongsoo Kim, Chanwoo Hwang et al.
Catastrophic Overfitting: A Potential Blessing in Disguise
MN Zhao, Lihe Zhang, Yuqiu Kong et al.
Object-Oriented Anchoring and Modal Alignment in Multimodal Learning
Shibin Mei, Bingbing Ni, Hang Wang et al.
Data Collection-free Masked Video Modeling
Yuchi Ishikawa, Masayoshi Kondo, Yoshimitsu Aoki
Training A Secure Model against Data-Free Model Extraction
Zhenyi Wang, Li Shen, junfeng guo et al.
ELSE: Efficient Deep Neural Network Inference through Line-based Sparsity Exploration
Zeqi Zhu, Alberto Garcia-Ortiz, Luc Waeijen et al.
Deep Nets with Subsampling Layers Unwittingly Discard Useful Activations at Test-Time
Chiao-An Yang, Ziwei Liu, Raymond Yeh
Classification Matters: Improving Video Action Detection with Class-Specific Attention
Jinsung Lee, Taeoh Kim, Inwoong Lee et al.
Image-Feature Weak-to-Strong Consistency: An Enhanced Paradigm for Semi-Supervised Learning
Zhiyu Wu, Jin shi Cui
Depth-guided NeRF Training via Earth Mover’s Distance
Anita Rau, Josiah Aklilu, Floyd C Holsinger et al.
Exploiting Supervised Poison Vulnerability to Strengthen Self-Supervised Defense
Jeremy Styborski, Mingzhi Lyu, YI HUANG et al.
Time-Efficient and Identity-Consistent Virtual Try-On Using A Variant of Altered Diffusion Models
Phuong Dam, Jihoon Jeong, Anh Tran et al.
Beyond the Data Imbalance: Employing the Heterogeneous Datasets for Vehicle Maneuver Prediction
Hyeongseok Jeon, Sanmin Kim, Abi Rahman Syamil et al.
Transferable 3D Adversarial Shape Completion using Diffusion Models
Xuelong Dai, Bin Xiao
Multi-Granularity Sparse Relationship Matrix Prediction Network for End-to-End Scene Graph Generation
lei wang, Zejian Yuan, Badong Chen
An Optimal Control View of LoRA and Binary Controller Design for Vision Transformers
CHI Zhang, Jingpu Cheng, Qianxiao Li
A Secure Image Watermarking Framework with Statistical Guarantees via Adversarial Attacks on Secret Key Networks
Feiyu CHEN, Wei Lin, Ziquan Liu et al.
Consistent 3D Line Mapping
Xulong Bai, Hainan Cui, Shuhan Shen
Integration of Global and Local Representations for Fine-grained Cross-modal Alignment
Seungwan Jin, Hoyoung Choi, Taehyung Noh et al.
Multiscale Graph Texture Network
Ravishankar Evani, Deepu Rajan, Shangbo Mao
Face Reconstruction Transfer Attack as Out-of-Distribution Generalization
Yoon Gyo Jung, Jaewoo Park, Xingbo Dong et al.
Wavelength-Embedding-guided Filter-Array Transformer for Spectral Demosaicing
haijin zeng, Hiep Luong, Wilfried Philips
MetaAug: Meta-Data Augmentation for Post-Training Quantization
Cuong Pham, Hoang Anh Dung, Cuong Cao Nguyen et al.
Towards compact reversible image representations for neural style transfer
Xiyao Liu, Siyu Yang, Jian Zhang et al.
Text Motion Translator: A Bi-Directional Model for Enhanced 3D Human Motion Generation from Open-Vocabulary Descriptions
Yijun Qian, Jack Urbanek, Alexander Hauptmann et al.
POCA: Post-training Quantization with Temporal Alignment for Codec Avatars
Jian Meng, Yuecheng Li, CHENGHUI Li et al.
Bayesian Detector Combination for Object Detection with Crowdsourced Annotations
Zhi Qin Tan, Olga Isupova, Gustavo Carneiro et al.
An Information Theoretical View for Out-Of-Distribution Detection
Jinjing Hu, Wenrui Liu, Hong Chang et al.
A Probability-guided Sampler for Neural Implicit Surface Rendering
Gonçalo José Dias Pais, Valter André Piedade, Moitreya Chatterjee et al.
LASS3D: Language-Assisted Semi-Supervised 3D Semantic Segmentation with Progressive Unreliable Data Exploitation
Jianan Li, Qiulei Dong
FairViT: Fair Vision Transformer via Adaptive Masking
Bowei Tian, Ruijie Du, Yanning Shen
Source-Free Domain-Invariant Performance Prediction
Ekaterina Khramtsova, Mahsa Baktashmotlagh, Guido Zuccon et al.
Optimizing Factorized Encoder Models: Time and Memory Reduction for Scalable and Efficient Action Recognition
Shreyank Narayana Gowda, Anurag Arnab, Jonathan Huang
Freeview Sketching: View-Aware Fine-Grained Sketch-Based Image Retrieval
Aneeshan Sain, Pinaki Nath Chowdhury, Subhadeep Koley et al.
Distractor-Free Novel View Synthesis via Exploiting Memorization Effect in Optimization
Yukun Wang, Kunhong Li, Minglin Chen et al.
GroundUp: Rapid Sketch-Based 3D City Massing
Gizem Esra Unlu, Mohamed Sayed, Yulia Gryaditskaya et al.
Semicalibrated Relative Pose from an Affine Correspondence and Monodepth
Petr Hrubý, Marc Pollefeys, Daniel Barath