Most Cited 2024 "optimal partitioning trees" Papers
12,324 papers found • Page 18 of 62
Conference
PDiscoFormer: Relaxing Part Discovery Constraints with Vision Transformers
Ananthu Aniraj, Cassio F. Dantas, Dino Ienco et al.
Multi-Level Cross-Modal Alignment for Image Clustering
Liping Qiu, Qin Zhang, Xiaojun Chen et al.
FedNS: A Fast Sketching Newton-Type Algorithm for Federated Learning
Jian Li, Yong Liu, Wei Wang et al.
Prompt Augmentation for Self-supervised Text-guided Image Manipulation
Rumeysa Bodur, Binod Bhattarai, Tae-Kyun Kim
Segment, Lift and Fit: Automatic 3D Shape Labeling from 2D Prompts
Jianhao Li, Tianyu Sun, Zhongdao Wang et al.
Liberating Seen Classes: Boosting Few-Shot and Zero-Shot Text Classification via Anchor
Han Liu, Siyang Zhao, Xiaotong Zhang et al.
Analyzing and Improving Optimal-Transport-based Adversarial Networks
Jaemoo Choi, Jaewoong Choi, Myungjoo Kang
Geometry-aware Reconstruction and Fusion-refined Rendering for Generalizable Neural Radiance Fields
Tianqi Liu, Xinyi Ye, Min Shi et al.
LMT-GP: Combined Latent Mean-Teacher and Gaussian Process for Semi-supervised Low-light Image Enhancement
Ye Yu, Fengxin Chen, Jun Yu et al.
Revisiting Sampson Approximations for Geometric Estimation Problems
Felix Rydell, Angelica Torres, Viktor Larsson
Spherical World-Locking for Audio-Visual Localization in Egocentric Videos
Heeseung Yun, Ruohan Gao, Ishwarya Ananthabhotla et al.
Diffusion Reflectance Map: Single-Image Stochastic Inverse Rendering of Illumination and Reflectance
Yuto Enyo, Ko Nishino
Learning to Rank Patches for Unbiased Image Redundancy Reduction
Yang Luo, Zhineng Chen, Peng Zhou et al.
SENCR: A Span Enhanced Two-Stage Network with Counterfactual Rethinking for Chinese NER
Hang Zheng, Qingsong Li, Shen Chen et al.
Learning to Localize Actions in Instructional Videos with LLM-Based Multi-Pathway Text-Video Alignment
Yuxiao Chen, Kai Li, Wentao Bao et al.
Fully Geometric Panoramic Localization
Junho Kim, Jiwon Jeong, Young Min Kim
Out-of-Variable Generalisation for Discriminative Models
Siyuan Guo, Jonas Wildberger, Bernhard Schoelkopf
Gen-Z: Generative Zero-Shot Text Classification with Contextualized Label Descriptions
Sachin Kumar, Chan Young Park, Yulia Tsvetkov
DHR: Dual Features-Driven Hierarchical Rebalancing in Inter- and Intra-Class Regions for Weakly-Supervised Semantic Segmentation
Sanghyun Jo, Fei Pan, In-Jae Yu et al.
Pano-NeRF: Synthesizing High Dynamic Range Novel Views with Geometry from Sparse Low Dynamic Range Panoramic Images
Zhan Lu, Qian Zheng, Boxin Shi et al.
Self-Prompt Mechanism for Few-Shot Image Recognition
Mingchen Song, Huiqiang Wang, Guoqiang Zhong
CMA: A Chromaticity Map Adapter for Robust Detection of Screen-Recapture Document Images
Changsheng Chen, Liangwei Lin, Yongqi Chen et al.
OpticalDR: A Deep Optical Imaging Model for Privacy-Protective Depression Recognition
Yuchen Pan, Junjun Jiang, Kui Jiang et al.
Alice Benchmarks: Connecting Real World Re-Identification with the Synthetic
Xiaoxiao Sun, Yue Yao, Shengjin Wang et al.
Geometry Fidelity for Spherical Images
Anders Christensen, Nooshin Mojab, Khushman Patel et al.
Incorporating Geo-Diverse Knowledge into Prompting for Increased Geographical Robustness in Object Recognition
Kyle Buettner, Sina Malakouti, Xiang Li et al.
Constructing Concept-based Models to Mitigate Spurious Correlations with Minimal Human Effort
Jeeyung Kim, Ze Wang, Qiang Qiu
Entropy-MCMC: Sampling from Flat Basins with Ease
Bolian Li, Ruqi Zhang
3D-Aware Face Editing via Warping-Guided Latent Direction Learning
Yuhao Cheng, Zhuo Chen, Xingyu Ren et al.
Multi-View Representation is What You Need for Point-Cloud Pre-Training
Siming Yan, Chen Song, Youkang Kong et al.
Merging and Splitting Diffusion Paths for Semantically Coherent Panoramas
Fabio Quattrini, Vittorio Pippi, Silvia Cascianelli et al.
Adaptive Multi-task Learning for Few-shot Object Detection
Yan Ren, Yanling Li, Wai-Kin Adams Kong
PlaSma: Procedural Knowledge Models for Language-based Planning and Re-Planning
Faeze Brahman, Chandra Bhagavatula, Valentina Pyatkin et al.
Flexible Depth Completion for Sparse and Varying Point Densities
Jinhyung Park, Yu-Jhe Li, Kris Kitani
FairWASP: Fast and Optimal Fair Wasserstein Pre-processing
Zikai Xiong, Niccolo Dalmasso, Alan Mishler et al.
Operational Open-Set Recognition and PostMax Refinement
Steve Cruz, Ryan Rabinowitz, Manuel Günther et al.
X-InstructBLIP: A Framework for Aligning Image, 3D, Audio, Video to LLMs and its Emergent Cross-modal Reasoning
Artemis Panagopoulou, Le Xue, Ning Yu et al.
Conceptual Codebook Learning for Vision-Language Models
Yi Zhang, Ke Yu, Siqi Wu et al.
CNC-Net: Self-Supervised Learning for CNC Machining Operations
Mohsen Yavartanoo, Sangmin Hong, Reyhaneh Neshatavar et al.
Towards Making Learnware Specification and Market Evolvable
Jian-Dong Liu, Zhi-Hao Tan, Zhi-Hua Zhou
Learning to Produce Semi-dense Correspondences for Visual Localization
Khang Truong Giang, Soohwan Song, Sungho Jo
Weak-to-Strong 3D Object Detection with X-Ray Distillation
Alexander Gambashidze, Aleksandr Dadukin, Maksim Golyadkin et al.
LLMCO4MR: LLMs-aided Neural Combinatorial Optimization for Ancient Manuscript Restoration from Fragments with Case Studies on Dunhuang
Yuqing Zhang, Hangqi Li, Shengyu Zhang et al.
Epistemic Uncertainty Quantification For Pre-Trained Neural Networks
Hanjing Wang, Qiang Ji
MeshVPR: Citywide Visual Place Recognition Using 3D Meshes
Gabriele Berton, Lorenz Junglas, Riccardo Zaccone et al.
Time Fairness in Online Knapsack Problems
Adam Lechowicz, Rik Sengupta, Bo Sun et al.
Blind Face Restoration under Extreme Conditions: Leveraging 3D-2D Prior Fusion for Superior Structural and Texture Recovery
Zhengrui Chen, Liying Lu, Ziyang Yuan et al.
Error Feedback Reloaded: From Quadratic to Arithmetic Mean of Smoothness Constants
Peter Richtarik, Elnur Gasanov, Konstantin Burlachenko
CoG-DQA: Chain-of-Guiding Learning with Large Language Models for Diagram Question Answering
Shaowei Wang, Lingling Zhang, Longji Zhu et al.
Topic-VQ-VAE: Leveraging Latent Codebooks for Flexible Topic-Guided Document Generation
YoungJoon Yoo, Jongwon Choi
Cross-view and Cross-pose Completion for 3D Human Understanding
Matthieu Armando, Salma Galaaoui, Fabien Baradel et al.
Improving Out-of-Distribution Generalization in Graphs via Hierarchical Semantic Environments
Yinhua Piao, Sangseon Lee, Yijingxiu Lu et al.
Long-range Turbulence Mitigation: A Large-scale Dataset and A Coarse-to-fine Framework
Shengqi Xu, Run Sun, Yi Chang et al.
Beyond the Label Itself: Latent Labels Enhance Semi-supervised Point Cloud Panoptic Segmentation
Yujun Chen, Xin Tan, Zhizhong Zhang et al.
Boosting Residual Networks with Group Knowledge
Shengji Tang, Peng Ye, Baopu Li et al.
Error Norm Truncation: Robust Training in the Presence of Data Noise for Text Generation Models
Tianjian Li, Haoran Xu, Philipp Koehn et al.
Rethinking LiDAR Domain Generalization: Single Source as Multiple Density Domains
Jaeyeul Kim, Jungwan Woo, Jeonghoon Kim et al.
Fixed Point Diffusion Models
Luke Melas-Kyriazi, Xingjian Bai
In-Context Matting
He Guo, Zixuan Ye, Zhiguo Cao et al.
Finding NeMo: Negative-mined Mosaic Augmentation for Referring Image Segmentation
Seongsu Ha, Chaeyun Kim, Donghwa Kim et al.
DEVIAS: Learning Disentangled Video Representations of Action and Scene
Kyungho Bae, Youngrae Kim, Geo Ahn et al.
Scores for Learning Discrete Causal Graphs with Unobserved Confounders
Alexis Bellot, Junzhe Zhang, Elias Bareinboim
Long-Term Typhoon Trajectory Prediction: A Physics-Conditioned Approach Without Reanalysis Data
Young-Jae Park, Minseok Seo, Doyi Kim et al.
Task-Free Dynamic Sparse Vision Transformer for Continual Learning
Fei Ye, Adrian Bors
Partial Label Learning with a Partner
Chongjie Si, Zekun Jiang, Xuehui Wang et al.
Koopman-based generalization bound: New aspect for full-rank weights
Yuka Hashimoto, Sho Sonoda, Isao Ishikawa et al.
Quanta Video Restoration
PRATEEK CHENNURI, Yiheng Chi, Enze Jiang et al.
DSD$^2$: Can We Dodge Sparse Double Descent and Compress the Neural Network Worry-Free?
Victor Quetu, Enzo Tartaglione
Learning to Solve Bilevel Programs with Binary Tender
Bo Zhou, Ruiwei Jiang, Siqian Shen
Defense without Forgetting: Continual Adversarial Defense with Anisotropic & Isotropic Pseudo Replay
Yuhang Zhou, Zhongyun Hua
Spatial Voting with Incomplete Voter Information
Aviram Imber, Jonas Israel, Markus Brill et al.
TiMix: Text-Aware Image Mixing for Effective Vision-Language Pre-training
Chaoya Jiang, Wei Ye, Haiyang Xu et al.
Investigating Style Similarity in Diffusion Models
Gowthami Somepalli, Anubhav Anubhav, Kamal Gupta et al.
Multi-View People Detection in Large Scenes via Supervised View-Wise Contribution Weighting
Qi ZHANG, Yunfei Gong, Daijie Chen et al.
Enhancing Zero-Shot Multi-Speaker TTS with Negated Speaker Representations
Yejin Jeon, Yunsu Kim, Gary Geunbae Lee
A Cross-View Hierarchical Graph Learning Hypernetwork for Skill Demand-Supply Joint Prediction
Wenshuo Chao, Zhaopeng Qiu, Likang Wu et al.
Existence Is Chaos: Enhancing 3D Human Motion Prediction with Uncertainty Consideration
Zhihao Wang, Yulin Zhou, Ningyu Zhang et al.
RadarMOSEVE: A Spatial-Temporal Transformer Network for Radar-Only Moving Object Segmentation and Ego-Velocity Estimation
Changsong Pang, Xieyuanli Chen, Yimin Liu et al.
From Graphs to Hypergraphs: Hypergraph Projection and its Reconstruction
Yanbang Wang, Jon Kleinberg
DOGE-Train: Discrete Optimization on GPU with End-to-End Training
Ahmed Abbas, P. Swoboda
Semantic Flow: Learning Semantic Fields of Dynamic Scenes from Monocular Videos
Fengrui Tian, Yueqi Duan, Angtian Wang et al.
Distributionally Robust Optimization with Bias and Variance Reduction
Ronak Mehta, Vincent Roulet, Krishna Pillutla et al.
Racing Control Variable Genetic Programming for Symbolic Regression
Nan Jiang, Yexiang Xue
BBScore: A Brownian Bridge Based Metric for Assessing Text Coherence
Zhecheng Sheng, Tianhao Zhang, Chen Jiang et al.
Neural Time
Reversed Generalized Riccati Equation
Uncertainty Quantification in Heterogeneous Treatment Effect Estimation with Gaussian-Process-Based Partially Linear Model
Shunsuke Horii, Yoichi Chikahara
CMDA: Cross-Modal and Domain Adversarial Adaptation for LiDAR-Based 3D Object Detection
Gyusam Chang, Wonseok Roh, Sujin Jang et al.
TransGOP: Transformer-Based Gaze Object Prediction
Binglu Wang, Chenxi Guo, Yang Jin et al.
TetraSphere: A Neural Descriptor for O(3)-Invariant Point Cloud Analysis
Pavlo Melnyk, Andreas Robinson, Michael Felsberg et al.
3D-GOI: 3D GAN Omni-Inversion for Multifaceted and Multi-object Editing
Haoran Li, Long Ma, Haolin Shi et al.
Data-Free Hard-Label Robustness Stealing Attack
Xiaojian Yuan, Kejiang Chen, Wen Huang et al.
Neighborhood-Enhanced 3D Human Pose Estimation with Monocular LiDAR in Long-Range Outdoor Scenes
Jingyi Zhang, Qihong Mao, Guosheng Hu et al.
Entropy Induced Pruning Framework for Convolutional Neural Networks
Yiheng Lu, Ziyu Guan, Yaming Yang et al.
UVAGaze: Unsupervised 1-to-2 Views Adaptation for Gaze Estimation
Ruicong Liu, Feng Lu
FaceRSA: RSA-Aware Facial Identity Cryptography Framework
Zhongyi Zhang, Tianyi Wei, Wenbo Zhou et al.
This Probably Looks Exactly Like That: An Invertible Prototypical Network
Zachariah Carmichael, Timothy Redgrave, Daniel Gonzalez Cedre et al.
Complexity of Credulous and Skeptical Acceptance in Epistemic Argumentation Framework
Gianvincenzo Alfano, Sergio Greco, Francesco Parisi et al.
Occlusion-Aware Seamless Segmentation
Yihong Cao, Jiaming Zhang, Hao Shi et al.
Bidirectional Uncertainty-Based Active Learning for Open-Set Annotation
ChenChen Zong, Ye-Wen Wang, Kun-Peng Ning et al.
Video Frame Prediction from a Single Image and Events
Juanjuan Zhu, Zhexiong Wan, Yuchao Dai
Boosting Order-Preserving and Transferability for Neural Architecture Search: a Joint Architecture Refined Search and Fine-tuning Approach
Beichen Zhang, Xiaoxing Wang, Xiaohan Qin et al.
The Lipschitz-Variance-Margin Tradeoff for Enhanced Randomized Smoothing
Blaise Delattre, Alexandre Araujo, Quentin Barthélemy et al.
Generative Model-Based Feature Knowledge Distillation for Action Recognition
Guiqin Wang, Peng Zhao, Yanjiang Shi et al.
Decoupled Marked Temporal Point Process using Neural Ordinary Differential Equations
Yujee Song, Donghyun LEE, Rui Meng et al.
Quantum Interference Model for Semantic Biases of Glosses in Word Sense Disambiguation
Junwei Zhang, Ruifang He, Fengyu Guo et al.
Scaling Up Semi-supervised Learning with Unconstrained Unlabelled Data
Shuvendu Roy, Ali Etemad
DreamView: Injecting View-specific Text Guidance into Text-to-3D Generation
Junkai Yan, Yipeng Gao, Qize Yang et al.
Two-Stage Active Learning for Efficient Temporal Action Segmentation
Yuhao Su, Ehsan Elhamifar
Watching it in Dark: A Target-aware Representation Learning Framework for High-Level Vision Tasks in Low Illumination
Yunan LI, Yihao Zhang, Shoude Li et al.
HyperSDFusion: Bridging Hierarchical Structures in Language and Geometry for Enhanced 3D Text2Shape Generation
Zhiying Leng, Tolga Birdal, Xiaohui Liang et al.
Domain Generalization of 3D Object Detection by Density-Resampling
Shuangzhi Li, Lei Ma, Xingyu Li
DME: Unveiling the Bias for Better Generalized Monocular Depth Estimation
Songsong Yu, Yifan Wang, Yunzhi Zhuge et al.
Noise-assisted Prompt Learning for Image Forgery Detection and Localization
Dong Li, Jiaying Zhu, Xueyang Fu et al.
Component Fourier Neural Operator for Singularly Perturbed Differential Equations
Ye Li, Ting Du, Yiwen Pang et al.
Pseudo-keypoint RKHS Learning for Self-supervised 6DoF Pose Estimation
Yangzheng Wu, Michael Alan Greenspan
Edge-Guided Fusion and Motion Augmentation for Event-Image Stereo
Fengan Zhao, Qianang Zhou, Junlin Xiong
Advancing Video Synchronization with Fractional Frame Analysis: Introducing a Novel Dataset and Model
Yuxuan Liu, Haizhou Ai, Junliang Xing et al.
ADMap: Anti-disturbance Framework for Vectorized HD Map Construction
Haotian Hu, Fanyi Wang, Yaonong Wang et al.
Domain-Adaptive 2D Human Pose Estimation via Dual Teachers in Extremely Low-Light Conditions
Yihao Ai, Yifei Qi, Bo Wang et al.
Test-Time Stain Adaptation with Diffusion Models for Histopathology Image Classification
Cheng-Chang Tsai, Yuan-Chih Chen, Chun-Shien Lu
FedCompetitors: Harmonious Collaboration in Federated Learning with Competing Participants
Shanli Tan, Hao Cheng, Xiaohu Wu et al.
Unsupervised Extractive Summarization with Learnable Length Control Strategies
Renlong Jie, Xiaojun Meng, Xin Jiang et al.
Unsupervised Object Interaction Learning with Counterfactual Dynamics Models
Jongwook Choi, Sungtae Lee, Xinyu Wang et al.
LiDAR-based All-weather 3D Object Detection via Prompting and Distilling 4D Radar
Yujeong Chae, HYEONSEONG KIM, Changgyoon Oh et al.
Viewpoint textual inversion: discovering scene representations and 3D view control in 2D diffusion models
James Burgess, Kuan-Chieh Wang, Serena Yeung-Levy
PA2D-MORL: Pareto Ascent Directional Decomposition Based Multi-Objective Reinforcement Learning
Tianmeng Hu, Biao Luo
AlignDiff: Aligning Diffusion Models for General Few-Shot Segmentation
Ri-Zhao Qiu, Yu-Xiong Wang, Kris Hauser
The Curse of Diversity in Ensemble-Based Exploration
Zhixuan Lin, Pierluca D'Oro, Evgenii Nikishin et al.
CMD: A Cross Mechanism Domain Adaptation Dataset for 3D Object Detection
Jinhao Deng, Wei Ye, Hai Wu et al.
RRL: Recommendation Reverse Learning
Xiaoyu You, Jianwei Xu, Mi Zhang et al.
GenQ: Quantization in Low Data Regimes with Generative Synthetic Data
YUHANG LI, Youngeun Kim, Donghyun Lee et al.
Reconciling Spatial and Temporal Abstractions for Goal Representation
Mehdi Zadem, Sergio Mover, Sao Mai Nguyen
Interleaving One-Class and Weakly-Supervised Models with Adaptive Thresholding for Unsupervised Video Anomaly Detection
Yongwei Nie, Hao Huang, Chengjiang Long et al.
VisFocus: Prompt-Guided Vision Encoders for OCR-Free Dense Document Understanding
Ofir Abramovich, Niv Nayman, Sharon Fogel et al.
E3M: Zero-Shot Spatio-Temporal Video Grounding with Expectation-Maximization Multimodal Modulation
Peijun Bao, Zihao Shao, Wenhan Yang et al.
Dual-Enhanced Coreset Selection with Class-wise Collaboration for Online Blurry Class Incremental Learning
Yutian Luo, Shiqi Zhao, Haoran Wu et al.
Dealing with Numeric and Metric Time Constraints in PDDL3 via Compilation to Numeric Planning
Luigi Bonassi, Alfonso Emilio Gerevini, Enrico Scala
ML-SemReg: Boosting Point Cloud Registration with Multi-level Semantic Consistency
Shaocheng Yan, Pengcheng Shi, Jiayuan Li
Reprojection Errors as Prompts for Efficient Scene Coordinate Regression
Ting-Ru Liu, Hsuan-Kung Yang, Jou-Min Liu et al.
Temporal As a Plugin: Unsupervised Video Denoising with Pre-Trained Image Denoisers
Zixuan Fu, Lanqing Guo, Chong Wang et al.
Adaptive Multi-modal Fusion of Spatially Variant Kernel Refinement with Diffusion Model for Blind Image Super-Resolution
Junxiong Lin, Yan Wang, Zeng Tao et al.
FREST: Feature RESToration for Semantic Segmentation under Multiple Adverse Conditions
Sohyun Lee, Namyup Kim, Sungyeon Kim et al.
Region-Aware Sequence-to-Sequence Learning for Hyperspectral Denoising
JiaHua Xiao, Yang Liu, Xing Wei
GAReT: Cross-view Video Geolocalization with Adapters and Auto-Regressive Transformers
Manu S Pillai, Mamshad Nayeem Rizve, Shah Mubarak
EgoPoseFormer: A Simple Baseline for Stereo Egocentric 3D Human Pose Estimation
Chenhongyi Yang, Anastasia Tkach, Shreyas Hampali et al.
Generalizable Face Landmarking Guided by Conditional Face Warping
Jiayi Liang, Haotian Liu, Hongteng Xu et al.
Unleashing Network Potentials for Semantic Scene Completion
Fengyun Wang, Qianru Sun, Dong Zhang et al.
Decongestion by Representation: Learning to Improve Economic Welfare in Marketplaces
Omer Nahum, Gali Noti, David Parkes et al.
Active Coarse-to-Fine Segmentation of Moveable Parts from Real Images
Ruiqi Wang, Akshay Gadi Patil, Fenggen Yu et al.
LiveHPS++: Robust and Coherent Motion Capture in Dynamic Free Environment
Yiming Ren, Xiao Han, Yichen Yao et al.
cDP-MIL: Robust Multiple Instance Learning via Cascaded Dirichlet Process
Yihang Chen, TSAI HOR CHAN, Guosheng Yin et al.
WebRPG: Automatic Web Rendering Parameters Generation for Visual Presentation
Zirui Shao, Feiyu Gao, Hangdi Xing et al.
SpecFormer: Guarding Vision Transformer Robustness via Maximum Singular Value Penalization
Xixu Hu, Runkai Zheng, Jindong Wang et al.
Understanding Multi-compositional learning in Vision and Language models via Category Theory
Sotirios Panagiotis Takis Chytas, Hyunwoo J. Kim, Vikas Singh
Reliable Spatial-Temporal Voxels For Multi-Modal Test-Time Adaptation
Haozhi Cao, Yuecong Xu, Jianfei Yang et al.
Exploring Reliable Matching with Phase Enhancement for Night-time Semantic Segmentation
Yuwen Pan, Rui Sun, Naisong Luo et al.
Revisit Event Generation Model: Self-Supervised Learning of Event-to-Video Reconstruction with Implicit Neural Representations
Zipeng Wang, yunfan lu, LIN WANG
The Gaussian Discriminant Variational Autoencoder (GdVAE): A Self-Explainable Model with Counterfactual Explanations
Anselm Haselhoff, Kevin Trelenberg, Fabian Küppers et al.
S^3D-NeRF: Single-Shot Speech-Driven Neural Radiance Field for High Fidelity Talking Head Synthesis
Dongze Li, Kang Zhao, WEI WANG et al.
Temporal Residual Guided Diffusion Framework for Event-Driven Video Reconstruction
Lin Zhu, Yunlong Zheng, Yijun Zhang et al.
Pre-training LiDAR-based 3D Object Detectors through Colorization
Tai-Yu Pan, Chenyang Ma, Tianle Chen et al.
DSMix: Distortion-Induced Saliency Map Based Pre-training for No-Reference Image Quality Assessment
Jinsong Shi, Jinsong Shi, Xiaojiang Peng et al.
AlignZeg: Mitigating Objective Misalignment for Zero-shot Semantic Segmentation
Jiannan Ge, Lingxi Xie, Hongtao Xie et al.
AddMe: Zero-shot Group-photo Synthesis by Inserting People into Scenes
Dongxu Yue, Maomao Li, Yunfei Liu et al.
Efficient Hyperparameter Optimization with Adaptive Fidelity Identification
Jiantong Jiang, Zeyi Wen, Atif Mansoor et al.
Efficient Depth-Guided Urban View Synthesis
sheng miao, Jiaxin Huang, Dongfeng Bai et al.
Projecting Trackable Thermal Patterns for Dynamic Computer Vision
Mark Sheinin, Aswin C. Sankaranarayanan, Srinivasa G. Narasimhan
Probabilistic Sampling of Balanced K-Means using Adiabatic Quantum Computing
Jan-Nico Zaech, Martin Danelljan, Tolga Birdal et al.
Spike-Temporal Latent Representation for Energy-Efficient Event-to-Video Reconstruction
Jianxiong Tang, Jian-Huang Lai, Lingxiao Yang et al.
Towards Cheaper Inference in Deep Networks with Lower Bit-Width Accumulators
Yaniv Blumenfeld, Itay Hubara, Daniel Soudry
SCOMatch: Alleviating Overtrusting in Open-set Semi-supervised Learning
ZERUN WANG, Liuyu Xiang, Lang Huang et al.
Towards a Theoretical Understanding of Why Local Search Works for Clustering with Fair-Center Representation
Zhen Zhang, Junfeng Yang, Limei Liu et al.
Pseudo Label Refinery for Unsupervised Domain Adaptation on Cross-dataset 3D Object Detection
Zhanwei Zhang, Minghao Chen, Shuai Xiao et al.
Fully Exploiting Every Real Sample: SuperPixel Sample Gradient Model Stealing
Yunlong Zhao, Xiaoheng Deng, Yijing Liu et al.
Adversarial Feature Map Pruning for Backdoor
Dong HUANG, Qingwen Bu
FaceLift: Semi-supervised 3D Facial Landmark Localization
David Ferman, Pablo Garrido, Gaurav Bharaj
EFormer: Enhanced Transformer towards Semantic-Contour Features of Foreground for Portraits Matting
Zitao Wang, Qiguang Miao, Yue Xi et al.
VSRD: Instance-Aware Volumetric Silhouette Rendering for Weakly Supervised 3D Object Detection
Zihua Liu, Hiroki Sakuma, Masatoshi Okutomi
Spot the Error: Non-autoregressive Graphic Layout Generation with Wireframe Locator
Jieru Lin, Danqing Huang, Tiejun Zhao et al.
PMAC: Personalized Multi-Agent Communication
Xiangrui Meng, Ying Tan
Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout Analysis
Tianci Bi, Xiaoyi Zhang, Zhizheng Zhang et al.
Normalizing Flows on the Product Space of SO(3) Manifolds for Probabilistic Human Pose Modeling
Olaf Dünkel, Tim Salzmann, Florian Pfaff
Stationary Representations: Optimally Approximating Compatibility and Implications for Improved Model Replacements
Niccolò Biondi, Federico Pernici, Simone Ricci et al.
Cheaper and Faster: Distributed Deep Reinforcement Learning with Serverless Computing
Hanfei Yu, Jian Li, Yang Hua et al.
ContactGen: Contact-Guided Interactive 3D Human Generation for Partners
Dongjun Gu, Jaehyeok Shim, Jaehoon Jang et al.
Unraveling the Enigma of Double Descent: An In-depth Analysis through the Lens of Learned Feature Space
Yufei Gu, Xiaoqing Zheng, Tomaso Aste
Correspondence-Free SE(3) Point Cloud Registration in RKHS via Unsupervised Equivariant Learning
Ray Zhang, Zheming Zhou, Min Sun et al.
NECA: Neural Customizable Human Avatar
Junjin Xiao, Qing Zhang, Zhan Xu et al.
Cached Transformers: Improving Transformers with Differentiable Memory Cached
Zhaoyang Zhang, Wenqi Shao, Yixiao Ge et al.
Unsegment Anything by Simulating Deformation
Jiahao Lu, Xingyi Yang, Xinchao Wang
Using My Artistic Style? You Must Obtain My Authorization
Xiuli Bi, Haowei Liu, Weisheng Li et al.
Modeling Knowledge Graphs with Composite Reasoning
Wanyun Cui, Linqiu Zhang
ACT-Diffusion: Efficient Adversarial Consistency Training for One-step Diffusion Models
Fei Kong, Jinhao Duan, Lichao Sun et al.
Impartial Adversarial Distillation: Addressing Biased Data-Free Knowledge Distillation via Adaptive Constrained Optimization
Dongping Liao, Xitong Gao, Chengzhong Xu
Intensity-Robust Autofocus for Spike Camera
Changqing Su, Zhiyuan Ye, Yongsheng Xiao et al.
nuCraft: Crafting High Resolution 3D Semantic Occupancy for Unified 3D Scene Understanding
Benjin Zhu, zhe wang, Hongsheng LI
Permutation-Based Hypothesis Testing for Neural Networks
Francesca Mandel, Ian Barnett
EBMDock: Neural Probabilistic Protein-Protein Docking via a Differentiable Energy Model
Huaijin Wu, Wei Liu, Yatao Bian et al.
Stable Video Portraits
Mirela Ostrek, Justus Thies
Active Generation for Image Classification
Tao Huang, Jiaqi Liu, Shan You et al.