Most Cited 2024 "equivariant neural fields" Papers
12,324 papers found • Page 28 of 62
Conference
Learning Invariant Inter-pixel Correlations for Superpixel Generation
Sen Xu, Shikui Wei, Tao Ruan et al.
Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation
Fu-Yun Wang, Xiaoshi Wu, Zhaoyang Huang et al.
Reinforcement Learning as a Parsimonious Alternative to Prediction Cascades: A Case Study on Image Segmentation
Bharat Srikishan, Anika Tabassum, Srikanth Allu et al.
Visual Redundancy Removal for Composite Images: A Benchmark Dataset and a Multi-Visual-Effects Driven Incremental Method
Miaohui Wang, Rong Zhang, Lirong Huang et al.
Beyond Pixels: Semi-Supervised Semantic Segmentation with a Multi-scale Patch-based Multi-Label Classifier
Prantik Howlader, Srijan Das, Hieu Le et al.
Independency Adversarial Learning for Cross-Modal Sound Separation
Zhenkai Lin, Yanli Ji, Yang Yang
Confucius: Iterative Tool Learning from Introspection Feedback by Easy-to-Difficult Curriculum
Zhengliang Shi, Shen Gao, Minghang Zhu et al.
Enhancing Semi-supervised Domain Adaptation via Effective Target Labeling
Jiujun He, Bin Liu, Guosheng Yin
AdversariaLeak: External Information Leakage Attack Using Adversarial Samples on Face Recognition Systems
Roye Katzav, Amit Giloni, Edita Grolman et al.
H2GFormer: Horizontal-to-Global Voxel Transformer for 3D Semantic Scene Completion
Yu Wang, Chao Tong
Jointly Improving the Sample and Communication Complexities in Decentralized Stochastic Minimax Optimization
Xuan Zhang, Gabriel Mancino-Ball, Necdet Serhat Aybat et al.
FT-GAN: Fine-Grained Tune Modeling for Chinese Opera Synthesis
Meizhen Zheng, Peng Bai, Xiaodong Shi et al.
Voxel or Pillar: Exploring Efficient Point Cloud Representation for 3D Object Detection
Yuhao Huang, Sanping Zhou, Junjie Zhang et al.
Structural Information Enhanced Graph Representation for Link Prediction
Lei Shi, Bin Hu, Deng Zhao et al.
Restoring Images in Adverse Weather Conditions via Histogram Transformer
Shangquan Sun, Wenqi Ren, Xinwei Gao et al.
VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding
Yue Fan, Xiaojian Ma, Rujie Wu et al.
TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks
Jinjie Mai, Wenxuan Zhu, Sara Rojas Martinez et al.
Rejection Sampling IMLE: Designing Priors for Better Few-Shot Image Synthesis
Chirag Vashist, Shichong Peng, Ke Li
LGMRec: Local and Global Graph Learning for Multimodal Recommendation
Zhiqiang Guo, Jianjun Li, Guohui Li et al.
Data-Driven Knowledge-Aware Inference of Private Information in Continuous Double Auctions
Lvye Cui, Haoran Yu
Linear-Time Algorithms for Front-Door Adjustment in Causal Graphs
Marcel Wienöbst, Benito van der Zander, Maciej Liskiewicz
Discriminatively Fuzzy Multi-View K-means Clustering with Local Structure Preserving
Jun Yin, Shiliang Sun, Lai Wei et al.
Robust Blind Text Image Deblurring via Maximum Consensus Framework
Zijian Min, Gundu Hassan, GeunSik Jo
Phoneme Hallucinator: One-Shot Voice Conversion via Set Expansion
Siyuan Shan, Yang Li, Amartya Banerjee et al.
Continuous-Time Graph Representation with Sequential Survival Process
Abdulkadir Celikkanat, Nikolaos Nakis, Morten Mørup
MONTRAGE: Monitoring Training for Attribution of Generative Diffusion Models
Jonathan Brokman, Omer Hofman, Roman Vainshtein et al.
Understanding Distributed Representations of Concepts in Deep Neural Networks without Supervision
Wonjoon Chang, Dahee Kwon, Jaesik Choi
How Far Can a 1-Pixel Camera Go? Solving Vision Tasks using Photoreceptors and Computationally Designed Visual Morphology
Andrei Atanov, Rishubh Singh, Jiawei Fu et al.
Unsupervised Continual Anomaly Detection with Contrastively-Learned Prompt
Jiaqi Liu, Kai Wu, Qiang Nie et al.
Separate the Wheat from the Chaff: Model Deficiency Unlearning via Parameter-Efficient Module Operation
Xinshuo Hu, Dongfang Li, Zihao Zheng et al.
ProAgent: Building Proactive Cooperative Agents with Large Language Models
Ceyao Zhang, Kaijie Yang, Siyi Hu et al.
Encoding Constraints as Binary Constraint Networks Satisfying BTP
MLPHand: Real Time Multi-View 3D Hand Reconstruction via MLP Modeling
Jian Yang, Jiakun Li, Guoming Li et al.
Distractors-Immune Representation Learning with Cross-modal Contrastive Regularization for Change Captioning
Yunbin Tu, Liang Li, Li Su et al.
DreamIdentity: Enhanced Editability for Efficient Face-Identity Preserved Image Generation
Zhuowei Chen, Shancheng Fang, Wei Liu et al.
F3-Pruning: A Training-Free and Generalized Pruning Strategy towards Faster and Finer Text-to-Video Synthesis
Sitong Su, Jianzhi Liu, Lianli Gao et al.
Adaptive Feature Imputation with Latent Graph for Deep Incomplete Multi-View Clustering
Jingyu Pu, Chenhang Cui, Xinyue Chen et al.
Can Textual Semantics Mitigate Sounding Object Segmentation Preference?
Yaoting Wang, Peiwen Sun, Yuanchao Li et al.
Variational Hybrid-Attention Framework for Multi-Label Few-Shot Aspect Category Detection
Cheng Peng, Ke Chen, Lidan Shou et al.
SNN-PDE: Learning Dynamic PDEs from Data with Simplicial Neural Networks
Jae Choi, Yuzhou Chen, Huikyo Lee et al.
DreamScene: 3D Gaussian-based Text-to-3D Scene Generation via Formation Pattern Sampling
Haoran Li, Haolin Shi, Wenli Zhang et al.
Resilience of Entropy Model in Distributed Neural Networks
Milin Zhang, Mohammad Abdi, Shahriar Rifat et al.
Dynamic Reactive Spiking Graph Neural Network
Block Image Compressive Sensing with Local and Global Information Interaction
Xiaoyu Kong, Yongyong Chen, Feng Zheng et al.
StockMixer: A Simple Yet Strong MLP-Based Architecture for Stock Price Forecasting
Jinyong Fan, Yanyan Shen
A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment
Tianhe Wu, Kede Ma, Jie Liang et al.
Efficient Nonparametric Tensor Decomposition for Binary and Count Data
Zerui Tao, Toshihisa Tanaka, Qibin Zhao
Identification of Causal Structure in the Presence of Missing Data with Additive Noise Model
Zhengrui Chen, Liying Lu, Ziyang Yuan et al.
TCNet: Continuous Sign Language Recognition from Trajectories and Correlated Regions
Learning Domain-Independent Heuristics for Grounded and Lifted Planning
Lost and Found: Overcoming Detector Failures in Online Multi-Object Tracking
Lorenzo Vaquero, Yihong XU, Xavier Alameda-Pineda et al.
Prompt-Driven Contrastive Learning for Transferable Adversarial Attacks
Hunmin Yang, Jongoh Jeong, Kuk-Jin Yoon
OntoFact: Unveiling Fantastic Fact-Skeleton of LLMs via Ontology-Driven Reinforcement Learning
Ziyu Shang, Ke Wenjun, Nana Xiu et al.
Exploring Temporal Feature Correlation for Efficient and Stable Video Semantic Segmentation
Matthieu Lin, Jenny Sheng, Yubin Hu et al.
DeblurSR: Event-Based Motion Deblurring under the Spiking Representation
Chen Song, Chandrajit Bajaj, Qixing Huang
Boosting Multiple Instance Learning Models for Whole Slide Image Classification: A Model-Agnostic Framework Based on Counterfactual Inference
Weiping Lin, Zhenfeng Zhuang, Lequan Yu et al.
OCEAN-MBRL: Offline Conservative Exploration for Model-Based Offline Reinforcement Learning
Fan Wu, Rui Zhang, Qi Yi et al.
Continual Learning and Unknown Object Discovery in 3D Scenes via Self-Distillation
Mohamed El Amine Boudjoghra, Jean Lahoud, Salman Khan et al.
Semantic Complete Scene Forecasting from a 4D Dynamic Point Cloud Sequence
Zifan Wang, Zhuorui Ye, Haoran Wu et al.
Authors
- Xinshu Li, Lina Yao
Improving Panoptic Narrative Grounding by Harnessing Semantic Relationships and Visual Confirmation
TAG: Text Prompt Augmentation for Zero-Shot Out-of-Distribution Detection
Xixi Liu, Christopher Zach
PLOT: Text-based Person Search with Part Slot Attention for Corresponding Part Discovery
Jicheol Park, Dongwon Kim, Boseung Jeong et al.
Occluded Gait Recognition with Mixture of Experts: An Action Detection Perspective
Panjian Huang, Yunjie Peng, Saihui Hou et al.
Causal Strategic Learning with Competitive Selection
Causal Walk: Debiasing Multi-Hop Fact Verification with Front-Door Adjustment
Close, But Not There: Boosting Geographic Distance Sensitivity in Visual Place Recognition
Sergio Izquierdo, Javier Civera
Video-Language Aligned Transformer for Video Question Answering
PoetryDiffusion: Towards Joint Semantic and Metrical Manipulation in Poetry Generation
DoubleTake: Geometry Guided Depth Estimation
Mohamed Sayed, Filippo Aleotti, Jamie Watson et al.
Efficient Asynchronous Federated Learning with Prospective Momentum Aggregation and Fine-Grained Correction
HEAP: Unsupervised Object Discovery and Localization with Contrastive Grouping
On the Structural Hardness of Answer Set Programming: Can Structure Efficiently Confine the Power of Disjunctions?
Markus Hecher, Rafael Kiesel
Multi-View Randomized Kernel Classification via Nonconvex Optimization
Oulu Remote-photoplethysmography Physical Domain Attacks Database (ORPDAD)
Marko Savic, Guoying Zhao
Self-Rectifying Diffusion Sampling with Perturbed-Attention Guidance
Donghoon Ahn, Hyoungwon Cho, Jaewon Min et al.
Are You Concerned about Limited Function Evaluations: Data-Augmented Pareto Set Learning for Expensive Multi-Objective Optimization
Enhancing Representation of Spiking Neural Networks via Similarity-Sensitive Contrastive Learning
Learning Coalition Structures with Games
On Unsupervised Domain Adaptation: Pseudo Label Guided Mixup for Adversarial Prompt Tuning
Distribution-Conditioned Adversarial Variational Autoencoder for Valid Instrumental Variable Generation
Leveraging Representations from Intermediate Encoder-blocks for Synthetic Image Detection
Christos Koutlis, Symeon Papadopoulos
Solving Motion Planning Tasks with a Scalable Generative Model
Yihan Hu, Siqi Chai, Zhening Yang et al.
Transfer and Alignment Network for Generalized Category Discovery
Wenbin An, Feng Tian, Wenkai Shi et al.
FedCD: Federated Semi-supervised Learning with Class Awareness Balance via Dual Teachers
Yuzhi Liu, Huisi Wu, Jing Qin
Semi-supervised TEE Segmentation via Interacting with SAM Equipped with Noise-Resilient Prompting
Sen Deng, Yidan Feng, Haoneng Lin et al.
Prompting Multi-Modal Image Segmentation with Semantic Grouping
TMFormer: Token Merging Transformer for Brain Tumor Segmentation with Missing Modalities
Zheyu Zhang, Gang Yang, Yueyi Zhang et al.
Multiscale Attention Wavelet Neural Operator for Capturing Steep Trajectories in Biochemical Systems
Jiayang Su, Junbo Ma, Songyang Tong et al.
An Effective Augmented Lagrangian Method for Fine-Grained Multi-View Optimization
Yuze Tan, Hecheng Cai, Shudong Huang et al.
Versatile Incremental Learning: Towards Class and Domain-Agnostic Incremental Learning
Minyeong Park, Jae-Ho Lee, Gyeong-Moon Park
Parrot Captions Teach CLIP to Spot Text
Yiqi Lin, Conghui He, Alex Jinpeng Wang et al.
Primitive-Based 3D Human-Object Interaction Modelling and Programming
Siqi Liu, Yong-Lu Li, Zhou FANG et al.
Collaborative Synthesis of Patient Records through Multi-Visit Health State Inference
Hongda Sun, Hongzhan Lin, Rui Yan
Decentralized Sum-of-Nonconvex Optimization
Zhuanghua Liu, Bryan Kian Hsiang Low
All Beings Are Equal in Open Set Recognition
Chaohua Li, Enhao Zhang, Chuanxing Geng et al.
PRP Rebooted: Advancing the State of the Art in FOND Planning
Christian Muise, Sheila McIlraith, J. Christopher Beck
LAPT: Label-driven Automated Prompt Tuning for OOD Detection with Vision-Language Models
Yabin Zhang, Wenjie Zhu, Chenhang He et al.
ExpeL: LLM Agents Are Experiential Learners
Andrew Zhao, Daniel Huang, Quentin Xu et al.
Multi-Cross Sampling and Frequency-Division Reconstruction for Image Compressed Sensing
Heping Song, Jingyao Gong, Hongying Meng et al.
Electron Microscopy Images as Set of Fragments for Mitochondrial Segmentation
Naisong Luo, Rui Sun, Yuwen Pan et al.
Graph-Based Prediction and Planning Policy Network (GP3Net) for Scalable Self-Driving in Dynamic Environments Using Deep Reinforcement Learning
Jayabrata Chowdhury, Venkataramanan Shivaraman, Suresh Sundaram et al.
MedBench: A Large-Scale Chinese Benchmark for Evaluating Medical Large Language Models
Yan Cai, Linlin Wang, Ye Wang et al.
Sampling for Beyond-Worst-Case Online Ranking
Qingyun Chen, Sungjin Im, Benjamin Moseley et al.
PMET: Precise Model Editing in a Transformer
Xiaopeng Li, Shasha Li, Shezheng Song et al.
Learning Neural Deformation Representation for 4D Dynamic Shape Generation
Gyojin Han, Jiwan Hur, Jaehyun Choi et al.
ColorMAE: Exploring data-independent masking strategies in Masked AutoEncoders
Carlos Hinojosa, Shuming Liu, Bernard Ghanem
Explaining Reinforcement Learning Agents through Counterfactual Action Outcomes
Yotam Amitai, Yael Friedler, Ofra Amir
CLIP-Gaze: Towards General Gaze Estimation via Visual-Linguistic Model
Pengwei Yin, Guanzhong Zeng, Jingjing Wang et al.
Point Deformable Network with Enhanced Normal Embedding for Point Cloud Analysis
Xingyilang Yin, Xi Yang, Liangchen Liu et al.
Agile-Quant: Activation-Guided Quantization for Faster Inference of LLMs on the Edge
Xuan Shen, Peiyan Dong, Lei Lu et al.
Chains of Diffusion Models
Yanheng Wei, Lianghua Huang, Zhi-Fan Wu et al.
Robust Nearest Neighbors for Source-Free Domain Adaptation under Class Distribution Shift
Antonio Tejero-de-Pablos, Riku Togashi, Mayu Otani et al.
A Direct Approach to Viewing Graph Solvability
Federica Arrigoni, Andrea Fusiello, Tomas Pajdla
Selective Deep Autoencoder for Unsupervised Feature Selection
Wael Hassanieh, Abdallah Chehade
Variable Importance in High-Dimensional Settings Requires Grouping
Yifan Lu, Ziqi Zhang, Chunfeng Yuan et al.
Physics-informed Knowledge Transfer for Underwater Monocular Depth Estimation
Jinghe Yang, Mingming Gong, Ye Pu
EG-NAS: Neural Architecture Search with Fast Evolutionary Exploration
Cross-Modal Feature Distribution Calibration for Few-Shot Visual Question Answering
Learning Equilibrium Transformation for Gamut Expansion and Color Restoration
JUN XIAO, Changjian Shui, Zhi-Song Liu et al.
SFC: Shared Feature Calibration in Weakly Supervised Semantic Segmentation
Set Prediction Guided by Semantic Concepts for Diverse Video Captioning
Pseudo-Label Calibration Semi-supervised Multi-Modal Entity Alignment
Luyao Wang, Pengnian Qi, Xigang Bao et al.
GGRt: Towards Generalizable 3D Gaussians without Pose Priors in Real-Time
Hao Li, Yuanyuan Gao, Dingwen Zhang et al.
Weighted Ensemble Models Are Strong Continual Learners
Imad Eddine Marouf, Subhankar Roy, Enzo Tartaglione et al.
Three Things We Need to Know About Transferring Stable Diffusion to Visual Dense Prediciton Tasks
Manyuan Zhang, Guanglu Song, Xiaoyu Shi et al.
Cocktail Universal Adversarial Attack on Deep Neural Networks
Shaoxin Li, Xiaofeng Liao, Xin Che et al.
Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation
Seung Hyun Lee, Yinxiao Li, Junjie Ke et al.
Semi-Supervised Teacher-Reference-Student Architecture for Action Quality Assessment
Wulian Yun, Mengshi Qi, Fei Peng et al.
CPT-VR: Improving Surface Rendering via Closest Point Transform with View-Reflection Appearance
Zhipeng Hu, Yongqiang Zhang, Chen Liu et al.
Orthogonal Dictionary Guided Shape Completion Network for Point Cloud
Pingping Cai, Deja Scott, Xiaoguang Li et al.
Progressive High-Frequency Reconstruction for Pan-Sharpening with Implicit Neural Representation
Ge Meng, Jingjia Huang, Yingying Wang et al.
LPViT: Low-Power Semi-structured Pruning for Vision Transformers
KAIXIN Xu, Zhe Wang, Chunyun Chen et al.
What to Remember: Self-Adaptive Continual Learning for Audio Deepfake Detection
XiaoHui Zhang, Jiangyan Yi, Chenglong Wang et al.
Can You Rely on Synthetic Labellers in Preference-Based Reinforcement Learning? It’s Complicated
Katherine Metcalf, Miguel Sarabia, Masha Fedzechkina et al.
GAD-PVI: A General Accelerated Dynamic-Weight Particle-Based Variational Inference Framework
Fangyikang Wang, Huminhao Zhu, Chao Zhang et al.
Beyond Grounding: Extracting Fine-Grained Event Hierarchies across Modalities
Hammad Ayyubi, Christopher Thomas, Lovish Chum et al.
Towards Open-World Object-based Anomaly Detection via Self-Supervised Outlier Synthesis
Brian Isaac Medina, Yona Falinie Abdul Gaus, Neelanjan Bhowmik et al.
Object-Conditioned Energy-Based Attention Map Alignment in Text-to-Image Diffusion Models
Yasi Zhang, Peiyu Yu, Ying Nian Wu
Underspecification in Language Modeling Tasks: A Causality-Informed Study of Gendered Pronoun Resolution
Emily McMilin
Enhancing Bilingual Lexicon Induction via Bi-directional Translation Pair Retrieving
Ding Qiuyu, Hailong Cao, Tiejun Zhao
Graph Reasoning Transformers for Knowledge-Aware Question Answering
Ruilin Zhao, Feng Zhao, Liang Hu et al.
Multi-Modal Hallucination Control by Visual Information Grounding
Alessandro Favero, Luca Zancato, Matthew Trager et al.
Earthfarsser: Versatile Spatio-Temporal Dynamical Systems Modeling in One Model
Hao Wu, Yuxuan Liang, Wei Xiong et al.
HoloADMM: High-Quality Holographic Complex Field Recovery
Mazen Mel, Paul Springer, Pietro Zanuttigh et al.
Unsqueeze [CLS] Bottleneck to Learn Rich Representations
Qing Su, Shihao Ji
Learning without Exact Guidance: Updating Large-scale High-resolution Land Cover Maps from Low-resolution Historical Labels
Zhuohong Li, Wei He, Jiepan Li et al.
Boosting Image Quality Assessment through Efficient Transformer Adaptation with Local Feature Enhancement
Kangmin Xu, Liang Liao, Jing Xiao et al.
Segment Any Event Streams via Weighted Adaptation of Pivotal Tokens
Zhiwen Chen, Zhiyu Zhu, Yifan Zhang et al.
IBD-SLAM: Learning Image-Based Depth Fusion for Generalizable SLAM
Minghao Yin, Shangzhe Wu, Kai Han
MAPSeg: Unified Unsupervised Domain Adaptation for Heterogeneous Medical Image Segmentation Based on 3D Masked Autoencoding and Pseudo-Labeling
Xuzhe Zhang, Yuhao Wu, Elsa Angelini et al.
Anomaly Heterogeneity Learning for Open-set Supervised Anomaly Detection
Jiawen Zhu, Choubo Ding, Yu Tian et al.
Market-GAN: Adding Control to Financial Market Data Generation with Semantic Context
Haochong Xia, Shuo Sun, Xinrun Wang et al.
Fast Adaptation for Human Pose Estimation via Meta-Optimization
Shengxiang Hu, Huaijiang Sun, Bin Li et al.
Selective Hourglass Mapping for Universal Image Restoration Based on Diffusion Model
Dian Zheng, Xiao-Ming Wu, Shuzhou Yang et al.
Semantic Segmentation in Multiple Adverse Weather Conditions with Domain Knowledge Retention
Xin Yang, Wending Yan, Yuan Yuan et al.
Understanding the Generalization of Pretrained Diffusion Models on Out-of-Distribution Data
Sai Niranjan Ramachandran, Rudrabha Mukhopadhyay, Madhav Agarwal et al.
Purified and Unified Steganographic Network
GuoBiao Li, Sheng Li, Zicong Luo et al.
Explicitly Perceiving and Preserving the Local Geometric Structures for 3D Point Cloud Attack
Daizong Liu, Wei Hu
KVQ: Kwai Video Quality Assessment for Short-form Videos
Yiting Lu, Xin Li, Yajing Pei et al.
Consistent Prompting for Rehearsal-Free Continual Learning
Zhanxin Gao, Jun Cen, Xiaobin Chang
ModWaveMLP: MLP-Based Mode Decomposition and Wavelet Denoising Model to Defeat Complex Structures in Traffic Forecasting
Ke Sun, Pei Liu, Pengfei Li et al.
FedHide: Federated Learning by Hiding in the Neighbors
Hyunsin Park, Sungrack Yun
Mean-Shift Feature Transformer
Takumi Kobayashi
Tactile-Augmented Radiance Fields
Yiming Dou, Fengyu Yang, Yi Liu et al.
SpikingResformer: Bridging ResNet and Vision Transformer in Spiking Neural Networks
Xinyu Shi, Zecheng Hao, Zhaofei Yu
Improving Transferability for Cross-Domain Trajectory Prediction via Neural Stochastic Differential Equation
Daehee Park, Jaewoo Jeong, Kuk-Jin Yoon
One-Shot Open Affordance Learning with Foundation Models
Gen Li, Deqing Sun, Laura Sevilla-Lara et al.
Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld
Yijun Yang, Tianyi Zhou, kanxue Li et al.
Hypercorrelation Evolution for Video Class-Incremental Learning
Sen Liang, Kai Zhu, Wei Zhai et al.
Inter-X: Towards Versatile Human-Human Interaction Analysis
Liang Xu, Xintao Lv, Yichao Yan et al.
Benchmarking Audio Visual Segmentation for Long-Untrimmed Videos
Chen Liu, Peike Li, Qingtao Yu et al.
Information Design for Congestion Games with Unknown Demand
Svenja M. Griesbach, Martin Hoefer, Max Klimm et al.
Towards Image Ambient Lighting Normalization
Florin-Alexandru Vasluianu, Tim Seizinger, Zongwei Wu et al.
CadVLM: Bridging Language and Vision in the Generation of Parametric CAD Sketches
Sifan Wu, Amir Hosein Khasahmadi, Mor Katz et al.
DomainFusion: Generalizing To Unseen Domains with Latent Diffusion Models
Yuyang Huang, Yabo Chen, Yuchen Liu et al.
Tree-D Fusion: Simulation-Ready Tree Dataset from Single Images with Diffusion Priors
Jae Joong Lee, Bosheng Li, Sara Beery et al.
DiVAS: Video and Audio Synchronization with Dynamic Frame Rates
Clara Maria Fernandez Labrador, Mertcan Akcay, Eitan Abecassis et al.
Holodeck: Language Guided Generation of 3D Embodied AI Environments
Yue Yang, Fan-Yun Sun, Luca Weihs et al.
PartDistill: 3D Shape Part Segmentation by Vision-Language Model Distillation
Ardian Umam, Cheng-Kun Yang, Min-Hung Chen et al.
Detector-Free Structure from Motion
Xingyi He, Jiaming Sun, Yifan Wang et al.
Rethinking Human Motion Prediction with Symplectic Integral
Haipeng Chen, Kedi L yu, Zhenguang Liu et al.
Double Buffers CEM-TD3: More Efficient Evolution and Richer Exploration
Sheng Zhu, Chun Shen, Shuai Lü et al.
iToF-flow-based High Frame Rate Depth Imaging
Yu Meng, Zhou Xue, Xu Chang et al.
AttriHuman-3D: Editable 3D Human Avatar Generation with Attribute Decomposition and Indexing
Fan Yang, Tianyi Chen, XIAOSHENG HE et al.
C3: High-Performance and Low-Complexity Neural Compression from a Single Image or Video
Hyunjik Kim, Matthias Bauer, Lucas Theis et al.
Learning Coupled Dictionaries from Unpaired Data for Image Super-Resolution
Longguang Wang, Juncheng Li, Yingqian Wang et al.
L4D-Track: Language-to-4D Modeling Towards 6-DoF Tracking and Shape Reconstruction in 3D Point Cloud Stream
Jingtao Sun, Yaonan Wang, Mingtao Feng et al.
On the Affinity, Rationality, and Diversity of Hierarchical Topic Modeling
Xiaobao Wu, Fengjun Pan, Thong Nguyen et al.
Adaptive Slot Attention: Object Discovery with Dynamic Slot Number
Ke Fan, Zechen Bai, Tianjun Xiao et al.
Rethinking Prior Information Generation with CLIP for Few-Shot Segmentation
Jin Wang, Bingfeng Zhang, Jian Pang et al.
LiSA: LiDAR Localization with Semantic Awareness
Bochun Yang, Zijun Li, Wen Li et al.
MmAP: Multi-Modal Alignment Prompt for Cross-Domain Multi-Task Learning
Yi Xin, Junlong Du, Qiang Wang et al.
Teaching Large Language Models to Translate with Comparison
Jiali Zeng, Fandong Meng, Yongjing Yin et al.
CausalPC: Improving the Robustness of Point Cloud Classification by Causal Effect Identification
Yuanmin Huang, Mi Zhang, Daizong Ding et al.
Adapting to Length Shift: FlexiLength Network for Trajectory Prediction
Yi Xu, Yun Fu
Diffeomorphic Template Registration for Atmospheric Turbulence Mitigation
Dong Lao, Congli Wang, Alex Wong et al.
Instruct-Imagen: Image Generation with Multi-modal Instruction
Hexiang Hu, Kelvin C.K. Chan, Yu-Chuan Su et al.
Rapid Motor Adaptation for Robotic Manipulator Arms
Yichao Liang, Kevin Ellis, João F. Henriques
PM-INR: Prior-Rich Multi-Modal Implicit Large-Scale Scene Neural Representation
Yiying Yang, Fukun Yin, Wen Liu et al.