Most Cited 2024 "student learning trajectories" Papers
12,324 papers found • Page 27 of 62
Conference
Dense Hand-Object(HO) GraspNet with Full Grasping Taxonomy and Dynamics
Woojin Cho, Jihyun Lee, Minjae Yi et al.
TexDreamer: Towards Zero-Shot High-Fidelity 3D Human Texture Generation
Yufei Liu, Junwei Zhu, Junshu Tang et al.
Look Around and Learn: Self-Training Object Detection by Exploration
Gianluca Scarpellini, Stefano Rosa, Pietro Morerio et al.
Correspondences of the Third Kind: Camera Pose Estimation from Object Reflection
Kohei Yamashita, Vincent Lepetit, Ko Nishino
Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation
Yunhao Gou, Kai Chen, Zhili LIU et al.
Polyper: Boundary Sensitive Polyp Segmentation
Hao Shao, Yang Zhang, Qibin Hou
Mean Teacher DETR with Masked Feature Alignment: A Robust Domain Adaptive Detection Transformer Framework
Weixi Weng, Chun Yuan
Semantic-Aware Transformation-Invariant RoI Align
Guo-Ye Yang, Kiyohiro Nakayama, Zi-Kai Xiao et al.
CR-SAM: Curvature Regularized Sharpness-Aware Minimization
Tao Wu, Tie Luo, Donald Wunsch
U-trustworthy Models. Reliability, Competence, and Confidence in Decision-Making
Ritwik Vashistha, Arya Farahi
Improving Hyperbolic Representations via Gromov-Wasserstein Regularization
yifei Yang, Wonjun Lee, Dongmian Zou et al.
Privacy-Preserving Adaptive Re-Identification without Image Transfer
Hamza Rami, Jhony H. Giraldo, Nicolas Winckler et al.
When CEGAR Meets Regression: A Love Story in Optimal Classical Planning
Martín Pozo, Alvaro Torralba, Carlos Linares Lopez
Tackling Vision Language Tasks through Learning Inner Monologues
Diji Yang, Kezhen Chen, Jinmeng Rao et al.
Recurrent Graph Neural Networks and Their Connections to Bisimulation and Logic
Maximilian Pflüger, David Tena Cucala, Egor Kostylev
Motion Keyframe Interpolation for Any Human Skeleton using Point Cloud-based Human Motion Data Homogenisation
Clinton Mo, Kun Hu, Chengjiang Long et al.
GroupDiff: Diffusion-based Group Portrait Editing
Yuming Jiang, Nanxuan Zhao, Qing Liu et al.
Knowledge-Aware Neuron Interpretation for Scene Classification
Yong Guan, Freddy Lecue, Jiaoyan Chen et al.
Prompt to Transfer: Sim-to-Real Transfer for Traffic Signal Control with Prompt Learning
Longchao Da, Minquan Gao, Hua Wei et al.
Towards the Robustness of Differentially Private Federated Learning
Tao Qi, Huili Wang, Yongfeng Huang
OSFFNet: Omni-Stage Feature Fusion Network for Lightweight Image Super-resolution
Yang Wang, Tao Zhang
Motion Aware Event Representation-driven Image Deblurring
Zhijing Sun, Xueyang Fu, Longzhuo Huang et al.
Robustness Verification of Deep Reinforcement Learning Based Control Systems Using Reward
Dapeng Zhi, Peixin Wang, Cheng Chen et al.
Weakly-Supervised Mirror Detection via Scribble Annotations
Mingfeng Zha, Yunqiang Pei, Guoqing Wang et al.
LogoStyleFool: Vitiating Video Recognition Systems via Logo Style Transfer
Yuxin Cao, Ziyu Zhao, Xi Xiao et al.
Follow the Rules: Reasoning for Video Anomaly Detection with Large Language Models
Yuchen Yang, Kwonjoon Lee, Behzad Dariush et al.
Counterfactual-Enhanced Information Bottleneck for Aspect-Based Sentiment Analysis
Mingshan Chang, Min Yang, Qingshan Jiang et al.
Plug and Play: A Representation Enhanced Domain Adapter for Collaborative Perception
TIANYOU LUO, Quan Yuan, Yuchen Xia et al.
DiffPMAE: Diffusion Masked Autoencoders for Point Cloud Reconstruction
YANLONG LI, Chamara Madarasingha, Kanchana Thilakarathna
3DFG-PIFu: 3D Feature Grids for Human Digitization from Sparse Views
Kennard Yanting Chan, Fayao Liu, Guosheng Lin et al.
Elevating All Zero-Shot Sketch-Based Image Retrieval Through Multimodal Prompt Learning
Mainak Singha, Ankit Jha, Divyam Gupta et al.
Task-Free Continual Generation and Representation Learning via Dynamic Expansionable Memory Cluster
Fei Ye, Adrian Bors
Learning Image Demoireing from Unpaired Real Data
Yunshan Zhong, Zhou Yuyao, Yuxin Zhang et al.
Fine-Tuning Large Language Model Based Explainable Recommendation with Explainable Quality Reward
Mengyuan Yang, Mengying Zhu, Yan Wang et al.
Reinforcement Learning via Auxillary Task Distillation
Abhinav Narayan Harish, Larry Heck, Josiah P Hanna et al.
LAFA: Multimodal Knowledge Graph Completion with Link Aware Fusion and Aggregation
Bin Shang, Yinliang Zhao, Jun Liu et al.
A Goal Interaction Graph Planning Framework for Conversational Recommendation
Xiaotong Zhang, Xuefang Jia, Han Liu et al.
On the Vulnerability of Skip Connections to Model Inversion Attacks
Jun Hao Koh, Sy-Tuyen Ho, Ngoc-Bao Nguyen et al.
An Embedding-Unleashing Video Polyp Segmentation Framework via Region Linking and Scale Alignment
Zhixue Fang, Xinrong Guo, Jingyin Lin et al.
ECHO-GL: Earnings Calls-Driven Heterogeneous Graph Learning for Stock Movement Prediction
Mengpu Liu, Mengying Zhu, Xiuyuan Wang et al.
Generalizable Symbolic Optimizer Learning
Xiaotian Song, Peng Zeng, Yanan Sun et al.
OLAF: A Plug-and-Play Framework for Enhanced Multi-object Multi-part Scene Parsing
Pranav Gupta, Rishubh Singh, Pradeep Shenoy et al.
End-to-End Learning of LTLf Formulae by Faithful LTLf Encoding
Hai Wan, Pingjia Liang, Jianfeng Du et al.
FeatWalk: Enhancing Few-Shot Classification through Local View Leveraging
Dalong Chen, Jianjia Zhang, Wei-shi Zheng et al.
SHaRPose: Sparse High-Resolution Representation for Human Pose Estimation
Xiaoqi An, Lin Zhao, Chen Gong et al.
Approximating the Shapley Value without Marginal Contributions
Patrick Kolpaczki, Viktor Bengs, Maximilian Muschalik et al.
Improving Neural Network Generalization on Data-Limited Regression with Doubly-Robust Boosting
An Empirical Study and Analysis of Text-to-Image Generation Using Large Language Model-Powered Textual Representation
Zhiyu Tan, Mengping Yang, Luozheng Qin et al.
DPA-Net: Structured 3D Abstraction from Sparse Views via Differentiable Primitive Assembly
Fenggen Yu, Yiming Qian, Xu Zhang et al.
Thompson Sampling for Real-Valued Combinatorial Pure Exploration of Multi-Armed Bandit
Shintaro Nakamura, Masashi Sugiyama
Implicit Modeling of Non-rigid Objects with Cross-Category Signals
Yuchun Liu, Benjamin Planche, Meng Zheng et al.
Towards Improved Proxy-Based Deep Metric Learning via Data-Augmented Domain Adaptation
Li Ren, Chen Chen, Liqiang Wang et al.
NeBLa: Neural Beer-Lambert for 3D Reconstruction of Oral Structures from Panoramic Radiographs
Sihwa Park, Seongjun Kim, Doeyoung Kwon et al.
Seeing Dark Videos via Self-Learned Bottleneck Neural Representation
Haofeng Huang, Wenhan Yang, Lingyu Duan et al.
REPrune: Channel Pruning via Kernel Representative Selection
Mincheol Park, Dongjin Kim, Cheonjun Park et al.
BAGS: Blur Agnostic Gaussian Splatting through Multi-Scale Kernel Modeling
Cheng Peng, Yutao Tang, Yifan Zhou et al.
Uncertainty Calibration with Energy Based Instance-wise Scaling in the Wild Dataset
Mijoo Kim, Junseok Kwon
Category Adaptation Meets Projected Distillation in Generalized Continual Category Discovery
Grzegorz Rypesc, Daniel Marczak, Sebastian Cygert et al.
Collaborative Control for Geometry-Conditioned PBR Image Generation
Shimon Vainer, Mark Boss, Mathias Parger et al.
Learning Representation for Multitask Learning through Self-Supervised Auxiliary Learning
Seokwon Shin, Hyungrok Do, Youngdoo Son
CLIP-Guided Federated Learning on Heterogeneity and Long-Tailed Data
Jiangming Shi, Shanshan Zheng, Xiangbo Yin et al.
DR-Label: Label Deconstruction and Reconstruction of GNN Models for Catalysis Systems
Bowen Wang, Chen Liang, Jiaze Wang et al.
GSDD: Generative Space Dataset Distillation for Image Super-resolution
Haiyu Zhang, Shaolin Su, Yu Zhu et al.
LRSLAM: Low-rank Representation of Signed Distance Fields in Dense Visual SLAM System
Hongbeen Park, Minjeong Park, Giljoo Nam et al.
GRiT: A Generative Region-to-text Transformer for Object Understanding
Jialian Wu, Jianfeng Wang, Zhengyuan Yang et al.
Deep Unfolded Network with Intrinsic Supervision for Pan-Sharpening
Parallel Beam Search Algorithms for Domain-Independent Dynamic Programming
Vincent Conitzer, Yueqian Wang, Yuxuan Wang et al.
ST-LDM: A Universal Framework for Text-Grounded Object Generation in Real Images
Xiangtian Xue, Jiasong Wu, Youyong Kong et al.
STAIR: Spatial-Temporal Reasoning with Auditable Intermediate Results for Video Question Answering
The Complexity of Computing Robust Mediated Equilibria in Ordinal Games
Multilevel Attention Network with Semi-supervised Domain Adaptation for Drug-Target Prediction
ZhouSan Xie, Shikui Tu, Lei Xu
Meta-optimized Angular Margin Contrastive Framework for Video-Language Representation Learning
Thanh Thong Nguyen, Yi Bin, Xiaobao Wu et al.
Comprehensive Visual Grounding for Video Description
Wenhui Jiang, Yibo Cheng, Liu Linxin et al.
Causal Adversarial Perturbations for Individual Fairness and Robustness in Heterogeneous Data Spaces
Ahmad-Reza Ehyaei, Kiarash Mohammadi, Amir-Hossein Karimi et al.
Visual Hallucination Elevates Speech Recognition
Fang Zhang, Yongxin Zhu, Xiangxiang Wang et al.
Intra- and Inter-group Optimal Transport for User-Oriented Fairness in Recommender Systems
Zhongxuan Han, Chaochao Chen, Xiaolin Zheng et al.
Ex2Eg-MAE: A Framework for Adaptation of Exocentric Video Masked Autoencoders for Egocentric Social Role Understanding
Minh Tran, Yelin Kim, Che-Chun Su et al.
Directed Diffusion: Direct Control of Object Placement through Attention Guidance
Wan-Duo Ma, Avisek Lahiri, J. P. Lewis et al.
KD-Club: An Efficient Exact Algorithm with New Coloring-Based Upper Bound for the Maximum K-defective Clique Problem
Jiongzhi Zheng, Mingming Jin, Kun He
W2P: Switching from Weak Supervision to Partial Supervision for Semantic Segmentation
Fangyuan Zhang, Tianxiang Pan, Junhai Yong et al.
A Fast Exact Solver with Theoretical Analysis for the Maximum Edge-Weighted Clique Problem
Lu Liu, Mingyu Xiao, Yi Zhou
Fractional Deep Reinforcement Learning for Age-Minimal Mobile Edge Computing
Lyudong Jin, Ming Tang, Meng Zhang et al.
CoPL: Contextual Prompt Learning for Vision-Language Understanding
Koustava Goswami, Srikrishna Karanam, Prateksha Udhayanan et al.
Terrain Diffusion Network: Climatic-Aware Terrain Generation with Geological Sketch Guidance
Zexin Hu, Kun Hu, Clinton Mo et al.
Object-Aware Query Perturbation for Cross-Modal Image-Text Retrieval
Naoya Sogi, Takashi Shibata, Makoto Terao
AdvDiff: Generating Unrestricted Adversarial Examples using Diffusion Models
Xuelong Dai, Kaisheng Liang, Bin Xiao
Discriminative Forests Improve Generative Diversity for Generative Adversarial Networks
Junjie Chen, Jiahao Li, Chen Song et al.
Progressive Feature Self-Reinforcement for Weakly Supervised Semantic Segmentation
Jingxuan He, Lechao Cheng, Chaowei Fang et al.
Piecewise Linear Transformation – Propagating Aleatoric Uncertainty in Neural Networks
Thomas Krapf, Michael Hagn, Paul Miethaner et al.
Multimodal Cross-Domain Few-Shot Learning for Egocentric Action Recognition
Masashi Hatano, Ryo Hachiuma, Ryo Fujii et al.
Social Physics Informed Diffusion Model for Crowd Simulation
Hongyi Chen, Jingtao Ding, Yong Li et al.
Textual Knowledge Matters: Cross-Modality Co-Teaching for Generalized Visual Class Discovery
Haiyang Zheng, Pu Nan, Wenjing Li et al.
SyFormer: Structure-Guided Synergism Transformer for Large-Portion Image Inpainting
Jie Wu, Yuchao Feng, Honghui Xu et al.
Curved Representation Space of Vision Transformers
Juyeop Kim, Junha Park, Songkuk Kim et al.
LDMVFI: Video Frame Interpolation with Latent Diffusion Models
Duolikun Danier, Fan Zhang, David Bull
Step Vulnerability Guided Mean Fluctuation Adversarial Attack against Conditional Diffusion Models
Hongwei Yu, Jiansheng Chen, Xinlong Ding et al.
Fast and Controllable Post-training Sparsity: Learning Optimal Sparsity Allocation with Global Constraint in Minutes
Ruihao Gong, Yang Yong, Zining Wang et al.
AdaDiff: Accelerating Diffusion Models through Step-Wise Adaptive Computation
Shengkun Tang, Yaqing Wang, Caiwen Ding et al.
Textual Grounding for Open-vocabulary Visual Information Extraction in Layout-diversified Documents
MENGJUN CHENG, Chengquan Zhang, Chang Liu et al.
ND-MRM: Neuronal Diversity Inspired Multisensory Recognition Model
Qixin Wang, Chaoqiong Fan, Tianyuan Jia et al.
Mining Fine-Grained Image-Text Alignment for Zero-Shot Captioning via Text-Only Training
Longtian Qiu, Shan Ning, Xuming He
Influential Exemplar Replay for Incremental Learning in Recommender Systems
Xinni Zhang, Yankai Chen, Chenhao Ma et al.
Evaluating the Adversarial Robustness of Semantic Segmentation: Trying Harder Pays Off
Levente Ferenc Halmosi, Bálint Mohos, Márk Jelasity
Triple Feature Disentanglement for One-Stage Adaptive Object Detection
Haoan Wang, Shilong Jia, Tieyong Zeng et al.
Object-Aware Domain Generalization for Object Detection
WooJu Lee, Dasol Hong, Hyungtae Lim et al.
Integrated Decision Gradients: Compute Your Attributions Where the Model Makes Its Decision
Weibo Gao, Qi Liu, Hao Wang et al.
VITA: ‘Carefully Chosen and Weighted Less’ Is Better in Medication Recommendation
Recurrent Partial Kernel Network for Efficient Optical Flow Estimation
A Non-parametric Graph Clustering Framework for Multi-View Data
Shengju Yu, Siwei Wang, Zhibin Dong et al.
A Diffusion Model for Simulation Ready Coronary Anatomy with Morpho-skeletal Control
Karim Kadry, Shreya Gupta, Jonas Sogbadji et al.
FreestyleRet: Retrieving Images from Style-Diversified Queries
Hao Li, Yanhao Jia, Peng Jin et al.
Diffusion-Generated Pseudo-Observations for High-Quality Sparse-View Reconstruction
Xinhang Liu, Jiaben Chen, Shiu-Hong Kao et al.
Spectral Subsurface Scattering for Material Classification
Haejoon Lee, Aswin C. Sankaranarayanan
High-Resolution and Few-shot View Synthesis from Asymmetric Dual-lens Inputs
Ruikang Xu, Mingde Yao, Yue Li et al.
MapDistill: Boosting Efficient Camera-based HD Map Construction via Camera-LiDAR Fusion Model Distillation
Xiaoshuai Hao, Ruikai Li, Hui Zhang et al.
Learning to Generate Conditional Tri-plane for 3D-aware Expression Controllable Portrait Animation
Taekyung Ki, Dongchan Min, Gyeongsu Chae
Uncovering and Mitigating the Hidden Chasm: A Study on the Text-Text Domain Gap in Euphemism Identification
A Compiler for Weak Decomposable Negation Normal Form
Petr Illner, Petr Kucera
Combinatorial Stochastic-Greedy Bandit
Fares Fourati, Christopher John Quinn, Mohamed-Slim Alouini et al.
Optimization-based Uncertainty Attribution Via Learning Informative Perturbations
Hanjing Wang, Bashirul Azam Biswas, Qiang Ji
TAPTR: Tracking Any Point with Transformers as Detection
Hongyang Li, Hao Zhang, Shilong Liu et al.
Non-stationary Projection-Free Online Learning with Dynamic and Adaptive Regret Guarantees
Yibo Wang, Wenhao Yang, Wei Jiang et al.
A New Benchmark and Model for Challenging Image Manipulation Detection
Zhenfei Zhang, Mingyang Li, Ming-Ching Chang
Attacks on Continual Semantic Segmentation by Perturbing Incremental Samples
Zhidong Yu, Wei Yang, Xike Xie et al.
To Supervise or Not to Supervise: Understanding and Addressing the Key Challenges of Point Cloud Transfer Learning
Souhail Hadgi, Lei Li, Maks Ovsjanikov
Unveiling Details in the Dark: Simultaneous Brightening and Zooming for Low-Light Image Enhancement
Ziyu Yue, Jiaxin Gao, Zhixun Su
3D Reconstruction of Objects in Hands without Real World 3D Supervision
Aditya Prakash, Matthew Chang, Matthew Jin et al.
Text2Place: Affordance-aware Text Guided Human Placement
Rishubh Parihar, Harsh Gupta, Sachidanand VS et al.
PreLAR: World Model Pre-training with Learnable Action Representation
Lixuan Zhang, Meina Kan, Shiguang Shan et al.
Non-flat ABA Is an Instance of Bipolar Argumentation
Markus Ulbricht, Nico Potyka, Anna Rapberger et al.
EDA: Evolving and Distinct Anchors for Multimodal Motion Prediction
Longzhong Lin, Xuewu Lin, Tianwei Lin et al.
CMG-Net: Robust Normal Estimation for Point Clouds via Chamfer Normal Distance and Multi-Scale Geometry
Yingrui Wu, Mingyang Zhao, Keqiang Li et al.
Learning Reduced Fluid Dynamics
Zherong Pan, Xifeng Gao, Kui Wu
Beyond Expected Return: Accounting for Policy Reproducibility When Evaluating Reinforcement Learning Algorithms
Manon Flageat, Bryan Lim, Antoine Cully
Quantifying and Analyzing Entity-Level Memorization in Large Language Models
Zhenhong Zhou, Jiuyang Xiang, Chaomeng Chen et al.
N-gram Unsupervised Compoundation and Feature Injection for Better Symbolic Music Understanding
Jinhao Tian, Zuchao Li, Jiajia Li et al.
Domain-Hallucinated Updating for Multi-Domain Face Anti-spoofing
Chengyang Hu, Ke-Yue Zhang, Taiping Yao et al.
De-biased Attention Supervision for Text Classification with Causality
Yiquan Wu, Yifei Liu, Ziyu Zhao et al.
SeA: Semantic Adversarial Augmentation for Last Layer Features from Unsupervised Representation Learning
Qi Qian, Yuanhong Xu, JUHUA HU
GENIXER: Empowering Multimodal Large Language Models as a Powerful Data Generator
Hengyuan Zhao, Pan Zhou, Mike Zheng Shou
Pose Guided Fine-Grained Sign Language Video Generation
Tongkai Shi, Lianyu Hu, Fanhua Shang et al.
MGQFormer: Mask-Guided Query-Based Transformer for Image Manipulation Localization
Kunlun Zeng, Ri Cheng, Weimin Tan et al.
An Optimal Transport View for Subspace Clustering and Spectral Clustering
Yuguang Yan, Zhihao Xu, Canlin Yang et al.
iNeMo: Incremental Neural Mesh Models for Robust Class-Incremental Learning
Tom Fischer, Yaoyao Liu, Artur Jesslen et al.
Relightable 3D Gaussians: Realistic Point Cloud Relighting with BRDF Decomposition and Ray Tracing
Jian Gao, chun gu, Youtian Lin et al.
Null Space Matters: Range-Null Decomposition for Consistent Multi-Contrast MRI Reconstruction
Jiacheng Chen, Jiawei Jiang, Fei Wu et al.
ConsistNER: Towards Instructive NER Demonstrations for LLMs with the Consistency of Ontology and Context
Chenxiao Wu, Ke Wenjun, Peng Wang et al.
LATTE3D: Large-scale Amortized Text-To-Enhanced3D Synthesis
Kevin Xie, Tianshi Cao, Jonathan P Lorraine et al.
A Closer Look at GAN Priors: Exploiting Intermediate Features for Enhanced Model Inversion Attacks
Yixiang Qiu, Hao Fang, Hongyao Yu et al.
FFT-Based Dynamic Token Mixer for Vision
Yuki Tatsunami, Masato Taki
BlazeBVD: Make Scale-Time Equalization Great Again for Blind Video Deflickering
Xinmin Qiu, Congying Han, Zicheng Zhang et al.
Eliminating Feature Ambiguity for Few-Shot Segmentation
Qianxiong Xu, Guosheng Lin, Chen Change Loy et al.
QAGait: Revisit Gait Recognition from a Quality Perspective
Zengbin Wang, Saihui Hou, Man Zhang et al.
AnimateMe: 4D Facial Expressions via Diffusion Models
Dimitrios Gerogiannis, Foivos Paraperas Papantoniou, Rolandos Alexandros Potamias et al.
Data Shunt: Collaboration of Small and Large Models for Lower Costs and Better Performance
Dong Chen, Yueting Zhuang, Shuo Zhang et al.
SkyScript: A Large and Semantically Diverse Vision-Language Dataset for Remote Sensing
Zhecheng Wang, Rajanie Prabha, Tianyuan Huang et al.
Frozen CLIP Transformer Is an Efficient Point Cloud Encoder
Xiaoshui Huang, Zhou Huang, Sheng Li et al.
Curved Diffusion: A Generative Model With Optical Geometry Control
Andrey Voynov, Amir Hertz, Moab Arar et al.
FMRNet: Image Deraining via Frequency Mutual Revision
Kui Jiang, Junjun Jiang, Xianming Liu et al.
Adversarial Robust Safeguard for Evading Deep Facial Manipulation
Jiazhi Guan, Yi Zhao, Zhuoer Xu et al.
Get Your Embedding Space in Order: Domain-Adaptive Regression for Forest Monitoring
Sizhuo Li, Dimitri Gominski, Martin Brandt et al.
SAFARI: Adaptive Sequence Transformer for Weakly Supervised Referring Expression Segmentation
Sayan Nag, Koustava Goswami, Srikrishna Karanam
The Fabrication of Reality and Fantasy: Scene Generation with LLM-Assisted Prompt Interpretation
Yi Yao, Chan-Feng Hsu, Jhe-Hao Lin et al.
MyVLM: Personalizing VLMs for User-Specific Queries
Yuval Alaluf, Elad Richardson, Sergey Tulyakov et al.
The Causal Impact of Credit Lines on Spending Distributions
Yijun Li, Cheuk Hang Leung, Xiangqian Sun et al.
AdaFormer: Efficient Transformer with Adaptive Token Sparsification for Image Super-resolution
Xiaotong Luo, zekun Ai, Qiuyuan Liang et al.
Collaborative Weakly Supervised Video Correlation Learning for Procedure-Aware Instructional Video Analysis
Tianyao He, Huabin Liu, Yuxi Li et al.
Model Counting and Sampling via Semiring Extensions
Andreas Goral, Joachim Giesen, Mark Blacher et al.
ASWT-SGNN: Adaptive Spectral Wavelet Transform-Based Self-Supervised Graph Neural Network
Ruyue Liu, Rong Yin, Yong Liu et al.
Fast Inter-frame Motion Prediction for Compressed Dynamic Point Cloud Attribute Enhancement
Wang Liu, Wei Gao, Xingming Mu
SUF: Stabilized Unconstrained Fine-Tuning for Offline-to-Online Reinforcement Learning
Jiaheng Feng, Mingxiao Feng, Haolin Song et al.
STAS: Spatial-Temporal Return Decomposition for Multi-Agent Reinforcement Learning
Sirui Chen, Zhaowei Zhang, Yaodong Yang et al.
Accelerating Text-to-Image Editing via Cache-Enabled Sparse Diffusion Inference
Zihao Yu, Haoyang Li, Fangcheng Fu et al.
Generalize for Future: Slow and Fast Trajectory Learning for CTR Prediction
Jian Zhu, Congcong Liu, Xue Jiang et al.
CLIP-DPO: Vision-Language Models as a Source of Preference for Fixing Hallucinations in LVLMs
Yassine Ouali, Adrian Bulat, Brais Martinez et al.
Few-shot Defect Image Generation based on Consistency Modeling
Qingfeng Shi, Jing Wei, Fei Shen et al.
MAD-DR: Map Compression for Visual Localization with Matchness Aware Descriptor Dimension Reduction
Qiang Wang
MELO: Enhancing Model Editing with Neuron-Indexed Dynamic LoRA
Lang Yu, Qin Chen, Jie Zhou et al.
CAMEL: Capturing Metaphorical Alignment with Context Disentangling for Multimodal Emotion Recognition
Linhao Zhang, Li Jin, Guangluan Xu et al.
VELMA: Verbalization Embodiment of LLM Agents for Vision and Language Navigation in Street View
Raphael Schumann, Wanrong Zhu, Weixi Feng et al.
Motion-Guided Latent Diffusion for Temporally Consistent Real-world Video Super-resolution
Xi Yang, Chenhang He, Jianqi Ma et al.
MetaMix: Meta-State Precision Searcher for Mixed-Precision Activation Quantization
Han-Byul Kim, Joo Hyung Lee, Sungjoo Yoo et al.
Shaping Up SHAP: Enhancing Stability through Layer-Wise Neighbor Selection
Gwladys Kelodjou, Laurence Rozé, Véronique Masson et al.
Optimised Storage for Datalog Reasoning
Xinyue Zhang, Pan Hu, Yavor Nenov et al.
LayerDiff: Exploring Text-guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model
Runhui Huang, Kaixin Cai, Jianhua Han et al.
RANRAC: Robust Neural Scene Representations via Random Ray Consensus
Benno Buschmann, Andreea Dogaru, Elmar Eisemann et al.
COD: Learning Conditional Invariant Representation for Domain Adaptation Regression
Hao-Ran Yang, Chuan-Xian Ren, You-Wei Luo
LLM as Copilot for Coarse-grained Vision-and-Language Navigation
Yanyuan Qiao, Qianyi Liu, Jiajun Liu et al.
G2fR: Frequency Regularization in Grid-based Feature Encoding Neural Radiance Fields
Shuxiang Xie, Shuyi Zhou, Ken Sakurada et al.
Critic-Guided Decision Transformer for Offline Reinforcement Learning
Yuanfu Wang, Chao Yang, Ying Wen et al.
Exploring One-Shot Semi-supervised Federated Learning with Pre-trained Diffusion Models
Mingzhao Yang, Shangchao Su, Bin Li et al.
All Should Be Equal in the Eyes of LMs: Counterfactually Aware Fair Text Generation
Pragyan Banerjee, Abhinav Java, Surgan Jandial et al.
WAVE: Warping DDIM Inversion Features for Zero-shot Text-to-Video Editing
Yutang Feng, Sicheng Gao, Yuxiang Bao et al.
Online Markov Decision Processes Configuration with Continuous Decision Space
Davide Maran, Pierriccardo Olivieri, Francesco Emanuele Stradi et al.
Spiking Wavelet Transformer
Yuetong Fang, Ziqing Wang, Lingfeng Zhang et al.
Physically Plausible Color Correction for Neural Radiance Fields
Qi Zhang, Ying Feng, HONGDONG LI
Data Distribution Distilled Generative Model for Generalized Zero-Shot Recognition
Yijie Wang, Mingjian Hong, Luwen Huangfu et al.
Safe Abductive Learning in the Presence of Inaccurate Rules
Xiao-Wen Yang, Jie-Jing Shao, Wei-Wei Tu et al.
FedGCR: Achieving Performance and Fairness for Federated Learning with Distinct Client Types via Group Customization and Reweighting
Shu-Ling Cheng, Chin-Yuan Yeh, Ting-An Chen et al.
Augmented Commonsense Knowledge for Remote Object Grounding
Bahram Mohammadi, Yicong Hong, Yuankai Qi et al.