Tat-Seng Chua

47
Papers
578
Total Citations

Papers (47)

RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback

CVPR 2024
344
citations

Towards 3D Molecule-Text Interpretation in Language Models

ICLR 2024
73
citations

Towards Semantic Equivalence of Tokenization in Multimodal LLM

ICLR 2025
57
citations

Language Representations Can be What Recommenders Need: Findings and Potentials

ICLR 2025
23
citations

GOODAT: Towards Test-Time Graph Out-of-Distribution Detection

AAAI 2024arXiv
20
citations

Temporally and Distributionally Robust Optimization for Cold-Start Recommendation

AAAI 2024arXiv
18
citations

LightPROF: A Lightweight Reasoning Framework for Large Language Model on Knowledge Graph

AAAI 2025
14
citations

Benchmarking Multimodal CoT Reward Model Stepwise by Visual Program

ICCV 2025
10
citations

Boosting Virtual Agent Learning and Reasoning: A Step-Wise, Multi-Dimensional, and Generalist Reward Model with Benchmark

ICML 2025
4
citations

JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation

NeurIPS 2025arXiv
4
citations

L-MTP: Leap Multi-Token Prediction Beyond Adjacent Context for Large Language Models

NeurIPS 2025
3
citations

IGD: Token Decisiveness Modeling via Information Gain in LLMs for Personalized Recommendation

NeurIPS 2025
3
citations

Uncertainty-Driven Expert Control: Enhancing the Reliability of Medical Vision-Language Models

ICCV 2025
2
citations

Attend and Enrich: Enhanced Visual Prompt for Zero-Shot Learning

AAAI 2025
1
citations

Optimize Incompatible Parameters Through Compatibility-aware Knowledge Integration

AAAI 2025
1
citations

Neural Causal Graph for Interpretable and Intervenable Classification

ICLR 2025
1
citations

Learning Image and User Features for Recommendation in Social Networks

ICCV 2015
0
citations

Discovering Spatio-Temporal Rationales for Video Question Answering

ICCV 2023arXiv
0
citations

Gradient-Regulated Meta-Prompt Learning for Generalizable Vision-Language Models

ICCV 2023arXiv
0
citations

Visual Relation Grounding in Videos

ECCV 2020
0
citations

Fine-Grained Scene Graph Generation with Data Transfer

ECCV 2022
0
citations

Video Graph Transformer for Video Question Answering

ECCV 2022
0
citations

Combating Multimodal LLM Hallucination via Bottom-Up Holistic Reasoning

AAAI 2025
0
citations

Causal-Entity Reflected Egocentric Traffic Accident Video Synthesis

ICCV 2025
0
citations

Improving Expressive Power of Spectral Graph Neural Networks with Eigenvalue Correction

AAAI 2024arXiv
0
citations

Auto-Encoding Morph-Tokens for Multimodal LLM

ICML 2024
0
citations

NExT-GPT: Any-to-Any Multimodal LLM

ICML 2024
0
citations

Momentor: Advancing Video Large Language Model with Fine-Grained Temporal Reasoning

ICML 2024
0
citations

NExT-Chat: An LMM for Chat, Detection and Segmentation

ICML 2024
0
citations

Online Collaborative Learning for Open-Vocabulary Visual Classifiers

CVPR 2016
0
citations

Visual Translation Embedding Network for Visual Relation Detection

CVPR 2017arXiv
0
citations

SCA-CNN: Spatial and Channel-Wise Attention in Convolutional Networks for Image Captioning

CVPR 2017
0
citations

Meta-Transfer Learning for Few-Shot Learning

CVPR 2019
0
citations

Hyperbolic Visual Embedding Learning for Zero-Shot Recognition

CVPR 2020
0
citations

SESS: Self-Ensembling Semi-Supervised 3D Object Detection

CVPR 2020arXiv
0
citations

NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions

CVPR 2021
0
citations

Few-Shot 3D Point Cloud Semantic Segmentation

CVPR 2021arXiv
0
citations

Invariant Grounding for Video Question Answering

CVPR 2022
0
citations

Learning to Self-Train for Semi-Supervised Few-Shot Classification

NeurIPS 2019
0
citations

Neural Sparse Voxel Fields

NeurIPS 2020
0
citations

Towards Multi-Grained Explainability for Graph Neural Networks

NeurIPS 2021
0
citations

Incorporating Bias-aware Margins into Contrastive Loss for Collaborative Filtering

NeurIPS 2022
0
citations

LasUIE: Unifying Information Extraction with Latent Adaptive Structure-aware Generative Language Model

NeurIPS 2022
0
citations

Empowering Collaborative Filtering with Principled Adversarial Contrastive Loss

NeurIPS 2023
0
citations

VPGTrans: Transfer Visual Prompt Generator across LLMs

NeurIPS 2023
0
citations

Rethinking Tokenizer and Decoder in Masked Graph Modeling for Molecules

NeurIPS 2023
0
citations

Imagine That! Abstract-to-Intricate Text-to-Image Synthesis with Scene Graph Hallucination Diffusion

NeurIPS 2023
0
citations