Tat-Seng Chua
47
Papers
578
Total Citations
Papers (47)
RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback
CVPR 2024
344
citations
Towards 3D Molecule-Text Interpretation in Language Models
ICLR 2024
73
citations
Towards Semantic Equivalence of Tokenization in Multimodal LLM
ICLR 2025
57
citations
Language Representations Can be What Recommenders Need: Findings and Potentials
ICLR 2025
23
citations
GOODAT: Towards Test-Time Graph Out-of-Distribution Detection
AAAI 2024arXiv
20
citations
Temporally and Distributionally Robust Optimization for Cold-Start Recommendation
AAAI 2024arXiv
18
citations
LightPROF: A Lightweight Reasoning Framework for Large Language Model on Knowledge Graph
AAAI 2025
14
citations
Benchmarking Multimodal CoT Reward Model Stepwise by Visual Program
ICCV 2025
10
citations
Boosting Virtual Agent Learning and Reasoning: A Step-Wise, Multi-Dimensional, and Generalist Reward Model with Benchmark
ICML 2025
4
citations
JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation
NeurIPS 2025arXiv
4
citations
L-MTP: Leap Multi-Token Prediction Beyond Adjacent Context for Large Language Models
NeurIPS 2025
3
citations
IGD: Token Decisiveness Modeling via Information Gain in LLMs for Personalized Recommendation
NeurIPS 2025
3
citations
Uncertainty-Driven Expert Control: Enhancing the Reliability of Medical Vision-Language Models
ICCV 2025
2
citations
Attend and Enrich: Enhanced Visual Prompt for Zero-Shot Learning
AAAI 2025
1
citations
Optimize Incompatible Parameters Through Compatibility-aware Knowledge Integration
AAAI 2025
1
citations
Neural Causal Graph for Interpretable and Intervenable Classification
ICLR 2025
1
citations
Learning Image and User Features for Recommendation in Social Networks
ICCV 2015
0
citations
Discovering Spatio-Temporal Rationales for Video Question Answering
ICCV 2023arXiv
0
citations
Gradient-Regulated Meta-Prompt Learning for Generalizable Vision-Language Models
ICCV 2023arXiv
0
citations
Visual Relation Grounding in Videos
ECCV 2020
0
citations
Fine-Grained Scene Graph Generation with Data Transfer
ECCV 2022
0
citations
Video Graph Transformer for Video Question Answering
ECCV 2022
0
citations
Combating Multimodal LLM Hallucination via Bottom-Up Holistic Reasoning
AAAI 2025
0
citations
Causal-Entity Reflected Egocentric Traffic Accident Video Synthesis
ICCV 2025
0
citations
Improving Expressive Power of Spectral Graph Neural Networks with Eigenvalue Correction
AAAI 2024arXiv
0
citations
Auto-Encoding Morph-Tokens for Multimodal LLM
ICML 2024
0
citations
NExT-GPT: Any-to-Any Multimodal LLM
ICML 2024
0
citations
Momentor: Advancing Video Large Language Model with Fine-Grained Temporal Reasoning
ICML 2024
0
citations
NExT-Chat: An LMM for Chat, Detection and Segmentation
ICML 2024
0
citations
Online Collaborative Learning for Open-Vocabulary Visual Classifiers
CVPR 2016
0
citations
Visual Translation Embedding Network for Visual Relation Detection
CVPR 2017arXiv
0
citations
SCA-CNN: Spatial and Channel-Wise Attention in Convolutional Networks for Image Captioning
CVPR 2017
0
citations
Meta-Transfer Learning for Few-Shot Learning
CVPR 2019
0
citations
Hyperbolic Visual Embedding Learning for Zero-Shot Recognition
CVPR 2020
0
citations
SESS: Self-Ensembling Semi-Supervised 3D Object Detection
CVPR 2020arXiv
0
citations
NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions
CVPR 2021
0
citations
Few-Shot 3D Point Cloud Semantic Segmentation
CVPR 2021arXiv
0
citations
Invariant Grounding for Video Question Answering
CVPR 2022
0
citations
Learning to Self-Train for Semi-Supervised Few-Shot Classification
NeurIPS 2019
0
citations
Neural Sparse Voxel Fields
NeurIPS 2020
0
citations
Towards Multi-Grained Explainability for Graph Neural Networks
NeurIPS 2021
0
citations
Incorporating Bias-aware Margins into Contrastive Loss for Collaborative Filtering
NeurIPS 2022
0
citations
LasUIE: Unifying Information Extraction with Latent Adaptive Structure-aware Generative Language Model
NeurIPS 2022
0
citations
Empowering Collaborative Filtering with Principled Adversarial Contrastive Loss
NeurIPS 2023
0
citations
VPGTrans: Transfer Visual Prompt Generator across LLMs
NeurIPS 2023
0
citations
Rethinking Tokenizer and Decoder in Masked Graph Modeling for Molecules
NeurIPS 2023
0
citations
Imagine That! Abstract-to-Intricate Text-to-Image Synthesis with Scene Graph Hallucination Diffusion
NeurIPS 2023
0
citations