Guo Chen

10

Papers

3,218

Total Citations

1

Affiliations

Affiliations

Nanjing University

Papers (10)

InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks

MVBench: A Comprehensive Multi-modal Video Understanding Benchmark

EgoExoLearn: A Dataset for Bridging Asynchronous Ego- and Exo-centric View of Procedural Activities in Real World

CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding

Modeling Fine-Grained Hand-Object Dynamics for Egocentric Video Representation Learning

EgoExoBench: A Benchmark for First- and Third-person View Video Understanding in MLLMs

Retrieval-Augmented Egocentric Video Captioning

AVSegFormer: Audio-Visual Segmentation with Transformer

NeuralIndicator: Implicit Surface Reconstruction from Neural Indicator Priors

Memory-and-Anticipation Transformer for Online Action Understanding