Jie Tang

30
Papers
1,836
Total Citations

Papers (30)

CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer

ICLR 2025
1,318
citations

LVBench: An Extreme Long Video Understanding Benchmark

ICCV 2025
208
citations

KoLA: Carefully Benchmarking World Knowledge of Large Language Models

ICLR 2024
85
citations

VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents

ICLR 2025
67
citations

Bilateral Propagation Network for Depth Completion

CVPR 2024
51
citations

Scaling Speech-Text Pre-training with Synthetic Interleaved Data

ICLR 2025
39
citations

CATANet: Efficient Content-Aware Token Aggregation for Lightweight Image Super-Resolution

CVPR 2025
23
citations

Sketch and Refine: Towards Fast and Accurate Lane Detection

AAAI 2024arXiv
20
citations

SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models

ICLR 2025
12
citations

TriSampler: A Better Negative Sampling Principle for Dense Retrieval

AAAI 2024arXiv
12
citations

Small Language Model Makes an Effective Long Text Extractor

AAAI 2025
1
citations

Towards Efficient Exact Optimization of Language Model Alignment

ICML 2024
0
citations

AutoLUT: LUT-Based Image Super-Resolution with Automatic Sampling and Adaptive Residual Learning

CVPR 2025
0
citations

MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models

CVPR 2025
0
citations

VPO: Aligning Text-to-Video Generation Models with Prompt Optimization

ICCV 2025
0
citations

CogAgent: A Visual Language Model for GUI Agents

CVPR 2024
0
citations

Residual Feature Aggregation Network for Image Super-Resolution

CVPR 2020
0
citations

BodyGAN: General-Purpose Controllable Neural Human Body Generation

CVPR 2022
0
citations

Robust Object Modeling for Visual Tracking

ICCV 2023arXiv
0
citations

Bandit Learning with Implicit Feedback

NeurIPS 2018
0
citations

CogLTX: Applying BERT to Long Texts

NeurIPS 2020
0
citations

A Matrix Chernoff Bound for Markov Chains and Its Application to Co-occurrence Matrices

NeurIPS 2020
0
citations

Graph Random Neural Networks for Semi-Supervised Learning on Graphs

NeurIPS 2020
0
citations

CogView: Mastering Text-to-Image Generation via Transformers

NeurIPS 2021
0
citations

Adaptive Diffusion in Graph Neural Networks

NeurIPS 2021
0
citations

A Hierarchical Reinforcement Learning Based Optimization Framework for Large-scale Dynamic Pickup and Delivery Problems

NeurIPS 2021
0
citations

UFC-BERT: Unifying Multi-Modal Controls for Conditional Image Synthesis

NeurIPS 2021
0
citations

CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers

NeurIPS 2022
0
citations

Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos

NeurIPS 2022
0
citations

ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation

NeurIPS 2023
0
citations