Jie Chen

Google Scholar OpenReview

74

Papers

2,570

Total Citations

2

h-index

Papers (74)

DETRs Beat YOLOs on Real-time Object Detection

ParCo: Part-Coordinating Text-to-Motion Synthesis

Adversarial Diffusion Compression for Real-World Image Super-Resolution

Hyperspectral Image Reconstruction via Combinatorial Embedding of Cross-Channel Spatio-Spectral Clues

Multimodal Large Language Models for Inverse Molecular Design with Retrosynthetic Planning

Local Action-Guided Motion Diffusion Model for Text-to-Motion Generation

CF-NeRF: Camera Parameter Free Neural Radiance Fields with Incremental Learning

iSegMan: Interactive Segment-and-Manipulate 3D Gaussians

Foundation Molecular Grammar: Multi-Modal Foundation Models Induce Interpretable Molecular Graph Languages

DASH: 4D Hash Encoding with Self-Supervised Decomposition for Real-Time Dynamic Scene Rendering

Tune-Your-Style: Intensity-tunable 3D Style Transfer with Gaussian Splatting

Cross-View Graph Consistency Learning for Invariant Graph Representations

Efficient Spiking Point Mamba for Point Cloud Analysis

FaceChain-SuDe: Building Derived Class to Inherit Category Attributes for One-shot Subject-Driven Generation

Directed Graph Grammars for Sequence-based Learning

GPEN: Global Position Encoding Network for Enhanced Subgraph Representation Learning

Boundary Exploration for Bayesian Optimization With Unknown Physical Constraints

Representing Molecules as Random Walks Over Interpretable Grammars

Self-Learning Scene-Specific Pedestrian Detectors Using a Progressive Latent Model

SRN: Side-output Residual Network for Object Symmetry Detection in the Wild

Robust Video Content Alignment and Compensation for Rain Removal in a CNN Framework

Light Field Spatial Super-Resolution via Deep Combinatorial Geometry Embedding and Structural Consistency Regularization

AD-Cluster: Augmented Discriminative Clustering for Domain Adaptive Person Re-Identification

Learning a Weakly-Supervised Video Actor-Action Segmentation Model With a Wise Selection

CoLA: Weakly-Supervised Temporal Action Localization With Snippet Contrastive Learning

Discover Cross-Modality Nuances for Visible-Infrared Person Re-Identification

Geometry-Aware Guided Loss for Deep Crack Recognition

Training-Free Transformer Architecture Search

ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval

ACSeg: Adaptive Conceptualization for Unsupervised Semantic Segmentation

A Unified Pyramid Recurrent Network for Video Frame Interpolation

Out-of-Candidate Rectification for Weakly Supervised Semantic Segmentation

Video-Text As Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning

From Node Interaction To Hop Interaction: New Effective and Scalable Graph Learning Paradigm

High-Frequency Stereo Matching Network

Fuzzy Positive Learning for Semi-Supervised Semantic Segmentation

Out-of-Distributed Semantic Pruning for Robust Semi-Supervised Learning

Attention on Attention for Image Captioning

CDNet: Centripetal Direction Network for Nuclear Instance Segmentation

ReCU: Reviving the Dead Weights in Binary Neural Networks

The Devil is in the Crack Orientation: A New Perspective for Crack Detection

LaPE: Layer-adaptive Position Embedding for Vision Transformers with Independent Layer Normalization

Towards Real-World Burst Image Super-Resolution: Benchmark and Method

TopoSeg: Topology-Aware Nuclear Instance Segmentation

Learning to Distill Global Representation for Sparse-View CT

DiffusionRet: Generative Text-Video Retrieval with Diffusion Model

Deep Multiview Clustering by Contrasting Cluster Assignments

Deep Spatial-angular Regularization for Compressive Light Field Reconstruction over Coded Apertures

Temporal-MPI: Enabling Multi-Plane Images for Dynamic Scene Modelling via Temporal Basis Learning

Locality Guidance for Improving Vision Transformers on Tiny Datasets

When Active Learning Meets Implicit Semantic Data Augmentation

NDF: Neural Deformable Fields for Dynamic Human Modelling

Solving Most Systems of Random Quadratic Equations

Multi-granularity Interaction Simulation for Unsupervised Interactive Segmentation

Temporal-aware Query Routing for Real-time Video Instance Segmentation

CLEP: A Novel Contrastive Learning Method for Evolutionary Reentrancy Vulnerability Detection

Defense Against Model Stealing Based on Account-Aware Distribution Discrepancy

Adversarial Learning Under Hybrid Perturbations for Robust Acute Lymphoblastic Leukemia Classification

Aligning Instance Brownian Bridge with Texts for Open-Vocabulary Video Instance Segmentation

DigitalLLaVA: Incorporating Digital Cognition Capability for Physical World Comprehension in Multimodal LLMs

Attack-inspired Calibration Loss for Calibrating Crack Recognition

Parallel Vertex Diffusion for Unified Visual Grounding

Practical Privacy-Preserving MLaaS: When Compressive Sensing Meets Generative Networks

Secure Distributed Sparse Gaussian Process Models Using Multi-Key Homomorphic Encryption

GraCo: Granularity-Controllable Interactive Segmentation

Mind Marginal Non-Crack Regions: Clustering-Inspired Representation Learning for Crack Segmentation

Constrained Generation of Semantically Valid Graphs via Regularizing Variational Autoencoders

Adaptively Aligned Image Captioning via Adaptive Attention Time

Online Convex Optimization Over Erdos-Renyi Random Networks

CentripetalText: An Efficient Text Instance Representation for Scene Text Detection

Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations

Discover and Align Taxonomic Context Priors for Open-world Semi-Supervised Learning

Deep Speech 2 : End-to-End Speech Recognition in English and Mandarin

DAG-GNN: DAG Structure Learning with Graph Neural Networks