科研成果

Towards Brigding the Gap Between Large-scale Pretraining and Efficient Finetuning for Humanoid Control

STVG-R1: Incentivizing Instance-Level Reasoning and Grounding in Videos via Reinforcement Learning

MVR:Multi-view Video Reward Shaping for Reinforcement Learning

MILR: Improving Multimodal Image Generation via Test-Time Latent Reasoning

When Large Multimodal Models Confront Evolving Knowledge: Challenges and Explorations

Learning What Matters Now: Dynamic Preference Inference under Contextual Shifts