北京通用人工智能研究院BIGAI

为机器立心
为人文赋理

Constrained Update Projection Approach to Safe Policy Optimization

Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning

Online tensor low-rank

Online Tensor Low-Rank Representation for Streaming Data Clustering

MATE: Benchmarking Multi-Agent Reinforcement Learning in Distributed Target Coverage Control

TarGF Learning Target Gradient Field for Object Rearrangement

TarGF: Learning Target Gradient Field for Object Rearrangement

NIPS22_EgoTaskQA

EgoTaskQA: Understanding Human Tasks in Egocentric Videos