You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 

1.8 KiB

BFM-Zero: Promptable Behavioral Foundation Model for Humanoid Control

Source: https://arxiv.org/abs/2511.04131 Fetched: 2026-02-13 Type: Research Paper


Paper Information

  • arXiv ID: 2511.04131
  • Authors: Yitang Li, Zhengyi Luo, Tonghe Zhang, Cunxi Dai, and nine additional collaborators
  • Submission Date: November 6, 2025
  • Field: Computer Science - Robotics

Abstract

BFM-Zero is a framework for training humanoid robots using unsupervised reinforcement learning. The system enables a single policy to handle multiple tasks through a shared latent representation without requiring retraining.

Core Methodology

Latent Space Architecture

The framework "learns an effective shared latent representation that embeds motions, goals, and rewards into a common space." This unified representation enables versatile control approaches.

Technical Foundations

Rather than traditional on-policy RL methods, BFM-Zero builds on unsupervised reinforcement learning and Forward-Backward models, providing what the authors describe as an "objective-centric, explainable, and smooth latent representation of whole-body motions."

Sim-to-Real Transfer

Critical components include reward shaping, domain randomization, and history-dependent asymmetric learning to bridge simulation-reality gaps.

Real-World Implementation on Unitree G1

Hardware Platform

Experiments deployed on a Unitree G1 humanoid robot.

Inference Capabilities

The promptable policy supports multiple downstream applications: zero-shot motion tracking, goal-directed reaching, reward optimization, and few-shot adaptation without retraining.

Significance

BFM-Zero provides a unified foundation model approach to humanoid control, enabling versatile task execution through promptable interfaces rather than task-specific training.