PAPER_TITLE

FIRST_AUTHOR_LAST, FIRST_AUTHOR_FIRST; SECOND_AUTHOR_LAST, SECOND_AUTHOR_FIRST

Learning 3D Compliant Flow Matching Policies
from Force & Demonstration-Guided Simulation Data

Tianyu Li^* Yihan Li^* Zizhe Zhang Nadia Figueroa

^* Equal Contribution · University of Pennsylvania

ICRA 2026

Paper Code (Coming Soon)

Data Collection Deployment

Demo

∞

Sim Data

Zero-Shot Sim2Real

Vision

Force

Proprio

Compliant
Output

Overview

What We Built

This framework uses a lightweight scheme to generate force-informed training data entirely in simulation from a single human demo. With this data, we train point cloud + force attending policies with passive impedance outputs. At execution, our passive impedance controller ensures compliant, safe interactions. Together, these components enable robots to generalize from simple geometry in sim to unseen real-world objects and spatial configurations.

Results

See It In Action

Abstract

While visuomotor policy has made advancements in recent years, contact-rich tasks still remain a challenge. Robotic manipulation tasks that require continuous contact demand explicit handling of compliance and force. However, most visuomotor policies ignore compliance, overlooking the importance of physical interaction with the real world, often leading to excessive contact forces or fragile behavior under uncertainty. Introducing force information into vision-based imitation learning could help improve awareness of contacts, but could also require a lot of data to perform well. One remedy for data scarcity is to generate data in simulation, yet computationally taxing processes are required to generate data good enough not to suffer from the Sim2Real gap. In this work, we introduce a framework for generating force-informed data in simulation, instantiated by a single human demonstration, and show how coupling with a compliant policy improves the performance of a visuomotor policy learned from synthetic data. We validate our approach on real-robot tasks, including non-prehensile block flipping and a bi-manual object moving, where the learned policy exhibits reliable contact maintenance and adaptation to novel conditions.

Method

Framework Pipeline

**3D Compliant Visuomotor Policy Learning Framework:** Starting from a single simulation demonstration, we generate point cloud and force data by introducing virtual targets and applying Laplacian editing beyond the original demonstration. This augmented data is used to train a flow-matching policy that receives point cloud and force input and predicts actions, including an impedance parameter. At rollout time, the policy's output is synthesized into a state velocity field, which is then executed using a Passive Impedance Controller to ensure compliant behaviors. *While our data is only generated with one simple geometry for both tasks, the trained policies produce generalizable capabilities beyond the original shapes by using our framework.*

Data Generation

Trajectory Modulation Strategies

From a single demonstration, we generate diverse training data by combining Laplacian editing with force-informed virtual targets — no additional human demos or training needed. Each strategy adds a different axis of variation entirely in simulation.

Original reference trajectory — (a) Original

Force-informed trajectory — (b) Force-Informed

Laplacian editing trajectory — (c) Laplacian Editing

Force + Laplacian trajectory — (d) Force + Laplacian

Experiments

Non-Prehensile Block Flipping

Trained on a single simple-geometry demo in simulation. Evaluated zero-shot on unseen real objects and spatial configurations.

Object Generalization

Six real-world objects with varied shapes, masses, and surface properties — each flipped with the same policy.

Spatial Generalization

The same policy handling varied initial positions and orientations across the workspace.

Experiments

Bi-Manual Object Moving

Two arms coordinate through contact forces to transport objects — generalizing across object geometry and start/goal layouts.

Object Generalization

Different object shapes moved bi-manually with compliant contact maintenance throughout.

Spatial Generalization

Varied start and goal configurations across the table surface.

In The Wild

Human Intervention & Stacked Objects

Testing robustness beyond the evaluation set — the policy handles mid-execution human interference and attempts to flip stacked objects it was never trained on.

Analysis

Learned Impedance Behavior

Force and impedance profiles from Block Flipping. As the contact force builds along the y-axis during the flip, the learned compliance gain increases in response. The policy becomes softer to maintain safe, continuous contact without explicit rules.

Acknowledgments

We thank our anonymous reviewers, who provided thorough and fair feedback that improved the quality of our paper. This work was supported by the National Science Foundation (NSF) Foundational Research in Robotics (FRR) program under NSF CAREER Award Grant No. FRR-2443721.

Citation

BibTeX

@article{li2025flow,
  title={Flow with the Force Field: Learning 3D Compliant Flow Matching Policies from Force and Demonstration-Guided Simulation Data},
  author={Li, Tianyu and Li, Yihan and Zhang, Zizhe and Figueroa, Nadia},
  journal={arXiv preprint arXiv:2510.02738},
  year={2025}
}

Learning 3D Compliant Flow Matching Policies from Force & Demonstration-Guided Simulation Data