Guiding Data Collection via Factored Scaling Curves

Published 12 May 2025 in cs.RO, cs.AI, and cs.LG | (2505.07728v1)

Abstract: Generalist imitation learning policies trained on large datasets show great promise for solving diverse manipulation tasks. However, to ensure generalization to different conditions, policies need to be trained with data collected across a large set of environmental factor variations (e.g., camera pose, table height, distractors) $-$ a prohibitively expensive undertaking, if done exhaustively. We introduce a principled method for deciding what data to collect and how much to collect for each factor by constructing factored scaling curves (FSC), which quantify how policy performance varies as data scales along individual or paired factors. These curves enable targeted data acquisition for the most influential factor combinations within a given budget. We evaluate the proposed method through extensive simulated and real-world experiments, across both training-from-scratch and fine-tuning settings, and show that it boosts success rates in real-world tasks in new environments by up to 26% over existing data-collection strategies. We further demonstrate how factored scaling curves can effectively guide data collection using an offline metric, without requiring real-world evaluation at scale.

Abstract PDF Upgrade to Chat

Authors (10)

Summary

Overview of "Guiding Data Collection via Factored Scaling Curves"

The paper "Guiding Data Collection via Factored Scaling Curves" presents a novel method to address the challenge of deciding what data to collect and in what quantity by leveraging factored scaling curves in imitation learning contexts, particularly for robotic manipulation tasks. This approach is motivated by the high demand for diverse data in training policies that generalize well to varying environmental conditions, such as different camera poses, table heights, or the presence of distractors. The study recognizes that exhaustive data collection to cover all potential variations is prohibitively costly in terms of both time and resources.

Factored Scaling Curves

The central concept introduced is that of factored scaling curves, which provide a principled way to ascertain how policy performance improves as additional data is collected across specific environmental factor variations. By quantifying the effect of data scale on policy efficacy in varied conditions, these curves facilitate targeted data acquisition, emphasizing the most impactful factors within a given budget.

Methodology and Results

The authors demonstrate their methodology through both simulated and real-world experiments. These experiments are conducted in two primary settings: training from scratch and fine-tuning pre-trained Vision-Language-Action (VLA) models. Notably, the application of factored scaling curves led to a substantial increase in success rates—up to 26% over existing strategies—by effectively guiding data collection. This is achieved through optimizing the allocation of data budgets to critical factor combinations, based on insights derived from scaling curves.

Furthermore, the approach's predictive capacity extends beyond real-world evaluation, utilizing an offline metric based on policy embedding similarity. This metric forecasts the policy performance improvements achievable through additional data collection, thus offering a cost-efficient alternative by eschewing the need for extensive real-world trials.

Theoretical and Practical Implications

The theoretical contributions of this paper lie in the development of a systematic framework for prioritizing data collection efforts. Practically, this translates to a more efficient use of resources in robotic learning, allowing practitioners to focus efforts where they will yield the highest returns in terms of policy performance. The use of factored scaling curves exemplifies a methodical strategy to navigate the trade-offs inherent in data-driven AI model development, especially under constraints typical in real-world applications.

Future Directions

The authors suggest several potential future research avenues. One area includes refining the scaling law extrapolation to enhance predictive accuracy for larger datasets and longer horizons. Another lies in exploring alternative proxy metrics to refine data collection guidelines further without compromising on policy performance improvements. There is also room for applying the approach to broader domains within AI and robotics, where data collection efficiency is critical.

In summary, this paper contributes a compelling method for data collection in robotic imitation learning through the innovative use of factored scaling curves, enhancing both the theoretical understanding and practical execution of efficient data acquisition.