Overparametrization helps offline-to-online generalization of closed-loop control from pixels

Lechner M, Hasani R, Amini A, Wang TH, Henzinger TA, Rus D. 2024. Overparametrization helps offline-to-online generalization of closed-loop control from pixels. Proceedings of the 2024 IEEE International Conference on Robotics and Automation. ICRA: Conference on Robotics and Automation, 2774–2782.

Download
No fulltext has been uploaded. References only!

Conference Paper | Published | English

Scopus indexed
Author
Lechner, MathiasISTA; Hasani, Ramin; Amini, Alexander; Wang, Tsun Hsuan; Henzinger, Thomas AISTA ; Rus, Daniela
Abstract
There is an ever-growing zoo of modern neural network models that can efficiently learn end-to-end control from visual observations. These advanced deep models, ranging from convolutional to Vision Transformers, from small to gigantic networks, have been extensively tested on offline image classification tasks. In this paper, we study these vision models with respect to the open-loop training to closed-loop generalization abilities, i.e., deployment realizes a causal feedback loop that is not present during training. This causality gap typically emerges in robotics applications such as autonomous driving, where a network is trained to imitate the control commands of a human. In this setting, two situations arise: 1) Closed-loop testing in-distribution, where the test environment shares properties with those of offline training data. 2) Closed-loop testing under distribution shifts and out-of-distribution. Contrary to recently reported results, we show that under proper training guidelines, all vision architectures perform indistinguishably well on in-distribution deployment, resolving the causality gap. In situation 2, We observe that scale is the strongest factor in improving closed-loop generalization regardless of the choice of the model architecture. Our results predict the trend that in the future we will see larger and larger models being used in offline-training-online-deployment imitation learning tasks in robotic applications.
Publishing Year
Date Published
2024-08-08
Proceedings Title
Proceedings of the 2024 IEEE International Conference on Robotics and Automation
Acknowledgement
This work was partially supported in parts by the ERC-2020-AdG 101020093. Additionally, it was partially sponsored by the United States Air Force Research Laboratory and the United States Air Force Artificial Intelligence Accelerator and was accomplished under Cooperative Agreement Number FA8750-19-2-1000. The views and conclusions contained in this document are those of the authors and should not be interpreted as representing the official policies, either expressed or implied, of the United States Air Force or the U.S. Government. The U.S. Government is authorized to reproduce and distribute reprints for Government purposes notwithstanding any copyright notation herein. This work was further supported by The Boeing Company and the Office of Naval Research (ONR) Grant N00014-18-1-2830.
Page
2774-2782
Conference
ICRA: Conference on Robotics and Automation
Conference Location
Yokohama, Japan
Conference Date
2024-05-13 – 2024-05-17
ISSN
IST-REx-ID

Cite this

Lechner M, Hasani R, Amini A, Wang TH, Henzinger TA, Rus D. Overparametrization helps offline-to-online generalization of closed-loop control from pixels. In: Proceedings of the 2024 IEEE International Conference on Robotics and Automation. Institute of Electrical and Electronics Engineers; 2024:2774-2782. doi:10.1109/ICRA57147.2024.10610284
Lechner, M., Hasani, R., Amini, A., Wang, T. H., Henzinger, T. A., & Rus, D. (2024). Overparametrization helps offline-to-online generalization of closed-loop control from pixels. In Proceedings of the 2024 IEEE International Conference on Robotics and Automation (pp. 2774–2782). Yokohama, Japan: Institute of Electrical and Electronics Engineers. https://doi.org/10.1109/ICRA57147.2024.10610284
Lechner, Mathias, Ramin Hasani, Alexander Amini, Tsun Hsuan Wang, Thomas A Henzinger, and Daniela Rus. “Overparametrization Helps Offline-to-Online Generalization of Closed-Loop Control from Pixels.” In Proceedings of the 2024 IEEE International Conference on Robotics and Automation, 2774–82. Institute of Electrical and Electronics Engineers, 2024. https://doi.org/10.1109/ICRA57147.2024.10610284.
M. Lechner, R. Hasani, A. Amini, T. H. Wang, T. A. Henzinger, and D. Rus, “Overparametrization helps offline-to-online generalization of closed-loop control from pixels,” in Proceedings of the 2024 IEEE International Conference on Robotics and Automation, Yokohama, Japan, 2024, pp. 2774–2782.
Lechner M, Hasani R, Amini A, Wang TH, Henzinger TA, Rus D. 2024. Overparametrization helps offline-to-online generalization of closed-loop control from pixels. Proceedings of the 2024 IEEE International Conference on Robotics and Automation. ICRA: Conference on Robotics and Automation, 2774–2782.
Lechner, Mathias, et al. “Overparametrization Helps Offline-to-Online Generalization of Closed-Loop Control from Pixels.” Proceedings of the 2024 IEEE International Conference on Robotics and Automation, Institute of Electrical and Electronics Engineers, 2024, pp. 2774–82, doi:10.1109/ICRA57147.2024.10610284.

Export

Marked Publications

Open Data ISTA Research Explorer

Search this title in

Google Scholar
ISBN Search