{"date_created":"2023-09-13T12:43:14Z","day":"23","date_updated":"2023-09-13T12:44:00Z","status":"public","publication":"ICML 2021 Workshop on Unsupervised Reinforcement Learning","_id":"14332","citation":{"ama":"Träuble F, Dittadi A, Wuthrich M, et al. Representation learning for out-of-distribution generalization in reinforcement learning. In: <i>ICML 2021 Workshop on Unsupervised Reinforcement Learning</i>. ; 2021.","ieee":"F. Träuble <i>et al.</i>, “Representation learning for out-of-distribution generalization in reinforcement learning,” in <i>ICML 2021 Workshop on Unsupervised Reinforcement Learning</i>, Virtual, 2021.","apa":"Träuble, F., Dittadi, A., Wuthrich, M., Widmaier, F., Gehler, P. V., Winther, O., … Bauer, S. (2021). Representation learning for out-of-distribution generalization in reinforcement learning. In <i>ICML 2021 Workshop on Unsupervised Reinforcement Learning</i>. Virtual.","ista":"Träuble F, Dittadi A, Wuthrich M, Widmaier F, Gehler PV, Winther O, Locatello F, Bachem O, Schölkopf B, Bauer S. 2021. Representation learning for out-of-distribution generalization in reinforcement learning. ICML 2021 Workshop on Unsupervised Reinforcement Learning. ICML: International Conference on Machine Learning.","short":"F. Träuble, A. Dittadi, M. Wuthrich, F. Widmaier, P.V. Gehler, O. Winther, F. Locatello, O. Bachem, B. Schölkopf, S. Bauer, in:, ICML 2021 Workshop on Unsupervised Reinforcement Learning, 2021.","chicago":"Träuble, Frederik, Andrea Dittadi, Manuel Wuthrich, Felix Widmaier, Peter Vincent Gehler, Ole Winther, Francesco Locatello, Olivier Bachem, Bernhard Schölkopf, and Stefan Bauer. “Representation Learning for Out-of-Distribution Generalization in Reinforcement Learning.” In <i>ICML 2021 Workshop on Unsupervised Reinforcement Learning</i>, 2021.","mla":"Träuble, Frederik, et al. “Representation Learning for Out-of-Distribution Generalization in Reinforcement Learning.” <i>ICML 2021 Workshop on Unsupervised Reinforcement Learning</i>, 2021."},"type":"conference","title":"Representation learning for out-of-distribution generalization in reinforcement learning","extern":"1","language":[{"iso":"eng"}],"conference":{"location":"Virtual","name":"ICML: International Conference on Machine Learning","start_date":"2021-07-23","end_date":"2021-07-23"},"oa_version":"None","department":[{"_id":"FrLo"}],"year":"2021","author":[{"first_name":"Frederik","full_name":"Träuble, Frederik","last_name":"Träuble"},{"last_name":"Dittadi","first_name":"Andrea","full_name":"Dittadi, Andrea"},{"full_name":"Wuthrich, Manuel","first_name":"Manuel","last_name":"Wuthrich"},{"first_name":"Felix","full_name":"Widmaier, Felix","last_name":"Widmaier"},{"last_name":"Gehler","full_name":"Gehler, Peter Vincent","first_name":"Peter Vincent"},{"last_name":"Winther","first_name":"Ole","full_name":"Winther, Ole"},{"last_name":"Locatello","first_name":"Francesco","full_name":"Locatello, Francesco","orcid":"0000-0002-4850-0683","id":"26cfd52f-2483-11ee-8040-88983bcc06d4"},{"first_name":"Olivier","full_name":"Bachem, Olivier","last_name":"Bachem"},{"last_name":"Schölkopf","first_name":"Bernhard","full_name":"Schölkopf, Bernhard"},{"first_name":"Stefan","full_name":"Bauer, Stefan","last_name":"Bauer"}],"quality_controlled":"1","abstract":[{"lang":"eng","text":"Learning data representations that are useful for various downstream tasks is a cornerstone of artificial intelligence. While existing methods are typically evaluated on downstream tasks such as classification or generative image quality, we propose to assess representations through their usefulness in downstream control tasks, such as reaching or pushing objects. By training over 10,000 reinforcement learning policies, we extensively evaluate to what extent different representation properties affect out-of-distribution (OOD) generalization. Finally, we demonstrate zero-shot transfer of these policies from simulation to the real world, without any domain randomization or fine-tuning. This paper aims to establish the first systematic characterization of the usefulness of learned representations for real-world OOD downstream tasks."}],"publication_status":"published","date_published":"2021-07-23T00:00:00Z","article_processing_charge":"No","month":"07","user_id":"2DF688A6-F248-11E8-B48F-1D18A9856A87"}