<?xml version="1.0" encoding="UTF-8"?>

<modsCollection xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://www.loc.gov/mods/v3" xsi:schemaLocation="http://www.loc.gov/mods/v3 http://www.loc.gov/standards/mods/v3/mods-3-3.xsd">
<mods version="3.3">

<genre>conference paper</genre>

<titleInfo><title>The role of pretrained representations for the OOD generalization of  reinforcement learning agents</title></titleInfo>


<note type="publicationStatus">published</note>


<note type="qualityControlled">yes</note>

<name type="personal">
  <namePart type="given">Andrea</namePart>
  <namePart type="family">Dittadi</namePart>
  <role><roleTerm type="text">author</roleTerm> </role></name>
<name type="personal">
  <namePart type="given">Frederik</namePart>
  <namePart type="family">Träuble</namePart>
  <role><roleTerm type="text">author</roleTerm> </role></name>
<name type="personal">
  <namePart type="given">Manuel</namePart>
  <namePart type="family">Wüthrich</namePart>
  <role><roleTerm type="text">author</roleTerm> </role></name>
<name type="personal">
  <namePart type="given">Felix</namePart>
  <namePart type="family">Widmaier</namePart>
  <role><roleTerm type="text">author</roleTerm> </role></name>
<name type="personal">
  <namePart type="given">Peter</namePart>
  <namePart type="family">Gehler</namePart>
  <role><roleTerm type="text">author</roleTerm> </role></name>
<name type="personal">
  <namePart type="given">Ole</namePart>
  <namePart type="family">Winther</namePart>
  <role><roleTerm type="text">author</roleTerm> </role></name>
<name type="personal">
  <namePart type="given">Francesco</namePart>
  <namePart type="family">Locatello</namePart>
  <role><roleTerm type="text">author</roleTerm> </role><identifier type="local">26cfd52f-2483-11ee-8040-88983bcc06d4</identifier><description xsi:type="identifierDefinition" type="orcid">0000-0002-4850-0683</description></name>
<name type="personal">
  <namePart type="given">Olivier</namePart>
  <namePart type="family">Bachem</namePart>
  <role><roleTerm type="text">author</roleTerm> </role></name>
<name type="personal">
  <namePart type="given">Bernhard</namePart>
  <namePart type="family">Schölkopf</namePart>
  <role><roleTerm type="text">author</roleTerm> </role></name>
<name type="personal">
  <namePart type="given">Stefan</namePart>
  <namePart type="family">Bauer</namePart>
  <role><roleTerm type="text">author</roleTerm> </role></name>







<name type="corporate">
  <namePart></namePart>
  <identifier type="local">FrLo</identifier>
  <role>
    <roleTerm type="text">department</roleTerm>
  </role>
</name>



<name type="conference">
  <namePart>ICLR: International Conference on Learning Representations</namePart>
</name>






<abstract lang="eng">Building sample-efficient agents that generalize out-of-distribution (OOD) in real-world settings remains a fundamental unsolved problem on the path towards achieving higher-level cognition. One particularly promising approach is to begin with low-dimensional, pretrained representations of our world, which should facilitate efficient downstream learning and generalization. By training 240 representations and over 10,000 reinforcement learning (RL) policies on a simulated robotic setup, we evaluate to what extent different properties of
pretrained VAE-based representations affect the OOD generalization of downstream agents. We observe that many agents are surprisingly robust to realistic distribution shifts, including the challenging sim-to-real case. In addition, we find that the generalization performance of a simple downstream proxy task reliably predicts the generalization performance of our RL agents
under a wide range of OOD settings. Such proxy tasks can thus be used to select pretrained representations that will lead to agents that generalize.</abstract>

<originInfo><dateIssued encoding="w3cdtf">2022</dateIssued><place><placeTerm type="text">Virtual</placeTerm></place>
</originInfo>
<language><languageTerm authority="iso639-2b" type="code">eng</languageTerm>
</language>



<relatedItem type="host"><titleInfo><title>10th International Conference on Learning Representations</title></titleInfo>
  <identifier type="arXiv">2107.05686</identifier>
<part>
</part>
</relatedItem>

<note type="extern">yes</note>
<extension>
<bibliographicCitation>
<ista>Dittadi A, Träuble F, Wüthrich M, Widmaier F, Gehler P, Winther O, Locatello F, Bachem O, Schölkopf B, Bauer S. 2022. The role of pretrained representations for the OOD generalization of  reinforcement learning agents. 10th International Conference on Learning Representations. ICLR: International Conference on Learning Representations.</ista>
<mla>Dittadi, Andrea, et al. “The Role of Pretrained Representations for the OOD Generalization of  Reinforcement Learning Agents.” &lt;i&gt;10th International Conference on Learning Representations&lt;/i&gt;, 2022.</mla>
<apa>Dittadi, A., Träuble, F., Wüthrich, M., Widmaier, F., Gehler, P., Winther, O., … Bauer, S. (2022). The role of pretrained representations for the OOD generalization of  reinforcement learning agents. In &lt;i&gt;10th International Conference on Learning Representations&lt;/i&gt;. Virtual.</apa>
<ieee>A. Dittadi &lt;i&gt;et al.&lt;/i&gt;, “The role of pretrained representations for the OOD generalization of  reinforcement learning agents,” in &lt;i&gt;10th International Conference on Learning Representations&lt;/i&gt;, Virtual, 2022.</ieee>
<ama>Dittadi A, Träuble F, Wüthrich M, et al. The role of pretrained representations for the OOD generalization of  reinforcement learning agents. In: &lt;i&gt;10th International Conference on Learning Representations&lt;/i&gt;. ; 2022.</ama>
<chicago>Dittadi, Andrea, Frederik Träuble, Manuel Wüthrich, Felix Widmaier, Peter Gehler, Ole Winther, Francesco Locatello, Olivier Bachem, Bernhard Schölkopf, and Stefan Bauer. “The Role of Pretrained Representations for the OOD Generalization of  Reinforcement Learning Agents.” In &lt;i&gt;10th International Conference on Learning Representations&lt;/i&gt;, 2022.</chicago>
<short>A. Dittadi, F. Träuble, M. Wüthrich, F. Widmaier, P. Gehler, O. Winther, F. Locatello, O. Bachem, B. Schölkopf, S. Bauer, in:, 10th International Conference on Learning Representations, 2022.</short>
</bibliographicCitation>
</extension>
<recordInfo><recordIdentifier>14174</recordIdentifier><recordCreationDate encoding="w3cdtf">2023-08-22T14:02:13Z</recordCreationDate><recordChangeDate encoding="w3cdtf">2023-09-11T09:48:36Z</recordChangeDate>
</recordInfo>
</mods>
</modsCollection>
