Skip to content

Commit

Permalink
Update index.html
Browse files Browse the repository at this point in the history
  • Loading branch information
Joe-Vincent authored May 10, 2024
1 parent 05ac53d commit 9d905b4
Showing 1 changed file with 2 additions and 0 deletions.
2 changes: 2 additions & 0 deletions docs/index.html
Original file line number Diff line number Diff line change
Expand Up @@ -199,7 +199,9 @@ <h2 class="title is-3">Comparing Policies</h2>
Here we apply our statistical bounds to the recent results from the <a href="https://arxiv.org/abs/2307.15818" target="_blank">RT-2 paper</a>, where the authors compare their RT-2 policy to a VC-1 policy in three settings designed to test emergent capabilities in symbol understanding, reasoning, and human recognition.
For each setting we find the 95% confidence intervals for policy success rate are disjiont, and we conclude with 95% confidence that RT-2 outperforms VC-1.
</p>
<div style="text-align: center;">
<img src="static/images/policy_comparison.png" alt="Confidence intervals for policy success rates" width="75%">
</div>
</div>
</div>
</div>
Expand Down

0 comments on commit 9d905b4

Please sign in to comment.