Link to paper: https://arxiv.org/pdf/1805.00899.pdf

the problem

<aside> 💡 self play zero-sum debate game = see which agents have best points (most true and informative information)

</aside>

for context:

introduction

<aside> 💡 a debate approach to alignment

How it works

Untitled