Тёмный

Models that prove their own correctness 

Simons Institute
Подписаться 63 тыс.
Просмотров 77
50% 1

Orr Paradise (UC Berkeley)
simons.berkele...
Alignment, Trust, Watermarking, and Copyright Issues in LLMs
This talk introduces Self-Proving models, a new class of models that formally prove the correctness of their outputs via an Interactive Proof system. After reviewing some related literature, I will formally define Self-Proving models and their per-input (worst-case) guarantees. I will then present algorithms for learning these models and explain how the complexity of the proof system affects the complexity of the learning algorithms. Finally, I will show experiments where Self-Proving models are trained to compute the Greatest Common Divisor of two integers, and to prove the correctness of their results to a simple verifier. Joint work with Noga Amit, Shafi Goldwasser, and Guy N. Rothblum.

Опубликовано:

 

21 окт 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии    
Далее
The ChatGPT Paradox: Impressive Yet Incomplete
1:08:22
Просмотров 27 тыс.
A visual guide to Bayesian thinking
11:25
Просмотров 1,8 млн
Ребенок по калькуляции 😂
00:32
What makes learning to control easy or hard?
1:18:31
Think Fast, Talk Smart: Communication Techniques
58:20
The Magic Of ARM w/ Casey Muratori
1:25:01
Просмотров 92 тыс.
Stop Button Solution? - Computerphile
23:45
Просмотров 480 тыс.
Invention and History of the Bubble Chamber
57:31
Просмотров 13 тыс.
Ребенок по калькуляции 😂
00:32