Formal Proof

TagLast edit: 26 Sep 2021 22:04 UTC by Pablo

A Formal Proof is a finite sequence of steps from axiom(s) or previous derived proof(s) which strictly follow the allowed rules of inference of the mathematical system in which it exists. They are used to establish statements as true within a mathematical framework in a way which can be independently verified with extremely high certainty, with the most reliable flavor of proof being machine-checked proofs generated by proof assistants since they have even less room for human error.

Proofs, Implications, and Models

Eliezer Yudkowsky30 Oct 2012 13:02 UTC

127 points

218 comments12 min readLW link

Compact Proofs of Model Performance via Mechanistic Interpretability

LawrenceC, rajashree, Adrià Garriga-alonso and Jason Gross

24 Jun 2024 19:27 UTC

92 points

3 comments8 min readLW link

(arxiv.org)

Eleuther releases Llemma: An Open Language Model For Mathematics

mako yass17 Oct 2023 20:03 UTC

22 points

0 comments1 min readLW link

(blog.eleuther.ai)

[Question] What Programming Language Characteristics Would Allow Provably Safe AI?

Davidmanheim28 Aug 2019 10:46 UTC

4 points

9 comments1 min readLW link

A List of things I might do with a Proof Oracle

Logan Zoellner5 Feb 2023 18:14 UTC

−14 points

13 comments3 min readLW link

Squeezing foundations research assistance out of formal logic narrow AI.

Donald Hobson8 Mar 2023 9:38 UTC

16 points

1 comment2 min readLW link

Roadmap for a collaborative prototype of an Open Agency Architecture

Deger Turan10 May 2023 17:41 UTC

30 points

0 comments12 min readLW link

I bet $500 on AI winning the IMO gold medal by 2026

azsantosk11 May 2023 14:46 UTC

37 points

27 comments1 min readLW link

Interview Daniel Murfet on Universal Phenomena in Learning Machines

Alexander Gietelink Oldenziel6 Feb 2023 0:00 UTC

45 points

1 comment16 min readLW link

Social Choice Theory and Logical Handshakes

StrivingForLegibility29 Dec 2023 3:49 UTC

14 points

0 comments4 min readLW link

Fundamentals of Formalisation Level 5: Formal Proof

philip_b9 Jul 2018 20:55 UTC

13 points

0 comments1 min readLW link

Infra-Domain proofs 1

Diffractor28 Mar 2021 9:16 UTC

13 points

0 comments23 min readLW link

Infra-Domain Proofs 2

Diffractor28 Mar 2021 9:15 UTC

13 points

0 comments21 min readLW link

Allowing a formal proof system to self improve while avoiding Lobian obstacles.

Donald Hobson23 Jan 2019 23:04 UTC

6 points

4 comments2 min readLW link

[Math] Towards Proof Writing as a Skill In Itself

Andrew Quinn13 Jun 2018 4:39 UTC

25 points

8 comments2 min readLW link

The value of learning mathematical proof

JonahS2 Jun 2015 3:15 UTC

8 points

42 comments1 min readLW link

An Illustrated Proof of the No Free Lunch Theorem

lifelonglearner8 Jun 2020 1:54 UTC

19 points

0 comments1 min readLW link

(mlu.red)

An example of self-fulfilling spurious proofs in UDT

cousin_it25 Mar 2012 11:47 UTC

33 points

43 comments2 min readLW link

Planning to build a cryptographic box with perfect secrecy

Lysandre Terrisse31 Dec 2023 9:31 UTC

39 points

6 comments11 min readLW link

Weak HCH accesses EXP

evhub22 Jul 2020 22:36 UTC

16 points

0 comments3 min readLW link

Alignment proposals and complexity classes

evhub16 Jul 2020 0:27 UTC

40 points

26 comments13 min readLW link

LBIT Proofs 5: Propositions 29-38

Diffractor16 Dec 2020 3:35 UTC

8 points

0 comments21 min readLW link

LBIT Proofs 1: Propositions 1-9

Diffractor16 Dec 2020 3:48 UTC

7 points

0 comments25 min readLW link

LBIT Proofs 6: Propositions 39-47

Diffractor16 Dec 2020 3:33 UTC

7 points

0 comments23 min readLW link

LBIT Proofs 2: Propositions 10-18

Diffractor16 Dec 2020 3:45 UTC

7 points

0 comments20 min readLW link

Proofs Section 2.3 (Updates, Decision Theory)

Diffractor27 Aug 2020 7:49 UTC

8 points

0 comments31 min readLW link

Proofs Section 2.2 (Isomorphism to Expectations)

Diffractor27 Aug 2020 7:52 UTC

8 points

0 comments46 min readLW link

A proof of Löb’s theorem in Haskell

cousin_it19 Sep 2014 13:01 UTC

52 points

8 comments3 min readLW link

Counterfactual Induction (Algorithm Sketch, Fixpoint proof)

Diffractor17 Dec 2019 5:04 UTC

5 points

2 comments7 min readLW link

Logical inductor limits are dense under pointwise convergence

SamEisenstat6 Oct 2016 8:07 UTC

5 points

0 comments6 min readLW link

Formalized math: dream vs reality

cousin_it9 Jul 2009 20:51 UTC

19 points

10 comments2 min readLW link

Progress on automated mathematical theorem proving?

JonahS3 Jul 2013 18:40 UTC

26 points

65 comments1 min readLW link

Proofs Section 1.1 (Initial results to LF-duality)

Diffractor27 Aug 2020 7:59 UTC

8 points

0 comments20 min readLW link

Proofs Section 1.2 (Mixtures, Updates, Pushforwards)

Diffractor27 Aug 2020 7:57 UTC

8 points

0 comments14 min readLW link

Proofs Section 2.1 (Theorem 1, Lemmas)

Diffractor27 Aug 2020 7:54 UTC

8 points

0 comments36 min readLW link

LBIT Proofs 4: Propositions 22-28

Diffractor16 Dec 2020 3:38 UTC

7 points

0 comments17 min readLW link

LBIT Proofs 7: Propositions 48-52

Diffractor16 Dec 2020 3:31 UTC

7 points

0 comments20 min readLW link

LBIT Proofs 8: Propositions 53-58

Diffractor16 Dec 2020 3:29 UTC

7 points

0 comments18 min readLW link

LBIT Proofs 3: Propositions 19-22

Diffractor16 Dec 2020 3:40 UTC

8 points

0 comments17 min readLW link

Question/Issue with the 5/10 Problem

acgt29 Nov 2021 10:45 UTC

6 points

3 comments3 min readLW link

Speedrunning 4 mistakes you make when your alignment strategy is based on formal proof

Quinn16 Feb 2023 1:13 UTC

62 points

18 comments2 min readLW link

A list of core AI safety problems and how I hope to solve them

davidad26 Aug 2023 15:12 UTC

162 points

27 comments5 min readLW link

Towards Guaranteed Safe AI: A Framework for Ensuring Robust and Reliable AI Systems

Joar Skalse17 May 2024 19:13 UTC

65 points

10 comments2 min readLW link

Video Intro to Guaranteed Safe AI

Mike Vaiana, Diogo de Lucena and AE Studio

11 Jul 2024 17:53 UTC

27 points

0 comments1 min readLW link

(youtu.be)

Davidad’s Bold Plan for Alignment: An In-Depth Explanation

Charbel-Raphaël and Gabin

19 Apr 2023 16:09 UTC

157 points

33 comments21 min readLW link

No comments.