Lemmata
Subscribe
Sign in
Share this post
Lemmata
Rate My Rate My Proof
Copy link
Facebook
Email
Notes
More
Rate My Rate My Proof
Greg Burnham
Mar 7
4
Share this post
Lemmata
Rate My Rate My Proof
Copy link
Facebook
Email
Notes
More
An idea for a benchmark to see how well LLMs can spot flawed reasoning
Read →
Comments
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts
Share this post
Rate My Rate My Proof
Share this post
An idea for a benchmark to see how well LLMs can spot flawed reasoning