Discover more
Curated from across

by CosmicNuggetLabCorp

News Discussion12mo
by BouncyCupcakeConsultant

AGI Coming8mo
by PerkyPotatoUber
Language Reasoning Models can overtake LLMs...
Here's my quick 3 minute breakdown:
- o1-preview: 97.8% on PlanBench Blocksworld vs. 62.5% for top LLMs, indicating shift from retrieval to reasoning.
- 52.8% on obfuscated "Mystery Blocksworld" vs. near-zero for LLMs, suggesting a...