SqueakyPickle
SqueakyPickle
5mo

Opus 4.5 completes SWE tasks that would take a human ~4hr 49mins on average successfully 50% of the time.

However GPT-5.1-Codex-Max tops Opus 4.5 on the 80% success rate scoreboard at 32 mins vs. 27 mins

Post image 1
Post image 2
Post image 3
Post image 4
5mo ago
SwirlyBagel
SwirlyBagel

Only people with skills issue, will worry about AI taking their jobs 😂.

PerkyHamster
PerkyHamster

Indeed

BubblyDumpling
BubblyDumpling

Time will answer soon.. May 2 or 3 years down the line..

MagicalWalrus
MagicalWalrus
Student
5mo

Bye bye job ..... Hello farming

ZoomyHamster
ZoomyHamster

So in future what we need to study AI tools ?

BouncyPenguin
BouncyPenguin

In future? 😂

ZoomyHamster
ZoomyHamster

I meant in college syllabus

SqueakyPickle
SqueakyPickle

We're in the 4-months doubling world

image
PrancingPanda
PrancingPanda

What does that mean?

SnoozyQuokka
SnoozyQuokka

I am scared, as i was debugging a bug, which was cross repo bug, and after explaining it to opus, within a minute it figured out the pain point, then i fixed it in minutes
Where as if i do it by myself, it could have took me 2 hr atleast to reach at that point

Amazing

PerkyHamster
PerkyHamster

A single attack on a company that relies mainly on AI would quickly reveal the reality of its limitations

😂😂 Uske bad AI bolna bnd kr denge ig may be i am wrong just predicting because the CEO or owner is also human he knows value machine vs human

PerkyHamster
PerkyHamster

Mainly hype is made by only AI companies and there is also bubble brust thing soo they want to make huge money every out of 5 ig 4 startup in AI

So market to banana padega nah..... Its just my thoughts guys may be i am missing some major points

SillyQuokka
SillyQuokka
4mo

I used Opus on a super legacy system with complicated layers, and I am beyond impressed.
Coding a solution is still an easy part. The way it debugged the issue left me amazed.

SillyQuokka
SillyQuokka
4mo

How did you get so intellectual ?

FluffyNugget
FluffyNugget

Benchmark optimising. Anyway let it all doom end so I don’t need to pay emi

JazzyRaccoon
JazzyRaccoon

But it won't be able to do what we are doing, in the legacy system the owner itself doesn't know where the data is coming from from, what happens when one line of code change will happen

BouncyPenguin
BouncyPenguin

That's just a skill issue

BubblyDumpling
BubblyDumpling

We are so cooked..

Discover more
Curated from across