2026-04-13 15:52:30

Claude Opus 4.6 seems to have really become less intelligent lately...

Last week, in the BridgeBench hallucination benchmark test, it was still firmly ranked second with an accuracy of 83.3%
As a result, on April 12th, after retesting, it dropped directly to 10th place, with an accuracy of only 68.3%, and hallucination rate skyrocketed by 98%
A comparison chart shows a very obvious gap between the before and after
Many people have recently felt that it has become noticeably dumber when writing code or doing reasoning, forgetting instructions quickly and increasing nonsense

View Original

This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.

Reward
like
Comment
Repost
Share

Comment

Add a comment

No comments

Trending Topics
View More
#
Gate13thAnniversaryDr.HanLetter
36.65K Popularity
#
CryptoMarketsDipSlightly
168.44K Popularity
#
USBlocksStraitofHormuz
708.77K Popularity
#
AaveDAOApproves$25MGrant
1.8M Popularity
#
GateSquareAprilPostingChallenge
1.62M Popularity

Sitemap

Claude Opus 4.6 seems to have really become less intelligent lately...

Trending Topics

Gate13thAnniversaryDr.HanLetter

CryptoMarketsDipSlightly

USBlocksStraitofHormuz

AaveDAOApproves$25MGrant

GateSquareAprilPostingChallenge

Pin