Futures
Access hundreds of perpetual contracts
TradFi
Gold
One platform for global traditional assets
Options
Hot
Trade European-style vanilla options
Unified Account
Maximize your capital efficiency
Demo Trading
Introduction to Futures Trading
Learn the basics of futures trading
Futures Events
Join events to earn rewards
Demo Trading
Use virtual funds to practice risk-free trading
Launch
CandyDrop
Collect candies to earn airdrops
Launchpool
Quick staking, earn potential new tokens
HODLer Airdrop
Hold GT and get massive airdrops for free
Launchpad
Be early to the next big token project
Alpha Points
Trade on-chain assets and earn airdrops
Futures Points
Earn futures points and claim airdrop rewards
Confidential AI Benchmark (ARC-AGI-X): Little impact on the crypto market
Headline
Wharton School scholar Ethan Mollick proposes a “confidential” ARC-AGI-X benchmark to more fairly assess AI models
Summary
Ethan Mollick (Wharton associate professor, author of “Co-Intelligence,” 2024 TIME100 AI honoree) proposed the idea of the “ARC-AGI-X” benchmark on social media: to have a trusted third party host the tests, with the questions and formats kept confidential, while the leaderboard is public but the test content remains secret, preventing models from training specifically on the test questions. His core idea is to improve evaluation methods to truly measure the progress of general intelligence, rather than continuing to reward scale stacking and “answer cramming.”
Analysis
The existing ARC-AGI benchmark was proposed by François Chollet in 2019 and tests “fluid intelligence” through novel grid puzzles. Human accuracy exceeds 85%, while AI systems (even ARC-AGI-3 in 2026) still fall below 50%. The reasons for the gap include:
Mollick’s approach is to use a “confidential question pool + external expert validation” to prevent “teaching to the test,” forcing models to genuinely improve in reasoning and generalization. This addresses an old problem: public question pools make models “appear stronger,” but they may not have genuinely transferable abilities.
The results of the 2025 ARC Prize also highlight this:
Possible impacts include:
Key Information:
Impact Assessment
Conclusion: For crypto traders and short-term funds, this topic is currently irrelevant; the real beneficiaries are researchers focused on AI evaluation and model capability verification. If you are an active trader in the crypto market, there is no need for immediate action; long-term investors can passively track and wait for signals indicating “AI evaluation mechanisms impacting the crypto AI sector.”