ChatGPT Leads Losses in Alpha Arena Crypto Trading Contest

Alpha Arena Crypto Trading Contest: Most LLMs End in Losses, Only DeepSeek and QWEN3 MAX Profit Despite Trading Challenges

  • Six large language models (LLMs) competed in the “Alpha Arena” crypto trading contest, with most finishing below their starting capital.
  • ChatGPT lost 63% of its funds, leading the losses among competitors.
  • Only two LLMs, DeepSeek and QWEN3 MAX, ended with profits, gaining $489 and $2,232 respectively.
  • Trading costs significantly affected profits due to frequent trades and small gains.
  • The competition setup imposed strict limits on models’ trading options and data access, making the task challenging.

Six large language models, including OpenAI’s ChatGPT, competed in the “Alpha Arena” crypto trading event created by Nof1. The contest, which ran for just over two weeks, involved all models trading cryptocurrencies under uniform prompts and conditions. The competition concluded on November 4th.

- Advertisement -

The final results showed that four of the six models ended with less than the initial $10,000 they started with. ChatGPT lost $6,267, equivalent to 63% of its funds. Google’s Gemini lost $5,671, X’s Grok lost $4,531, and Anthropic’s Claude Sonnet lost $3,081. Only High-Flyer’s DeepSeek and Alibaba’s QWEN3 MAX made profits, ending with gains of $489 and $2,232, respectively.

The number of trades varied widely among the models. Gemini executed 238 trades, while Claude Sonnet made only 38. Despite the difference in trade volume, all models had a win rate of 25 to 30 percent. Trading fees also impacted overall performance, with QWEN3 MAX paying the highest fees at $1,654 and Gemini paying $1,331 in transaction costs.

Nof1 highlighted that early losses were largely due to trading expenses. “PnL (profit and loss) was dominated by trading costs in early runs as agents over-traded and took quick, tiny gains that fees erased,” the organizer noted. On October 27th, some models briefly doubled their funds, and Claude and Grok managed short-term profits, while ChatGPT and Gemini remained mostly in the red throughout.

The contest was designed to be challenging, with LLMs receiving limited numerical time series data and restricted options for trading assets and actions. Nof1’s Jay Azhang commented that the models showed consistent behavior patterns, describing this as an investing “personality.” He added that the limited context and strict rules added difficulty.

- Advertisement -

Nof1 indicated a future competition is planned with improved prompts and added statistical rigor. Azhang’s goal is to develop his own crypto trading AI model, aiming to build on lessons from this event.

For further details, see Nof1’s complete post.

✅ Follow BITNEWSBOT on Telegram, Facebook, LinkedIn, X.com, and Google News for instant updates.

Previous Articles:

- Advertisement -

Latest News

Aave Lab Offers Revenue, New Focus to DAO’s End Feud

Aave Labs has proposed a new framework directing all revenue from Aave-branded products to...

Soldier used military secrets for $150K crypto bets.

An Israeli reserve soldier and a civilian accomplice face charges for allegedly using military...

BitGo, 21Shares Expand ETF Staking & Custody Partnership

BitGo and 21Shares have expanded their partnership to provide custody, trading, and staking services...

North Korean Hackers Use Google’s Gemini AI for Cyber Recon

Google's threat intelligence team observed the North Korean hacking group UNC2970 using the generative...

Binance SAFU Fund Now Holds $1 Billion in Bitcoin

Binance has purchased $305 million in Bitcoin for its user protection fund, bringing its...

Must Read

Top 10 Best DeFi Tokens to Invest in 2022

Decentralized Finance (Defi), is one of the most talked-about topics in the crypto space alongside NFTs. So if you want to know the best...
🔥 #AD Get 20% OFF any new 12 month hosting plan from Hostinger. Click here!