Together Evaluations now benchmarks proprietary AI models from OpenAI, Anthropic, and Google against open-source alternatives, claiming 10x cost savings. (Read Together Evaluations now benchmarks proprietary AI models from OpenAI, Anthropic, and Google against open-source alternatives, claiming 10x cost savings. (Read

Together AI Opens Evaluations to OpenAI, Anthropic, Google Models

2 min read

Together AI Opens Evaluations to OpenAI, Anthropic, Google Models

Lawrence Jengar Feb 02, 2026 20:01

Together Evaluations now benchmarks proprietary AI models from OpenAI, Anthropic, and Google against open-source alternatives, claiming 10x cost savings.

Together AI Opens Evaluations to OpenAI, Anthropic, Google Models

Together AI has expanded its Evaluations platform to support direct benchmarking against proprietary models from OpenAI, Anthropic, and Google—a move that could reshape how enterprises make AI infrastructure decisions.

The update, announced February 3, enables side-by-side comparisons between open-source models and closed-source alternatives including GPT-5, Claude Sonnet 4.5, and Gemini 2.5 Pro. For AI-focused crypto projects and decentralized compute networks, this creates a standardized framework for proving cost-efficiency claims.

What's Actually New

Together Evaluations now accepts models from three major providers as both evaluation targets and judges:

OpenAI: GPT-5, GPT-5.2
Anthropic: Claude Sonnet 4.5, Claude Haiku 4.5, Claude Opus 4.5
Google: Gemini 2.5 Pro, Gemini 2.5 Flash

The platform also supports any OpenAI Chat Completions-compatible URL, meaning self-hosted and decentralized inference endpoints can plug directly into the benchmarking system.

The Cost Argument Gets Data

Together AI published accompanying research showing fine-tuned open-source judges (GPT-OSS 120B, Qwen3 235B) outperforming GPT-5.2 as evaluators—62.63% accuracy versus 61.62%—while running at reportedly 10x lower cost and 15x higher speed.

That's a specific, testable claim. For decentralized AI networks competing on inference pricing, having a neutral benchmarking platform that accepts custom endpoints could prove valuable for customer acquisition.

The company, founded in 2020 and known for research innovations like FlashAttention-3, has positioned itself as infrastructure-agnostic. Its platform already offers access to over 200 open-source models with claimed 4x faster inference and 11x lower cost compared to GPT-4o, according to December 2024 benchmarks.

Why This Matters for Crypto AI

Several blockchain-based AI projects—from decentralized GPU marketplaces to inference networks—have struggled to prove their cost advantages aren't just marketing. A third-party evaluation framework that accepts any compatible endpoint changes that dynamic.

The Evaluations API runs on Together's Batch API at roughly 50% lower cost than real-time inference, making large-scale model comparisons economically viable for smaller teams.

Together AI remains a private company with no associated token. But its tooling increasingly touches the infrastructure layer where crypto AI projects compete—and now those projects have a standardized way to benchmark against the incumbents they're trying to displace.

Image source: Shutterstock
  • together ai
  • ai infrastructure
  • llm benchmarking
  • open source ai
  • enterprise ai
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact service@support.mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

Will Bitcoin Soar or Stumble Next?

Will Bitcoin Soar or Stumble Next?

The post Will Bitcoin Soar or Stumble Next? appeared on BitcoinEthereumNews.com. With the Federal Reserve’s forthcoming decision on interest rates causing speculation, Bitcoin‘s value remains stable at $115,400. China’s surprising maneuvers in the financial landscape have shifted expected market trends, prompting deeper examination by investors into analysts’ past evaluations regarding rate reductions. Continue Reading:Will Bitcoin Soar or Stumble Next? Source: https://en.bitcoinhaber.net/will-bitcoin-soar-or-stumble-next
Share
BitcoinEthereumNews2025/09/18 03:09
Which Is Set To Become The Next 50x Gainer In 2025?

Which Is Set To Become The Next 50x Gainer In 2025?

The post Which Is Set To Become The Next 50x Gainer In 2025? appeared on BitcoinEthereumNews.com. Crypto News 19 September 2025 | 21:10 Recent crypto market momentum has investors weighing the prospects of established tokens like DOGE and HBAR against rising challengers. DOGE trades close to $0.28, bolstered by the launch of the first U.S. Dogecoin ETF, while HBAR holds steady near $0.24 amid growing speculation around ETF inclusion and strong on-chain activity. Yet, much of the buzz has shifted to Layer Brett (LBRETT), now in presale at $0.0058 and already surpassing $3.8 million raised. With its blend of meme appeal, real utility, and high staking rewards, many investors see Layer Brett as the project with the clearest shot at becoming crypto’s next 50x gainer in 2025. Layer Brett – Is it the future? While DOGE and HBAR stabilize and flirt with resistance zones, Layer Brett is staking its claim as a potentially more aggressive play. With presale pricing at $0.0058 USD for $LBRETT and over $3.7 million USD raised so far, the project is constructing an Ethereum Layer 2 meme-utility token that emphasizes performance, speed, and rewards. Layer Brett’s narrative is not just hype. Its roadmap includes bridging solutions, staking from day one, and a community-driven model. These technical underpinnings give Layer Brett a sharper edge and help it stand out in the race for meme-utility tokens. If its execution aligns with its promise, it may offer more upside than DOGE or HBAR in the medium term. DOGE vs HBAR DOGE (Dogecoin) remains a foundational meme coin with one of the most active communities in crypto. Recent news shows DOGE has benefited from an ETF approval in the U.S., which has validated its institutional presence. Though DOGE continues to trade in a range near $0.25-$0.30, whales are reallocating portions of portfolios into meme-utility and presale tokens. Its upside is seen as more moderate compared to…
Share
BitcoinEthereumNews2025/09/20 03:46
Victra Named 2025 Recipient of Verizon’s Best Build Compliance Award

Victra Named 2025 Recipient of Verizon’s Best Build Compliance Award

Verizon Recognizes Victra for Industry-Leading Excellence in Store Design and Brand Compliance. RALEIGH, N.C., Feb. 3, 2026 /PRNewswire/ — Verizon has named Victra
Share
AI Journal2026/02/03 20:49