Fun

Google’s new Gemini AI model dominates benchmarks, beats GPT-4o and Claude-3

News Feed - 2024-08-02 06:08:03

Tristan Greene2 hours agoGoogle’s new Gemini AI model dominates benchmarks, beats GPT-4o and Claude-3This is the first time Google’s taken the top slot on the Chatbot Arena leaderboard.531 Total viewsListen to article 0:00NewsOwn this piece of crypto historyCollect this article as NFTCOINTELEGRAPH IN YOUR SOCIAL FEEDFollow ourSubscribe onThere’s a new top dog in the world of generative artificial intelligence benchmarks and its name is Gemini 1.5 Pro. 


The previous champ, OpenAI’s ChatGPT-4o, was finally surpassed on Aug. 1 when Google quietly launched an experimental release of its latest model.


Gemini’s latest update arrived without fanfare and is currently labelled as experimental. But it quickly gained the attention of the AI community across social media as reports began to trickle in that it was surpassing its rivals on benchmark scores.Artificial intelligence benchmarks


OpenAI’s ChatGPT has been the standard bearer for generative AI since the launch of GPT-3. Its latest model, GPT-4o, and its closest competitor, Anthropic’s Claude-3, have reigned supreme above most other models in most common benchmarks for the past year or so with little in the way of competition.Source:Large Model Systems Organization.


One of the most popular benchmarks is called the LMSYS Chatbot Arena. It tests models on a variety of tasks and assigns an overall competency score. GPT-4o received a score of 1,286 while Claude-3 earned a respectable 1,271.


A previous version of Gemini 1.5 Pro scored 1,261. But the experimental version (Gemini 1.5 Pro 0801) released on Aug 1 scored a whopping 1,300.


This indicates that it’s overall more capable than its competitors, but benchmarks aren’t necessarily an accurate representation of what an AI model can and can’t do.Community excitement


Without deeper comparisons available, we’re entering an era where the AI chatbot market has matured enough to offer multiple options. It’s ultimately up to end-users to determine which AI model works best for them.


Anecdotally, there’s been a wave of excitement over the latest version of Gemini with users on social media calling it “insanely good.” One Redditor went so far as to write that it “blows 4o out of the water.”


It’s unclear at this time if the experimental version of Gemini 1.5 Pro will end up being the default going forward. While it remains generally available as of the time of this article’s publication, the fact that it’s in what"s considered an early release or testing phase indicates that it’s possible the model could be rescinded or changed for safety or alignment reasons.


Related:Google announces safety, transparency advancements in AI models# Google# Technology# AI# ChatGPT# OpenAIAdd reaction

News Feed

XRP Price Prediction: Analyst Gives Reasons For Why $10,000 Is A Feasible Price Target
Este artículo también está disponible en español. Crypto analyst Vincent has given reasons why the XRP price could rally to as high as $10,000 at some point. This comes a
Bitcoin seen following stocks as BTC price gains 2.5% to attack $61K
William Suberg13 hours agoBitcoin seen following stocks as BTC price gains 2.5% to attack $61KBitcoin stands to gain from increasingly risk-on macro sentiment, but can it shift a stubborn BTC price range?1820 Total views
Putin Signs Law Prohibiting Payments With Digital Assets in Russia
Putin Signs Law Prohibiting Payments With Digital Assets in Russia President Vladimir Putin of Russia has signed into law a bill banning payments with digital financial assets. The
Terraform Labs Donates 12 Million LUNA to Luna Foundation Guard
Terraform Labs Donates 12 Million LUNA to Luna Foundation Guard Terraform Labs, the company behind the creation of Terra, has announced a new donation to support the ecosystem of t
Future Outlook For HBAR: Insights From Hedera Q3 Surge And Price Projections
Este artículo también está disponible en español. Decentralized ledger platform Hedera has posted a solid set of third quarter (Q3) results, in line with broader market t
5 Presales to Make Crazy Gains in 2025 as Bitcoin Looks to Breakout
Este artículo también está disponible en español. Despite hovering under the $100K price level, analysts believe Bitcoin is still very much in bullish territory. Undeterr
JPMorgan Praises Bitcoin Then Pushes JPM Coin, Sets Up Dedicated Crypto Unit
JPMorgan Praises Bitcoin Then Pushes JPM Coin, Sets Up Dedicated Crypto Unit After JPMorgan analysts praised bitcoin, saying that the price of the cryptocurrency
Tom Mitchelhill3 hours agoOpenSea investor marks down stake in platform by 90%: ReportOpenSea’s co-lead investor, Coatue Management, marked down its investment from $120 million to $13 million.2610 Total views45 Total
Former RBI Governor and IMF Chief Economist Sees Value in Bitcoin and Facebook Libra
Former RBI Governor and IMF Chief Economist Sees Value in Bitcoin and Facebook LibraRaghuram Rajan, former governor of the Reserve Bank of India (RBI) and chief economist at the Int
Picpay to Offer Cryptocurrency Services in Brazil to More Than 60 Million Customers
Picpay to Offer Cryptocurrency Services in Brazil to More Than 60 Million Customers Picpay, one of the most popular payments fintech companies in Brazil, has announced that it will
FTX Co-Founder’s Alleged Extravagance Comes to Light in Bankruptcy Court Documents
FTX Co-Founder"s Alleged Extravagance Comes to Light in Bankruptcy Court Documents Following the court filing that shows FTX co-founder Sam Bankman-Fried (SBF) wants access to FTX
Ethereum Exchange Balances Decline To 18.8M ETH: Smart Money Drains Supply
Reason to trust Strict editorial policy that focuses on accuracy, relevance, and impartiality Created by industry experts and meticulously reviewed The highest standards in reporting and pu