Fun

Google’s new Gemini AI model dominates benchmarks, beats GPT-4o and Claude-3

News Feed - 2024-08-02 06:08:03

Tristan Greene2 hours agoGoogle’s new Gemini AI model dominates benchmarks, beats GPT-4o and Claude-3This is the first time Google’s taken the top slot on the Chatbot Arena leaderboard.531 Total viewsListen to article 0:00NewsOwn this piece of crypto historyCollect this article as NFTCOINTELEGRAPH IN YOUR SOCIAL FEEDFollow ourSubscribe onThere’s a new top dog in the world of generative artificial intelligence benchmarks and its name is Gemini 1.5 Pro. 


The previous champ, OpenAI’s ChatGPT-4o, was finally surpassed on Aug. 1 when Google quietly launched an experimental release of its latest model.


Gemini’s latest update arrived without fanfare and is currently labelled as experimental. But it quickly gained the attention of the AI community across social media as reports began to trickle in that it was surpassing its rivals on benchmark scores.Artificial intelligence benchmarks


OpenAI’s ChatGPT has been the standard bearer for generative AI since the launch of GPT-3. Its latest model, GPT-4o, and its closest competitor, Anthropic’s Claude-3, have reigned supreme above most other models in most common benchmarks for the past year or so with little in the way of competition.Source:Large Model Systems Organization.


One of the most popular benchmarks is called the LMSYS Chatbot Arena. It tests models on a variety of tasks and assigns an overall competency score. GPT-4o received a score of 1,286 while Claude-3 earned a respectable 1,271.


A previous version of Gemini 1.5 Pro scored 1,261. But the experimental version (Gemini 1.5 Pro 0801) released on Aug 1 scored a whopping 1,300.


This indicates that it’s overall more capable than its competitors, but benchmarks aren’t necessarily an accurate representation of what an AI model can and can’t do.Community excitement


Without deeper comparisons available, we’re entering an era where the AI chatbot market has matured enough to offer multiple options. It’s ultimately up to end-users to determine which AI model works best for them.


Anecdotally, there’s been a wave of excitement over the latest version of Gemini with users on social media calling it “insanely good.” One Redditor went so far as to write that it “blows 4o out of the water.”


It’s unclear at this time if the experimental version of Gemini 1.5 Pro will end up being the default going forward. While it remains generally available as of the time of this article’s publication, the fact that it’s in what"s considered an early release or testing phase indicates that it’s possible the model could be rescinded or changed for safety or alignment reasons.


Related:Google announces safety, transparency advancements in AI models# Google# Technology# AI# ChatGPT# OpenAIAdd reaction

News Feed

Grayscale launches investment fund for MakerDAO token
Alex O’Donnell10 hours agoGrayscale launches investment fund for MakerDAO tokenGrayscale also launched funds for protocols Bittensor and Sui in August.1271 Total views3 Total sharesListen to article 0:00NewsOwn this pi
Institutional adoption in blockchain and crypto at its highest point, says BlockDaemon strategist
Tristan Greene4 hours agoInstitutional adoption in blockchain and crypto at its highest point, says BlockDaemon strategistBarnaby Hodgkins is bullish on mass adoption, Ethereum ETFs, and the future of the industry.946 To
Luna Foundation Guard Raises $1 Billion to Safeguard UST Dollar Peg
Luna Foundation Guard Raises $1 Billion to Safeguard UST Dollar Peg The Luna Foundation Guard (LFG) has raised $1 billion in a private token sale to allow the group to safeguard th
William Suberg13 hours agoBitcoin trader reveals ‘important’ BTC price zone as bulls hold $29.3KBitcoin traders continue to battle for control of a rangebound market — but some nearby BTC price levels are more sign
Report: The Oldest Bank in America, BNY Mellon Can Now Custody Bitcoin and Ethereum
Report: The Oldest Bank in America, BNY Mellon Can Now Custody Bitcoin and Ethereum America’s oldest bank, the Bank of New York Mellon Corporation, commonly known as BNY Mel
Billionaire Mark Cuban Expects SEC to Impose ‘Nightmare’ Crypto Registration Rules
Billionaire Mark Cuban Expects SEC to Impose "Nightmare" Crypto Registration Rules Shark Tank star and the owner of the NBA team Dallas Mavericks, Mark Cuban, has warned that the U
Alexander Vinnik Serves Prison Term in France but No Freedom in Sight
Alexander Vinnik Serves Prison Term in France but No Freedom in Sight Having served his five-year prison sentence in France, Russian IT and crypto specialist Alexander Vinnik is no
If Ethereum Holds $2,200 Price Could Recover Fast – Analyst Sets Price Target
Este artículo también está disponible en español. Ethereum is trading below the $2,300 mark after failing to hold key demand levels last week. The price has faced intense
Price analysis 6/12: BTC, ETH, BNB, SOL, XRP, DOGE, TON, SHIB, ADA, AVAX
Rakesh Upadhyay8 hours agoPrice analysis 6/12: BTC, ETH, BNB, SOL, XRP, DOGE, TON, SHIB, ADA, AVAXFavorable CPI data have helped Bitcoin reclaim the crucial $69,000 level, signaling that a move to $72,000 is possible.243
India Unveils Guidelines for Crypto Advertising
India Unveils Guidelines for Crypto Advertising The Advertising Standards Council of India has released guidelines for the advertising and promotion of crypto assets and related se
Survey: Investors Expect Bitcoin’s Price to Fall to $10,000
Survey: Investors Expect Bitcoin"s Price to Fall to $10,000 A new survey shows that the majority of nearly 1,000 investors who responded expect bitcoin’s price to drop to $1
Central Bank of Ukraine Supports Crypto Industry, Fears Cryptocurrency
Central Bank of Ukraine Supports Crypto Industry, Fears Cryptocurrency The National Bank of Ukraine recognizes the benefits of endorsing crypto innovations but also fears cryptocur