Fun

Google’s new Gemini AI model dominates benchmarks, beats GPT-4o and Claude-3

News Feed - 2024-08-02 06:08:03

Tristan Greene2 hours agoGoogle’s new Gemini AI model dominates benchmarks, beats GPT-4o and Claude-3This is the first time Google’s taken the top slot on the Chatbot Arena leaderboard.531 Total viewsListen to article 0:00NewsOwn this piece of crypto historyCollect this article as NFTCOINTELEGRAPH IN YOUR SOCIAL FEEDFollow ourSubscribe onThere’s a new top dog in the world of generative artificial intelligence benchmarks and its name is Gemini 1.5 Pro. 


The previous champ, OpenAI’s ChatGPT-4o, was finally surpassed on Aug. 1 when Google quietly launched an experimental release of its latest model.


Gemini’s latest update arrived without fanfare and is currently labelled as experimental. But it quickly gained the attention of the AI community across social media as reports began to trickle in that it was surpassing its rivals on benchmark scores.Artificial intelligence benchmarks


OpenAI’s ChatGPT has been the standard bearer for generative AI since the launch of GPT-3. Its latest model, GPT-4o, and its closest competitor, Anthropic’s Claude-3, have reigned supreme above most other models in most common benchmarks for the past year or so with little in the way of competition.Source:Large Model Systems Organization.


One of the most popular benchmarks is called the LMSYS Chatbot Arena. It tests models on a variety of tasks and assigns an overall competency score. GPT-4o received a score of 1,286 while Claude-3 earned a respectable 1,271.


A previous version of Gemini 1.5 Pro scored 1,261. But the experimental version (Gemini 1.5 Pro 0801) released on Aug 1 scored a whopping 1,300.


This indicates that it’s overall more capable than its competitors, but benchmarks aren’t necessarily an accurate representation of what an AI model can and can’t do.Community excitement


Without deeper comparisons available, we’re entering an era where the AI chatbot market has matured enough to offer multiple options. It’s ultimately up to end-users to determine which AI model works best for them.


Anecdotally, there’s been a wave of excitement over the latest version of Gemini with users on social media calling it “insanely good.” One Redditor went so far as to write that it “blows 4o out of the water.”


It’s unclear at this time if the experimental version of Gemini 1.5 Pro will end up being the default going forward. While it remains generally available as of the time of this article’s publication, the fact that it’s in what"s considered an early release or testing phase indicates that it’s possible the model could be rescinded or changed for safety or alignment reasons.


Related:Google announces safety, transparency advancements in AI models# Google# Technology# AI# ChatGPT# OpenAIAdd reaction

News Feed

Bitcoin Wobbles? Metaplanet Buys Big, Breaks $1 Billion Mark
Reason to trust Strict editorial policy that focuses on accuracy, relevance, and impartiality Created by industry experts and meticulously reviewed The highest standards in reporting and pu
Bitcoin’s Third Largest Wallet Changed Hands, but Onchain Data Shows It’s Likely the Same Owner
Bitcoin’s Third Largest Wallet Changed Hands, but Onchain Data Shows It’s Likely the Same Owner Last year and during the first half of 2022, speculators assumed the third-large
Solana Forms Higher Low: Charging Toward Range Highs?
Reason to trust Strict editorial policy that focuses on accuracy, relevance, and impartiality Created by industry experts and meticulously reviewed The highest standards in reporting and pu
Mastercard Debuts Blockchain Surveillance Tool for Banks and Crypto-Centric Card Issuers
Mastercard Debuts Blockchain Surveillance Tool for Banks and Crypto-Centric Card Issuers On Tuesday, the multinational financial services corporation Mastercard revealed that it is
Calm Before The Surge? Bitcoin Price Stability Signals Sustainable Rally Ahead
Reason to trust Strict editorial policy that focuses on accuracy, relevance, and impartiality Created by industry experts and meticulously reviewed The highest standards in reporting and pu
Bitcoin Price Flashes Major Buy Signal On The 4-Hour TD Sequential Chart, Where To Enter?
Este artículo también está disponible en español. A crypto analyst has shared a TD Sequential chart indicating that the Bitcoin price is flashing a major buy signalin the
Leading DeFi Charity Platform MUNCH Unites the Crypto World With New ‘Charity Circle’
Leading DeFi Charity Platform MUNCH Unites the Crypto World With New ‘Charity Circle’ press release PRESS RELEASE. Charitable company MUNCH will lead a philan
Amaka Nwaokocha22 minutes agoDigital yuan integration introduced to Chinese business air travelThe Civil Aviation Administration and China Merchants Bank said passengers will be able to utilize the digital currency to ac
Bitcoin Price To $130,000 By January, Here’s The Roadmap
Este artículo también está disponible en español. According to a technical analysis from analyst Xanrox, the Bitcoin price is on the road to reaching the $130,000 mark in
Jesse Coghlan6 hours agoDropbox ditches unlimited storage offering, blaming crypto cloud minersThe storage platform turned to metered storage after discovering its Advanced plan was being used by some for crypto mining a
Former FTX Executive Accused of Fueling a Charity Through Discounted FTT Purchase
Former FTX Executive Accused of Fueling a Charity Through Discounted FTT Purchase A former executive of FTX allegedly earned profits for a charity by purchasing discounted FTX toke
Study: Over 13% of All Proceeds of Crimes in Bitcoin Passed Through Privacy Wallets in 2020
Study: Over 13% of All Proceeds of Crimes in Bitcoin Passed Through Privacy Wallets in 2020 According to a study published by the blockchain analysis firm Ellipt