Fun

Groq AI's LPU: The breakthrough answer to ChatGPT's GPU woes?

News Feed - 2024-02-22 07:02:14

Savannah Fortis13 hours agoGroq AI"s LPU: The breakthrough answer to ChatGPT"s GPU woes?Groq"s LPU chip emerges as a potential solution to the challenges faced by AI developers relying on GPUs, sparking comparisons with ChatGPT.1270 Total views1 Total sharesListen to article 0:00NewsOwn this piece of crypto historyCollect this article as NFTJoin us on social networksThe latest artificial intelligence (AI) tool to capture the public’s attention is the Groq LPU Inference Engine, which became an overnight sensation on social media after its public benchmark tests went viral, outperforming the top models by other Big Tech companies. 


Groq, not to be confused with Elon Musk’s AI model called Grok, is, in fact, not a model itself but a chip system through which a model can run.


The team behind Groq developed its own “software-defined” AI chip which they called a language processing unit (LPU), developed for inference purposes. The LPU allows Groq to generate roughly 500 tokens per second.


Comparatively, the publicly available AI model ChatGPT-3.5, which runs off of scarce and costly graphics processing units (GPUs), can generate around 40 tokens per second. Comparisons between Groq and other AI systems have been flooding the X platform.Groq is a Radically Different kind of AI architecture

Among the new crop of AI chip startups, Groq stands out with a radically different approach centered around its compiler technology for optimizing a minimalist yet high-performance architecture. Groq's secret sauce is this… pic.twitter.com/Z70sihHNbx— Carlos E. Perez (@IntuitMachine) February 20, 2024


Cointelegraph heard from Mark Heaps, the Chief Evangelist at Groq, to better understand the tool and how it can potentially transform how AI systems operate. 


Heaps said that the founder of Groq, Jonathan Ross, initially wanted to create a system technology that would prevent AI from being “divided between the haves and have nots.”


At the time tensor processing units (TPUs) were only available to Google for their own systems, however, LPUs were born because:“[Ross] and the team wanted anyone in the world to be able to access this level of compute for AI to find innovative new solutions for the world.”


The Groq executive explained that the LPU is a “software-first designed hardware solution,” by which the nature of the design simplifies the way data travels — not only over the chip but from chip to chip and throughout a network. 


“Not needing schedulers, CUDA libraries, Kernels, and more improves not only performance but the Developer experience,” he said.“Imagine commuting to work and every red light turned green right as you hit it because it knew when you"d be there. Or the fact is, you wouldn"t need traffic lights at all. That"s what it"s like when data travels through our LPU.”


Related:Microsoft to invest 3 billion euros into AI development in Germany


A current issue plaguing developers in the industry is the scarcity and cost of powerful GPUs — such as Nvidia’s A100 and H100 chips — needed to run AI models.


However, Heaps said they don’t have the same issues as their chip is made using 14nm silicon. “This size of die has been used for 10 years in chip design,” he said, “and is very affordable, and readily available. Our next chip will be 4nm and also made in the United States.”


He said GPU systems still have a place when talking about running smaller-scale hardware deployments. However, the choice of GPU vs. LPU comes down to multiple factors including the workload and model.“If we"re talking about a large-scale system, serving thousands of users with high utilization of a large language model, our numbers show that [LPUs] are more efficient on power.”


LPU usage remains to be implemented by many of the major developers in the space. Heaps said several factors result in this, one of which being the relatively new “explosion of LLMs” over the last year.


“Folks still wanted a one-size-fits-all solution like a GPU which they can use for both their training and inference. Now the emerging market has forced people to find differentiation and a general solution won"t help them accomplish that.”


Aside from the product itself, Heaps also touched on the elephant in the room — the name “Groq.”


Groq was created in 2016 with the name trademarked shortly after. However, Elon Musk’s chatbot, Grok, only appeared on the scene in November 2023, becoming widely recognized in the AI space in a short time.


Heaps said there have been “Elon fans” who have assumed they tried to “take the name” or that it was a sort of marketing strategy. However, once the company’s history became known he said, “then folks [got] a little quieter.”“It was challenging a few months ago when their LLM was getting a lot of press, but right now I think people are taking notice of Groq, with a Q.”


Magazine:Google to fix diversity-borked Gemini AI, ChatGPT goes insane: AI Eye# Business# Technology# AI# GPU# ChatGPTAdd reactionAdd reaction

News Feed

Bitcoin’s 3% Price Rise Neutralizes Bearish Setup
View Bitcoin has again bounced up from $7,800 support, neutralizing the immediate bearish setup. A break above $8,820 is needed to invalidate the lower-highs setup and confirm a bull reversal. A bullish close, if confirm
XRP Forms A Bullish Pattern In 4-Hour Chart – Analyst Expects $4.20 After Breakout
Este artículo también está disponible en español. XRP is currently at a critical juncture, trading at a key level after breaking its all-time high just eight days ago. De
Ezra Reguerra8 minutes agoCouple mistakenly sent $10.5M by Crypto.com to face October plea hearingThevamanogari Manivel was sentenced to 18 months of community corrections with six months of unpaid community work while h
Trezor to simplify self-custody with onboarding sessions and new wallet
Helen Partz1 minute agoTrezor to simplify self-custody with onboarding sessions and new walletSelf-custody raises concerns about the burden of holding the private key, which Trezor wants to solve with the help of a dedic
A Look at ‘Individual X’ and the Seized Stash of Silk Road Bitcoins Worth $1 Billion
A Look at "Individual X" and the Seized Stash of Silk Road Bitcoins Worth $1 Billion On November 3, 2020, the cryptocurrency community noticed that one of the la
BlackRock Bitcoin ETF posts September's biggest daily inflow of over $180M
Ciaran Lyons12 hours agoBlackRock Bitcoin ETF posts September"s biggest daily inflow of over $180MBlackRock"s Bitcoin ETF saw the highest daily inflow of any fund this month on Sept. 25, amid a wider five-day inflow stre
How to Track, Get and Set the Best Transaction Fees with Bitcoin and Bitcoin Cash
How to Track, Get and Set the Best Transaction Fees with Bitcoin and Bitcoin Cash Once set up with a bitcoin or bitcoin cash wallet and some coins, using and sending them is pret
Colombian Money Laundering Watchdog Postpones Crypto Transaction Reporting Resolution
Colombian Money Laundering Watchdog Postpones Crypto Transaction Reporting Resolution The UIAF, which is the Colombian money laundering watchdog, has postponed the date on which ex
Tristan Greene5 hours agoBitbuy enters strategic partnership with Canadian crypto ATM firm LocalcoinLocalcoin ATM will also be expanding its range of cryptocurrency offerings and launching a wallet app.1205 Total views32
Biggest Movers: AVAX Hits 2-Week Low as DOT Extends Recent Losses
Biggest Movers: AVAX Hits 2-Week Low as DOT Extends Recent Losses Avalanche fell to a two-week low on Tuesday, as the token broke out of a key support point. Prices fell below a fl
Anthony Clarke9 hours agoBoosting blockchain adoption by keeping tech on the back endBuilders are increasingly looking to streamline their applications with more familiar interfaces to onboard new users.395 Total views16
Play-to-Earn on Playdapp’s Flagship RPG “Along With the Gods: Knights of the Dawn” in 7 Days
Play-to-Earn on Playdapp’s Flagship RPG “Along With the Gods: Knights of the Dawn” in 7 Days press release PRESS RELEASE. Fast-rising Blockchain gaming platform PlayDapp has r