Fun

Anthropic launches $15K jailbreak bounty program for its unreleased next-gen AI

News Feed - 2024-08-10 06:08:17

Tristan Greene2 hours agoAnthropic launches $15K jailbreak bounty program for its unreleased next-gen AIThe program will be open to a limited number of participants initially but will expand at a later date.404 Total viewsListen to article 0:00NewsOwn this piece of crypto historyCollect this article as NFTCOINTELEGRAPH IN YOUR SOCIAL FEEDFollow ourSubscribe onArtificial intelligence firm Anthropic announced the launch of an expanded bug bounty program on Aug.8, with rewards as high as $15,000 for participants who can “jailbreak” the company’s unreleased, “next generation” AI model. 


Anthropic’s flagship AI model, Claude-3, is a generative AI system similar to OpenAI’s ChatGPT and Google’s Gemini. As part of the company’s efforts to ensure that Claude and its other models are capable of operating safely, it conducts what’s called “red teaming.”Red teaming


Red teaming is basically just trying to break something on purpose. In Claude’s case, the point of red teaming is to try and figure out all of the ways that it could be prompted, forced, or otherwise perturbed into generating unwanted outputs.


During red teaming efforts, engineers might rephrase questions or reframe a query in order to trick the AI into outputting information it’s been programmed to avoid.


For example, an AI system trained on data gathered from the internet is likely to contain personally identifiable information on numerous people. As part of its safety policy, Anthropic has put guardrails in place to prevent Claude and its other models from outputting that information.


As AI models become more robust and capable of imitating human communication, the task of trying to figure out every possible unwanted output becomes exponentially challenging.Bug bounty


Anthropic has implemented several novel safety interventions in its models, including its “Constitutional AI” paradigm, but it’s always nice to get fresh eyes on a long-standing issue.


According to a company blog post, it’s latest initiative will expand on existing bug bounty programs to focus on universal jailbreak attacks:“These are exploits that could allow consistent bypassing of AI safety guardrails across a wide range of areas. By targeting universal jailbreaks, we aim to address some of the most significant vulnerabilities in critical, high-risk domains such as CBRN (chemical, biological, radiological, and nuclear) and cybersecurity.”


The company is only accepting a limited number of participants and encourages AI researchers with experience and those who “have demonstrated expertise in identifying jailbreaks in language models” to apply by Friday, Aug. 16.


Not everyone who applies will be selected, but the company plans to “expand this initiative more broadly in the future.”


Those who are selected will receive early access to an unreleased “next generation” AI model for red-teaming purposes.


Related:Tech firms pen letter to EU requesting more time to comply with AI Act# Technology# AIAdd reaction

News Feed

Bitget Announces Winners of Hero Trader Awards 2022
Bitget Announces Winners of Hero Trader Awards 2022 press release PRESS RELEASE.VICTORIA, Seychelles— Bitget, a leading cryptocurrency derivatives exchange, has announced the
Bitcoin Enters Oversold Levels, Analyst Warns This Is Bearish, Not Bullish
Reason to trust Strict editorial policy that focuses on accuracy, relevance, and impartiality Created by industry experts and meticulously reviewed The highest standards in reporting and pu
Grayscale’s GBTC Bitcoin holdings have fallen 33% since its conversion
Martin Young5 hours agoGrayscale’s GBTC Bitcoin holdings have fallen 33% since its conversionGrayscale held around 620,000 BTC at the time its GBTC fund was converted into an ETF. Today it’s sitting at around 420,680
Shark Tank’s Mark Cuban Says Bitcoin Is a Store of Value but ‘More Religion Than Solution to Any Problem’
Shark Tank’s Mark Cuban Says Bitcoin Is a Store of Value but "More Religion Than Solution to Any Problem" Shark Tank star and the Dallas Mavericks’ owner
Mad Money’s Jim Cramer Advises How to Invest in Bitcoin, When to Sell
Mad Money"s Jim Cramer Advises How to Invest in Bitcoin, When to Sell Mad Money host Jim Cramer has some advice on how to invest in bitcoin and when is a good ti
MiCA regulation takes shape under EBA’s newest guidelines
Ana Paula Pereira2 hours agoMiCA regulation takes shape under EBA’s newest guidelinesThe European Banking Authority has introduced a series of technical standards and guidelines for token issuers as MiCA implementation
Helen Partz10 hours agoBinance CEO warns of phishing scams as Uniswap founder gets hackedThe number of social engineering attacks in the cryptocurrency industry has been rising, with major execs getting hacked recently.1
Bitcoin, Ethereum Technical Analysis: BTC Falls Below $25,000 Following Recent Surge
Bitcoin, Ethereum Technical Analysis: BTC Falls Below $25,000 Following Recent Surge Bitcoin fell below the $25,000 mark on Friday, as markets moved into consolidation, following r
Amaka Nwaokocha1 hour agoElon Musk’s X platform faces backlash over XRP account suspensionCrypto Eri, a prominent figure in the cryptocurrency community, contacted Elon Musk on X, seeking clarification about the accoun
Cardano: Elliot Wave Predicts 50% Crash For ADA Price, Is It Time To Get Out
Reason to trust Strict editorial policy that focuses on accuracy, relevance, and impartiality Created by industry experts and meticulously reviewed The highest standards in reporting and pu
Arizona primary involving crypto Super PAC’s $1.3M is a squeaker
Turner Wright5 hours agoArizona primary involving crypto Super PAC’s $1.3M is a squeakerThe primary between two Democrats in Arizona’s 3rd Congressional District will likely go to a recount, with money from crypto in
St. Kitts and Nevis to Explore Possibility of Making Bitcoin Cash Legal Tender by March 2023
St. Kitts and Nevis to Explore Possibility of Making Bitcoin Cash Legal Tender by March 2023 St. Kitts and Nevis will explore the possibility of making bitcoin cash legal tender by