Fun

Anthropic launches $15K jailbreak bounty program for its unreleased next-gen AI

News Feed - 2024-08-10 06:08:17

Tristan Greene2 hours agoAnthropic launches $15K jailbreak bounty program for its unreleased next-gen AIThe program will be open to a limited number of participants initially but will expand at a later date.404 Total viewsListen to article 0:00NewsOwn this piece of crypto historyCollect this article as NFTCOINTELEGRAPH IN YOUR SOCIAL FEEDFollow ourSubscribe onArtificial intelligence firm Anthropic announced the launch of an expanded bug bounty program on Aug.8, with rewards as high as $15,000 for participants who can “jailbreak” the company’s unreleased, “next generation” AI model. 


Anthropic’s flagship AI model, Claude-3, is a generative AI system similar to OpenAI’s ChatGPT and Google’s Gemini. As part of the company’s efforts to ensure that Claude and its other models are capable of operating safely, it conducts what’s called “red teaming.”Red teaming


Red teaming is basically just trying to break something on purpose. In Claude’s case, the point of red teaming is to try and figure out all of the ways that it could be prompted, forced, or otherwise perturbed into generating unwanted outputs.


During red teaming efforts, engineers might rephrase questions or reframe a query in order to trick the AI into outputting information it’s been programmed to avoid.


For example, an AI system trained on data gathered from the internet is likely to contain personally identifiable information on numerous people. As part of its safety policy, Anthropic has put guardrails in place to prevent Claude and its other models from outputting that information.


As AI models become more robust and capable of imitating human communication, the task of trying to figure out every possible unwanted output becomes exponentially challenging.Bug bounty


Anthropic has implemented several novel safety interventions in its models, including its “Constitutional AI” paradigm, but it’s always nice to get fresh eyes on a long-standing issue.


According to a company blog post, it’s latest initiative will expand on existing bug bounty programs to focus on universal jailbreak attacks:“These are exploits that could allow consistent bypassing of AI safety guardrails across a wide range of areas. By targeting universal jailbreaks, we aim to address some of the most significant vulnerabilities in critical, high-risk domains such as CBRN (chemical, biological, radiological, and nuclear) and cybersecurity.”


The company is only accepting a limited number of participants and encourages AI researchers with experience and those who “have demonstrated expertise in identifying jailbreaks in language models” to apply by Friday, Aug. 16.


Not everyone who applies will be selected, but the company plans to “expand this initiative more broadly in the future.”


Those who are selected will receive early access to an unreleased “next generation” AI model for red-teaming purposes.


Related:Tech firms pen letter to EU requesting more time to comply with AI Act# Technology# AIAdd reaction

News Feed

Harvard built hacker-proof quantum network in Boston using existing fiber cable
Tristan Greene4 hours agoHarvard built hacker-proof quantum network in Boston using existing fiber cableAccording to the scientists, the 22-mile distance between nodes is the longest quantum fiber network to date.652 Tot
Berkshire’s Charlie Munger Likes the Fed, Hates Bitcoin Promoters, Calls Tesla’s Success a Miracle
Berkshire"s Charlie Munger Likes the Fed, Hates Bitcoin Promoters, Calls Tesla"s Success a Miracle Berkshire Hathaway Vice Chairman Charlie Munger, Warren Buffett’s right-han
LPNT’s Strategy For Increasing LPN TOKEN’s Utilization, Circulation, Demand and Supply
LPNT’s Strategy For Increasing LPN TOKEN’s Utilization, Circulation, Demand and Supply PRESS RELEASE. Forex trading is the world’s largest decentralize
CoinDeal Obtains in-Principle Approval for Maltese Class 4 VFA License
CoinDeal Obtains in-Principle Approval for Maltese Class 4 VFA License PRESS RELEASE. CoinDeal is pleased to announce that it is now the first publicly known com
Italian gov’t to ramp up surveillance of crypto market
Vince Quill6 hours agoItalian gov’t to ramp up surveillance of crypto marketThe latest draft policy stipulated fines between 5,000 and 5 million euros ($5,400–$5.4 million) for market manipulation and other financial
Report: Nigerian Central Bank Spent Over $1.8 Billion Managing Local Currency
Report: Nigerian Central Bank Spent Over $1.8 Billion Managing Local Currency During her appearance before Nigerian lawmakers, Aisha Ahmad, the deputy governor of the Central Bank
Dash Nigeria Takes Digital Currency Education Campaign to Regulators and Key Institutions
Dash Nigeria Takes Digital Currency Education Campaign to Regulators and Key InstitutionsWith sophisticated fraud schemes seemingly overwhelming the African crypto market, there is
Ana Paula Pereira7 hours agoCrypto will transcend international currencies — BlackRock CEOLarry Fink states that global investors are increasingly eager to add crypto assets to their portfolios.3779 Total views158 Tota
Gareth Jenkinson1 hour agoSam Bankman-Fried $500M Anthropic stake irrelevant to case, prosecutors sayUnited States prosecutors argue that the potential for FTX investors to be made whole through the high valuation of Ant
Bitcoin Falls Beneath $40K, Dragging Crypto Economy Below $2 Trillion
Bitcoin Falls Beneath $40K, Dragging Crypto Economy Below $2 Trillion On Thursday evening around 10 p.m. (EST), the price of bitcoin fell beneath the $40K zone for the first time s
3 Ways Staking Will Upend the Economics of Ethereum
The Takeaway New analysis of the economic model behind ethereum 2.0 suggests validators can expect to earn 4.6–10.3 percent in annualized rewards at the start. The hardware cost for running ethereum 2.0 validator softw
Ethereum Fees Drop 35% Since Last Week, Average ETH Gas Fee Still Above $30 per Transfer
Ethereum Fees Drop 35% Since Last Week, Average ETH Gas Fee Still Above $30 per Transfer According to statistics, Ethereum network transaction fees have dropped 35% from the transf