Fun

Anthropic launches $15K jailbreak bounty program for its unreleased next-gen AI

News Feed - 2024-08-10 06:08:17

Tristan Greene2 hours agoAnthropic launches $15K jailbreak bounty program for its unreleased next-gen AIThe program will be open to a limited number of participants initially but will expand at a later date.404 Total viewsListen to article 0:00NewsOwn this piece of crypto historyCollect this article as NFTCOINTELEGRAPH IN YOUR SOCIAL FEEDFollow ourSubscribe onArtificial intelligence firm Anthropic announced the launch of an expanded bug bounty program on Aug.8, with rewards as high as $15,000 for participants who can “jailbreak” the company’s unreleased, “next generation” AI model. 


Anthropic’s flagship AI model, Claude-3, is a generative AI system similar to OpenAI’s ChatGPT and Google’s Gemini. As part of the company’s efforts to ensure that Claude and its other models are capable of operating safely, it conducts what’s called “red teaming.”Red teaming


Red teaming is basically just trying to break something on purpose. In Claude’s case, the point of red teaming is to try and figure out all of the ways that it could be prompted, forced, or otherwise perturbed into generating unwanted outputs.


During red teaming efforts, engineers might rephrase questions or reframe a query in order to trick the AI into outputting information it’s been programmed to avoid.


For example, an AI system trained on data gathered from the internet is likely to contain personally identifiable information on numerous people. As part of its safety policy, Anthropic has put guardrails in place to prevent Claude and its other models from outputting that information.


As AI models become more robust and capable of imitating human communication, the task of trying to figure out every possible unwanted output becomes exponentially challenging.Bug bounty


Anthropic has implemented several novel safety interventions in its models, including its “Constitutional AI” paradigm, but it’s always nice to get fresh eyes on a long-standing issue.


According to a company blog post, it’s latest initiative will expand on existing bug bounty programs to focus on universal jailbreak attacks:“These are exploits that could allow consistent bypassing of AI safety guardrails across a wide range of areas. By targeting universal jailbreaks, we aim to address some of the most significant vulnerabilities in critical, high-risk domains such as CBRN (chemical, biological, radiological, and nuclear) and cybersecurity.”


The company is only accepting a limited number of participants and encourages AI researchers with experience and those who “have demonstrated expertise in identifying jailbreaks in language models” to apply by Friday, Aug. 16.


Not everyone who applies will be selected, but the company plans to “expand this initiative more broadly in the future.”


Those who are selected will receive early access to an unreleased “next generation” AI model for red-teaming purposes.


Related:Tech firms pen letter to EU requesting more time to comply with AI Act# Technology# AIAdd reaction

News Feed

Bitcoin Mining Equipment Maker Ebang Files $100 Million IPO for US Stock Market Listing
Bitcoin Mining Equipment Maker Ebang Files $100 Million IPO for US Stock Market Listing Bitcoin mining chip maker Ebang International Holdings Inc. is seeking to raise up to $100 mi
Libra Shows Central Banks’ Failure on Cross-Border Payments: Riksbank
Facebook’s Libra has been a wake-up call for central bankers. Now one such official – Gabriel Soderberg of Sweden’s Riksbank – says cross-border payments are where policymakers need to play ca
Helen Partz14 hours agoFTX founder’s parents sued, accused of stealing millions from crypto exchangeAccording to the allegations, Sam Bankman-Fried’s father, Joseph Bankman, was a “de facto officer” at FTX Group.
Ethereum Consolidation Continues – Here Are Key Levels To Watch For A Potential Surge
Este artículo también está disponible en español. Ethereum (ETH) continues to trade in a tight consolidationrange, keeping traders and investors on high alert for a poten
How and when to sell your crypto: A simple guide
Marco Castrovilli7 hours agoHow and when to sell your crypto: A simple guideThe latest Cointelegraph video explains how to set up a successful exit strategy in crypto using a few simple steps.857 Total views13 Total shar
Magic Eden passed Blur as leading NFT marketplace in March: CoinGecko
Brayden Lindrea10 minutes agoMagic Eden passed Blur as leading NFT marketplace in March: CoinGeckoCoinGecko cited Magic Eden’s new Diamond reward program and its ongoing commitment to support creator royalties as the m
Solaxy Presale Explodes to $17M as Solana Beats Ethereum in Monthly Revenue
Este artículo también está disponible en español. For the first time ever, Solana has beaten Ethereum in monthly revenue. Solana made around $116M in January 2025 as comp
Monopoly Millionaire Game Raised $1 Million in Seed Funding
Monopoly Millionaire Game Raised $1 Million in Seed Funding press release PRESS RELEASE.Monopoly Millionaire Game, a marine-themed GameFi that involves island construction, cultivat
David Attlee4 hours agoThe end of an era for Binance, troubles for Kraken: Law DecodedBinance and CZ’s ongoing legal drama with United States law enforcement ended with a plea deal.1267 Total views14 Total sharesListen
Mastercard Study: African Fintech Sector Had One of the Highest Year-on-Year Growth Rates in Funding in 2021
Mastercard Study: African Fintech Sector Had One of the Highest Year-on-Year Growth Rates in Funding in 2021 In 2021, African fintech startups accounted for 61% of the $2.7 billion
Rakesh Upadhyay4 hours agoBitcoin price stability creates lucrative setups in TON, XMR, MNT and QNTBitcoin price holding $26,000 could open up some bullish trading opportunities in TON, XMR, MNT and QNT.1282 Total views2
Bitcoin, Ethereum Technical Analysis: ETH Hits 3-Week High Ahead of FOMC Minutes
Bitcoin, Ethereum Technical Analysis: ETH Hits 3-Week High Ahead of FOMC Minutes Ethereum rose to a three-week high on Jan. 4, as markets prepared for the release of the latest Fed