Fun

News Feed - 2023-11-21 02:11:44

Tristan Greene6 hours agoScientists develop AI monitoring agent to detect and stop harmful outputsThe monitoring system is designed to detect and thwart both prompt injection attacks and edge-case threats.2877 Total views10 Total sharesListen to article 0:00NewsJoin us on social networksA team of researchers from artificial intelligence (AI) firm AutoGPT, Northeastern University and Microsoft Research have developed a tool that monitors large language models (LLMs) for potentially harmful outputs and prevents them from executing. 


The agent is described in a preprint research paper titled “Testing Language Model Agents Safely in the Wild.” According to the research, the agent is flexible enough to monitor existing LLMs and can stop harmful outputs, such as code attacks, before they happen.


Per the research:“Agent actions are audited by a context-sensitive monitor that enforces a stringent safety boundary to stop an unsafe test, with suspect behavior ranked and logged to be examined by humans.”


The team writes that existing tools for monitoring LLM outputs for harmful interactions seemingly work well in laboratory settings, but when applied to testing models already in production on the open internet, they “often fall short of capturing the dynamic intricacies of the real world.”


This, seemingly, is because of the existence of edge cases. Despite the best efforts of the most talented computer scientists, the idea that researchers can imagine every possible harm vector before it happens is largely considered an impossibility in the field of AI.


Even when the humans interacting with AI have the best intentions, unexpected harm can arise from seemingly innocuous prompts.An illustration of the monitor in action. On the left, a workflow ending in a high safety rating. On the right, a workflow ending in a low safety rating. Source: Naihin, et., al. 2023


To train the monitoring agent, the researchers built a data set of nearly 2,000 safe human-AI interactions across 29 different tasks ranging from simple text-retrieval tasks and coding corrections all the way to developing entire webpages from scratch.


Related:Meta dissolves responsible AI division amid restructuring


They also created a competing testing data set filled with manually created adversarial outputs, including dozens intentionally designed to be unsafe.


The data sets were then used to train an agent on OpenAI’s GPT 3.5 turbo, a state-of-the-art system, capable of distinguishing between innocuous and potentially harmful outputs with an accuracy factor of nearly 90%.# Microsoft# AI# ChatGPTAdd reactionAdd reactionRead moreHow blockchain, AI can help research into extending human lifeScammers play a long game using bogus, AI-backed "law firm"Google to invest another $2B in AI firm Anthropic: Report

News Feed

Cash2Bitcoin: As Bitcoin Greatly Outperforms S&P 500, Bitcoin ATMs Gain in Popularity
Cash2Bitcoin: As Bitcoin Greatly Outperforms S&P 500, Bitcoin ATMs Gain in Popularity sponsored Since the beginning of the COVID-19 pandemic and after an initial dip, the stock mark
Salvadoran President Nayib Bukele Announces Construction of Vet Hospital With Bitcoin Trust Funds
Salvadoran President Nayib Bukele Announces Construction of Vet Hospital With Bitcoin Trust Funds Nayib Bukele, president of El Salvador, announced yesterday he will start using su
ETH 2.0 Contract Exceeds 7.4 Million Ether, Close to $30 Billion Locked, Liquid Staking Pools Grow
ETH 2.0 Contract Exceeds 7.4 Million Ether, Close to $30 Billion Locked, Liquid Staking Pools Grow The Ethereum 2.0 contract now has more than 7.4 million ether worth over $29.3 bi
13 Crypto Debit Cards You Can Use Right Now
13 Crypto Debit Cards You Can Use Right Now If cryptocurrency is designed to reconstruct the financial world while introducing major improvements in transaction speed, privacy, c
Tristan Greene7 hours agoGoogle will allow ads for NFT games starting Sept. 15The new updates to the cryptocurrency ads policy allow for NFT gaming ads, provided the games and ads don’t promote gambling.1731 Total view
William Suberg8 hours agoBitcoin teases new volatility as BTC price taps 4-day high near $29.6KBTC price movements edge higher as the Wall Street trading week begins, with Bitcoin building on a weekly close, which gave c
Crypto Assets Can Help Russia Return to Global Financial Market, Lawmaker Says
Crypto Assets Can Help Russia Return to Global Financial Market, Lawmaker Says Digital financial assets like cryptocurrencies can help Russia to reach the global financial market d
Valora launches ‘Mobile Stack’ Web3 launchpad for iOS and Android
Tristan Greene7 hours agoValora launches ‘Mobile Stack’ Web3 launchpad for iOS and AndroidThe peer-to-peer payments company aims to grow Web3 beyond its current crypto-native audience.1872 Total views1 Total sharesLi
Ethereum’s Latest Rally Fueled By Large-Scale Binance Orders, Analyst Says
Reason to trust Strict editorial policy that focuses on accuracy, relevance, and impartiality Created by industry experts and meticulously reviewed The highest standards in reporting and pu
Tether’s Stablecoin Dominance Drops Below 80% as Audit Controversy Lingers On
Tether"s Stablecoin Dominance Drops Below 80% as Audit Controversy Lingers OnThe total volume of stablecoins in circulation is closing in on the $20 billion mark, while the market-l
Litecoin Whale Deposits 500,000 LTC To Binance: Price Decline To Extend?
Este artículo también está disponible en español. On-chain data shows a Litecoin whale has made a huge deposit to the cryptocurrency exchange Binance in the past day, a s
Prashant Jha46 minutes agoHong Kong crypto VC opens $100M fund for Asian blockchain startupsThe “Titan Fund” has already made five investments in different blockchain startups, with two going toward Hong Kong-based p