Fun

News Feed - 2023-11-21 02:11:44

Tristan Greene6 hours agoScientists develop AI monitoring agent to detect and stop harmful outputsThe monitoring system is designed to detect and thwart both prompt injection attacks and edge-case threats.2877 Total views10 Total sharesListen to article 0:00NewsJoin us on social networksA team of researchers from artificial intelligence (AI) firm AutoGPT, Northeastern University and Microsoft Research have developed a tool that monitors large language models (LLMs) for potentially harmful outputs and prevents them from executing. 


The agent is described in a preprint research paper titled “Testing Language Model Agents Safely in the Wild.” According to the research, the agent is flexible enough to monitor existing LLMs and can stop harmful outputs, such as code attacks, before they happen.


Per the research:“Agent actions are audited by a context-sensitive monitor that enforces a stringent safety boundary to stop an unsafe test, with suspect behavior ranked and logged to be examined by humans.”


The team writes that existing tools for monitoring LLM outputs for harmful interactions seemingly work well in laboratory settings, but when applied to testing models already in production on the open internet, they “often fall short of capturing the dynamic intricacies of the real world.”


This, seemingly, is because of the existence of edge cases. Despite the best efforts of the most talented computer scientists, the idea that researchers can imagine every possible harm vector before it happens is largely considered an impossibility in the field of AI.


Even when the humans interacting with AI have the best intentions, unexpected harm can arise from seemingly innocuous prompts.An illustration of the monitor in action. On the left, a workflow ending in a high safety rating. On the right, a workflow ending in a low safety rating. Source: Naihin, et., al. 2023


To train the monitoring agent, the researchers built a data set of nearly 2,000 safe human-AI interactions across 29 different tasks ranging from simple text-retrieval tasks and coding corrections all the way to developing entire webpages from scratch.


Related:Meta dissolves responsible AI division amid restructuring


They also created a competing testing data set filled with manually created adversarial outputs, including dozens intentionally designed to be unsafe.


The data sets were then used to train an agent on OpenAI’s GPT 3.5 turbo, a state-of-the-art system, capable of distinguishing between innocuous and potentially harmful outputs with an accuracy factor of nearly 90%.# Microsoft# AI# ChatGPTAdd reactionAdd reactionRead moreHow blockchain, AI can help research into extending human lifeScammers play a long game using bogus, AI-backed "law firm"Google to invest another $2B in AI firm Anthropic: Report

News Feed

Savannah Fortis10 hours agoSiemens and Microsoft partner to push AI adoption in industrial sectorsMicrosoft and Siemens revealed a new generative AI assistant catering to professionals in the manufacturing, healthcare, t
As Bitcoin and Ethereum See Sharp Drops, 18 Crypto Assets Captured Double-Digit Gains Last Week
As Bitcoin and Ethereum See Sharp Drops, 18 Crypto Assets Captured Double-Digit Gains Last Week While the crypto economy shed billions in value this week, 18 different digital asse
Biggest Movers: LTC Snaps Losing Streak, SOL Moves 7% Lower
Biggest Movers: LTC Snaps Losing Streak, SOL Moves 7% Lower Litecoin snapped a four-day losing streak on Feb. 22, despite cryptocurrency markets mostly trading lower. The token ros
Survivability in Times of Crisis, Internet Outages and Cyber Warfare – Bastyon Inventor Explains
Survivability in Times of Crisis, Internet Outages and Cyber Warfare - Bastyon Inventor Explains Bastyon is a social platform that combines elements of Youtube, Twitter and Torrent
Bank of England to experiment with wholesale CBDC, synchronization
Derek Andersen5 hours agoBank of England to experiment with wholesale CBDC, synchronizationThe BOE is considering applying DLT to its existing settlement system and introducing a wCBDC.609 Total views1 Total sharesListen
Marcel Pechman3 hours agoBitcoin Lightning Network is growing, but 3 major challenges remainThe Lightning Network is growing, but liquidity issues and a need for greater user awareness continue to hinder mainstream adopt
Crypto Community Discusses Warfare in Ukraine, Importance of Crypto, and the Future of Bitcoin
Crypto Community Discusses Warfare in Ukraine, Importance of Crypto, and the Future of Bitcoin During the course of the early morning trading sessions on Thursday (EST), 24-hour st
5 of 7 on-chain indicators suggest the bull run is just beginning
Martin Young4 hours ago5 of 7 on-chain indicators suggest the bull run is just beginningOn-chain indicators such as the Bitcoin MVRV Z score, Puell Multiple and hodl waves paint a bullish picture for Bitcoin investors.34
Jesse Coghlan7 hours agoCrypto thief steals $4.4M in a day as toll rises from LastPass breachEstimates in September revealed that at least $35 million in crypto has been stolen from victims of the LastPass breach since 2
10 Loan Providers Taking Crypto as Collateral, If You Think the Time Is Right for Hodling
10 Loan Providers Taking Crypto as Collateral, If You Think the Time Is Right for Hodling If you are not ready to part with decentralized money, at a time when prices are relativ
Institutional tokenization will propel Web3 forward — Jason Dehni
Vince Quill6 hours agoInstitutional tokenization will propel Web3 forward — Jason DehniThe real-world asset tokenization market is projected to reach $2 trillion by 2030 despite a rocky start, according to McKinsey &am
8 US States Propose a Bitcoin Reserve Before Trump’s Inauguration, Bullish for New Crypto in 2025?
Donald Trump’s pro-crypto stance has cooked up a storm in the crypto market. Ever since he won the presidential election, the market has been raging like a wild bull. Bitcoin