Fun

News Feed - 2023-11-21 02:11:44

Tristan Greene6 hours agoScientists develop AI monitoring agent to detect and stop harmful outputsThe monitoring system is designed to detect and thwart both prompt injection attacks and edge-case threats.2877 Total views10 Total sharesListen to article 0:00NewsJoin us on social networksA team of researchers from artificial intelligence (AI) firm AutoGPT, Northeastern University and Microsoft Research have developed a tool that monitors large language models (LLMs) for potentially harmful outputs and prevents them from executing. 


The agent is described in a preprint research paper titled “Testing Language Model Agents Safely in the Wild.” According to the research, the agent is flexible enough to monitor existing LLMs and can stop harmful outputs, such as code attacks, before they happen.


Per the research:“Agent actions are audited by a context-sensitive monitor that enforces a stringent safety boundary to stop an unsafe test, with suspect behavior ranked and logged to be examined by humans.”


The team writes that existing tools for monitoring LLM outputs for harmful interactions seemingly work well in laboratory settings, but when applied to testing models already in production on the open internet, they “often fall short of capturing the dynamic intricacies of the real world.”


This, seemingly, is because of the existence of edge cases. Despite the best efforts of the most talented computer scientists, the idea that researchers can imagine every possible harm vector before it happens is largely considered an impossibility in the field of AI.


Even when the humans interacting with AI have the best intentions, unexpected harm can arise from seemingly innocuous prompts.An illustration of the monitor in action. On the left, a workflow ending in a high safety rating. On the right, a workflow ending in a low safety rating. Source: Naihin, et., al. 2023


To train the monitoring agent, the researchers built a data set of nearly 2,000 safe human-AI interactions across 29 different tasks ranging from simple text-retrieval tasks and coding corrections all the way to developing entire webpages from scratch.


Related:Meta dissolves responsible AI division amid restructuring


They also created a competing testing data set filled with manually created adversarial outputs, including dozens intentionally designed to be unsafe.


The data sets were then used to train an agent on OpenAI’s GPT 3.5 turbo, a state-of-the-art system, capable of distinguishing between innocuous and potentially harmful outputs with an accuracy factor of nearly 90%.# Microsoft# AI# ChatGPTAdd reactionAdd reactionRead moreHow blockchain, AI can help research into extending human lifeScammers play a long game using bogus, AI-backed "law firm"Google to invest another $2B in AI firm Anthropic: Report

News Feed

Mt. Gox shifts $2.5B in Bitcoin to unknown wallet, repayments top 40%
Tom Mitchelhill3 hours agoMt. Gox shifts $2.5B in Bitcoin to unknown wallet, repayments top 40%Mt. Gox transferred 37,477 BTC to a new wallet, while data shows that 40% of creditor repayments have now been distributed.23
Opera Mini’s crypto wallet MiniPay now offers USDT and USDC
Helen Partz2 hours agoOpera Mini’s crypto wallet MiniPay now offers USDT and USDCSince launching in September 2023, Opera Mini’s MiniPay app has amassed three million users.3085 Total views14 Total sharesListen to ar
Jamaican Central Bank Says It Has ‘Successfully Completed CBDC Pilot’
Jamaican Central Bank Says It Has "Successfully Completed CBDC Pilot" The Jamaican central bank successfully completed the pilot testing of its central bank digital currency, a sta
Yashu Gola9 hours agoWhy is the crypto market down today?The crypto market is down today as traders assess the latest Curve Finance hack and the SEC"s potential to target all altcoins in the future.311059 Total views526
Report: Nearly 13,000 Chinese Social Media Accounts Promoting Virtual Currency Closed
Report: Nearly 13,000 Chinese Social Media Accounts Promoting Virtual Currency Closed Nearly 13,000 Chinese social media accounts that allegedly promoted virtual currency investmen
Brian Quarmby3 hours agoBTC hodlers outperformed crypto funds by 69% in H1: 21e6 CapitalAccording to 21e6 Capital AG, crypto funds generally outperformed the price gains of BTC in previous bull runs, but they ultimately
William Suberg17 hours agoBitcoin price dives 2% on US jobs data as Fed rate hike bets heat upBitcoin briefly heads back down to $27,000 thanks to unexpected non-farm payroll numbers, with BTC price staging a strong reco
Belgian Banking Group KBC Creates Blockchain-Based Coin
Belgian Banking Group KBC Creates Blockchain-Based Coin KBC Group, a major European banking and insurance institution headquartered in Belgium, has launched a token based on a bloc
Tokenized RWAs are ’a $30-trillion opportunity’ — Polygon exec
Alex O’Donnell7 hours agoTokenized RWAs are ’a $30-trillion opportunity’ — Polygon execHigh-net-worth individuals and private equity funds will drive adoption, said Colin Butler.1615 Total views6 Total sharesList
Alice Ivey10 hours agoHow to protect your privacy onlineDiscover effective strategies to maintain online privacy and learn how to safeguard your personal information while navigating the digital landscape.557 Total views
US City Installs Crypto ATM at Airport After Accepting Cryptocurrency for Payments
US City Installs Crypto ATM at Airport After Accepting Cryptocurrency for Payments The U.S. city of Williston in North Dakota is installing a cryptocurrency ATM at its internationa
Biggest Movers: DOGE, SHIB Slip Following ECB Rate Hike
Biggest Movers: DOGE, SHIB Slip Following ECB Rate Hike Dogecoin and shiba inu fell by as much as 5% in today’s session, as markets reacted to the European Central Bank (ECB) rat