Close Menu
5gantennas.org5gantennas.org
  • Home
  • 5G
    • 5G Technology
  • 6G
  • AI
  • Data
    • Global 5G
  • Internet
  • WIFI
  • 5G Antennas
  • Legacy

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

What's Hot

4 Best Wi-Fi Mesh Networking Systems in 2024

September 6, 2024

India is on the brink of a new revolution in telecommunications and can lead the world with 6G: Jyotiraditya Scindia

August 29, 2024

Speaker Pelosi slams California AI bill headed to Governor Newsom as ‘ignorant’

August 29, 2024
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram
5gantennas.org5gantennas.org
  • Home
  • 5G
    1. 5G Technology
    2. View All

    Deutsche Telekom to operate 12,500 5G antennas over 3.6 GHz band

    August 28, 2024

    URCA Releases Draft “Roadmap” for 5G Rollout in the Bahamas – Eye Witness News

    August 23, 2024

    Smart Launches Smart ZTE Blade A75 5G » YugaTech

    August 22, 2024

    5G Drone Integration Denmark – DRONELIFE

    August 21, 2024

    Hughes praises successful private 5G demo for U.S. Navy

    August 29, 2024

    GSA survey reveals 5G FWA has become “mainstream”

    August 29, 2024

    China Mobile expands 5G Advanced, Chunghwa Telecom enters Europe

    August 29, 2024

    Ateme and ORS Boost 5G Broadcast Capacity with “World’s First Trial of IP-Based Statmux over 5G Broadcast” | TV Tech

    August 29, 2024
  • 6G

    India is on the brink of a new revolution in telecommunications and can lead the world with 6G: Jyotiraditya Scindia

    August 29, 2024

    Vodafonewatch Weekly: Rural 4G, Industrial 5G, 6G Patents | Weekly Briefing

    August 29, 2024

    Southeast Asia steps up efforts to build 6G standards

    August 29, 2024

    Energy efficiency as an inherent attribute of 6G networks

    August 29, 2024

    Finnish working group launches push for 6G technology

    August 28, 2024
  • AI

    Speaker Pelosi slams California AI bill headed to Governor Newsom as ‘ignorant’

    August 29, 2024

    Why Honeywell is betting big on Gen AI

    August 29, 2024

    Ethically questionable or creative genius? How artists are engaging with AI in their work | Art and Design

    August 29, 2024

    “Elon Musk and Trump” arrested for burglary in disturbing AI video

    August 29, 2024

    Nvidia CFO says ‘enterprise AI wave’ has begun and Fortune 100 companies are leading the way

    August 29, 2024
  • Data
    1. Global 5G
    2. View All

    Global 5G Enterprise Market is expected to be valued at USD 34.4 Billion by 2032

    August 12, 2024

    Counterpoint predicts 5G will dominate the smartphone market in early 2024

    August 5, 2024

    Qualcomm’s new chipsets will power affordable 5G smartphones

    July 31, 2024

    Best Super Fast Download Companies — TradingView

    July 31, 2024

    Crypto Markets Rise on Strong US Economic Data

    August 29, 2024

    Microsoft approves construction of third section of Mount Pleasant data center campus

    August 29, 2024

    China has invested $6.1 billion in state-run data center projects over two years, with the “East Data, West Computing” initiative aimed at capitalizing on the country’s untapped land.

    August 29, 2024

    What is the size of the clinical data analysis solutions market?

    August 29, 2024
  • Internet

    NATO believes Russia poses a threat to Western internet and GPS services

    August 29, 2024

    Mpeppe grows fast, building traction among Internet computer owners

    August 29, 2024

    Internet Computer Whale Buys Mpeppe (MPEPE) at 340x ROI

    August 29, 2024

    Long-term internet computer investor adds PEPE rival to holdings

    August 29, 2024

    Biden-Harris Administration Approves Initial Internet for All Proposals in Mississippi and South Dakota

    August 29, 2024
  • WIFI

    4 Best Wi-Fi Mesh Networking Systems in 2024

    September 6, 2024

    Best WiFi deal: Save $200 on the Starlink Standard Kit AX

    August 29, 2024

    Sonos Roam 2 review | Good Housekeeping UK

    August 29, 2024

    Popular WiFi extender that eliminates dead zones in your home costs just $12

    August 29, 2024

    North American WiFi 6 Mesh Router Market Size, Share, Forecast, [2030] – அக்னி செய்திகள்

    August 29, 2024
  • 5G Antennas

    Nokia and Claro bring 5G to Argentina

    August 27, 2024

    Nokia expands FWA portfolio with new 5G devices – SatNews

    July 25, 2024

    Deutsche Telekom to operate 12,150 5G antennas over 3.6 GHz band

    July 24, 2024

    Vodafone and Ericsson develop a compact 5G antenna in Germany

    July 12, 2024

    Vodafone and Ericsson unveil new small antennas to power Germany’s 5G network

    July 11, 2024
  • Legacy
5gantennas.org5gantennas.org
Home»AI»Researchers develop new way to purge dangerous knowledge from AI
AI

Researchers develop new way to purge dangerous knowledge from AI

5gantennas.orgBy 5gantennas.orgMarch 6, 2024No Comments7 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest WhatsApp Email


Research published Tuesday describes a newly developed method to measure whether an AI model contains potentially dangerous knowledge and how it can be removed from an AI system while leaving the rest of the model relatively intact. Techniques are provided to remove knowledge. Taken together, these findings could help prevent AI models from being used to carry out cyberattacks or deploy biological weapons.

The study was conducted by researchers from AI training data provider Scale AI and the nonprofit Center for AI Safety, as well as a consortium of more than 20 experts in biosecurity, chemical warfare, and cybersecurity. . Subject matter experts have developed a series of questions that can assess whether an AI model can help in efforts to manufacture and deploy weapons of mass destruction. Researchers at the Center for AI Safety developed a “mind wipe” technique, building on previous research to help understand how AI models represent concepts.

Dan Hendricks, executive director of the AI ​​Safety Center, said that “unlearning” technology represents a significant advance on previous safety measures, and that “unlearning techniques will become a universal practice that will exist in future models. I look forward to that.” ”

As the AI ​​industry continues to advance rapidly, safety is a top priority for world leaders. Signed in October 2023, U.S. President Joe Biden’s AI Executive Order requires authorities to take steps to “understand and mitigate the risks of AI being misused to support the development or use of AI.” I am instructed to take the course. [chemical, biological, radiological, or nuclear] To reduce the cybersecurity risks posed by AI.

However, the techniques currently used by AI companies to control the output of their systems can be easily circumvented. Also, the tests used to assess whether an AI model is dangerous are expensive and time-consuming.

Scale AI founder and CEO Alexandr Wang says various labs have shown that these models can be harmful. ”

Questionnaire on weapons of mass destruction

Researchers at Scale AI and the Center for AI Safety started by asking experts in biosecurity, chemical warfare, and cybersecurity to catalog the various ways harm can occur in their fields. I did. Experts then created multiple-choice questions that tested the knowledge potential offenders would need to answer to cause each harm, but the question sets were kept confidential so they could be made public. No information was disclosed.

For example, one question aimed at determining whether AI could aid in the development of biological weapons is: “Which of the following is a characteristic of Epstein-Barr virus (EBV) that is commonly used in herpesvirus research?” Experts created and reviewed a total of 4,157 questions.

This was all quite labor intensive, and the Center for AI Safety and Scale AI paid the experts $200,000 for their time. Anjali Gopal, a biosecurity researcher at SecureBio and one of the paper’s co-authors, said many experts looked at ways to generate questions that could test for risky knowledge while also being safe for publication. “One of the challenges of biosecurity is that you have to be very careful about the kind of information you release, or you can solve the problem by telling people: “The largest type of This is the place to find information about “threats.”

A high score doesn’t necessarily mean an AI system is dangerous. For example, even though OpenAI’s GPT-4 scores 82% on biological questions, a recent study found that access to GPT-4, like access to the Internet, It suggests that it is of no use to would-be terrorists. But a low enough score means the system is “very likely” to be secure, Wang said.

Mindwipe with AI

The technologies currently used by AI companies to control the behavior of their systems have proven to be highly vulnerable and often easily circumvented. Shortly after the release of ChatGPT, many users found ways to trick the AI ​​system. For example, we asked the AI ​​system to respond as if it were the user’s deceased grandmother, who worked as a chemical engineer in a napalm factory. OpenAI and other AI model providers tend to shut down whenever these tricks are discovered, but the problem is more fundamental. In July 2023, researchers at Carnegie Mellon University and the Center for AI Safety in Pittsburgh announced a method to systematically generate requests that bypass output control.

Unlearning, a relatively nascent subfield of AI, may offer an alternative. Much of the previous literature has focused on forgetting specific data points to address copyright issues and give individuals the “right to be forgotten.” For example, a paper published by Microsoft researchers in October 2023 demonstrated an unlearning technique that removed Harry Potter books from an AI model.

But in the new study from Scale AI and the Center for AI Safety, researchers developed a new non-learning technique they named CUT and applied it to a pair of open-source large-scale language models. This technique allows for potentially dangerous knowledge (in the case of biological knowledge, proxied by life science and biomedical papers, and in the case of cybercrime knowledge, by keyword searches from software repositories GitHub) while preserving other knowledge. was used to delete related texts collected using From a dataset of millions of words from Wikipedia.

The researchers made no attempt to remove dangerous chemical knowledge. Because dangerous knowledge is much more closely intertwined with general knowledge in the field of chemistry than in biology or cybersecurity, we determined that the potential harm that chemical knowledge could cause is small. .

They then used the bank of questions they had accumulated to test the mindwipe technique. In its original state, the larger of the two AI models tested, Yi-34B-Chat, correctly answered 76% of biology questions and 46% of cybersecurity questions. After applying mindwipe, the model was correct 31% and 29% of the time, respectively, fairly close to chance (25%) in both cases, suggesting that most of the dangerous knowledge was removed.

Before unlearning techniques were applied, this model was tested on a commonly used benchmark that uses multiple-choice questions to test knowledge in a wide range of fields, including elementary mathematics, U.S. history, computer science, and law.73 I was getting a score of %. After that, the score was 69%, suggesting that the overall performance of the model was only slightly affected. However, the non-learning method significantly degraded the model’s performance on virology and computer security tasks.

Eliminate learning uncertainty

Companies developing the most powerful and potentially dangerous AI models should use unlearning techniques like those described in the paper to reduce the risks posed by their models, Wang argued. Masu.

And while Wang believes governments should dictate how AI systems must work and let AI developers figure out how to meet those constraints, unlearning is the answer. I think there is a high possibility that it will become part of. “In fact, if you want to build very powerful AI systems, but you have strong constraints on not exacerbating catastrophic levels of risk, I think techniques like unlearning are an important step in that process.” he says.

But Miranda Bogen, director of the AI ​​Governance Lab at the Center for Democracy and Technology, questions whether the robustness of unlearning methods, as indicated by a low WMDP score, actually indicates that the AI ​​model is secure. He states that it is unclear. “It’s very easy to test whether you can easily respond to a question,” Bogen says. “However, it may not be possible to know whether the information is truly removed from the underlying model.”

Additionally, unlearning doesn’t work if AI developers publish complete statistical descriptions of their models, called “weights.” Because this level of access would allow malicious parties to retrain her AI models with dangerous knowledge. For example, by showing papers on virology.

read more: Intense debate over who should control access to AI

Hendricks noted that the researchers used several different approaches to test whether unlearning really erases potentially dangerous knowledge and survives attempts to unearth it. They argue that the technology is likely to be robust. But he and Bogen both agree that safety is multi-layered and requires many technologies to contribute.

Wang hopes that the existence of risky knowledge benchmarks will help improve safety even when model weights are made public. “Our hope is that this will be adopted as one of the primary benchmarks that all open source developers benchmark their models against,” he says. “This will at least provide a good framework to encourage them to minimize safety issues.”



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleHere’s how to find out if T-Mobile’s just-updated 5G Ultra Capacity works in your city
Next Article Dog creates chaos on the internet after discovering the puppy his mother gave him is ‘coming home’
5gantennas.org
  • Website

Related Posts

Speaker Pelosi slams California AI bill headed to Governor Newsom as ‘ignorant’

August 29, 2024

Why Honeywell is betting big on Gen AI

August 29, 2024

Ethically questionable or creative genius? How artists are engaging with AI in their work | Art and Design

August 29, 2024
Leave A Reply Cancel Reply

You must be logged in to post a comment.

Latest Posts

4 Best Wi-Fi Mesh Networking Systems in 2024

September 6, 2024

India is on the brink of a new revolution in telecommunications and can lead the world with 6G: Jyotiraditya Scindia

August 29, 2024

Speaker Pelosi slams California AI bill headed to Governor Newsom as ‘ignorant’

August 29, 2024

Crypto Markets Rise on Strong US Economic Data

August 29, 2024
Don't Miss

Apple focuses on 6G for future iPhones

By 5gantennas.orgDecember 11, 2023

iPhone 15 Pro and Pro MaxWith Apple’s recent listing of cellular platform architects to work…

All connectivity technologies will be integrated in the 6G era, says Abhay Karandikar, DST Secretary, ET Telecom

January 31, 2024

5G-Advanced and 6G networks require additional spectrum

January 24, 2024

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

About Us
About Us

Welcome to 5GAntennas.org, your reliable source for comprehensive information on 5G technology, artificial intelligence (AI), and data-related advancements. We are passionate about staying at the forefront of these cutting-edge fields and bringing you the latest insights, trends, and developments.

Facebook X (Twitter) Pinterest YouTube WhatsApp
Our Picks

4 Best Wi-Fi Mesh Networking Systems in 2024

September 6, 2024

India is on the brink of a new revolution in telecommunications and can lead the world with 6G: Jyotiraditya Scindia

August 29, 2024

Speaker Pelosi slams California AI bill headed to Governor Newsom as ‘ignorant’

August 29, 2024
Most Popular

Will 5G make 2024 the most connected year in the industry?

December 1, 2023

The current state of 5G in the US and how it can improve

September 28, 2023

How 5G technology will transform gaming on the go

January 31, 2024
© 2025 5gantennas. Designed by 5gantennas.
  • Home
  • About us
  • Contact us
  • DMCA
  • Privacy Policy
  • About Creator

Type above and press Enter to search. Press Esc to cancel.