Close Menu
5gantennas.org5gantennas.org
  • Home
  • 5G
    • 5G Technology
  • 6G
  • AI
  • Data
    • Global 5G
  • Internet
  • WIFI
  • 5G Antennas
  • Legacy

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

What's Hot

4 Best Wi-Fi Mesh Networking Systems in 2024

September 6, 2024

India is on the brink of a new revolution in telecommunications and can lead the world with 6G: Jyotiraditya Scindia

August 29, 2024

Speaker Pelosi slams California AI bill headed to Governor Newsom as ‘ignorant’

August 29, 2024
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram
5gantennas.org5gantennas.org
  • Home
  • 5G
    1. 5G Technology
    2. View All

    Deutsche Telekom to operate 12,500 5G antennas over 3.6 GHz band

    August 28, 2024

    URCA Releases Draft “Roadmap” for 5G Rollout in the Bahamas – Eye Witness News

    August 23, 2024

    Smart Launches Smart ZTE Blade A75 5G » YugaTech

    August 22, 2024

    5G Drone Integration Denmark – DRONELIFE

    August 21, 2024

    Hughes praises successful private 5G demo for U.S. Navy

    August 29, 2024

    GSA survey reveals 5G FWA has become “mainstream”

    August 29, 2024

    China Mobile expands 5G Advanced, Chunghwa Telecom enters Europe

    August 29, 2024

    Ateme and ORS Boost 5G Broadcast Capacity with “World’s First Trial of IP-Based Statmux over 5G Broadcast” | TV Tech

    August 29, 2024
  • 6G

    India is on the brink of a new revolution in telecommunications and can lead the world with 6G: Jyotiraditya Scindia

    August 29, 2024

    Vodafonewatch Weekly: Rural 4G, Industrial 5G, 6G Patents | Weekly Briefing

    August 29, 2024

    Southeast Asia steps up efforts to build 6G standards

    August 29, 2024

    Energy efficiency as an inherent attribute of 6G networks

    August 29, 2024

    Finnish working group launches push for 6G technology

    August 28, 2024
  • AI

    Speaker Pelosi slams California AI bill headed to Governor Newsom as ‘ignorant’

    August 29, 2024

    Why Honeywell is betting big on Gen AI

    August 29, 2024

    Ethically questionable or creative genius? How artists are engaging with AI in their work | Art and Design

    August 29, 2024

    “Elon Musk and Trump” arrested for burglary in disturbing AI video

    August 29, 2024

    Nvidia CFO says ‘enterprise AI wave’ has begun and Fortune 100 companies are leading the way

    August 29, 2024
  • Data
    1. Global 5G
    2. View All

    Global 5G Enterprise Market is expected to be valued at USD 34.4 Billion by 2032

    August 12, 2024

    Counterpoint predicts 5G will dominate the smartphone market in early 2024

    August 5, 2024

    Qualcomm’s new chipsets will power affordable 5G smartphones

    July 31, 2024

    Best Super Fast Download Companies — TradingView

    July 31, 2024

    Crypto Markets Rise on Strong US Economic Data

    August 29, 2024

    Microsoft approves construction of third section of Mount Pleasant data center campus

    August 29, 2024

    China has invested $6.1 billion in state-run data center projects over two years, with the “East Data, West Computing” initiative aimed at capitalizing on the country’s untapped land.

    August 29, 2024

    What is the size of the clinical data analysis solutions market?

    August 29, 2024
  • Internet

    NATO believes Russia poses a threat to Western internet and GPS services

    August 29, 2024

    Mpeppe grows fast, building traction among Internet computer owners

    August 29, 2024

    Internet Computer Whale Buys Mpeppe (MPEPE) at 340x ROI

    August 29, 2024

    Long-term internet computer investor adds PEPE rival to holdings

    August 29, 2024

    Biden-Harris Administration Approves Initial Internet for All Proposals in Mississippi and South Dakota

    August 29, 2024
  • WIFI

    4 Best Wi-Fi Mesh Networking Systems in 2024

    September 6, 2024

    Best WiFi deal: Save $200 on the Starlink Standard Kit AX

    August 29, 2024

    Sonos Roam 2 review | Good Housekeeping UK

    August 29, 2024

    Popular WiFi extender that eliminates dead zones in your home costs just $12

    August 29, 2024

    North American WiFi 6 Mesh Router Market Size, Share, Forecast, [2030] – அக்னி செய்திகள்

    August 29, 2024
  • 5G Antennas

    Nokia and Claro bring 5G to Argentina

    August 27, 2024

    Nokia expands FWA portfolio with new 5G devices – SatNews

    July 25, 2024

    Deutsche Telekom to operate 12,150 5G antennas over 3.6 GHz band

    July 24, 2024

    Vodafone and Ericsson develop a compact 5G antenna in Germany

    July 12, 2024

    Vodafone and Ericsson unveil new small antennas to power Germany’s 5G network

    July 11, 2024
  • Legacy
5gantennas.org5gantennas.org
Home»AI»AI flaws that could lead to dangerous misinformation
AI

AI flaws that could lead to dangerous misinformation

5gantennas.orgBy 5gantennas.orgAugust 27, 2024No Comments6 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest WhatsApp Email


When you go to the hospital and have your blood tested, the results are compiled into a dataset and compared to other patient results and population data. This allows doctors to compare you (blood, age, gender, health history, scans, etc.) with other patient results and histories to predict, manage and develop new treatments.

For centuries, this has been the foundation of scientific research: identify a problem, collect data, look for patterns, and build a model to solve it. The hope is that a type of artificial intelligence (AI) called machine learning, which creates models from data, will be able to do this much faster, more effectively, and more accurately than humans can.

However, training these AI models requires large amounts of data, some of which must be synthetic — that is, data that reproduces existing patterns rather than real data from real people. Most synthetic datasets are generated by machine learning AI.

While the extreme inaccuracies of image generators and chatbots are easy to spot, synthetic data also produces hallucinations: unlikely, biased, or even outright improbable results. Like images and text, they can be entertaining, but the widespread use of these systems in all areas of public life means that they have a huge potential for harm.

What is synthetic data?

AI models need much more data than the real world can provide. Synthetic data offers the solution. Generative AI looks at the statistical distribution of real datasets and creates new synthetic data to train other AI models.

This synthesized “pseudo” data is similar but not identical to the original data, which means it can be used to ensure privacy, circumvent data regulations, and even be freely shared or distributed.

Synthetic data can complement real datasets and can also be large enough to train AI systems, and if the real dataset is biased (for example, too few women, or too many cardigans instead of pullovers), the synthetic data can balance it out. There is an ongoing debate about how far synthetic data can stray from the original data.

Apparent omissions

Without proper curation, tools that create synthetic data will always over-represent what is already dominant in the dataset, and under-represent (or omit) less common “edge cases.”

This is how my interest in synthetic data first began: women and other minorities are already underrepresented in medical research, and I was concerned that synthetic data would exacerbate this problem, so I teamed up with machine learning scientist Dr. Sagi Hajisharif to investigate the phenomenon of disappearing edge cases.

In our study, we used a type of AI called GAN to create a synthetic version of the 1990 U.S. Adult Census data. As expected, the synthetic dataset was missing edge cases: the original data had 40 countries of origin, but the synthetic version had only 31. The synthetic data excluded immigrants from nine countries.

After realizing this error, we were able to tweak our methodology and include it in a new synthetic dataset. It was possible, but it required careful curation.

“Cross Hallucinations” – AI creates impossible data

Then we started noticing something else in the data: cross hallucinations.

Intersectionality is a concept in gender studies. It describes the power relations that produce discrimination and privilege for different people in different ways. It considers not only gender, but also age, race, class, disability, and other factors, and considers the situations in which these factors “intersect.”

This can inform how we analyze synthetic data that includes all data, not just population data, because intersecting aspects of the datasets generate complex combinations of what the data describes.

In our synthetic dataset, the statistical representation of distinct categories was very good. For example, the age distribution was similar in the synthetic and original data. Not identical, but close. This is a good thing, since synthetic data should be similar to the original, not an exact reproduction.

We then analyzed the synthetic data and looked for intersections. More complex intersections were also reproduced. For example, in the synthetic dataset, Age-Income-Gender The reproduction was extremely accurate, a precision we called “cross-fidelity.”

However, we also noticed that there were 333 data points in the synthetic data labeled “husband/wife and single” – a cross hallucination; the AI ​​had not learned (or been taught) that this was not possible. Over 100 of these data points were “unmarried and husband making less than $50,000 a year,” a cross hallucination that did not exist in the original data.

Meanwhile, the original data set contained multiple “widowed women working in tech support” who were completely absent from the synthetic version.

This means that our synthetic dataset can be used for the following studies: Age-Income-Gender That’s correct for the question you ask (if you have cross-fidelity), but not if you’re interested in “widowed women who work in tech support.” You should also be careful about whether you have “unmarried husbands” in your results.

The big question is, where does this stop? These hallucinations are the intersection of two-part and three-part, but what about the intersection of four-part? Or maybe five-part? At what point (and to what purpose) does synthetic data become irrelevant, misleading, useless, or dangerous?

Embrace the crossover illusion

Structured datasets exist because the relationships between columns in a spreadsheet give us useful information. Think of blood tests: doctors want to know how a patient’s blood differs from normal blood, or from other diseases or treatment outcomes. This is why we organize data in the first place, and why we have done so for centuries.

However, when using synthetic data, intersection hallucinations always occur, because the synthetic data must be slightly different from the original data, otherwise it will be just a copy of the original data. need Hallucinations exist, but only the right kind of hallucinations: those that amplify or extend a dataset, not those that produce something impossible, misleading, or biased.

The existence of cross-hallucinations means that one synthetic dataset cannot be used for many different applications: each use case requires a bespoke synthetic dataset with labelled hallucinations, which requires a cognitive system.

Building a reliable AI system

For an AI to be trustworthy, we need to know what cross-hallucinations are present in its training data, especially if it is being used to predict people’s behavior or to regulate, govern, treat, or police us. We need to ensure that the AI ​​is not trained on dangerous or misleading cross-hallucinations, such as a 6-year-old doctor receiving a pension.

But what happens when synthetic datasets are used carelessly? Currently, there is no standard way to identify synthetic datasets, and they are often confused with real data. Once you share a dataset for others to use, it is impossible to know whether it can be trusted, what is hallucinatory and what is not. A clear, universally recognizable way to identify synthetic data is needed.

Cross hallucinations may not be as interesting as hands with 15 fingers or recommendations to put glue on pizza. They are boring and unappealing numbers and statistics, but they affect us all. Sooner or later, synthetic data will be everywhere, and by its very nature, cross hallucinations will inevitably be included. Some we want, some we don’t, but the problem is distinguishing between them. We need to make this possible before it’s too late.conversationconversation

This article is republished from The Conversation under a Creative Commons license. Read the original article.



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleWhat did 5G get right and what did it get wrong?
Next Article Internet Computer Launches New ICO for $0.001777, Adds High Yield to Portfolio
5gantennas.org
  • Website

Related Posts

Speaker Pelosi slams California AI bill headed to Governor Newsom as ‘ignorant’

August 29, 2024

Why Honeywell is betting big on Gen AI

August 29, 2024

Ethically questionable or creative genius? How artists are engaging with AI in their work | Art and Design

August 29, 2024

Comments are closed.

Latest Posts

4 Best Wi-Fi Mesh Networking Systems in 2024

September 6, 2024

India is on the brink of a new revolution in telecommunications and can lead the world with 6G: Jyotiraditya Scindia

August 29, 2024

Speaker Pelosi slams California AI bill headed to Governor Newsom as ‘ignorant’

August 29, 2024

Crypto Markets Rise on Strong US Economic Data

August 29, 2024
Don't Miss

6G: Will it spark a manufacturing revolution?

By 5gantennas.orgJanuary 8, 2024

Roger Kauffman, Senior Director of Product Management and Marketing, Molex The next evolution in mobile…

Where 6G creates solid business impact

November 17, 2023

Introducing the researchers who supported the development of the 6G Framework Recommendation Draft – Samsung Global Newsroom

July 25, 2023

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

About Us
About Us

Welcome to 5GAntennas.org, your reliable source for comprehensive information on 5G technology, artificial intelligence (AI), and data-related advancements. We are passionate about staying at the forefront of these cutting-edge fields and bringing you the latest insights, trends, and developments.

Facebook X (Twitter) Pinterest YouTube WhatsApp
Our Picks

4 Best Wi-Fi Mesh Networking Systems in 2024

September 6, 2024

India is on the brink of a new revolution in telecommunications and can lead the world with 6G: Jyotiraditya Scindia

August 29, 2024

Speaker Pelosi slams California AI bill headed to Governor Newsom as ‘ignorant’

August 29, 2024
Most Popular

How 5G will impact entertainment

January 3, 2024

5G technology and its impact on connectivity | By Hafsa Sajjad | January 2024

January 23, 2024

Gogo updates investors on latest 5G delays

August 8, 2023
© 2025 5gantennas. Designed by 5gantennas.
  • Home
  • About us
  • Contact us
  • DMCA
  • Privacy Policy
  • About Creator

Type above and press Enter to search. Press Esc to cancel.