Tue Jul 18

FTC probe of OpenAI: Consumer protection is the opening salvo of US AI regulation

Written by Anjana Susarla, Professor of Information Systems, Michigan State University

The Federal Trade Commission has launched an investigation of ChatGPT maker OpenAI for potential violations of consumer protection laws ^[1]. The FTC sent the company a 20-page demand for information in the week of July 10, 2023. The move comes as European regulators have begun to take action ^[2], and Congress is working on legislation ^[3] to regulate the artificial intelligence industry.

The FTC has asked OpenAI to provide details of all complaints the company has received from users regarding “false, misleading, disparaging, or harmful ^[4]” statements put out by OpenAI, and whether OpenAI engaged in unfair or deceptive practices relating to risks of harm to consumers, including reputational harm. The agency has asked detailed questions about how OpenAI obtains its data, how it trains its models, the processes it uses for human feedback, risk assessment and mitigation, and its mechanisms for privacy protection.

As a researcher of social media and AI ^[5], I recognize the immensely transformative potential of generative AI models, but I believe that these systems pose risks ^[6]. In particular, in the context of consumer protection, these models can produce errors, exhibit biases and violate personal data privacy.

Hidden power

At the heart of chatbots such as ChatGPT and image generation tools such as DALL-E lies the power of generative AI models that can create realistic content from text, images, audio and video inputs. These tools can be accessed through a browser or a smartphone app.

Since these AI models have no predefined use ^[7], they can be fine-tuned for a wide range of applications in a variety of domains ranging from finance to biology. The models, trained on vast quantities of data, can be adapted for different tasks with little to no coding and sometimes as easily as by describing a task in simple language.

Given that AI models such as GPT-3 and GPT-4 were developed by private organizations using proprietary data sets, the public doesn’t know the nature of the data used to train them ^[8]. The opacity of training data and the complexity of the model architecture – GPT-3 was trained on over 175 billion variables or “parameters”^[9] – make it difficult for anyone to audit these models. Consequently, it’s difficult to prove that the way they are built or trained causes harm ^[10].

Hallucinations

In language model AIs, a hallucination is a confident response that is inaccurate and seemingly not justified by a model’s training data ^[11]. Even some generative AI models that were designed to be less prone to hallucinations have amplified them ^[12].

There is a danger that generative AI models can produce incorrect or misleading information that can end up being damaging to users. A study investigating ChatGPT’s ability to generate factually correct scientific writing in the medical field found that ChatGPT ended up either generating citations to nonexistent papers or reporting nonexistent results ^[13]. My collaborators and I found similar patterns ^[14] in our investigations.

Such hallucinations can cause real damage when the models are used without adequate supervision. For example, ChatGPT falsely claimed that a professor it named had been accused of sexual harassment ^[15]. And a radio host has filed a defamation lawsuit against OpenAI ^[16] regarding ChatGPT falsely claiming that there was a legal complaint against him for embezzlement.

Bias and discrimination

Without adequate safeguards or protections, generative AI models trained on vast quantities of data collected from the internet can end up replicating existing societal biases. For example, organizations that use generative AI models to design recruiting campaigns could end up unintentionally discriminating against some groups of people.

When a journalist asked DALL-E 2 to generate images of “a technology journalist writing an article about a new AI system that can create remarkable and strange images,” it generated only pictures of men ^[17]. An AI portrait app exhibited several sociocultural biases ^[18], for example by lightening the skin color of an actress.

Data privacy

Another major concern, especially pertinent to the FTC investigation, is the risk of privacy breaches where the AI may end up revealing sensitive or confidential information. A hacker could gain access to sensitive information about people whose data was used to train an AI model.

Researchers have cautioned about risks from manipulations called prompt injection attacks, which can trick generative AI into giving out information that it shouldn’t ^[19]. “Indirect prompt injection” attacks could trick AI models ^[20] with steps such as sending someone a calendar invitation with instructions for their digital assistant to export the recipient’s data and send it to the hacker.

A man in a business suit stands with his right hand raised in a wood-paneled room.

OpenAI CEO Sam Altman testified before a Senate Judiciary subcommittee on May 16, 2023. AI regulation legislation is in the works, but the FTC beat Congress to the punch. AP Photo/Patrick Semansky ^[21]

Some solutions

The European Commission has published ethical guidelines for trustworthy AI ^[22] that include an assessment checklist for six different aspects of AI systems: human agency and oversight; technical robustness and safety; privacy and data governance; transparency, diversity, nondiscrimination and fairness; societal and environmental well-being; and accountability.

Better documentation of AI developers’ processes can help in highlighting potential harms. For example, researchers of algorithmic fairness have proposed model cards ^[23], which are similar to nutritional labels for food. Data statements ^[24] and datasheets ^[25], which characterize data sets used to train AI models, would serve a similar role.

Amazon Web Services, for instance, introduced AI service cards that describe the uses and limitations of some models it provides ^[26]. The cards describe the models’ capabilities, training data and intended uses.

The FTC’s inquiry hints that this type of disclosure may be a direction that U.S. regulators take. Also, if the FTC finds OpenAI has violated consumer protection laws, it could fine the company or put it under a consent decree.

References

^{^} potential violations of consumer protection laws (www.washingtonpost.com)
^{^} have begun to take action (www.europarl.europa.eu)
^{^} Congress is working on legislation (www.washingtonpost.com)
^{^} false, misleading, disparaging, or harmful (www.washingtonpost.com)
^{^} As a researcher of social media and AI (scholar.google.com)
^{^} pose risks (doi.org)
^{^} these AI models have no predefined use (ainowinstitute.org)
^{^} the nature of the data used to train them (www.judiciary.senate.gov)
^{^} trained on over 175 billion variables or “parameters” (developer.nvidia.com)
^{^} difficult to prove that the way they are built or trained causes harm (www.wired.com)
^{^} inaccurate and seemingly not justified by a model’s training data (doi.org)
^{^} have amplified them (dx.doi.org)
^{^} generating citations to nonexistent papers or reporting nonexistent results (doi.org)
^{^} found similar patterns (doi.org)
^{^} falsely claimed that a professor it named had been accused of sexual harassment (www.washingtonpost.com)
^{^} defamation lawsuit against OpenAI (news.bloomberglaw.com)
^{^} it generated only pictures of men (spectrum.ieee.org)
^{^} exhibited several sociocultural biases (doi.org)
^{^} can trick generative AI into giving out information that it shouldn’t (doi.org)
^{^} could trick AI models (doi.org)
^{^} AP Photo/Patrick Semansky (newsroom.ap.org)
^{^} ethical guidelines for trustworthy AI (digital-strategy.ec.europa.eu)
^{^} have proposed model cards (doi.org)
^{^} Data statements (doi.org)
^{^} datasheets (doi.org)
^{^} some models it provides (www.deeplearning.ai)

Authors: Anjana Susarla, Professor of Information Systems, Michigan State University

FTC probe of OpenAI: Consumer protection is the opening salvo of US AI regulation

Hidden power

Hallucinations

Bias and discrimination

Data privacy

Some solutions

References

Cold weather brings itchy, irritated, dry and scaly skin – here's how to treat eczema and other skin conditions and when to see a doctor

Jobs are up, wages less so – and lower purchasing power could still lead the US into a recession

Inflation is heating up again, putting pressure on Trump to cool it on tariffs

Why sending a belated gift is not as bad as you probably think − and late is better than never

Why using fear to promote COVID-19 vaccination and mask wearing could backfire

El Salvador voters set to trade democracy for promise of security in presidential election

John Fetterman might be the first to try to bare his legs in the Senate, but shorts have been ticking people off for almost a century

Map wars in the Middle East: How cartographers charted and helped shape a regional conflict

There's a way for modern medicine to cure diseases even when the treatments aren't profitable

Sketchy darknet websites are taking advantage of the COVID-19 pandemic – buyer beware