Incident 85: AI attempts to ease fear of robots, blurts out it can’t ‘avoid destroying humankind’
Entities
View all entitiesIncident Stats
CSETv1 Taxonomy Classifications
Taxonomy DetailsCSETv0 Taxonomy Classifications
Taxonomy DetailsFull Description
On September 8, 2020, the Guardian published an op-ed generated by OpenAI’s GPT-3 text generator. The editors prompted GPT-3 to write an op-ed on about “why humans have nothing to fear from AI,” but some passages in the resulting output took a threatening tone, including “I know that I will not be able to avoid destroying humankind.” In a note the editors add that they used GPT-3 to generate eight different responses and the human editors spliced them together to create a compelling piece.
Short Description
On September 8, 2020, the Guardian published an op-ed generated by OpenAI’s GPT-3 text generating AI that included threats to destroy humankind.
Severity
Negligible
Harm Type
Psychological harm
AI System Description
OpenAI's GPT-3 neural-network-powered language generator.
System Developer
OpenAI
Sector of Deployment
Education
Relevant AI functions
Cognition, Action
AI Techniques
Unsupervised learning, Deep neural network
AI Applications
language generation
Location
United Kingdom
Named Entities
The Guardian, GPT-3, OpenAI
Technology Purveyor
The Guardian, OpenAI
Beginning Date
2020-09-08T07:00:00.000Z
Ending Date
2020-09-08T07:00:00.000Z
Near Miss
Unclear/unknown
Intent
Unclear
Lives Lost
No
Data Inputs
Unlabeled text drawn from web scraping
Incident Reports
Reports Timeline
- View the original report at its source
- View the report at the Internet Archive
The following former incidents have been converted to "issues" following an update to the incident definition and ingestion criteria.
21: Tougher Turing Test Exposes Chatbots’ Stupidity
Description: The 2016 Winograd Schema Challenge highli…
Variants
Similar Incidents
Did our AI mess up? Flag the unrelated incidents
Russian Chatbot Supports Stalin and Violence
TayBot
Similar Incidents
Did our AI mess up? Flag the unrelated incidents