paint-brush
Giving Credit Where Credit Is Due: News Outlet Sues AI for Removing Copyright Informationby@legalpdf
New Story

Giving Credit Where Credit Is Due: News Outlet Sues AI for Removing Copyright Information

by Legal PDF: Tech Court CasesAugust 13th, 2024
Read on Terminal Reader
Read this story w/o Javascript
tldt arrow

Too Long; Didn't Read

Defendants, such as OpenAI and Microsoft, are accused of intentionally removing copyright management information—like author names, titles, and copyright notices—from Plaintiff’s works when creating training sets for ChatGPT and Copilot. This omission allegedly allowed the AI models to use copyrighted journalism without proper attribution, potentially infringing on copyright laws.
featured image - Giving Credit Where Credit Is Due: News Outlet Sues AI for Removing Copyright Information
Legal PDF: Tech Court Cases HackerNoon profile picture

The Center for Investigative Reporting Inc. v. OpenAI Court Filing, retrieved on June 27, 2024, is part of HackerNoon’s Legal PDF Series. You can jump to any part in this filing here. This part is 8 of 18.

96. ChatGPT and Copilot do not have any independent knowledge of the information provided in their responses. Rather, to service Defendants’ paying customers, ChatGPT and Copilot instead repackage, among other material, the copyrighted journalism work product developed by Plaintiff, and others, at often considerable expense.


97. When providing responses, ChatGPT and Copilot give the impression that they are an all-knowing, “intelligent” source of the information being provided, when in reality, the responses are frequently based on copyrighted works of journalism that ChatGPT and Copilot simply mimic.


98. If ChatGPT and Copilot were trained on works of journalism that included the original author, title, copyright notice, and terms of use information, they would have learned to communicate that information when providing responses to users unless Defendants trained them otherwise.


99. Based on the information described above, thousands of Plaintiff’s copyrighted works were included in Defendants’ training sets without the author, title, copyright notice, and terms of use information that Plaintiff conveyed in publishing them.


100. Based on the information above, including the OpenAI Defendants’ admission to using the Dragnet and Newspaper extraction methods, which remove author, title, copyright notice, and terms of use information from copyright-protected news articles published online, the OpenAI Defendants intentionally removed author, title, copyright notice, and terms of use information from Plaintiff’s copyrighted works in creating ChatGPT training sets.


Continue Reading Here.


About HackerNoon Legal PDF Series: We bring you the most important technical and insightful public domain court case filings.


This court case retrieved on June 27, 2024, motherjones.com is part of the public domain. The court-created documents are works of the federal government, and under copyright law, are automatically placed in the public domain and may be shared without legal restriction.