Generative AI Systems Tee Up Fair Use Fight

The surge of generative artificial intelligence (“AI”) systems entering the market faces a barrage of intellectual property challenges in the courts. In one particular flavor, copyright holders allege that the generative AI systems infringe the owners’ copyrighted works. This infringement, copyright owners contend, happens both on the “input side” and the “output side” of the systems. On the input side, copyright owners contend that the process of “training” the systems’ algorithms or models by ingesting large swaths of publicly-available works infringes the owners’ copyrights. On the output side, copyright owners contend that the secondary works created by generative AI systems infringe the owners’ copyrights. The defendants contend (or are anticipated to contend) that the accused activities are permissible fair use, teeing up whether fair use shields AI companies from copyright infringement claims. The following are some of currently-pending cases that interested parties may wish to monitor for developments on this issue:

Tremblay v. OpenAI, Inc., No. 3:23-cv-03223 (N.D. Cal.):¹ A collection of authors sued OpenAI in the U.S. District Court for the Northern District of California, alleging OpenAI infringed plaintiffs’ copyrighted books by training OpenAI’s ChatGPT and other AI products with those works. Defendants filed a motion to dismiss all causes of action except the direct copyright infringement claim. The court dismissed certain challenged claims but granted leave to amend plaintiff’s complaint.
Andersen v. Stability AI Ltd., No. 3:23-cv-00201 (N.D. Cal.): Three visual artists sued Stability AI Ltd., Stability AI, Inc., Deviant Art, Inc., and Midjourney, Inc. on behalf of a putative class in the U.S. District Court for the Northern District of California, alleging defendants infringed plaintiffs’ copyrighted images by training their respective generative AI systems with those works. Each defendant moved to dismiss, and the court dismissed all claims against all three defendants with leave to amend except the claim of direct infringement against Stability AI. Plaintiffs filed an amended complaint, adding Runway AI, Inc. to the complaint. As of the date of this article, each defendant has moved to dismiss plaintiffs’ amended complaints.
Authors Guild v. OpenAI, Inc., No. 1:23-cv-08292 (S.D.N.Y.): Authors of registered copyrights sued OpenAI in the U.S. District Court for the Southern District of New York, alleging OpenAI infringed the authors’ copyrighted works by training ChatGPT with those works.As of the date of this article, defendant has filed its answer and asserted numerous defenses including fair use.
Getty Images (US), Inc. v. Stability AI, Inc., No. 1:23-cv-00135 (D. Del.): Getty Images sued Stability AI in the U.S. District Court for the District of Delaware, alleging Stability AI infringed Getty’s copyrighted works by training Stability AI’s accused AI with more than 12 million of Getty’s copyrighted images. Defendant moved to dismiss on multiple grounds and moved to transfer. As of the date of this article, the court has not ruled on those motions.
The New York Times Co. v. Microsoft Corp., No. 1:23-cv-11195 (S.D.N.Y.): The New York Times sued OpenAI and Microsoft (and related corporate entities) in the U.S. District Court for the Southern District of New York, alleging Microsoft and OpenAI infringed the Times’ copyrighted newspaper articles by training the accused chatbots with the Times’ articles. Defendants moved to dismiss, and Microsoft moved to intervene and dismiss, stay, or transfer. As of the date of this article, the court has not ruled on those motions.

These cases are in the early stages, but it is likely that each of the defendants will assert a fair use defense, which will make these cases among the first cases that tee up the applicability of the fair use defense in the context of AI. While some believe that training AI models on copyrighted works is clearly fair use,² the unique nature of AI and the inherent lack of transparency into how the systems work could make that issue less clear and potentially present greater challenges to accused infringers.

One particular challenge could stem from the fact that accused infringers do not know exactly how their AI systems work. For example, deep learning models—one of the most prevalent forms of modern AI—learn much the same way that humans learn.³ In such models, companies train their algorithms with correct examples of something they want the algorithm to recognize and eventually the algorithms develop a “neural network” capable of categorizing things to which the algorithms have not been exposed.⁴ Indeed, website users have likely unwittingly participated in such training by completing a “reCAPTCHA” to access a website (e.g. click on all of the street signs in a given image). Because of the way these deep learning models work, AI companies do not know precisely how their systems make decisions or come to conclusions—a phenomenon known as the “black box” problem.⁵

Fair use is an affirmative defense to copyright infringement, and the accused infringer bears the burden of proving the defense. In deciding whether a use is fair use, courts consider the following factors: (1) the purpose and character of the use; (2) the nature of the copyrighted work; (3) the amount and substantiality of the portion used; and (4) the effect of the use upon the potential market for the copyrighted work.⁶ The black box problem may well be something accused infringers need to navigate in articulating how these factors weigh in favor of fair use, potentially presenting challenges to demonstrating fair use. That issue could affect all four factors but could be particularly challenging for factor (3). For example, if AI defendants do not know how their systems learn, it could be challenging to explain to a court the “amount and substantiality of the portion used.” Interested parties will want to monitor the above-listed cases to see whether the black box problem becomes an issue, as well as to gain insight into how courts deal with fair use in the AI context.

Disclaimer

This blog is made available by �鶹��ý (“�鶹��ý” or “the Firm”) for informational purposes only. It is not meant to convey the Firm’s legal position on behalf of any client, nor is it intended to convey specific legal advice. Any opinions expressed in this article do not necessarily reflect the views of �鶹��ý, its partners, or its clients. Accordingly, do not act upon this information without seeking counsel from a licensed attorney. This blog is not intended to create, and receipt of it does not constitute, an attorney-client relationship. Communicating with �鶹��ý through this website by email, blog post, or otherwise, does not create an attorney-client relationship for any legal matter. Therefore, any communication or material you transmit to �鶹��ý through this blog, whether by email, blog post or any other manner, will not be treated as confidential or proprietary. The information on this blog is published “AS IS” and is not guaranteed to be complete, accurate, and or up-to-date. �鶹��ý makes no representations or warranties of any kind, express or implied, as to the operation or content of the site. �鶹��ý expressly disclaims all other guarantees, warranties, conditions and representations of any kind, either express or implied, whether arising under any statute, law, commercial use or otherwise, including implied warranties of merchantability, fitness for a particular purpose, title and non-infringement. In no event shall �鶹��ý or any of its partners, officers, employees, agents or affiliates be liable, directly or indirectly, under any theory of law (contract, tort, negligence or otherwise), to you or anyone else, for any claims, losses or damages, direct, indirect special, incidental, punitive or consequential, resulting from or occasioned by the creation, use of or reliance on this site (including information and other content) or any third party websites or the information, resources or material accessed through any such websites. In some jurisdictions, the contents of this blog may be considered Attorney Advertising. If applicable, please note that prior results do not guarantee a similar outcome. Photographs are for dramatization purposes only and may include models. Likenesses do not necessarily imply current client, partnership or employee status.