Back to Articles
Major Publishers Sue Meta for Copyright Infringement Over AI Training

The Guardian

ENRICHED

Details

Date Published
5 May 2026
Priority Score
2
Australian
No
Created
5 May 2026, 06:00 pm

Authors (1)

Description

Hachette, Macmillan and others allege that Meta pirated millions of works from textbooks to novels for Llama model

Summary

Five major publishers, including Elsevier and Macmillan, have filed a class-action lawsuit against Meta alleging the unauthorized use of millions of copyrighted works to train Llama large language models. The litigation centers on the 'fair use' doctrine and could fundamentally reshape the data acquisition norms for frontier AI development. While primarily a copyright dispute, the case impacts AI safety by potentially restricting the datasets available for training high-capability models and forcing greater transparency in AI supply chains. The outcome will likely influence global AI governance frameworks regarding the balance between intellectual property rights and the rapid advancement of generative AI capabilities.

Body

The Meta CEO, Mark Zuckerberg, testifies during a Senate judiciary committee hearing in Washington DC on 31 January 2024. Photograph: Brendan Smialowski/AFP via Getty ImagesView image in fullscreenThe Meta CEO, Mark Zuckerberg, testifies during a Senate judiciary committee hearing in Washington DC on 31 January 2024. Photograph: Brendan Smialowski/AFP via Getty ImagesMajor publishers sue Meta for copyright infringement over AI trainingHachette, Macmillan and others allege that Meta pirated millions of works from textbooks to novels for Llama modelFive major publishers sued Meta Platforms in Manhattan federal court on Tuesday, alleging that the tech giant misused their books and journal articles to train its artificial intelligence models.Elsevier, Cengage, Hachette, Macmillan and McGraw Hill, as well as author Scott Turow, alleged in the proposed class-action complaint that Meta pirated millions of their works and used them without permission to train its Llama large language models to respond to human prompts.“Meta’s mass-scale infringement isn’t public progress, and AI will never be properly realized if tech companies prioritize pirate sites over scholarship and imagination,” Maria Pallante, the president of the Association of American Publishers, said in a statement.Meta has denied any wrongdoing.Families sue OpenAI over failure to report Canada mass shooter’s behavior on ChatGPTRead more“AI is powering transformative innovations, productivity and creativity for individuals and companies, and courts have rightly found that training AI on copyrighted material can qualify as fair use,” a Meta spokesperson responded in a statement on Tuesday. “We will fight this lawsuit aggressively.“The publishers allege that Meta pirated works ranging from textbooks to scientific articles to novels including The Fifth Season by NK Jemisin and The Wild Robot by Peter Brown for its AI training. They asked the court for permission to represent a larger class of copyright owners and an unspecified amount of monetary damages.The lawsuit opens a new front in the ongoing copyright battle between creators and tech companies over AI training, in which dozens of authors, news outlets, visual artists and other plaintiffs have sued companies including Meta, OpenAI and Anthropic for infringement. All of the pending cases are likely to revolve around whether AI systems make fair use of copyrighted material by using it to create new, transformative content. The first two judges to consider the matter issued diverging rulings last year. Amazon- and Google-backed Anthropic was the first major AI company to settle one of the cases, agreeing last year to pay a group of authors $1.5bn to resolve a class-action lawsuit that could have cost the company billions more in damages for alleged piracy. The New York Times has sued OpenAI and Microsoft for copyright infringement as well.Explore more on these topicsMetaAI (artificial intelligence)PublishingLaw (US)New YorknewsShareReuse this content