Anthropic reaches settlement with authors in landmark copyright case

Skye Jacobs · Aug 28, 2025

What just happened? Anthropic has settled its copyright dispute with authors, but key legal questions about AI training remain unresolved. The case suggests that future litigation may focus less on whether AI use is transformative and more on how copyrighted data is obtained.

Anthropic has reached a settlement in a sweeping copyright class action that alleged the company unlawfully collected and stored millions of pirated books to train its AI assistant, Claude. The agreement, revealed in a court filing before US District Judge William Alsup, avoids what could have been a trial with financial exposure on an unprecedented scale in the AI industry.

The litigation, certified earlier this summer as the largest copyright class action on record, drew national attention when Alsup concluded in June that Anthropic may have downloaded approximately seven million books from shadow library sites. Plaintiffs argued that this conduct amounted to wholesale piracy, distinguishable from the narrower question of whether training an AI model on copyrighted works constitutes infringement.

In a pivotal ruling, Alsup split the issues. He held that using copyrighted works to train Anthropic's models qualified as fair use, particularly given the transformative, non-expressive nature of model training. However, he rejected Anthropic's claim that its method of collecting and storing the texts – maintaining them in a "central library" – was protected. That practice, he ruled, fell outside fair use and exposed the company to direct liability for infringement.

The ruling suggests that fair use may shield AI training, but not the unauthorized creation and storage of copyrighted archives.

This bifurcated ruling set up a damages trial scheduled for December, where Anthropic faced statutory damages of up to $150,000 per infringed title. With millions of works in dispute, its potential liability reached into the hundreds of billions of dollars. Some industry analyses suggested exposure could have exceeded $1 trillion if maximum statutory damages applied.

The terms of the settlement remain confidential, and Anthropic declined to comment. Judge Alsup has ordered the parties to seek preliminary approval by September 5. Plaintiffs' counsel Justin Nelson described the resolution as "historic" and promised details in the coming weeks.

The settlement comes just two months after the unusually broad class certification ruling. Although the case began with three named plaintiffs – Andrea Bartz, Charles Graeber, and Kirk Wallace Johnson – Alsup's order opened the door for up to seven million potential rights holders to join. This dramatically magnified Anthropic's risk profile, pushing the litigation into existential territory for both the company and the broader AI industry.

Edward Lee, a copyright law specialist at Santa Clara University, called the development "a stunning turn of events," noting that Anthropic had pursued an aggressive litigation strategy until recently and had even assembled a new trial team. "The company had few defenses left at trial, given Judge Alsup's rulings," Lee told Wired.

This litigation represents one of the earliest comprehensive tests of how copyright law applies to AI training. While Alsup's June opinion supports the argument that ingesting copyrighted works for training may qualify as fair use, he simultaneously drew a bright line around the method of acquisition and storage. That distinction is likely to influence how companies structure their data sourcing practices going forward.

Notably, the case leaves unresolved whether companies can rely on commercially licensed material, as opposed to publicly scraped content, without legal risk. The ruling suggests fair use may protect training, but not the creation of unauthorized archives of copyrighted works for that purpose.

The case also underscores the practical risks associated with class certification in mass copyright disputes. Once Alsup certified a potential plaintiff pool of seven million, the specter of catastrophic liability effectively altered Anthropic's litigation calculus.

While the settlement terms remain sealed, its impact will ripple across the AI sector. The agreement avoids a damages determination that could have bankrupted one of the industry's most prominent young firms, founded in 2021 by former OpenAI researchers and backed heavily by Amazon and Alphabet.

For copyright owners, the settlement marks an early victory in securing concessions from an AI provider without the delays and uncertainty of trial. For technology companies, it serves both as a warning and a blueprint: courts may tolerate the use of copyrighted content for training, but not the method by which that data is acquired.

Permalink to story:

Anthropic reaches settlement with authors in landmark copyright case

Squid Surprise · Aug 28, 2025

"This litigation represents one of the earliest comprehensive tests of how copyright law applies to AI training."

They actually WON that part... the real flaw is that they got all the books via piracy... considering that "regular Joes" who get hit with that tend to face jail time, I'm wondering if that was on the table...

Pirating over 7 million books and using them for monetary gain should be rather cut-and-dry in the legal system... guess everything changes if you use "AI" - and happen to be a big company

dmatheson · Aug 31, 2025

Artists, as usual, get screwed in the findings. It should be up to the artist to decide if they want to allow their work to benefit corporate AI and it's not like they don't have the money to license the work. All this says is that as long as you don't outright steal the copyrighted material (in other words, don't make a copy to your disk, instead process it token by token at the illegal source), it's totally OK.

Anthropic reaches settlement with authors in landmark copyright case

Skye Jacobs

Posts: 1,994 +58

What just happened? Anthropic has settled its copyright dispute with authors, but key legal questions about AI training remain unresolved. The case suggests that future litigation may focus less on whether AI use is transformative and more on how copyrighted data is obtained.

Squid Surprise

Posts: 7,879 +8,175

dmatheson

Posts: 65 +24

Similar threads

Latest posts