Judge Decides AI Firms Can Utilize Certain Copyrighted Materials for Their Training Data

0
12K

This week, a federal judge handed AI companies a major win, potentially setting a legal precedent for the industry to plunder copyrighted materials to train their large language models.

Anthropic, the large AI company backed by Amazon, has been in a pitched legal battle with a group of writers and journalists who sued the company last summer and accused it of illegally using their works to train the company’s flagship chatbot, Claude. The legality of the AI industry’s entire business model has long depended on the question of whether it is kosher to hoover up large amounts of copyrighted data from all over the web and then feed it into an algorithm to produce “original” text. Anthropic has maintained that its use of the writers’ work falls under fair use and is therefore legal. This week, the federal judge presiding over the case, William Alsup, partially agreed.

In his ruling, Alsup claimed that, by training its LLM without the authors’ permission, Anthropic did not infringe on copyrighted materials because the work it produced was, in his eyes, original. He claimed that the company’s algorithms have…

“…not reproduced to the public a given work’s creative elements, nor even one author’s identifiable expressive style…Yes, Claude has outputted grammar, composition, and style that the underlying LLM distilled from thousands of works. But if someone were to read all the modern-day classics because of their exceptional expression, memorize them, and then emulate a blend of their best writing, would that violate the Copyright Act? Of course not.”

Alsup’s ruling departs quite a bit from the writers’ litigation, which accused Anthropic of “strip-mining” human expression and ingenuity for the sake of corporate profits. This ruling is just one judge’s opinion, but critics fear it could easily set a precedent for other legal decisions across the country. AI companies have been sued dozens of times by creatives on similar grounds.

While Alsup’s decision may signal broader victories for the AI industry, it isn’t exactly what you would call a win for Anthropic. That’s because Alsup also ruled that the specific way in which Anthropic nabbed some of the copyrighted materials for its LLM—by downloading over 7 million pirated books—could be illegal, and would require a separate trial. “We will have a trial on the pirated copies used to create Anthropic’s central library and the resulting damages,” Alsup wrote. “That Anthropic later bought a copy of a book [that] it earlier stole off the internet will not absolve it of liability for theft, but it may affect the extent of statutory damages.”

When reached for comment by Gizmodo, Anthropic provided the following statement: “We are pleased that the Court recognized that using ‘works to train LLMs was transformative — spectacularly so.’ Consistent with copyright’s purpose in enabling creativity and fostering scientific progress, ‘Anthropic’s LLMs trained upon works not to race ahead and replicate or supplant them — but to turn a hard corner and create something different.’”

Alsup has presided over several prominent cases involving large tech companies, including Uber, DoorDash, and Waymo. More recently, Alsup ordered the Trump administration to reinstate thousands of fired probationary workers who were pushed out by Elon Musk’s DOGE initiative.

Like
Love
Haha
3
Cerca
Categorie
Leggi tutto
Uncategorized
1 loại nho đặc biệt có chứa tinh chất quý hiếm từ nhân sâm, giá tiền lên đến 700 nghìn/kg
Nho mẫu đơn nhân sâm Hàn Quốc, được cho là có tinh...
By YettiGoingRogue Hodkiewicz 2025-06-27 08:02:03 0 9K
Uncategorized
Xe máy quá niên hạn sử dụng cố tình tham gia giao thông sẽ bị tịch thu vĩnh viễn theo quy định mới nhất, đúng không?
Sử dụng xe máy quá niên hạn sử dụng tham gia giao...
By BitterDuck2026 Thào 2025-07-06 07:31:04 0 9K
Uncategorized
Tung ảnh "chính chủ" đáp trả kẻ mạo danh, nữ nhân viên pha chế khiến dân tình chao đảo vì body quyến rũ, "đỉnh chóp"
Nổi tiếng, được nhiều người biết đến luôn đi kèm với những rắc rối không mong muốn...
By themidwesthottie Wolf 2025-08-16 09:42:16 0 8K
Uncategorized
Điểm chuẩn ngành Công nghệ thông tin các trường đại học 3 năm gần đây
Trong 3 năm gần đây, điểm trúng tuyển ngành Công nghệ...
By deadreaper7 Kling 2025-08-18 02:24:08 0 8K
Uncategorized
TP.HCM dự kiến cấm học sinh dùng điện thoại ở trường, kể cả giờ ra chơi
Cụ thể, ông Nguyễn Văn Hiếu đã giao nhiệm vụ cho Phòng...
By melijohnsonn Lều 2025-07-11 07:07:05 0 8K