While the court found that the second and third factors weighed in the defendant’s favor, because the headnotes were not as original or creative as fictional works and the defendant’s AI product did not directly reproduce the copied headnotes, the court found that the two most important factors weighed in the plaintiff’s favor. Specifically, the court found that the purpose of the challenged use was commercial, and that the defendant intended to compete directly with the incumbent plaintiff from the legal research market and enter the market of providing AI-powered legal research tools.
The court also noted that it did not matter if the plaintiff also intended to train its own AI tools on its headnotes, as the effect on the potential market was enough for the fourth factor to weigh in the plaintiff’s favor.
Importantly, the court took care to distinguish the AI tools in question from generative AI tools. The court found it important that the AI tools being challenged were trained on copyrighted works and only returned relevant judicial opinions based on its training to a user’s queries—not generate new output in response to prompts. The court explicitly left open the question of whether the fair use defense could succeed where generative AI tools are at issue and whether generative AI would change the analysis of a direct competitor creating content from an incumbent’s content.
Takeaways
This ruling reinforces copyright protections for curated legal research materials and signals stricter scrutiny for AI-driven data scraping in competitive industries. Additionally, it highlights a growing legal challenge for AI developers relying on proprietary datasets for training models.