Meta Llama

The Llama Open Source and Copyright Debate (2026)

Updated June 5, 2026 13 min read

Two questions follow Meta Llama everywhere it goes. Is it really open source? And did Meta break the law by training it on copyrighted books? Both questions have generated heated public argument, a standards-body rebuke, and a federal lawsuit that is still partly unresolved. This breakdown separates what has been decided from what is still in dispute, attributes every criticism to the party that made it, and pairs each criticism with Meta's stated position.

The single most important distinction in this article is also the easiest one to get wrong. On June 25, 2025, a federal judge ruled that Meta's use of authors' books to train Llama qualified as fair use. That is a court decision, a fact. But a separate allegation in the same case, that Meta downloaded and shared pirated copies of those books to obtain the training data, was not resolved by that ruling and continues to be litigated. Meta winning the training question does not mean Meta was cleared of everything.

700M

Monthly active user threshold in the Llama Community License; above it, a separate commercial license must be requested from Meta

Llama Community License

~50

Words: the court found Meta's guardrails kept Llama from reproducing more than roughly this many words of any book

Kadrey v. Meta, Jun 25 2025 ruling

Jul 7 2023

Date the authors filed Kadrey v. Meta in the Northern District of California

Case 3:23-cv-03417

Jun 25 2025

Date the court granted summary judgment for Meta on fair use of the books for training

Kadrey v. Meta court record

What the Llama Community License Actually Says

Before weighing the open source debate, it helps to read the license on its own terms. The Llama Community License grants a royalty-free, limited license to use, reproduce, and modify the Llama materials. It is permissive in many respects, and most developers and businesses can use Llama freely. The conditions below are the ones that shape the open source argument, because they are restrictions that a standard open source license would not impose.

License term	What it means in practice
Royalty-free grant	You may use, reproduce, and modify the Llama materials at no cost, subject to the conditions below.
700 million MAU threshold	If your products had more than 700 million monthly active users in the calendar month before the relevant Llama version was released, you must request a commercial license from Meta, granted at Meta's sole discretion.
Attribution	You must include a copy of the license and prominently display Built with Llama on related materials.
Derivative naming	Any other AI model you create using the Llama materials must include Llama at the start of its name.
No improving other models	You may not use Llama or its outputs to train or improve any large language model other than Llama or a Llama derivative. Distilling Llama outputs into a competitor's model is treated as a breach.
Acceptable Use Policy	Prohibits illegal use, child sexual abuse material, chemical, biological, radiological, nuclear, or high-yield explosive weapons, defamation, and unauthorized medical or legal advice, among other uses.
EU multimodal carve-out	For Llama 4, the multimodal rights are not granted to individuals or companies domiciled in the European Union.

License terms are version-specific. The 700 million monthly active user threshold and the Llama 4 European Union multimodal carve-out apply to particular Llama versions. Always read the license attached to the exact version you intend to deploy.

Is Llama Open Source? The Contested Label

Meta consistently describes Llama as open source, and that framing is central to how the company positions the project. The dispute is not about whether the weights are downloadable, which they are. The dispute is about whether a model released under the conditions above can accurately be called open source. Several standards bodies and academics say it cannot, and they have said so on the record.

The criticism, attributed

The Open Source Initiative. In July 2023, the executive director of the Open Source Initiative, Stefano Maffulli, argued that calling Llama open source is polluting the term. The Open Source Initiative's position is that the Llama Community License violates the Open Source Definition because it discriminates against persons or groups, through the 700 million monthly active user restriction, and against fields of endeavor, through the Acceptable Use Policy. In October 2024, the Open Source Initiative published its Open Source AI Definition, which requires a level of transparency about training data that the Open Source Initiative says Meta does not provide.

Mark Dingemanse, Radboud University. In July 2023, the linguist Mark Dingemanse called the open source labeling positively misleading, pointing to the absence of released source and the undocumented training data behind the model.

Nature, November 2024. In an analysis published in Nature, authors Widder, Whittaker, and Myers West described models marketed this way as examples of openwashing, arguing that such systems are better understood as closed than as open.

The Free Software Foundation. In January 2025, the Free Software Foundation classified the Llama 3.1 license as a nonfree software license.

Because of these objections, many reviewers and outlets now prefer the terms open weight or source-available to describe Llama. These terms acknowledge that the weights can be downloaded and run while signaling that the broader freedoms of open source, and full training-data transparency, are not present. It is worth being precise here: open weight is the critics' preferred framing, not an undisputed fact. Meta markets Llama as open source and disagrees with that reframing.

Meta's position

Meta's defense of the open source label is public and consistent. In a July 2024 open letter, Mark Zuckerberg argued that open source AI is the path that best prevents a concentration of power in the hands of a few companies, framing Meta's release strategy as a benefit to the broader ecosystem. When the Open Source Initiative's definition drew renewed attention, a Meta spokesperson told The Verge in October 2024 that Meta disagrees with the Open Source Initiative's definition of open source AI. Meta's view is that its release approach delivers the practical benefits people associate with open source, even if it does not satisfy every condition the Open Source Initiative sets out.

Both sides agree on the underlying mechanics: the weights are available, and the training data is not fully disclosed. The disagreement is over what to call that arrangement. Reasonable readers can weigh the standards bodies' definitions against Meta's stated rationale and decide which framing they find more persuasive.

Kadrey v. Meta: The Copyright Case

On July 7, 2023, authors Richard Kadrey, Sarah Silverman, and Christopher Golden filed suit against Meta in the United States District Court for the Northern District of California. The case is captioned Kadrey v. Meta Platforms, case number 3:23-cv-03417. The plaintiffs alleged that Meta trained Llama on pirated books drawn from shadow libraries, specifically the Books3 section of a dataset known as ThePile, sourced from a site called Bibliotik, and the shadow library LibGen.

It is important to separate the distinct claims in the case, because they have reached very different stages.

What was dismissed

In November 2023, Judge Vince Chhabria dismissed the plaintiffs' claims that Llama's outputs themselves infringe their copyrights and that Llama as a model is itself an infringing work. The plaintiffs were given leave to amend. This early ruling narrowed the case but did not end it.

The June 25, 2025 fair-use ruling

On June 25, 2025, Judge Chhabria granted summary judgment for Meta on the question of fair use of the books for training. The court ruled that the use was highly transformative, and it found that Meta's guardrails kept Llama from reproducing more than roughly 50 words of any of the books, so the plaintiffs had not proven that Llama acts as a market substitute for the works. Meta's position throughout was that training is transformative and does not reproduce, or provide meaningful access to, the books.

The judge was unusually explicit that the ruling was narrow. Chhabria stressed that the decision turned on the specific record before him and that future plaintiffs who develop stronger evidence of market harm could prevail on similar facts. In other words, the ruling is a win for Meta on these plaintiffs' evidence, not a blanket declaration that training on copyrighted books is always fair use.

What is still being litigated

The fair-use ruling did not resolve a separate allegation: that Meta acquired the training data by downloading, and seeding through torrenting, pirated copies of books from LibGen. That claim, focused on the method of obtaining the data rather than the act of training, continues. A Fourth Amended Complaint was filed in April 2026. As of this writing, that part of the dispute is unresolved and remains an allegation, not a finding. Litigation status last checked June 2026; consult the court docket for any later ruling or settlement.

⚖️ Settled by the court

Using the authors' books to train Llama was ruled fair use on June 25, 2025. The court called it highly transformative and found no proven market substitution, while stressing the ruling was narrow.

Court decision

🔄 Still being litigated

The separate allegation that Meta downloaded and seeded pirated books from LibGen to obtain the data continues. A Fourth Amended Complaint was filed in April 2026. This remains an unresolved allegation.

Open allegation

💬 Contested label

Whether Llama is open source is a matter of definition. Meta says yes; the Open Source Initiative and the Free Software Foundation say no, and critics prefer open weight or source-available. No court has ruled on the label.

Public dispute

A common misreading of the June 2025 ruling is that Meta was cleared of all wrongdoing in the case. That is not accurate. The fair-use ruling addressed the use of the books for training. The allegation about how Meta obtained the books through LibGen was not decided and is still active.

Timeline: How Both Debates Unfolded

July 2023

OSI criticism and the lawsuit are filed

The Open Source Initiative's Stefano Maffulli argues that calling Llama open source pollutes the term. In the same month, on July 7, authors Kadrey, Silverman, and Golden file Kadrey v. Meta in the Northern District of California.

November 2023

Partial dismissal

Judge Vince Chhabria dismisses the claims that Llama's outputs infringe and that Llama itself is an infringing work, with leave to amend.

July 2024

Zuckerberg's open source letter

Mark Zuckerberg publishes an open letter defending open source AI as a way to prevent a concentration of power, framing Meta's release strategy as a benefit to the wider ecosystem.

October 2024

OSI publishes the Open Source AI Definition

The Open Source Initiative releases its Open Source AI Definition, which requires training-data transparency the Open Source Initiative says Meta does not provide. A Meta spokesperson tells The Verge that Meta disagrees with the definition.

January 2025

FSF classifies the license as nonfree

The Free Software Foundation classifies the Llama 3.1 license as a nonfree software license.

June 25, 2025

Fair-use ruling on training

Judge Chhabria grants summary judgment for Meta on fair use of the books for training, calling it highly transformative and finding no proven market substitution, while stressing the ruling is narrow.

April 2026

Fourth Amended Complaint filed

The plaintiffs file a Fourth Amended Complaint. The separate allegation that Meta torrented and seeded pirated copies from LibGen to obtain the data continues to be litigated.

Why These Debates Matter for Anyone Using Llama

For teams evaluating Llama, the two debates have practical consequences that go beyond terminology and headlines.

The label affects compliance and procurement

If your organization has a policy that requires genuinely open source components, or that maps to the Open Source Initiative's definition, then the open weight distinction is not academic. Under the Open Source Initiative's and Free Software Foundation's reasoning, Llama would not qualify as open source for that policy, and the 700 million monthly active user clause, the derivative naming requirement, and the restriction on improving other models are real contractual obligations. Reading the license attached to your specific Llama version is the safe path.

The copyright case is still developing

The fair-use ruling is a meaningful data point for anyone weighing the legal posture of models trained on large text corpora, but Judge Chhabria's own caution means it is not a settled, portable precedent. The surviving allegation about how the training data was obtained is also unresolved. Organizations with low risk tolerance should treat the legal landscape around training-data provenance as still in motion rather than closed.

⚖️

The ruling was narrow by the judge's own words

Judge Chhabria stressed that the June 25, 2025 fair-use decision rested on these plaintiffs' specific record and that better evidence of market harm could let future plaintiffs win. Do not treat it as a blanket ruling that training on copyrighted books is always fair use.

🔄

The torrenting claim is unresolved

The allegation that Meta downloaded and seeded pirated copies from LibGen to obtain the data continues, with a Fourth Amended Complaint filed in April 2026. It is an allegation, not a finding. Meta argues training is transformative and does not reproduce or provide access to the books.

🏷️

Open source is a contested label, not a verdict

Meta calls Llama open source; the Open Source Initiative and the Free Software Foundation say it is not, and critics prefer open weight or source-available. No court has ruled on the label. Present it as a definitional dispute with both positions, not as a resolved fact.

Frequently Asked Questions

Is Llama open source?

It depends on whose definition you use. Meta markets Llama as open source, and Mark Zuckerberg's July 2024 letter defends that framing as a way to prevent a concentration of power. The Open Source Initiative disagrees: in July 2023, its executive director Stefano Maffulli said calling Llama open source is polluting the term, and the Open Source Initiative argues the license violates the Open Source Definition by discriminating against persons or groups and against fields of endeavor. The Free Software Foundation classified the Llama 3.1 license as nonfree in January 2025. Many critics prefer open weight or source-available. A Meta spokesperson told The Verge in October 2024 that Meta disagrees with the Open Source Initiative's definition.

Did Meta win the Llama copyright case?

Partly, and the case is not over. On June 25, 2025, Judge Vince Chhabria granted summary judgment for Meta on fair use of the authors' books for training Llama, calling it highly transformative and noting that Meta's guardrails kept Llama from reproducing more than roughly 50 words of any book, so the plaintiffs had not proven market substitution. Chhabria stressed the ruling was narrow and that future plaintiffs with stronger evidence of market harm could win. A separate allegation, that Meta downloaded and seeded pirated copies from LibGen to obtain the data, was not resolved by that ruling and continues to be litigated, with a Fourth Amended Complaint filed in April 2026.

What is the 700 million MAU threshold in the Llama license?

The Llama Community License grants a royalty-free, limited license to use, reproduce, and modify the Llama materials. However, if a licensee's products had more than 700 million monthly active users in the calendar month before the relevant version was released, the licensee must request a separate commercial license from Meta, which Meta may grant at its sole discretion. For most organizations this threshold is never reached, but it is one of the conditions critics cite when arguing the license is not open source.

What does the Llama license require you to do?

Distributors must include a copy of the license and prominently display the phrase Built with Llama, and any derivative model's name must begin with Llama. The license also prohibits using Llama or its outputs to train or improve any non-Llama large language model. The Acceptable Use Policy bars illegal use, child sexual abuse material, chemical, biological, radiological, nuclear, or high-yield explosive weapons, defamation, and unauthorized medical or legal advice, among other uses. For Llama 4, multimodal rights are not granted to individuals or companies domiciled in the European Union.

Why do critics call Llama open weight instead of open source?

Critics argue that releasing model weights under a restrictive license is not the same as open source. The Open Source Initiative's Open Source AI Definition, published in October 2024, requires training-data transparency that Meta does not provide. Mark Dingemanse of Radboud University called the open source labeling positively misleading in July 2023, citing the absence of source and undocumented training data. A November 2024 analysis in Nature by Widder, Whittaker, and Myers West described such systems as openwashing. Because the weights are downloadable but the training data and full license freedoms are not, many reviewers reframed Llama as open weight or source-available. Meta disputes this reframing.

Video Resources

🔍

Is Llama open source? The OSI open weight debate explained

YouTube Search

🔍

Kadrey v. Meta: the fair-use ruling on Llama training data

YouTube Search

🔍

AI training data, LibGen, and the copyright lawsuits

YouTube Search

Gallery

Contacts

The Llama Open Source and Copyright Debate (2026)

What the Llama Community License Actually Says

Is Llama Open Source? The Contested Label

The criticism, attributed

Meta's position

Kadrey v. Meta: The Copyright Case

What was dismissed

The June 25, 2025 fair-use ruling

What is still being litigated

Timeline: How Both Debates Unfolded

Why These Debates Matter for Anyone Using Llama

The label affects compliance and procurement

The copyright case is still developing

Frequently Asked Questions

Video Resources

Services

Learn

Company