Papers
Topics
Authors
Recent
Assistant
AI Research Assistant
Well-researched responses based on relevant abstracts and paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses.
Gemini 2.5 Flash
Gemini 2.5 Flash 134 tok/s
Gemini 2.5 Pro 41 tok/s Pro
GPT-5 Medium 28 tok/s Pro
GPT-5 High 42 tok/s Pro
GPT-4o 92 tok/s Pro
Kimi K2 187 tok/s Pro
GPT OSS 120B 431 tok/s Pro
Claude Sonnet 4.5 37 tok/s Pro
2000 character limit reached

Apache License 2.0

Updated 1 July 2025
  • The Apache License 2.0 is a permissive free software license granting broad rights to use, modify, and distribute code, while explicitly addressing patent grants and requiring attribution.
  • Widely adopted in commercial and research software ecosystems, it is favored for its business-friendly terms and compatibility with various open and closed source projects.
  • Unlike copyleft licenses (like GPL), it allows derivative works to be relicensed under different terms, but it presents challenges for project sustainability and navigating AI-driven code reuse.

The Apache License, Version 2.0 (commonly "Apache 2" or "ALv2") is a widely adopted, permissive free software license that grants recipients broad rights to use, modify, and distribute software, subject to specific conditions oriented toward attribution, patent protection, and legal clarity. Emerging as a response to the evolution of open source software distribution and the need for robust patent terms, Apache 2.0 has played a central role in enabling collaborative and commercial software ecosystems, research reproducibility, and open scientific infrastructure.

The Apache 2 License is defined by a combination of permissiveness and explicit legal safeguards that distinguish it within the family of open source licenses. Its critical features include:

  • Broad Grant of Rights: Recipients are given perpetual, worldwide, royalty-free, non-exclusive rights to use, copy, modify, distribute, and sublicense the software. This is comparable to MIT or BSD licenses but governed by more detailed terms.
  • Explicit Patent Grant: Each contributor grants downstream users a “perpetual, worldwide, non-exclusive, no-charge, royalty-free, irrevocable patent license,” addressing a risk noted for earlier permissive licenses where contributors could later assert patent claims.
  • Attribution and Notice Requirements: Redistributions must retain the license text and any NOTICE file contributed by upstream authors.
  • No Copyleft: There is no requirement for derivative works to be licensed under Apache 2.0; downstream users can relicense modified versions, including under proprietary terms, provided the attribution, notice, and other basic conditions are preserved.
  • Modification Tracking: Significant changes to the licensed files must be indicated.
  • Trademark and Warranty Clauses: The license does not grant trademark rights and explicitly disclaims any warranties, providing legal protection to contributors.

These conditions can be represented elegantly using formal notation:

Rrecv={use,modify,distribute,sublicense}Cattribution,Cpreserve notices,Cstate changes,Cno patent aggressionR_{\text{recv}} = \{\text{use}, \text{modify}, \text{distribute}, \text{sublicense}\} \mid C_{\text{attribution}}, C_{\text{preserve notices}}, C_{\text{state changes}}, C_{\text{no patent aggression}}

with revocation triggered if a user initiates patent litigation:

If Upatent litigationC    LU,C=R\text{If } U \to_{\text{patent litigation}} C \implies L_{U,C} = \cancel{R}

(Coleman, 2014)

2. Historical Evolution and Position in the Open Source Landscape

Apache 2.0 emerged as part of a larger proliferation of open source licenses following the OSI's foundation in 1998. Its introduction was motivated by the need to clarify rights—especially patent rights—and to enhance compatibility for both open and closed ecosystems (Sherlock et al., 2018).

Empirical studies show that around 2001, licensing trends in open source underwent a reversal: permissive licenses such as Apache 2 overtook restrictive “copyleft” licenses (e.g., GPL) as the dominant framework in project growth. This shift coincided with the rise of commercially sponsored open-source foundations like the Apache Software Foundation, which leveraged permissive licenses to enable collaborative industry investment and rapid adoption (Hoffmann et al., 2014).

Period Dominant Growth Confidence Key Factor
1995–2001 Restrictive >95% Volunteer-driven, community ethos
2001–2007 Permissive ~75% Corporate/commercial ecosystems

((Hoffmann et al., 2014), Table 4)

3. Comparative Analysis with Other Open Source Licenses

The Apache 2 License is commonly contrasted with other licenses on multiple axes:

  • MIT/BSD/ISC: These share the permissive model but lack explicit patent grants. They are seen as maximizing "freedom from restriction" (Coleman, 2014).
  • GPL Family (v2, v3): GPL imposes strong copyleft, requiring derivative works to remain under the GPL and (from v3) introduces limited patent terms. Apache 2 is more business-friendly and compatible with GPLv3 but not GPLv2 without further permissions.
  • MPL, LGPL: Classified as “weak copyleft,” these require inherited licensing only for parts of the code; Apache 2 does not impose such requirements (Gomez-Diaz, 2014).
License Commercial Use Copyleft Patent Grant License Inheritance
Apache 2 Yes No Yes No
MIT/BSD Yes No No* No
GPLv3 Yes Yes Limited Yes

*Patent rights are implied but not explicit.

(Coleman, 2014, Gomez-Diaz, 2014)

4. Adoption, Ecosystem Prevalence, and Empirical Studies

Studies of license frequency and software repositories document Apache 2.0 as one of the most prevalent licenses, especially in enterprise and widely used package ecosystems. For example, in Maven (Java), Apache 2 is the most common license, covering around 41% of all packages, and in Cargo (Rust), it accounts for about 28% (Pfeiffer, 2022). Large-scale analyses of code repositories confirm its robust, reliable detection and rare mutation compared to other licenses (Zacchiroli, 2022, González-Barahona et al., 2023).

Ecosystem Prevalence of Apache 2.0 Noted Risks
Maven ~41% (most common) Low; mainly GPL-2 only
Cargo ~28% Low
PyPI ~8% Not significant

(Pfeiffer, 2022)

Empirical studies of copy-based code reuse in open source find that Apache 2-licensed files are frequently copied between projects, with attendant challenges regarding the preservation of license and notice information (Jahanshahi et al., 2023).

5. Implications for Research Software, Reproducibility, and Policy

In scientific research, Apache 2.0 has been widely adopted due to its maximally permissive and business-friendly terms, facilitating both broad reuse and compliance with open access mandates (Grange et al., 2019, Grange et al., 2020). Policies such as the ASTRON Open Source Policy and large infrastructure projects (e.g., SocioXplorer) use Apache 2 as their default, citing its role in promoting reproducibility, attribution, and open collaboration.

Best practices in research settings emphasize:

  • Early and explicit license selection to avoid later complications (Grange et al., 2020),
  • Use of DOIs or citation mechanisms attached to released software (Grange et al., 2019),
  • Central management of source code and consistent retention of license and NOTICE files (Grange et al., 2019).

6. Contemporary Challenges: Monetization, Sustainability, and AI

Recent studies identify sustainability challenges associated with permissive licensing. While Apache 2.0 facilitates wide adoption, it offers limited mechanisms for monetization, sometimes leading to volunteer burnout or funding crises (Kula et al., 4 Mar 2025). In high-profile cases, projects originally under Apache 2.0 have shifted to more restrictive “source-available” models to seek better community support or direct funding.

The rise of AI models trained on permissively licensed code introduces new complexities. Concerns about unclear copyright provenance in generated code and the possibility of widespread, untraceable reuse challenge the traditional mechanisms of attribution and license enforcement. Several communities have begun to restrict the use of AI-generated code for these reasons (Kula et al., 4 Mar 2025).

License Type Monetization Path AI-Era Challenge
Permissive Indirect (support, donations) Unclear code provenance
Restrictive Dual-licensing, stronger enforcement Compatibility with AI training
Source-Avail. Direct license fees, delayed FOSS Defining “open” for AI models

(Kula et al., 4 Mar 2025)

7. Summary Table: Rights and Obligations under Apache 2

Aspect Obligation/Right
Source availability Not required for proprietary derivatives
Modification Permitted, commercial or proprietary use allowed
Patent rights Explicit grant, termination upon patent litigation
Trademark Not granted
Attribution/Notice Must preserve license and NOTICE files
Distribution Allowed, under original or new license, preserving requirements

(Sherlock et al., 2018)

Conclusion

The Apache License, Version 2.0 stands as a foundational legal framework for the modern open source ecosystem. Its explicit patent clauses, permissive stance on reuse and derivative works, and business-friendly conditions support both collaborative research and enterprise adoption at scale. Studies consistently identify Apache 2.0 as a license that minimizes legal friction, maximizes compatibility across codebases and ecosystems, and underpins major research and commercial software infrastructure. At the same time, its lack of mechanisms for enforcing reciprocal community support raises critical questions about long-term project sustainability in an era shaped by artificial intelligence, changing funding models, and the evolving open science landscape.

Forward Email Streamline Icon: https://streamlinehq.com

Follow Topic

Get notified by email when new papers are published related to Apache 2 Licence.

Don't miss out on important new AI/ML research

See which papers are being discussed right now on X, Reddit, and more:

“Emergent Mind helps me see which AI papers have caught fire online.”

Philip

Philip

Creator, AI Explained on YouTube