PEP: 459
Title: Standard Metadata Extensions for Python Software Packages
Version:
This PEP describes several standard extensions to the Python metadata.
Like all metadata extensions, each standard extension format is independently versioned. Changing any of the formats requires an update to this PEP, but does not require an update to the core packaging metadata.
Note
These extensions may eventually be separated out into their own PEPs, but we're already suffering from PEP overload in the packaging metadata space.
This PEP was initially created by slicing out large sections of earlier drafts of PEP 426 and making them extensions, so some of the specifics may still be rough in the new context.
The python
project on the Python Package Index refers to the CPython
reference interpreter. This namespace is used as the namespace for the
standard metadata extensions.
The currently defined standard extensions are:
python.details
python.project
python.integrator
python.exports
python.commands
python.constraints
All standard extensions are currently at version 1.0
, and thus the
extension_metadata
field may be omitted without losing access to any
functionality.
The python.details
extension allows for more information to be provided
regarding the software distribution.
The python.details
extension contains four custom subfields:
license
: the copyright license for the distributionkeywords
: package index keywords for the distributionclassifiers
: package index Trove classifiers for the distributiondocument_names
: the names of additional metadata files
All of these fields are optional. Automated tools MUST operate correctly if a distribution does not provide them, including failing cleanly when an operation depending on one of these fields is requested.
A short string summarising the license used for this distribution.
Note that distributions that provide this field should still specify any applicable license Trove classifiers in the Classifiers field. Even when an appropriate Trove classifier is available, the license summary can be a good way to specify a particular version of that license, or to indicate any variations or exception to the license.
This field SHOULD contain fewer than 512 characters and MUST contain fewer than 2048.
This field SHOULD NOT contain any line breaks.
The full license text SHOULD be included as a separate file in the source archive for the distribution. See Document names for details.
Example:
"license": "GPL version 3, excluding DRM provisions"
A list of additional keywords to be used to assist searching for the distribution in a larger catalog.
Example:
"keywords": ["comfy", "chair", "cushions", "too silly", "monty python"]
A list of strings, with each giving a single classification value for the distribution. Classifiers are described in PEP 301 [2].
Example:
"classifiers": [ "Development Status :: 4 - Beta", "Environment :: Console (Text Based)", "License :: OSI Approved :: GNU General Public License v3 (GPLv3)" ]
Filenames for supporting documents included in the distribution's
dist-info
metadata directory.
The following supporting documents can be named:
description
: a file containing a long description of the distributionlicense
: a file with the full text of the distribution's licensechangelog
: a file describing changes made to the distribution
Supporting documents MUST be included directly in the dist-info
directory. Directory separators are NOT permitted in document names.
The markup format (if any) for the file is indicated by the file extension. This allows index servers and other automated tools to render included text documents correctly and provide feedback on rendering errors, rather than having to guess the intended format.
If the filename has no extension, or the extension is not recognised, the default rendering format MUST be plain text.
The following markup renderers SHOULD be used for the specified file extensions:
- Plain text:
.txt
, no extension, unknown extension - reStructured Text:
.rst
- Markdown:
.md
- AsciiDoc:
.adoc
,.asc
,.asciidoc
- HTML:
.html
,.htm
Automated tools MAY render one or more of the specified formats as plain text and MAY render other markup formats beyond those listed.
Automated tools SHOULD NOT make any assumptions regarding the maximum length of supporting document content, except as necessary to protect the integrity of a service.
Example:
"document_names": { "description": "README.rst", "license": "LICENSE.rst", "changelog": "NEWS" }
The python.project
extension allows for more information to be provided
regarding the creation and maintenance of the distribution.
The python.project
extension contains three custom subfields:
contacts
: key contact points for the distributioncontributors
: other contributors to the distributionproject_urls
: relevant URLs for the distribution
Details on individuals and organisations are recorded as mappings with the following subfields:
name
: the name of an individual or groupemail
: an email address (this may be a mailing list)url
: a URL (such as a profile page on a source code hosting service)role
: one of"author"
,"maintainer"
or"contributor"
The name
subfield is required, the other subfields are optional.
If no specific role is stated, the default is contributor
.
Email addresses must be in the form local-part@domain
where the
local-part may be up to 64 characters long and the entire email address
contains no more than 254 characters. The formal specification of the
format is in RFC 5322 (sections 3.2.3 and 3.4.1) and RFC 5321, with a more
readable form given in the informational RFC 3696 and the associated errata.
The defined contributor roles are as follows:
author
: the original creator of a distributionmaintainer
: the current lead contributor for a distribution, when they are not the original creatorcontributor
: any other individuals or organizations involved in the creation of the distribution
Contact and contributor metadata is optional. Automated tools MUST operate correctly if a distribution does not provide it, including failing cleanly when an operation depending on one of these fields is requested.
A list of contributor entries giving the recommended contact points for getting more information about the project.
The example below would be suitable for a project that was in the process of handing over from the original author to a new lead maintainer, while operating as part of a larger development group.
Example:
"contacts": [ { "name": "Python Packaging Authority/Distutils-SIG", "email": "[email protected]", "url": "https://bitbucket.org/pypa/" }, { "name": "Samantha C.", "role": "maintainer", "email": "[email protected]" }, { "name": "Charlotte C.", "role": "author", "email": "[email protected]" } ]
A list of contributor entries for other contributors not already listed as current project points of contact. The subfields within the list elements are the same as those for the main contact field.
Example:
"contributors": [ {"name": "John C."}, {"name": "Erik I."}, {"name": "Terry G."}, {"name": "Mike P."}, {"name": "Graeme C."}, {"name": "Terry J."} ]
A mapping of arbitrary text labels to additional URLs relevant to the project.
While projects are free to choose their own labels and specific URLs, it is RECOMMENDED that home page, source control, issue tracker and documentation links be provided using the labels in the example below.
URL labels MUST be treated as case insensitive by automated tools, but they are not required to be valid Python identifiers. Any legal JSON string is permitted as a URL label.
Example:
"project_urls": { "Documentation": "https://distlib.readthedocs.org", "Home": "https://bitbucket.org/pypa/distlib", "Repository": "https://bitbucket.org/pypa/distlib/src", "Tracker": "https://bitbucket.org/pypa/distlib/issues" }
Structurally, this extension is largely identical to the python.project
extension (the extension name is the only difference).
However, where the project
metadata refers to the upstream creators
of the software, the integrator
metadata refers to the downstream
redistributor of a modified version.
If the software is being redistributed unmodified, then typically this extension will not be used. However, if the software has been patched (for example, backporting compatible fixes from a later version, or addressing a platform compatibility issue), then this extension SHOULD be used, and a local version label added to the distribution's version identifier.
If there are multiple redistributors in the chain, each one just overwrites this extension with their particular metadata.
Most Python distributions expose packages and modules for import through the Python module namespace. Distributions may also expose other interfaces when installed.
The python.exports
extension contains three custom subfields:
modules
: modules exported by the distributionnamespaces
: namespace packages that the distribution contributes toexports
: other Python interfaces exported by the distribution
An export specifier is a string consisting of a fully qualified name, as well as an optional extra name enclosed in square brackets. This gives the following four possible forms for an export specifier:
module module:name module[requires_extra] module:name[requires_extra]
Note
The jsonschema file currently restricts qualified names using the Python 2 ASCII identifier rules. This may need to be reconsidered given the more relaxed identifier rules in Python 3.
The meaning of the subfields is as follows:
module
: the module providing the exportname
: if applicable, the qualified name of the export within the modulerequires_extra
: indicates the export will only work correctly if the additional dependencies named in the given extra are available in the installed environment
Note
I tried this as a mapping with subfields, and it made the examples below unreadable. While this PEP is mostly for tool use, readability still matters to some degree for debugging purposes, and because I expect snippets of the format to be reused elsewhere.
A list of qualified names of modules and packages that the distribution provides for import.
Note
The jsonschema file currently restricts qualified names using the Python 2 ASCII identifier rules. This may need to be reconsidered given the more relaxed identifier rules in Python 3.
For names that contain dots, the portion of the name before the final dot MUST appear either in the installed module list or in the namespace package list.
To help avoid name conflicts, it is RECOMMENDED that distributions provide a single top level module or package that matches the distribution name (or a lower case equivalent). This requires that the distribution name also meet the requirements of a Python identifier, which are stricter than those for distribution names). This practice will also make it easier to find authoritative sources for modules.
Index servers SHOULD allow multiple distributions to publish the same modules, but MAY notify distribution authors of potential conflicts.
Installation tools SHOULD report an error when asked to install a distribution that provides a module that is also provided by a different, previously installed, distribution.
Note that attempting to import some declared modules may result in an exception if the appropriate extras are not installed.
Example:
"modules": ["chair", "chair.cushions", "python_sketches.nobody_expects"]
Note
Making this a list of export specifiers instead would allow a distribution to declare when a particular module requires a particular extra in order to run correctly. On the other hand, there's an argument to be made that that is the point where it starts to become worthwhile to split out a separate distribution rather than using extras.
A list of qualified names of namespace packages that the distribution contributes modules to.
Note
The jsonschema file currently restricts qualified names using the Python 2 ASCII identifier rules. This may need to be reconsidered given the more relaxed identifier rules in Python 3.
On versions of Python prior to Python 3.3 (which provides native namespace
package support), installation tools SHOULD emit a suitable __init__.py
file to properly initialise the namespace rather than using a distribution
provided file.
Installation tools SHOULD emit a warning and MAY emit an error if a distribution declares a namespace package that conflicts with the name of an already installed module or vice-versa.
Example:
"namespaces": ["python_sketches"]
The exports
field is a mapping containing prefixed names as keys. Each
key identifies an export group containing one or more exports published by
the distribution.
Export group names are defined by distributions that will then make use of the published export information in some way. The primary use case is for distributions that support a plugin model: defining an export group allows other distributions to indicate which plugins they provide, how they can be imported and accessed, and which additional dependencies (if any) are needed for the plugin to work correctly.
To reduce the chance of name conflicts, export group names SHOULD use a prefix that corresponds to a module name in the distribution that defines the meaning of the export group. This practice will also make it easier to find authoritative documentation for export groups.
Each individual export group is then a mapping of arbitrary non-empty string keys to export specifiers. The meaning of export names within an export group is up to the distribution that defines the export group. Creating an appropriate definition for the export name format can allow the importing distribution to determine whether or not an export is relevant without needing to import every exporting module.
Example:
"exports": { "nose.plugins.0.10": { "chairtest": "chair:NosePlugin" } }
The python.commands
extension contains three custom subfields:
wrap_console
: console wrapper scripts to be generated by the installerwrap_gui
: GUI wrapper scripts to be generated by the installerprebuilt
: scripts created by the distribution's build process and installed directly to the configured scripts directory
wrap_console
and wrap_gui
are both mappings of script names to
export specifiers. The script names must follow the same naming rules as
distribution names.
The export specifiers for wrapper scripts must refer to either a package
with a __main__ submodule (if no name
subfield is given in the export
specifier) or else to a callable inside the named module.
Installation tools should generate appropriate wrappers as part of the installation process.
Note
Still needs more detail on what "appropriate wrappers" means. For now, refer to what setuptools and zc.buildout generate as wrapper scripts.
prebuilt
is a list of script paths, relative to the scripts directory in
a wheel file or following installation. They are provided for informational
purpose only - installing them is handled through the normal processes for
files created when building a distribution.
Build tools SHOULD mark this extension as requiring handling by installers.
Index servers SHOULD allow multiple distributions to publish the same commands, but MAY notify distribution authors of potential conflicts.
Installation tools SHOULD report an error when asked to install a distribution that provides a command that is also provided by a different, previously installed, distribution.
Example:
"python.commands": { "installer_must_handle": true, "wrap_console": [{"chair": "chair:run_cli"}], "wrap_gui": [{"chair-gui": "chair:run_gui"}], "prebuilt": ["reduniforms"] }
The python.constraints
extension contains two custom subfields:
environments
: supported installation environmentsextension_metadata
: required exact matches in extension metadata fields published by other installed components
Build tools SHOULD mark this extension as requiring handling by installers.
Index servers SHOULD allow distributions to be uploaded with constraints that cannot be satisfied using that index, but MAY notify distribution authors of any such potential compatibility issues.
Installation tools SHOULD report an error if constraints are specified by the distribution and the target installation environment fails to satisfy them, MUST at least emit a warning, and MAY allow the user to force the installation to proceed regardless.
Example:
"python.constraints": { "installer_must_handle": true, "environments": ["python_version >= 2.6"], "extension_metadata": { "fortranlib": { "fortranlib.compatibility": { "fortran_abi": "openblas-g77" } } } }
The environments
subfield is a list of strings specifying the
environments that the distribution explicitly supports. An environment is
considered supported if it matches at least one of the environment markers
given.
If this field is not given in the metadata, it is assumed that the distribution supports any platform supported by Python.
Individual entries are environment markers, as described in PEP 426.
The two main uses of this field are to declare which versions of Python and which underlying operating systems are supported.
Examples indicating supported Python versions:
# Supports Python 2.6+ "environments": ["python_version >= '2.6'"] # Supports Python 2.6+ (for 2.x) or 3.3+ (for 3.x) "environments": ["python_version >= '3.3'", "'3.0' > python_version >= '2.6'"]
Examples indicating supported operating systems:
# Windows only "environments": ["sys_platform == 'win32'"] # Anything except Windows "environments": ["sys_platform != 'win32'"] # Linux or BSD only "environments": ["'linux' in sys_platform", "'bsd' in sys_platform"]
Example where the supported Python version varies by platform:
# The standard library's os module has long supported atomic renaming # on POSIX systems, but only gained atomic renaming on Windows in Python # 3.3. A distribution that needs atomic renaming support for reliable # operation might declare the following supported environments. "environment": ["python_version >= '2.6' and sys_platform != 'win32'", "python_version >= '3.3' and sys_platform == 'win32'"]
The extension_metadata
subfield is a mapping from distribution names
to extension metadata snippets that are expected to exactly match the
metadata of the named distribution in the target installation environment.
Each submapping then consists of a mapping from metadata extension names to the exact expected values of a subset of fields.
For example, a distribution called fortranlib
may publish a different
FORTRAN ABI depending on how it is built, and any related projects that are
installed into the same runtime environment should use matching build
options. This can be handled by having the base distribution publish a
custom extension that indicates the build option that was used to create
the binary extensions:
"extensions": { "fortranlib.compatibility": { "fortran_abi": "openblas-g77" } }
Other distributions that contain binary extensions that need to be compatible with the base distribution would then define a suitable constraint in their own metadata:
"python.constraints": { "installer_must_handle": true, "extension_metadata": { "fortranlib": { "fortranlib.compatibility": { "fortran_abi": "openblas-g77" } } } }
This constraint specifies that:
fortranlib
must be installed (this should also be expressed as a normal dependency so that installers ensure it is satisfied)- The installed version of
fortranlib
must include the customfortranlib.compatibility
extension in its published metadata - The
fortan_abi
subfield of that extension must have the exact valueopenblas-g77
.
If all of these conditions are met (the distribution is installed, the specified extension is included in the metadata, the specified subfields have the exact specified value), then the constraint is considered to be satisfied.
Note
The primary intended use case here is allowing C extensions with additional ABI compatibility requirements to declare those in a way that any installation tool can enforce without needing to understand the details. In particular, many NumPy based scientific libraries need to be built using a consistent set of FORTRAN libraries, hence the "fortranlib" example.
This is the reason there's no support for pattern matching or boolean logic: even the "simple" version of this extension is relatively complex, and there's currently no compelling rationale for making it more complicated than it already is.
This document has been placed in the public domain.