Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -48,6 +48,7 @@
*.s
*.so
*.so.dbg
*.spdx.json
*.su
*.symtypes
*.tab.[ch]
Expand Down
1 change: 1 addition & 0 deletions Documentation/tools/index.rst
Original file line number Diff line number Diff line change
Expand Up @@ -12,6 +12,7 @@ more additions are needed here:

rtla/index
rv/index
sbom/sbom

.. only:: subproject and html

Expand Down
206 changes: 206 additions & 0 deletions Documentation/tools/sbom/sbom.rst
Original file line number Diff line number Diff line change
@@ -0,0 +1,206 @@
.. SPDX-License-Identifier: GPL-2.0-only OR MIT
.. Copyright (C) 2025 TNG Technology Consulting GmbH

KernelSbom
==========

Introduction
------------

KernelSbom is a Python script ``scripts/sbom/sbom.py`` that can be
executed after a successful kernel build. When invoked, KernelSbom
analyzes all files involved in the build and generates Software Bill of
Materials (SBOM) documents in SPDX 3.0.1 format.
The generated SBOM documents capture:

* **Final output artifacts**, typically the kernel image and modules
* **All source files** that contributed to the build with metadata
and licensing information
* **Details of the build process**, including intermediate artifacts
and the build commands linking source files to the final output
artifacts

KernelSbom is originally developed in the
`KernelSbom repository <https://github.com/TNG/KernelSbom>`_.

Requirements
------------

Python 3.10 or later. No libraries or other dependencies are required.

Basic Usage
-----------

Run the ``make sbom`` target.
For example::

$ make defconfig O=kernel_build
$ make sbom O=kernel_build -j$(nproc)

This will trigger a kernel build. After all build outputs have been
generated, KernelSbom produces three SPDX documents in the root
directory of the object tree:

* ``sbom-source.spdx.json``
Describes all source files involved in the build and
associates each file with its corresponding license expression.

* ``sbom-output.spdx.json``
Captures all final build outputs (kernel image and ``.ko`` module files)
and includes build metadata such as environment variables and
a hash of the ``.config`` file used for the build.

* ``sbom-build.spdx.json``
Imports files from the source and output documents and describes every
intermediate build artifact. For each artifact, it records the exact
build command used and establishes the relationship between
input files and generated outputs.

When invoking the sbom target, it is recommended to perform
out-of-tree builds using ``O=<objtree>``. KernelSbom classifies files as
source files when they are located in the source tree and not in the
object tree. For in-tree builds, where the source and object trees are
the same directory, this distinction can no longer be made reliably.
In that case, KernelSbom does not generate a dedicated source SBOM.
Instead, source files are included in the build SBOM.

Standalone Usage
----------------

KernelSbom can also be used as a standalone script to generate
SPDX documents for specific build outputs. For example, after a
successful x86 kernel build, KernelSbom can generate SPDX documents
for the ``bzImage`` kernel image::

$ SRCARCH=x86 python3 scripts/sbom/sbom.py \
--src-tree . \
--obj-tree ./kernel_build \
--roots arch/x86/boot/bzImage \
--generate-spdx \
--generate-used-files \
--prettify-json \
--debug

Note that when KernelSbom is invoked outside of the ``make`` process,
the environment variables used during compilation are not available and
therefore cannot be included in the generated SPDX documents. It is
recommended to set at least the ``SRCARCH`` environment variable to the
architecture for which the build was performed.

For a full list of command-line options, run::

$ python3 scripts/sbom/sbom.py --help

Output Format
-------------

KernelSbom generates documents conforming to the
`SPDX 3.0.1 specification <https://spdx.github.io/spdx-spec/v3.0.1/>`_
serialized as JSON-LD.

To reduce file size, the output documents use the JSON-LD ``@context``
to define custom prefixes for ``spdxId`` values. While this is compliant
with the SPDX specification, only a limited number of tools in the
current SPDX ecosystem support custom JSON-LD contexts. To use such
tools with the generated documents, the custom JSON-LD context must
be expanded before providing the documents.
See https://lists.spdx.org/g/Spdx-tech/message/6064 for more information.

How it Works
------------

KernelSbom operates in two major phases:

1. **Generate the cmd graph**, an acyclic directed dependency graph.
2. **Generate SPDX documents** based on the cmd graph.

KernelSbom begins from the root artifacts specified by the user, e.g.,
``arch/x86/boot/bzImage``. For each root artifact, it collects all
dependencies required to build that artifact. The dependencies come
from multiple sources:

* **.cmd files**: The primary source is the ``.cmd`` file of the
generated artifact, e.g., ``arch/x86/boot/.bzImage.cmd``. These files
contain the exact command used to build the artifact and often include
an explicit list of input dependencies. By parsing the ``.cmd``
file, the full list of dependencies can be obtained.

* **.incbin statements**: The second source are include binary
``.incbin`` statements in ``.S`` assembly files.

* **Hardcoded dependencies**: Unfortunately, not all build dependencies
can be found via ``.cmd`` files and ``.incbin`` statements. Some build
dependencies are directly defined in Makefiles or Kbuild files.
Parsing these files is considered too complex for the scope of this
project. Instead, the remaining gaps of the graph are filled using a
list of manually defined dependencies, see
``scripts/sbom/sbom/cmd_graph/hardcoded_dependencies.py``. This list is
known to be incomplete. However, analysis of the cmd graph indicates a
~99% completeness. For more information about the completeness analysis,
see `KernelSbom #95 <https://github.com/TNG/KernelSbom/issues/95>`_.

Given the list of dependency files, KernelSbom recursively processes
each file, expanding the dependency chain all the way to the version
controlled source files. The result is a complete dependency graph
where nodes represent files, and edges represent "file A was used to
build file B" relationships.

Using the cmd graph, KernelSbom produces three SPDX documents.
For every file in the graph, KernelSbom:

* Parses ``SPDX-License-Identifier`` headers,
* Computes file hashes,
* Estimates the file type based on extension and path,
* Records build relationships between files.

Each root output file is additionally associated with an SPDX Package
element that captures version information, license data, and copyright.

Advanced Usage
--------------

Including Kernel Modules
~~~~~~~~~~~~~~~~~~~~~~~~

The list of all ``.ko`` kernel modules produced during a build can be
extracted from the ``modules.order`` file within the object tree.
For example::

$ echo "arch/x86/boot/bzImage" > sbom-roots.txt
$ sed 's/\.o$/.ko/' ./kernel_build/modules.order >> sbom-roots.txt

Then use the generated roots file::

$ SRCARCH=x86 python3 scripts/sbom/sbom.py \
--src-tree . \
--obj-tree ./kernel_build \
--roots-file sbom-roots.txt \
--generate-spdx

Equal Source and Object Trees
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

When the source tree and object tree are identical (for example, when
building in-tree), source files can no longer be reliably distinguished
from generated files.
In this scenario, KernelSbom does not produce a dedicated
``sbom-source.spdx.json`` document. Instead, both source files and build
artifacts are included together in ``sbom-build.spdx.json``, and
``sbom.used-files.txt`` lists all files referenced in the build document.

Unknown Build Commands
~~~~~~~~~~~~~~~~~~~~~~

Because the kernel supports a wide range of configurations and versions,
KernelSbom may encounter build commands in ``.cmd`` files that it does
not yet support. By default, KernelSbom will fail if an unknown build
command is encountered.

If you still wish to generate SPDX documents despite unsupported
commands, you can use the ``--do-not-fail-on-unknown-build-command``
option. KernelSbom will continue and produce the documents, although
the resulting SBOM will be incomplete.

This option should only be used when the missing portion of the
dependency graph is small and an incomplete SBOM is acceptable for
your use case.
6 changes: 6 additions & 0 deletions MAINTAINERS
Original file line number Diff line number Diff line change
Expand Up @@ -23365,6 +23365,12 @@ R: Marc Murphy <marc.murphy@sancloud.com>
S: Supported
F: arch/arm/boot/dts/ti/omap/am335x-sancloud*

SBOM
M: Luis Augenstein <luis.augenstein@tngtech.com>
M: Maximilian Huber <maximilian.huber@tngtech.com>
S: Maintained
F: scripts/sbom/

SC1200 WDT DRIVER
M: Zwane Mwaikambo <zwanem@gmail.com>
S: Maintained
Expand Down
11 changes: 9 additions & 2 deletions Makefile
Original file line number Diff line number Diff line change
Expand Up @@ -772,7 +772,7 @@ endif
# in addition to whatever we do anyway.
# Just "make" or "make all" shall build modules as well

ifneq ($(filter all modules nsdeps compile_commands.json clang-%,$(MAKECMDGOALS)),)
ifneq ($(filter all modules nsdeps compile_commands.json clang-% sbom,$(MAKECMDGOALS)),)
KBUILD_MODULES := y
endif

Expand Down Expand Up @@ -1612,7 +1612,7 @@ CLEAN_FILES += vmlinux.symvers modules-only.symvers \
modules.builtin.ranges vmlinux.o.map vmlinux.unstripped \
compile_commands.json rust/test \
rust-project.json .vmlinux.objs .vmlinux.export.c \
.builtin-dtbs-list .builtin-dtb.S
.builtin-dtbs-list .builtin-dtb.S sbom-*.spdx.json

# Directories & files removed with 'make mrproper'
MRPROPER_FILES += include/config include/generated \
Expand Down Expand Up @@ -1728,6 +1728,7 @@ help:
@echo ''
@echo 'Tools:'
@echo ' nsdeps - Generate missing symbol namespace dependencies'
@echo ' sbom - Generate Software Bill of Materials'
@echo ''
@echo 'Kernel selftest:'
@echo ' kselftest - Build and run kernel selftest'
Expand Down Expand Up @@ -2108,6 +2109,12 @@ nsdeps: export KBUILD_NSDEPS=1
nsdeps: modules
$(Q)$(CONFIG_SHELL) $(srctree)/scripts/nsdeps

# Script to generate .spdx.json SBOM documents describing the build
# ---------------------------------------------------------------------------
PHONY += sbom
sbom: all
$(Q)$(MAKE) $(build)=scripts/sbom

# Clang Tooling
# ---------------------------------------------------------------------------

Expand Down
40 changes: 40 additions & 0 deletions scripts/sbom/Makefile
Original file line number Diff line number Diff line change
@@ -0,0 +1,40 @@
# SPDX-License-Identifier: GPL-2.0-only OR MIT
# Copyright (C) 2025 TNG Technology Consulting GmbH

SBOM_SOURCE_FILE := $(objtree)/sbom-source.spdx.json
SBOM_BUILD_FILE := $(objtree)/sbom-build.spdx.json
SBOM_OUTPUT_FILE := $(objtree)/sbom-output.spdx.json
SBOM_ROOTS_FILE := $(objtree)/sbom-roots.txt


ifeq ($(srctree),$(objtree))
SBOM_TARGETS := $(SBOM_BUILD_FILE) $(SBOM_OUTPUT_FILE)
else
SBOM_TARGETS := $(SBOM_SOURCE_FILE) $(SBOM_BUILD_FILE) $(SBOM_OUTPUT_FILE)
endif

SBOM_DEPS := $(objtree)/$(KBUILD_IMAGE) $(objtree)/include/generated/autoconf.h
ifdef CONFIG_MODULES
SBOM_DEPS += $(objtree)/modules.order
endif

$(SBOM_TARGETS) &: $(SBOM_DEPS)
$(Q)echo " GEN $(notdir $(SBOM_TARGETS))"

$(Q)printf "%s\n" "$(KBUILD_IMAGE)" > $(SBOM_ROOTS_FILE)
$(Q)if [ "$(CONFIG_MODULES)" = "y" ]; then \
sed 's/\.o$$/.ko/' $(objtree)/modules.order >> $(SBOM_ROOTS_FILE); \
fi

$(Q)$(PYTHON3) $(srctree)/scripts/sbom/sbom.py \
--src-tree $(abspath $(srctree)) \
--obj-tree $(abspath $(objtree)) \
--roots-file $(SBOM_ROOTS_FILE) \
--output-directory $(abspath $(objtree)) \
--generate-spdx \
--package-license "GPL-2.0 WITH Linux-syscall-note" \
--package-version "$(KERNELVERSION)"

$(Q)rm $(SBOM_ROOTS_FILE)

$(obj)/: $(SBOM_TARGETS)
Loading