Add paper on the flattened tree and dtc presented at linux.conf.au 2006 by
way of some more documentation.
This commit is contained in:
parent
f192a7ae0b
commit
f5aa792d81
2 changed files with 640 additions and 0 deletions
43
Documentation/dtc-paper.bib
Normal file
43
Documentation/dtc-paper.bib
Normal file
|
@ -0,0 +1,43 @@
|
|||
@STRING{pub-IEEE = "IEEE Computer Society"}
|
||||
@STRING{pub-IEEE:adr = "345 E. 47th St, New York, NY 10017, USA"}
|
||||
|
||||
@BOOK{IEEE1275,
|
||||
key = "IEEE1275",
|
||||
title = "{IEEE} {S}tandard for {B}oot ({I}nitialization {C}onfiguration) {F}irmware: {C}ore {R}equirements and {P}ractices",
|
||||
publisher = pub-IEEE,
|
||||
address = pub-IEEE:adr,
|
||||
series = "IEEE Std 1275-1994",
|
||||
year = 1994,
|
||||
}
|
||||
|
||||
@BOOK{IEEE1275-pci,
|
||||
key = "IEEE1275-pci",
|
||||
title = "{PCI} {B}us {B}inding to: {IEEE} {S}td 1275-1994 {S}tandard for {B}oot ({I}nitialization {C}onfiguration) {F}irmware",
|
||||
publisher = pub-IEEE,
|
||||
address = pub-IEEE:adr,
|
||||
note = "Revision 2.1",
|
||||
year = 1998,
|
||||
}
|
||||
|
||||
@MISC{noof1,
|
||||
author = "Benjamin Herrenschmidt",
|
||||
title = "Booting the {L}inux/ppc kernel without {O}pen {F}irmware",
|
||||
month = may,
|
||||
year = 2005,
|
||||
note = "v0.1, \url{http://ozlabs.org/pipermail/linuxppc64-dev/2005-May/004073.html}",
|
||||
}
|
||||
|
||||
@MISC{noof5,
|
||||
author = "Benjamin Herrenschmidt",
|
||||
title = "Booting the {L}inux/ppc kernel without {O}pen {F}irmware",
|
||||
month = nov,
|
||||
year = 2005,
|
||||
note = "v0.5, \url{http://ozlabs.org/pipermail/linuxppc64-dev/2005-December/006994.html}",
|
||||
}
|
||||
|
||||
@MISC{dtcgit,
|
||||
author = "David Gibson et al.",
|
||||
title = "\dtc{}",
|
||||
howpublished = "git tree",
|
||||
note = "\url{http://ozlabs.org/~dgibson/dtc/dtc.git}",
|
||||
}
|
597
Documentation/dtc-paper.tex
Normal file
597
Documentation/dtc-paper.tex
Normal file
|
@ -0,0 +1,597 @@
|
|||
\documentclass[a4paper,twocolumn]{article}
|
||||
|
||||
\usepackage{abstract}
|
||||
\usepackage{xspace}
|
||||
\usepackage{amssymb}
|
||||
\usepackage{latexsym}
|
||||
\usepackage{tabularx}
|
||||
\usepackage[T1]{fontenc}
|
||||
\usepackage{calc}
|
||||
\usepackage{listings}
|
||||
\usepackage{color}
|
||||
\usepackage{url}
|
||||
|
||||
\title{Device trees everywhere}
|
||||
|
||||
\author{David Gibson \texttt{<{dwg}{@}{au1.ibm.com}>}\\
|
||||
Benjamin Herrenschmidt \texttt{<{benh}{@}{kernel.crashing.org}>}\\
|
||||
\emph{OzLabs, IBM Linux Technology Center}}
|
||||
|
||||
\newcommand{\R}{\textsuperscript{\textregistered}\xspace}
|
||||
\newcommand{\tm}{\textsuperscript{\texttrademark}\xspace}
|
||||
\newcommand{\tge}{$\geqslant$}
|
||||
%\newcommand{\ditto}{\textquotedbl\xspace}
|
||||
|
||||
\newcommand{\fixme}[1]{$\bigstar$\emph{\textbf{\large #1}}$\bigstar$\xspace}
|
||||
|
||||
\newcommand{\ppc}{\mbox{PowerPC}\xspace}
|
||||
\newcommand{\of}{Open Firmware\xspace}
|
||||
\newcommand{\benh}{Ben Herrenschmidt\xspace}
|
||||
\newcommand{\kexec}{\texttt{kexec()}\xspace}
|
||||
\newcommand{\dtbeginnode}{\texttt{OF\_DT\_BEGIN\_NODE\xspace}}
|
||||
\newcommand{\dtendnode}{\texttt{OF\_DT\_END\_NODE\xspace}}
|
||||
\newcommand{\dtprop}{\texttt{OF\_DT\_PROP\xspace}}
|
||||
\newcommand{\dtend}{\texttt{OF\_DT\_END\xspace}}
|
||||
\newcommand{\dtc}{\texttt{dtc}\xspace}
|
||||
\newcommand{\phandle}{\texttt{linux,phandle}\xspace}
|
||||
\begin{document}
|
||||
|
||||
\maketitle
|
||||
|
||||
\begin{abstract}
|
||||
We present a method for booting a \ppc{}\R Linux\R kernel on an
|
||||
embedded machine. To do this, we supply the kernel with a compact
|
||||
flattened-tree representation of the system's hardware based on the
|
||||
device tree supplied by Open Firmware on IBM\R servers and Apple\R
|
||||
Power Macintosh\R machines.
|
||||
|
||||
The ``blob'' representing the device tree can be created using \dtc
|
||||
--- the Device Tree Compiler --- that turns a simple text
|
||||
representation of the tree into the compact representation used by
|
||||
the kernel. The compiler can produce either a binary ``blob'' or an
|
||||
assembler file ready to be built into a firmware or bootwrapper
|
||||
image.
|
||||
|
||||
This flattened-tree approach is now the only supported method of
|
||||
booting a \texttt{ppc64} kernel without Open Firmware, and we plan
|
||||
to make it the only supported method for all \texttt{powerpc}
|
||||
kernels in the future.
|
||||
\end{abstract}
|
||||
|
||||
\section{Introduction}
|
||||
|
||||
\subsection{OF and the device tree}
|
||||
|
||||
Historically, ``everyday'' \ppc machines have booted with the help of
|
||||
\of (OF), a firmware environment defined by IEEE1275 \cite{IEEE1275}.
|
||||
Among other boot-time services, OF maintains a device tree that
|
||||
describes all of the system's hardware devices and how they're
|
||||
connected. During boot, before taking control of memory management,
|
||||
the Linux kernel uses OF calls to scan the device tree and transfer it
|
||||
to an internal representation that is used at run time to look up
|
||||
various device information.
|
||||
|
||||
The device tree consists of nodes representing devices or
|
||||
buses\footnote{Well, mostly. There are a few special exceptions.}.
|
||||
Each node contains \emph{properties}, name--value pairs that give
|
||||
information about the device. The values are arbitrary byte strings,
|
||||
and for some properties, they contain tables or other structured
|
||||
information.
|
||||
|
||||
\subsection{The bad old days}
|
||||
|
||||
Embedded systems, by contrast, usually have a minimal firmware that
|
||||
might supply a few vital system parameters (size of RAM and the like),
|
||||
but nothing as detailed or complete as the OF device tree. This has
|
||||
meant that the various 32-bit \ppc embedded ports have required a
|
||||
variety of hacks spread across the kernel to deal with the lack of
|
||||
device tree. These vary from specialised boot wrappers to parse
|
||||
parameters (which are at least reasonably localised) to
|
||||
CONFIG-dependent hacks in drivers to override normal probe logic with
|
||||
hardcoded addresses for a particular board. As well as being ugly of
|
||||
itself, such CONFIG-dependent hacks make it hard to build a single
|
||||
kernel image that supports multiple embedded machines.
|
||||
|
||||
Until relatively recently, the only 64-bit \ppc machines without OF
|
||||
were legacy (pre-POWER5\R) iSeries\R machines. iSeries machines often
|
||||
only have virtual IO devices, which makes it quite simple to work
|
||||
around the lack of a device tree. Even so, the lack means the iSeries
|
||||
boot sequence must be quite different from the pSeries or Macintosh,
|
||||
which is not ideal.
|
||||
|
||||
The device tree also presents a problem for implementing \kexec. When
|
||||
the kernel boots, it takes over full control of the system from OF,
|
||||
even re-using OF's memory. So, when \kexec comes to boot another
|
||||
kernel, OF is no longer around for the second kernel to query.
|
||||
|
||||
\section{The Flattened Tree}
|
||||
|
||||
In May 2005 \benh implemented a new approach to handling the device
|
||||
tree that addresses all these problems. When booting on OF systems,
|
||||
the first thing the kernel runs is a small piece of code in
|
||||
\texttt{prom\_init.c}, which executes in the context of OF. This code
|
||||
walks the device tree using OF calls, and transcribes it into a
|
||||
compact, flattened format. The resulting device tree ``blob'' is then
|
||||
passed to the kernel proper, which eventually unflattens the tree into
|
||||
its runtime form. This blob is the only data communicated between the
|
||||
\texttt{prom\_init.c} bootstrap and the rest of the kernel.
|
||||
|
||||
When OF isn't available, either because the machine doesn't have it at
|
||||
all or because \kexec has been used, the kernel instead starts
|
||||
directly from the entry point taking a flattened device tree. The
|
||||
device tree blob must be passed in from outside, rather than generated
|
||||
by part of the kernel from OF. For \kexec, the userland
|
||||
\texttt{kexec} tools build the blob from the runtime device tree
|
||||
before invoking the new kernel. For embedded systems the blob can
|
||||
come either from the embedded bootloader, or from a specialised
|
||||
version of the \texttt{zImage} wrapper for the system in question.
|
||||
|
||||
\subsection{Properties of the flattened tree}
|
||||
|
||||
The flattened tree format should be easy to handle, both for the
|
||||
kernel that parses it and the bootloader that generates it. In
|
||||
particular, the following properties are desirable:
|
||||
|
||||
\begin{itemize}
|
||||
\item \emph{relocatable}: the bootloader or kernel should be able to
|
||||
move the blob around as a whole, without needing to parse or adjust
|
||||
its internals. In practice that means we must not use pointers
|
||||
within the blob.
|
||||
\item \emph{insert and delete}: sometimes the bootloader might want to
|
||||
make tweaks to the flattened tree, such as deleting or inserting a
|
||||
node (or whole subtree). It should be possible to do this without
|
||||
having to effectively regenerate the whole flattened tree. In
|
||||
practice this means limiting the use of internal offsets in the blob
|
||||
that need recalculation if a section is inserted or removed with
|
||||
\texttt{memmove()}.
|
||||
\item \emph{compact}: embedded systems are frequently short of
|
||||
resources, particularly RAM and flash memory space. Thus, the tree
|
||||
representation should be kept as small as conveniently possible.
|
||||
\end{itemize}
|
||||
|
||||
\subsection{Format of the device tree blob}
|
||||
\label{sec:format}
|
||||
|
||||
\begin{figure}[htb!]
|
||||
\centering
|
||||
\footnotesize
|
||||
\begin{tabular}{r|c|l}
|
||||
\multicolumn{1}{r}{\textbf{Offset}}& \multicolumn{1}{c}{\textbf{Contents}} \\\cline{2-2}
|
||||
\texttt{0x00} & \texttt{0xd00dfeed} & magic number \\\cline{2-2}
|
||||
\texttt{0x04} & \emph{totalsize} \\\cline{2-2}
|
||||
\texttt{0x08} & \emph{off\_struct} & \\\cline{2-2}
|
||||
\texttt{0x0C} & \emph{off\_strs} & \\\cline{2-2}
|
||||
\texttt{0x10} & \emph{off\_rsvmap} & \\\cline{2-2}
|
||||
\texttt{0x14} & \emph{version} \\\cline{2-2}
|
||||
\texttt{0x18} & \emph{last\_comp\_ver} & \\\cline{2-2}
|
||||
\texttt{0x1C} & \emph{boot\_cpu\_id} & \tge v2 only\\\cline{2-2}
|
||||
\texttt{0x20} & \emph{size\_strs} & \tge v3 only\\\cline{2-2}
|
||||
\multicolumn{1}{r}{\vdots} & \multicolumn{1}{c}{\vdots} & \\\cline{2-2}
|
||||
\emph{off\_rsvmap} & \emph{address0} & memory reserve \\
|
||||
+ \texttt{0x04} & ...& table \\\cline{2-2}
|
||||
+ \texttt{0x08} & \emph{len0} & \\
|
||||
+ \texttt{0x0C} & ...& \\\cline{2-2}
|
||||
\vdots & \multicolumn{1}{c|}{\vdots} & \\\cline{2-2}
|
||||
& \texttt{0x00000000}- & end marker\\
|
||||
& \texttt{00000000} & \\\cline{2-2}
|
||||
& \texttt{0x00000000}- & \\
|
||||
& \texttt{00000000} & \\\cline{2-2}
|
||||
\multicolumn{1}{r}{\vdots} & \multicolumn{1}{c}{\vdots} & \\\cline{2-2}
|
||||
\emph{off\_strs} & \texttt{'n' 'a' 'm' 'e'} & strings block \\
|
||||
+ \texttt{0x04} & \texttt{~0~ 'm' 'o' 'd'} & \\
|
||||
+ \texttt{0x08} & \texttt{'e' 'l' ~0~ \makebox[\widthof{~~~}]{\textrm{...}}} & \\
|
||||
\vdots & \multicolumn{1}{c|}{\vdots} & \\\cline{2-2}
|
||||
\multicolumn{1}{r}{+ \emph{size\_strs}} \\
|
||||
\multicolumn{1}{r}{\vdots} & \multicolumn{1}{c}{\vdots} & \\\cline{2-2}
|
||||
\emph{off\_struct} & \dtbeginnode & structure block \\\cline{2-2}
|
||||
+ \texttt{0x04} & \texttt{'/' ~0~ ~0~ ~0~} & root node\\\cline{2-2}
|
||||
+ \texttt{0x08} & \dtprop & \\\cline{2-2}
|
||||
+ \texttt{0x0C} & \texttt{0x00000005} & ``\texttt{model}''\\\cline{2-2}
|
||||
+ \texttt{0x10} & \texttt{0x00000008} & \\\cline{2-2}
|
||||
+ \texttt{0x14} & \texttt{'M' 'y' 'B' 'o'} & \\
|
||||
+ \texttt{0x18} & \texttt{'a' 'r' 'd' ~0~} & \\\cline{2-2}
|
||||
\vdots & \multicolumn{1}{c|}{\vdots} & \\\cline{2-2}
|
||||
& \texttt{\dtendnode} \\\cline{2-2}
|
||||
& \texttt{\dtend} \\\cline{2-2}
|
||||
\multicolumn{1}{r}{\vdots} & \multicolumn{1}{c}{\vdots} & \\\cline{2-2}
|
||||
\multicolumn{1}{r}{\emph{totalsize}} \\
|
||||
\end{tabular}
|
||||
\caption{Device tree blob layout}
|
||||
\label{fig:blob-layout}
|
||||
\end{figure}
|
||||
|
||||
The format for the blob we devised, was first described on the
|
||||
\texttt{linuxppc64-dev} mailing list in \cite{noof1}. The format has
|
||||
since evolved through various revisions, and the current version is
|
||||
included as part of the \dtc (see \S\ref{sec:dtc}) git tree,
|
||||
\cite{dtcgit}.
|
||||
|
||||
Figure \ref{fig:blob-layout} shows the layout of the blob of data
|
||||
containing the device tree. It has three sections of variable size:
|
||||
the \emph{memory reserve table}, the \emph{structure block} and the
|
||||
\emph{strings block}. A small header gives the blob's size and
|
||||
version and the locations of the three sections, plus a handful of
|
||||
vital parameters used during early boot.
|
||||
|
||||
The memory reserve map section gives a list of regions of memory that
|
||||
the kernel must not use\footnote{Usually such ranges contain some data
|
||||
structure initialised by the firmware that must be preserved by the
|
||||
kernel.}. The list is represented as a simple array of (address,
|
||||
size) pairs of 64 bit values, terminated by a zero size entry. The
|
||||
strings block is similarly simple, consisting of a number of
|
||||
null-terminated strings appended together, which are referenced from
|
||||
the structure block as described below.
|
||||
|
||||
The structure block contains the device tree proper. Each node is
|
||||
introduced with a 32-bit \dtbeginnode tag, followed by the node's name
|
||||
as a null-terminated string, padded to a 32-bit boundary. Then
|
||||
follows all of the properties of the node, each introduced with a
|
||||
\dtprop tag, then all of the node's subnodes, each introduced with
|
||||
their own \dtbeginnode tag. The node ends with an \dtendnode tag, and
|
||||
after the \dtendnode for the root node is an \dtend tag, indicating
|
||||
the end of the whole tree\footnote{This is redundant, but included for
|
||||
ease of parsing.}. The structure block starts with the \dtbeginnode
|
||||
introducing the description of the root node (named \texttt{/}).
|
||||
|
||||
Each property, after the \dtprop, has a 32-bit value giving an offset
|
||||
from the beginning of the strings block at which the property name is
|
||||
stored. Because it's common for many nodes to have properties with
|
||||
the same name, this approach can substantially reduce the total size
|
||||
of the blob. The name offset is followed by the length of the
|
||||
property value (as a 32-bit value) and then the data itself padded to
|
||||
a 32-bit boundary.
|
||||
|
||||
\subsection{Contents of the tree}
|
||||
\label{sec:treecontents}
|
||||
|
||||
Having seen how to represent the device tree structure as a flattened
|
||||
blob, what actually goes into the tree? The short answer is ``the
|
||||
same as an OF tree''. On OF systems, the flattened tree is
|
||||
transcribed directly from the OF device tree, so for simplicity we
|
||||
also use OF conventions for the tree on other systems.
|
||||
|
||||
In many cases a flat tree can be simpler than a typical OF provided
|
||||
device tree. The flattened tree need only provide those nodes and
|
||||
properties that the kernel actually requires; the flattened tree
|
||||
generally need not include devices that the kernel can probe itself.
|
||||
For example, an OF device tree would normally include nodes for each
|
||||
PCI device on the system. A flattened tree need only include nodes
|
||||
for the PCI host bridges; the kernel will scan the buses thus
|
||||
described to find the subsidiary devices. The device tree can include
|
||||
nodes for devices where the kernel needs extra information, though:
|
||||
for example, for ISA devices on a subsidiary PCI/ISA bridge, or for
|
||||
devices with unusual interrupt routing.
|
||||
|
||||
Where they exist, we follow the IEEE1275 bindings that specify how to
|
||||
describe various buses in the device tree (for example,
|
||||
\cite{IEEE1275-pci} describe how to represent PCI devices). The
|
||||
standard has not been updated for a long time, however, and lacks
|
||||
bindings for many modern buses and devices. In particular, embedded
|
||||
specific devices such as the various System-on-Chip buses are not
|
||||
covered. We intend to create new bindings for such buses, in keeping
|
||||
with the general conventions of IEEE1275 (a simple such binding for a
|
||||
System-on-Chip bus was included in \cite{noof5} a revision of
|
||||
\cite{noof1}).
|
||||
|
||||
One complication arises for representing ``phandles'' in the flattened
|
||||
tree. In OF, each node in the tree has an associated phandle, a
|
||||
32-bit integer that uniquely identifies the node\footnote{In practice
|
||||
usually implemented as a pointer or offset within OF memory.}. This
|
||||
handle is used by the various OF calls to query and traverse the tree.
|
||||
Sometimes phandles are also used within the tree to refer to other
|
||||
nodes in the tree. For example, devices that produce interrupts
|
||||
generally have an \texttt{interrupt-parent} property giving the
|
||||
phandle of the interrupt controller that handles interrupts from this
|
||||
device. Parsing these and other interrupt related properties allows
|
||||
the kernel to build a complete representation of the system's
|
||||
interrupt tree, which can be quite different from the tree of bus
|
||||
connections.
|
||||
|
||||
In the flattened tree, a node's phandle is represented by a special
|
||||
\phandle property. When the kernel generates a flattened tree from
|
||||
OF, it adds a \phandle property to each node, containing the phandle
|
||||
retrieved from OF. When the tree is generated without OF, however,
|
||||
only nodes that are actually referred to by phandle need to have this
|
||||
property.
|
||||
|
||||
Another complication arises because nodes in an OF tree have two
|
||||
names. First they have the ``unit name'', which is how the node is
|
||||
referred to in an OF path. The unit name generally consists of a
|
||||
device type followed by an \texttt{@} followed by a \emph{unit
|
||||
address}. For example \texttt{/memory@0} is the full path of a memory
|
||||
node at address 0, \texttt{/ht@0,f2000000/pci@1} is the path of a PCI
|
||||
bus node, which is under a HyperTransport\tm bus node. The form of
|
||||
the unit address is bus dependent, but is generally derived from the
|
||||
node's \texttt{reg} property. In addition, nodes have a property,
|
||||
\texttt{name}, whose value is usually equal to the first path of the
|
||||
unit name. For example, the nodes in the previous example would have
|
||||
\texttt{name} properties equal to \texttt{memory} and \texttt{pci},
|
||||
respectively. To save space in the blob, the current version of the
|
||||
flattened tree format only requires the unit names to be present.
|
||||
When the kernel unflattens the tree, it automatically generates a
|
||||
\texttt{name} property from the node's path name.
|
||||
|
||||
\section{The Device Tree Compiler}
|
||||
\label{sec:dtc}
|
||||
|
||||
\begin{figure}[htb!]
|
||||
\centering
|
||||
\begin{lstlisting}[frame=single,basicstyle=\footnotesize\ttfamily,
|
||||
tabsize=3,numbers=left,xleftmargin=2em]
|
||||
/memreserve/ 0x20000000-0x21FFFFFF;
|
||||
|
||||
/ {
|
||||
model = "MyBoard";
|
||||
compatible = "MyBoardFamily";
|
||||
#address-cells = <2>;
|
||||
#size-cells = <2>;
|
||||
|
||||
cpus {
|
||||
#address-cells = <1>;
|
||||
#size-cells = <0>;
|
||||
PowerPC,970@0 {
|
||||
device_type = "cpu";
|
||||
reg = <0>;
|
||||
clock-frequency = <5f5e1000>;
|
||||
timebase-frequency = <1FCA055>;
|
||||
linux,boot-cpu;
|
||||
i-cache-size = <10000>;
|
||||
d-cache-size = <8000>;
|
||||
};
|
||||
};
|
||||
|
||||
memory@0 {
|
||||
device_type = "memory";
|
||||
memreg: reg = <00000000 00000000
|
||||
00000000 20000000>;
|
||||
};
|
||||
|
||||
mpic@0x3fffdd08400 {
|
||||
/* Interrupt controller */
|
||||
/* ... */
|
||||
};
|
||||
|
||||
pci@40000000000000 {
|
||||
/* PCI host bridge */
|
||||
/* ... */
|
||||
};
|
||||
|
||||
chosen {
|
||||
bootargs = "root=/dev/sda2";
|
||||
linux,platform = <00000600>;
|
||||
interrupt-controller =
|
||||
< &/mpic@0x3fffdd08400 >;
|
||||
};
|
||||
};
|
||||
\end{lstlisting}
|
||||
\caption{Example \dtc source}
|
||||
\label{fig:dts}
|
||||
\end{figure}
|
||||
|
||||
As we've seen, the flattened device tree format provides a convenient
|
||||
way of communicating device tree information to the kernel. It's
|
||||
simple for the kernel to parse, and simple for bootloaders to
|
||||
manipulate. On OF systems, it's easy to generate the flattened tree
|
||||
by walking the OF maintained tree. However, for embedded systems, the
|
||||
flattened tree must be generated from scratch.
|
||||
|
||||
Embedded bootloaders are generally built for a particular board. So,
|
||||
it's usually possible to build the device tree blob at compile time
|
||||
and include it in the bootloader image. For minor revisions of the
|
||||
board, the bootloader can contain code to make the necessary tweaks to
|
||||
the tree before passing it to the booted kernel.
|
||||
|
||||
The device trees for embedded boards are usually quite simple, and
|
||||
it's possible to hand construct the necessary blob by hand, but doing
|
||||
so is tedious. The ``device tree compiler'', \dtc{}\footnote{\dtc can
|
||||
be obtained from \cite{dtcgit}.}, is designed to make creating device
|
||||
tree blobs easier by converting a text representation of the tree
|
||||
into the necessary blob.
|
||||
|
||||
\subsection{Input and output formats}
|
||||
|
||||
As well as the normal mode of compiling a device tree blob from text
|
||||
source, \dtc can convert a device tree between a number of
|
||||
representations. It can take its input in one of three different
|
||||
formats:
|
||||
\begin{itemize}
|
||||
\item source, the normal case. The device tree is described in a text
|
||||
form, described in \S\ref{sec:dts}.
|
||||
\item blob (\texttt{dtb}), the flattened tree format described in
|
||||
\S\ref{sec:format}. This mode is useful for checking a pre-existing
|
||||
device tree blob.
|
||||
\item filesystem (\texttt{fs}), input is a directory tree in the
|
||||
layout of \texttt{/proc/device-tree} (roughly, a directory for each
|
||||
node in the device tree, a file for each property). This is useful
|
||||
for building a blob for the device tree in use by the currently
|
||||
running kernel.
|
||||
\end{itemize}
|
||||
|
||||
In addition, \dtc can output the tree in one of three different
|
||||
formats:
|
||||
\begin{itemize}
|
||||
\item blob (\texttt{dtb}), as in \S\ref{sec:format}. The most
|
||||
straightforward use of \dtc is to compile from ``source'' to
|
||||
``blob'' format.
|
||||
\item source (\texttt{dts}), as in \S\ref{sec:dts}. If used with blob
|
||||
input, this allows \dtc to act as a ``decompiler''.
|
||||
\item assembler source (\texttt{asm}). \dtc can produce an assembler
|
||||
file, which will assemble into a \texttt{.o} file containing the
|
||||
device tree blob, with symbols giving the beginning of the blob and
|
||||
its various subsections. This can then be linked directly into a
|
||||
bootloader or firmware image.
|
||||
\end{itemize}
|
||||
|
||||
For maximum applicability, \dtc can both read and write any of the
|
||||
existing revisions of the blob format. When reading, \dtc takes the
|
||||
version from the blob header, and when writing it takes a command line
|
||||
option specifying the desired version. It automatically makes any
|
||||
necessary adjustments to the tree that are necessary for the specified
|
||||
version. For example, formats before 0x10 require each node to have
|
||||
an explicit \texttt{name} property. When \dtc creates such a blob, it
|
||||
will automatically generate \texttt{name} properties from the unit
|
||||
names.
|
||||
|
||||
\subsection{Source format}
|
||||
\label{sec:dts}
|
||||
|
||||
The ``source'' format for \dtc is a text description of the device
|
||||
tree in a vaguely C-like form. Figure \ref{fig:dts} shows an
|
||||
example. The file starts with \texttt{/memreserve/} directives, which
|
||||
gives address ranges to add to the output blob's memory reserve table,
|
||||
then the device tree proper is described.
|
||||
|
||||
Nodes of the tree are introduced with the node name, followed by a
|
||||
\texttt{\{} ... \texttt{\};} block containing the node's properties
|
||||
and subnodes. Properties are given as just {\emph{name} \texttt{=}
|
||||
\emph{value}\texttt{;}}. The property values can be given in any
|
||||
of three forms:
|
||||
\begin{itemize}
|
||||
\item \emph{string} (for example, \texttt{"MyBoard"}). The property
|
||||
value is the given string, including terminating NULL. C-style
|
||||
escapes (\verb+\t+, \verb+\n+, \verb+\0+ and so forth) are allowed.
|
||||
\item \emph{cells} (for example, \texttt{<0 8000 f0000000>}). The
|
||||
property value is made up of a list of 32-bit ``cells'', each given
|
||||
as a hex value.
|
||||
\item \emph{bytestring} (for example, \texttt{[1234abcdef]}). The
|
||||
property value is given as a hex bytestring.
|
||||
\end{itemize}
|
||||
|
||||
Cell properties can also contain \emph{references}. Instead of a hex
|
||||
number, the source can give an ampersand (\texttt{\&}) followed by the
|
||||
full path to some node in the tree. For example, in Figure
|
||||
\ref{fig:dts}, the \texttt{/chosen} node has an
|
||||
\texttt{interrupt-controller} property referring to the interrupt
|
||||
controller described by the node \texttt{/mpic@0x3fffdd08400}. In the
|
||||
output tree, the value of the referenced node's phandle is included in
|
||||
the property. If that node doesn't have an explicit phandle property,
|
||||
\dtc will automatically create a unique phandle for it. This approach
|
||||
makes it easy to create interrupt trees without having to explicitly
|
||||
assign and remember phandles for the various interrupt controller
|
||||
nodes.
|
||||
|
||||
The \dtc source can also include ``labels'', which are placed on a
|
||||
particular node or property. For example, Figure \ref{fig:dts} has a
|
||||
label ``\texttt{memreg}'' on the \texttt{reg} property of the node
|
||||
\texttt{/memory@0}. When using assembler output, corresponding labels
|
||||
in the output are generated, which will assemble into symbols
|
||||
addressing the part of the blob with the node or property in question.
|
||||
This is useful for the common case where an embedded board has an
|
||||
essentially fixed device tree with a few variable properties, such as
|
||||
the size of memory. The bootloader for such a board can have a device
|
||||
tree linked in, including a symbol referring to the right place in the
|
||||
blob to update the parameter with the correct value determined at
|
||||
runtime.
|
||||
|
||||
\subsection{Tree checking}
|
||||
|
||||
Between reading in the device tree and writing it out in the new
|
||||
format, \dtc performs a number of checks on the tree:
|
||||
\begin{itemize}
|
||||
\item \emph{syntactic structure}: \dtc checks that node and property
|
||||
names contain only allowed characters and meet length restrictions.
|
||||
It checks that a node does not have multiple properties or subnodes
|
||||
with the same name.
|
||||
\item \emph{semantic structure}: In some cases, \dtc checks that
|
||||
properties whose contents are defined by convention have appropriate
|
||||
values. For example, it checks that \texttt{reg} properties have a
|
||||
length that makes sense given the address forms specified by the
|
||||
\texttt{\#address-cells} and \texttt{\#size-cells} properties. It
|
||||
checks that properties such as \texttt{interrupt-parent} contain a
|
||||
valid phandle.
|
||||
\item \emph{Linux requirements}: \dtc checks that the device tree
|
||||
contains those nodes and properties that are required by the Linux
|
||||
kernel to boot correctly.
|
||||
\end{itemize}
|
||||
|
||||
These checks are useful to catch simple problems with the device tree,
|
||||
rather than having to debug the results on an embedded kernel. With
|
||||
the blob input mode, it can also be used for diagnosing problems with
|
||||
an existing blob.
|
||||
|
||||
\section{Future Work}
|
||||
|
||||
\subsection{Board ports}
|
||||
|
||||
The flattened device tree has always been the only supported way to
|
||||
boot a \texttt{ppc64} kernel on an embedded system. With the merge of
|
||||
\texttt{ppc32} and \texttt{ppc64} code it has also become the only
|
||||
supported way to boot any merged \texttt{powerpc} kernel, 32-bit or
|
||||
64-bit. In fact, the old \texttt{ppc} architecture exists mainly just
|
||||
to support the old ppc32 embedded ports that have not been migrated
|
||||
to the flattened device tree approach. We plan to remove the
|
||||
\texttt{ppc} architecture eventually, which will mean porting all the
|
||||
various embedded boards to use the flattened device tree.
|
||||
|
||||
\subsection{\dtc features}
|
||||
|
||||
While it is already quite usable, there are a number of extra features
|
||||
that \dtc could include to make creating device trees more convenient:
|
||||
\begin{itemize}
|
||||
\item \emph{better tree checking}: Although \dtc already performs a
|
||||
number of checks on the device tree, they are rather haphazard. In
|
||||
many cases \dtc will give up after detecting a minor error early and
|
||||
won't pick up more interesting errors later on. There is a
|
||||
\texttt{-f} parameter that forces \dtc to generate an output tree
|
||||
even if there are errors. At present, this needs to be used more
|
||||
often than one might hope, because \dtc is bad at deciding which
|
||||
errors should really be fatal, and which rate mere warnings.
|
||||
\item \emph{binary include}: Occasionally, it is useful for the device
|
||||
tree to incorporate as a property a block of binary data for some
|
||||
board-specific purpose. For example, many of Apple's device trees
|
||||
incorporate bytecode drivers for certain platform devices. \dtc's
|
||||
source format ought to allow this by letting a property's value be
|
||||
read directly from a binary file.
|
||||
\item \emph{macros}: it might be useful for \dtc to implement some
|
||||
sort of macros so that a tree containing a number of similar devices
|
||||
(for example, multiple identical ethernet controllers or PCI buses)
|
||||
can be written more quickly. At present, this can be accomplished
|
||||
in part by running the source file through CPP before compiling with
|
||||
\dtc. It's not clear whether ``native'' support for macros would be
|
||||
more useful.
|
||||
\end{itemize}
|
||||
|
||||
\bibliographystyle{amsplain}
|
||||
\bibliography{dtc-paper}
|
||||
|
||||
\section*{About the authors}
|
||||
|
||||
David Gibson has been a member of the IBM Linux Technology Center,
|
||||
working from Canberra, Australia, since 2001. Recently he has worked
|
||||
on Linux hugepage support and performance counter support for ppc64,
|
||||
as well as the device tree compiler. In the past, he has worked on
|
||||
bringup for various ppc and ppc64 embedded systems, the orinoco
|
||||
wireless driver, ramfs, and a userspace checkpointing system
|
||||
(\texttt{esky}).
|
||||
|
||||
Benjamin Herrenschmidt was a MacOS developer for about 10 years, but
|
||||
ultimately saw the light and installed Linux on his Apple PowerPC
|
||||
machine. After writing a bootloader, BootX, for it in 1998, he
|
||||
started contributing to the PowerPC Linux port in various areas,
|
||||
mostly around the support for Apple machines. He became official
|
||||
PowerMac maintainer in 2001. In 2003, he joined the IBM Linux
|
||||
Technology Center in Canberra, Australia, where he ported the 64 bit
|
||||
PowerPC kernel to Apple G5 machines and the Maple embedded board,
|
||||
among others things. He's a member of the ppc64 development ``team''
|
||||
and one of his current goals is to make the integration of embedded
|
||||
platforms smoother and more maintainable than in the 32-bit PowerPC
|
||||
kernel.
|
||||
|
||||
\section*{Legal Statement}
|
||||
|
||||
This work represents the view of the author and does not necessarily
|
||||
represent the view of IBM.
|
||||
|
||||
IBM, \ppc, \ppc Architecture, POWER5, pSeries and iSeries are
|
||||
trademarks or registered trademarks of International Business Machines
|
||||
Corporation in the United States and/or other countries.
|
||||
|
||||
Apple and Power Macintosh are a registered trademarks of Apple
|
||||
Computer Inc. in the United States, other countries, or both.
|
||||
|
||||
Linux is a registered trademark of Linus Torvalds.
|
||||
|
||||
Other company, product, and service names may be trademarks or service
|
||||
marks of others.
|
||||
|
||||
\end{document}
|
Loading…
Reference in a new issue