diff --git a/en_US.ISO8859-1/articles/committers-guide/article.xml b/en_US.ISO8859-1/articles/committers-guide/article.xml
index e05587a219..db04ddd5f0 100644
--- a/en_US.ISO8859-1/articles/committers-guide/article.xml
+++ b/en_US.ISO8859-1/articles/committers-guide/article.xml
@@ -1,5655 +1,5655 @@
 <?xml version="1.0" encoding="iso-8859-1"?>
 <!DOCTYPE article PUBLIC "-//FreeBSD//DTD DocBook XML V5.0-Based Extension//EN"
 	"http://www.FreeBSD.org/XML/share/xml/freebsd50.dtd" [
 <!ENTITY ga "Google Analytics">
 ]>
 
 <article xmlns="http://docbook.org/ns/docbook"
   xmlns:xlink="http://www.w3.org/1999/xlink" version="5.0"
   xml:lang="en">
 
   <info>
     <title>Committer's Guide</title>
 
     <author>
       <orgname>The &os; Documentation Project</orgname>
     </author>
 
     <copyright>
       <year>1999</year>
       <year>2000</year>
       <year>2001</year>
       <year>2002</year>
       <year>2003</year>
       <year>2004</year>
       <year>2005</year>
       <year>2006</year>
       <year>2007</year>
       <year>2008</year>
       <year>2009</year>
       <year>2010</year>
       <year>2011</year>
       <year>2012</year>
       <year>2013</year>
       <year>2014</year>
       <year>2015</year>
       <year>2016</year>
       <year>2017</year>
       <year>2018</year>
       <year>2019</year>
       <year>2020</year>
       <holder>The &os; Documentation Project</holder>
     </copyright>
 
     <legalnotice xml:id="trademarks" role="trademarks">
       &tm-attrib.freebsd;
       &tm-attrib.coverity;
       &tm-attrib.ibm;
       &tm-attrib.intel;
       &tm-attrib.sparc;
       &tm-attrib.general;
     </legalnotice>
 
     <pubdate>$FreeBSD$</pubdate>
 
     <releaseinfo>$FreeBSD$</releaseinfo>
 
     <abstract>
       <para>This document provides information for the &os;
 	committer community.  All new committers should read this
 	document before they start, and existing committers are
 	strongly encouraged to review it from time to time.</para>
 
       <para>Almost all &os; developers have commit rights to one or
 	more repositories.  However, a few developers do not, and some
 	of the information here applies to them as well.  (For
 	instance, some people only have rights to work with the
 	Problem Report database).  Please see
 	<xref linkend="non-committers"/> for more information.</para>
 
       <para>This document may also be of interest to members of the
 	&os; community who want to learn more about how the project
 	works.</para>
     </abstract>
   </info>
 
   <sect1 xml:id="admin">
     <title>Administrative Details</title>
 
     <informaltable frame="none" orient="port" pgwide="1">
       <tgroup cols="2">
 	<colspec colwidth="20*"/>
 	<colspec colwidth="80*"/>
 	<tbody>
 	  <row>
 	    <entry><emphasis>Login Methods</emphasis></entry>
 	    <entry>&man.ssh.1;, protocol 2 only</entry>
 	  </row>
 
 	  <row>
 	    <entry><emphasis>Main Shell Host</emphasis></entry>
 	    <entry><systemitem
 		class="fqdomainname">freefall.FreeBSD.org</systemitem></entry>
 	  </row>
 
 	  <row>
 	    <entry><emphasis>SMTP Host</emphasis></entry>
 	    <entry>
 	      <literal><systemitem
 		class="fqdomainname">smtp.FreeBSD.org</systemitem>:587</literal>
 	      (see also <xref linkend="smtp-setup"/>).</entry>
 	  </row>
 
 	  <row>
 	    <entry><emphasis><literal>src/</literal> Subversion
 		Root</emphasis></entry>
 	    <entry><literal>svn+ssh://</literal><systemitem
 		class="fqdomainname">repo.FreeBSD.org</systemitem><filename>/base</filename>
 	      (see also <xref
 		linkend="svn-getting-started-base-layout"/>).</entry>
 	  </row>
 
 	  <row>
 	    <entry><emphasis><literal>doc/</literal> Subversion
 		Root</emphasis></entry>
 	    <entry><literal>svn+ssh://</literal><systemitem
 		class="fqdomainname">repo.FreeBSD.org</systemitem><filename>/doc</filename>
 	      (see also <xref
 		linkend="svn-getting-started-doc-layout"/>).</entry>
 	  </row>
 
 	  <row>
 	    <entry><emphasis><literal>ports/</literal> Subversion
 		Root</emphasis></entry>
 
 	    <entry><literal>svn+ssh://</literal><systemitem
 		class="fqdomainname">repo.FreeBSD.org</systemitem><filename>/ports</filename>
 	      (see also <xref
 		linkend="svn-getting-started-ports-layout"/>).</entry>
 	  </row>
 
 	  <row>
 	    <entry><emphasis>Internal Mailing Lists</emphasis></entry>
 	    <entry>developers (technically called all-developers),
 	      doc-developers, doc-committers, ports-developers,
 	      ports-committers, src-developers, src-committers.  (Each
 	      project repository has its own -developers and
 	      -committers mailing lists.  Archives for these lists can
 	      be found in the files
 	      <filename>/local/mail/<replaceable>repository-name</replaceable>-developers-archive</filename>
 	      and
 	      <filename>/local/mail/<replaceable>repository-name</replaceable>-committers-archive</filename>
 	      on the <systemitem
 		class="fqdomainname">FreeBSD.org</systemitem>
 	      cluster.)</entry>
 	  </row>
 
 
 	  <row>
 	    <entry><emphasis>Core Team monthly
 		reports</emphasis></entry>
 	    <entry><filename>/home/core/public/monthly-reports</filename>
 	      on the <systemitem
 		class="fqdomainname">FreeBSD.org</systemitem>
 	      cluster.</entry>
 	  </row>
 
 	  <row>
 	    <entry><emphasis>Ports Management Team monthly
 		reports</emphasis></entry>
 	    <entry><filename>/home/portmgr/public/monthly-reports</filename>
 	      on the <systemitem
 		class="fqdomainname">FreeBSD.org</systemitem>
 	      cluster.</entry>
 	  </row>
 
 	  <row>
 	    <entry><emphasis>Noteworthy <literal>src/</literal> SVN
 		Branches</emphasis></entry>
 	    <entry>
 	      <literal>stable/</literal><replaceable>n</replaceable>
 	      (<replaceable>n</replaceable>-STABLE),
 	      <literal>head</literal> (-CURRENT)</entry>
 	  </row>
 	</tbody>
       </tgroup>
     </informaltable>
 
     <para>&man.ssh.1; is required to connect to the project hosts.
       For more information, see <xref linkend="ssh.guide"/>.</para>
 
     <para>Useful links:</para>
 
     <itemizedlist>
       <listitem>
 	<para><link xlink:href="&url.base;/internal/">&os;
 	    Project Internal Pages</link></para>
       </listitem>
 
       <listitem>
 	<para><link
 	    xlink:href="&url.base;/internal/machines.html">&os;
 	    Project Hosts</link></para>
       </listitem>
 
       <listitem>
 	<para><link xlink:href="&url.base;/administration.html">&os;
 	    Project Administrative Groups</link></para>
       </listitem>
     </itemizedlist>
   </sect1>
 
   <sect1 xml:id="pgpkeys">
     <title>Open<acronym>PGP</acronym> Keys for &os;</title>
 
     <para>Cryptographic keys conforming to the
       Open<acronym>PGP</acronym> (<emphasis>Pretty Good
       Privacy</emphasis>) standard are used by the &os; project to
       authenticate committers.  Messages carrying important
       information like public <acronym>SSH</acronym> keys can be
       signed with the Open<acronym>PGP</acronym> key to prove that
       they are really from the committer.  See
       <link xlink:href="http://www.nostarch.com/pgp_ml.htm">PGP &amp;
 	GPG: Email for the Practical Paranoid by Michael Lucas</link>
       and <link
 	xlink:href="http://en.wikipedia.org/wiki/Pretty_Good_Privacy"></link>
       for more information.</para>
 
     <sect2 xml:id="pgpkeys-creating">
       <title>Creating a Key</title>
 
       <para>Existing keys can be used, but should be checked with
 	<filename>doc/head/share/pgpkeys/checkkey.sh</filename>
 	first.  In this case, make sure the key has a &os; user
 	ID.</para>
 
       <para>For those who do not yet have an
 	Open<acronym>PGP</acronym> key, or need a new key to meet &os;
 	security requirements, here we show how to generate
 	one.</para>
 
       <procedure xml:id="pgpkeys-create-steps">
 
 	<step>
 	  <para>Install
 	    <filename role="package">security/gnupg</filename>.  Enter
 	    these lines in <filename>~/.gnupg/gpg.conf</filename> to
 	    set minimum acceptable defaults:</para>
 
 	  <programlisting>fixed-list-mode
 keyid-format 0xlong
 personal-digest-preferences SHA512 SHA384 SHA256 SHA224
 default-preference-list SHA512 SHA384 SHA256 SHA224 AES256 AES192 AES CAST5 BZIP2 ZLIB ZIP Uncompressed
 use-agent
 verify-options show-uid-validity
 list-options show-uid-validity
 sig-notation issuer-fpr@notations.openpgp.fifthhorseman.net=%g
 cert-digest-algo SHA512</programlisting>
 	</step>
 
 	<step>
 	  <para>Generate a key:</para>
 
 	  <screen>&prompt.user; <userinput>gpg --full-gen-key</userinput>
 gpg (GnuPG) 2.1.8; Copyright (C) 2015 Free Software Foundation, Inc.
 This is free software: you are free to change and redistribute it.
 There is NO WARRANTY, to the extent permitted by law.
 
 Warning: using insecure memory!
 Please select what kind of key you want:
    (1) RSA and RSA (default)
    (2) DSA and Elgamal
    (3) DSA (sign only)
    (4) RSA (sign only)
 Your selection? <userinput>1</userinput>
 RSA keys may be between 1024 and 4096 bits long.
 What keysize do you want? (2048) <userinput>2048</userinput>  <co xml:id="co-pgp-bits"/>
 Requested keysize is 2048 bits
 Please specify how long the key should be valid.
 	 0 = key does not expire
       &lt;n&gt;  = key expires in n days
       &lt;n&gt;w = key expires in n weeks
       &lt;n&gt;m = key expires in n months
       &lt;n&gt;y = key expires in n years
 Key is valid for? (0) <userinput>3y</userinput>  <co xml:id="co-pgp-expire"/>
 Key expires at Wed Nov  4 17:20:20 2015 MST
 Is this correct? (y/N) <userinput>y</userinput>
 
 GnuPG needs to construct a user ID to identify your key.
 
 Real name: <userinput><replaceable>Chucky Daemon</replaceable></userinput> <co xml:id="co-pgp-realname"/>
 Email address: <userinput><replaceable>notreal@example.com</replaceable></userinput>
 Comment:
 You selected this USER-ID:
     "<replaceable>Chucky Daemon &lt;notreal@example.com&gt;</replaceable>"
 
 Change (N)ame, (C)omment, (E)mail or (O)kay/(Q)uit? <userinput>o</userinput>
 You need a Passphrase to protect your secret key.</screen>
 
 	  <calloutlist>
 	    <callout arearefs="co-pgp-bits">
 	      <para>2048-bit keys with a three-year expiration provide
 		adequate protection at present (2013-12).  <link
 		  xlink:href="http://danielpocock.com/rsa-key-sizes-2048-or-4096-bits"/>
 		describes the situation in more detail.</para>
 	    </callout>
 
 	    <callout arearefs="co-pgp-expire">
 	      <para>A three year key lifespan is short enough to
 		obsolete keys weakened by advancing computer power,
 		but long enough to reduce key management
 		problems.</para>
 	    </callout>
 
 	    <callout arearefs="co-pgp-realname">
 	      <para>Use your real name here, preferably matching that
 		shown on government-issued <acronym>ID</acronym> to
 		make it easier for others to verify your identity.
 		Text that may help others identify you can be entered
 		in the <literal>Comment</literal> section.</para>
 	    </callout>
 	  </calloutlist>
 
 	  <para>After the email address is entered, a passphrase is
 	    requested.  Methods of creating a secure passphrase are
 	    contentious.  Rather than suggest a single way, here are
 	    some links to sites that describe various methods: <link
 	      xlink:href="http://world.std.com/~reinhold/diceware.html"></link>,
 	    <link
 	      xlink:href="http://www.iusmentis.com/security/passphrasefaq/"></link>,
 	    <link xlink:href="http://xkcd.com/936/"></link>,
 	    <link
 	      xlink:href="http://en.wikipedia.org/wiki/Passphrase"></link>.</para>
 	</step>
       </procedure>
 
       <para>Protect the private key and passphrase.  If either the
 	private key or passphrase may have been compromised or
 	disclosed, immediately notify
 	<email>accounts@FreeBSD.org</email> and revoke the key.</para>
 
       <para>Committing the new key is shown in
 	<xref linkend="commit-steps"/>.</para>
     </sect2>
   </sect1>
 
   <sect1 xml:id="kerberos-ldap">
     <title>Kerberos and LDAP web Password for &os; Cluster</title>
 
     <para>The &os; cluster requires a Kerberos password to access
       certain services.  The Kerberos password also serves as the
       LDAP web password, since LDAP is proxying to Kerberos in the
       cluster.  Some of the services
       which require this include:</para>
 
     <itemizedlist>
       <listitem>
 	<para><link
 	    xlink:href="https://bugs.freebsd.org/bugzilla">Bugzilla</link></para>
       </listitem>
       <listitem>
 	<para><link
 	    xlink:href="https://ci.freebsd.org">Jenkins</link></para>
       </listitem>
     </itemizedlist>
 
     <para>To create a new Kerberos account in the &os; cluster, or to
       reset a Kerberos password for an existing account using a random
       password generator:</para>
 
     <screen>&prompt.user; <userinput>ssh kpasswd.freebsd.org</userinput></screen>
 
     <note>
       <para>This must be done from a machine outside of the &os;.org
 	cluster.</para>
     </note>
 
     <para>A Kerberos password can also be set manually
       by logging into <systemitem
 	class="fqdomainname">freefall.FreeBSD.org</systemitem> and
       running:</para>
 
     <screen>&prompt.user; <userinput>kpasswd</userinput></screen>
 
     <note>
       <para>Unless the Kerberos-authenticated services
 	of the &os;.org cluster have been used previously,
 	<errorname>Client unknown</errorname> will be shown.  This
 	error means that the
 	<command>ssh kpasswd.freebsd.org</command> method shown above
 	must be used first to initialize the Kerberos account.</para>
     </note>
 
   </sect1>
 
   <sect1 xml:id="committer.types">
     <title>Commit Bit Types</title>
 
     <para>The &os; repository has a number of components which, when
       combined, support the basic operating system source,
       documentation, third party application ports infrastructure, and
       various maintained utilities.  When &os; commit bits are
       allocated, the areas of the tree where the bit may be used are
       specified.  Generally, the areas associated with a bit reflect
       who authorized the allocation of the commit bit.  Additional
       areas of authority may be added at a later date: when this
       occurs, the committer should follow normal commit bit allocation
       procedures for that area of the tree, seeking approval from the
       appropriate entity and possibly getting a mentor for that area
       for some period of time.</para>
 
     <informaltable frame="none" pgwide="1">
       <tgroup cols="3">
 	<tbody>
 	  <row>
 	    <entry><emphasis>Committer Type</emphasis></entry>
 	    <entry><emphasis>Responsible</emphasis></entry>
 	    <entry><emphasis>Tree Components</emphasis></entry>
 	  </row>
 
 	  <row>
 	    <entry>src</entry>
 	    <entry>core@</entry>
 	    <entry>src/, doc/ subject to appropriate review</entry>
 	  </row>
 
 	  <row>
 	    <entry>doc</entry>
 	    <entry>doceng@</entry>
 	    <entry>doc/, ports/, src/ documentation</entry>
 	  </row>
 
 	  <row>
 	    <entry>ports</entry>
 	    <entry>portmgr@</entry>
 	    <entry>ports/</entry>
 	  </row>
 	</tbody>
       </tgroup>
     </informaltable>
 
     <para>Commit bits allocated prior to the development of the notion
       of areas of authority may be appropriate for use in many parts
       of the tree.  However, common sense dictates that a committer
       who has not previously worked in an area of the tree seek review
       prior to committing, seek approval from the appropriate
       responsible party, and/or work with a mentor.  Since the rules
       regarding code maintenance differ by area of the tree, this is
       as much for the benefit of the committer working in an area of
       less familiarity as it is for others working on the tree.</para>
 
     <para>Committers are encouraged to seek review for their work as
       part of the normal development process, regardless of the area
       of the tree where the work is occurring.</para>
 
     <sect2>
       <title>Policy for Committer Activity in Other Trees</title>
 
       <itemizedlist>
 	<listitem>
 	  <para>All committers may modify
 	    <filename>base/head/share/misc/committers-*.dot</filename>,
 	    <filename>base/head/usr.bin/calendar/calendars/calendar.freebsd</filename>,
 	    and
 	    <filename>ports/head/astro/xearth/files</filename>.</para>
 	</listitem>
 
 	<listitem>
 	  <para>doc committers may commit
 	    documentation changes to <filename>src</filename>
 	    files, such as man pages, READMEs, fortune databases,
 	    calendar files, and comment fixes without approval from a
 	    src committer, subject to the normal care and tending of
 	    commits.</para>
 	</listitem>
 
 	<listitem>
 	  <para>Any committer may make changes to any other tree
 	    with an "Approved by" from a non-mentored committer with
 	    the appropriate bit.</para>
 	</listitem>
 
 	<listitem>
 	  <para>Committers can acquire an additional bit by the usual
 	    process of finding a mentor who will propose them to core,
 	    doceng, or portmgr, as appropriate.  When approved, they
 	    will be added to 'access' and the normal mentoring period
 	    will ensue, which will involve a continuing of
 	    <quote>Approved by</quote> for some period.</para>
 	</listitem>
 
 	<listitem>
 	  <para>"Approved by" is only acceptable from non-mentored src
 	    committers -- mentored committers can provide a "Reviewed
 	    by" but not an "Approved by".</para>
 	</listitem>
       </itemizedlist>
     </sect2>
   </sect1>
 
   <sect1 xml:id="subversion-primer">
     <title>Subversion Primer</title>
 
     <para>New committers are assumed to already be familiar with the
       basic operation of Subversion.  If not, start by reading the
       <link xlink:href="http://svnbook.red-bean.com/">Subversion
 	Book</link>.</para>
 
     <sect2 xml:id="svn-intro">
       <title>Introduction</title>
 
       <para>The &os; source repository switched from
 	<acronym>CVS</acronym> to Subversion on May 31st, 2008.  The
 	first real <acronym>SVN</acronym> commit is
 	<emphasis>r179447</emphasis>.</para>
 
       <para>The &os; <literal>doc/www</literal> repository switched
 	from <acronym>CVS</acronym> to Subversion on May 19th, 2012.
 	The first real <acronym>SVN</acronym> commit is
 	<emphasis>r38821</emphasis>.</para>
 
       <para>The &os; <literal>ports</literal> repository switched
 	from <acronym>CVS</acronym> to Subversion on July 14th, 2012.
 	The first real <acronym>SVN</acronym> commit is
 	<emphasis>r300894</emphasis>.</para>
 
       <para>Subversion can be installed from the &os; Ports
 	Collection by issuing these commands:</para>
 
       <screen>&prompt.root; <userinput>pkg install subversion</userinput></screen>
 
     </sect2>
 
     <sect2 xml:id="svn-getting-started">
       <title>Getting Started</title>
 
       <para>There are a few ways to obtain a working copy of the tree
 	from Subversion.  This section will explain them.</para>
 
       <sect3 xml:id="svn-getting-started-direct-checkout">
 	<title>Direct Checkout</title>
 
 	<para>The first is to check out directly from the main
 	  repository.  For the <literal>src</literal> tree,
 	  use:</para>
 
 	<screen>&prompt.user; <userinput>svn checkout svn+ssh://repo.freebsd.org/base/head /usr/src</userinput></screen>
 
 	<para>For the <literal>doc</literal> tree, use:</para>
 
 	<screen>&prompt.user; <userinput>svn checkout svn+ssh://repo.freebsd.org/doc/head /usr/doc</userinput></screen>
 
 	<para>For the <literal>ports</literal> tree, use:</para>
 
 	<screen>&prompt.user; <userinput>svn checkout svn+ssh://repo.freebsd.org/ports/head /usr/ports</userinput></screen>
 
 	<note>
 	  <para>Though the remaining examples in this document are
 	    written with the workflow of working with the
 	    <literal>src</literal> tree in mind, the underlying
 	    concepts are the same for working with the
 	    <literal>doc</literal> and the <literal>ports</literal>
 	    tree.
 	    Ports related Subversion operations are listed in
 	    <xref linkend="ports"/>.</para>
 	</note>
 
 	<para>The above command will check out a
 	  <literal>CURRENT</literal> source tree as
 	  <filename><replaceable>/usr/src/</replaceable></filename>,
 	  which can be any target directory on the local filesystem.
 	  Omitting the final argument of that command causes the
 	  working copy, in this case, to be named <quote>head</quote>,
 	  but that can be renamed safely.</para>
 
 	<para><literal>svn+ssh</literal> means the
 	  <acronym>SVN</acronym> protocol tunnelled over
 	  <acronym>SSH</acronym>.  The name of the server is
 	  <literal>repo.freebsd.org</literal>, <literal>base</literal>
 	  is the path to the repository, and <literal>head</literal>
 	  is the subdirectory within the repository.</para>
 
 	<para>If your &os; login name is different from the login
 	  name used on the local machine, either include it in
 	  the <acronym>URL</acronym> (for example
 	  <literal>svn+ssh://jarjar@repo.freebsd.org/base/head</literal>),
 	  or add an entry to <filename>~/.ssh/config</filename>
 	  in the form:</para>
 
 	<programlisting>Host repo.freebsd.org
 	User jarjar</programlisting>
 
 	<para>This is the simplest method, but it is hard to tell just
 	  yet how much load it will place on the repository.</para>
 
 	<note>
 	  <para>The <command>svn diff</command> does not require
 	    access to the server as <acronym>SVN</acronym> stores a
 	    reference copy of every file in the working copy.  This,
 	    however, means that Subversion working copies are very
 	    large in size.</para>
 	</note>
       </sect3>
 
       <sect3 xml:id="svn-getting-started-base-layout">
 	<title><literal>RELENG_*</literal> Branches and General
 	  Layout</title>
 
 	<para>In <literal>svn+ssh://repo.freebsd.org/base</literal>,
 	  <emphasis>base</emphasis> refers to the source tree.
 	  Similarly, <emphasis>ports</emphasis> refers to the ports
 	  tree, and so on.  These are separate repositories with their
 	  own change number sequences, access controls and commit
 	  mail.</para>
 
 	<para>For the base repository, HEAD refers to the -CURRENT
 	  tree.  For example, <filename>head/bin/ls</filename> is what
 	  would go into <filename>/usr/src/bin/ls</filename> in a
 	  release.  Some key locations are:</para>
 
 	<itemizedlist>
 	  <listitem>
 	    <para><emphasis>/head/</emphasis> which corresponds to
 	      <literal>HEAD</literal>, also known as
 	      <literal>-CURRENT</literal>.</para>
 	  </listitem>
 
 	  <listitem>
 	    <para><emphasis>/stable/<replaceable>n</replaceable></emphasis>
 	      which corresponds to
 	      <literal>RELENG_<replaceable>n</replaceable></literal>.</para>
 	  </listitem>
 
 	  <listitem>
 	    <para><emphasis>/releng/<replaceable>n.n</replaceable></emphasis>
 	      which corresponds to
 	      <literal>RELENG_<replaceable>n_n</replaceable></literal>.</para>
 	  </listitem>
 
 	  <listitem>
 	    <para><emphasis>/release/<replaceable>n.n.n</replaceable></emphasis>
 	      which corresponds to
 	      <literal>RELENG_<replaceable>n_n_n</replaceable>_RELEASE</literal>.</para>
 	  </listitem>
 
 	  <listitem>
 	    <para><emphasis>/vendor*</emphasis> is the vendor branch
 	      import work area.  This directory itself does not
 	      contain branches, however its subdirectories do.  This
 	      contrasts with the <emphasis>stable</emphasis>,
 	      <emphasis>releng</emphasis> and
 	      <emphasis>release</emphasis> directories.</para>
 	  </listitem>
 
 	  <listitem>
 	    <para><emphasis>/projects</emphasis> and
 	      <emphasis>/user</emphasis> feature a branch work area.
 	      As above, the
 	      <emphasis>/user</emphasis> directory does not contain
 	      branches itself.</para>
 	  </listitem>
 	</itemizedlist>
       </sect3>
 
       <sect3 xml:id="svn-getting-started-doc-layout">
 	<title>&os; Documentation Project Branches and
 	  Layout</title>
 
 	<para>In <literal>svn+ssh://repo.freebsd.org/doc</literal>,
 	  <emphasis>doc</emphasis> refers to the repository root of
 	  the source tree.</para>
 
 	<para>In general, most &os; Documentation Project work will be
 	  done within the <filename>head/</filename> branch of the
 	  documentation source tree.</para>
 
 	<para>&os; documentation is written and/or translated to
 	  various languages, each in a separate
 	  directory in the <filename>head/</filename>
 	  branch.</para>
 
 	<para>Each translation set contains several subdirectories for
 	  the various parts of the &os; Documentation Project.  A few
 	  noteworthy directories are:</para>
 
 	<itemizedlist>
 	  <listitem>
 	    <para><emphasis>/articles/</emphasis> contains the source
 	      code for articles written by various &os;
 	      contributors.</para>
 	  </listitem>
 
 	  <listitem>
 	    <para><emphasis>/books/</emphasis> contains the source
 	      code for the different books, such as the
 	      &os;&nbsp;Handbook.</para>
 	  </listitem>
 
 	  <listitem>
 	    <para><emphasis>/htdocs/</emphasis> contains the source
 	      code for the &os;&nbsp;website.</para>
 	  </listitem>
 	</itemizedlist>
       </sect3>
 
       <sect3 xml:id="svn-getting-started-ports-layout">
 	<title>&os; Ports Tree Branches and Layout</title>
 
 	<para>In <literal>svn+ssh://repo.freebsd.org/ports</literal>,
 	  <emphasis>ports</emphasis> refers to the repository root of
 	  the ports tree.</para>
 
 	<para>In general, most &os; port work will be done within the
 	  <filename>head/</filename> branch of the ports tree which is
 	  the actual ports tree used to install software.  Some other
 	  key locations are:</para>
 
 	<itemizedlist>
 	  <listitem>
 	    <para><emphasis>/branches/RELENG_<replaceable>n_n_n</replaceable></emphasis>
 	      which corresponds to
 	      <literal>RELENG_<replaceable>n_n_n</replaceable></literal>
 	      is used to merge back security updates in preparation
 	      for a release.</para>
 	  </listitem>
 
 	  <listitem>
 	    <para><emphasis>/tags/RELEASE_<replaceable>n_n_n</replaceable></emphasis>
 	      which corresponds to
 	      <literal>RELEASE_<replaceable>n_n_n</replaceable></literal>
 	      represents a release tag of the ports tree.</para>
 	  </listitem>
 
 	  <listitem>
 	    <para><emphasis>/tags/RELEASE_<replaceable>n</replaceable>_EOL</emphasis>
 	      represents the end of life tag of a specific &os;
 	      branch.</para>
 	  </listitem>
 	</itemizedlist>
       </sect3>
     </sect2>
 
     <sect2 xml:id="svn-daily-use">
       <title>Daily Use</title>
 
       <para>This section will explain how to perform common day-to-day
 	operations with Subversion.</para>
 
       <sect3 xml:id="svn-daily-use-help">
 	<title>Help</title>
 
 	<para><acronym>SVN</acronym> has built in help documentation.
 	  It can be accessed by typing:</para>
 
 	<screen>&prompt.user; <userinput>svn help</userinput></screen>
 
 	<para>Additional information can be found in the
 	  <link xlink:href="http://svnbook.red-bean.com/">Subversion
 	    Book</link>.</para>
       </sect3>
 
       <sect3 xml:id="svn-daily-use-checkout">
 	<title>Checkout</title>
 
 	<para>As seen earlier, to check out the &os; head
 	  branch:</para>
 
 	<screen>&prompt.user; <userinput>svn checkout svn+ssh://repo.freebsd.org/base/head /usr/src</userinput></screen>
 
 	<para>At some point, more than just <literal>HEAD</literal>
 	  will probably be useful, for instance when merging changes
 	  to stable/7.  Therefore, it may be useful to have a partial
 	  checkout of the complete tree (a full checkout would be very
 	  painful).</para>
 
 	<para>To do this, first check out the root of the
 	  repository:</para>
 
 	<screen>&prompt.user; <userinput>svn checkout --depth=immediates svn+ssh://repo.freebsd.org/base</userinput></screen>
 
 	<para>This will give <literal>base</literal> with all the
 	  files it contains (at the time of writing, just
 	  <filename>ROADMAP.txt</filename>) and empty subdirectories
 	  for <literal>head</literal>, <literal>stable</literal>,
 	  <literal>vendor</literal> and so on.</para>
 
 	<para>Expanding the working copy is possible.  Just change the
 	  depth of the various subdirectories:</para>
 
 	<screen>&prompt.user; <userinput>svn up --set-depth=infinity base/head</userinput>
 &prompt.user; <userinput>svn up --set-depth=immediates base/release base/releng base/stable</userinput></screen>
 
 	<para>The above command will pull down a full copy of
 	  <literal>head</literal>, plus empty copies of every
 	  <literal>release</literal> tag, every
 	  <literal>releng</literal> branch, and every
 	  <literal>stable</literal> branch.</para>
 
 	<para>If at a later date merging to
 	  <literal>7-STABLE</literal> is required, expand the working
 	  copy:</para>
 
 	<screen>&prompt.user; <userinput>svn up --set-depth=infinity base/stable/7</userinput></screen>
 
 	<para>Subtrees do not have to be expanded completely.  For
 	  instance, expanding only <literal>stable/7/sys</literal> and
 	  then later expand the rest of
 	  <literal>stable/7</literal>:</para>
 
 	<screen>&prompt.user; <userinput>svn up --set-depth=infinity base/stable/7/sys</userinput>
 &prompt.user; <userinput>svn up --set-depth=infinity base/stable/7</userinput></screen>
 
 	<para>Updating the tree with <command>svn update</command>
 	  will only update what was previously asked for (in this
 	  case, <literal>head</literal> and
 	  <literal>stable/7</literal>; it will not pull down the whole
 	  tree.</para>
       </sect3>
 
       <sect3 xml:id="svn-daily-use-anonymous-checkout">
 	<title>Anonymous Checkout</title>
 
 	<para>It is possible to anonymously check out the &os;
 	  repository with Subversion.  This will give access to a
 	  read-only tree that can be updated, but not committed back
 	  to the main repository.  To do this, use:</para>
 
 	<screen>&prompt.user; <userinput>svn co https://svn.FreeBSD.org/base/head /usr/src</userinput></screen>
 
 	<para>More details on using Subversion this way can be found
 	  in <link xlink:href="&url.books.handbook;/svn.html">Using
 	    Subversion</link>.</para>
       </sect3>
 
       <sect3 xml:id="svn-daily-use-updating-the-tree">
 	<title>Updating the Tree</title>
 
 	<para>To update a working copy to either the latest revision,
 	  or a specific revision:</para>
 
 	<screen>&prompt.user; <userinput>svn update</userinput>
 &prompt.user; <userinput>svn update -<replaceable>r12345</replaceable></userinput></screen>
       </sect3>
 
       <sect3 xml:id="svn-daily-use-status">
 	<title>Status</title>
 
 	<para>To view the local changes that have been made to the
 	  working copy:</para>
 
 	<screen>&prompt.user; <userinput>svn status</userinput></screen>
 
 	<para>To show local changes and files that are out-of-date
 	  do:</para>
 
 	<screen>&prompt.user; <userinput>svn status --show-updates</userinput></screen>
       </sect3>
 
       <sect3 xml:id="svn-daily-use-editing-and-committing">
 	<title>Editing and Committing</title>
 
 	<para><acronym>SVN</acronym> does not need to
 	  be told in advance about file editing.</para>
 
 	<para>To commit all changes in
 	  the current directory and all subdirectories:</para>
 
 	<screen>&prompt.user; <userinput>svn commit</userinput></screen>
 
 	<para>To commit all changes in, for example,
 	  <filename><replaceable>lib/libfetch/</replaceable></filename>
 	  and
 	  <filename><replaceable>usr/bin/fetch/</replaceable></filename>
 	  in a single operation:</para>
 
 	<screen>&prompt.user; <userinput>svn commit <replaceable>lib/libfetch</replaceable> <replaceable>usr/bin/fetch</replaceable></userinput></screen>
 
 	<para>There is also a commit wrapper for the ports tree to
 	  handle the properties and sanity checking the
 	  changes:</para>
 
 	<screen>&prompt.user; <userinput>/usr/ports/Tools/scripts/psvn commit</userinput></screen>
       </sect3>
 
       <sect3 xml:id="svn-daily-use-adding-and-removing">
 	<title>Adding and Removing Files</title>
 
 	<note>
 	  <para>Before adding files, get a copy of <link
 	      xlink:href="https://people.FreeBSD.org/~peter/auto-props.txt">auto-props.txt</link>
 	    (there is also a <link
 	      xlink:href="https://people.FreeBSD.org/~beat/cvs2svn/auto-props.txt">
 	      ports tree specific version</link>) and add it to
 	    <filename>~/.subversion/config</filename> according to the
 	    instructions in the file.  If you added something before
 	    reading this, use <command>svn rm --keep-local</command>
 	    for just added files, fix your config file and re-add them
 	    again.  The initial config file is created when you first
 	    run a svn command, even something as simple as
 	    <command>svn help</command>.</para>
 	</note>
 
 	<para>Files are added to a
 	  <acronym>SVN</acronym> repository with <command>svn
 	    add</command>.  To add a file named
 	  <emphasis>foo</emphasis>, edit it, then:</para>
 
 	<screen>&prompt.user; <userinput>svn add <replaceable>foo</replaceable></userinput></screen>
 
 	<note>
 	  <para>Most new source files should include a
 	    <literal>&dollar;&os;&dollar;</literal> string near the
 	    start of the file.  On commit, <command>svn</command> will
 	    expand the <literal>&dollar;&os;&dollar;</literal> string,
 	    adding the file path, revision number, date and time of
 	    commit, and the username of the committer.  Files which
 	    cannot be modified may be committed without the
 	    <literal>&dollar;&os;&dollar;</literal> string.</para>
 	</note>
 
 	<para>Files can be removed with <command>svn
 	    remove</command>:</para>
 
 	<screen>&prompt.user; <userinput>svn remove <replaceable>foo</replaceable></userinput></screen>
 
 	<para>Subversion does not require deleting the file before
 	  using <command>svn rm</command>, and indeed complains if
 	  that happens.</para>
 
 	<para>It is possible to add directories with
 	  <command>svn add</command>:</para>
 
 	<screen>&prompt.user; <userinput>mkdir <replaceable>bar</replaceable></userinput>
 &prompt.user; <userinput>svn add <replaceable>bar</replaceable></userinput></screen>
 
 	<para>Although <command>svn mkdir</command> makes this easier
 	  by combining the creation of the directory and the adding of
 	  it:</para>
 
 	<screen>&prompt.user; <userinput>svn mkdir <replaceable>bar</replaceable></userinput></screen>
 
 	<para>Like files, directories are removed with
 	  <command>svn rm</command>.  There is no separate command
 	  specifically for removing directories.</para>
 
 	<screen>&prompt.user; <userinput>svn rm <replaceable>bar</replaceable></userinput></screen>
       </sect3>
 
       <sect3 xml:id="svn-daily-use-copying-and-moving">
 	<title>Copying and Moving Files</title>
 
 	<para>This command creates a copy of
 	  <filename>foo.c</filename> named <filename>bar.c</filename>,
 	  with the new file also under version control and with the
 	  full history of <filename>foo.c</filename>:</para>
 
 	<screen>&prompt.user; <userinput>svn copy <replaceable>foo.c</replaceable> <replaceable>bar.c</replaceable></userinput></screen>
 
 	<para>This is usually preferred to copying the file with
 	  <command>cp</command> and adding it to the repository with
 	  <command>svn add</command> because this way the new file
 	  does not inherit the original one's history.</para>
 
 	<para>To move and rename a file:</para>
 
 	<screen>&prompt.user; <userinput>svn move <replaceable>foo.c</replaceable> <replaceable>bar.c</replaceable></userinput></screen>
       </sect3>
 
       <sect3 xml:id="svn-daily-use-log-and-annotate">
 	<title>Log and Annotate</title>
 
 	<para><command>svn log</command> shows revisions and commit
 	  messages, most recent first, for files or directories.  When
 	  used on a directory, all revisions that affected the
 	  directory and files within that directory are shown.</para>
 
 	<para><command>svn annotate</command>, or equally <command>svn
 	    praise</command> or <command>svn blame</command>, shows
 	  the most recent revision number and who committed that
 	  revision for each line of a file.</para>
       </sect3>
 
       <sect3 xml:id="svn-daily-use-diffs">
 	<title>Diffs</title>
 
 	<para><command>svn diff</command> displays changes to the
 	  working copy.  Diffs generated by <acronym>SVN</acronym> are
 	  unified and include new files by default in the diff
 	  output.</para>
 
 	<para><command>svn diff</command> can show the changes between
 	  two revisions of the same file:</para>
 
 	<screen>&prompt.user; <userinput>svn diff -r179453:179454 ROADMAP.txt</userinput></screen>
 
 	<para>It can also show all changes for a specific changeset.
 	  This command shows what changes were made to the
 	  current directory and all subdirectories in changeset
 	  179454:</para>
 
 	<screen>&prompt.user; <userinput>svn diff -c179454 .</userinput></screen>
       </sect3>
 
       <sect3 xml:id="svn-daily-use-reverting">
 	<title>Reverting</title>
 
 	<para>Local changes (including additions and deletions) can be
 	  reverted using <command>svn revert</command>.  It does not
 	  update out-of-date files, but just replaces them with
 	  pristine copies of the original version.</para>
       </sect3>
 
       <sect3 xml:id="svn-daily-use-conflicts">
 	<title>Conflicts</title>
 
 	<para>If an <command>svn update</command> resulted in a merge
 	  conflict, Subversion will remember which files have
 	  conflicts and refuse to commit any changes to those files
 	  until explicitly told that the conflicts have been resolved.
 	  The simple, not yet deprecated procedure is:</para>
 
 	<screen>&prompt.user; <userinput>svn resolved <replaceable>foo</replaceable></userinput></screen>
 
 	<para>However, the preferred procedure is:</para>
 
 	<screen>&prompt.user; <userinput>svn resolve --accept=working <replaceable>foo</replaceable></userinput></screen>
 
 	<para>The two examples are equivalent.  Possible values for
 	  <literal>--accept</literal> are:</para>
 
 	<itemizedlist>
 	  <listitem>
 	    <para><literal>working</literal>: use the version in your
 	      working directory (which one presumes has been edited to
 	      resolve the conflicts).</para>
 	  </listitem>
 
 	  <listitem>
 	    <para><literal>base</literal>: use a pristine copy of the
 	      version you had before <command>svn update</command>,
 	      discarding your own changes, the conflicting changes,
 	      and possibly other intervening changes as well.</para>
 	  </listitem>
 
 	  <listitem>
 	    <para><literal>mine-full</literal>: use what you had
 	      before <command>svn update</command>, including your own
 	      changes, but discarding the conflicting changes, and
 	      possibly other intervening changes as well.</para>
 	  </listitem>
 
 	  <listitem>
 	    <para><literal>theirs-full</literal>: use the version that
 	      was retrieved when you did
 	      <command>svn update</command>, discarding your own
 	      changes.</para>
 	  </listitem>
 	</itemizedlist>
       </sect3>
     </sect2>
 
     <sect2>
       <title>Advanced Use</title>
 
       <sect3 xml:id="svn-advanced-use-sparse-checkouts">
 	<title>Sparse Checkouts</title>
 
 	<para><acronym>SVN</acronym> allows
 	  <emphasis>sparse</emphasis>, or partial checkouts of a
 	  directory by adding <option>--depth</option> to a
 	  <command>svn checkout</command>.</para>
 
 	<para>Valid arguments to <option>--depth</option>
 	  are:</para>
 
 	<itemizedlist>
 	  <listitem>
 	    <para><literal>empty</literal>: the directory itself
 	      without any of its contents.</para>
 	  </listitem>
 
 	  <listitem>
 	    <para><literal>files</literal>: the directory and any
 	      files it contains.</para>
 	  </listitem>
 
 	  <listitem>
 	    <para><literal>immediates</literal>: the directory and any
 	      files and directories it contains, but none of the
 	      subdirectories' contents.</para>
 	  </listitem>
 
 	  <listitem>
 	    <para><literal>infinity</literal>: anything.</para>
 	  </listitem>
 	</itemizedlist>
 
 	<para>The <literal>--depth</literal> option applies to many
 	  other commands, including <command>svn commit</command>,
 	  <command>svn revert</command>, and <command>svn
 	    diff</command>.</para>
 
 	<para>Since <literal>--depth</literal> is sticky, there is a
 	  <literal>--set-depth</literal> option for <command>svn
 	    update</command> that will change the selected depth.
 	  Thus, given the working copy produced by the previous
 	  example:</para>
 
 	<screen>&prompt.user; <userinput>cd <replaceable>~/freebsd</replaceable></userinput>
 &prompt.user; <userinput>svn update --set-depth=immediates .</userinput></screen>
 
 	<para>The above command will populate the working copy in
 	  <replaceable>~/freebsd</replaceable> with
 	  <filename>ROADMAP.txt</filename> and empty subdirectories,
 	  and nothing will happen when <command>svn update</command>
 	  is executed on the subdirectories.  However, this
 	  command will set the depth for
 	  <replaceable>head</replaceable> (in this case) to infinity,
 	  and fully populate it:</para>
 
 	<screen>&prompt.user; <userinput>svn update --set-depth=infinity <replaceable>head</replaceable></userinput></screen>
       </sect3>
 
       <sect3 xml:id="svn-advanced-use-direct-operation">
 	<title>Direct Operation</title>
 
 	<para>Certain operations can be performed directly on the
 	  repository without touching the working copy.  Specifically,
 	  this applies to any operation that does not require editing
 	  a file, including:</para>
 
 	<itemizedlist>
 	  <listitem>
 	    <para><literal>log</literal>,
 	      <literal>diff</literal></para>
 	  </listitem>
 
 	  <listitem>
 	    <para><literal>mkdir</literal></para>
 	  </listitem>
 
 	  <listitem>
 	    <para><literal>remove</literal>, <literal>copy</literal>,
 	      <literal>rename</literal></para>
 	  </listitem>
 
 	  <listitem>
 	    <para><literal>propset</literal>,
 	      <literal>propedit</literal>,
 	      <literal>propdel</literal></para>
 	  </listitem>
 
 	  <listitem>
 	    <para><literal>merge</literal></para>
 	  </listitem>
 	</itemizedlist>
 
 	<para>Branching is very fast.  This command would be
 	  used to branch <literal>RELENG_8</literal>:</para>
 
 	<screen>&prompt.user; <userinput>svn copy svn+ssh://repo.freebsd.org/base/head svn+ssh://repo.freebsd.org/base/stable/8</userinput></screen>
 
 	<para>This is equivalent to these commands
 	  which take minutes and hours as opposed to seconds,
 	  depending on your network connection:</para>
 
 	<screen>&prompt.user; <userinput>svn checkout --depth=immediates svn+ssh://repo.freebsd.org/base</userinput>
 &prompt.user; <userinput>cd base</userinput>
 &prompt.user; <userinput>svn update --set-depth=infinity head</userinput>
 &prompt.user; <userinput>svn copy head stable/8</userinput>
 &prompt.user; <userinput>svn commit stable/8</userinput></screen>
       </sect3>
 
       <sect3 xml:id="svn-advanced-use-merging">
 	<title>Merging with <acronym>SVN</acronym></title>
 
 	<para>This section deals with merging code from one branch to
 	  another (typically, from head to a stable branch).</para>
 
 	<note>
 	  <para>In all examples below, <literal>&dollar;FSVN</literal>
 	    refers to the location of the &os; Subversion repository,
 	    <literal>svn+ssh://repo.freebsd.org/base/</literal>.</para>
 	</note>
 
 	<sect4>
 	  <title>About Merge Tracking</title>
 
 	  <para>From the user's perspective, merge tracking
 	    information (or mergeinfo) is stored in a property called
 	    <literal>svn:mergeinfo</literal>, which is a
 	    comma-separated list of revisions and ranges of revisions
 	    that have been merged.  When set on a file, it applies
 	    only to that file.  When set on a directory, it applies to
 	    that directory and its descendants (files and directories)
 	    except for those that have their own
 	    <literal>svn:mergeinfo</literal>.</para>
 
 	  <para>It is <emphasis>not</emphasis> inherited.  For
 	    instance, <filename>stable/6/contrib/openpam/</filename>
 	    does not implicitly inherit mergeinfo from
 	    <filename>stable/6/</filename>, or
 	    <filename>stable/6/contrib/</filename>.
 	    Doing so would make partial checkouts very hard to manage.
 	    Instead, mergeinfo is explicitly propagated down the tree.
 	    For merging something into
 	    <filename>branch/foo/bar/</filename>,
 	    these rules apply:</para>
 
 	  <orderedlist>
 	    <listitem>
 	      <para>If
 		<filename>branch/foo/bar/</filename>
 		does not already have a mergeinfo record, but a direct
 		ancestor (for instance,
 		<filename>branch/foo/</filename>)
 		does, then that record will be propagated down to
 		<filename>branch/foo/bar/</filename>
 		before information about the current merge is
 		recorded.</para>
 	    </listitem>
 
 	    <listitem>
 	      <para>Information about the current merge will
 		<emphasis>not</emphasis> be propagated back up that
 		ancestor.</para>
 	    </listitem>
 
 	    <listitem>
 	      <para>If a direct descendant of
 		<filename>branch/foo/bar/</filename> (for instance,
 		<filename>branch/foo/bar/baz/</filename>) already has
 		a mergeinfo record, information about the current
 		merge will be propagated down to it.</para>
 	    </listitem>
 	  </orderedlist>
 
 	  <para>If you consider the case where a revision changes
 	    several separate parts of the tree (for example,
 	    <filename>branch/foo/bar/</filename> and
 	    <filename>branch/foo/quux/</filename>), but you only want
 	    to merge some of it (for example,
 	    <filename>branch/foo/bar/</filename>), you will see that
 	    these rules make sense.  If mergeinfo was propagated up,
 	    it would seem like that revision had also been merged to
 	    <filename>branch/foo/quux/</filename>, when in fact it had
 	    not been.</para>
 	</sect4>
 
 	<sect4 xml:id="merge-source">
 	  <title>Selecting the Source and Target Branch
 	    When Merging</title>
 
 	  <para>Merging to <literal>stable/</literal> branches should
 	    originate from <literal>head/</literal>.  For
 	    example:</para>
 
 	  <screen>&prompt.user; svn merge -c <replaceable>r123456</replaceable> ^/head/ stable/<replaceable>11</replaceable>
 &prompt.user; svn commit stable/<replaceable>11</replaceable></screen>
 
 	  <para>Merges to <literal>releng/</literal> branches should
 	    always originate from the corresponding
 	    <literal>stable/</literal> branch.  For example:</para>
 
 	  <screen>&prompt.user; svn merge -c <replaceable>r123456</replaceable> ^/stable/<replaceable>11</replaceable>  releng/<replaceable>11.0</replaceable>
 &prompt.user; svn commit releng/<replaceable>11.0</replaceable></screen>
 
 	  <note>
 	    <para>Committers are only permitted to commit to the
 	      <literal>releng/</literal> branches during a release
 	      cycle after receiving approval from the Release
 	      Engineering Team, after which only the Security Officer
 	      may commit to a <literal>releng/</literal> branch for
 	      a Security Advisory or Errata Notice.</para>
 	  </note>
 
 	  <para>All merges are
 	    merged to and committed from the root of the
 	    branch.  All merges look like:</para>
 
 	  <screen>&prompt.user; svn merge -c <replaceable>r123456</replaceable> ^/head/ <replaceable>checkout</replaceable>
 &prompt.user; svn commit <replaceable>checkout</replaceable></screen>
 
 	  <para>Note that <replaceable>checkout</replaceable> must be
 	    a complete checkout of the branch to which the merge
 	    occurs.</para>
 
 	  <screen>&prompt.user; svn merge -c <replaceable>r123456</replaceable> ^/stable/<replaceable>10</replaceable> releng/<replaceable>10.0</replaceable></screen>
 	</sect4>
 
 	<sect4>
 	  <title>Preparing the Merge Target</title>
 
-	  <para>Because of the mergeinfo propagation issues described
+	  <para>Due to the mergeinfo propagation issues described
 	    earlier, it is very important to never merge changes
 	    into a sparse working copy.  Always use a full
 	    checkout of the branch being merged into.  For instance,
 	    when merging from HEAD to 7, use a full checkout
 	    of stable/7:</para>
 
 	  <screen>&prompt.user; <userinput>cd stable/7</userinput>
 &prompt.user; <userinput>svn up --set-depth=infinity</userinput></screen>
 
 	  <para>The target directory must also be up-to-date and must
 	    not contain any uncommitted changes or stray files.</para>
 	</sect4>
 
 	<sect4>
 	  <title>Identifying Revisions</title>
 
 	  <para>Identifying revisions to be merged is a must.  If the
 	    target already has complete mergeinfo, ask
 	    <acronym>SVN</acronym> for a list:</para>
 
 	  <screen>&prompt.user; <userinput>cd stable/6/contrib/openpam</userinput>
 &prompt.user; <userinput>svn mergeinfo --show-revs=eligible $FSVN/head/contrib/openpam</userinput></screen>
 
 	  <para>If the target does not have complete mergeinfo, check
 	    the log for the merge source.</para>
 	</sect4>
 
 	<sect4>
 	  <title>Merging</title>
 
 	  <para>Now, let us start merging!</para>
 
 	  <sect5>
 	    <title>The Principles</title>
 
 	    <para>For example, To merge:</para>
 
 	    <itemizedlist>
 	      <listitem>
 		<para>revision <literal>&dollar;R</literal></para>
 	      </listitem>
 
 	      <listitem>
 		<para>in directory &dollar;target in stable branch
 		  &dollar;B</para>
 	      </listitem>
 
 	      <listitem>
 		<para>from directory &dollar;source in head</para>
 	      </listitem>
 
 	      <listitem>
 		<para>&dollar;FSVN is
 		  <literal>svn+ssh://repo.freebsd.org/base</literal></para>
 	      </listitem>
 	    </itemizedlist>
 
 	    <para>Assuming that revisions &dollar;P and &dollar;Q have
 	      already been merged, and that the current directory is
 	      an up-to-date working copy of stable/&dollar;B, the
 	      existing mergeinfo looks like this:</para>
 
 	    <screen>&prompt.user; <userinput>svn propget svn:mergeinfo -R $target</userinput>
 $target - /head/$source:$P,$Q</screen>
 
 	    <para>Merging is done like so:</para>
 
 	    <screen>&prompt.user; <userinput>svn merge -c$R $FSVN/head/$source $target</userinput></screen>
 
 	    <para>Checking the results of this is possible with
 	      <command>svn diff</command>.</para>
 
 	    <para>The svn:mergeinfo now looks like:</para>
 
 	    <screen>&prompt.user; <userinput>svn propget svn:mergeinfo -R $target</userinput>
 $target - head/$source:$P,$Q,$R</screen>
 
 	    <para>If the results are not exactly as shown, assistance
 	      may be required before committing as mistakes may have
 	      been made, or there may be something wrong with the
 	      existing mergeinfo, or there may be a bug in
 	      Subversion.</para>
 	  </sect5>
 
 	  <sect5>
 	    <title>Practical Example</title>
 
 	    <para>As a practical example, consider this
 	      scenario.  The changes to <filename>netmap.4</filename>
 	      in r238987 are to be merged from CURRENT to 9-STABLE.
 	      The file resides in
 	      <filename>head/share/man/man4</filename>.  According
 	      to <xref linkend="svn-advanced-use-merging"/>, this is
 	      also where to do the merge.  Note that in this example
 	      all paths are relative to the top of the svn repository.
 	      For more information on the directory layout, see <xref
 		linkend="svn-getting-started-base-layout"/>.</para>
 
 	    <para>The first step is to inspect the existing
 	      mergeinfo.</para>
 
 	    <screen>&prompt.user; <userinput>svn propget svn:mergeinfo -R stable/9/share/man/man4</userinput></screen>
 
 	    <para>Take a quick note of how it looks before moving on
 	      to the next step; doing the actual merge:</para>
 
 	    <screen>&prompt.user; <userinput>svn merge -c r238987 svn+ssh://repo.freebsd.org/base/head/share/man/man4 stable/9/share/man/man4</userinput>
 --- Merging r238987 into 'stable/9/share/man/man4':
 U    stable/9/share/man/man4/netmap.4
 --- Recording mergeinfo for merge of r238987 into
 'stable/9/share/man/man4':
  U   stable/9/share/man/man4</screen>
 
 	    <para>Check that the revision number of the merged
 	      revision has been added.  Once this is verified, the
 	      only thing left is the actual commit.</para>
 
 	    <screen>&prompt.user; <userinput>svn commit stable/9/share/man/man4</userinput></screen>
 	  </sect5>
 	</sect4>
 
 	<sect4>
 	  <title>Precautions Before Committing</title>
 
 	  <para>As always, build world (or appropriate parts of
 	    it).</para>
 
 	  <para>Check the changes with <command>svn diff</command> and
 	    <command>svn stat</command>.  Make sure all the files that
 	    should have been added or deleted were in fact added or
 	    deleted.</para>
 
 	  <para>Take a closer look at any property change (marked by a
 	    <literal>M</literal> in the second column of <command>svn
 	      stat</command>).  Normally, no svn:mergeinfo properties
 	    should be anywhere except the target directory (or
 	    directories).</para>
 
 	  <para>If something looks fishy, ask for help.</para>
 	</sect4>
 
 	<sect4>
 	  <title>Committing</title>
 
 	  <para>Make sure to commit a top level directory to have the
 	    mergeinfo included as well.  Do not specify individual
 	    files on the command line.  For more information about
 	    committing files in general, see the relevant section of
 	    this primer.</para>
 	</sect4>
       </sect3>
 
       <sect3 xml:id="svn-advanced-use-vendor-imports">
 	<title>Vendor Imports with <acronym>SVN</acronym></title>
 
 	<important>
 	  <para>Please read this entire section before starting a
 	    vendor import.</para>
 	</important>
 
 	<note>
 	  <para>Patches to vendor code fall into two
 	    categories:</para>
 
 	  <itemizedlist>
 	    <listitem>
 	      <para>Vendor patches: these are patches that have been
 		issued by the vendor, or that have been extracted from
 		the vendor's version control system, which address
 		issues which cannot wait until the
 		next vendor release.</para>
 	    </listitem>
 
 	    <listitem>
 	      <para>&os; patches: these are patches that modify the
 		vendor code to address &os;-specific issues.</para>
 	    </listitem>
 	  </itemizedlist>
 
 	  <para>The nature of a patch dictates where it should be
 	    committed:</para>
 
 	  <itemizedlist>
 	    <listitem>
 	      <para>Vendor patches must be committed to the vendor
 		branch, and merged from there to head.  If the patch
 		addresses an issue in a new release that is currently
 		being imported, it <emphasis>must not</emphasis> be
 		committed along with the new release: the release must
 		be imported and tagged first, then the patch can be
 		applied and committed.  There is no need to re-tag the
 		vendor sources after committing the patch.</para>
 	    </listitem>
 
 	    <listitem>
 	      <para>&os; patches are committed directly to
 		head.</para>
 	    </listitem>
 	  </itemizedlist>
 	</note>
 
 	<sect4>
 	  <title>Preparing the Tree</title>
 
 	  <para>If importing for the first time after the switch to
 	    Subversion, flattening and cleaning up the vendor tree is
 	    necessary, as well as bootstrapping the merge history in
 	    the main tree.</para>
 
 	  <sect5>
 	    <title>Flattening</title>
 
 	    <para>During the conversion from <acronym>CVS</acronym> to
 	      Subversion, vendor branches were imported with the same
 	      layout as the main tree.  This means that the
 	      <literal>pf</literal> vendor sources ended up in
 	      <filename>vendor/pf/dist/contrib/pf</filename>.  The
 	      vendor source is best directly in
 	      <filename>vendor/pf/dist</filename>.</para>
 
 	    <para>To flatten the <literal>pf</literal> tree:</para>
 
 	    <screen>&prompt.user; <userinput>cd <replaceable>vendor/pf/dist/contrib/pf</replaceable></userinput>
 &prompt.user; <userinput>svn mv $(svn list) ../..</userinput>
 &prompt.user; <userinput>cd ../..</userinput>
 &prompt.user; <userinput>svn rm contrib</userinput>
 &prompt.user; <userinput>svn propdel -R svn:mergeinfo .</userinput>
 &prompt.user; <userinput>svn commit</userinput></screen>
 
 	    <para>The <literal>propdel</literal> bit is necessary
 	      because starting with 1.5, Subversion will automatically
 	      add <literal>svn:mergeinfo</literal> to any directory
 	      that is copied or moved.  In this case, as nothing is
 	      being merged from the deleted tree, they just get in the
 	      way.</para>
 
 	    <para>Tags may be flattened as well (3, 4, 3.5 etc.); the
 	      procedure is exactly the same, only changing
 	      <literal>dist</literal> to <literal>3.5</literal> or
 	      similar, and putting the <command>svn commit</command>
 	      off until the end of the process.</para>
 	  </sect5>
 
 	  <sect5>
 	    <title>Cleaning Up</title>
 
 	    <para>The <literal>dist</literal> tree can be cleaned up
 	      as necessary.  Disabling keyword expansion is
 	      recommended, as it makes no sense on unmodified vendor
 	      code and in some cases it can even be harmful.
 	      <application>OpenSSH</application>, for example,
 	      includes two files that originated with &os; and still
 	      contain the original version tags.  To do this:</para>
 
 	    <screen>&prompt.user; <userinput>svn propdel svn:keywords -R .</userinput>
 &prompt.user; <userinput>svn commit</userinput></screen>
 	  </sect5>
 
 	  <sect5>
 	    <title>Bootstrapping Merge History</title>
 
 	    <para>If importing for the first time after the switch to
 	      Subversion, bootstrap <literal>svn:mergeinfo</literal>
 	      on the target directory in the main tree to the revision
 	      that corresponds to the last related change to the
 	      vendor tree, prior to importing new sources:</para>
 
 	    <screen>&prompt.user; <userinput>cd <replaceable>head/contrib/pf</replaceable></userinput>
 &prompt.user; <userinput>svn merge --record-only svn+ssh://repo.freebsd.org/base/<replaceable>vendor/pf/dist@180876</replaceable> .</userinput>
 &prompt.user; <userinput>svn commit</userinput></screen>
 	  </sect5>
 	</sect4>
 
 	<sect4>
 	  <title>Importing New Sources</title>
 
 	  <para>With two commits&mdash;one for the import itself and
 	    one for the tag&mdash;this step can optionally be repeated
 	    for every upstream release between the last import and the
 	    current import.</para>
 
 	  <sect5>
 	    <title>Preparing the Vendor Sources</title>
 
 	    <para>Subversion is able to store a
 	      full distribution in the vendor tree.  So, import
 	      everything, but merge only what is required.</para>
 
 	    <para>A <command>svn add</command> is required to add any
 	      files that were added since the last vendor import, and
 	      <command>svn rm</command> is required to remove any that
 	      were removed since.  Preparing sorted lists of the
 	      contents of the vendor tree and of the sources that are
 	      about to be imported is recommended, to facilitate the
 	      process.</para>
 
 	    <screen>&prompt.user; <userinput>cd <replaceable>vendor/pf/dist</replaceable></userinput>
 &prompt.user; <userinput>svn list -R | grep -v '/$' | sort &gt;../old</userinput>
 &prompt.user; <userinput>cd <replaceable>../pf-4.3</replaceable></userinput>
 &prompt.user; <userinput>find . -type f | cut -c 3- | sort &gt;../new</userinput></screen>
 
 	    <para>With these two files,
 	      <command>comm -23 ../old ../new</command> will list
 	      removed files (files only in <filename>old</filename>),
 	      while <command>comm -13 ../old ../new</command> will
 	      list added files only in
 	      <filename>new</filename>.</para>
 	  </sect5>
 
 	  <sect5>
 	    <title>Importing into the Vendor Tree</title>
 
 	    <para>Now, the sources must be copied into
 	      <filename><replaceable>dist</replaceable></filename> and
 	      the <command>svn add</command> and
 	      <command>svn rm</command> commands are used as
 	      needed:</para>
 
 	    <screen>&prompt.user; <userinput>cd <replaceable>vendor/pf/pf-4.3</replaceable></userinput>
 &prompt.user; <userinput>tar cf - . | tar xf - -C ../dist</userinput>
 &prompt.user; <userinput>cd <replaceable>../dist</replaceable></userinput>
 &prompt.user; <userinput>comm -23 ../old ../new | xargs svn rm</userinput>
 &prompt.user; <userinput>comm -13 ../old ../new | xargs svn add --parents</userinput></screen>
 
 	    <para>If any directories were removed, they will have to
 	      be <command>svn rm</command>ed manually.  Nothing will
 	      break if they are not, but they will remain in the
 	      tree.</para>
 
 	    <para>Check properties on any new files.  All text files
 	      should have <literal>svn:eol-style</literal> set to
 	      <literal>native</literal>.  All binary files should have
 	      <literal>svn:mime-type</literal> set to
 	      <literal>application/octet-stream</literal> unless there
 	      is a more appropriate media type.  Executable files
 	      should have <literal>svn:executable</literal> set to
 	      <literal>*</literal>.  No other properties should exist
 	      on any file in the tree.</para>
 
 	    <para>Committing is now possible.  However, it is good
 	      practice to make sure that everything is okay by using
 	      the <command>svn stat</command> and
 	      <command>svn diff</command> commands.</para>
 	  </sect5>
 
 	  <sect5>
 	    <title>Tagging</title>
 
 	    <para>Once committed, vendor releases are tagged for
 	      future reference.  The best and quickest way to do this
 	      is directly in the repository:</para>
 
 	    <screen>&prompt.user; <userinput>svn cp svn+ssh://repo.freebsd.org/base/<replaceable>vendor/pf/dist</replaceable> svn+ssh://repo.freebsd.org/base/<replaceable>vendor/pf/4.3</replaceable></userinput></screen>
 
 	    <para>Once that is complete, <command>svn up</command> the
 	      working copy of
 	      <filename><replaceable>vendor/pf</replaceable></filename>
 	      to get the new tag, although this is rarely
 	      needed.</para>
 
 	    <para>If creating the tag in the working copy of the tree,
 	      <command>svn:mergeinfo</command> results must be
 	      removed:</para>
 
 	    <screen>&prompt.user; <userinput>cd	<replaceable>vendor/pf</replaceable></userinput>
 &prompt.user; <userinput>svn cp dist 4.3</userinput>
 &prompt.user; <userinput>svn propdel svn:mergeinfo -R 4.3</userinput></screen>
 	  </sect5>
 	</sect4>
 
 	<sect4>
 	  <title>Merging to Head</title>
 
 	  <screen>&prompt.user; <userinput>cd <replaceable>head/contrib/pf</replaceable></userinput>
 &prompt.user; <userinput>svn up</userinput>
 &prompt.user; <userinput>svn merge --accept=postpone svn+ssh://repo.freebsd.org/base/<replaceable>vendor/pf/dist</replaceable> .</userinput></screen>
 
 	  <para>The <literal>--accept=postpone</literal> tells
 	    Subversion not to complain about merge
 	    conflicts as they will be handled manually.</para>
 
 	  <tip xml:id="svn-advanced-use-vendor-imports-pre-svn">
 	    <para>The <command>cvs2svn</command> changeover occurred
 	      on June 3, 2008.  When performing vendor merges for
 	      packages which were already present and converted by the
 	      <command>cvs2svn</command> process, the command used to
 	      merge
 	      <filename>/vendor/<replaceable>package_name</replaceable>/dist</filename>
 	      to
 	      <filename>/head/<replaceable>package_location</replaceable></filename>
 	      (for example,
 	      <filename>head/contrib/sendmail</filename>) must use
 	      <option>-c <replaceable>REV</replaceable></option> to
 	      indicate the revision to merge from the
 	      <filename>/vendor</filename> tree.  For example:</para>
 
 	    <screen>&prompt.user; <userinput>svn checkout svn+ssh://repo.freebsd.org/base/head/contrib/<replaceable>sendmail</replaceable></userinput>
 &prompt.user; <userinput>cd sendmail</userinput>
 &prompt.user; <userinput>svn merge -c r<replaceable>261190</replaceable> '^/vendor/<replaceable>sendmail/dist</replaceable>' .</userinput></screen>
 
 	    <para><literal>^</literal> is an alias for the
 	      repository path.</para>
 	  </tip>
 
 	  <note>
 	    <para>If using the <application>Zsh</application> shell,
 	      the <literal>^</literal> must be escaped with
 	      <literal>\</literal> or quoted.</para>
 	  </note>
 
 	  <para>It is necessary to resolve any merge conflicts.</para>
 
 	  <para>Make sure that any files that were added or removed in
 	    the vendor tree have been properly added or removed in the
 	    main tree.  To check diffs against the vendor
 	    branch:</para>
 
 	  <screen>&prompt.user; <userinput>svn diff --no-diff-deleted --old=svn+ssh://repo.freebsd.org/base/<replaceable>vendor/pf/dist</replaceable> --new=.</userinput></screen>
 
 	  <para>The <literal>--no-diff-deleted</literal> tells
 	    Subversion not to complain about files that are in the
 	    vendor tree but not in the main tree.  Things that
 	    would have previously been removed before the vendor
 	    import, like the vendor's makefiles
 	    and configure scripts.</para>
 
 	  <para>Using <acronym>CVS</acronym>, once a file was off the
 	    vendor branch, it was not able to be put back.  With
 	    Subversion, there is no concept of on or off the vendor
 	    branch.  If a file that previously had local
 	    modifications, to make it not show up in diffs in the
 	    vendor tree, all that has to be done is remove any
 	    left-over cruft like &os; version tags, which is much
 	    easier.</para>
 
 	  <para>If any changes are required for the world to build
 	    with the new sources, make them now, and keep testing
 	    until everything builds and runs perfectly.</para>
 	</sect4>
 
 	<sect4>
 	  <title>Committing the Vendor Import</title>
 
 	  <para>Committing is now possible!  Everything must be
 	    committed in one go.  If done properly, the tree will move
 	    from a consistent state with old code, to a consistent
 	    state with new code.</para>
 	</sect4>
 
 	<sect4>
 	  <title>From Scratch</title>
 
 	  <sect5>
 	    <title>Importing into the Vendor Tree</title>
 
 	    <para>This section is an example of importing and tagging
 	      <application>byacc</application> into
 	      <filename>head</filename>.</para>
 
 	    <para>First, prepare the directory in
 	      <filename>vendor</filename>:</para>
 
 	    <screen>&prompt.user; <userinput>svn co --depth immediates <replaceable>$FSVN/vendor</replaceable></userinput>
 &prompt.user; <userinput>cd <replaceable>vendor</replaceable></userinput>
 &prompt.user; <userinput>svn mkdir <replaceable>byacc</replaceable></userinput>
 &prompt.user; <userinput>svn mkdir <replaceable>byacc/dist</replaceable></userinput></screen>
 
 	    <para>Now, import the sources into the
 	      <filename>dist</filename> directory.
 	      Once the files are in place, <command>svn add</command>
 	      the new ones, then <command>svn commit</command> and tag
 	      the imported version.  To save time and bandwidth,
 	      direct remote committing and tagging is possible:</para>
 
 	    <screen>&prompt.user; <userinput>svn cp -m <replaceable>"Tag byacc 20120115"</replaceable> <replaceable>$FSVN/vendor/byacc/dist</replaceable> <replaceable>$FSVN/vendor/byacc/20120115</replaceable></userinput></screen>
 	  </sect5>
 
 	  <sect5>
 	    <title>Merging to <literal>head</literal></title>
 
 	    <para>Due to this being a new file, copy it for the
 	      merge:</para>
 
 	    <screen>&prompt.user; <userinput>svn cp -m <replaceable>"Import byacc to contrib"</replaceable> <replaceable>$FSVN/vendor/byacc/dist</replaceable> <replaceable>$FSVN/head/contrib/byacc</replaceable></userinput></screen>
 
 	    <para>Working normally on newly imported sources is still
 	      possible.</para>
 	  </sect5>
 	</sect4>
       </sect3>
 
       <sect3 xml:id="svn-advanced-use-reverting-a-commit">
 	<title>Reverting a Commit</title>
 
 	<para>Reverting a commit to a previous version is fairly
 	  easy:</para>
 
 	<screen>&prompt.user; <userinput>svn merge -r179454:179453 ROADMAP.txt</userinput>
 &prompt.user; <userinput>svn commit</userinput></screen>
 
 	<para>Change number syntax, with negative meaning a reverse
 	  change, can also be used:</para>
 
 	<screen>&prompt.user; <userinput>svn merge -c -179454 ROADMAP.txt</userinput>
 &prompt.user; <userinput>svn commit</userinput></screen>
 
 	<para>This can also be done directly in the repository:</para>
 
 	<screen>&prompt.user; <userinput>svn merge -r179454:179453 svn+ssh://repo.freebsd.org/base/ROADMAP.txt</userinput></screen>
 
 	<note>
 	  <para>It is important to ensure that the mergeinfo
 	    is correct when reverting a file to permit
 	    <command>svn mergeinfo --eligible</command> to work as
 	    expected.</para>
 	</note>
 
 	<para>Reverting the deletion of a file is slightly different.
 	  Copying the version of the file that predates the deletion
 	  is required.  For example, to restore a file that was
 	  deleted in revision N, restore version N-1:</para>
 
 	<screen>&prompt.user; <userinput>svn copy svn+ssh://repo.freebsd.org/base/ROADMAP.txt@179454</userinput>
 &prompt.user; <userinput>svn commit</userinput></screen>
 
 	<para>or, equally:</para>
 
 	<screen>&prompt.user; <userinput>svn copy svn+ssh://repo.freebsd.org/base/ROADMAP.txt@179454 svn+ssh://repo.freebsd.org/base</userinput></screen>
 
 	<para>Do <emphasis>not</emphasis> simply recreate the file
 	  manually and <command>svn add</command> it&mdash;this will
 	  cause history to be lost.</para>
       </sect3>
 
       <sect3 xml:id="svn-advanced-use-fixing-mistakes">
 	<title>Fixing Mistakes</title>
 
 	<para>While we can do surgery in an emergency, do not plan on
 	  having mistakes fixed behind the scenes.  Plan on mistakes
 	  remaining in the logs forever.  Be sure to check the output
 	  of <command>svn status</command> and <command>svn
 	    diff</command> before committing.</para>
 
 	<para>Mistakes will happen but,
 	  they can generally be fixed without
 	  disruption.</para>
 
 	<para>Take a case of adding a file in the wrong location.  The
 	  right thing to do is to <command>svn move</command> the file
 	  to the correct location and commit.  This causes just a
 	  couple of lines of metadata in the repository journal, and
 	  the logs are all linked up correctly.</para>
 
 	<para>The wrong thing to do is to delete the file and then
 	  <command>svn add</command> an independent copy in the
 	  correct location.  Instead of a couple of lines of text, the
 	  repository journal grows an entire new copy of the file.
 	  This is a waste.</para>
       </sect3>
 
       <sect3 xml:id="svn-getting-started-checkout-from-a-mirror">
 	<title>Using a Subversion Mirror</title>
 
 	<para>There is a serious disadvantage to this method: every
 	  time something is to be committed, a
 	  <command>svn relocate</command> to the main repository has
 	  to be done, remembering to <command>svn relocate</command>
 	  back to the mirror after the commit.  Also, since
 	  <command>svn relocate</command> only works between
 	  repositories that have the same UUID, some hacking of the
 	  local repository's UUID has to occur before it is possible
 	  to start using it.</para>
 
 	<sect4 xml:id="svn-advanced-checkout-from-mirror">
 	  <title>Checkout from a Mirror</title>
 
 	  <para>Check out a working copy from a mirror by
 	    substituting the mirror's <acronym>URL</acronym> for
 	    <literal>svn+ssh://repo.freebsd.org/base</literal>.  This
 	    can be an official mirror or a mirror maintained by using
 	    <command>svnsync</command>.</para>
 	</sect4>
 
 	<sect4 xml:id="svn-advanced-use-setting-up-svnsync">
 	  <title>Setting up a <application>svnsync</application>
 	    Mirror</title>
 
 	  <para>Avoid setting up a <application>svnsync</application>
 	    mirror unless there is a very good reason for it.  Most
 	    of the time a <command>git</command> mirror is a better
 	    alternative.  Starting a fresh mirror from scratch takes
 	    a long time.
 	    Expect a minimum of 10 hours for high speed connectivity.
 	    If international links are involved, expect this to take
 	    four to ten times longer.</para>
 
 	  <para>One way to limit the time required is to grab a <link
 	      xlink:href="https://download.freebsd.org/ftp/development/subversion/">seed
 	      file</link>.  It is large (~1GB) but will consume less
 	    network traffic and take less time to fetch than svnsync
 	    will.</para>
 
 	  <para>Extract the file and update it:</para>
 
 	  <screen>&prompt.user; <userinput>tar xf svnmirror-base-r261170.tar.xz</userinput>
 &prompt.user; <userinput>svnsync sync file:///home/svnmirror/base</userinput></screen>
 
 	  <para>Now, set that up to run from &man.cron.8;, do
 	    checkouts locally, set up a svnserve server for local
 	    machines to talk to, etc.</para>
 
 	  <para>The seed mirror is set to fetch from
 	    <literal>svn://svn.freebsd.org/base</literal>.  The
 	    configuration for the mirror is stored in
 	    <literal>revprop 0</literal> on the local mirror.  To see
 	    the configuration, try:</para>
 
 	  <screen>&prompt.user; <userinput>svn proplist -v --revprop -r 0 file:///home/svnmirror/base</userinput></screen>
 
 	  <para>Use <literal>svn propset</literal> to change
 	    things.</para>
 	</sect4>
       </sect3>
 
       <sect3 xml:id="svn-advanced-use-committing-high-ascii-data">
 	<title>Committing High-<acronym>ASCII</acronym> Data</title>
 
 	<para>Files that have high-<acronym>ASCII</acronym> bits are
 	  considered binary files in <acronym>SVN</acronym>, so the
 	  pre-commit checks fail and indicate that the
 	  <literal>mime-type</literal> property should be set to
 	  <literal>application/octet-stream</literal>.  However, the
 	  use of this is discouraged, so please do not set it.  The
 	  best way is always avoiding high-<acronym>ASCII</acronym>
 	  data, so that it can be read everywhere with any text editor
 	  but if it is not avoidable, instead of changing the
 	  mime-type, set the <literal>fbsd:notbinary</literal>
 	  property with <literal>propset</literal>:</para>
 
 	<screen>&prompt.user; <userinput>svn propset fbsd:notbinary yes foo.data</userinput></screen>
       </sect3>
 
       <sect3 xml:id="svn-advanced-use-maintaining-a-project-branch">
 	<title>Maintaining a Project Branch</title>
 
 	<para>A project branch is one that is synced to head (or
 	  another branch) is used to develop a project then commit it
 	  back to head.  In <acronym>SVN</acronym>,
 	  <quote>dolphin</quote> branching is used for this.  A
 	  <quote>dolphin</quote> branch is one that diverges for a
 	  while and is finally committed back to the original branch.
 	  During development code migration in one direction (from
 	  head to the branch only).  No code is committed back to head
 	  until the end.  After the branch is committed back at the
 	  end, it is dead (although a new branch with the same name
 	  can be created after the dead one is deleted).</para>
 
 	<para>As per <link
 	    xlink:href="https://people.FreeBSD.org/~peter/svn_notes.txt">https://people.FreeBSD.org/~peter/svn_notes.txt</link>,
 	  work that is intended to be merged back into HEAD should be
 	  in <filename>base/projects/</filename>.  If the
 	  work is beneficial to the &os; community in some way
 	  but not intended to be merged directly back into HEAD then
 	  the proper location is
 	  <filename>base/user/<replaceable>username</replaceable>/</filename>.
 	  <link
 	    xlink:href="https://svnweb.freebsd.org/base/projects/GUIDELINES.txt">This
 	    page</link> contains further details.</para>
 
 	<para>To create a project branch:</para>
 
 	<screen>&prompt.user; <userinput>svn copy svn+ssh://repo.freebsd.org/base/head svn+ssh://repo.freebsd.org/base/projects/spif</userinput></screen>
 
 	<para>To merge changes from HEAD back into the project
 	  branch:</para>
 
 	<screen>&prompt.user; <userinput>cd copy_of_spif</userinput>
 &prompt.user; <userinput>svn merge svn+ssh://repo.freebsd.org/base/head</userinput>
 &prompt.user; <userinput>svn commit</userinput></screen>
 
 	<para>It is important to resolve any merge conflicts before
 	  committing.</para>
 	<!--
 	<para>To collapse everything back at the end:</para>
 
 	<screen>&prompt.user; <userinput>svn write me</userinput></screen>
 
 	-->
       </sect3>
     </sect2>
 
     <sect2>
       <title>Some Tips</title>
 
       <para>In commit logs etc., <quote>rev 179872</quote> is
 	spelled <quote>r179872</quote> as per convention.</para>
 
       <para>Speeding up svn is possible by adding these entries to
 	<filename>~/.ssh/config</filename>:</para>
 
       <screen>Host *
 ControlPath ~/.ssh/sockets/master-%l-%r@%h:%p
 ControlMaster auto
 ControlPersist yes</screen>
 
       <para>and then typing</para>
 
       <screen><userinput>mkdir ~/.ssh/sockets</userinput></screen>
 
       <para>Checking out a working copy with a stock Subversion client
 	without &os;-specific patches
 	(<varname>OPTIONS_SET=FREEBSD_TEMPLATE</varname>) will mean
 	that <literal>&dollar;FreeBSD&dollar;</literal> tags will not
 	be expanded.  Once the correct version has been installed,
 	trick Subversion into expanding them like so:</para>
 
       <screen>&prompt.user; <userinput>svn propdel -R svn:keywords .</userinput>
 &prompt.user; <userinput>svn revert -R .</userinput></screen>
 
       <para>This will wipe out uncommitted patches.</para>
 
       <para>It is possible to automatically fill the "Sponsored by"
 	and "MFC after" commit log fields by setting
 	"freebsd-sponsored-by" and "freebsd-mfc-after" fields in the
 	"[miscellany]" section of the
 	<filename>~/.subversion/config</filename> configuration file.
 	For example:</para>
 
       <programlisting>freebsd-sponsored-by = The FreeBSD Foundation
 freebsd-mfc-after = 2 weeks</programlisting>
     </sect2>
   </sect1>
 
   <sect1 xml:id="conventions">
     <title>Setup, Conventions, and Traditions</title>
 
     <para>There are a number of things to do as a new developer.
       The first set of steps is specific to committers only.  These
       steps must be done by a mentor for those who are not
       committers.</para>
 
     <sect2 xml:id="conventions-committers">
       <title>For New Committers</title>
 
       <para>Those who have been given commit rights to the &os;
 	repositories must follow these steps.</para>
 
       <itemizedlist xml:id="commit-notes">
 	<listitem>
 	  <para>Get mentor approval before committing each of these
 	    changes!</para>
 	</listitem>
 
 	<listitem>
 	  <para>The <filename>.ent</filename> and
 	    <filename>.xml</filename> files mentioned below exist in
 	    the &os; Documentation Project SVN repository at
 	    <literal>svn+ssh://repo.FreeBSD.org/doc/</literal>.</para>
 	</listitem>
 
 	<listitem>
 	  <para>New files that do not have the
 	    <literal>FreeBSD=%H</literal>
 	    <command>svn:keywords</command> property will be rejected
 	    when attempting to commit them to the repository.  Be sure
 	    to read
 	    <xref linkend="svn-daily-use-adding-and-removing"/>
 	    regarding adding and removing files.  Verify that
 	    <filename>~/.subversion/config</filename> contains the
 	    necessary <quote>auto-props</quote> entries from
 	    <filename>auto-props.txt</filename> mentioned
 	    there.</para>
 	</listitem>
 
 	<listitem>
 	  <para>All <filename>src</filename> commits go to
 	    &os.current; first before being merged to &os.stable;.
 	    The &os.stable; branch must maintain
 	    <acronym>ABI</acronym> and <acronym>API</acronym>
 	    compatibility with earlier versions of that branch.  Do
 	    not merge changes that break this compatibility.</para>
 	</listitem>
       </itemizedlist>
 
       <procedure xml:id="commit-steps">
 	<title>Steps for New Committers</title>
 
 	<step>
 	  <title>Add an Author Entity</title>
 
 	  <para><filename>doc/head/share/xml/authors.ent</filename>
 	    &mdash; Add an author entity.  Later steps depend on this
 	    entity, and missing this step will cause the
 	    <filename>doc/</filename> build to fail.  This is a
 	    relatively easy task, but remains a good first test of
 	    version control skills.</para>
 	</step>
 
 	<step>
 	  <title>Update the List of Developers and
 	    Contributors</title>
 
 	  <para><filename>doc/head/en_US.ISO8859-1/articles/contributors/contrib.committers.xml</filename>
 	    &mdash;
 	    Add an entry to the <quote>Developers</quote> section
 	    of the <link
 	      xlink:href="&url.articles.contributors;/staff-committers.html">Contributors
 	      List</link>.  Entries are sorted by last name.</para>
 
 	  <para><filename>doc/head/en_US.ISO8859-1/articles/contributors/contrib.additional.xml</filename>
 	    &mdash; <emphasis>Remove</emphasis> the entry from the
 	    <quote>Additional Contributors</quote> section.  Entries
 	    are sorted by first name.</para>
 	</step>
 
 	<step>
 	  <title>Add a News Item</title>
 
 	  <para><filename>doc/head/share/xml/news.xml</filename>
 	    &mdash; Add an entry.  Look for the other entries that
 	    announce new committers and follow the format.  Use the
 	    date from the commit bit approval email from
 	    <email>core@FreeBSD.org</email>.</para>
 	</step>
 
 	<step>
 	  <title>Add a <acronym>PGP</acronym> Key</title>
 
 	  <para><filename>doc/head/share/pgpkeys/pgpkeys.ent</filename>
 	    and
 	    <filename>doc/head/share/pgpkeys/pgpkeys-developers.xml</filename>
 	    - Add your <acronym>PGP</acronym> or
 	    Gnu<acronym>PG</acronym> key.  Those who do not yet have a
 	    key should see <xref linkend="pgpkeys-creating"/>.</para>
 
 	  <para>&a.des.email; has written a shell script
 	    (<filename>doc/head/share/pgpkeys/addkey.sh</filename>) to
 	    make this easier.  See the <link
 	      xlink:href="http://svnweb.FreeBSD.org/doc/head/share/pgpkeys/README">README</link>
 	    file for more information.</para>
 
 	  <para>Use
 	    <filename>doc/head/share/pgpkeys/checkkey.sh</filename> to
 	    verify that keys meet minimal best-practices
 	    standards.</para>
 
 	  <para>After adding and checking a key, add both updated
 	    files to source control and then commit them.  Entries in
 	    this file are sorted by last name.</para>
 
 	  <note>
 	    <para>It is very important to have a current
 	      <acronym>PGP</acronym>/Gnu<acronym>PG</acronym> key in
 	      the repository.  The key may be required for positive
 	      identification of a committer.  For example, the
 	      &a.admins; might need it for account recovery.  A
 	      complete keyring of <systemitem
 		class="fqdomainname">FreeBSD.org</systemitem> users is
 	      available for download from <link
 		xlink:href="&url.base;/doc/pgpkeyring.txt">https://www.FreeBSD.org/doc/pgpkeyring.txt</link>.</para>
 	  </note>
 	</step>
 
 	<step>
 	  <title>Update Mentor and Mentee Information</title>
 
 	  <para><filename>base/head/share/misc/committers-<replaceable>repository</replaceable>.dot</filename>
 	    &mdash; Add an entry to the current committers section,
 	    where <replaceable>repository</replaceable> is
 	    <literal>doc</literal>, <literal>ports</literal>, or
 	    <literal>src</literal>, depending on the commit privileges
 	    granted.</para>
 
 	  <para>Add an entry for each additional mentor/mentee
 	    relationship in the bottom section.</para>
 	</step>
 
 	<step>
 	  <title>Generate a <application>Kerberos</application>
 	    Password</title>
 
 	  <para>See <xref linkend="kerberos-ldap"/> to generate or
 	    set a <application>Kerberos</application> for use with
 	    other &os; services like the bug tracking database.</para>
 	</step>
 
 	<step>
 	  <title>Optional: Enable Wiki Account</title>
 
 	  <para><link xlink:href="https://wiki.freebsd.org">&os;
 	      Wiki</link> Account &mdash; A wiki account allows
 	    sharing projects and ideas.  Those who do not yet have an
 	    account can follow instructions on the <link
 	    xlink:href="https://wiki.freebsd.org/AboutWiki">AboutWiki
 	    Page</link> to obtain one.  Contact
 	    <email>wiki-admin@FreeBSD.org</email> if you need help
 	    with your Wiki account.</para>
 	</step>
 
 	<step>
 	  <title>Optional: Update Wiki Information</title>
 
 	  <para>Wiki Information - After gaining access to the wiki,
 	    some people add entries to the <link
 	      xlink:href="https://wiki.freebsd.org/HowWeGotHere">How
 	      We Got Here</link>, <link
 	      xlink:href="https://wiki.freebsd.org/IRC/Nicknames">IRC
 	      Nicks</link>, and <link
 	      xlink:href="https://wiki.freebsd.org/Community/Dogs">
 	      Dogs of FreeBSD</link> pages.</para>
 	</step>
 
 	<step>
 	  <title>Optional: Update Ports with Personal
 	    Information</title>
 
 	  <para><filename>ports/astro/xearth/files/freebsd.committers.markers</filename>
 	    and
 	    <filename>src/usr.bin/calendar/calendars/calendar.freebsd</filename>
 	    - Some people add entries for themselves to these files to
 	    show where they are located or the date of their
 	    birthday.</para>
 	</step>
 
 	<step>
 	  <title>Optional: Prevent Duplicate Mailings</title>
 
 	  <para>Subscribers to &a.svn-src-all.name;,
 	    &a.svn-ports-all.name; or &a.svn-doc-all.name; might wish
 	    to unsubscribe to avoid receiving duplicate copies of
 	    commit messages and followups.</para>
 	</step>
       </procedure>
     </sect2>
 
     <sect2 xml:id="conventions-everyone">
       <title>For Everyone</title>
 
       <procedure xml:id="conventions-everyone-steps">
 	<step>
 	  <para>Introduce yourself to the other developers, otherwise
 	    no one will have any idea who you are or what you are
 	    working on.  The introduction need not be a comprehensive
 	    biography, just write a paragraph or two about who you
 	    are, what you plan to be working on as a developer in
 	    &os;, and who will be your mentor.  Email this to the
 	    &a.developers; and you will be on your way!</para>
 	</step>
 
 	<step>
 	  <para>Log into <systemitem>freefall.FreeBSD.org</systemitem>
 	    and create a
 	    <filename>/var/forward/<replaceable>user</replaceable></filename>
 	    (where <replaceable>user</replaceable> is your username)
 	    file containing the e-mail address where you want mail
 	    addressed to
 	    <replaceable>yourusername</replaceable>@FreeBSD.org to be
 	    forwarded.  This includes all of the commit messages as
 	    well as any other mail addressed to the &a.committers; and
 	    the &a.developers;.  Really large mailboxes which have
 	    taken up permanent residence on
 	    <systemitem>freefall</systemitem> may get truncated
 	    without warning if space needs to be freed, so forward it
 	    or save it elsewhere.</para>
 
 	  <note>
 	    <para>If your e-mail system uses SPF with strict rules,
 	      you should whitelist <systemitem
 		class="fqdomainname">mx2.FreeBSD.org</systemitem> from
 	      SPF checks.</para>
 	  </note>
 
 	  <para>Due to the severe load dealing with SPAM places on the
 	    central mail servers that do the mailing list processing,
 	    the front-end server does do some basic checks and will
 	    drop some messages based on these checks.  At the moment
 	    proper DNS information for the connecting host is the only
 	    check in place but that may change.  Some people blame
 	    these checks for bouncing valid email.  To have these
 	    checks turned off for your email, create a file
 	    named <filename>~/.spam_lover</filename>
 	    on <systemitem
 	      class="fqdomainname">freefall.FreeBSD.org</systemitem>.</para>
 	</step>
       </procedure>
 
       <note>
 	<para>Those who are developers but not committers will
 	  not be subscribed to the committers or developers mailing
 	  lists.  The subscriptions are derived from the access
 	  rights.</para>
       </note>
 
       <sect3 xml:id="smtp-setup">
 	<title>SMTP Access Setup</title>
 
 	<para>For those willing to send e-mail messages through the
 	  FreeBSD.org infrastructure, follow the instructions
 	  below:</para>
 
 	<procedure>
 	  <step>
 	    <para>Point your mail client at
 	      <literal><systemitem
 		class="fqdomainname">smtp.FreeBSD.org</systemitem>:587</literal>.</para></step>
 
 	  <step>
 	    <para>Enable STARTTLS.</para>
 	  </step>
 
 	  <step>
 	    <para>Ensure your <literal>From:</literal> address is set
 	      to
 	      <literal><replaceable>yourusername</replaceable>@FreeBSD.org</literal>.</para>
 	  </step>
 
 	  <step>
 	    <para>For authentication, you can use your &os; Kerberos
 	      username and password (see <xref
 		linkend="kerberos-ldap"/>).  The
 	      <literal><replaceable>yourusername</replaceable>/mail</literal>
 	      principal is preferred, as it is only valid for
 	      authenticating to mail resources.</para>
 
 	    <note>
 	      <para>Do not include <literal>@FreeBSD.org</literal>
 		when entering in your username.</para>
 	    </note>
 	  </step>
 	</procedure>
 
 	<note>
 	  <title>Additional Notes</title>
 
 	  <itemizedlist>
 	    <listitem>
 	      <para>Will only accept mail from
 		<literal><replaceable>yourusername</replaceable>@FreeBSD.org</literal>.
 		If you are authenticated as one user, you are not
 		permitted to send mail from another.</para>
 	    </listitem>
 
 	    <listitem>
 	      <para>A header will be appended with the SASL username:
 		(<literal>Authenticated sender:
 		<replaceable>username</replaceable></literal>).</para></listitem>
 
 	    <listitem>
 	      <para>Host has various rate limits in place to cut down
 		on brute force attempts.</para>
 	    </listitem>
 	  </itemizedlist>
 	</note>
 
 	<sect4 xml:id="smtp-setup-local-mta">
 	  <title>Using a Local MTA to Forward Emails to the
 	    &os;.org SMTP Service</title>
 
 	  <para>It is also possible to use a local
 	    <acronym>MTA</acronym> to forward locally sent emails to
 	    the &os;.org SMTP servers.</para>
 
 	  <example xml:id="smtp-setup-local-postfix">
 	    <title>Using <application>Postfix</application></title>
 
 	    <para>To tell a local Postfix instance that anything from
 	      <literal><replaceable>yourusername</replaceable>@FreeBSD.org</literal>
 	      should be forwarded to the &os;.org servers, add this to
 	      your <filename>main.cf</filename>:</para>
 
 	    <programlisting>sender_dependent_relayhost_maps = hash:/usr/local/etc/postfix/relayhost_maps
 smtp_sasl_auth_enable = yes
 smtp_sasl_security_options = noanonymous
 smtp_sasl_password_maps = hash:/usr/local/etc/postfix/sasl_passwd
 smtp_use_tls = yes</programlisting>
 
 	    <para>Create
 	      <filename>/usr/local/etc/postfix/relayhost_maps</filename>
 	      with the following content:</para>
 
 	    <programlisting><replaceable>yourusername</replaceable>@FreeBSD.org  [smtp.freebsd.org]:587</programlisting>
 
 	    <para>Create
 	      <filename>/usr/local/etc/postfix/sasl_passwd</filename>
 	      with the following content:</para>
 
 	    <programlisting>[smtp.freebsd.org]:587          <replaceable>yourusername</replaceable>:<replaceable>yourpassword</replaceable></programlisting>
 
 	    <para>If the email server is used by other people, you
 	      may want to prevent them from sending e-mails from your
 	      address.  To achieve this, add this to your
 	      <filename>main.cf</filename>:</para>
 
 	    <programlisting>smtpd_sender_login_maps = hash:/usr/local/etc/postfix/sender_login_maps
 smtpd_sender_restrictions = reject_known_sender_login_mismatch</programlisting>
 
 	    <para>Create
 	      <filename>/usr/local/etc/postfix/sender_login_maps</filename>
 	      with the following content:</para>
 
 	    <programlisting><replaceable>yourusername</replaceable>@FreeBSD.org <replaceable>yourlocalusername</replaceable></programlisting>
 
 	    <para>Where <replaceable>yourlocalusername</replaceable>
 	      is the <acronym>SASL</acronym> username used to connect
 	      to the local instance of
 	      <application>Postfix</application>.</para>
 	  </example>
 	</sect4>
       </sect3>
     </sect2>
 
     <sect2 xml:id="mentors">
       <title>Mentors</title>
 
       <para>All new developers have a mentor assigned to them for
 	the first few months.  A mentor is responsible for teaching
 	the mentee the rules and conventions of the project and
 	guiding their first steps in the developer community.  The
 	mentor is also personally responsible for the mentee's actions
 	during this initial period.</para>
 
       <para>For committers: do not commit anything without first
 	getting mentor approval.  Document that approval with an
 	<literal>Approved by:</literal> line in the commit
 	message.</para>
 
       <para>When the mentor decides that a mentee has learned the
 	ropes and is ready to commit on their own, the mentor
 	announces it with a commit to
 	<filename>conf/mentors</filename>.  This file is in the
 	<filename>svnadmin</filename> branch of each
 	repository:</para>
 
       <informaltable frame="none">
 	<tgroup cols="2">
 	  <tbody>
 	    <row>
 	      <entry><literal>src</literal></entry>
 	      <entry><filename>base/svnadmin/conf/mentors</filename></entry>
 	    </row>
 
 	    <row>
 	      <entry><literal>doc</literal></entry>
 	      <entry><filename>doc/svnadmin/conf/mentors</filename></entry>
 	    </row>
 
 	    <row>
 	      <entry><literal>ports</literal></entry>
 	      <entry><filename>ports/svnadmin/conf/mentors</filename></entry>
 	    </row>
 	  </tbody>
 	</tgroup>
       </informaltable>
 
       <para>New committers should aim to complete enough commits that
 	their mentor is comfortable releasing them from mentorship
 	within the first year.  If they are still under mentorship, the
 	appropriate management body (core, doceng, or portmgr) should
 	attempt to ensure that there are no barriers preventing
 	completion.  If the committer is unable to satisfy their mentor
 	of readiness by a year and a half their commit bit may be
 	converted to project membership.</para>
     </sect2>
   </sect1>
 
   <sect1 xml:id="pre-commit-review">
     <title>Pre-Commit Review</title>
 
     <para>Code review is one way to increase the quality of software.
       The following guidelines apply to commits to the
       <literal>head</literal> (-CURRENT) branch of the
       <literal>src</literal> repository.  Other branches and the
       <literal>ports</literal> and <literal>docs</literal> trees have
       their own review policies, but these guidelines generally apply
       to commits requiring review:</para>
     <itemizedlist>
       <listitem>
 	<para>All non-trivial changes should be reviewed before they
 	  are committed to the repository.</para>
       </listitem>
 
       <listitem>
 	<para>Reviews may be conducted by email, in
 	  <application>Bugzilla</application>, in
 	  <application>Phabricator</application>, or by another
 	  mechanism.  Where possible, reviews should be public.</para>
       </listitem>
 
       <listitem>
 	<para>The developer responsible for a code change is also
 	  responsible for making all necessary review-related
 	  changes.</para>
       </listitem>
 
       <listitem>
 	<para>Code review can be an iterative process, which continues
 	  until the patch is ready to be committed.  Specifically,
 	  once a patch is sent out for review, it should receive an
 	  explicit <quote>looks good</quote> before it is committed.
 	  So long as it is explicit, this can take whatever form makes
 	  sense for the review method.</para>
       </listitem>
 
       <listitem>
 	<para>Timeouts are not a substitute for review.</para>
       </listitem>
     </itemizedlist>
 
     <para>Sometimes code reviews will take longer than you would hope
       for, especially for larger features.  Accepted ways to speed up
       review times for your patches are:</para>
 
     <itemizedlist>
       <listitem>
 	<para>Review other people's patches.  If you help out,
 	  everybody will be more willing to do the same for you;
 	  goodwill is our currency.</para>
       </listitem>
 
       <listitem>
 	<para>Ping the patch.  If it is urgent, provide reasons why
 	  it is important to you to get this patch landed and ping
 	  it every couple of days.  If it is not urgent, the common
 	  courtesy ping rate is one week.  Remember that you are
 	  asking for valuable time from other professional
 	  developers.</para>
       </listitem>
 
       <listitem>
 	<para>Ask for help on mailing lists, IRC, etc.  Others
 	  may be able to either help you directly, or suggest a
 	  reviewer.</para>
       </listitem>
 
       <listitem>
 	<para>Split your patch into multiple smaller patches that
 	  build on each other.  The smaller your patch, the higher
 	  the probability that somebody will take a quick look at
 	  it.</para>
 
 	<para>When making large changes, it is helpful to keep this
 	  in mind from the beginning of the effort as breaking large
 	  changes into smaller ones is often difficult after the
 	  fact.</para>
       </listitem>
     </itemizedlist>
 
     <para>Developers should participate in code reviews as both
       reviewers and reviewees.  If someone is kind enough to review
       your code, you should return the favor for someone else.
       Note that while anyone is welcome to review and give feedback
       on a patch, only an appropriate subject-matter expert can
       approve a change.  This will usually be a committer who works
       with the code in question on a regular basis.</para>
 
     <para>In some cases, no subject-matter expert may be available.
       In those cases, a review by an experienced developer is
       sufficient when coupled with appropriate testing.</para>
   </sect1>
 
   <sect1 xml:id="commit-log-message">
     <title>Commit Log Messages</title>
 
     <para>This section contains some suggestions and traditions for
       how commit logs are formatted.</para>
 
     <para>As well as including an informative message with each
       commit, some additional information may be needed.</para>
 
     <para>This information consists of one or more lines
       containing the key word or phrase, a colon, tabs for formatting,
       and then the additional information.</para>
 
     <para>The key words or phrases are:</para>
 
     <informaltable frame="none" pgwide="1">
       <tgroup cols="2">
 	<tbody>
 	  <row>
 	    <entry><literal>PR:</literal></entry>
 	    <entry>The problem report (if any) which is affected
 	      (typically, by being closed) by this commit.
 	      Multiple PRs may be specified on one line, separated by
 	      commas or spaces.</entry>
 	  </row>
 
 	  <row>
 	    <entry><literal>Submitted by:</literal></entry>
 	    <entry>
 	      <para>The name and e-mail address of the person
 		that submitted the fix; for developers, just the
 		username on the &os; cluster.</para>
 
 	      <para>If the submitter is the maintainer of the port
 		being committed, include "(maintainer)"
 		after the email address.</para>
 
 	      <para>Avoid obfuscating the email address of the
 		submitter as this adds additional work when searching
 		logs.</para>
 	    </entry>
 	  </row>
 
 	  <row>
 	    <entry><literal>Reviewed by:</literal></entry>
 	    <entry>The name and e-mail address of the person or
 	      people that reviewed the change; for developers,
 	      just the username on the &os; cluster.  If a
 	      patch was submitted to a mailing list for review,
 	      and the review was favorable, then just include
 	      the list name.</entry>
 	  </row>
 
 	  <row>
 	    <entry><literal>Approved by:</literal></entry>
 	    <entry><para>The name and e-mail address of the person or
 	      people that approved the change; for developers, just
 	      the username on the &os; cluster.  It is customary to
 	      get prior approval for a commit if it is to an area of
 	      the tree to which you do not usually commit.  In
 	      addition, during the run up to a new release all commits
 	      <emphasis>must</emphasis> be approved by the release
 	      engineering team.</para>
 
 	    <para>While under mentorship, get mentor approval before
 	      the commit.  Enter the mentor's username in this field,
 	      and note that they are a mentor:</para>
 
 	    <screen>Approved by: <userinput><replaceable>username-of-mentor</replaceable> <literal>(mentor)</literal></userinput></screen>
 
 	    <para>If a team approved these commits then include the
 	      team name followed by the username of the approver in
 	      parentheses.  For example:</para>
 
 	    <screen>Approved by: <userinput><literal>re</literal> (<replaceable>username</replaceable>)</userinput></screen></entry>
 	  </row>
 
 	  <row>
 	    <entry><literal>Obtained from:</literal></entry>
 	    <entry>The name of the project (if any) from which
 	      the code was obtained.  Do not use this line for the
 	      name of an individual person.</entry>
 	  </row>
 
 	  <row>
 	    <entry><literal>Sponsored by:</literal></entry>
 	    <entry>Sponsoring organizations for this change, if any.
 	      Separate multiple organizations with commas.  If only a
 	      portion of the work was sponsored, or different amounts
 	      of sponsorship were provided to different authors,
 	      please give appropriate credit in parentheses after each
 	      sponsor name.  For example, <literal>Example.com (alice,
 		code refactoring), Wormulon (bob), Momcorp
 		(cindy)</literal> shows that Alice was sponsored by
 	      Example.com to do code refactoring, while Wormulon
 	      sponsored Bob's work and Momcorp sponsored Cindy's work.
 	      Other authors were either not sponsored or chose not to
 	      list sponsorship.</entry>
 	  </row>
 
 	  <row>
 	    <entry><literal>MFC after:</literal></entry>
 	    <entry>To receive an e-mail reminder to
 	      <acronym>MFC</acronym> at a later date, specify the
 	      number of days, weeks, or months after which an
 	      <acronym>MFC</acronym> is planned.</entry>
 	  </row>
 
 	  <row>
 	    <entry><literal>MFC to:</literal></entry>
 	    <entry>If the commit should be merged to a subset of
 	      stable branches, specify the branch names.</entry>
 	  </row>
 
 	  <row>
 	    <entry><literal>MFC with:</literal></entry>
 	    <entry>If the commit should be merged together with
 	      a previous one in a single
 	      <acronym>MFC</acronym> commit (for example, where
 	      this commit corrects a bug in the previous change),
 	      specify the corresponding revision number.</entry>
 	  </row>
 
 	  <row>
 	    <entry><literal>Relnotes:</literal></entry>
 	    <entry>If the change is a candidate for inclusion in
 	      the release notes for the next release from the branch,
 	      set to <literal>yes</literal>.</entry>
 	  </row>
 
 	  <row>
 	    <entry><literal>Security:</literal></entry>
 	    <entry>If the change is related to a security
 	      vulnerability or security exposure, include one or more
 	      references or a description of the issue.  If possible,
 	      include a VuXML URL or a CVE ID.</entry>
 	  </row>
 
 	  <row>
 	    <entry><literal>Event:</literal></entry>
 	    <entry>The description for the event where this commit was
 	      made.  If this is a recurring event, add the year or
 	      even the month to it.  For example, this could be
 	      <literal>FooBSDcon 2019</literal>.  The idea behind this
 	      line is to put recognition to conferences, gatherings,
 	      and other types of meetups and to show that these are
 	      useful to have.  Please do not use the
 	      <literal>Sponsored by:</literal> line for this as that
 	      is meant for organizations sponsoring certain features
 	      or developers working on them.</entry>
 	  </row>
 
 	  <row>
 	    <entry><literal>Differential Revision:</literal></entry>
 	    <entry>The full URL of the Phabricator review.  This line
 	      <emphasis>must be the last line</emphasis>.  For
 	      example:
 	      <literal>https://reviews.freebsd.org/D1708</literal>.</entry>
 	  </row>
 	</tbody>
       </tgroup>
     </informaltable>
 
     <example>
       <title>Commit Log for a Commit Based on a PR</title>
 
       <para>The commit is based on a patch from a PR submitted by John
 	Smith.  The commit message <quote>PR</quote> and
 	<quote>Submitted by</quote> fields are filled..</para>
 
       <programlisting>...
 
 	    PR:                    12345
 	    Submitted by:	   John Smith &lt;John.Smith@example.com&gt;</programlisting>
     </example>
 
     <example>
       <title>Commit Log for a Commit Needing Review</title>
 
       <para>The virtual memory system is being changed.  After
 	posting patches to the appropriate mailing list (in this
 	case, <literal>freebsd-arch</literal>) and the changes have
 	been approved.</para>
 
       <programlisting>...
 
 	    Reviewed by:       -arch</programlisting>
     </example>
 
     <example>
       <title>Commit Log for a Commit Needing Approval</title>
 
       <para>Commit a port, after working with
 	the listed MAINTAINER, who said to go ahead and
 	commit.</para>
 
       <programlisting>...
 
 	    Approved by:	    <replaceable>abc</replaceable> (maintainer)</programlisting>
 
       <para>Where <replaceable>abc</replaceable> is the account name
 	of the person who approved.</para>
     </example>
 
     <example>
       <title>Commit Log for a Commit Bringing in Code from
 	OpenBSD</title>
 
       <para>Committing some code based on work done in the
 	OpenBSD project.</para>
 
       <programlisting>...
 
 	    Obtained from:      OpenBSD</programlisting>
     </example>
 
     <example>
       <title>Commit Log for a Change to &os.current; with a Planned
 	Commit to &os.stable; to Follow at a Later Date.</title>
 
       <para>Committing some code which will be merged from
 	&os.current; into the &os.stable; branch after two
 	weeks.</para>
 
       <programlisting>...
 
 MFC after:      <replaceable>2 weeks</replaceable></programlisting>
 
       <para>Where <replaceable>2</replaceable> is the number of days,
 	weeks, or months after which an <acronym>MFC</acronym> is
 	planned.  The <replaceable>weeks</replaceable> option may be
 	<literal>day</literal>, <literal>days</literal>,
 	<literal>week</literal>, <literal>weeks</literal>,
 	<literal>month</literal>, <literal>months</literal>.</para>
     </example>
 
     <para>It is often necessary to combine these.</para>
 
     <para>Consider the situation where a user has submitted a PR
       containing code from the NetBSD project.  Looking at the PR, the
       developer sees it is not an area of the tree they normally work
       in, so they have the change reviewed by the
       <literal>arch</literal> mailing list.  Since the change is
       complex, the developer opts to <acronym>MFC</acronym> after one
       month to allow adequate testing.</para>
 
     <para>The extra information to include in the commit would look
       something like</para>
 
     <example>
       <title>Example Combined Commit Log</title>
 
       <programlisting>PR:                 54321
 Submitted by:       John Smith &lt;John.Smith@example.com&gt;
 Reviewed by:        -arch
 Obtained from:      NetBSD
 MFC after:          1 month
 Relnotes:           yes</programlisting>
     </example>
   </sect1>
 
   <sect1 xml:id="pref-license">
     <title>Preferred License for New Files</title>
 
     <para>The &os; Project's full license policy can be found at <link
 	xlink:href="&url.base;/internal/software-license.html">https://www.FreeBSD.org/internal/software-license.html</link>.
       The rest of this section is intended to help you get started.
       As a rule, when in doubt, ask.  It is much easier to give advice
       than to fix the source tree.</para>
 
     <para>The &os; Project suggests and uses this
       text as the preferred license scheme:</para>
 
     <programlisting>/*-
  * SPDX-License-Identifier: BSD-2-Clause-FreeBSD
  *
  * Copyright (c) [year] [your name]
  *
  * Redistribution and use in source and binary forms, with or without
  * modification, are permitted provided that the following conditions
  * are met:
  * 1. Redistributions of source code must retain the above copyright
  *    notice, this list of conditions and the following disclaimer.
  * 2. Redistributions in binary form must reproduce the above copyright
  *    notice, this list of conditions and the following disclaimer in the
  *    documentation and/or other materials provided with the distribution.
  *
  * THIS SOFTWARE IS PROVIDED BY THE AUTHOR AND CONTRIBUTORS ``AS IS'' AND
  * ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE
  * IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE
  * ARE DISCLAIMED.  IN NO EVENT SHALL THE AUTHOR OR CONTRIBUTORS BE LIABLE
  * FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL
  * DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS
  * OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION)
  * HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT
  * LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY
  * OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF
  * SUCH DAMAGE.
  *
  * [id for your version control system, if any]
  */</programlisting>
 
     <para>The &os; project strongly discourages the so-called
       "advertising clause" in new code.  Due to the large number of
       contributors to the &os; project, complying with this clause for
       many commercial vendors has become difficult.  If you have code
       in the tree with the advertising clause, please consider
       removing it.  In fact, please consider using the above license
       for your code.</para>
 
     <para>The &os; project discourages completely new licenses and
       variations on the standard licenses.  New licenses require the
       approval of the &a.core; to reside in the
       main repository.  The more different licenses that are used in
       the tree, the more problems that this causes to those wishing to
       utilize this code, typically from unintended consequences from a
       poorly worded license.</para>
 
     <para>Project policy dictates that code under some non-BSD
       licenses must be placed only in specific sections of the
       repository, and in some cases, compilation must be conditional
       or even disabled by default.  For example, the GENERIC kernel
       must be compiled under only licenses identical to or
       substantially similar to the BSD license.  GPL, APSL, CDDL, etc,
       licensed software must not be compiled into GENERIC.</para>
 
     <para>Developers are reminded that in open source, getting "open"
       right is just as important as getting "source" right, as
       improper handling of intellectual property has serious
       consequences.  Any questions or concerns should immediately be
       brought to the attention of the core team.</para>
   </sect1>
 
   <sect1 xml:id="tracking.license.grants">
     <title>Keeping Track of Licenses Granted to the &os;
       Project</title>
 
     <para>Various software or data exist in the repositories where
       the &os; project has been granted a special licence to be able
       to use them.  A case in point are the Terminus fonts for use
       with &man.vt.4;.  Here the author Dimitar Zhekov has allowed us
       to use the "Terminus BSD Console" font under a 2-clause BSD
       license rather than the regular Open Font License he normally
       uses.</para>
 
     <para>It is clearly sensible to keep a record of any such
       license grants.  To that end, the &a.core; has decided to keep
       an archive of them.  Whenever the &os; project is granted a
       special license we require the &a.core; to be notified.  Any
       developers involved in arranging such a license grant, please
       send details to the &a.core; including:</para>
 
     <itemizedlist>
       <listitem>
 	<para>Contact details for people or organizations granting the
 	  special license.</para>
       </listitem>
 
       <listitem>
 	<para>What files, directories etc. in the repositories are
 	  covered by the license grant including the revision numbers
 	  where any specially licensed material was committed.</para>
       </listitem>
 
       <listitem>
 	<para>The date the license comes into effect from.  Unless
 	  otherwise agreed, this will be the date the license was
 	  issued by the authors of the software in question.</para>
       </listitem>
 
       <listitem>
 	<para>The license text.</para>
       </listitem>
 
       <listitem>
 	<para>A note of any restrictions, limitations or exceptions
 	  that apply specifically to &os;'s usage of the licensed
 	  material.</para>
       </listitem>
 
       <listitem>
 	<para>Any other relevant information.</para>
       </listitem>
     </itemizedlist>
 
     <para>Once the &a.core; is satisfied that all the necessary
       details have been gathered and are correct, the secretary will
       send a PGP-signed acknowledgement of receipt including the
       license details.  This receipt will be persistently archived and
       serve as our permanent record of the license grant.</para>
 
     <para>The license archive should contain only details of license
       grants; this is not the place for any discussions around
       licensing or other subjects.  Access to data within the license
       archive will be available on request to the &a.core;.</para>
   </sect1>
 
   <sect1 xml:id="developer.relations">
     <title>Developer Relations</title>
 
     <para>When working directly on your own code or on code which is
       already well established as your responsibility, then there is
       probably little need to check with other committers before
       jumping in with a commit.  Working on a bug in an area of the
       system which is clearly orphaned (and there are a few such
       areas, to our shame), the same applies.  When modifying
       parts of the system which are maintained, formally, or
       informally, consider asking for review just as a developer
       would have before becoming a
       committer.  For ports, contact the listed
       <varname>MAINTAINER</varname> in the
       <filename>Makefile</filename>.</para>
 
     <para>To determine if an area of the tree is maintained, check the
       MAINTAINERS file at the root of the tree.  If nobody is listed,
       scan the revision history to see who has committed
       changes in the past.  An example script that lists each person
       who has committed to a given file along with the number of
       commits each person has made can be found at on
       <systemitem>freefall</systemitem> at
       <filename>~eadler/bin/whodid</filename>.  If queries go
       unanswered or the committer otherwise indicates a lack of
       interest in the area affected, go ahead and commit it.</para>
 
     <important>
       <para>Avoid sending private emails to maintainers.  Other people
 	might be interested in the conversation, not just the final
 	output.</para>
     </important>
 
     <para>If there is any doubt about a commit for any reason at all,
       have it reviewed before
       committing.  Better to have it flamed then and there rather than
       when it is part of the repository.  If a commit does results in
       controversy erupting, it may be advisable to consider backing
       the change out again until the matter is settled.  Remember,
       with a version control system we can always change it
       back.</para>
 
     <para>Do not impugn the intentions of others.  If they see a
       different solution to a problem, or even a different problem, it
       is probably not because they are stupid, because they have
       questionable parentage, or because they are trying to destroy
       hard work, personal image, or &os;, but basically because they
       have a different outlook on the world.  Different is
       good.</para>
 
     <para>Disagree honestly.  Argue your position from its merits,
       be honest about any shortcomings it may have, and be open to
       seeing their solution, or even their vision of the problem,
       with an open mind.</para>
 
     <para>Accept correction.  We are all fallible.  When you have made
       a mistake, apologize and get on with life.  Do not beat up
       yourself, and certainly do not beat up others for your mistake.
       Do not waste time on embarrassment or recrimination, just fix
       the problem and move on.</para>
 
     <para>Ask for help.  Seek out (and give) peer reviews.  One of
       the ways open source software is supposed to excel is in the
       number of eyeballs applied to it; this does not apply if nobody
       will review code.</para>
   </sect1>
 
   <sect1 xml:id="if-in-doubt">
     <title>If in Doubt...</title>
 
     <para>When unsure about something, whether it be a
       technical issue or a project convention be sure to ask.  If you
       stay silent you will never make progress.</para>
 
     <para>If it relates to a technical issue ask on the public
       mailing lists.  Avoid the temptation to email the individual
       person that knows the answer.  This way everyone will be able to
       learn from the question and the answer.</para>
 
     <para>For project specific or administrative questions
       ask, in order:</para>
 
     <itemizedlist>
       <listitem>
 	<para>Your mentor or former mentor.</para>
       </listitem>
 
       <listitem>
 	<para>An experienced committer on IRC, email, etc.</para>
       </listitem>
 
       <listitem>
 	<para>Any team with a "hat", as they can give you a
 	  definitive answer.</para>
       </listitem>
 
       <listitem>
 	<para>If still not sure, ask on &a.developers;.</para>
       </listitem>
     </itemizedlist>
 
     <para>Once your question is answered, if no one pointed you to
       documentation that spelled out the answer to your question,
       document it, as others will have the same question.</para>
   </sect1>
 
   <sect1 xml:id="bugzilla">
     <title>Bugzilla</title>
 
     <para>The &os; Project utilizes
       <application>Bugzilla</application> for tracking bugs and change
       requests.  Be sure that if you commit a fix or suggestion found
       in the PR database to close it.  It is also considered nice if
       you take time to close any PRs associated with your commits, if
       appropriate.</para>
 
     <para>Committers with
       non-<systemitem class="domainname">&os;.org</systemitem>
       Bugzilla accounts can have the old account merged with the
       <systemitem class="domainname">&os;.org</systemitem> account by
       following these steps:</para>
 
     <procedure>
       <step>
 	<para>Log in using your old account.</para>
       </step>
 
       <step>
 	<para>Open new bug.  Choose <literal>Services</literal> as the
 	  Product, and <literal>Bug Tracker</literal> as the
 	  Component.  In bug description list accounts you wish to be
 	  merged.</para>
       </step>
 
       <step>
 	<para>Log in using <systemitem
 	    class="domainname">&os;.org</systemitem> account and post
 	  comment to newly opened bug to confirm ownership.  See <xref
 	    linkend="kerberos-ldap"/> for more details on how to
 	  generate or set a password for your <systemitem
 	    class="domainname">&os;.org</systemitem> account.</para>
       </step>
 
       <step>
 	<para>If there are more than two accounts to merge, post
 	  comments from each of them.</para>
       </step>
     </procedure>
 
     <para>You can find out more about
       <application>Bugzilla</application> at:</para>
 
     <itemizedlist>
       <listitem>
 	<para><link
 	    xlink:href="&url.articles.pr-guidelines;/index.html">&os;
 	    Problem Report Handling Guidelines</link></para>
       </listitem>
 
       <listitem>
 	<para><link
 	    xlink:href="&url.base;/support.html">https://www.FreeBSD.org/support.html</link></para>
       </listitem>
     </itemizedlist>
   </sect1>
 
   <sect1 xml:id="phabricator">
     <title>Phabricator</title>
 
     <para>The &os; Project utilizes <link
 	xlink:href="https://reviews.freebsd.org">Phabricator</link>
       for code review requests.  See the <link
 	xlink:href="https://wiki.freebsd.org/CodeReview">CodeReview</link>
       wiki page for details.</para>
 
     <para>Committers with
       non-<systemitem class="domainname">&os;.org</systemitem>
       Phabricator accounts can have the old account renamed to the
       <systemitem class="domainname">&os;.org</systemitem> account by
       following these steps:</para>
 
     <procedure>
       <step>
 	<para>Change your <application>Phabricator</application>
 	  account email to your <systemitem
 	    class="domainname">&os;.org</systemitem> email.</para>
       </step>
 
       <step>
 	<para>Open new bug on our bug tracker using your <systemitem
 	    class="domainname">&os;.org</systemitem> account, see
 	  <xref linkend="bugzilla"/> for more information.  Choose
 	  <literal>Services</literal> as the Product, and
 	  <literal>Code Review</literal> as the Component.  In bug
 	  description request that your
 	  <application>Phabricator</application> account be renamed,
 	  and provide a link to your
 	  <application>Phabricator</application> user.  For example,
 	  <literal>https://reviews.freebsd.org/p/<replaceable>bob_example.com</replaceable>/</literal></para>
       </step>
     </procedure>
 
     <important>
       <para><application>Phabricator</application> accounts cannot be
 	merged, please do not open a new account.</para>
     </important>
   </sect1>
 
   <sect1 xml:id="people">
     <title>Who's Who</title>
 
     <para>Besides the repository meisters, there are other &os;
       project members and teams whom you will probably get to know in
       your role as a committer.  Briefly, and by no means
       all-inclusively, these are:</para>
 
     <variablelist>
       <varlistentry>
 	<term>&a.doceng;</term>
 
 	<listitem>
 	  <para>doceng is the group responsible for the documentation
 	    build infrastructure, approving new documentation
 	    committers, and ensuring that the &os; website and
 	    documentation on the FTP site is up to date with respect
 	    to the <application>subversion</application> tree.  It is
 	    not a conflict resolution body.
 	    The vast majority of documentation related discussion
 	    takes place on the &a.doc;.  More details regarding the
 	    doceng team can be found in its <link
 	      xlink:href="https://www.FreeBSD.org/internal/doceng.html">charter</link>.
 	    Committers interested in contributing to the documentation
 	    should familiarize themselves with the <link
 	      xlink:href="&url.books.fdp-primer;/index.html">Documentation
 	      Project Primer</link>.</para>
 	</listitem>
       </varlistentry>
 
       <varlistentry>
 	<term>&a.re.members.email;</term>
 
 	<listitem>
 	  <para>These are the members of the &a.re;.  This team is
 	    responsible for setting release deadlines and controlling
 	    the release process.  During code freezes, the release
 	    engineers have final authority on all changes to the
 	    system for whichever branch is pending release status.  If
 	    there is something you want merged from &os.current; to
 	    &os.stable; (whatever values those may have at any given
 	    time), these are the people to talk to about it.</para>
 	</listitem>
       </varlistentry>
 
       <varlistentry>
 	<term>&a.so.email;</term>
 
 	<listitem>
 	  <para>&a.so; is the
 	    <link xlink:href="&url.base;/security/">&os; Security
 	      Officer</link> and oversees the
 	    &a.security-officer;.</para>
 	</listitem>
       </varlistentry>
 
       <varlistentry>
 	<term>&a.wollman.email;</term>
 
 	<listitem>
 	  <para>If you need advice on obscure network internals or
 	    are not sure of some potential change to the networking
 	    subsystem you have in mind, Garrett is someone to talk
 	    to.  Garrett is also very knowledgeable on the various
 	    standards applicable to &os;.</para>
 	</listitem>
       </varlistentry>
 
       <varlistentry>
 	<term>&a.committers;</term>
 
 	<listitem>
 	  <para>&a.svn-src-all.name;, &a.svn-ports-all.name; and
 	    &a.svn-doc-all.name; are the mailing lists that the
 	    version control system uses to send commit messages to.
 	    <emphasis>Never</emphasis> send email directly
 	    to these lists.  Only send replies to this list
 	    when they are short and are directly related to a
 	    commit.</para>
 	</listitem>
       </varlistentry>
 
       <varlistentry>
 	<term>&a.developers;</term>
 
 	<listitem>
 	  <para>All committers are subscribed to -developers.  This
 	    list was created to be a forum for the committers
 	    <quote>community</quote> issues.  Examples are Core
 	    voting, announcements, etc.</para>
 
 	  <para>The &a.developers; is for the exclusive use of &os;
 	    committers.  To develop &os;, committers must
 	    have the ability to openly discuss matters that will be
 	    resolved before they are publicly announced.  Frank
 	    discussions of work in progress are not suitable for open
 	    publication and may harm &os;.</para>
 
 	  <para>All &os; committers are expected not to
 	    not publish or forward messages from the
 	    &a.developers; outside the list membership without
 	    permission of all of the authors.  Violators will be
 	    removed from the
 	    &a.developers;, resulting in a suspension of commit
 	    privileges.  Repeated or flagrant violations may result in
 	    permanent revocation of commit privileges.</para>
 
 	  <para>This list is <emphasis>not</emphasis> intended as a
 	    place for code reviews or for any technical discussion.
 	    In fact using it as such hurts the &os; Project as it
 	    gives a sense of a closed list where general decisions
 	    affecting all of the &os; using community are made without
 	    being <quote>open</quote>.  Last, but not least
 	    <emphasis>never, never ever, email the &a.developers; and
 	    CC:/BCC: another &os; list</emphasis>.  Never, ever email
 	    another &os; email list and CC:/BCC: the &a.developers;.
 	    Doing so can greatly diminish the benefits of this
 	    list.</para>
 	</listitem>
       </varlistentry>
     </variablelist>
   </sect1>
 
   <sect1 xml:id="ssh.guide">
     <title>SSH Quick-Start Guide</title>
 
     <procedure>
       <step>
 	<para>If you do not wish to type your password in every time
 	  you use &man.ssh.1;, and you use keys to
 	  authenticate, &man.ssh-agent.1; is there for your
 	  convenience.  If you want to use &man.ssh-agent.1;, make
 	  sure that you run it before running other applications.  X
 	  users, for example, usually do this from their
 	  <filename>.xsession</filename> or
 	  <filename>.xinitrc</filename>.  See &man.ssh-agent.1; for
 	  details.</para>
       </step>
 
       <step>
 	<para>Generate a key pair using &man.ssh-keygen.1;.  The key
 	  pair will wind up in your
 	  <filename>$HOME/.ssh/</filename>
 	  directory.</para>
 
 	<important>
 	  <para>Only <acronym>ECDSA</acronym>,
 	    <acronym>Ed25519</acronym> or <acronym>RSA</acronym> keys
 	    are supported.</para>
 	</important>
       </step>
 
       <step>
 	<para>Send your public key
 	  (<filename>$HOME/.ssh/id_ecdsa.pub</filename>,
 	  <filename>$HOME/.ssh/id_ed25519.pub</filename>, or
 	  <filename>$HOME/.ssh/id_rsa.pub</filename>)
 	  to the person setting you up as a committer so it can be put
 	  into
 	  <filename><replaceable>yourlogin</replaceable></filename>
 	  in
 	  <filename>/etc/ssh-keys/</filename> on
 	  <systemitem>freefall</systemitem>.</para>
       </step>
     </procedure>
 
     <para>Now &man.ssh-add.1; can be used for
       authentication once per session.  It prompts for
       the private key's pass phrase, and then stores it in the
       authentication agent (&man.ssh-agent.1;).  Use <command>ssh-add
 	-d</command> to remove keys stored in the agent.</para>
 
     <para>Test with a simple remote command: <command>ssh
 	freefall.FreeBSD.org ls /usr</command>.</para>
 
     <para>For more information, see
       <package>security/openssh-portable</package>,
       &man.ssh.1;, &man.ssh-add.1;, &man.ssh-agent.1;,
       &man.ssh-keygen.1;, and &man.scp.1;.</para>
 
     <para>For information on adding, changing, or removing &man.ssh.1;
       keys, see <uri
 	xlink:href="https://wiki.freebsd.org/clusteradm/ssh-keys">this
 	article</uri>.</para>
   </sect1>
 
   <sect1 xml:id="coverity">
     <title>&coverity; Availability for &os; Committers</title>
 
     <para>All &os; developers can obtain access to
       <application>Coverity</application> analysis results of all &os;
       Project software.  All who are interested in obtaining access to
       the analysis results of the automated
       <application>Coverity</application> runs, can sign up at <uri
 	xlink:href="http://scan.coverity.com/">Coverity
 	Scan</uri>.</para>
 
     <para>The &os; wiki includes a mini-guide for developers who are
       interested in working with the &coverity; analysis reports: <uri
 	xlink:href="https://wiki.freebsd.org/CoverityPrevent">https://wiki.freebsd.org/CoverityPrevent</uri>.
       Please note that this mini-guide is only readable by &os;
       developers, so if you cannot access this page, you will have to
       ask someone to add you to the appropriate Wiki access
       list.</para>
 
     <para>Finally, all &os; developers who are going to use
       &coverity; are always encouraged to ask for more details and
       usage information, by posting any questions to the mailing list
       of the &os; developers.</para>
   </sect1>
 
   <sect1 xml:id="rules">
     <title>The &os; Committers' Big List of Rules</title>
 
     <para>Everyone involved with the &os; project is expected to
       abide by the <emphasis>Code of Conduct</emphasis> available from
       <link xlink:href="&url.base;/internal/code-of-conduct.html"
       >https://www.FreeBSD.org/internal/code-of-conduct.html</link>.
       As committers, you form the public face of the project, and how
       you behave has a vital impact on the public perception of it.
       This guide expands on the parts of the
       <emphasis>Code of Conduct</emphasis> specific to
       committers.</para>
 
     <orderedlist>
       <listitem>
 	<para>Respect other committers.</para>
       </listitem>
 
       <listitem>
 	<para>Respect other contributors.</para>
       </listitem>
 
       <listitem>
 	<para>Discuss any significant change
 	  <emphasis>before</emphasis> committing.</para>
       </listitem>
 
       <listitem>
 	<para>Respect existing maintainers (if listed in the
 	  <varname>MAINTAINER</varname> field in
 	  <filename>Makefile</filename> or in
 	  <filename>MAINTAINER</filename> in the top-level
 	  directory).</para>
       </listitem>
 
       <listitem>
 	<para>Any disputed change must be backed out pending
 	  resolution of the dispute if requested by a maintainer.
 	  Security related changes may override a maintainer's wishes
 	  at the Security Officer's discretion.</para>
       </listitem>
 
       <listitem>
 	<para>Changes go to &os.current; before &os.stable; unless
 	  specifically permitted by the release engineer or unless
 	  they are not applicable to &os.current;.  Any non-trivial or
 	  non-urgent change which is applicable should also be allowed
 	  to sit in &os.current; for at least 3 days before merging so
 	  that it can be given sufficient testing.  The release
 	  engineer has the same authority over the &os.stable; branch
 	  as outlined for the maintainer in rule #5.</para>
       </listitem>
 
       <listitem>
 	<para>Do not fight in public with other committers; it looks
 	  bad.</para>
       </listitem>
 
       <listitem>
 	<para>Respect all code freezes and read the
 	  <literal>committers</literal> and
 	  <literal>developers</literal> mailing lists in a timely
 	  manner so you know when a code freeze is in effect.</para>
       </listitem>
 
       <listitem>
 	<para>When in doubt on any procedure, ask first!</para>
       </listitem>
 
       <listitem>
 	<para>Test your changes before committing them.</para>
       </listitem>
 
       <listitem>
 	<para>Do not commit to contributed software without
 	  <emphasis>explicit</emphasis> approval from the respective
 	  maintainers.</para>
       </listitem>
     </orderedlist>
 
     <para>As noted, breaking some of these rules can be grounds for
       suspension or, upon repeated offense, permanent removal of
       commit privileges.  Individual members of core have the power to
       temporarily suspend commit privileges until core as a whole has
       the chance to review the issue.  In case of an
       <quote>emergency</quote> (a committer doing damage to the
       repository), a temporary suspension may also be done by the
       repository meisters.  Only a 2/3 majority of core has the
       authority to suspend commit privileges for longer than a week or
       to remove them permanently.  This rule does not exist to set
       core up as a bunch of cruel dictators who can dispose of
       committers as casually as empty soda cans, but to give the
       project a kind of safety fuse.  If someone is out of control, it
       is important to be able to deal with this immediately rather
       than be paralyzed by debate.  In all cases, a committer whose
       privileges are suspended or revoked is entitled to a
       <quote>hearing</quote> by core, the total duration of the
       suspension being determined at that time.  A committer whose
       privileges are suspended may also request a review of the
       decision after 30 days and every 30 days thereafter (unless the
       total suspension period is less than 30 days).  A committer
       whose privileges have been revoked entirely may request a review
       after a period of 6 months has elapsed.  This review policy is
       <emphasis>strictly informal</emphasis> and, in all cases, core
       reserves the right to either act on or disregard requests for
       review if they feel their original decision to be the right
       one.</para>
 
     <para>In all other aspects of project operation, core is a subset
       of committers and is bound by the
       <emphasis>same rules</emphasis>.  Just because someone is in
       core this does not mean that they have special dispensation to
       step outside any of the lines painted here; core's
       <quote>special powers</quote> only kick in when it acts as a
       group, not on an individual basis.  As individuals, the core
       team members are all committers first and core second.</para>
 
     <sect2>
       <title>Details</title>
 
       <orderedlist>
 	<listitem xml:id="respect">
 	  <para>Respect other committers.</para>
 
 	  <para>This means that you need to treat other committers as
 	    the peer-group developers that they are.  Despite our
 	    occasional attempts to prove the contrary, one does not
 	    get to be a committer by being stupid and nothing rankles
 	    more than being treated that way by one of your peers.
 	    Whether we always feel respect for one another or not (and
 	    everyone has off days), we still have to
 	    <emphasis>treat</emphasis> other committers with respect
 	    at all times, on public forums and in private
 	    email.</para>
 
 	  <para>Being able to work together long term is this
 	    project's greatest asset, one far more important than any
 	    set of changes to the code, and turning arguments about
 	    code into issues that affect our long-term ability to work
 	    harmoniously together is just not worth the trade-off by
 	    any conceivable stretch of the imagination.</para>
 
 	  <para>To comply with this rule, do not send email when you
 	    are angry or otherwise behave in a manner which is likely
 	    to strike others as needlessly confrontational.  First
 	    calm down, then think about how to communicate in the most
 	    effective fashion for convincing the other persons that
 	    your side of the argument is correct, do not just blow off
 	    some steam so you can feel better in the short term at the
 	    cost of a long-term flame war.  Not only is this very bad
 	    <quote>energy economics</quote>, but repeated displays of
 	    public aggression which impair our ability to work well
 	    together will be dealt with severely by the project
 	    leadership and may result in suspension or termination of
 	    your commit privileges.  The project leadership will take
 	    into account both public and private communications
 	    brought before it.  It will not seek the disclosure of
 	    private communications, but it will take it into account
 	    if it is volunteered by the committers involved in the
 	    complaint.</para>
 
 	  <para>All of this is never an option which the project's
 	    leadership enjoys in the slightest, but unity comes first.
 	    No amount of code or good advice is worth trading that
 	    away.</para>
 	</listitem>
 
 	<listitem>
 	  <para>Respect other contributors.</para>
 
 	  <para>You were not always a committer.  At one time you were
 	    a contributor.  Remember that at all times.  Remember what
 	    it was like trying to get help and attention.  Do not
 	    forget that your work as a contributor was very important
 	    to you.  Remember what it was like.  Do not discourage,
 	    belittle, or demean contributors.  Treat them with
 	    respect.  They are our committers in waiting.  They are
 	    every bit as important to the project as committers.
 	    Their contributions are as valid and as important as your
 	    own.  After all, you made many contributions before you
 	    became a committer.  Always remember that.</para>
 
 	  <para>Consider the points raised under
 	    <xref linkend="respect"/> and apply them also to
 	    contributors.</para>
 	</listitem>
 
 	<listitem>
 	  <para>Discuss any significant change
 	    <emphasis>before</emphasis> committing.</para>
 
 	  <para>The repository is not where changes are initially
 	    submitted for correctness or argued over, that happens
 	    first in the mailing lists or by use of the Phabricator
 	    service.  The commit will only happen once something
 	    resembling consensus has been reached.  This does not mean
 	    that permission is required before correcting every
 	    obvious syntax error or manual page misspelling, just that
 	    it is good to develop a feel for when a proposed change is
 	    not quite such a no-brainer and requires some feedback
 	    first.  People really do not mind sweeping changes if the
 	    result is something clearly better than what they had
 	    before, they just do not like being
 	    <emphasis>surprised</emphasis> by those changes.  The very
 	    best way of making sure that things are on the right track
 	    is to have code reviewed by one or more other
 	    committers.</para>
 
 	  <para>When in doubt, ask for review!</para>
 	</listitem>
 
 	<listitem>
 	  <para>Respect existing maintainers if listed.</para>
 
 	  <para>Many parts of &os; are not <quote>owned</quote> in
 	    the sense that any specific individual will jump up and
 	    yell if you commit a change to <quote>their</quote> area,
 	    but it still pays to check first.  One convention we use
 	    is to put a maintainer line in the
 	    <filename>Makefile</filename> for any package or subtree
 	    which is being actively maintained by one or more people;
 	    see <link
 	      xlink:href="&url.books.developers-handbook;/policies.html">https://www.FreeBSD.org/doc/en_US.ISO8859-1/books/developers-handbook/policies.html</link>
 	    for documentation on this.  Where sections of code have
 	    several maintainers, commits to affected areas by one
 	    maintainer need to be reviewed by at least one other
 	    maintainer.  In cases where the
 	    <quote>maintainer-ship</quote> of something is not clear,
 	    look at the repository logs for the files
 	    in question and see if someone has been working recently
 	    or predominantly in that area.</para>
 	</listitem>
 
 	<listitem>
 	  <para>Any disputed change must be backed out pending
 	    resolution of the dispute if requested by a maintainer.
 	    Security related changes may override a maintainer's
 	    wishes at the Security Officer's discretion.</para>
 
 	  <para>This may be hard to swallow in times of conflict (when
 	    each side is convinced that they are in the right, of
 	    course) but a version control system makes it unnecessary
 	    to have an ongoing dispute raging when it is far easier to
 	    simply reverse the disputed change, get everyone calmed
 	    down again and then try to figure out what is the best way
 	    to proceed.  If the change turns out to be the best thing
 	    after all, it can be easily brought back.  If it turns out
 	    not to be, then the users did not have to live with the
 	    bogus change in the tree while everyone was busily
 	    debating its merits.  People <emphasis>very</emphasis>
 	    rarely call for back-outs in the repository since
 	    discussion generally exposes bad or controversial changes
 	    before the commit even happens, but on such rare occasions
 	    the back-out should be done without argument so that we
 	    can get immediately on to the topic of figuring out
 	    whether it was bogus or not.</para>
 	</listitem>
 
 	<listitem>
 	  <para>Changes go to &os.current; before &os.stable; unless
 	    specifically permitted by the release engineer or unless
 	    they are not applicable to &os.current;.  Any non-trivial
 	    or non-urgent change which is applicable should also be
 	    allowed to sit in &os.current; for at least 3 days before
 	    merging so that it can be given sufficient testing.  The
 	    release engineer has the same authority over the
 	    &os.stable; branch as outlined in rule #5.</para>
 
 	  <para>This is another <quote>do not argue about it</quote>
 	    issue since it is the release engineer who is ultimately
 	    responsible (and gets beaten up) if a change turns out to
 	    be bad.  Please respect this and give the release engineer
 	    your full cooperation when it comes to the &os.stable;
 	    branch.  The management of &os.stable; may frequently seem
 	    to be overly conservative to the casual observer, but also
 	    bear in mind the fact that conservatism is supposed to be
 	    the hallmark of &os.stable; and different rules apply
 	    there than in &os.current;.  There is also really no point
 	    in having &os.current; be a testing ground if changes are
 	    merged over to &os.stable; immediately.  Changes need a
 	    chance to be tested by the &os.current; developers, so
 	    allow some time to elapse before merging unless the
 	    &os.stable; fix is critical, time sensitive or so obvious
 	    as to make further testing unnecessary (spelling fixes to
 	    manual pages, obvious bug/typo fixes, etc.)  In other
 	    words, apply common sense.</para>
 
 	  <para>Changes to the security branches (for example,
 	    <literal>releng/9.3</literal>) must be approved by a
 	    member of the &a.security-officer;, or in some cases, by a
 	    member of the &a.re;.</para>
 	</listitem>
 
 	<listitem>
 	  <para>Do not fight in public with other committers; it looks
 	    bad.</para>
 
 	  <para>This project has a public image to uphold and that
 	    image is very important to all of us, especially if we are
 	    to continue to attract new members.  There will be
 	    occasions when, despite everyone's very best attempts at
 	    self-control, tempers are lost and angry words are
 	    exchanged.  The best thing that can be done in such cases
 	    is to minimize the effects of this until everyone has
 	    cooled back down.  Do not air
 	    angry words in public and do not forward private
 	    correspondence or other private communications to public
 	    mailing lists, mail aliases, instant messaging channels or
 	    social media sites.  What people say one-to-one is often
 	    much less sugar-coated than what they would say in public,
 	    and such communications therefore have no place there -
 	    they only serve to inflame an already bad situation.  If
 	    the person sending a flame-o-gram at least had the
 	    grace to send it privately, then have the grace to keep it
 	    private yourself.  If you feel you are being unfairly
 	    treated by another developer, and it is causing you
 	    anguish, bring the matter up with core rather than taking
 	    it public.  Core will do its best to play peace makers and
 	    get things back to sanity.  In cases where the dispute
 	    involves a change to the codebase and the participants do
 	    not appear to be reaching an amicable agreement, core may
 	    appoint a mutually-agreeable third party to resolve the
 	    dispute.  All parties involved must then agree to be bound
 	    by the decision reached by this third party.</para>
 	</listitem>
 
 	<listitem>
 	  <para>Respect all code freezes and read the
 	    <literal>committers</literal> and
 	    <literal>developers</literal> mailing list on a timely
 	    basis so you know when a code freeze is in effect.</para>
 
 	  <para>Committing unapproved changes during a code freeze is
 	    a really big mistake and committers are expected to keep
 	    up-to-date on what is going on before jumping in after a
 	    long absence and committing 10 megabytes worth of
 	    accumulated stuff.  People who abuse this on a regular
 	    basis will have their commit privileges suspended until
 	    they get back from the &os; Happy Reeducation Camp we
 	    run in Greenland.</para>
 	</listitem>
 
 	<listitem>
 	  <para>When in doubt on any procedure, ask first!</para>
 
 	  <para>Many mistakes are made because someone is in a hurry
 	    and just assumes they know the right way of doing
 	    something.  If you have not done it before, chances are
 	    good that you do not actually know the way we do things
 	    and really need to ask first or you are going to
 	    completely embarrass yourself in public.  There is no
 	    shame in asking
 	    <quote>how in the heck do I do this?</quote>  We already
 	    know you are an intelligent person; otherwise, you would
 	    not be a committer.</para>
 	</listitem>
 
 	<listitem>
 	  <para>Test your changes before committing them.</para>
 
 	  <para>This may sound obvious, but if it really were so
 	    obvious then we probably would not see so many cases of
 	    people clearly not doing this.  If your changes are to the
 	    kernel, make sure you can still compile both GENERIC and
 	    LINT.  If your changes are anywhere else, make sure you
 	    can still make world.  If your changes are to a branch,
 	    make sure your testing occurs with a machine which is
 	    running that code.  If you have a change which also may
 	    break another architecture, be sure and test on all
 	    supported architectures.  Please refer to the
 	    <link xlink:href="https://www.FreeBSD.org/internal/">&os;
 	      Internal Page</link> for a list of available resources.
 	    As other architectures are added to the &os; supported
 	    platforms list, the appropriate shared testing resources
 	    will be made available.</para>
 	</listitem>
 
 	<listitem>
 	  <para>Do not commit to contributed software without
 	    <emphasis>explicit</emphasis> approval from the respective
 	    maintainers.</para>
 
 	  <para>Contributed software is anything under the
 	    <filename>src/contrib</filename>,
 	    <filename>src/crypto</filename>, or
 	    <filename>src/sys/contrib</filename> trees.</para>
 
 	  <para>The trees mentioned above are for contributed software
 	    usually imported onto a vendor branch.  Committing
 	    something there may cause unnecessary headaches
 	    when importing newer versions of the software.  As a
 	    general consider sending patches upstream to the vendor.
 	    Patches may be committed to FreeBSD first with permission
 	    of the maintainer.</para>
 
 	  <para>Reasons for modifying upstream software range from
 	    wanting strict control over a tightly coupled dependency
 	    to lack of portability in the canonical repository's
 	    distribution of their code.  Regardless of the reason,
 	    effort to minimize the maintenance burden of fork is
 	    helpful to fellow maintainers.  Avoid committing trivial
 	    or cosmetic changes to files since it makes every merge
 	    thereafter more difficult: such patches need to be
 	    manually re-verified every import.</para>
 
 	  <para>If a particular piece of software lacks a maintainer,
 	    you are encouraged to take up ownership.  If you are unsure
 	    of the current maintainership email &a.arch; and
 	    ask.</para>
 	</listitem>
       </orderedlist>
     </sect2>
 
     <sect2>
       <title>Policy on Multiple Architectures</title>
 
       <para>&os; has added several new architecture ports during
 	recent release cycles and is truly no longer an &i386; centric
 	operating system.  In an effort to make it easier to keep
 	&os; portable across the platforms we support, core has
 	developed this mandate:</para>
 
       <blockquote>
 	<para>Our 32-bit reference platform is &arch.i386;, and our
 	  64-bit reference platform is &arch.amd64;.  Major design
 	  work (including major API and ABI changes) must prove
 	  itself on at least one 32-bit and at least one 64-bit
 	  platform, preferably the primary reference platforms,
 	  before it may be committed to the source tree.</para>
       </blockquote>
 
       <para>The &arch.i386; and &arch.amd64; platforms were chosen
 	due to being more readily available to developers and as
 	representatives of more diverse processor and system designs -
 	big versus little endian, register file versus register stack,
 	different DMA and cache implementations, hardware page tables
 	versus software TLB management etc.</para>
 
       <para>We will continue to re-evaluate this policy as cost and
 	availability of the 64-bit platforms change.</para>
 
       <para>Developers should also be aware of our Tier Policy for
 	the long term support of hardware architectures.  The rules
 	here are intended to provide guidance during the development
 	process, and are distinct from the requirements for features
 	and architectures listed in that section.  The Tier rules for
 	feature support on architectures at release-time are more
 	strict than the rules for changes during the development
 	process.</para>
     </sect2>
 
     <sect2>
       <title>Other Suggestions</title>
 
       <para>When committing documentation changes, use a spell checker
 	before committing.  For all XML docs, verify that the
 	formatting directives are correct by running
 	<command>make lint</command> and
 	<package>textproc/igor</package>.</para>
 
       <para>For manual pages, run <package>sysutils/manck</package>
 	and <package>textproc/igor</package>
 	over the manual page to verify all of the cross
 	references and file references are correct and that the man
 	page has all of the appropriate <varname>MLINK</varname>s
 	installed.</para>
 
       <para>Do not mix style fixes with new functionality.  A style
 	fix is any change which does not modify the functionality of
 	the code.  Mixing the changes obfuscates the functionality
 	change when asking for differences between revisions, which
 	can hide any new bugs.  Do not include whitespace changes with
 	content changes in commits to <filename>doc/</filename> .
 	The extra clutter in the diffs
 	makes the translators' job much more difficult.  Instead, make
 	any style or whitespace changes in separate commits that are
 	clearly labeled as such in the commit message.</para>
     </sect2>
 
     <sect2>
       <title>Deprecating Features</title>
 
       <para>When it is necessary to remove functionality from software
 	in the base system, follow these guidelines
 	whenever possible:</para>
 
       <orderedlist>
 	<listitem>
 	  <para>Mention is made in the manual page and possibly the
 	    release notes that the option, utility, or interface is
 	    deprecated.  Use of the deprecated feature generates a
 	    warning.</para>
 	</listitem>
 
 	<listitem>
 	  <para>The option, utility, or interface is preserved until
 	    the next major (point zero) release.</para>
 	</listitem>
 
 	<listitem>
 	  <para>The option, utility, or interface is removed and no
 	    longer documented.  It is now obsolete.  It is also
 	    generally a good idea to note its removal in the release
 	    notes.</para>
 	</listitem>
       </orderedlist>
     </sect2>
 
     <sect2>
       <title>Privacy and Confidentiality</title>
 
       <orderedlist>
 	<listitem>
 	  <para>Most &os; business is done in public.</para>
 
 	  <para>&os; is an <emphasis>open</emphasis> project.  Which
 	    means that not only can anyone use the source code, but
 	    that most of the development process is open to public
 	    scrutiny.</para>
 	</listitem>
 
 	<listitem>
 	  <para>Certain sensitive matters must remain private or
 	    held under embargo.</para>
 
 	  <para>There unfortunately cannot be complete transparency.
 	    As a &os; developer you will have a certain degree of
 	    privileged access to information.  Consequently you are
 	    expected to respect certain requirements for
 	    confidentiality.  Sometimes the need for confidentiality
 	    comes from external collaborators or has a specific time
 	    limit.  Mostly though, it is a matter of not releasing
 	    private communications.</para>
 	</listitem>
 
 	<listitem>
 	  <para>The Security Officer has sole control over the
 	    release of security advisories.</para>
 
 	  <para>Where there are security problems that affect many
 	    different operating systems, &os; frequently depends on
 	    early access to be able to prepare advisories for
 	    coordinated release.  Unless &os; developers can be
 	    trusted to maintain security, such early access will not
 	    be made available.  The Security Officer is responsible
 	    for controlling pre-release access to information about
 	    vulnerabilities, and for timing the release of all
 	    advisories.  He may request help under condition of
 	    confidentiality from any developer with relevant knowledge
 	    to prepare security fixes.</para>
 	</listitem>
 
 	<listitem>
 	  <para>Communications with Core are kept confidential for as
 	    long as necessary.</para>
 
 	  <para>Communications to core will initially be treated as
 	    confidential.  Eventually however, most of Core's business
 	    will be summarized into the monthly or quarterly core
 	    reports.  Care will be taken to avoid publicising any
 	    sensitive details.  Records of some particularly sensitive
 	    subjects may not be reported on at all and will be
 	    retained only in Core's private archives.</para>
 	</listitem>
 
 	<listitem>
 	  <para>Non-disclosure Agreements may be required for access
 	    to certain commercially sensitive data.</para>
 
 	  <para>Access to certain commercially sensitive data may
 	    only be available under a Non-Disclosure Agreement.  The
 	    FreeBSD Foundation legal staff must be consulted before
 	    any binding agreements are entered into.</para>
 	</listitem>
 
 	<listitem>
 	  <para>Private communications must not be made
 	    public without permission.</para>
 
 	  <para>Beyond the specific requirements above there is a
 	    general expectation not to publish private communications
 	    between developers without the consent of all parties
 	    involved.  Ask permission before forwarding a message onto
 	    a public mailing list, or posting it to a forum or website
 	    that can be accessed by other than the original
 	    correspondents.</para>
 	</listitem>
 
 	<listitem>
 	  <para>Communications on project-only or restricted access
 	    channels must be kept private.</para>
 
 	  <para>Similarly to personal communications, certain
 	    internal communications channels, including &os; Committer
 	    only mailing lists and restricted access IRC channels
 	    are considered private communications.  Permission is
 	    required to publish material from these
 	    sources.</para>
 	</listitem>
 
 	<listitem>
 	  <para>Core may approve publication.</para>
 
 	  <para>Where it is impractical to obtain permission due to
 	    the number of correspondents or where permission to
 	    publish is unreasonably withheld, Core may approve release
 	    of such private matters that merit more general
 	    publication.</para>
 	</listitem>
       </orderedlist>
     </sect2>
   </sect1>
 
   <sect1 xml:id="archs">
     <title>Support for Multiple Architectures</title>
 
     <para>&os; is a highly portable operating system intended to
       function on many different types of hardware architectures.
       Maintaining clean separation of Machine Dependent (MD) and
       Machine Independent (MI) code, as well as minimizing MD code, is
       an important part of our strategy to remain agile with regards
       to current hardware trends.  Each new hardware architecture
       supported by &os; adds substantially to the cost of code
       maintenance, toolchain support, and release engineering.  It
       also dramatically increases the cost of effective testing of
       kernel changes.  As such, there is strong motivation to
       differentiate between classes of support for various
       architectures while remaining strong in a few key architectures
       that are seen as the &os; <quote>target audience</quote>.</para>
 
     <sect2>
       <title>Statement of General Intent</title>
 
       <para>The &os; Project targets "production quality commercial
 	off-the-shelf (COTS) workstation, server, and high-end
 	embedded systems".  By retaining a focus on a narrow set of
 	architectures of interest in these environments, the &os;
 	Project is able to maintain high levels of quality, stability,
 	and performance, as well as minimize the load on various
 	support teams on the project, such as the ports team,
 	documentation team, security officer, and release engineering
 	teams.  Diversity in hardware support broadens the options for
 	&os; consumers by offering new features and usage
 	opportunities, but these benefits must always
 	be carefully considered in terms of the real-world maintenance
 	cost associated with additional platform support.</para>
 
       <para>The &os; Project differentiates platform targets into four
 	tiers.  Each tier includes a list of guarantees consumers may
 	rely on as well as obligations by the Project and developers
 	to fulfill those guarantees.  These lists define the minimum
 	guarantees for each tier.  The Project and developers may
 	provide additional levels of support beyond the minimum
 	guarantees for a given tier, but such additional support is
 	not guaranteed.  Each platform target is assigned to a
 	specific tier for each stable branch.  As a result, a platform
 	target might be assigned to different tiers on concurrent
 	stable branches.</para>
     </sect2>
 
     <sect2>
       <title>Platform Targets</title>
 
       <para>Support for a hardware platform consists of two
 	components: kernel support and userland Application Binary
 	Interfaces (ABIs).  Kernel platform support includes things
 	needed to run a &os; kernel on a hardware platform such as
 	machine-dependent virtual memory management and device
 	drivers.  A userland ABI specifies an interface for user
 	processes to interact with a &os; kernel and base system
 	libraries.  A userland ABI includes system call interfaces,
 	the layout and semantics of public data structures, and the
 	layout and semantics of arguments passed to subroutines.  Some
 	components of an ABI may be defined by specifications such as
 	the layout of C++ exception objects or calling conventions for
 	C functions.</para>
 
       <para>A &os; kernel also uses an ABI (sometimes referred to as
 	the Kernel Binary Interface (KBI)) which includes the
 	semantics and layouts of public data structures and the layout
 	and semantics of arguments to public functions within the
 	kernel itself.</para>
 
       <para>A &os; kernel may support multiple userland ABIs.  For
 	example, &os;'s amd64 kernel supports &os; amd64 and i386
 	userland ABIs as well as Linux x86_64 and i386 userland ABIs.
 	A &os; kernel should support a <quote>native</quote> ABI as
 	the default ABI.  The native <quote>ABI</quote> generally
 	shares certain properties with the kernel ABI such as the C
 	calling convention, sizes of basic types, etc.</para>
 
       <para>Tiers are defined for both kernels and userland ABIs.  In
 	the common case, a platform's kernel and &os; ABIs are
 	assigned to the same tier.</para>
     </sect2>
 
     <sect2>
       <title>Tier 1: Fully-Supported Architectures</title>
 
       <para>Tier 1 platforms are the most mature &os; platforms.
 	They are supported by the security officer, release
 	engineering, and port management teams.  Tier 1 architectures
 	are expected to be Production Quality with respect to all
 	aspects of the &os; operating system, including installation
 	and development environments.</para>
 
       <para>The &os; Project provides the following guarantees to
 	consumers of Tier 1 platforms:</para>
 
       <itemizedlist>
 	<listitem>
 	  <para>Official &os; release images will be provided by the
 	    release engineering team.</para>
 	</listitem>
 	<listitem>
 	  <para>Binary updates and source patches for Security
 	    Advisories and Errata Notices will be provided for
 	    supported releases.</para>
 	</listitem>
 	<listitem>
 	  <para>Source patches for Security Advisories will be
 	    provided for supported branches.</para>
 	</listitem>
 	<listitem>
 	  <para>Binary updates and source patches for cross-platform
 	    Security Advisories will typically be provided at the time
 	    of the announcement.</para>
 	</listitem>
 	<listitem>
 	  <para>Changes to userland ABIs will generally include
 	    compatibility shims to ensure correct operation of
 	    binaries compiled against any stable branch where the
 	    platform is Tier 1.  These shims might not be enabled in
 	    the default install.  If compatibility shims are not
 	    provided for an ABI change, the lack of shims will be
 	    clearly documented in the release notes.</para>
 	</listitem>
 	<listitem>
 	  <para>Changes to certain portions of the kernel ABI will
 	    include compatibility shims to ensure correct operation of
 	    kernel modules compiled against the oldest supported
 	    release on the branch.  Note that not all parts of the
 	    kernel ABI are protected.</para>
 	</listitem>
 	<listitem>
 	  <para>Official binary packages for third party software will
 	    be provided by the ports team.  For embedded
 	    architectures, these packages may be cross-built from a
 	    different architecture.</para>
 	</listitem>
 	<listitem>
 	  <para>Most relevant ports should either build or have the
 	    appropriate filters to prevent inappropriate ones from
 	    building.</para>
 	</listitem>
 	<listitem>
 	  <para>New features which are not inherently
 	    platform-specific will be fully functional on all Tier 1
 	    architectures.</para>
 	</listitem>
 	<listitem>
 	  <para>Features and compatibility shims used by binaries
 	    compiled against older stable branches may be removed in
 	    newer major versions.  Such removals will be clearly
 	    documented in the release notes.</para>
 	</listitem>
 	<listitem>
 	  <para>Tier 1 platforms should be fully documented.  Basic
 	    operations will be documented in the &os; Handbook.</para>
 	</listitem>
 	<listitem>
 	  <para>Tier 1 platforms will be included in the source
 	    tree.</para>
 	</listitem>
 	<listitem>
 	  <para>Tier 1 platforms should be self-hosting either via the
 	    in-tree toolchain or an external toolchain.  If an
 	    external toolchain is required, official binary packages
 	    for an external toolchain will be provided.</para>
 	</listitem>
       </itemizedlist>
 
       <para>To maintain maturity of Tier 1 platforms, the &os; Project
 	will maintain the following resources to support
 	development:</para>
 
       <itemizedlist>
 	<listitem>
 	  <para>Build and test automation support either in the
 	    FreeBSD.org cluster or some other location easily
 	    available for all developers.  Embedded platforms may
 	    substitute an emulator available in the FreeBSD.org
 	    cluster for actual hardware.</para>
 	</listitem>
 	<listitem>
 	  <para>Inclusion in the <userinput>make universe</userinput>
 	    and <userinput>make tinderbox</userinput> targets.</para>
 	</listitem>
 	<listitem>
 	  <para>Dedicated hardware in one of the &os; clusters for
 	    package building (either natively or via
 	    qemu-user).</para>
 	</listitem>
       </itemizedlist>
 
       <para>Collectively, developers are required to provide the
 	following to maintain the Tier 1 status of a platform:</para>
 
       <itemizedlist>
 	<listitem>
 	  <para>Changes to the source tree should not knowingly break
 	    the build of a Tier 1 platform.</para>
 	</listitem>
 	<listitem>
 	  <para>Tier 1 architectures must have a mature, healthy
 	    ecosystem of users and active developers.</para>
 	</listitem>
 	<listitem>
 	  <para>Developers should be able to build packages on
 	    commonly available, non-embedded Tier 1 systems.  This can
 	    mean either native builds if non-embedded systems are
 	    commonly available for the platform in question, or it can
 	    mean cross-builds hosted on some other Tier 1
 	    architecture.</para>
 	</listitem>
 	<listitem>
 	  <para>Changes cannot break the userland ABI.  If an ABI
 	    change is required, ABI compatibility for existing
 	    binaries should be provided via use of symbol versioning
 	    or shared library version bumps.</para>
 	</listitem>
 	<listitem>
 	  <para>Changes merged to stable branches cannot break the
 	    protected portions of the kernel ABI.  If a kernel ABI
 	    change is required, the change should be modified to
 	    preserve functionality of existing kernel modules.</para>
 	</listitem>
       </itemizedlist>
     </sect2>
 
     <sect2>
       <title>Tier 2: Developmental and Niche Architectures</title>
 
       <para>Tier 2 platforms are functional, but less mature &os;
 	platforms.  They are not supported by the security officer,
 	release engineering, and port management teams.</para>
 
       <para>Tier 2 platforms may be Tier 1 platform candidates that
 	are still under active development.  Architectures reaching
 	end of life may also be moved from Tier 1 status to Tier 2
 	status as the availability of resources to continue to
 	maintain the system in a Production Quality state diminishes.
 	Well-supported niche architectures may also be Tier 2.</para>
 
       <para>The &os; Project provides the following guarantees to
 	consumers of Tier 2 platforms:</para>
 
       <itemizedlist>
 	<listitem>
 	  <para>The ports infrastructure should include basic support
 	    for Tier 2 architectures sufficient to support building
 	    ports and packages.  This includes support for basic
 	    packages such as ports-mgmt/pkg, but there is no guarantee
 	    that arbitrary ports will be buildable or
 	    functional.</para>
 	</listitem>
 	<listitem>
 	  <para>New features which are not inherently
 	    platform-specific should be feasible on all Tier 2
 	    architectures if not implemented.</para>
 	</listitem>
 	<listitem>
 	  <para>Tier 2 platforms will be included in the source
 	    tree.</para>
 	</listitem>
 	<listitem>
 	  <para>Tier 2 platforms should be self-hosting either via the
 	    in-tree toolchain or an external toolchain.  If an
 	    external toolchain is required, official binary packages
 	    for an external toolchain will be provided.</para>
 	</listitem>
 	<listitem>
 	  <para>Tier 2 platforms should provide functional kernels and
 	    userlands even if an official release distribution is not
 	    provided.</para>
 	</listitem>
       </itemizedlist>
 
       <para>To maintain maturity of Tier 2 platforms, the &os; Project
 	will maintain the following resources to support
 	development:</para>
 
       <itemizedlist>
 	<listitem>
 	  <para>Inclusion in the <userinput>make universe</userinput>
 	    and <userinput>make tinderbox</userinput> targets.</para>
 	</listitem>
       </itemizedlist>
 
       <para>Collectively, developers are required to provide the
 	following to maintain the Tier 2 status of a platform:</para>
 
       <itemizedlist>
 	<listitem>
 	  <para>Changes to the source tree should not knowingly break
 	    the build of a Tier 2 platform.</para>
 	</listitem>
 	<listitem>
 	  <para>Tier 2 architectures must have an active ecosystem of
 	    users and developers.</para>
 	</listitem>
 	<listitem>
 	  <para>While changes are permitted to break the userland ABI,
 	    the ABI should not be broken gratuitously.  Significant
 	    userland ABI changes should be restricted to major
 	    versions.</para>
 	</listitem>
 	<listitem>
 	  <para>New features that are not yet implemented on Tier 2
 	    architectures should provide a means of disabling them on
 	    those architectures.</para>
 	</listitem>
       </itemizedlist>
     </sect2>
 
     <sect2>
       <title>Tier 3: Experimental Architectures</title>
 
       <para>Tier 3 platforms have at least partial &os; support.  They
 	are <emphasis>not</emphasis> supported by the security
 	officer, release engineering, and port management
 	teams.</para>
 
       <para>Tier 3 platforms are architectures in the early stages of
 	development, for non-mainstream hardware platforms, or which
 	are considered legacy systems unlikely to see broad future
 	use.  Initial support for Tier 3 platforms may exist in a
 	separate repository rather than the main source
 	repository.</para>
 
       <para>The &os; Project provides no guarantees to consumers of
 	Tier 3 platforms and is not committed to maintaining resources
 	to support development.  Tier 3 platforms may not always be
 	buildable, nor are any kernel or userland ABIs considered
 	stable.</para>
     </sect2>
 
     <sect2>
       <title>Tier 4: Unsupported Architectures</title>
 
       <para>Tier 4 platforms are not supported in any form by the
 	project.</para>
 
       <para>All systems not otherwise classified are Tier 4 systems.
 	When a platform transitions to Tier 4, all support for the
 	platform is removed from the source and ports trees.  Note
 	that ports support should remain as long as the platform is
 	supported in a branch supported by ports.</para>
     </sect2>
 
     <sect2>
       <title>Policy on Changing the Tier of an Architecture</title>
 
       <para>Systems may only be moved from one tier to another by
 	approval of the &os; Core Team, which shall make that decision
 	in collaboration with the Security Officer, Release
 	Engineering, and ports management teams.  For a platform to be
 	promoted to a higher tier, any missing support guarantees must
 	be satisfied before the promotion is completed.</para>
     </sect2>
   </sect1>
 
   <sect1 xml:id="ports">
     <title>Ports Specific FAQ</title>
 
     <qandaset>
       <qandadiv xml:id="ports-qa-adding">
 	<title>Adding a New Port</title>
 
 	<qandaentry xml:id="ports-qa-add-new">
 	  <question>
 	    <para>How do I add a new port?</para>
 	  </question>
 
 	  <answer>
 	    <para>First, please read the section about repository
 	      copies.</para>
 
 	    <para>The easiest way to add a new port is the
 	      <command>addport</command> script located in the
 	      <filename>ports/Tools/scripts</filename> directory.  It
 	      adds a port from the directory specified, determining
 	      the category automatically from the port
 	      <filename>Makefile</filename>.  It also adds an entry to
 	      the port's category <filename>Makefile</filename>.  It
 	      was written by &a.mharo.email;, &a.will.email;, and
 	      &a.garga.email;.  When sending questions about this
 	      script to the &a.ports;, please also CC &a.crees.email;,
 	      the current maintainer.</para>
 	  </answer>
 	</qandaentry>
 
 	<qandaentry xml:id="ports-qa-add-new-extra">
 	  <question>
 	    <para>Any other things I need to know when I add a new
 	      port?</para>
 	  </question>
 
 	  <answer>
 	    <para>Check the port, preferably to make sure it compiles
 	      and packages correctly.  This is the recommended
 	      sequence:</para>
 
 	    <screen>&prompt.root; <userinput>make install</userinput>
 &prompt.root; <userinput>make package</userinput>
 &prompt.root; <userinput>make deinstall</userinput>
 &prompt.root; <userinput>pkg add <replaceable>package you built above</replaceable></userinput>
 &prompt.root; <userinput>make deinstall</userinput>
 &prompt.root; <userinput>make reinstall</userinput>
 &prompt.root; <userinput>make package</userinput></screen>
 
 	    <para>The <link
 		xlink:href="&url.books.porters-handbook;/index.html">Porters
 		Handbook</link> contains more detailed
 	      instructions.</para>
 
 	    <para>Use &man.portlint.1; to check the syntax of the
 	      port.  You do not necessarily have to eliminate all
 	      warnings but make sure you have fixed the simple
 	      ones.</para>
 
 	    <para>If the port came from a submitter who has not
 	      contributed to the Project before, add that person's
 	      name to the <link
 		xlink:href="&url.articles.contributors;/contrib-additional.html">Additional
 		Contributors</link> section of the &os;
 	      Contributors List.</para>
 
 	    <para>Close the PR if the port came in as a PR.  To close
 	      a PR, change the state to <literal>Issue
 		Resolved</literal> and the resolution as
 	      <literal>Fixed</literal>.</para>
 	  </answer>
 	</qandaentry>
       </qandadiv>
 
       <qandadiv xml:id="ports-qa-removing">
 	<title>Removing an Existing Port</title>
 
 	<qandaentry xml:id="ports-qa-remove-one">
 	  <question>
 	    <para>How do I remove an existing port?</para>
 	  </question>
 
 	  <answer>
 	    <para>First, please read the section about repository
 	      copies.  Before you remove the port, you have to verify
 	      there are no other ports depending on it.</para>
 
 	    <itemizedlist>
 	      <listitem>
 		<para>Make sure there is no dependency on the port
 		  in the ports collection:</para>
 
 		<itemizedlist>
 		  <listitem>
 		    <para>The port's PKGNAME appears in exactly
 		      one line in a recent INDEX file.</para>
 		  </listitem>
 
 		  <listitem>
 		    <para>No other ports contains any reference
 		      to the port's directory or PKGNAME in their
 		      Makefiles</para>
 
 		    <tip>
 		      <para>When using <application>Git</application>,
 			consider using <command>git grep</command>, it
 			is much faster than <command>grep
 			  -r</command>.</para>
 		    </tip>
 		  </listitem>
 		</itemizedlist>
 	      </listitem>
 
 	      <listitem>
 		<para>Then, remove the port:</para>
 
 		<procedure>
 		  <step>
 		    <para>Remove the port's files and directory with
 		      <command>svn remove</command>.</para>
 		  </step>
 
 		  <step>
 		    <para>Remove the <varname>SUBDIR</varname> listing
 		      of the port in the parent directory
 		      <filename>Makefile</filename>.</para>
 		  </step>
 
 		  <step>
 		    <para>Add an entry to
 		      <filename>ports/MOVED</filename>.</para>
 		  </step>
 
 		  <step>
 		    <para>Search for entries in
 		      <filename>ports/security/vuxml/vuln.xml</filename>
 		      and adjust them accordingly.  In particular,
 		      check for previous packages with the new name
 		      which version could include the new port.</para>
 		  </step>
 
 		  <step>
 		    <para>Remove the port from
 		      <filename>ports/LEGAL</filename> if it is
 		      there.</para>
 		  </step>
 		</procedure>
 	      </listitem>
 	    </itemizedlist>
 
 	    <para>Alternatively, you can use the
 	      <command>rmport</command> script, from
 	      <filename>ports/Tools/scripts</filename>.  This script
 	      was written by &a.vd.email;.  When sending questions
 	      about this script to the &a.ports;, please also CC
 	      &a.crees.email;, the current maintainer.</para>
 	  </answer>
 	</qandaentry>
       </qandadiv>
 
       <qandadiv xml:id="ports-qa-re-adding">
 	<title>Re-adding a Deleted Port</title>
 
 	<qandaentry xml:id="ports-qa-resurrect">
 	  <question>
 	    <para>How do I re-add a deleted port?</para>
 	  </question>
 
 	  <answer>
 	    <para>This is essentially the reverse of deleting a
 	      port.</para>
 
 	    <important>
 	      <para>Do not use <command>svn add</command> to add the
 		port.  Follow these steps.  If they are unclear, or
 		are not working, ask for help, do not just
 		<command>svn add</command> the port.</para>
 	    </important>
 
 	    <procedure>
 	      <step>
 		<para>Figure out when the port was removed.  Use this
 		  <link
 		    xlink:href="https://people.FreeBSD.org/~crees/removed_ports/index.xml">list</link>,
 		  or look for the port on <link
 		    xlink:href="http://www.freshports.org/">freshports</link>,
 		  and then copy the last living revision of the
 		  port:</para>
 
 		<screen>&prompt.user; <userinput>cd /usr/ports/<replaceable>category</replaceable></userinput>
 &prompt.user; <userinput>svn cp 'svn+ssh://repo.freebsd.org/ports/head/<replaceable>category</replaceable>/<replaceable>portname</replaceable>/@<replaceable>XXXXXX</replaceable>' <replaceable>portname</replaceable></userinput></screen>
 
 		<para>Pick the revision that is just before the
 		  removal.  For example, if the revision where it was
 		  removed is <literal>269874</literal>, use
 		  <literal>269873</literal>.</para>
 
 		<para>It is also possible to specify a date.  In that
 		  case, pick a date that is before the removal but
 		  after the last commit to the port.</para>
 
 		<screen>&prompt.user; <userinput>cd /usr/ports/<replaceable>category</replaceable></userinput>
 &prompt.user; <userinput>svn cp 'svn+ssh://repo.freebsd.org/ports/head/<replaceable>category</replaceable>/<replaceable>portname</replaceable>/@{<replaceable>YYYY-MM-DD</replaceable>}' <replaceable>portname</replaceable></userinput></screen>
 	      </step>
 
 	      <step>
 		<para>Make the changes necessary to get the port
 		  working again.  If it was deleted because the
 		  distfiles are no longer available, either
 		  volunteer to host the distfiles, or find someone
 		  else to do so.</para>
 	      </step>
 
 	      <step>
 		<para>If some files have been added, or were removed
 		  during the resurrection process, use <command>svn
 		    add</command> or <command>svn remove</command> to
 		  make sure all the files in the port will be
 		  committed.</para>
 	      </step>
 
 	      <step>
 		<para>Restore the <varname>SUBDIR</varname> listing of
 		  the port in the parent directory
 		  <filename>Makefile</filename>, keeping the entries
 		  sorted.</para>
 	      </step>
 
 	      <step>
 		<para>Delete the port entry from
 		  <filename>ports/MOVED</filename>.</para>
 	      </step>
 
 	      <step>
 		<para>If the port had an entry in
 		  <filename>ports/LEGAL</filename>, restore it.</para>
 	      </step>
 
 	      <step>
 		<para><command>svn commit</command> these changes,
 		  preferably in one step.</para>
 	      </step>
 	    </procedure>
 
 	    <tip>
 	      <para>The <command>addport</command> script mentioned in
 		<xref linkend="ports-qa-adding"/> now detects when the
 		port to add has previously existed, and attempts to
 		handle all except the <filename>ports/LEGAL</filename>
 		step automatically.</para>
 	    </tip>
 	  </answer>
 	</qandaentry>
       </qandadiv>
 
       <qandadiv xml:id="ports-qa-repocopies">
 	<title>Repository Copies</title>
 
 	<qandaentry xml:id="ports-qa-repocopy-when">
 	  <question>
 	    <para>When do we need a repository copy?</para>
 	  </question>
 
 	  <answer>
 	    <para>When you want to add a port that is related to any
 	      port that is already in the tree in a separate
 	      directory, you have to do a repository copy.  Here
 	      <wordasword>related</wordasword> means it is a different
 	      version or a slightly modified version.  Examples are
 	      <filename>print/ghostscript*</filename> (different
 	      versions) and <filename>x11-wm/windowmaker*</filename>
 	      (English-only and internationalized version).</para>
 
 	    <para>Another example is when a port is moved from one
 	      subdirectory to another, or when the name of a directory
 	      must be changed because the authors renamed their
 	      software even though it is a descendant of a port
 	      already in a tree.</para>
 	  </answer>
 	</qandaentry>
 
 	<qandaentry xml:id="ports-qa-repocopy-how">
 	  <question>
 	    <para>What do I need to do?</para>
 	  </question>
 
 	  <answer>
 	    <para>With Subversion, a repo copy can be done by any
 	      committer:</para>
 
 	    <itemizedlist>
 	      <listitem>
 		<para>Doing a repo copy:</para>
 
 		<procedure>
 		  <step>
 		    <para>Verify that the target directory does
 		      not exist.</para>
 		  </step>
 
 		  <step>
 		    <para>Use <command>svn up</command> to make
 		      certain the original files, directories, and
 		      checkout information is current.</para>
 		  </step>
 
 		  <step>
 		    <para>Use <command>svn move</command> or
 		      <command>svn copy</command> to do the repo
 		      copy.</para>
 		  </step>
 
 		  <step>
 		    <para>Upgrade the copied port to the new version.
 		      Remember to add or change the
 		      <varname>PKGNAMEPREFIX</varname> or
 		      <varname>PKGNAMESUFFIX</varname> so there are no
 		      duplicate ports with the same name.  In some
 		      rare cases it may be necessary to change the
 		      <varname>PORTNAME</varname> instead of adding
 		      <varname>PKGNAMEPREFIX</varname> or
 		      <varname>PKGNAMESUFFIX</varname>, but this is
 		      only done when it is really needed &mdash; for
 		      example, using an existing port as the base for
 		      a very similar program with a different name, or
 		      upgrading a port to a new upstream version which
 		      actually changes the distribution name, like the
 		      transition from
 		      <filename>textproc/libxml</filename> to
 		      <filename>textproc/libxml2</filename>.  In most
 		      cases, adding or changing
 		      <varname>PKGNAMEPREFIX</varname> or
 		      <varname>PKGNAMESUFFIX</varname>
 		      suffices.</para>
 		  </step>
 
 		  <step>
 		    <para>Add the new subdirectory to the
 		      <varname>SUBDIR</varname> listing in the parent
 		      directory <filename>Makefile</filename>.  You
 		      can run <command>make checksubdirs</command> in
 		      the parent directory to check this.</para>
 		  </step>
 
 		  <step>
 		    <para>If the port changed categories, modify the
 		      <varname>CATEGORIES</varname> line of the port's
 		      <filename>Makefile</filename> accordingly</para>
 		  </step>
 
 		  <step>
 		    <para>Add an entry to
 		      <filename>ports/MOVED</filename>, if you remove
 		      the original port.</para>
 		  </step>
 
 		  <step>
 		    <para>Commit all changes on one commit.</para>
 		  </step>
 		</procedure>
 	      </listitem>
 
 	      <listitem>
 		<para>When removing a port:</para>
 
 		<procedure>
 		  <step>
 		    <para>Perform a thorough check of the ports
 		      collection for any dependencies on the old port
 		      location/name, and update them.  Running
 		      <command>grep</command> on
 		      <filename>INDEX</filename> is not enough because
 		      some ports have dependencies enabled by
 		      compile-time options.  A full
 		      <command>grep -r</command> of the ports
 		      collection is recommended.</para>
 		  </step>
 
 		  <step>
 		    <para>Remove the old port and the
 		      old <varname>SUBDIR</varname> entry.</para>
 		  </step>
 
 		  <step>
 		    <para>Add an entry to
 		      <filename>ports/MOVED</filename>.</para>
 		  </step>
 		</procedure>
 	      </listitem>
 
 	      <listitem>
 		<para>After repo moves (<quote>rename</quote>
 		  operations where a port is copied and the old
 		  location is removed):</para>
 
 		<procedure>
 		  <step>
 		    <para>Follow the same steps that are outlined in
 		      the previous two entries, to activate the new
 		      location of the port and remove the old
 		      one.</para>
 		  </step>
 		</procedure>
 	      </listitem>
 	    </itemizedlist>
 	  </answer>
 	</qandaentry>
       </qandadiv>
 
       <qandadiv xml:id="ports-qa-freeze">
 	<title>Ports Freeze</title>
 
 	<qandaentry xml:id="ports-qa-freeze-what">
 	  <question>
 	    <para>What is a <quote>ports freeze</quote>?</para>
 	  </question>
 
 	  <answer>
 	    <para>A <quote>ports freeze</quote> was a restricted state
 	      the ports tree was put in before a release.  It was used
 	      to ensure a higher quality for the packages shipped with
 	      a release.  It usually lasted a couple of weeks.  During
 	      that time, build problems were fixed, and the release
 	      packages were built.  This practice is no longer used,
 	      as the packages for the releases are built from the
 	      current stable, quarterly branch.</para>
 
 	    <para>For more information on how to merge commits to the
 	      quarterly branch, see <xref
 		linkend="ports-qa-misc-request-mfh"/>.</para>
 	  </answer>
 	</qandaentry>
       </qandadiv>
 
       <qandadiv xml:id="ports-qa-quarterly">
 	<title>Quarterly Branches</title>
 
 	<qandaentry xml:id="ports-qa-misc-request-mfh">
 	  <question>
 	    <para>What is the procedure to request authorization for
 	      merging a commit to the quarterly branch?</para>
 	  </question>
 
 	  <answer>
 	    <para>When doing the commit, add the branch name to the
 	      <literal>MFH:</literal> line, for example:</para>
 
 	    <programlisting>MFH:	<replaceable>2014Q1</replaceable></programlisting>
 
 	    <para>It will automatically notify the &a.ports-secteam;
 	      and the &a.portmgr;.  They will then decide if the
 	      commit can be merged and answer with the
 	      procedure.</para>
 
 	    <para>If the commit has already been made, send an email
 	      to the &a.ports-secteam; and the &a.portmgr; with the
 	      revision number and a small description of why the
 	      commit needs to be merged.</para>
 
 	    <tip>
 	      <para>If the MFH is covered by a blanket approval,
 		please explain why with a couple of words on the
 		<literal>MFH</literal> line, so that the reviewing
 		team can skip this commit and save time.  For
 		example:</para>
 
 	      <programlisting>MFH:  <replaceable>2014Q1 (runtime fix)</replaceable>
 MFH:  <replaceable>2014Q1 (browser blanket)</replaceable></programlisting>
 
 	      <para>The list of blanket approvals is available in
 		<xref linkend="ports-qa-blanket"/>.</para>
 	    </tip>
 	  </answer>
 	</qandaentry>
 
 	<qandaentry xml:id="ports-qa-blanket">
 	  <question>
 	    <para>Are there any changes that can be merged without
 	      asking for approval?</para>
 	  </question>
 
 	  <answer>
 	    <para>The following blanket approvals for merging to the
 	      quarterly branches are in effect:</para>
 
 	    <note>
 	      <para>This blanket approval also applies to direct
 		commits for ports that have been removed from
 		<literal>head</literal>.</para>
 	    </note>
 
 	    <important>
 	      <para>These fixes <emphasis>must</emphasis> be
 		tested on the quarterly branch.</para>
 	    </important>
 
 	    <itemizedlist>
 
 	      <listitem>
 		<para>Fixes that do not result in a change in contents
 		  of the resulting package.  For example:</para>
 
 		<itemizedlist>
 		  <listitem>
 		    <para><filename>pkg-descr</filename>:
 		      <literal>WWW:</literal> URL updates (existing
 		      404, moved or incorrect)</para>
 		  </listitem>
 		</itemizedlist>
 	      </listitem>
 
 	      <listitem>
 		<para>Build, runtime or packaging fixes, if the
 		  quarterly branch version is currently broken.</para>
 	      </listitem>
 
 	      <listitem>
 		<para>Missing dependencies (detected, linked against
 		  but not registered via
 		  <varname><replaceable>*</replaceable>_DEPENDS</varname>).</para>
 	      </listitem>
 
 	      <listitem>
 		<para>Fixing <link
 		    xlink:href="&url.books.porters-handbook;/uses-shebangfix.html">shebangs</link>,
 		  stripping installed libraries and binaries, and
 		  plist fixes.</para>
 	      </listitem>
 
 	      <listitem>
 		<para>Backport of security and reliability fixes which
 		  only result in <varname>PORTREVISION</varname> bumps
 		  and no changes to enabled features.  for example,
 		  adding a patch fixing a buffer overflow.</para>
 	      </listitem>
 
 	      <listitem>
 		<para>Minor version changes that do nothing but fix
 		  security or crash-related issues.</para>
 	      </listitem>
 
 	      <listitem>
 		<para>Adding/fixing
 		  <varname>CONFLICTS</varname>.</para>
 	      </listitem>
 
 	      <listitem>
 		<para>Web Browsers, browser plugins, and their
 		  required dependencies.</para>
 	      </listitem>
 	    </itemizedlist>
 
 	    <important>
 	      <para>Commits that are not covered by these blanket
 		approvals always require explicit approval of either
 		&a.ports-secteam; or &a.portmgr;.</para>
 	    </important>
 	  </answer>
 	</qandaentry>
 
 	<qandaentry xml:id="ports-qa-misc-commit-mfh">
 	  <question>
 	    <para>What is the procedure for merging commits to the
 	      quarterly branch?</para>
 	  </question>
 
 	  <answer>
 	    <para>A script is provided to automate merging a specific
 	      commit: <filename>ports/Tools/scripts/mfh</filename>.
 	      It is used as follows:</para>
 
 	    <screen>&prompt.user; <userinput>/usr/ports/Tools/scripts/mfh 380362</userinput>
  U   2015Q1
 Checked out revision 380443.
 A    2015Q1/security
 Updating '2015Q1/security/rubygem-sshkit':
 A    2015Q1/security/rubygem-sshkit
 A    2015Q1/security/rubygem-sshkit/Makefile
 A    2015Q1/security/rubygem-sshkit/distinfo
 A    2015Q1/security/rubygem-sshkit/pkg-descr
 Updated to revision 380443.
 --- Merging r380362 into '2015Q1':
 U    2015Q1/security/rubygem-sshkit/Makefile
 U    2015Q1/security/rubygem-sshkit/distinfo
 --- Recording mergeinfo for merge of r380362 into '2015Q1':
  U   2015Q1
 --- Recording mergeinfo for merge of r380362 into '2015Q1/security':
  G   2015Q1/security
 --- Eliding mergeinfo from '2015Q1/security':
  U   2015Q1/security
 --- Recording mergeinfo for merge of r380362 into '2015Q1/security/rubygem-sshkit':
  G   2015Q1/security/rubygem-sshkit
 --- Eliding mergeinfo from '2015Q1/security/rubygem-sshkit':
  U   2015Q1/security/rubygem-sshkit
  M      2015Q1
 M       2015Q1/security/rubygem-sshkit/Makefile
 M       2015Q1/security/rubygem-sshkit/distinfo
 Index: 2015Q1/security/rubygem-sshkit/Makefile
 ===================================================================
 --- 2015Q1/security/rubygem-sshkit/Makefile     (revision 380443)
 +++ 2015Q1/security/rubygem-sshkit/Makefile     (working copy)
 @@ -2,7 +2,7 @@
  # $FreeBSD$
 
  PORTNAME=      sshkit
 -PORTVERSION=   1.6.1
 +PORTVERSION=   1.7.0
  CATEGORIES=    security rubygems
  MASTER_SITES=  RG
 
 Index: 2015Q1/security/rubygem-sshkit/distinfo
 ===================================================================
 --- 2015Q1/security/rubygem-sshkit/distinfo     (revision 380443)
 +++ 2015Q1/security/rubygem-sshkit/distinfo     (working copy)
 @@ -1,2 +1,2 @@
 -SHA256 (rubygem/sshkit-1.6.1.gem) = 8ca67e46bb4ea50fdb0553cda77552f3e41b17a5aa919877d93875dfa22c03a7
 -SIZE (rubygem/sshkit-1.6.1.gem) = 135680
 +SHA256 (rubygem/sshkit-1.7.0.gem) = 90effd1813363bae7355f4a45ebc8335a8ca74acc8d0933ba6ee6d40f281a2cf
 +SIZE (rubygem/sshkit-1.7.0.gem) = 136192
 Index: 2015Q1
 ===================================================================
 --- 2015Q1      (revision 380443)
 +++ 2015Q1      (working copy)
 
 Property changes on: 2015Q1
 ___________________________________________________________________
 Modified: svn:mergeinfo
    Merged /head:r380362
 Do you want to commit? (no = start a shell) [y/n]</screen>
 
 	    <para>At that point, the script will either open a shell
 	      for you to fix things, or open your text editor with the
 	      commit message all prepared and then commit the
 	      merge.</para>
 
 	    <para>The script assumes that you can connect to
 	      <literal>repo.FreeBSD.org</literal> with
 	      <application>SSH</application> directly, so if your
 	      local login name is different than your &os; cluster
 	      account, you need a few lines in your
 	      <filename>~/.ssh/config</filename>:</para>
 
 	    <programlisting>Host *.freebsd.org
     User <replaceable>freebsd-login</replaceable></programlisting>
 
 	    <tip>
 	      <para>The script is also able to merge more than one
 		revision at a time.  If there have been other updates
 		to the port since the branch was created that have not
 		been merged because they were not security related.
 		Add the different revisions <emphasis>in the order
 		  they were committed</emphasis> on the
 		<command>mfh</command> line.  The new commit log
 		message will contain the combined log messages from
 		all the original commits.  These messages
 		<emphasis>must</emphasis> be edited to show what is
 		actually being done with the new commit.</para>
 
 	      <screen>&prompt.user; <userinput>/usr/ports/Tools/scripts/mfh r407208 r407713 r407722 r408567 r408943 r410728</userinput></screen>
 	    </tip>
 
 	    <note>
 	      <para>The mfh script can also take an optional first
 		argument, the branch where the merge is being done.
 		Only the latest quarterly branch is supported, so
 		specifying the branch is discouraged.  To be safe, the
 		script will give a warning if the quarterly branch is
 		not the latest:</para>
 
 	      <screen>&prompt.user; <userinput>/usr/ports/Tools/scripts/mfh 2016Q1 r407208 r407713</userinput>
 /!\ The latest branch is 2016Q2, do you really want to commit to 2016Q1? [y/n]</screen>
 	    </note>
 	  </answer>
 	</qandaentry>
       </qandadiv>
 
       <qandadiv xml:id="ports-qa-new-category">
 	<title>Creating a New Category</title>
 
 	<qandaentry xml:id="ports-qa-new-category-how">
 	  <question>
 	    <para>What is the procedure for creating a new
 	      category?</para>
 	  </question>
 
 	  <answer>
 	    <para>Please see <link
 		xlink:href="&url.books.porters-handbook;/makefile-categories.html#proposing-categories">
 		Proposing a New Category</link> in the Porter's
 	      Handbook.  Once that procedure has been followed and the
 	      PR has been assigned to the &a.portmgr;, it is their
 	      decision whether or not to approve it.  If they do, it
 	      is their responsibility to:</para>
 
 	    <procedure>
 	      <step>
 		<para>Perform any needed moves.  (This only applies
 		  to physical categories.)</para>
 	      </step>
 
 	      <step>
 		<para>Update the <varname>VALID_CATEGORIES</varname>
 		  definition in
 		  <filename>ports/Mk/bsd.port.mk</filename>.</para>
 	      </step>
 
 	      <step>
 		<para>Assign the PR back to you.</para>
 	      </step>
 	    </procedure>
 	  </answer>
 	</qandaentry>
 
 	<qandaentry xml:id="ports-qa-new-category-physical">
 	  <question>
 	    <para>What do I need to do to implement a new physical
 	      category?</para>
 	  </question>
 
 	  <answer>
 	    <procedure>
 	      <step>
 		<para>Upgrade each moved port's
 		  <filename>Makefile</filename>.  Do not connect the
 		  new category to the build yet.</para>
 
 		<para>To do this, you will need to:</para>
 
 		<procedure>
 		  <step>
 		    <para>Change the port's
 		      <varname>CATEGORIES</varname> (this was the
 		      point of the exercise, remember?)  The new
 		      category is listed
 		      <emphasis>first</emphasis>.  This will help to
 		      ensure that the <varname>PKGORIGIN</varname> is
 		      correct.</para>
 		  </step>
 
 		  <step>
 		    <para>Run a <command>make describe</command>.
 		      Since the top-level
 		      <command>make index</command> that you will be
 		      running in a few steps is an iteration of
 		      <command>make describe</command> over the entire
 		      ports hierarchy, catching any errors here will
 		      save you having to re-run that step later
 		      on.</para>
 		  </step>
 
 		  <step>
 		    <para>If you want to be really thorough, now
 		      might be a good time to run
 		      &man.portlint.1;.</para>
 		  </step>
 		</procedure>
 	      </step>
 
 	      <step>
 		<para>Check that the <varname>PKGORIGIN</varname>s are
 		  correct.  The ports system uses each port's
 		  <varname>CATEGORIES</varname> entry to create its
 		  <varname>PKGORIGIN</varname>, which is used to
 		  connect installed packages to the port directory
 		  they were built from.  If this entry is wrong,
 		  common port tools like &man.pkg.version.1; and
 		  &man.portupgrade.1; fail.</para>
 
 		<para>To do this, use the
 		  <filename>chkorigin.sh</filename> tool:
 		  <command>env
 		  PORTSDIR=<replaceable>/path/to/ports</replaceable>
 		  sh -e
 		  <replaceable>/path/to/ports</replaceable>/Tools/scripts/chkorigin.sh</command>.
 		  This will check <emphasis>every</emphasis> port in
 		  the ports tree, even those not connected to the
 		  build, so you can run it directly after the move
 		  operation.  Hint: do not forget to look at the
 		  <varname>PKGORIGIN</varname>s of any slave ports of
 		  the ports you just moved!</para>
 	      </step>
 
 	      <step>
 		<para>On your own local system, test the proposed
 		  changes: first, comment out the
 		  <varname>SUBDIR</varname> entries in the old ports'
 		  categories' <filename>Makefile</filename>s; then
 		  enable building the new category in
 		  <filename>ports/Makefile</filename>.  Run
 		  <command>make checksubdirs</command> in the affected
 		  category directories to check the
 		  <varname>SUBDIR</varname> entries.  Next, in the
 		  <filename>ports/</filename>
 		  directory, run <command>make index</command>.  This
 		  can take over 40 minutes on even modern systems;
 		  however, it is a necessary step to prevent problems
 		  for other people.</para>
 	      </step>
 
 	      <step>
 		<para>Once this is done, you can commit the updated
 		  <filename>ports/Makefile</filename> to connect the
 		  new category to the build and also commit the
 		  <filename>Makefile</filename> changes for the old
 		  category or categories.</para>
 	      </step>
 
 	      <step>
 		<para>Add appropriate entries to
 		  <filename>ports/MOVED</filename>.</para>
 	      </step>
 
 	      <step>
 		<para>Update the documentation by modifying:</para>
 
 		<itemizedlist>
 		  <listitem>
 		    <para>the <link
 			xlink:href="&url.books.porters-handbook;/makefile-categories.html#PORTING-CATEGORIES">list
 			of categories</link> in the Porter's
 		      Handbook</para>
 		  </listitem>
 
 		  <listitem>
 		    <para><filename>doc/en_US.ISO8859-1/htdocs/ports</filename>.
 		      Note that these are now displayed by sub-groups,
 		      as specified in
 		      <filename>doc/en_US.ISO8859-1/htdocs/ports/categories.descriptions</filename>.</para>
 		  </listitem>
 		</itemizedlist>
 
 		<para>(Note: these are in the docs, not the ports,
 		  repository).  If you are not a docs committer, you
 		  will need to submit a PR for this.</para>
 	      </step>
 
 	      <step>
 		<para>Only once all the above have been done, and no
 		  one is any longer reporting problems with the new
 		  ports, should the old ports be deleted from their
 		  previous locations in the repository.</para>
 	      </step>
 	    </procedure>
 
 	    <para>It is not necessary to manually update the
 	      <link xlink:href="&url.base;/ports/index.html">ports web
 		pages</link> to reflect the new category.  This is
 	      done automatically via the change to
 	      <filename>en_US.ISO8859-1/htdocs/ports/categories</filename>
 	      and the automated rebuild of
 	      <filename>INDEX</filename>.</para>
 	  </answer>
 	</qandaentry>
 
 	<qandaentry xml:id="ports-qa-new-category-virtual">
 	  <question>
 	    <para>What do I need to do to implement a new virtual
 	      category?</para>
 	  </question>
 
 	  <answer>
 	    <para>This is much simpler than a physical category.  Only
 	      a few modifications are needed:</para>
 
 	    <itemizedlist>
 	      <listitem>
 		<para>the <link
 		    xlink:href="&url.books.porters-handbook;/makefile-categories.html#PORTING-CATEGORIES">list
 		    of categories</link> in the Porter's
 		  Handbook</para>
 	      </listitem>
 
 	      <listitem>
 		<para><filename>en_US.ISO8859-1/htdocs/ports/categories</filename></para>
 	      </listitem>
 	    </itemizedlist>
 	  </answer>
 	</qandaentry>
       </qandadiv>
 
       <qandadiv xml:id="ports-qa-misc-questions">
 	<title>Miscellaneous Questions</title>
 
 	<qandaentry xml:id="ports-qa-misc-blanket-approval">
 	  <question>
 	    <para>Are there changes that can be committed without
 	      asking the maintainer for approval?</para>
 	  </question>
 
 	  <answer>
 	    <para>Blanket approval for most ports applies to these
 	      types of fixes:</para>
 
 	    <itemizedlist>
 	      <listitem>
 		<para>Most infrastructure changes to a port (that is,
 		  modernizing, but not changing the functionality).
 		  For example, the blanket covers converting to new
 		  <varname>USES</varname> macros, enabling verbose
 		  builds, and switching to new ports system
 		  syntaxes.</para>
 	      </listitem>
 
 	      <listitem>
 		<para>Trivial and <emphasis>tested</emphasis> build
 		  and runtime fixes.</para>
 	      </listitem>
 
 	      <listitem>
 		<para>Documentations or metadata changes to ports,
 		  like <filename>pkg-descr</filename> or
 		  <varname>COMMENT</varname>.</para>
 	      </listitem>
 	    </itemizedlist>
 
 	    <important>
 	      <para>Exceptions to this are anything maintained by the
 		&a.portmgr;, or the &a.security-officer;.  No
 		unauthorized commits may ever be made to ports
 		maintained by those groups.</para>
 	    </important>
 
 	  </answer>
 	</qandaentry>
 
 	<qandaentry xml:id="ports-qa-misc-correctly-building">
 	  <question>
 	    <para>How do I know if my port is building correctly or
 	      not?</para>
 	  </question>
 
 	  <answer>
 	    <para>The packages are built multiple times each week.  If
 	      a port fails, the maintainer will receive an email from
 	      <literal>pkg-fallout@FreeBSD.org</literal>.</para>
 
 	    <para>Reports for all the package builds (official,
 	      experimental, and non-regression) are aggregated at
 	      <link
 		xlink:href="https://pkg-status.freebsd.org/">pkg-status.FreeBSD.org</link>.</para>
 	  </answer>
 	</qandaentry>
 
 	<qandaentry xml:id="ports-qa-misc-INDEX">
 	  <question>
 	    <para>I added a new port.  Do I need to add it to the
 	      <filename>INDEX</filename>?</para>
 	  </question>
 
 	  <answer>
 	    <para>No.  The file can either be generated by running
 	      <command>make index</command>, or a pre-generated
 	      version can be downloaded with
 	      <command>make fetchindex</command>.</para>
 	  </answer>
 	</qandaentry>
 
 	<qandaentry xml:id="ports-qa-misc-no-touch">
 	  <question>
 	    <para>Are there any other files I am not allowed to
 	      touch?</para>
 	  </question>
 
 	  <answer>
 	    <para>Any file directly under <filename>ports/</filename>,
 	      or any file under a subdirectory that starts with an
 	      uppercase letter (<filename>Mk/</filename>,
 	      <filename>Tools/</filename>, etc.).  In particular, the
 	      &a.portmgr; is very protective of
 	      <filename>ports/Mk/bsd.port*.mk</filename> so do not
 	      commit changes to those files unless you want to face
 	      their wrath.</para>
 	  </answer>
 	</qandaentry>
 
 	<qandaentry xml:id="ports-qa-misc-updated-distfile">
 	  <question>
 	    <para>What is the proper procedure for updating the
 	      checksum for a port distfile when the file changes
 	      without a version change?</para>
 	  </question>
 
 	  <answer>
 	    <para>When the checksum for a distribution file is updated
 	      due to the author updating the file without changing the
 	      port revision, the commit message includes a
 	      summary of the relevant diffs between the original and
 	      new distfile to ensure that the distfile has not been
 	      corrupted or maliciously altered.  If the current
 	      version of the port has been in the ports tree for a
 	      while, a copy of the old distfile will usually be
 	      available on the ftp servers; otherwise the author or
 	      maintainer should be contacted to find out why the
 	      distfile has changed.</para>
 	  </answer>
 	</qandaentry>
 
 	<qandaentry xml:id="ports-exp-run">
 	  <question>
 	    <para>How can an experimental test build of the ports tree
 	      (<emphasis>exp-run</emphasis>) be requested?</para>
 	  </question>
 
 	  <answer>
 	    <para>An exp-run must be completed before patches with a
 	      significant ports impact are committed.  The patch can
 	      be against the ports tree or the base system.</para>
 
 	    <para>Full package builds will be done with the patches
 	      provided by the submitter, and the submitter is required
 	      to fix detected problems (<emphasis>fallout</emphasis>)
 	      before commit.</para>
 
 	    <procedure>
 	      <step>
 		<para>Go to the <link
 		    xlink:href="https://bugs.freebsd.org/submit">Bugzilla
 		    new <acronym>PR</acronym> page</link>.</para>
 	      </step>
 
 	      <step>
 		<para>Select the product your patch is about.</para>
 	      </step>
 
 	      <step>
 		<para>Fill in the bug report as normal.  Remember to
 		  attach the patch.</para>
 	      </step>
 
 	      <step>
 		<para>If at the top it says <quote>Show Advanced
 		    Fields</quote> click on it.  It will now say
 		  <quote>Hide Advanced Fields</quote>.  Many new
 		  fields will be available.  If it already says
 		  <quote>Hide Advanced Fields</quote>, no need to do
 		  anything.</para>
 	      </step>
 
 	      <step>
 		<para>In the <quote>Flags</quote> section, set the
 		  <quote>exp-run</quote> one to <literal>?</literal>.
 		  As for all other fields, hovering the mouse over any
 		  field shows more details.</para>
 	      </step>
 
 	      <step>
 		<para>Submit.  Wait for the build to run.</para>
 	      </step>
 
 	      <step>
 		<para>&a.portmgr; will reply with a possible
 		  fallout.</para>
 	      </step>
 
 	      <step>
 		<para>Depending on the fallout:</para>
 
 		<stepalternatives>
 		  <step>
 		    <para>If there is no fallout, the procedure stops
 		      here, and the change can be committed, pending
 		      any other approval required.</para>
 		  </step>
 
 		  <step>
 		    <substeps>
 		      <step>
 			<para>If there is fallout, it
 			  <emphasis>must</emphasis> be fixed, either
 			  by fixing the ports directly in the ports
 			  tree, or adding to the submitted
 			  patch.</para>
 		      </step>
 
 		      <step>
 			<para>When this is done, go back to step 6
 			  saying the fallout was fixed and wait for
 			  the exp-run to be run again.  Repeat as long
 			  as there are broken ports.</para>
 		      </step>
 		    </substeps>
 		  </step>
 		</stepalternatives>
 	      </step>
 	    </procedure>
 	  </answer>
 	</qandaentry>
       </qandadiv>
     </qandaset>
   </sect1>
 
   <sect1 xml:id="non-committers">
     <title>Issues Specific to Developers Who Are Not
       Committers</title>
 
     <para>A few people who have access to the &os; machines do not
       have commit bits.  Almost all of this document will apply to
       these developers as well (except things specific to commits and
       the mailing list memberships that go with them).  In particular,
       we recommend that you read:</para>
 
     <itemizedlist>
       <listitem>
 	<para><link linkend="admin">Administrative
 	    Details</link></para>
       </listitem>
 
       <listitem>
 	<para><link
 	    linkend="conventions-everyone">Conventions</link></para>
 
 	<note>
 	  <para>Get your mentor to add you to the
 	    <quote>Additional Contributors</quote>
 	    (<filename>doc/en_US.ISO8859-1/articles/contributors/contrib.additional.xml</filename>),
 	    if you are not already listed there.</para>
 	</note>
       </listitem>
 
       <listitem>
 	<para><link linkend="developer.relations">Developer
 	    Relations</link></para>
       </listitem>
 
       <listitem>
 	<para><link linkend="ssh.guide">SSH Quick-Start
 	    Guide</link></para>
       </listitem>
 
       <listitem>
 	<para><link linkend="rules">The &os; Committers' Big List
 	    of Rules</link></para>
       </listitem>
     </itemizedlist>
 
   </sect1>
 
   <sect1 xml:id="google-analytics">
     <title>Information About &ga;</title>
 
     <para>As of December 12, 2012, &ga; was enabled on the
       &os;&nbsp;Project website to collect anonymized usage statistics
       regarding usage of the site.  The information collected is
       valuable to the &os;&nbsp;Documentation Project, to
       identify various problems on the &os; website.</para>
 
     <sect2 xml:id="google-analytics-policy">
       <title>&ga; General Policy</title>
 
       <para>The &os;&nbsp;Project takes visitor privacy very
 	seriously.  As such, the &os;&nbsp;Project website honors the
 	<quote>Do Not Track</quote> header <emphasis>before</emphasis>
 	fetching the tracking code from Google.  For more information,
 	please see the
 	<link xlink:href="https://www.FreeBSD.org/privacy.html">&os;
 	  Privacy Policy</link>.</para>
 
       <para>&ga; access is <emphasis>not</emphasis> arbitrarily
 	allowed &mdash; access must be requested, voted on by the
 	&a.doceng;, and explicitly granted.</para>
 
       <para>Requests for &ga; data must include a specific purpose.
 	For example, a valid reason for requesting access would be
 	<quote>to see the most frequently used web browsers when
 	  viewing &os; web pages to ensure page rendering speeds are
 	  acceptable.</quote></para>
 
       <para>Conversely, <quote>to see what web browsers are most
 	  frequently used</quote> (without stating
 	<emphasis>why</emphasis>) would be rejected.</para>
 
       <para>All requests must include the timeframe for which the data
 	would be required.  For example, it must be explicitly stated
 	if the requested data would be needed for a timeframe covering
 	a span of 3 weeks, or if the request would be one-time
 	only.</para>
 
       <para>Any request for &ga; data without a clear, reasonable
 	reason beneficial to the &os;&nbsp;Project will be
 	rejected.</para>
     </sect2>
 
     <sect2 xml:id="google-analytics-data">
       <title>Data Available Through &ga;</title>
 
       <para>A few examples of the types of &ga; data available
 	include:</para>
 
       <itemizedlist>
 	<listitem>
 	  <para>Commonly used web browsers</para>
 	</listitem>
 
 	<listitem>
 	  <para>Page load times</para>
 	</listitem>
 
 	<listitem>
 	  <para>Site access by language</para>
 	</listitem>
       </itemizedlist>
     </sect2>
   </sect1>
 
   <sect1 xml:id="misc">
     <title>Miscellaneous Questions</title>
 
     <qandaset>
       <qandaentry>
 	<question>
 	  <para>How do I add a new file to a branch?</para>
 	</question>
 
 	<answer>
 	  <para>To add a file onto a branch, simply checkout or update
 	    to the branch you want to add to and then add the file
 	    using the add operation as you normally would.  This works
 	    fine for the <literal>doc</literal> and
 	    <literal>ports</literal> trees.  The
 	    <literal>src</literal> tree uses SVN and requires more
 	    care because of the <literal>mergeinfo</literal>
 	    properties.  See the
 	    <link linkend="subversion-primer">Subversion Primer</link>
 	    for details on how to perform an MFC.</para>
 	</answer>
       </qandaentry>
 
       <qandaentry>
 	<question>
 	  <para>How do I access <systemitem
 	      class="fqdomainname">people.FreeBSD.org</systemitem> to
 	    put up personal or project information?</para>
 	</question>
 
 	<answer>
 	  <para><systemitem
 	      class="fqdomainname">people.FreeBSD.org</systemitem> is
 	    the same as <systemitem
 	      class="fqdomainname">freefall.FreeBSD.org</systemitem>.
 	    Just create a <filename>public_html</filename> directory.
 	    Anything you place in that directory will automatically be
 	    visible under <uri
 	      xlink:href="https://people.FreeBSD.org/">https://people.FreeBSD.org/</uri>.</para>
 	</answer>
       </qandaentry>
 
       <qandaentry>
 	<question>
 	  <para>Where are the mailing list archives stored?</para>
 	</question>
 
 	<answer>
 	  <para>The mailing lists are archived under
 	    <filename>/local/mail</filename> on <systemitem
 	      class="fqdomainname"
 	      >freefall.FreeBSD.org</systemitem>.</para>
 	</answer>
       </qandaentry>
 
       <qandaentry>
 	<question>
 	  <para>I would like to mentor a new committer.  What process
 	    do I need to follow?</para>
 	</question>
 
 	<answer>
 	  <para>See the <link
 	      xlink:href="https://www.freebsd.org/internal/new-account.html">New
 	      Account Creation Procedure</link> document on the
 	    internal pages.</para>
 	</answer>
       </qandaentry>
     </qandaset>
   </sect1>
 
   <sect1 xml:id="benefits">
     <title>Benefits and Perks for &os; Committers</title>
 
     <sect2 xml:id="benefits-recognition">
       <title>Recognition</title>
 
       <para>Recognition as a competent software engineer is the
 	longest lasting value.  In addition, getting a chance to work
 	with some of the best people that every engineer would dream
 	of meeting is a great perk!</para>
     </sect2>
 
     <sect2 xml:id="benefits-freebsdmall">
       <title>FreeBSD Mall</title>
 
       <para>&os; committers can get a free 4-CD or DVD set at
 	conferences from
 	<link xlink:href="http://www.freebsdmall.com">&os; Mall,
 	  Inc.</link>.</para>
     </sect2>
 
     <sect2 xml:id="benefits-irc">
       <title><acronym>IRC</acronym></title>
 
       <para>In addition, developers may request a cloaked hostmask
 	for their account on the Freenode IRC network in the form
 	of
 	<literal>freebsd/developer/</literal><replaceable>freefall
 	  name</replaceable> or
 	<literal>freebsd/developer/</literal><replaceable>NickServ
 	  name</replaceable>.  To request a cloak, send an email to
 	&a.irc.email; with your requested hostmask and NickServ
 	account name.</para>
     </sect2>
 
     <sect2 xml:id="benefits-gandi">
       <title><systemitem
 	  class="domainname">Gandi.net</systemitem></title>
 
       <para>Gandi provides website hosting, cloud computing, domain
 	registration, and X.509 certificate services.</para>
 
       <para>Gandi offers an E-rate discount to all &os; developers.
 	Send mail to <email>non-profit@gandi.net</email> using your
 	<literal>@freebsd.org</literal> mail address, and indicate
 	your Gandi handle.</para>
     </sect2>
   </sect1>
 </article>
diff --git a/en_US.ISO8859-1/articles/contributors/contrib.develinmemoriam.xml b/en_US.ISO8859-1/articles/contributors/contrib.develinmemoriam.xml
index cb5064aa5c..eda3c20709 100644
--- a/en_US.ISO8859-1/articles/contributors/contrib.develinmemoriam.xml
+++ b/en_US.ISO8859-1/articles/contributors/contrib.develinmemoriam.xml
@@ -1,179 +1,179 @@
 <?xml version="1.0" encoding="iso-8859-1"?>
 <!-- $FreeBSD$ -->
 <itemizedlist xmlns="http://docbook.org/ns/docbook"
   xmlns:xlink="http://www.w3.org/1999/xlink" version="5.0">
 
   <listitem>
     <para>Bruce D. Evans (1991 - 2019; RIP 2019)</para>
 
     <para>Bruce was a programming giant who made FreeBSD his
       home.</para>
 
     <para>Back before FreeBSD and Linux there was Minix, a toy "unix"
       written by Andy Tannenbaum, released in 1987, sold with complete
       sources on three floppy disks, for $99.</para>
 
     <para>Bruce ported Minix to the i386 around 1989.</para>
 
     <para>Linus Torvalds used Minix/386 to develop his own kernel, and
       Bruce was the first person he thanked in the
       release-announcement.</para>
 
     <para>When Bill Jolitz released 386BSD 0.1 in 1992, Bruce was
       listed as a contributor.</para>
 
     <para>Bruce co-founded the FreeBSD project, and served on core.0,
       but he was never partisan, and over the years many other
       projects have benefitted from his patches, advice and
       wisdom.</para>
 
     <para>Code reviews from Bruce came in three flavours, "mild",
       "brucified" and "brucifiction", but they were never personal: It
       was always only about the code, the mistakes, the sloppy
       thinking, the missing historical context, the ambiguous
       standards - and the style(9) transgressions.</para>
 
-    <para>Because Bruce gave more code reviews than anybody else in
+    <para>As Bruce gave more code reviews than anybody else in
       the history of the FreeBSD project, the commit logs hide the
       true scale of his impact until you pay attention to
       "Submitted by", "Reviewed by" and "Pointed out by".</para>
 
     <para>Being hard of hearing, Bruce did not attend
       conferences.</para>
 
     <para>The notable exception was the 1999 BSDcon in California,
       where his core team colleagues greeted him with "We're not
       worthy!" in Wayne's World fashion.</para>
 
     <para>Twenty years later we're still not.</para>
   </listitem>
 
   <listitem>
     <para>Kurt Lidl (2015 - 2019; RIP 2019)</para>
 
     <para>Kurt first got involved with BSD while it was still a
       project at the University of California at Berkeley.  Shortly
       after personalized license plates became available in Maryland,
       he got "BSDWZRD".</para>
 
     <para>He began contributing to FreeBSD shortly after the
       conception of the project.  He became a FreeBSD source committer
       in October 2015.</para>
 
     <para>Kurt's most well known FreeBSD project was
       &man.blacklistd.8; which blocks and releases ports on demand to
       avoid DoS abuse.  He has also made many other bug fixes and
       enhancements to DTrace, boot loaders, and other bits and pieces
       of the FreeBSD infrastructure.</para>
 
     <para>Earlier work included the game XTank, an author on RFC 2516
       <link xlink:href="https://tools.ietf.org/html/rfc2516">"A Method
 	for Transmitting PPP Over Ethernet (PPPoE)"</link>, and the
       USENIX paper <link
 	xlink:href="https://www.usenix.org/conference/usenix-winter-1994-technical-conference/drinking-firehose-multicast-usenet-news">"Drinking from the Firehose: Multicast USENET News"</link>.</para>
   </listitem>
 
   <listitem>
     <para>Frank Durda IV (1995 - 2003; RIP 2018)</para>
 
     <para>Frank had been around the project since the
       very early days, contributing code to the 1.x line
       before becoming a committer.</para>
   </listitem>
 
   <listitem>
     <para>Andrey A. Chernov (1993 - 2017; RIP 2017)</para>
 
     <para>Andrey contributions to &os; can not be overstated. Having
       been involved for a long there is hardly an area which he did
       not touch.</para>
   </listitem>
 
   <listitem>
     <para>J&uuml;rgen Lock (2006 - 2015; RIP 2015)</para>
 
     <para>J&uuml;rgen made a number of contributions to &os;,
       including work on libvirt, the graphics stack, and QEMU.
       J&uuml;rgen's contributions and helpfulness were appreciated by
       people around the world.  That work continues to improve the
       lives of thousands every day.</para>
   </listitem>
 
   <listitem>
     <para>&a.alexbl.email; (2006 - 2011; RIP 2012)</para>
 
     <para><link
 	xlink:href="http://www.legacy.com/obituaries/sfgate/obituary.aspx?pid=159801494">Alexander</link>
       was best known as a major contributor to &os;'s
       <application>Python</application> ports and a founding member of
       &a.python; as well as his work on
       <application>XMMS2</application>.</para>
   </listitem>
 
   <listitem>
     <para>&a.jb.email; (1997 - 2009; RIP 2009)</para>
 
     <para><link
 	xlink:href="http://hub.opensolaris.org/bin/view/Community+Group+ogb/In+Memoriam">John</link>
       made major contributions to FreeBSD, the best known of which is
       the import of the &man.dtrace.1; code.  John's unique sense of
       humor and plain-spokenness either ruffled feathers or made him
       quick friends.  At the end of his life, he had moved to a rural
       area and was attempting to live with as minimal impact to the
       planet as possible, while at the same time still working in the
       high-tech area.</para>
   </listitem>
 
   <listitem>
     <para>&a.jmz.email; (1994 - 2009; RIP 2009)</para>
 
     <para><link
 	xlink:href="http://www.obs-besancon.fr/article.php3?id_article=323">Jean-Marc</link>
       was an astrophysicist who made important contributions to the
       modeling of the atmospheres of both planets and comets at
       <link xlink:href="http://www.obs-besancon.fr/">l'Observatoire de
 	Besan&ccedil;on</link> in Besan&ccedil;on, France.  While
       there, he participated in the conception and construction of the
       Vega tricanal spectrometer that studied Halley's Comet.  He had
       also been a long-time contributor to FreeBSD.</para>
   </listitem>
 
   <listitem>
     <para>&a.itojun.email; (1997 - 2001; RIP 2008)</para>
 
     <para>Known to everyone as <link
 	xlink:href="http://astralblue.livejournal.com/350702.html">itojun</link>,
       Jun-ichiro Hagino was a core researcher at the
       <link xlink:href="http://www.kame.net/">KAME Project</link>,
       which aimed to provide IPv6 and IPsec technology in freely
       redistributable form.  Much of this code was incorporated into
       FreeBSD.  Without his efforts, the state of IPv6 on the Internet
       would be much different.</para>
   </listitem>
 
   <listitem>
     <para>&a.cg.email; (1999 - 2005; RIP 2005)</para>
 
     <para><link xlink:href="http://www.dbsi.org/cam/">Cameron</link>
       was a unique individual who contributed to the project despite
       serious physical disabilities.  He was responsible for a
       complete rewrite of our sound system during the late 1990s.
       Many of those who corresponded with him had no idea of his
       limited mobility, due to his cheerful spirit and willingness to
       help others.</para>
   </listitem>
 
   <listitem>
     <para>&a.alane.email; (2002 - 2003; RIP 2003)</para>
 
     <para><link
 	xlink:href="http://freebsd.kde.org/memoriam/alane.php">Alan</link>
       was a major contributor to the KDE on FreeBSD group.  In
       addition, he maintained many other difficult and time-consuming
       ports such as <application>autoconf</application>,
       <application>CUPS</application>, and
       <application>python</application>.  Alan's path was not an easy
       one but his passion for FreeBSD, and dedication to programming
       excellence, won him many friends.</para>
   </listitem>
 </itemizedlist>
diff --git a/en_US.ISO8859-1/articles/ldap-auth/article.xml b/en_US.ISO8859-1/articles/ldap-auth/article.xml
index d1957adb4f..26ecdf7f29 100644
--- a/en_US.ISO8859-1/articles/ldap-auth/article.xml
+++ b/en_US.ISO8859-1/articles/ldap-auth/article.xml
@@ -1,972 +1,972 @@
 <?xml version="1.0" encoding="iso-8859-1"?>
 <!DOCTYPE article PUBLIC "-//FreeBSD//DTD DocBook XML V5.0-Based Extension//EN"
 	"http://www.FreeBSD.org/XML/share/xml/freebsd50.dtd">
 <article xmlns="http://docbook.org/ns/docbook"
   xmlns:xlink="http://www.w3.org/1999/xlink" version="5.0"
   xml:lang="en">
   <info>
     <title>LDAP Authentication</title>
 
     <authorgroup>
       <author>
 	<personname>
 	  <firstname>Toby</firstname>
 	  <surname>Burress</surname>
 	</personname>
 	<affiliation>
 	  <address>
 	    <email>kurin@causa-sui.net</email>
 	  </address>
 	</affiliation>
       </author>
     </authorgroup>
 
     <copyright>
       <year>2007</year>
       <year>2008</year>
       <holder>The FreeBSD Documentation Project</holder>
     </copyright>
 
     <legalnotice xml:id="trademarks" role="trademarks">
       &tm-attrib.freebsd;
       &tm-attrib.general;
     </legalnotice>
 
     <pubdate>$FreeBSD$</pubdate>
 
     <releaseinfo>$FreeBSD$</releaseinfo>
 
     <abstract>
       <para>This document is intended as a guide for the configuration
 	of an LDAP server (principally an
 	<application>OpenLDAP</application> server) for authentication
 	on &os;.  This is useful for situations where many servers
 	need the same user accounts, for example as a replacement for
 	<application>NIS</application>.</para>
     </abstract>
   </info>
 
   <sect1 xml:id="preface">
     <title>Preface</title>
 
     <para>This document is intended to give the reader enough of an
       understanding of LDAP to configure an LDAP server.  This
       document will attempt to provide an explanation of
       <package>net/nss_ldap</package> and
       <package>security/pam_ldap</package> for use with client
       machines services for use with the LDAP server.</para>
 
     <para>When finished, the reader should be able to configure and
       deploy a &os; server that can host an LDAP directory, and to
       configure and deploy a &os; server which can authenticate
       against an LDAP directory.</para>
 
     <para>This article is not intended to be an exhaustive account of
       the security, robustness, or best practice considerations for
       configuring LDAP or the other services discussed herein.  While
       the author takes care to do everything correctly, they do not
       address security issues beyond a general scope.  This article
       should be considered to lay the theoretical groundwork only, and
       any actual implementation should be accompanied by careful
       requirement analysis.</para>
   </sect1>
 
   <sect1 xml:id="ldap">
     <title>Configuring LDAP</title>
 
     <para>LDAP stands for <quote>Lightweight Directory Access
 	Protocol</quote> and is a subset of the X.500 Directory Access
       Protocol.  Its most recent specifications are in <link
 	xlink:href="http://www.ietf.org/rfc/rfc4510.txt">RFC4510</link>
       and friends.  Essentially it is a database that expects to be
       read from more often than it is written to.</para>
 
     <para>The LDAP server <link
 	xlink:href="http://www.openldap.org/">OpenLDAP</link> will be
       used in the examples in this document; while the principles here
       should be generally applicable to many different servers, most
       of the concrete administration is
       <application>OpenLDAP</application>-specific.  There are several
       server versions in ports, for example
       <package>net/openldap24-server</package>.  Client servers will
       need the corresponding <package>net/openldap24-client</package>
       libraries.</para>
 
     <para>There are (basically) two areas of the LDAP service which
       need configuration.  The first is setting up a server to receive
       connections properly, and the second is adding entries to the
       server's directory so that &os; tools know how to interact with
       it.</para>
 
     <sect2 xml:id="ldap-connect">
       <title>Setting Up the Server for Connections</title>
 
       <note>
 	<para>This section is specific to
 	  <application>OpenLDAP</application>.  If you are using
 	  another server, you will need to consult that server's
 	  documentation.</para>
       </note>
 
       <sect3 xml:id="ldap-connect-install">
 	<title>Installing <application>OpenLDAP</application></title>
 
 	<para>First, install
 	  <application>OpenLDAP</application>:</para>
 
 	<example xml:id="oldap-install">
 	  <title>Installing
 	    <application>OpenLDAP</application></title>
 
 	  <screen>&prompt.root; <userinput>cd /usr/ports/net/openldap24-server</userinput>
 &prompt.root; make install clean</screen>
 	</example>
 
 	<para>This installs the <command>slapd</command> and
 	  <command>slurpd</command> binaries, along with the required
 	  <application>OpenLDAP</application> libraries.</para>
       </sect3>
 
       <sect3 xml:id="ldap-connect-config">
 	<title>Configuring <application>OpenLDAP</application></title>
 
 	<para>Next we must configure
 	  <application>OpenLDAP</application>.</para>
 
 	<para>You will want to require encryption in your connections
 	  to the LDAP server; otherwise your users' passwords will be
 	  transferred in plain text, which is considered insecure.
 	  The tools we will be using support two very similar kinds of
 	  encryption, SSL and TLS.</para>
 
 	<para>TLS stands for <quote>Transportation Layer
 	    Security</quote>.  Services that employ TLS tend to
 	  connect on the <emphasis>same</emphasis> ports as the same
 	  services without TLS; thus an SMTP server which supports TLS
 	  will listen for connections on port 25, and an LDAP server
 	  will listen on 389.</para>
 
 	<para>SSL stands for <quote>Secure Sockets Layer</quote>, and
 	  services that implement SSL do <emphasis>not</emphasis>
 	  listen on the same ports as their non-SSL counterparts.
 	  Thus SMTPS listens on port 465 (not 25), HTTPS listens on
 	  443, and LDAPS on 636.</para>
 
 	<para>The reason SSL uses a different port than TLS is because
 	  a TLS connection begins as plain text, and switches to
 	  encrypted traffic after the <literal>STARTTLS</literal>
 	  directive.  SSL connections are encrypted from the
 	  beginning.  Other than that there are no substantial
 	  differences between the two.</para>
 
 	<note>
 	  <para>We will adjust <application>OpenLDAP</application> to
 	    use TLS, as SSL is considered deprecated.</para>
 	</note>
 
 	<para>Once <application>OpenLDAP</application> is installed
 	  via ports, the following configuration parameters in
 	  <filename>/usr/local/etc/openldap/slapd.conf</filename> will
 	  enable TLS:</para>
 
 	<programlisting>security ssf=128
 
 TLSCertificateFile /path/to/your/cert.crt
 TLSCertificateKeyFile /path/to/your/cert.key
 TLSCACertificateFile /path/to/your/cacert.crt</programlisting>
 
 
 	<para>Here, <literal>ssf=128</literal> tells
 	  <application>OpenLDAP</application> to require 128-bit
 	  encryption for all connections, both search and update.
 	  This parameter may be configured based on the security needs
 	  of your site, but rarely you need to weaken it, as most LDAP
 	  client libraries support strong encryption.</para>
 
 	<para>The <filename>cert.crt</filename>,
 	  <filename>cert.key</filename>, and
 	  <filename>cacert.crt</filename> files are necessary for
 	  clients to authenticate <emphasis>you</emphasis> as the
 	  valid LDAP server.  If you simply want a server that runs,
 	  you can create a self-signed certificate with
 	  OpenSSL:</para>
 
 	<example xml:id="genrsa">
 	  <title>Generating an RSA Key</title>
 
 	  <screen>&prompt.user; <userinput>openssl genrsa -out cert.key 1024</userinput>
 Generating RSA private key, 1024 bit long modulus
 ....................++++++
 ...++++++
 e is 65537 (0x10001)
 &prompt.user; <userinput>openssl req -new -key cert.key -out cert.csr</userinput></screen>
 	</example>
 
 	<para>At this point you should be prompted for some values.
 	  You may enter whatever values you like; however, it is
 	  important the <quote>Common Name</quote> value be the fully
 	  qualified domain name of the
 	  <application>OpenLDAP</application> server.  In our case,
 	  and the examples here, the server is
 	  <replaceable>server.example.org</replaceable>.  Incorrectly
 	  setting this value will cause clients to fail when making
 	  connections.  This can the cause of great frustration, so
 	  ensure that you follow these steps closely.</para>
 
 	<para>Finally, the certificate signing request needs to be
 	  signed:</para>
 
 	<example xml:id="self-sign">
 	  <title>Self-signing the Certificate</title>
 
 	  <screen>&prompt.user; <userinput>openssl x509 -req -in cert.csr -days 365 -signkey cert.key -out cert.crt</userinput>
 Signature ok
 subject=/C=AU/ST=Some-State/O=Internet Widgits Pty Ltd
 Getting Private key</screen>
 	</example>
 
 	<para>This will create a self-signed certificate that can be
 	  used for the directives in <filename>slapd.conf</filename>,
 	  where <filename>cert.crt</filename> and
 	  <filename>cacert.crt</filename> are the same file.  If you
 	  are going to use many <application>OpenLDAP</application>
 	  servers (for replication via <literal>slurpd</literal>) you
 	  will want to see <xref linkend="ssl-ca"/> to generate a CA
 	  key and use it to sign individual server
 	  certificates.</para>
 
 	<para>Once this is done, put the following in
 	  <filename>/etc/rc.conf</filename>:</para>
 
 	<programlisting>slapd_enable="YES"</programlisting>
 
 	<para>Then run <userinput>/usr/local/etc/rc.d/slapd
 	  start</userinput>.  This should start
 	  <application>OpenLDAP</application>.  Confirm that it is
 	  listening on 389 with</para>
 
 	<screen>&prompt.user; <userinput>sockstat -4 -p 389</userinput>
 ldap     slapd      3261  7  tcp4   *:389                 *:*</screen>
       </sect3>
 
       <sect3 xml:id="ldap-connect-client">
 	<title>Configuring the Client</title>
 
 	<para>Install the <package>net/openldap24-client</package>
 	  port for the <application>OpenLDAP</application> libraries.
 	  The client machines will always have
 	  <application>OpenLDAP</application> libraries since that is
 	  all <package>security/pam_ldap</package> and
 	  <package>net/nss_ldap</package> support, at least for the
 	  moment.</para>
 
 	<para>The configuration file for the
 	  <application>OpenLDAP</application> libraries is
 	  <filename>/usr/local/etc/openldap/ldap.conf</filename>.
 	  Edit this file to contain the following values:</para>
 
 	<programlisting>base dc=example,dc=org
 uri ldap://server.example.org/
 ssl start_tls
 tls_cacert /path/to/your/cacert.crt</programlisting>
 
 	<note>
 	  <para>It is important that your clients have access to
 	    <filename>cacert.crt</filename>, otherwise they will not
 	    be able to connect.</para>
 	</note>
 
 	<note>
 	  <para>There are two files called
 	    <filename>ldap.conf</filename>.  The first is this file,
 	    which is for the <application>OpenLDAP</application>
 	    libraries and defines how to talk to the server.  The
 	    second is <filename>/usr/local/etc/ldap.conf</filename>,
 	    and is for <application>pam_ldap</application>.</para>
 	</note>
 
 	<para>At this point you should be able to run
 	  <userinput>ldapsearch -Z</userinput> on the client machine;
 	  <option>-Z</option> means <quote>use TLS</quote>.  If you
 	  encounter an error, then something is configured wrong; most
 	  likely it is your certificates.  Use &man.openssl.1;'s
 	  <command>s_client</command> and <command>s_server</command>
 	  to ensure you have them configured and signed
 	  properly.</para>
       </sect3>
     </sect2>
 
     <sect2 xml:id="ldap-database">
       <title>Entries in the Database</title>
 
       <para>Authentication against an LDAP directory is generally
 	accomplished by attempting to bind to the directory as the
 	connecting user.  This is done by establishing a
 	<quote>simple</quote> bind on the directory with the user name
 	supplied.  If there is an entry with the
 	<literal>uid</literal> equal to the user name and that entry's
 	<literal>userPassword</literal> attribute matches the password
 	supplied, then the bind is successful.</para>
 
       <para>The first thing we have to do is figure out is where in
 	the directory our users will live.</para>
 
       <para>The base entry for our database is
 	<literal>dc=example,dc=org</literal>.  The default location
 	for users that most clients seem to expect is something like
 	<literal>ou=people,<replaceable>base</replaceable></literal>,
 	so that is what will be used here.  However keep in mind that
 	this is configurable.</para>
 
       <para>So the ldif entry for the <literal>people</literal>
 	organizational unit will look like:</para>
 
       <programlisting>dn: ou=people,dc=example,dc=org
 objectClass: top
 objectClass: organizationalUnit
 ou: people</programlisting>
 
       <para>All users will be created as subentries of this
 	organizational unit.</para>
 
       <para>Some thought might be given to the object class your users
 	will belong to.  Most tools by default will use
 	<literal>people</literal>, which is fine if you simply want to
 	provide entries against which to authenticate.  However, if
 	you are going to store user information in the LDAP database
 	as well, you will probably want to use
 	<literal>inetOrgPerson</literal>, which has many useful
 	attributes.  In either case, the relevant schemas need to be
 	loaded in <filename>slapd.conf</filename>.</para>
 
       <para>For this example we will use the <literal>person</literal>
 	object class.  If you are using
 	<literal>inetOrgPerson</literal>, the steps are basically
 	identical, except that the <literal>sn</literal> attribute is
 	required.</para>
 
       <para>To add a user <literal>testuser</literal>, the ldif would
 	be:</para>
 
       <programlisting>dn: uid=tuser,ou=people,dc=example,dc=org
 objectClass: person
 objectClass: posixAccount
 objectClass: shadowAccount
 objectClass: top
 uidNumber: 10000
 gidNumber: 10000
 homeDirectory: /home/tuser
 loginShell: /bin/csh
 uid: tuser
 cn: tuser</programlisting>
 
       <para>I start my LDAP users' UIDs at 10000 to avoid collisions
 	with system accounts; you can configure whatever number you
 	wish here, as long as it is less than 65536.</para>
 
       <para>We also need group entries.  They are as configurable as
 	user entries, but we will use the defaults below:</para>
 
       <programlisting>dn: ou=groups,dc=example,dc=org
 objectClass: top
 objectClass: organizationalUnit
 ou: groups
 
 dn: cn=tuser,ou=groups,dc=example,dc=org
 objectClass: posixGroup
 objectClass: top
 gidNumber: 10000
 cn: tuser</programlisting>
 
       <para>To enter these into your database, you can use
 	<command>slapadd</command> or <command>ldapadd</command> on a
 	file containing these entries.  Alternatively, you can use
 	<package>sysutils/ldapvi</package>.</para>
 
       <para>The <command>ldapsearch</command> utility on the client
 	machine should now return these entries.  If it does, your
 	database is properly configured to be used as an LDAP
 	authentication server.</para>
     </sect2>
   </sect1>
 
   <sect1 xml:id="client">
     <title>Client Configuration</title>
 
     <para>The client should already have
       <application>OpenLDAP</application> libraries from <xref
 	linkend="ldap-connect-client"/>, but if you are installing
       several client machines you will need to install
       <package>net/openldap24-client</package> on each of them.</para>
 
     <para>&os; requires two ports to be installed to authenticate
       against an LDAP server, <package>security/pam_ldap</package> and
       <package>net/nss_ldap</package>.</para>
 
     <sect2 xml:id="client-auth">
       <title>Authentication</title>
 
       <para><package>security/pam_ldap</package> is configured via
 	<filename>/usr/local/etc/ldap.conf</filename>.</para>
 
       <note>
 	<para>This is a <emphasis>different file</emphasis> than the
 	  <application>OpenLDAP</application> library functions'
 	  configuration file,
 	  <filename>/usr/local/etc/openldap/ldap.conf</filename>;
 	  however, it takes many of the same options; in fact it is a
 	  superset of that file.  For the rest of this section,
 	  references to <filename>ldap.conf</filename> will mean
 	  <filename>/usr/local/etc/ldap.conf</filename>.</para>
       </note>
 
       <para>Thus, we will want to copy all of our original
 	configuration parameters from
 	<filename>openldap/ldap.conf</filename> to the new
 	<filename>ldap.conf</filename>.  Once this is done, we want to
 	tell <package>security/pam_ldap</package> what to look for on
 	the directory server.</para>
 
       <para>We are identifying our users with the
 	<literal>uid</literal> attribute.  To configure this (though
 	it is the default), set the
 	<literal>pam_login_attribute</literal> directive in
 	<filename>ldap.conf</filename>:</para>
 
       <example xml:id="set-pam-login-attr">
 	<title>Setting <literal>pam_login_attribute</literal></title>
 
 	<programlisting>pam_login_attribute uid</programlisting>
       </example>
 
       <para>With this set, <package>security/pam_ldap</package> will
 	search the entire LDAP directory under <literal>base</literal>
 	for the value
 	<literal>uid=<replaceable>username</replaceable></literal>.
 	If it finds one and only one entry, it will attempt to bind as
 	that user with the password it was given.  If it binds
 	correctly, then it will allow access.  Otherwise it will
 	fail.</para>
 
       <para>Users whose shell is not in
 	<filename>/etc/shells</filename> will not be able to log in.
 	This is particularly important when
 	<application>Bash</application> is set as the user shell on
 	the LDAP server.  <application>Bash</application> is not
 	included with a default installation of &os;.  When installed
 	from a package or port, it is located at
 	<filename>/usr/local/bin/bash</filename>.  Verify that the
 	path to the shell on the server is set correctly:</para>
 
       <screen>&prompt.user; <userinput>getent passwd <replaceable>username</replaceable></userinput></screen>
 
       <para>There are two choices when the output shows
 	<literal>/bin/bash</literal> in the last column.  The first is
 	to change the user's entry on the LDAP server to
 	<filename>/usr/local/bin/bash</filename>.  The second option
 	is to create a symlink on the LDAP client computer so
 	<application>Bash</application> is found at the correct
 	location:</para>
 
       <screen>&prompt.root; <userinput>ln -s /usr/local/bin/bash /bin/bash</userinput></screen>
 
       <para>Make sure that <filename>/etc/shells</filename> contains
 	entries for both <literal>/usr/local/bin/bash</literal> and
 	<literal>/bin/bash</literal>.  The user will then be able to
 	log in to the system with <application>Bash</application> as
 	their shell.</para>
 
       <sect3 xml:id="client-auth-pam">
 	<title>PAM</title>
 
 	<para>PAM, which stands for <quote>Pluggable Authentication
 	    Modules</quote>, is the method by which &os; authenticates
 	  most of its sessions.  To tell &os; we wish to use an LDAP
 	  server, we will have to add a line to the appropriate PAM
 	  file.</para>
 
 	<para>Most of the time the appropriate PAM file is
 	  <filename>/etc/pam.d/sshd</filename>, if you want to use
 	  <application>SSH</application> (remember to set the relevant
 	  options in <filename>/etc/ssh/sshd_config</filename>,
 	  otherwise <application>SSH</application> will not use
 	  PAM).</para>
 
 	<para>To use PAM for authentication, add the line</para>
 
 	<programlisting>auth  sufficient  /usr/local/lib/pam_ldap.so  no_warn</programlisting>
 
 	<para>Exactly where this line shows up in the file and which
 	  options appear in the fourth column determine the exact
 	  behavior of the authentication mechanism; see
 	  &man.pam.d.5;</para>
 
 	<para>With this configuration you should be able to
 	  authenticate a user against an LDAP directory.
 	  <application>PAM</application> will perform a bind with your
 	  credentials, and if successful will tell
 	  <application>SSH</application> to allow access.</para>
 
 	<para>However it is not a good idea to allow
 	  <emphasis>every</emphasis> user in the directory into
 	  <emphasis>every</emphasis> client machine.  With the current
 	  configuration, all that a user needs to log into a machine
 	  is an LDAP entry.  Fortunately there are a few ways to
 	  restrict user access.</para>
 
 	<para><filename>ldap.conf</filename> supports a
 	  <literal>pam_groupdn</literal> directive; every account that
 	  connects to this machine needs to be a member of the group
 	  specified here.  For example, if you have</para>
 
 	<programlisting>pam_groupdn cn=servername,ou=accessgroups,dc=example,dc=org</programlisting>
 
 	<para>in <filename>ldap.conf</filename>, then only members of
 	  that group will be able to log in.  There are a few things
 	  to bear in mind, however.</para>
 
 	<para>Members of this group are specified in one or more
 	  <literal>memberUid</literal> attributes, and each attribute
 	  must have the full distinguished name of the member.  So
 	  <literal>memberUid: someuser</literal> will not work; it
 	  must be:</para>
 
 	<programlisting>memberUid: uid=someuser,ou=people,dc=example,dc=org</programlisting>
 
 	<para>Additionally, this directive is not checked in PAM
 	  during authentication, it is checked during account
 	  management, so you will need a second line in your PAM files
 	  under <literal>account</literal>.  This will require, in
 	  turn, <emphasis>every</emphasis> user to be listed in the
 	  group, which is not necessarily what we want.  To avoid
 	  blocking users that are not in LDAP, you should enable the
 	  <literal>ignore_unknown_user</literal> attribute.  Finally,
 	  you should set the
 	  <literal>ignore_authinfo_unavail</literal> option so that
 	  you are not locked out of every computer when the LDAP
 	  server is unavailable.</para>
 
 	<para>Your <filename>pam.d/sshd</filename> might then end up
 	  looking like this:</para>
 
 	<example xml:id="pam">
 	  <title>Sample <filename>pam.d/sshd</filename></title>
 
 	  <programlisting>auth            required        pam_nologin.so          no_warn
 auth            sufficient      pam_opie.so             no_warn no_fake_prompts
 auth            requisite       pam_opieaccess.so       no_warn allow_local
 auth            sufficient      /usr/local/lib/pam_ldap.so      no_warn
 auth            required        pam_unix.so             no_warn try_first_pass
 
 account         required        pam_login_access.so
 account         required        /usr/local/lib/pam_ldap.so      no_warn ignore_authinfo_unavail ignore_unknown_user</programlisting>
 	</example>
 
 	<note>
 	  <para>Since we are adding these lines specifically to
 	    <filename>pam.d/sshd</filename>, this will only have an
 	    effect on <application>SSH</application> sessions.  LDAP
 	    users will be unable to log in at the console.  To change
 	    this behavior, examine the other files in
 	    <filename>/etc/pam.d</filename> and modify them
 	    accordingly.</para>
 	</note>
       </sect3>
     </sect2>
 
     <sect2 xml:id="client-nss">
       <title>Name Service Switch</title>
 
       <para><application>NSS</application> is the service that maps
 	attributes to names.  So, for example, if a file is owned by
 	user <literal>1001</literal>, an application will query
 	<application>NSS</application> for the name of
 	<literal>1001</literal>, and it might get
 	<literal>bob</literal> or <literal>ted</literal> or whatever
 	the user's name is.</para>
 
       <para>Now that our user information is kept in LDAP, we need to
 	tell <application>NSS</application> to look there when
 	queried.</para>
 
       <para>The <package>net/nss_ldap</package> port does this.  It
 	uses the same configuration file as
 	<package>security/pam_ldap</package>, and should not need any
 	extra parameters once it is installed.  Instead, what is left
 	is simply to edit <filename>/etc/nsswitch.conf</filename> to
 	take advantage of the directory.  Simply replace the following
 	lines:</para>
 
       <programlisting>group: compat
 passwd: compat</programlisting>
 
       <para>with</para>
 
       <programlisting>group: files ldap
 passwd: files ldap</programlisting>
 
       <para>This will allow you to map usernames to UIDs and UIDs to
 	usernames.</para>
 
       <para>Congratulations!  You should now have working LDAP
 	authentication.</para>
     </sect2>
 
     <sect2 xml:id="caveats">
       <title>Caveats</title>
 
       <para>Unfortunately, as of the time this was written &os; did
 	not support changing user passwords with &man.passwd.1;.
-	Because of this, most administrators are left to implement a
+	As a result of this, most administrators are left to implement a
 	solution themselves.  I provide some examples here.  Note that
 	if you write your own password change script, there are some
 	security issues you should be made aware of; see <xref
 	  linkend="security-passwd"/></para>
 
       <example xml:id="chpw-shell">
 	<title>Shell Script for Changing Passwords</title>
 
 	<programlisting><![CDATA[#!/bin/sh
 
 stty -echo
 read -p "Old Password: " oldp; echo
 read -p "New Password: " np1; echo
 read -p "Retype New Password: " np2; echo
 stty echo
 
 if [ "$np1" != "$np2" ]; then
   echo "Passwords do not match."
   exit 1
 fi
 
 ldappasswd -D uid="$USER",ou=people,dc=example,dc=org \
   -w "$oldp" \
   -a "$oldp" \
   -s "$np1"]]></programlisting>
       </example>
 
       <caution>
 	<para>This script does hardly any error checking, but more
 	  important it is very cavalier about how it stores your
 	  passwords.  If you do anything like this, at least adjust
 	  the <literal>security.bsd.see_other_uids</literal> sysctl
 	  value:</para>
 
 	<screen>&prompt.root; <userinput>sysctl security.bsd.see_other_uids=0</userinput></screen>
       </caution>
 
       <para>A more flexible (and probably more secure) approach can be
 	used by writing a custom program, or even a web interface.
 	The following is part of a <application>Ruby</application>
 	library that can change LDAP passwords.  It sees use both on
 	the command line, and on the web.</para>
 
       <example xml:id="chpw-ruby">
 	<title>Ruby Script for Changing Passwords</title>
 
 	<programlisting><![CDATA[require 'ldap'
 require 'base64'
 require 'digest'
 require 'password' # ruby-password
 
 ldap_server = "ldap.example.org"
 luser = "uid=#{ENV['USER']},ou=people,dc=example,dc=org"
 
 # get the new password, check it, and create a salted hash from it
 def get_password
   pwd1 = Password.get("New Password: ")
   pwd2 = Password.get("Retype New Password: ")
 
   raise if pwd1 != pwd2
   pwd1.check # check password strength
 
   salt = rand.to_s.gsub(/0\./, '')
   pass = pwd1.to_s
   hash = "{SSHA}"+Base64.encode64(Digest::SHA1.digest("#{pass}#{salt}")+salt).chomp!
   return hash
 end
 
 oldp = Password.get("Old Password: ")
 newp = get_password
 
 # We'll just replace it.  That we can bind proves that we either know
 # the old password or are an admin.
 
 replace = LDAP::Mod.new(LDAP::LDAP_MOD_REPLACE | LDAP::LDAP_MOD_BVALUES,
                         "userPassword",
                         [newp])
 
 conn = LDAP::SSLConn.new(ldap_server, 389, true)
 conn.set_option(LDAP::LDAP_OPT_PROTOCOL_VERSION, 3)
 conn.bind(luser, oldp)
 conn.modify(luser, [replace])]]></programlisting>
       </example>
 
       <para>Although not guaranteed to be free of security holes (the
 	password is kept in memory, for example) this is cleaner and
 	more flexible than a simple <command>sh</command>
 	script.</para>
     </sect2>
   </sect1>
 
   <sect1 xml:id="secure">
     <title>Security Considerations</title>
 
     <para>Now that your machines (and possibly other services) are
       authenticating against your LDAP server, this server needs to be
       protected at least as well as
       <filename>/etc/master.passwd</filename> would be on a regular
       server, and possibly even more so since a broken or cracked LDAP
       server would break every client service.</para>
 
     <para>Remember, this section is not exhaustive.  You should
       continually review your configuration and procedures for
       improvements.</para>
 
     <sect2 xml:id="secure-readonly">
       <title>Setting Attributes Read-only</title>
 
       <para>Several attributes in LDAP should be read-only.  If left
 	writable by the user, for example, a user could change his
 	<literal>uidNumber</literal> attribute to <literal>0</literal>
 	and get <systemitem class="username">root</systemitem>
 	access!</para>
 
       <para>To begin with, the <literal>userPassword</literal>
 	attribute should not be world-readable.  By default, anyone
 	who can connect to the LDAP server can read this attribute.
 	To disable this, put the following in
 	<filename>slapd.conf</filename>:</para>
 
       <example xml:id="hide-userpass">
 	<title>Hide Passwords</title>
 
 	<programlisting>access to dn.subtree="ou=people,dc=example,dc=org"
   attrs=userPassword
   by self write
   by anonymous auth
   by * none
 
 access to *
   by self write
   by * read</programlisting>
       </example>
 
       <para>This will disallow reading of the
 	<literal>userPassword</literal> attribute, while still
 	allowing users to change their own passwords.</para>
 
       <para>Additionally, you'll want to keep users from changing some
 	of their own attributes.  By default, users can change any
 	attribute (except for those which the LDAP schemas themselves
 	deny changes), such as <literal>uidNumber</literal>.  To close
 	this hole, modify the above to</para>
 
       <example xml:id="attrib-readonly">
 	<title>Read-only Attributes</title>
 
 	<programlisting>access to dn.subtree="ou=people,dc=example,dc=org"
   attrs=userPassword
   by self write
   by anonymous auth
   by * none
 
 access to attrs=homeDirectory,uidNumber,gidNumber
   by * read
 
 access to *
   by self write
   by * read</programlisting>
       </example>
 
       <para>This will stop users from being able to masquerade as
 	other users.</para>
     </sect2>
 
     <sect2 xml:id="secure-root">
       <title><systemitem class="username">root</systemitem> Account
 	Definition</title>
 
       <para>Often the <systemitem class="username">root</systemitem>
 	or manager account for the LDAP service will be defined in the
 	configuration file.  <application>OpenLDAP</application>
 	supports this, for example, and it works, but it can lead to
 	trouble if <filename>slapd.conf</filename> is compromised.  It
 	may be better to use this only to bootstrap yourself into
 	LDAP, and then define a <systemitem
 	  class="username">root</systemitem> account there.</para>
 
       <para>Even better is to define accounts that have limited
 	permissions, and omit a <systemitem
 	  class="username">root</systemitem> account entirely.  For
 	example, users that can add or remove user accounts are added
 	to one group, but they cannot themselves change the membership
 	of this group.  Such a security policy would help mitigate the
 	effects of a leaked password.</para>
 
       <sect3 xml:id="manager-acct">
 	<title>Creating a Management Group</title>
 
 	<para>Say you want your IT department to be able to change
 	  home directories for users, but you do not want all of them
 	  to be able to add or remove users.  The way to do this is to
 	  add a group for these admins:</para>
 
 	<example xml:id="manager-acct-dn">
 	  <title>Creating a Management Group</title>
 
 	  <programlisting>dn: cn=homemanagement,dc=example,dc=org
 objectClass: top
 objectClass: posixGroup
 cn: homemanagement
 gidNumber: 121 # required for posixGroup
 memberUid: uid=tuser,ou=people,dc=example,dc=org
 memberUid: uid=user2,ou=people,dc=example,dc=org</programlisting>
 	</example>
 
 	<para>And then change the permissions attributes in
 	  <filename>slapd.conf</filename>:</para>
 
 	<example xml:id="management-acct-acl">
 	  <title>ACLs for a Home Directory Management Group</title>
 
 	  <programlisting>access to dn.subtree="ou=people,dc=example,dc=org"
   attr=homeDirectory
   by dn="cn=homemanagement,dc=example,dc=org"
   dnattr=memberUid write</programlisting>
 	</example>
 
 	<para>Now <systemitem class="username">tuser</systemitem> and
 	  <systemitem class="username">user2</systemitem> can change
 	  other users' home directories.</para>
 
 	<para>In this example we have given a subset of administrative
 	  power to certain users without giving them power in other
 	  domains.  The idea is that soon no single user account has
 	  the power of a <systemitem
 	    class="username">root</systemitem> account, but every
 	  power root had is had by at least one user.  The <systemitem
 	    class="username">root</systemitem> account then becomes
 	  unnecessary and can be removed.</para>
       </sect3>
     </sect2>
 
     <sect2 xml:id="security-passwd">
       <title>Password Storage</title>
 
       <para>By default <application>OpenLDAP</application> will store
 	the value of the <literal>userPassword</literal> attribute as
 	it stores any other data: in the clear.  Most of the time it
 	is base 64 encoded, which provides enough protection to keep
 	an honest administrator from knowing your password, but little
 	else.</para>
 
       <para>It is a good idea, then, to store passwords in a more
 	secure format, such as SSHA (salted SHA).  This is done by
 	whatever program you use to change users' passwords.</para>
     </sect2>
   </sect1>
 
   <appendix xml:id="useful">
     <title>Useful Aids</title>
 
     <para>There are a few other programs that might be useful,
       particularly if you have many users and do not want to configure
       everything manually.</para>
 
     <para><package>security/pam_mkhomedir</package> is a PAM module
       that always succeeds; its purpose is to create home directories
       for users which do not have them.  If you have dozens of client
       servers and hundreds of users, it is much easier to use this and
       set up skeleton directories than to prepare every home
       directory.</para>
 
     <para><package>sysutils/cpu</package> is a &man.pw.8;-like utility
       that can be used to manage users in the LDAP directory.  You can
       call it directly, or wrap scripts around it.  It can handle both
       TLS (with the <option>-x</option> flag) and SSL
       (directly).</para>
 
     <para><package>sysutils/ldapvi</package> is a great utility for
       editing LDAP values in an LDIF-like syntax.  The directory (or
       subsection of the directory) is presented in the editor chosen
       by the <envar>EDITOR</envar> environment variable.  This makes
       it easy to enable large-scale changes in the directory without
       having to write a custom tool.</para>
 
     <para><package>security/openssh-portable</package> has the ability
       to contact an LDAP server to verify
       <application>SSH</application> keys.  This is extremely nice if
       you have many servers and do not want to copy your public keys
       across all of them.</para>
   </appendix>
 
   <appendix xml:id="ssl-ca">
     <title><application>OpenSSL</application> Certificates for
       LDAP</title>
 
     <para>If you are hosting two or more LDAP servers, you will
       probably not want to use self-signed certificates, since each
       client will have to be configured to work with each certificate.
       While this is possible, it is not nearly as simple as creating
       your own certificate authority, and signing your servers'
       certificates with that.</para>
 
     <para>The steps here are presented as they are with very little
       attempt at explaining what is going on&mdash;further explanation
       can be found in &man.openssl.1; and its friends.</para>
 
     <para>To create a certificate authority, we simply need a
       self-signed certificate and key.  The steps for this again
       are</para>
 
     <example xml:id="make-cert">
       <title>Creating a Certificate</title>
 
       <screen>&prompt.user; <userinput>openssl genrsa -out root.key 1024</userinput>
 &prompt.user; <userinput>openssl req -new -key root.key -out root.csr</userinput>
 &prompt.user; <userinput>openssl x509 -req -days 1024 -in root.csr -signkey root.key -out root.crt</userinput></screen>
     </example>
 
     <para>These will be your root CA key and certificate.  You will
       probably want to encrypt the key and store it in a cool, dry
       place; anyone with access to it can masquerade as one of your
       LDAP servers.</para>
 
     <para>Next, using the first two steps above create a key
       <filename>ldap-server-one.key</filename> and certificate signing
       request <filename>ldap-server-one.csr</filename>.  Once you sign
       the signing request with <filename>root.key</filename>, you will
       be able to use <filename>ldap-server-one.*</filename> on your
       LDAP servers.</para>
 
     <note>
       <para>Do not forget to use the fully qualified domain name for
 	the <quote>common name</quote> attribute when generating the
 	certificate signing request; otherwise clients will reject a
 	connection with you, and it can be very tricky to
 	diagnose.</para>
     </note>
 
     <para>To sign the key, use <option>-CA</option> and
       <option>-CAkey</option> instead of
       <option>-signkey</option>:</para>
 
     <example xml:id="ca-sign">
       <title>Signing as a Certificate Authority</title>
 
       <screen>&prompt.user; <userinput>openssl x509 -req -days 1024 \
 -in ldap-server-one.csr -CA root.crt -CAkey root.key \
 -out ldap-server-one.crt</userinput></screen>
     </example>
 
     <para>The resulting file will be the certificate that you can use
       on your LDAP servers.</para>
 
     <para>Finally, for clients to trust all your servers, distribute
       <filename>root.crt</filename> (the
       <emphasis>certificate</emphasis>, not the key!) to each client,
       and specify it in the <literal>TLSCACertificateFile</literal>
       directive in <filename>ldap.conf</filename>.</para>
   </appendix>
 </article>
diff --git a/en_US.ISO8859-1/articles/linux-emulation/article.xml b/en_US.ISO8859-1/articles/linux-emulation/article.xml
index d3e6c8742e..489b88168c 100644
--- a/en_US.ISO8859-1/articles/linux-emulation/article.xml
+++ b/en_US.ISO8859-1/articles/linux-emulation/article.xml
@@ -1,2545 +1,2545 @@
 <?xml version="1.0" encoding="iso-8859-1"?>
 <!DOCTYPE article PUBLIC "-//FreeBSD//DTD DocBook XML V5.0-Based Extension//EN"
 	"http://www.FreeBSD.org/XML/share/xml/freebsd50.dtd">
 <!-- $FreeBSD$ -->
 <!-- The FreeBSD Documentation Project -->
 <article xmlns="http://docbook.org/ns/docbook"
   xmlns:xlink="http://www.w3.org/1999/xlink" version="5.0"
   xml:lang="en">
   <info>
     <title>&linux; emulation in &os;</title>
 
     <author>
       <personname>
 	<firstname>Roman</firstname>
 	<surname>Divacky</surname>
       </personname>
       <affiliation>
 	<address>
 	  <email>rdivacky@FreeBSD.org</email>
 	</address>
       </affiliation>
     </author>
 
     <legalnotice xml:id="trademarks" role="trademarks">
       &tm-attrib.adobe;
       &tm-attrib.ibm;
       &tm-attrib.freebsd;
       &tm-attrib.linux;
       &tm-attrib.netbsd;
       &tm-attrib.realnetworks;
       &tm-attrib.oracle;
       &tm-attrib.sun;
       &tm-attrib.general;
     </legalnotice>
 
     <pubdate>$FreeBSD$</pubdate>
 
     <releaseinfo>$FreeBSD$</releaseinfo>
 
     <abstract>
       <para>This masters thesis deals with updating the &linux;
 	emulation layer (the so called
 	<firstterm>Linuxulator</firstterm>).  The task was to update
 	the layer to match the functionality of &linux; 2.6. As a
 	reference implementation, the &linux; 2.6.16 kernel was
 	chosen.  The concept is loosely based on the NetBSD
 	implementation.  Most of the work was done in the summer of
 	2006 as a part of the Google Summer of Code students program.
 	The focus was on bringing the <firstterm>NPTL</firstterm> (new
 	&posix; thread library) support into the emulation layer,
 	including <firstterm>TLS</firstterm> (thread local storage),
 	<firstterm>futexes</firstterm> (fast user space mutexes),
 	<firstterm>PID mangling</firstterm>, and some other minor
 	things.  Many small problems were identified and fixed in the
 	process.  My work was integrated into the main &os; source
 	repository and will be shipped in the upcoming 7.0R release.
 	We, the emulation development team, are working on making the
 	&linux; 2.6 emulation the default emulation layer in
 	&os;.</para>
     </abstract>
   </info>
 
   <sect1 xml:id="intro">
     <title>Introduction</title>
 
     <para>In the last few years the open source &unix; based operating
       systems started to be widely deployed on server and client
       machines.  Among these operating systems I would like to point
       out two: &os;, for its BSD heritage, time proven code base and
       many interesting features and &linux; for its wide user base,
       enthusiastic open developer community and support from large
       companies.  &os; tends to be used on server class machines
       serving heavy duty networking tasks with less usage on desktop
       class machines for ordinary users.  While &linux; has the same
       usage on servers, but it is used much more by home based users.
       This leads to a situation where there are many binary only
       programs available for &linux; that lack support for
       &os;.</para>
 
     <para>Naturally, a need for the ability to run &linux; binaries on
       a &os; system arises and this is what this thesis deals with:
       the emulation of the &linux; kernel in the &os; operating
       system.</para>
 
     <para>During the Summer of 2006 Google Inc. sponsored a project
       which focused on extending the &linux; emulation layer (the so
       called Linuxulator) in &os; to include &linux; 2.6 facilities.
       This thesis is written as a part of this project.</para>
   </sect1>
 
   <sect1 xml:id="inside">
     <title>A look inside&hellip;</title>
 
     <para>In this section we are going to describe every operating
       system in question.  How they deal with syscalls, trapframes
       etc., all the low-level stuff.  We also describe the way they
       understand common &unix; primitives like what a PID is, what a
       thread is, etc.  In the third subsection we talk about how
       &unix; on &unix; emulation could be done in general.</para>
 
     <sect2 xml:id="what-is-unix">
       <title>What is &unix;</title>
 
       <para>&unix; is an operating system with a long history that has
 	influenced almost every other operating system currently in
 	use.  Starting in the 1960s, its development continues to this
 	day (although in different projects).  &unix; development soon
 	forked into two main ways: the BSDs and System III/V families.
 	They mutually influenced themselves by growing a common &unix;
 	standard.  Among the contributions originated in BSD we can
 	name virtual memory, TCP/IP networking, FFS, and many others.
 	The System V branch contributed to SysV interprocess
 	communication primitives, copy-on-write, etc. &unix; itself
 	does not exist any more but its ideas have been used by many
 	other operating systems world wide thus forming the so called
 	&unix;-like operating systems.  These days the most
 	influential ones are &linux;, Solaris, and possibly (to some
 	extent) &os;.  There are in-company &unix; derivatives (AIX,
 	HP-UX etc.), but these have been more and more migrated to the
 	aforementioned systems.  Let us summarize typical &unix;
 	characteristics.</para>
     </sect2>
 
     <sect2 xml:id="tech-details">
       <title>Technical details</title>
 
       <para>Every running program constitutes a process that
 	represents a state of the computation.  Running process is
 	divided between kernel-space and user-space.  Some operations
 	can be done only from kernel space (dealing with hardware
 	etc.), but the process should spend most of its lifetime in
 	the user space.  The kernel is where the management of the
 	processes, hardware, and low-level details take place.  The
 	kernel provides a standard unified &unix; API to the user
 	space.  The most important ones are covered below.</para>
 
       <sect3 xml:id="kern-proc-comm">
 	<title>Communication between kernel and user space
 	  process</title>
 
 	<para>Common &unix; API defines a syscall as a way to issue
 	  commands from a user space process to the kernel.  The most
 	  common implementation is either by using an interrupt or
 	  specialized instruction (think of
 	  <literal>SYSENTER</literal>/<literal>SYSCALL</literal>
 	  instructions for ia32).  Syscalls are defined by a number.
 	  For example in &os;, the syscall number&nbsp;85 is the
 	  &man.swapon.2; syscall and the syscall number&nbsp;132 is
 	  &man.mkfifo.2;.  Some syscalls need parameters, which are
 	  passed from the user-space to the kernel-space in various
 	  ways (implementation dependant).  Syscalls are
 	  synchronous.</para>
 
 	<para>Another possible way to communicate is by using a
 	  <firstterm>trap</firstterm>.  Traps occur asynchronously
 	  after some event occurs (division by zero, page fault etc.).
 	  A trap can be transparent for a process (page fault) or can
 	  result in a reaction like sending a
 	  <firstterm>signal</firstterm> (division by zero).</para>
       </sect3>
 
       <sect3 xml:id="proc-proc-comm">
 	<title>Communication between processes</title>
 
 	<para>There are other APIs (System V IPC, shared memory etc.)
 	  but the single most important API is signal.  Signals are
 	  sent by processes or by the kernel and received by
 	  processes.  Some signals can be ignored or handled by a user
 	  supplied routine, some result in a predefined action that
 	  cannot be altered or ignored.</para>
       </sect3>
 
       <sect3 xml:id="proc-mgmt">
 	<title>Process management</title>
 
 	<para>Kernel instances are processed first in the system (so
 	  called init).  Every running process can create its
 	  identical copy using the &man.fork.2; syscall.  Some
 	  slightly modified versions of this syscall were introduced
 	  but the basic semantic is the same.  Every running process
 	  can morph into some other process using the &man.exec.3;
 	  syscall.  Some modifications of this syscall were introduced
 	  but all serve the same basic purpose.  Processes end their
 	  lives by calling the &man.exit.2; syscall.  Every process is
 	  identified by a unique number called PID.  Every process has
 	  a defined parent (identified by its PID).</para>
       </sect3>
 
       <sect3 xml:id="thread-mgmt">
 	<title>Thread management</title>
 
 	<para>Traditional &unix; does not define any API nor
 	  implementation for threading, while  &posix; defines its
 	  threading API but the implementation is undefined.
 	  Traditionally there were two ways of implementing threads.
 	  Handling them as separate processes (1:1 threading) or
 	  envelope the whole thread group in one process and managing
 	  the threading in userspace (1:N threading).  Comparing main
 	  features of each approach:</para>
 
 	<para>1:1 threading</para>
 
 	<itemizedlist>
 	  <listitem>
 	    <para>- heavyweight threads</para>
 	  </listitem>
 	  <listitem>
 	    <para>- the scheduling cannot be altered by the user
 	      (slightly mitigated by the &posix; API)</para>
 	  </listitem>
 	  <listitem>
 	    <para>+ no syscall wrapping necessary</para>
 	  </listitem>
 	  <listitem>
 	    <para>+ can utilize multiple CPUs</para>
 	  </listitem>
 	</itemizedlist>
 
 	<para>1:N threading</para>
 
 	<itemizedlist>
 	  <listitem>
 	    <para>+ lightweight threads</para>
 	  </listitem>
 	  <listitem>
 	    <para>+ scheduling can be easily altered by the
 	      user</para>
 	  </listitem>
 	  <listitem>
 	    <para>- syscalls must be wrapped</para>
 	  </listitem>
 	  <listitem>
 	    <para>- cannot utilize more than one CPU</para>
 	  </listitem>
 	</itemizedlist>
       </sect3>
     </sect2>
 
     <sect2 xml:id="what-is-freebsd">
       <title>What is &os;?</title>
 
       <para>The &os; project is one of the oldest open source
 	operating systems currently available for daily use.  It is a
 	direct descendant of the genuine &unix; so it could be claimed
 	that it is a true &unix; although licensing issues do not
 	permit that.  The start of the project dates back to the early
 	1990's when a crew of fellow BSD users patched the 386BSD
 	operating system.  Based on this patchkit a new operating
 	system arose named &os; for its liberal license.  Another
 	group created the NetBSD operating system with different goals
 	in mind.  We will focus on &os;.</para>
 
       <para>&os; is a modern &unix;-based operating system with all
 	the features of &unix;.  Preemptive multitasking, multiuser
 	facilities, TCP/IP networking, memory protection, symmetric
 	multiprocessing support, virtual memory with merged VM and
 	buffer cache, they are all there.  One of the interesting and
 	extremely useful features is the ability to emulate other
 	&unix;-like operating systems.  As of December&nbsp;2006 and
 	7-CURRENT development, the following emulation functionalities
 	are supported:</para>
 
       <itemizedlist>
 	<listitem>
 	  <para>&os;/i386 emulation on &os;/amd64</para>
 	</listitem>
 	<listitem>
 	  <para>&os;/i386 emulation on &os;/ia64</para>
 	</listitem>
 	<listitem>
 	  <para>&linux;-emulation of &linux; operating system on
 	    &os;</para>
 	</listitem>
 	<listitem>
 	  <para>NDIS-emulation of Windows networking drivers
 	    interface</para>
 	</listitem>
 	<listitem>
 	  <para>NetBSD-emulation of NetBSD operating system</para>
 	</listitem>
 	<listitem>
 	  <para>PECoff-support for PECoff &os; executables</para>
 	</listitem>
 	<listitem>
 	  <para>SVR4-emulation of System V revision 4 &unix;</para>
 	</listitem>
       </itemizedlist>
 
       <para>Actively developed emulations are the &linux; layer and
 	various &os;-on-&os; layers.  Others are not supposed to work
 	properly nor be usable these days.</para>
 
       <sect3 xml:id="freebsd-tech-details">
 	<title>Technical details</title>
 
 	<para>&os; is traditional flavor of &unix; in the sense of
 	  dividing the run of processes into two halves: kernel space
 	  and user space run.  There are two types of process entry to
 	  the kernel: a syscall and a trap.  There is only one way to
 	  return.  In the subsequent sections we will describe the
 	  three gates to/from the kernel.  The whole description
 	  applies to the i386 architecture as the Linuxulator only
 	  exists there but the concept is similar on other
 	  architectures.  The information was taken from [1] and the
 	  source code.</para>
 
 	<sect4 xml:id="freebsd-sys-entries">
 	  <title>System entries</title>
 
 	  <para>&os; has an abstraction called an execution class
 	    loader, which is a wedge into the &man.execve.2; syscall.
 	    This employs a structure <literal>sysentvec</literal>,
 	    which describes an executable ABI.  It contains things
 	    like errno translation table, signal translation table,
 	    various functions to serve syscall needs (stack fixup,
 	    coredumping, etc.).  Every ABI the &os; kernel wants to
 	    support must define this structure, as it is used later in
 	    the syscall processing code and at some other places.
 	    System entries are handled by trap handlers, where we can
 	    access both the kernel-space and the user-space at
 	    once.</para>
 	</sect4>
 
 	<sect4 xml:id="freebsd-syscalls">
 	  <title>Syscalls</title>
 
 	  <para>Syscalls on &os; are issued by executing interrupt
 	    <literal>0x80</literal> with register
 	    <varname>%eax</varname> set to a desired syscall number
 	    with arguments passed on the stack.</para>
 
 	  <para>When a process issues an interrupt
 	    <literal>0x80</literal>, the <literal>int0x80</literal>
 	    syscall trap handler is issued (defined in
 	    <filename>sys/i386/i386/exception.s</filename>), which
 	    prepares arguments (i.e. copies them on to the stack) for
 	    a call to a C function &man.syscall.2; (defined in
 	    <filename>sys/i386/i386/trap.c</filename>), which
 	    processes the passed in trapframe.  The processing
 	    consists of preparing the syscall (depending on the
 	    <literal>sysvec</literal> entry), determining if the
 	    syscall is 32-bit or 64-bit one (changes size of the
 	    parameters), then the parameters are copied, including the
 	    syscall.  Next, the actual syscall function is executed
 	    with processing of the return code (special cases for
 	    <literal>ERESTART</literal> and
 	    <literal>EJUSTRETURN</literal> errors).  Finally an
 	    <literal>userret()</literal> is scheduled, switching the
 	    process back to the users-pace.  The parameters to the
 	    actual syscall handler are passed in the form of
 	    <literal>struct thread *td</literal>, <literal>struct
 	      syscall args *</literal> arguments where the second
 	    parameter is a pointer to the copied in structure of
 	    parameters.</para>
 	</sect4>
 
 	<sect4 xml:id="freebsd-traps">
 	  <title>Traps</title>
 
 	  <para>Handling of traps in &os; is similar to the handling
 	    of syscalls.  Whenever a trap occurs, an assembler handler
 	    is called.  It is chosen between alltraps, alltraps with
 	    regs pushed or calltrap depending on the type of the trap.
 	    This handler prepares arguments for a call to a C function
 	    <literal>trap()</literal> (defined in
 	    <filename>sys/i386/i386/trap.c</filename>), which then
 	    processes the occurred trap.  After the processing it
 	    might send a signal to the process and/or exit to userland
 	    using <literal>userret()</literal>.</para>
 	</sect4>
 
 	<sect4 xml:id="freebsd-exits">
 	  <title>Exits</title>
 
 	  <para>Exits from kernel to userspace happen using the
 	    assembler routine <literal>doreti</literal> regardless of
 	    whether the kernel was entered via a trap or via a
 	    syscall.  This restores the program status from the stack
 	    and returns to the userspace.</para>
 	</sect4>
 
 	<sect4 xml:id="freebsd-unix-primitives">
 	  <title>&unix; primitives</title>
 
 	  <para>&os; operating system adheres to the traditional
 	    &unix; scheme, where every process has a unique
 	    identification number, the so called
 	    <firstterm>PID</firstterm> (Process ID).  PID numbers are
 	    allocated either linearly or randomly ranging from
 	    <literal>0</literal> to <literal>PID_MAX</literal>.  The
 	    allocation of PID numbers is done using linear searching
 	    of PID space.  Every thread in a process receives the same
 	    PID number as result of the &man.getpid.2; call.</para>
 
 	  <para>There are currently two ways to implement threading in
 	    &os;.  The first way is M:N threading followed by the 1:1
 	    threading model.  The default library used is M:N
 	    threading (<literal>libpthread</literal>) and you can
 	    switch at runtime to 1:1 threading
 	    (<literal>libthr</literal>).  The plan is to switch to 1:1
 	    library by default soon.  Although those two libraries use
 	    the same kernel primitives, they are accessed through
 	    different API(es).  The M:N library uses the
 	    <literal>kse_*</literal> family of syscalls while the 1:1
 	    library uses the <literal>thr_*</literal> family of
-	    syscalls.  Because of this, there is no general concept of
+	    syscalls.  Due to this, there is no general concept of
 	    thread ID shared between kernel and userspace.  Of course,
 	    both threading libraries implement the pthread thread ID
 	    API.  Every kernel thread (as described by <literal>struct
 	      thread</literal>) has td tid identifier but this is not
 	    directly accessible from userland and solely serves the
 	    kernel's needs.  It is also used for 1:1 threading library
 	    as pthread's thread ID but handling of this is internal to
 	    the library and cannot be relied on.</para>
 
 	  <para>As stated previously there are two implementations of
 	    threading in &os;.  The M:N library divides the work
 	    between kernel space and userspace.  Thread is an entity
 	    that gets scheduled in the kernel but it can represent
 	    various number of userspace threads.  M userspace threads
 	    get mapped to N kernel threads thus saving resources while
 	    keeping the ability to exploit multiprocessor parallelism.
 	    Further information about the implementation can be
 	    obtained from the man page or [1].  The 1:1 library
 	    directly maps a userland thread to a kernel thread thus
 	    greatly simplifying the scheme.  None of these designs
 	    implement a fairness mechanism (such a mechanism was
 	    implemented but it was removed recently because it caused
 	    serious slowdown and made the code more difficult to deal
 	    with).</para>
 	</sect4>
       </sect3>
     </sect2>
 
     <sect2 xml:id="what-is-linux">
       <title>What is &linux;</title>
 
       <para>&linux; is a &unix;-like kernel originally developed by
 	Linus Torvalds, and now being contributed to by a massive
 	crowd of programmers all around the world.  From its mere
 	beginnings to today, with wide support from companies such as
 	IBM or Google, &linux; is being associated with its fast
 	development pace, full hardware support and benevolent
 	dictator model of organization.</para>
 
       <para>&linux; development started in 1991 as a hobbyist project
 	at University of Helsinki in Finland.  Since then it has
 	obtained all the features of a modern &unix;-like OS:
 	multiprocessing, multiuser support, virtual memory,
 	networking, basically everything is there.  There are also
 	highly advanced features like virtualization etc.</para>
 
       <para>As of 2006 &linux; seems to be the most widely used open
 	source operating system with support from independent software
 	vendors like Oracle, RealNetworks, Adobe, etc.  Most of the
 	commercial software distributed for &linux; can only be
 	obtained in a binary form so recompilation for other operating
 	systems is impossible.</para>
 
       <para>Most of the &linux; development happens in a
 	<application>Git</application> version control system.
 	<application>Git</application> is a distributed system so
 	there is no central source of the &linux; code, but some
 	branches are considered prominent and official.  The version
 	number scheme implemented by &linux; consists of four numbers
 	A.B.C.D.  Currently development happens in 2.6.C.D, where C
 	represents major version, where new features are added or
 	changed while D is a minor version for bugfixes only.</para>
 
       <para>More information can be obtained from [3].</para>
 
       <sect3 xml:id="linux-tech-details">
 	<title>Technical details</title>
 
 	<para>&linux; follows the traditional &unix; scheme of
 	  dividing the run of a process in two halves: the kernel and
 	  user space.  The kernel can be entered in two ways: via a
 	  trap or via a syscall.  The return is handled only in one
 	  way.  The further description applies to &linux;&nbsp;2.6 on
 	  the &i386; architecture.  This information was taken from
 	  [2].</para>
 
 	<sect4 xml:id="linux-syscalls">
 	  <title>Syscalls</title>
 
 	  <para>Syscalls in &linux; are performed (in userspace) using
 	    <literal>syscallX</literal> macros where X substitutes a
 	    number representing the number of parameters of the given
 	    syscall.  This macro translates to a code that loads
 	    <varname>%eax</varname> register with a number of the
 	    syscall and executes interrupt <literal>0x80</literal>.
 	    After this syscall return is called, which translates
 	    negative return values to positive
 	    <literal>errno</literal> values and sets
 	    <literal>res</literal> to <literal>-1</literal> in case of
 	    an error.  Whenever the interrupt <literal>0x80</literal>
 	    is called the process enters the kernel in system call
 	    trap handler.  This routine saves all registers on the
 	    stack and calls the selected syscall entry.  Note that the
 	    &linux; calling convention expects parameters to the
 	    syscall to be passed via registers as shown here:</para>
 
 	  <orderedlist>
 	    <listitem>
 	      <para>parameter -&gt; <varname>%ebx</varname></para>
 	    </listitem>
 	    <listitem>
 	      <para>parameter -&gt; <varname>%ecx</varname></para>
 	    </listitem>
 	    <listitem>
 	      <para>parameter -&gt; <varname>%edx</varname></para>
 	    </listitem>
 	    <listitem>
 	      <para>parameter -&gt; <varname>%esi</varname></para>
 	    </listitem>
 	    <listitem>
 	      <para>parameter -&gt; <varname>%edi</varname></para>
 	    </listitem>
 	    <listitem>
 	      <para>parameter -&gt; <varname>%ebp</varname></para>
 	    </listitem>
 	  </orderedlist>
 
 	  <para>There are some exceptions to this, where &linux; uses
 	    different calling convention (most notably the
 	    <literal>clone</literal> syscall).</para>
 	</sect4>
 
 	<sect4 xml:id="linux-traps">
 	  <title>Traps</title>
 
 	  <para>The trap handlers are introduced in
 	    <filename>arch/i386/kernel/traps.c</filename> and most of
 	    these handlers live in
 	    <filename>arch/i386/kernel/entry.S</filename>, where
 	    handling of the traps happens.</para>
 	</sect4>
 
 	<sect4 xml:id="linux-exits">
 	  <title>Exits</title>
 
 	  <para>Return from the syscall is managed by syscall
 	    &man.exit.3;, which checks for the process having
 	    unfinished work, then checks whether we used user-supplied
 	    selectors.  If this happens stack fixing is applied and
 	    finally the registers are restored from the stack and the
 	    process returns to the userspace.</para>
 	</sect4>
 
 	<sect4 xml:id="linux-unix-primitives">
 	  <title>&unix; primitives</title>
 
 	  <para>In the 2.6 version, the &linux; operating system
 	    redefined some of the traditional &unix; primitives,
 	    notably PID, TID and thread.  PID is defined not to be
 	    unique for every process, so for some processes (threads)
 	    &man.getppid.2; returns the same value.  Unique
 	    identification of process is provided by TID.  This is
 	    because <firstterm>NPTL</firstterm> (New &posix; Thread
 	    Library) defines threads to be normal processes (so called
 	    1:1 threading).  Spawning a new process in
 	    &linux;&nbsp;2.6 happens using the
 	    <literal>clone</literal> syscall (fork variants are
 	    reimplemented using it).  This clone syscall defines a set
 	    of flags that affect behavior of the cloning process
 	    regarding thread implementation.  The semantic is a bit
 	    fuzzy as there is no single flag telling the syscall to
 	    create a thread.</para>
 
 	  <para>Implemented clone flags are:</para>
 
 	  <itemizedlist>
 	    <listitem>
 	      <para><literal>CLONE_VM</literal> - processes share
 		their memory space</para>
 	    </listitem>
 	    <listitem>
 	      <para><literal>CLONE_FS</literal> - share umask, cwd and
 		namespace</para>
 	    </listitem>
 	    <listitem>
 	      <para><literal>CLONE_FILES</literal> - share open
 		files</para>
 	    </listitem>
 	    <listitem>
 	      <para><literal>CLONE_SIGHAND</literal> - share signal
 		handlers and blocked signals</para>
 	    </listitem>
 	    <listitem>
 	      <para><literal>CLONE_PARENT</literal> - share
 		parent</para>
 	    </listitem>
 	    <listitem>
 	      <para><literal>CLONE_THREAD</literal> - be thread
 		(further explanation below)</para>
 	    </listitem>
 	    <listitem>
 	      <para><literal>CLONE_NEWNS</literal> - new
 		namespace</para>
 	    </listitem>
 	    <listitem>
 	      <para><literal>CLONE_SYSVSEM</literal> - share SysV undo
 		structures</para>
 	    </listitem>
 	    <listitem>
 	      <para><literal>CLONE_SETTLS</literal> - setup TLS at
 		supplied address</para>
 	    </listitem>
 	    <listitem>
 	      <para><literal>CLONE_PARENT_SETTID</literal> - set TID
 		in the parent</para>
 	    </listitem>
 	    <listitem>
 	      <para><literal>CLONE_CHILD_CLEARTID</literal> - clear
 		TID in the child</para>
 	    </listitem>
 	    <listitem>
 	      <para><literal>CLONE_CHILD_SETTID</literal> - set TID in
 		the child</para>
 	    </listitem>
 	  </itemizedlist>
 
 	  <para><literal>CLONE_PARENT</literal> sets the real parent
 	    to the parent of the caller.  This is useful for threads
 	    because if thread A creates thread B we want thread B to
 	    be parented to the parent of the whole thread group.
 	    <literal>CLONE_THREAD</literal> does exactly the same
 	    thing as <literal>CLONE_PARENT</literal>,
 	    <literal>CLONE_VM</literal> and
 	    <literal>CLONE_SIGHAND</literal>, rewrites PID to be the
 	    same as PID of the caller, sets exit signal to be none and
 	    enters the thread group.  <literal>CLONE_SETTLS</literal>
 	    sets up GDT entries for TLS handling.  The
 	    <literal>CLONE_*_*TID</literal> set of flags sets/clears
 	    user supplied address to TID or 0.</para>
 
 	  <para>As you can see the <literal>CLONE_THREAD</literal>
 	    does most of the work and does not seem to fit the scheme
 	    very well.  The original intention is unclear (even for
 	    authors, according to comments in the code) but I think
 	    originally there was one threading flag, which was then
 	    parcelled among many other flags but this separation was
 	    never fully finished.  It is also unclear what this
 	    partition is good for as glibc does not use that so only
 	    hand-written use of the clone permits a programmer to
 	    access this features.</para>
 
 	  <para>For non-threaded programs the PID and TID are the
 	    same.  For threaded programs the first thread PID and TID
 	    are the same and every created thread shares the same PID
 	    and gets assigned a unique TID (because
 	    <literal>CLONE_THREAD</literal> is passed in) also parent
 	    is shared for all processes forming this threaded
 	    program.</para>
 
 	  <para>The code that implements &man.pthread.create.3; in
 	    NPTL defines the clone flags like this:</para>
 
 	  <programlisting>int clone_flags = (CLONE_VM | CLONE_FS | CLONE_FILES | CLONE_SIGNAL
 
  | CLONE_SETTLS | CLONE_PARENT_SETTID
 
 | CLONE_CHILD_CLEARTID | CLONE_SYSVSEM
 #if __ASSUME_NO_CLONE_DETACHED == 0
 
 | CLONE_DETACHED
 #endif
 
 | 0);</programlisting>
 
 	  <para>The <literal>CLONE_SIGNAL</literal> is defined
 	    like</para>
 
 	  <programlisting>#define CLONE_SIGNAL (CLONE_SIGHAND | CLONE_THREAD)</programlisting>
 
 	  <para>the last 0 means no signal is sent when any of the
 	    threads exits.</para>
 	</sect4>
       </sect3>
     </sect2>
 
     <sect2 xml:id="what-is-emu">
       <title>What is emulation</title>
 
       <para>According to a dictionary definition, emulation is the
 	ability of a program or device to imitate another program or
 	device.  This is achieved by providing the same reaction to a
 	given stimulus as the emulated object.  In practice, the
 	software world mostly sees three types of emulation - a
 	program used to emulate a machine (QEMU, various game console
 	emulators etc.), software emulation of a hardware facility
 	(OpenGL emulators, floating point units emulation etc.) and
 	operating system emulation (either in kernel of the operating
 	system or as a userspace program).</para>
 
       <para>Emulation is usually used in a place, where using the
 	original component is not feasible nor possible at all.  For
 	example someone might want to use a program developed for a
 	different operating system than they use.  Then emulation
 	comes in handy.  Sometimes there is no other way but to use
 	emulation - e.g. when the hardware device you try to use does
 	not exist (yet/anymore) then there is no other way but
 	emulation.  This happens often when porting an operating
 	system to a new (non-existent) platform.  Sometimes it is just
 	cheaper to emulate.</para>
 
       <para>Looking from an implementation point of view, there are
 	two main approaches to the implementation of emulation.  You
 	can either emulate the whole thing - accepting possible inputs
 	of the original object, maintaining inner state and emitting
 	correct output based on the state and/or input.  This kind of
 	emulation does not require any special conditions and
 	basically can be implemented anywhere for any device/program.
 	The drawback is that implementing such emulation is quite
 	difficult, time-consuming and error-prone.  In some cases we
 	can use a simpler approach.  Imagine you want to emulate a
 	printer that prints from left to right on a printer that
 	prints from right to left.  It is obvious that there is no
 	need for a complex emulation layer but simply reversing of the
 	printed text is sufficient.  Sometimes the
 	emulating environment is very similar to the emulated one so
 	just a thin layer of some translation is necessary to provide
 	fully working emulation!  As you can see this is much less
 	demanding to implement, so less time-consuming and error-prone
 	than the previous approach.  But the necessary condition is
 	that the two environments must be similar enough.  The third
 	approach combines the two previous.  Most of the time the
 	objects do not provide the same capabilities so in a case of
 	emulating the more powerful one on the less powerful we have
 	to emulate the missing features with full emulation described
 	above.</para>
 
       <para>This master thesis deals with emulation of &unix; on
 	&unix;, which is exactly the case, where only a thin layer of
 	translation is sufficient to provide full emulation.  The
 	&unix; API consists of a set of syscalls, which are usually
 	self contained and do not affect some global kernel
 	state.</para>
 
       <para>There are a few syscalls that affect inner state but this
 	can be dealt with by providing some structures that maintain
 	the extra state.</para>
 
       <para>No emulation is perfect and emulations tend to lack some
 	parts but this usually does not cause any serious drawbacks.
 	Imagine a game console emulator that emulates everything but
 	music output.  No doubt that the games are playable and one
 	can use the emulator.  It might not be that comfortable as the
 	original game console but its an acceptable compromise between
 	price and comfort.</para>
 
       <para>The same goes with the &unix; API.  Most programs can live
 	with a very limited set of syscalls working.  Those syscalls
 	tend to be the oldest ones (&man.read.2;/&man.write.2;,
 	&man.fork.2; family, &man.signal.3; handling, &man.exit.3;,
 	&man.socket.2; API) hence it is easy to emulate because their
 	semantics is shared among all &unix;es, which exist
 	todays.</para>
     </sect2>
   </sect1>
 
   <sect1 xml:id="freebsd-emulation">
     <title>Emulation</title>
 
     <sect2>
       <title>How emulation works in &os;</title>
 
       <para>As stated earlier, &os; supports running binaries from
 	several other &unix;es.  This works because &os; has an
 	abstraction called the execution class loader.  This wedges
 	into the &man.execve.2; syscall, so when &man.execve.2; is
 	about to execute a binary it examines its type.</para>
 
       <para>There are basically two types of binaries in &os;.
 	Shell-like text scripts which are identified by
 	<literal>#!</literal> as their first two characters and normal
 	(typically <firstterm>ELF</firstterm>) binaries, which are a
 	representation of a compiled executable object.  The vast
 	majority (one could say all of them) of binaries in &os; are
 	from type ELF.  ELF files contain a header, which specifies
 	the OS ABI for this ELF file.  By reading this information,
 	the operating system can accurately determine what type of
 	binary the given file is.</para>
 
       <para>Every OS ABI must be registered in the &os; kernel.  This
 	applies to the &os; native OS ABI, as well.  So when
 	&man.execve.2; executes a binary it iterates through the list
 	of registered APIs and when it finds the right one it starts
 	to use the information contained in the OS ABI description
 	(its syscall table, <literal>errno</literal> translation
 	table, etc.).  So every time the process calls a syscall, it
 	uses its own set of syscalls instead of some global one.  This
 	effectively provides a very elegant and easy way of supporting
 	execution of various binary formats.</para>
 
       <para>The nature of emulation of different OSes (and also some
 	other subsystems) led developers to invite a handler event
 	mechanism.  There are various places in the kernel, where a
 	list of event handlers are called.  Every subsystem can
 	register an event handler and they are called accordingly.
 	For example, when a process exits there is a handler called
 	that possibly cleans up whatever the subsystem needs to be
 	cleaned.</para>
 
       <para>Those simple facilities provide basically everything that
 	is needed for the emulation infrastructure and in fact these
 	are basically the only things necessary to implement the
 	&linux; emulation layer.</para>
     </sect2>
 
     <sect2 xml:id="freebsd-common-primitives">
       <title>Common primitives in the &os; kernel</title>
 
       <para>Emulation layers need some support from the operating
 	system.  I am going to describe some of the supported
 	primitives in the &os; operating system.</para>
 
       <sect3 xml:id="freebsd-locking-primitives">
 	<title>Locking primitives</title>
 
 	<para>Contributed by: &a.attilio.email;</para>
 
 	<para>The &os; synchronization primitive set is based on the
 	  idea to supply a rather huge number of different primitives
 	  in a way that the better one can be used for every
 	  particular, appropriate situation.</para>
 
 	<para>To a high level point of view you can consider three
 	  kinds of synchronization primitives in the &os;
 	  kernel:</para>
 
 	<itemizedlist>
 	  <listitem>
 	    <para>atomic operations and memory barriers</para>
 	  </listitem>
 	  <listitem>
 	    <para>locks</para>
 	  </listitem>
 	  <listitem>
 	    <para>scheduling barriers</para>
 	  </listitem>
 	</itemizedlist>
 
 	<para>Below there are descriptions for the 3 families.  For
 	  every lock, you should really check the linked manpage
 	  (where possible) for more detailed explanations.</para>
 
 	<sect4 xml:id="freebsd-atomic-op">
 	  <title>Atomic operations and memory barriers</title>
 
 	  <para>Atomic operations are implemented through a set of
 	    functions performing simple arithmetics on memory operands
 	    in an atomic way with respect to external events
 	    (interrupts, preemption, etc.).  Atomic operations can
 	    guarantee atomicity just on small data types (in the
 	    magnitude order of the <literal>.long.</literal>
 	    architecture C data type), so should be rarely used
 	    directly in the end-level code, if not only for very
 	    simple operations (like flag setting in a bitmap, for
 	    example).  In fact, it is rather simple and common to
 	    write down a wrong semantic based on just atomic
 	    operations (usually referred as lock-less).  The &os;
 	    kernel offers a way to perform atomic operations in
 	    conjunction with a memory barrier.  The memory barriers
 	    will guarantee that an atomic operation will happen
 	    following some specified ordering with respect to other
 	    memory accesses.  For example, if we need that an atomic
 	    operation happen just after all other pending writes (in
 	    terms of instructions reordering buffers activities) are
 	    completed, we need to explicitly use a memory barrier in
 	    conjunction to this atomic operation.  So it is simple to
 	    understand why memory barriers play a key role for
 	    higher-level locks building (just as refcounts, mutexes,
 	    etc.).  For a detailed explanatory on atomic operations,
 	    please refer to &man.atomic.9;.  It is far, however,
 	    noting that atomic operations (and memory barriers as
 	    well) should ideally only be used for building
 	    front-ending locks (as mutexes).</para>
 	</sect4>
 
 	<sect4 xml:id="freebsd-refcounts">
 	  <title>Refcounts</title>
 
 	  <para>Refcounts are interfaces for handling reference
 	    counters.  They are implemented through atomic operations
 	    and are intended to be used just for cases, where the
 	    reference counter is the only one thing to be protected,
 	    so even something like a spin-mutex is deprecated.  Using
 	    the refcount interface for structures, where a mutex is
 	    already used is often wrong since we should probably close
 	    the reference counter in some already protected paths.  A
 	    manpage discussing refcount does not exist currently, just
 	    check <filename>sys/refcount.h</filename> for an overview
 	    of the existing API.</para>
 	</sect4>
 
 	<sect4 xml:id="freebsd-locks">
 	  <title>Locks</title>
 
 	  <para>&os; kernel has huge classes of locks.  Every lock is
 	    defined by some peculiar properties, but probably the most
 	    important is the event linked to contesting holders (or in
 	    other terms, the behavior of threads unable to acquire the
 	    lock).  &os;'s locking scheme presents three different
 	    behaviors for contenders:</para>
 
 	  <orderedlist>
 	    <listitem>
 	      <para>spinning</para>
 	    </listitem>
 	    <listitem>
 	      <para>blocking</para>
 	    </listitem>
 	    <listitem>
 	      <para>sleeping</para>
 	    </listitem>
 	  </orderedlist>
 
 	  <note>
 	    <para>numbers are not casual</para>
 	  </note>
 	</sect4>
 
 	<sect4 xml:id="freebsd-spinlocks">
 	  <title>Spinning locks</title>
 
 	  <para>Spin locks let waiters to spin until they cannot
 	    acquire the lock.  An important matter do deal with is
 	    when a thread contests on a spin lock if it is not
 	    descheduled.  Since the &os; kernel is preemptive, this
 	    exposes spin lock at the risk of deadlocks that can be
 	    solved just disabling interrupts while they are acquired.
 	    For this and other reasons (like lack of priority
 	    propagation support, poorness in load balancing schemes
 	    between CPUs, etc.), spin locks are intended to protect
 	    very small paths of code, or ideally not to be used at all
 	    if not explicitly requested (explained later).</para>
 	</sect4>
 
 	<sect4 xml:id="freebsd-blocking">
 	  <title>Blocking</title>
 
 	  <para>Block locks let waiters to be descheduled and blocked
 	    until the lock owner does not drop it and wakes up one or
 	    more contenders.  In order to avoid starvation issues,
 	    blocking locks do priority propagation from the waiters to
 	    the owner.  Block locks must be implemented through the
 	    turnstile interface and are intended to be the most used
 	    kind of locks in the kernel, if no particular conditions
 	    are met.</para>
 	</sect4>
 
 	<sect4 xml:id="freebsd-sleeping">
 	  <title>Sleeping</title>
 
 	  <para>Sleep locks let waiters to be descheduled and fall
 	    asleep until the lock holder does not drop it and wakes up
 	    one or more waiters.  Since sleep locks are intended to
 	    protect large paths of code and to cater asynchronous
 	    events, they do not do any form of priority propagation.
 	    They must be implemented through the &man.sleepqueue.9;
 	    interface.</para>
 
 	  <para>The order used to acquire locks is very important, not
 	    only for the possibility to deadlock due at lock order
 	    reversals, but even because lock acquisition should follow
 	    specific rules linked to locks natures.  If you give a
 	    look at the table above, the practical rule is that if a
 	    thread holds a lock of level n (where the level is the
 	    number listed close to the kind of lock) it is not allowed
 	    to acquire a lock of superior levels, since this would
 	    break the specified semantic for a path.  For example, if
 	    a thread holds a block lock (level 2), it is allowed to
 	    acquire a spin lock (level 1) but not a sleep lock (level
 	    3), since block locks are intended to protect smaller
 	    paths than sleep lock (these rules are not about atomic
 	    operations or scheduling barriers, however).</para>
 
 	  <para>This is a list of lock with their respective
 	    behaviors:</para>
 
 	  <itemizedlist>
 	    <listitem>
 	      <para>spin mutex - spinning - &man.mutex.9;</para>
 	    </listitem>
 	    <listitem>
 	      <para>sleep mutex - blocking - &man.mutex.9;</para>
 	    </listitem>
 	    <listitem>
 	      <para>pool mutex - blocking - &man.mtx.pool.9;</para>
 	    </listitem>
 	    <listitem>
 	      <para>sleep family - sleeping - &man.sleep.9; pause
 		tsleep msleep msleep spin msleep rw msleep sx</para>
 	    </listitem>
 	    <listitem>
 	      <para>condvar - sleeping - &man.condvar.9;</para>
 	    </listitem>
 	    <listitem>
 	      <para>rwlock - blocking - &man.rwlock.9;</para>
 	    </listitem>
 	    <listitem>
 	      <para>sxlock - sleeping - &man.sx.9;</para>
 	    </listitem>
 	    <listitem>
 	      <para>lockmgr - sleeping - &man.lockmgr.9;</para>
 	    </listitem>
 	    <listitem>
 	      <para>semaphores - sleeping - &man.sema.9;</para>
 	    </listitem>
 	  </itemizedlist>
 
 	  <para>Among these locks only mutexes, sxlocks, rwlocks and
 	    lockmgrs are intended to handle recursion, but currently
 	    recursion is only supported by mutexes and
 	    lockmgrs.</para>
 	</sect4>
 
 	<sect4 xml:id="freebsd-scheduling">
 	  <title>Scheduling barriers</title>
 
 	  <para>Scheduling barriers are intended to be used in order
 	    to drive scheduling of threading.  They consist mainly of
 	    three different stubs:</para>
 
 	  <itemizedlist>
 	    <listitem>
 	      <para>critical sections (and preemption)</para>
 	    </listitem>
 	    <listitem>
 	      <para>sched_bind</para>
 	    </listitem>
 	    <listitem>
 	      <para>sched_pin</para>
 	    </listitem>
 	  </itemizedlist>
 
 	  <para>Generally, these should be used only in a particular
 	    context and even if they can often replace locks, they
 	    should be avoided because they do not let the diagnose of
 	    simple eventual problems with locking debugging tools (as
 	    &man.witness.4;).</para>
 	</sect4>
 
 	<sect4 xml:id="freebsd-critical">
 	  <title>Critical sections</title>
 
 	  <para>The &os; kernel has been made preemptive basically to
 	    deal with interrupt threads.  In fact, in order to avoid
 	    high interrupt latency, time-sharing priority threads can
 	    be preempted by interrupt threads (in this way, they do
 	    not need to wait to be scheduled as the normal path
 	    previews).  Preemption, however, introduces new racing
 	    points that need to be handled, as well.  Often, in order
 	    to deal with preemption, the simplest thing to do is to
 	    completely disable it.  A critical section defines a piece
 	    of code (borderlined by the pair of functions
 	    &man.critical.enter.9; and &man.critical.exit.9;, where
 	    preemption is guaranteed to not happen (until the
 	    protected code is fully executed).  This can often replace
 	    a lock effectively but should be used carefully in order
 	    to not lose the whole advantage that preemption
 	    brings.</para>
 	</sect4>
 
 	<sect4 xml:id="freebsd-schedpin">
 	  <title>sched_pin/sched_unpin</title>
 
 	  <para>Another way to deal with preemption is the
 	    <function>sched_pin()</function> interface.  If a piece of
 	    code is closed in the <function>sched_pin()</function>
 	    and <function>sched_unpin()</function> pair of functions
 	    it is guaranteed that the respective thread, even if it
 	    can be preempted, it will always be executed on the same
 	    CPU.  Pinning is very effective in the particular case
 	    when we have to access at per-cpu datas and we assume
 	    other threads will not change those data.  The latter
 	    condition will determine a critical section as a too
 	    strong condition for our code.</para>
 	</sect4>
 
 	<sect4 xml:id="freebsd-schedbind">
 	  <title>sched_bind/sched_unbind</title>
 
 	  <para><function>sched_bind</function> is an API used in
 	    order to bind a thread to a particular CPU for all the
 	    time it executes the code, until a
 	    <function>sched_unbind</function> function call does not
 	    unbind it.  This feature has a key role in situations
 	    where you cannot trust the current state of CPUs (for
 	    example, at very early stages of boot), as you want to
 	    avoid your thread to migrate on inactive CPUs.  Since
 	    <function>sched_bind</function> and
 	    <function>sched_unbind</function> manipulate internal
 	    scheduler structures, they need to be enclosed in
 	    <function>sched_lock</function> acquisition/releasing when
 	    used.</para>
 	</sect4>
       </sect3>
 
       <sect3 xml:id="freebsd-proc">
 	<title>Proc structure</title>
 
 	<para>Various emulation layers sometimes require some
 	  additional per-process data.  It can manage separate
 	  structures (a list, a tree etc.) containing these data for
 	  every process but this tends to be slow and memory
 	  consuming.  To solve this problem the &os;
 	  <literal>proc</literal> structure contains
 	  <literal>p_emuldata</literal>, which is a void pointer to
 	  some emulation layer specific data.  This
 	  <literal>proc</literal> entry is protected by the proc
 	  mutex.</para>
 
 	<para>The &os; <literal>proc</literal> structure contains a
 	  <literal>p_sysent</literal> entry that identifies, which ABI
 	  this process is running.  In fact, it is a pointer to the
 	  <literal>sysentvec</literal> described above.  So by
 	  comparing this pointer to the address where the
 	  <literal>sysentvec</literal> structure for the given ABI is
 	  stored we can effectively determine whether the process
 	  belongs to our emulation layer.  The code typically looks
 	  like:</para>
 
 	<programlisting>if (__predict_true(p-&gt;p_sysent != &amp;elf_&linux;_sysvec))
 	  return;</programlisting>
 
 	<para>As you can see, we effectively use the
 	  <literal>__predict_true</literal> modifier to collapse the
 	  most common case (&os; process) to a simple return operation
 	  thus preserving high performance.  This code should be
 	  turned into a macro because currently it is not very
 	  flexible, i.e. we do not support &linux;64 emulation nor
 	  A.OUT &linux; processes on i386.</para>
       </sect3>
 
       <sect3 xml:id="freebsd-vfs">
 	<title>VFS</title>
 
 	<para>The &os; VFS subsystem is very complex but the &linux;
 	  emulation layer uses just a small subset via a well defined
 	  API.  It can either operate on vnodes or file handlers.
 	  Vnode represents a virtual vnode, i.e. representation of a
 	  node in VFS.  Another representation is a file handler,
 	  which represents an opened file from the perspective of a
 	  process.  A file handler can represent a socket or an
 	  ordinary file.  A file handler contains a pointer to its
 	  vnode.  More then one file handler can point to the same
 	  vnode.</para>
 
 	<sect4 xml:id="freebsd-namei">
 	  <title>namei</title>
 
 	  <para>The &man.namei.9; routine is a central entry point to
 	    pathname lookup and translation.  It traverses the path
 	    point by point from the starting point to the end point
 	    using lookup function, which is internal to VFS.  The
 	    &man.namei.9; syscall can cope with symlinks, absolute and
 	    relative paths.  When a path is looked up using
 	    &man.namei.9; it is inputed to the name cache.  This
 	    behavior can be suppressed.  This routine is used all over
 	    the kernel and its performance is very critical.</para>
 	</sect4>
 
 	<sect4 xml:id="freebsd-vn">
 	  <title>vn_fullpath</title>
 
 	  <para>The &man.vn.fullpath.9; function takes the best effort
 	    to traverse VFS name cache and returns a path for a given
 	    (locked) vnode.  This process is unreliable but works just
 	    fine for the most common cases.  The unreliability is
 	    because it relies on VFS cache (it does not traverse the
 	    on medium structures), it does not work with hardlinks,
 	    etc.  This routine is used in several places in the
 	    Linuxulator.</para>
 	</sect4>
 
 	<sect4 xml:id="freebsd-vnode">
 	  <title>Vnode operations</title>
 
 	  <itemizedlist>
 	    <listitem>
 	      <para><function>fgetvp</function> - given a thread and a
 		file descriptor number it returns the associated
 		vnode</para>
 	    </listitem>
 	    <listitem>
 	      <para>&man.vn.lock.9; - locks a vnode</para>
 	    </listitem>
 	    <listitem>
 	      <para><function>vn_unlock</function> - unlocks a
 		vnode</para>
 	    </listitem>
 	    <listitem>
 	      <para>&man.VOP.READDIR.9; - reads a directory referenced
 		by a vnode</para>
 	    </listitem>
 	    <listitem>
 	      <para>&man.VOP.GETATTR.9; - gets attributes of a file or
 		a directory referenced by a vnode</para>
 	    </listitem>
 	    <listitem>
 	      <para>&man.VOP.LOOKUP.9; - looks up a path to a given
 		directory</para>
 	    </listitem>
 	    <listitem>
 	      <para>&man.VOP.OPEN.9; - opens a file referenced by a
 		vnode</para>
 	    </listitem>
 	    <listitem>
 	      <para>&man.VOP.CLOSE.9; - closes a file referenced by a
 		vnode</para>
 	    </listitem>
 	    <listitem>
 	      <para>&man.vput.9; - decrements the use count for a
 		vnode and unlocks it</para>
 	    </listitem>
 	    <listitem>
 	      <para>&man.vrele.9; - decrements the use count for a
 		vnode</para>
 	    </listitem>
 	    <listitem>
 	      <para>&man.vref.9; - increments the use count for a
 		vnode</para>
 	    </listitem>
 	  </itemizedlist>
 	</sect4>
 
 	<sect4 xml:id="freebsd-file-handler">
 	  <title>File handler operations</title>
 
 	  <itemizedlist>
 	    <listitem>
 	      <para><function>fget</function> - given a thread and a
 		file descriptor number it returns associated file
 		handler and references it</para>
 	    </listitem>
 	    <listitem>
 	      <para><function>fdrop</function> - drops a reference to
 		a file handler</para>
 	    </listitem>
 	    <listitem>
 	      <para><function>fhold</function> - references a file
 		handler</para>
 	    </listitem>
 	  </itemizedlist>
 	</sect4>
       </sect3>
     </sect2>
   </sect1>
 
   <sect1 xml:id="md">
     <title>&linux; emulation layer -MD part</title>
 
     <para>This section deals with implementation of &linux; emulation
       layer in &os; operating system.  It first describes the machine
       dependent part talking about how and where interaction between
       userland and kernel is implemented.  It talks about syscalls,
       signals, ptrace, traps, stack fixup.  This part discusses i386
       but it is written generally so other architectures should not
       differ very much.  The next part is the machine independent part
       of the Linuxulator.  This section only covers i386 and ELF
       handling.  A.OUT is obsolete and untested.</para>
 
     <sect2 xml:id="syscall-handling">
       <title>Syscall handling</title>
 
       <para>Syscall handling is mostly written in
 	<filename>linux_sysvec.c</filename>, which covers most of the
 	routines pointed out in the <literal>sysentvec</literal>
 	structure.  When a &linux; process running on &os; issues a
 	syscall, the general syscall routine calls linux prepsyscall
 	routine for the &linux; ABI.</para>
 
       <sect3 xml:id="linux-prepsyscall">
 	<title>&linux; prepsyscall</title>
 
 	<para>&linux; passes arguments to syscalls via registers (that
 	  is why it is limited to 6 parameters on i386) while &os;
 	  uses the stack.  The &linux; prepsyscall routine must copy
 	  parameters from registers to the stack.  The order of the
 	  registers is: <varname>%ebx</varname>,
 	  <varname>%ecx</varname>, <varname>%edx</varname>,
 	  <varname>%esi</varname>, <varname>%edi</varname>,
 	  <varname>%ebp</varname>.  The catch is that this is true for
 	  only <emphasis>most</emphasis> of the syscalls.  Some (most
 	  notably <function>clone</function>) uses a different order
 	  but it is luckily easy to fix by inserting a dummy parameter
 	  in the <function>linux_clone</function> prototype.</para>
       </sect3>
 
       <sect3 xml:id="syscall-writing">
 	<title>Syscall writing</title>
 
 	<para>Every syscall implemented in the Linuxulator must have
 	  its prototype with various flags in
 	  <filename>syscalls.master</filename>.  The form of the file
 	  is:</para>
 
 	<programlisting>...
 	AUE_FORK STD		{ int linux_fork(void); }
 ...
 	AUE_CLOSE NOPROTO	{ int close(int fd); }
 ...</programlisting>
 
 	<para>The first column represents the syscall number.  The
 	  second column is for auditing support.  The third column
 	  represents the syscall type.  It is either
 	  <literal>STD</literal>, <literal>OBSOL</literal>,
 	  <literal>NOPROTO</literal> and <literal>UNIMPL</literal>.
 	  <literal>STD</literal> is a standard syscall with full
 	  prototype and implementation.  <literal>OBSOL</literal> is
 	  obsolete and defines just the prototype.
 	  <literal>NOPROTO</literal> means that the syscall is
 	  implemented elsewhere so do not prepend ABI prefix, etc.
 	  <literal>UNIMPL</literal> means that the syscall will be
 	  substituted with the <function>nosys</function> syscall (a
 	  syscall just printing out a message about the syscall not
 	  being implemented and returning
 	  <literal>ENOSYS</literal>).</para>
 
 	<para>From <filename>syscalls.master</filename> a script
 	  generates three files: <filename>linux_syscall.h</filename>,
 	  <filename>linux_proto.h</filename> and
 	  <filename>linux_sysent.c</filename>.  The
 	  <filename>linux_syscall.h</filename> contains definitions of
 	  syscall names and their numerical value, e.g.:</para>
 
 	<programlisting>...
 #define LINUX_SYS_linux_fork 2
 ...
 #define LINUX_SYS_close 6
 ...</programlisting>
 
 	<para>The <filename>linux_proto.h</filename> contains
 	  structure definitions of arguments to every syscall,
 	  e.g.:</para>
 
 	<programlisting>struct linux_fork_args {
   register_t dummy;
 };</programlisting>
 
 	<para>And finally, <filename>linux_sysent.c</filename>
 	  contains structure describing the system entry table, used
 	  to actually dispatch a syscall, e.g.:</para>
 
 	<programlisting>{ 0, (sy_call_t *)linux_fork, AUE_FORK, NULL, 0, 0 }, /* 2 = linux_fork */
 { AS(close_args), (sy_call_t *)close, AUE_CLOSE, NULL, 0, 0 }, /* 6 = close */</programlisting>
 
 	<para>As you can see <function>linux_fork</function> is
 	  implemented in Linuxulator itself so the definition is of
 	  <literal>STD</literal> type and has no argument, which is
 	  exhibited by the dummy argument structure.  On the other
 	  hand <function>close</function> is just an alias for real
 	  &os; &man.close.2; so it has no linux arguments structure
 	  associated and in the system entry table it is not prefixed
 	  with linux as it calls the real &man.close.2; in the
 	  kernel.</para>
       </sect3>
 
       <sect3 xml:id="dummy-syscalls">
 	<title>Dummy syscalls</title>
 
 	<para>The &linux; emulation layer is not complete, as some
 	  syscalls are not implemented properly and some are not
 	  implemented at all.  The emulation layer employs a facility
 	  to mark unimplemented syscalls with the
 	  <literal>DUMMY</literal> macro.  These dummy definitions
 	  reside in <filename>linux_dummy.c</filename> in a form of
 	  <literal>DUMMY(syscall);</literal>, which is then translated
 	  to various syscall auxiliary files and the implementation
 	  consists of printing a message saying that this syscall is
 	  not implemented.  The <literal>UNIMPL</literal> prototype is
 	  not used because we want to be able to identify the name of
 	  the syscall that was called in order to know what syscalls
 	  are more important to implement.</para>
       </sect3>
     </sect2>
 
     <sect2 xml:id="signal-handling">
       <title>Signal handling</title>
 
       <para>Signal handling is done generally in the &os; kernel for
 	all binary compatibilities with a call to a compat-dependent
 	layer.  &linux; compatibility layer defines
 	<function>linux_sendsig</function> routine for this
 	purpose.</para>
 
       <sect3 xml:id="linux-sendsig">
 	<title>&linux; sendsig</title>
 
 	<para>This routine first checks whether the signal has been
 	  installed with a <literal>SA_SIGINFO</literal> in which case
 	  it calls <function>linux_rt_sendsig</function> routine
 	  instead.  Furthermore, it allocates (or reuses an already
 	  existing) signal handle context, then it builds a list of
 	  arguments for the signal handler.  It translates the signal
 	  number based on the signal translation table, assigns a
 	  handler, translates sigset.  Then it saves context for the
 	  <function>sigreturn</function> routine (various registers,
 	  translated trap number and signal mask).  Finally, it copies
 	  out the signal context to the userspace and prepares context
 	  for the actual signal handler to run.</para>
       </sect3>
 
       <sect3 xml:id="linux-rt-sendsig">
 	<title>linux_rt_sendsig</title>
 
 	<para>This routine is similar to
 	  <function>linux_sendsig</function> just the signal context
 	  preparation is different.  It adds
 	  <literal>siginfo</literal>, <literal>ucontext</literal>, and
 	  some &posix; parts.  It might be worth considering whether
 	  those two functions could not be merged with a benefit of
 	  less code duplication and possibly even faster
 	  execution.</para>
       </sect3>
 
       <sect3 xml:id="linux-sigreturn">
 	<title>linux_sigreturn</title>
 
 	<para>This syscall is used for return from the signal handler.
 	  It does some security checks and restores the original
 	  process context.  It also unmasks the signal in process
 	  signal mask.</para>
       </sect3>
     </sect2>
 
     <sect2 xml:id="ptrace">
       <title>Ptrace</title>
 
       <para>Many &unix; derivates implement the &man.ptrace.2; syscall
 	in order to allow various tracking and debugging features.
 	This facility enables the tracing process to obtain various
 	information about the traced process, like register dumps, any
 	memory from the process address space, etc. and also to trace
 	the process like in stepping an instruction or between system
 	entries (syscalls and traps).  &man.ptrace.2; also lets you
 	set various information in the traced process (registers
 	etc.).  &man.ptrace.2; is a &unix;-wide standard implemented
 	in most &unix;es around the world.</para>
 
       <para>&linux; emulation in &os; implements the &man.ptrace.2;
 	facility in <filename>linux_ptrace.c</filename>.  The routines
 	for converting registers between &linux; and &os; and the
 	actual &man.ptrace.2; syscall emulation syscall.  The syscall
 	is a long switch block that implements its counterpart in &os;
 	for every &man.ptrace.2; command.  The &man.ptrace.2; commands
 	are mostly equal between &linux; and &os; so usually just a
 	small modification is needed.  For example,
 	<literal>PT_GETREGS</literal> in &linux; operates on direct
 	data while &os; uses a pointer to the data so after performing
 	a (native) &man.ptrace.2; syscall, a copyout must be done to
 	preserve &linux; semantics.</para>
 
       <para>The &man.ptrace.2; implementation in Linuxulator has some
 	known weaknesses.  There have been panics seen when using
 	<command>strace</command> (which is a &man.ptrace.2; consumer)
 	in the Linuxulator environment.  Also
 	<literal>PT_SYSCALL</literal> is not implemented.</para>
     </sect2>
 
     <sect2 xml:id="traps">
       <title>Traps</title>
 
       <para>Whenever a &linux; process running in the emulation layer
 	traps the trap itself is handled transparently with the only
 	exception of the trap translation.  &linux; and &os; differs
 	in opinion on what a trap is so this is dealt with here.  The
 	code is actually very short:</para>
 
       <programlisting>static int
 translate_traps(int signal, int trap_code)
 {
 
   if (signal != SIGBUS)
     return signal;
 
   switch (trap_code) {
 
     case T_PROTFLT:
     case T_TSSFLT:
     case T_DOUBLEFLT:
     case T_PAGEFLT:
       return SIGSEGV;
 
     default:
       return signal;
   }
 }</programlisting>
     </sect2>
 
     <sect2 xml:id="stack-fixup">
       <title>Stack fixup</title>
 
       <para>The RTLD run-time link-editor expects so called AUX tags
 	on stack during an <function>execve</function> so a fixup must
 	be done to ensure this.  Of course, every RTLD system is
 	different so the emulation layer must provide its own stack
 	fixup routine to do this.  So does Linuxulator.  The
 	<function>elf_linux_fixup</function> simply copies out AUX
 	tags to the stack and adjusts the stack of the user space
 	process to point right after those tags.  So RTLD works in a
 	smart way.</para>
     </sect2>
 
     <sect2 xml:id="aout-support">
       <title>A.OUT support</title>
 
       <para>The &linux; emulation layer on i386 also supports &linux;
 	A.OUT binaries.  Pretty much everything described in the
 	previous sections must be implemented for A.OUT support
 	(beside traps translation and signals sending).  The support
 	for A.OUT binaries is no longer maintained, especially the 2.6
 	emulation does not work with it but this does not cause any
 	problem, as the linux-base in ports probably do not support
 	A.OUT binaries at all.  This support will probably be removed
 	in future.  Most of the stuff necessary for loading &linux;
 	A.OUT binaries is in <filename>imgact_linux.c</filename>
 	file.</para>
     </sect2>
   </sect1>
 
   <sect1 xml:id="mi">
     <title>&linux; emulation layer -MI part</title>
 
     <para>This section talks about machine independent part of the
       Linuxulator.  It covers the emulation infrastructure needed for
       &linux; 2.6 emulation, the thread local storage (TLS)
       implementation (on i386) and futexes.  Then we talk briefly
       about some syscalls.</para>
 
     <sect2 xml:id="nptl-desc">
       <title>Description of NPTL</title>
 
       <para>One of the major areas of progress in development of
 	&linux; 2.6 was threading.  Prior to 2.6, the &linux;
 	threading support was implemented in the
 	<application>linuxthreads</application> library.  The library
 	was a partial implementation of &posix; threading.  The
 	threading was implemented using separate processes for each
 	thread using the <function>clone</function> syscall to let
 	them share the address space (and other things).  The main
 	weaknesses of this approach was that every thread had a
 	different PID, signal handling was broken (from the pthreads
 	perspective), etc.  Also the performance was not very good
 	(use of <literal>SIGUSR</literal> signals for threads
 	synchronization, kernel resource consumption, etc.) so to
 	overcome these problems a new threading system was developed
 	and named NPTL.</para>
 
       <para>The NPTL library focused on two things but a third thing
 	came along so it is usually considered a part of NPTL.  Those
 	two things were embedding of threads into a process structure
 	and futexes.  The additional third thing was TLS, which is not
 	directly required by NPTL but the whole NPTL userland library
 	depends on it.  Those improvements yielded in much improved
 	performance and standards conformance.  NPTL is a standard
 	threading library in &linux; systems these days.</para>
 
       <para>The &os; Linuxulator implementation approaches the NPTL in
 	three main areas.  The TLS, futexes and PID mangling, which is
 	meant to simulate the &linux; threads.  Further sections
 	describe each of these areas.</para>
     </sect2>
 
     <sect2 xml:id="linux26-emu">
       <title>&linux; 2.6 emulation infrastructure</title>
 
       <para>These sections deal with the way &linux; threads are
 	managed and how we simulate that in &os;.</para>
 
       <sect3 xml:id="linux26-runtime">
 	<title>Runtime determining of 2.6 emulation</title>
 
 	<para>The &linux; emulation layer in &os; supports runtime
 	  setting of the emulated version.  This is done via
 	  &man.sysctl.8;, namely
 	  <literal>compat.linux.osrelease</literal>.  Setting this
 	  &man.sysctl.8; affects runtime behavior of the emulation
 	  layer.  When set to 2.6.x it sets the value of
 	  <literal>linux_use_linux26</literal> while setting to
 	  something else keeps it unset.  This variable (plus
 	  per-prison variables of the very same kind) determines
 	  whether 2.6 infrastructure (mainly PID mangling) is used in
 	  the code or not.  The version setting is done system-wide
 	  and this affects all &linux; processes.  The &man.sysctl.8;
 	  should not be changed when running any &linux; binary as it
 	  might harm things.</para>
       </sect3>
 
       <sect3 xml:id="linux-proc-thread">
 	<title>&linux; processes and thread identifiers</title>
 
 	<para>The semantics of &linux; threading are a little
 	  confusing and uses entirely different nomenclature to &os;.
 	  A process in &linux; consists of a <literal>struct
 	    task</literal> embedding two identifier fields - PID and
 	  TGID.  PID is <emphasis>not</emphasis> a process ID but it
 	  is a thread ID.  The TGID identifies a thread group in other
 	  words a process.  For single-threaded process the PID equals
 	  the TGID.</para>
 
 	<para>The thread in NPTL is just an ordinary process that
 	  happens to have TGID not equal to PID and have a group
 	  leader not equal to itself (and shared VM etc. of course).
 	  Everything else happens in the same way as to an ordinary
 	  process.  There is no separation of a shared status to some
 	  external structure like in &os;.  This creates some
 	  duplication of information and possible data inconsistency.
 	  The &linux; kernel seems to use task -&gt; group information
 	  in some places and task information elsewhere and it is
 	  really not very consistent and looks error-prone.</para>
 
 	<para>Every NPTL thread is created by a call to the
 	  <function>clone</function> syscall with a specific set of
 	  flags (more in the next subsection).  The NPTL implements
 	  strict 1:1 threading.</para>
 
 	<para>In &os; we emulate NPTL threads with ordinary &os;
 	  processes that share VM space, etc. and the PID gymnastic is
 	  just mimicked in the emulation specific structure attached
 	  to the process.  The structure attached to the process looks
 	  like:</para>
 
 	<programlisting>struct linux_emuldata {
   pid_t pid;
 
   int *child_set_tid; /* in clone(): Child.s TID to set on clone */
   int *child_clear_tid;/* in clone(): Child.s TID to clear on exit */
 
   struct linux_emuldata_shared *shared;
 
   int pdeath_signal; /* parent death signal */
 
   LIST_ENTRY(linux_emuldata) threads; /* list of linux threads */
 };</programlisting>
 
 	<para>The PID is used to identify the &os; process that
 	  attaches this structure.  The
 	  <function>child_se_tid</function> and
 	  <function>child_clear_tid</function> are used for TID
 	  address copyout when a process exits and is created.  The
 	  <varname>shared</varname> pointer points to a structure
 	  shared among threads.  The <varname>pdeath_signal</varname>
 	  variable identifies the parent death signal  and the
 	  <varname>threads</varname> pointer is used to link this
 	  structure to the list of threads.  The
 	  <literal>linux_emuldata_shared</literal> structure looks
 	  like:</para>
 
 	<programlisting>struct linux_emuldata_shared {
 
   int refs;
 
   pid_t group_pid;
 
   LIST_HEAD(, linux_emuldata) threads; /* head of list of linux threads */
 };</programlisting>
 
 	<para>The <varname>refs</varname> is a reference counter being
 	  used to determine when we can free the structure to avoid
 	  memory leaks.  The <varname>group_pid</varname> is to
 	  identify PID ( = TGID) of the whole process ( = thread
 	  group).  The <varname>threads</varname> pointer is the head
 	  of the list of threads in the process.</para>
 
 	<para>The <literal>linux_emuldata</literal> structure can be
 	  obtained from the process using
 	  <function>em_find</function>.  The prototype of the function
 	  is:</para>
 
 	<programlisting>struct linux_emuldata *em_find(struct proc *, int locked);</programlisting>
 
 	<para>Here, <varname>proc</varname> is the process we want the
 	  emuldata structure from and the locked parameter determines
 	  whether we want to lock or not.  The accepted values are
 	  <literal>EMUL_DOLOCK</literal> and
 	  <literal>EMUL_DOUNLOCK</literal>.  More about locking
 	  later.</para>
       </sect3>
 
       <sect3 xml:id="pid-mangling">
 	<title>PID mangling</title>
 
-	<para>Because of the described different view knowing what a
+	<para>As there is a difference in view as what to the idea of a
 	  process ID and thread ID is between &os; and &linux; we have
 	  to translate the view somehow.  We do it by PID mangling.
 	  This means that we fake what a PID (=TGID) and TID (=PID) is
 	  between kernel and userland.  The rule of thumb is that in
 	  kernel (in Linuxulator) PID = PID and TGID = shared -&gt;
 	  group pid and to userland we present <literal>PID = shared
 	    -&gt; group_pid</literal> and <literal>TID = proc -&gt;
 	    p_pid</literal>.  The PID member of
 	  <literal>linux_emuldata structure</literal> is a &os;
 	  PID.</para>
 
 	<para>The above affects mainly getpid, getppid, gettid
 	  syscalls.  Where we use PID/TGID respectively.  In copyout
 	  of TIDs in <function>child_clear_tid</function> and
 	  <function>child_set_tid</function> we copy out &os;
 	  PID.</para>
       </sect3>
 
       <sect3 xml:id="clone-syscall">
 	<title>Clone syscall</title>
 
 	<para>The <function>clone</function> syscall is the way
 	  threads are created in &linux;.  The syscall prototype looks
 	  like this:</para>
 
 	<programlisting>int linux_clone(l_int flags, void *stack, void *parent_tidptr, int dummy,
 void * child_tidptr);</programlisting>
 
 	<para>The <varname>flags</varname> parameter tells the syscall
 	  how exactly the processes should be cloned.  As described
 	  above, &linux; can create processes sharing various things
 	  independently, for example two processes can share file
 	  descriptors but not VM, etc.  Last byte of the
 	  <varname>flags</varname> parameter is the exit signal of the
 	  newly created process.  The <varname>stack</varname>
 	  parameter if non-<literal>NULL</literal> tells, where the
 	  thread stack is and if it is <literal>NULL</literal> we are
 	  supposed to copy-on-write the calling process stack (i.e. do
 	  what normal &man.fork.2; routine does).  The
 	  <varname>parent_tidptr</varname> parameter is used as an
 	  address for copying out process PID (i.e.  thread id) once
 	  the process is sufficiently instantiated but is not runnable
 	  yet.  The <varname>dummy</varname> parameter is here because
 	  of the very strange calling convention of this syscall on
 	  i386.  It uses the registers directly and does not let the
 	  compiler do it what results in the need of a dummy syscall.
 	  The <varname>child_tidptr</varname> parameter is used as an
 	  address for copying out PID once the process has finished
 	  forking and when the process exits.</para>
 
 	<para>The syscall itself proceeds by setting corresponding
 	  flags depending on the flags passed in.  For example,
 	  <literal>CLONE_VM</literal> maps to RFMEM (sharing of VM),
 	  etc.  The only nit here is <literal>CLONE_FS</literal> and
 	  <literal>CLONE_FILES</literal> because &os; does not allow
 	  setting this separately so we fake it by not setting RFFDG
 	  (copying of fd table and other fs information) if either of
 	  these is defined.  This does not cause any problems, because
 	  those flags are always set together.  After setting the
 	  flags the process is forked using the internal
 	  <function>fork1</function> routine, the process is
 	  instrumented not to be put on a run queue, i.e. not to be
 	  set runnable.  After the forking is done we possibly
 	  reparent the newly created process to emulate
 	  <literal>CLONE_PARENT</literal> semantics.  Next part is
 	  creating the emulation data.  Threads in &linux; does not
 	  signal their parents so we set exit signal to be 0 to
 	  disable this.  After that setting of
 	  <varname>child_set_tid</varname> and
 	  <varname>child_clear_tid</varname> is performed enabling the
 	  functionality later in the code.  At this point we copy out
 	  the PID to the address specified by
 	  <varname>parent_tidptr</varname>.  The setting of process
 	  stack is done by simply rewriting thread frame
 	  <varname>%esp</varname> register (<varname>%rsp</varname> on
 	  amd64).  Next part is setting up TLS for the newly created
 	  process.  After this &man.vfork.2; semantics might be
 	  emulated and finally the newly created process is put on a
 	  run queue and copying out its PID to the parent process via
 	  <function>clone</function> return value is done.</para>
 
 	<para>The <function>clone</function> syscall is able and in
 	  fact is used for emulating classic &man.fork.2; and
 	  &man.vfork.2; syscalls.  Newer glibc in a case of 2.6 kernel
 	  uses <function>clone</function> to implement &man.fork.2;
 	  and &man.vfork.2; syscalls.</para>
       </sect3>
 
       <sect3 xml:id="locking">
 	<title>Locking</title>
 
 	<para>The locking is implemented to be per-subsystem because
 	  we do not expect a lot of contention on these.  There are
 	  two locks: <literal>emul_lock</literal> used to protect
 	  manipulating of <literal>linux_emuldata</literal> and
 	  <literal>emul_shared_lock</literal> used to manipulate
 	  <literal>linux_emuldata_shared</literal>.  The
 	  <literal>emul_lock</literal> is a nonsleepable blocking
 	  mutex while <literal>emul_shared_lock</literal> is a
-	  sleepable blocking <literal>sx_lock</literal>.  Because of
+	  sleepable blocking <literal>sx_lock</literal>.  Due to
 	  the per-subsystem locking we can coalesce some locks and
 	  that is why the em find offers the non-locking
 	  access.</para>
       </sect3>
     </sect2>
 
     <sect2 xml:id="tls">
       <title>TLS</title>
 
       <para>This section deals with TLS also known as thread local
 	storage.</para>
 
       <sect3 xml:id="trheading-intro">
 	<title>Introduction to threading</title>
 
 	<para>Threads in computer science are entities within a
 	  process that can be scheduled independently from each other.
 	  The threads in the process share process wide data (file
 	  descriptors, etc.) but also have their own stack for their
 	  own data.  Sometimes there is a need for process-wide data
 	  specific to a given thread.  Imagine a name of the thread in
 	  execution or something like that.  The traditional &unix;
 	  threading API, <application>pthreads</application> provides
 	  a way to do it via &man.pthread.key.create.3;,
 	  &man.pthread.setspecific.3; and &man.pthread.getspecific.3;
 	  where a thread can create a key to the thread local data and
 	  using &man.pthread.getspecific.3; or
 	  &man.pthread.getspecific.3; to manipulate those data.  You
 	  can easily see that this is not the most comfortable way
 	  this could be accomplished.  So various producers of C/C++
 	  compilers introduced a better way.  They defined a new
 	  modifier keyword thread that specifies that a variable is
 	  thread specific.  A new method of accessing such variables
 	  was developed as well (at least on i386).  The
 	  <application>pthreads</application> method tends to be
 	  implemented in userspace as a trivial lookup table.  The
 	  performance of such a solution is not very good.  So the new
 	  method uses (on i386) segment registers to address a
 	  segment, where TLS area is stored so the actual accessing of
 	  a thread variable is just appending the segment register to
 	  the address thus addressing via it.  The segment registers
 	  are usually <varname>%gs</varname> and
 	  <varname>%fs</varname> acting like segment selectors.  Every
 	  thread has its own area where the thread local data are
 	  stored and the segment must be loaded on every context
 	  switch.  This method is very fast and used almost
 	  exclusively in the whole i386 &unix; world.  Both &os; and
 	  &linux; implement this approach and it yields very good
 	  results.  The only drawback is the need to reload the
 	  segment on every context switch which can slowdown context
 	  switches.  &os; tries to avoid this overhead by using only 1
 	  segment descriptor for this while &linux; uses 3.
 	  Interesting thing is that almost nothing uses more than 1
 	  descriptor (only <application>Wine</application> seems to
 	  use 2) so &linux; pays this unnecessary price for context
 	  switches.</para>
       </sect3>
 
       <sect3 xml:id="i386-segs">
 	<title>Segments on i386</title>
 
 	<para>The i386 architecture implements the so called segments.
 	  A segment is a description of an area of memory.  The base
 	  address (bottom) of the memory area, the end of it
 	  (ceiling), type, protection, etc.  The memory described by a
 	  segment can be accessed using segment selector registers
 	  (<varname>%cs</varname>, <varname>%ds</varname>,
 	  <varname>%ss</varname>, <varname>%es</varname>,
 	  <varname>%fs</varname>, <varname>%gs</varname>).  For
 	  example let us suppose we have a segment which base address
 	  is 0x1234 and length and this code:</para>
 
 	<programlisting>mov %edx,%gs:0x10</programlisting>
 
 	<para>This will load the content of the
 	  <varname>%edx</varname> register into memory location
 	  0x1244.  Some segment registers have a special use, for
 	  example <varname>%cs</varname> is used for code segment and
 	  <varname>%ss</varname> is used for stack segment but
 	  <varname>%fs</varname> and <varname>%gs</varname> are
 	  generally unused.  Segments are either stored in a global
 	  GDT table or in a local LDT table.  LDT is accessed via an
 	  entry in the GDT.  The LDT can store more types of segments.
 	  LDT can be per process.  Both tables define up to 8191
 	  entries.</para>
       </sect3>
 
       <sect3 xml:id="linux-i386">
 	<title>Implementation on &linux; i386</title>
 
 	<para>There are two main ways of setting up TLS in &linux;.
 	  It can be set when cloning a process using the
 	  <function>clone</function> syscall or it can call
 	  <function>set_thread_area</function>.  When a process passes
 	  <literal>CLONE_SETTLS</literal> flag to
 	  <function>clone</function>, the kernel expects the memory
 	  pointed to by the <varname>%esi</varname> register a &linux;
 	  user space representation of a segment, which gets
 	  translated to the machine representation of a segment and
 	  loaded into a GDT slot.  The GDT slot can be specified with
 	  a number or -1 can be used meaning that the system itself
 	  should choose the first free slot.  In practice, the vast
 	  majority of programs use only one TLS entry and does not
 	  care about the number of the entry.  We exploit this in the
 	  emulation and in fact depend on it.</para>
       </sect3>
 
       <sect3 xml:id="tls-emu">
 	<title>Emulation of &linux; TLS</title>
 
 	<sect4 xml:id="tls-i386">
 	  <title>i386</title>
 
 	  <para>Loading of TLS for the current thread happens by
 	    calling <function>set_thread_area</function> while loading
 	    TLS for a second process in <function>clone</function> is
 	    done in the separate block in <function>clone</function>.
 	    Those two functions are very similar.  The only difference
 	    being the actual loading of the GDT segment, which happens
 	    on the next context switch for the newly created process
 	    while <function>set_thread_area</function> must load this
 	    directly.  The code basically does this.  It copies the
 	    &linux; form segment descriptor from the userland.  The
 	    code checks for the number of the descriptor but because
 	    this differs between &os; and &linux; we fake it a little.
 	    We only support indexes of 6, 3 and -1.  The 6 is genuine
 	    &linux; number, 3 is genuine &os; one and -1 means
 	    autoselection.  Then we set the descriptor number to
 	    constant 3 and copy out this to the userspace.  We rely on
 	    the userspace process using the number from the descriptor
 	    but this works most of the time (have never seen a case
 	    where this did not work) as the userspace process
 	    typically passes in 1.  Then we convert the descriptor
 	    from the &linux; form to a machine dependant form (i.e.
 	    operating system independent form) and copy this to the
 	    &os; defined segment descriptor.  Finally we can load it.
 	    We assign the descriptor to threads PCB (process control
 	    block) and load the <varname>%gs</varname> segment using
 	    <function>load_gs</function>.  This loading must be done
 	    in a critical section so that nothing can interrupt us.
 	    The <literal>CLONE_SETTLS</literal> case works exactly
 	    like this just the loading using
 	    <function>load_gs</function> is not performed.  The
 	    segment used for this (segment number 3) is shared for
 	    this use between &os; processes and &linux; processes so
 	    the &linux; emulation layer does not add any overhead over
 	    plain &os;.</para>
 	</sect4>
 
 	<sect4 xml:id="tls-amd64">
 	  <title>amd64</title>
 
 	  <para>The amd64 implementation is similar to the i386 one
 	    but there was initially no 32bit segment descriptor used
 	    for this purpose (hence not even native 32bit TLS users
 	    worked) so we had to add such a segment and implement its
 	    loading on every context switch (when a flag signaling use
 	    of 32bit is set).  Apart from this the TLS loading is
 	    exactly the same just the segment numbers are different
 	    and the descriptor format and the loading differs
 	    slightly.</para>
 	</sect4>
       </sect3>
     </sect2>
 
     <sect2 xml:id="futexes">
       <title>Futexes</title>
 
       <sect3 xml:id="sync-intro">
 	<title>Introduction to synchronization</title>
 
 	<para>Threads need some kind of synchronization and &posix;
 	  provides some of them: mutexes for mutual exclusion,
 	  read-write locks for mutual exclusion with biased ratio of
 	  reads and writes and condition variables for signaling a
 	  status change.  It is interesting to note that &posix;
 	  threading API lacks support for semaphores.  Those
 	  synchronization routines implementations are heavily
 	  dependant on the type threading support we have.  In pure
 	  1:M (userspace) model the implementation can be solely done
 	  in userspace and thus be very fast (the condition variables
 	  will probably end up being implemented using signals, i.e.
 	  not fast) and simple.  In 1:1 model, the situation is also
 	  quite clear - the threads must be synchronized using kernel
 	  facilities (which is very slow because a syscall must be
 	  performed).  The mixed M:N scenario just combines the first
 	  and second approach or rely solely on kernel.  Threads
 	  synchronization is a vital part of thread-enabled
 	  programming and its performance can affect resulting program
 	  a lot.  Recent benchmarks on &os; operating system showed
 	  that an improved sx_lock implementation yielded 40% speedup
 	  in <firstterm>ZFS</firstterm> (a heavy sx user), this is
 	  in-kernel stuff but it shows clearly how important the
 	  performance of synchronization primitives is.</para>
 
 	<para>Threaded programs should be written with as little
 	  contention on locks as possible.  Otherwise, instead of
-	  doing useful work the thread just waits on a lock.  Because
+	  doing useful work the thread just waits on a lock.  As a result
 	  of this, the most well written threaded programs show little
 	  locks contention.</para>
       </sect3>
 
       <sect3 xml:id="futex-intro">
 	<title>Futexes introduction</title>
 
 	<para>&linux; implements 1:1 threading, i.e. it has to use
 	  in-kernel synchronization primitives.  As stated earlier,
 	  well written threaded programs have little lock contention.
 	  So a typical sequence could be performed as two atomic
 	  increase/decrease mutex reference counter, which is very
 	  fast, as presented by the following example:</para>
 
 	<programlisting>pthread_mutex_lock(&amp;mutex);
 ....
 pthread_mutex_unlock(&amp;mutex);</programlisting>
 
 	<para>1:1 threading forces us to perform two syscalls for
 	  those mutex calls, which is very slow.</para>
 
 	<para>The solution &linux;&nbsp;2.6 implements is called
 	  futexes.  Futexes implement the check for contention in
 	  userspace and call kernel primitives only in a case of
 	  contention.  Thus the typical case takes place without any
 	  kernel intervention.  This yields reasonably fast and
 	  flexible synchronization primitives implementation.</para>
       </sect3>
 
       <sect3 xml:id="futex-api">
 	<title>Futex API</title>
 
 	<para>The futex syscall looks like this:</para>
 
 	<programlisting>int futex(void *uaddr, int op, int val, struct timespec *timeout, void *uaddr2, int val3);</programlisting>
 
 	<para>In this example <varname>uaddr</varname> is an address
 	  of the mutex in userspace, <varname>op</varname> is an
 	  operation we are about to perform and the other parameters
 	  have per-operation meaning.</para>
 
 	<para>Futexes implement the following operations:</para>
 
 	<itemizedlist>
 	  <listitem>
 	    <para><literal>FUTEX_WAIT</literal></para>
 	  </listitem>
 	  <listitem>
 	    <para><literal>FUTEX_WAKE</literal></para>
 	  </listitem>
 	  <listitem>
 	    <para><literal>FUTEX_FD</literal></para>
 	  </listitem>
 	  <listitem>
 	    <para><literal>FUTEX_REQUEUE</literal></para>
 	  </listitem>
 	  <listitem>
 	    <para><literal>FUTEX_CMP_REQUEUE</literal></para>
 	  </listitem>
 	  <listitem>
 	    <para><literal>FUTEX_WAKE_OP</literal></para>
 	  </listitem>
 	</itemizedlist>
 
 	<sect4 xml:id="futex-wait">
 	  <title>FUTEX_WAIT</title>
 
 	  <para>This operation verifies that on address
 	    <varname>uaddr</varname> the value <varname>val</varname>
 	    is written.  If not, <literal>EWOULDBLOCK</literal> is
 	    returned, otherwise the thread is queued on the futex and
 	    gets suspended.  If the argument
 	    <varname>timeout</varname> is non-zero it specifies the
 	    maximum time for the sleeping, otherwise the sleeping is
 	    infinite.</para>
 	</sect4>
 
 	<sect4 xml:id="futex-wake">
 	  <title>FUTEX_WAKE</title>
 
 	  <para>This operation takes a futex at
 	    <varname>uaddr</varname> and wakes up
 	    <varname>val</varname> first futexes queued on this
 	    futex.</para>
 	</sect4>
 
 	<sect4 xml:id="futex-fd">
 	  <title>FUTEX_FD</title>
 
 	  <para>This operations associates a file descriptor with a
 	    given futex.</para>
 	</sect4>
 
 	<sect4 xml:id="futex-requeue">
 	  <title>FUTEX_REQUEUE</title>
 
 	  <para>This operation takes <varname>val</varname> threads
 	    queued on futex at <varname>uaddr</varname>, wakes them
 	    up, and takes <varname>val2</varname> next threads and
 	    requeues them on futex at
 	    <varname>uaddr2</varname>.</para>
 	</sect4>
 
 	<sect4 xml:id="futex-cmp-requeue">
 	  <title>FUTEX_CMP_REQUEUE</title>
 
 	  <para>This operation does the same as
 	    <literal>FUTEX_REQUEUE</literal> but it checks that
 	    <varname>val3</varname> equals to <varname>val</varname>
 	    first.</para>
 	</sect4>
 
 	<sect4 xml:id="futex-wake-op">
 	  <title>FUTEX_WAKE_OP</title>
 
 	  <para>This operation performs an atomic operation on
 	    <varname>val3</varname> (which contains coded some other
 	    value) and <varname>uaddr</varname>.  Then it wakes up
 	    <varname>val</varname> threads on futex at
 	    <varname>uaddr</varname> and if the atomic operation
 	    returned a positive number it wakes up
 	    <varname>val2</varname> threads on futex at
 	    <varname>uaddr2</varname>.</para>
 
 	  <para>The operations implemented in
 	    <literal>FUTEX_WAKE_OP</literal>:</para>
 
 	  <itemizedlist>
 	    <listitem>
 	      <para><literal>FUTEX_OP_SET</literal></para>
 	    </listitem>
 	    <listitem>
 	      <para><literal>FUTEX_OP_ADD</literal></para>
 	    </listitem>
 	    <listitem>
 	      <para><literal>FUTEX_OP_OR</literal></para>
 	    </listitem>
 	    <listitem>
 	      <para><literal>FUTEX_OP_AND</literal></para>
 	    </listitem>
 	    <listitem>
 	      <para><literal>FUTEX_OP_XOR</literal></para>
 	    </listitem>
 	  </itemizedlist>
 
 	  <note>
 	    <para>There is no <varname>val2</varname> parameter in the
 	      futex prototype.  The <varname>val2</varname> is taken
 	      from the <varname>struct timespec *timeout</varname>
 	      parameter for operations
 	      <literal>FUTEX_REQUEUE</literal>,
 	      <literal>FUTEX_CMP_REQUEUE</literal> and
 	      <literal>FUTEX_WAKE_OP</literal>.</para>
 	  </note>
 	</sect4>
       </sect3>
 
       <sect3 xml:id="futex-emu">
 	<title>Futex emulation in &os;</title>
 
 	<para>The futex emulation in &os; is taken from NetBSD and
 	  further extended by us.  It is placed in
 	  <filename>linux_futex.c</filename> and
 	  <filename>linux_futex.h</filename> files.  The
 	  <literal>futex</literal> structure looks like:</para>
 
 	<programlisting>struct futex {
   void *f_uaddr;
   int f_refcount;
 
   LIST_ENTRY(futex) f_list;
 
   TAILQ_HEAD(lf_waiting_paroc, waiting_proc) f_waiting_proc;
 };</programlisting>
 
 	<para>And the structure <literal>waiting_proc</literal>
 	  is:</para>
 
 	<programlisting>struct waiting_proc {
 
   struct thread *wp_t;
 
   struct futex *wp_new_futex;
 
   TAILQ_ENTRY(waiting_proc) wp_list;
 };</programlisting>
 
 	<sect4 xml:id="futex-get">
 	  <title>futex_get / futex_put</title>
 
 	  <para>A futex is obtained using the
 	    <function>futex_get</function> function, which searches a
 	    linear list of futexes and returns the found one or
 	    creates a new futex.  When releasing a futex from the use
 	    we call the <function>futex_put</function> function, which
 	    decreases a reference counter of the futex and if the
 	    refcount reaches zero it is released.</para>
 	</sect4>
 
 	<sect4 xml:id="futex-sleep">
 	  <title>futex_sleep</title>
 
 	  <para>When a futex queues a thread for sleeping it creates a
 	    <literal>working_proc</literal> structure and puts this
 	    structure to the list inside the futex structure then it
 	    just performs a &man.tsleep.9; to suspend the thread.  The
 	    sleep can be timed out.  After &man.tsleep.9; returns (the
 	    thread was woken up or it timed out) the
 	    <literal>working_proc</literal> structure is removed from
 	    the list and is destroyed.  All this is done in the
 	    <function>futex_sleep</function> function.  If we got
 	    woken up from <function>futex_wake</function> we have
 	    <varname>wp_new_futex</varname> set so we sleep on it.
 	    This way the actual requeueing is done in this
 	    function.</para>
 	</sect4>
 
 	<sect4 xml:id="futex-wake-2">
 	  <title>futex_wake</title>
 
 	  <para>Waking up a thread sleeping on a futex is performed in
 	    the <function>futex_wake</function> function.  First in
 	    this function we mimic the strange &linux; behavior, where
 	    it wakes up N threads for all operations, the only
 	    exception is that the REQUEUE operations are performed on
 	    N+1 threads.  But this usually does not make any
 	    difference as we are waking up all threads.  Next in the
 	    function in the loop we wake up n threads, after this we
 	    check if there is a new futex for requeueing.  If so, we
 	    requeue up to n2 threads on the new futex.  This
 	    cooperates with <function>futex_sleep</function>.</para>
 	</sect4>
 
 	<sect4 xml:id="futex-wake-op-2">
 	  <title>futex_wake_op</title>
 
 	  <para>The <literal>FUTEX_WAKE_OP</literal> operation is
 	    quite complicated.  First we obtain two futexes at
 	    addresses <varname>uaddr</varname> and
 	    <varname>uaddr2</varname> then we perform the atomic
 	    operation using <varname>val3</varname> and
 	    <varname>uaddr2</varname>.  Then <varname>val</varname>
 	    waiters on the first futex is woken up and if the atomic
 	    operation condition holds we wake up
 	    <varname>val2</varname> (i.e.  <varname>timeout</varname>)
 	    waiter on the second futex.</para>
 	</sect4>
 
 	<sect4 xml:id="futex-atomic-op">
 	  <title>futex atomic operation</title>
 
 	  <para>The atomic operation takes two parameters
 	    <varname>encoded_op</varname> and
 	    <varname>uaddr</varname>.  The encoded operation encodes
 	    the operation itself, comparing value, operation argument,
 	    and comparing argument.  The pseudocode for the operation
 	    is like this one:</para>
 
 	  <programlisting>oldval = *uaddr2
 *uaddr2 = oldval OP oparg</programlisting>
 
 	  <para>And this is done atomically.  First a copying in of
 	    the number at <varname>uaddr</varname> is performed and
 	    the operation is done.  The code handles page faults and
 	    if no page fault occurs <varname>oldval</varname> is
 	    compared to <varname>cmparg</varname> argument with cmp
 	    comparator.</para>
 	</sect4>
 
 	<sect4 xml:id="futex-locking">
 	  <title>Futex locking</title>
 
 	  <para>Futex implementation uses two lock lists protecting
 	    <function>sx_lock</function> and global locks (either
 	    Giant or another <function>sx_lock</function>).  Every
 	    operation is performed locked from the start to the very
 	    end.</para>
 	</sect4>
       </sect3>
     </sect2>
 
     <sect2 xml:id="syscall-impl">
       <title>Various syscalls implementation</title>
 
       <para>In this section I am going to describe some smaller
 	syscalls that are worth mentioning because their
 	implementation is not obvious or those syscalls are
 	interesting from other point of view.</para>
 
       <sect3 xml:id="syscall-at">
 	<title>*at family of syscalls</title>
 
 	<para>During development of &linux; 2.6.16 kernel, the *at
 	  syscalls were added.  Those syscalls
 	  (<function>openat</function> for example) work exactly like
 	  their at-less counterparts with the slight exception of the
 	  <varname>dirfd</varname> parameter.  This parameter changes
 	  where the given file, on which the syscall is to be
 	  performed, is.  When the <varname>filename</varname>
 	  parameter is absolute <varname>dirfd</varname> is ignored
 	  but when the path to the file is relative, it comes to the
 	  play.  The <varname>dirfd</varname> parameter is a directory
 	  relative to which the relative pathname is checked.  The
 	  <varname>dirfd</varname> parameter is a file descriptor of
 	  some directory or <literal>AT_FDCWD</literal>.  So for
 	  example the <function>openat</function> syscall can be like
 	  this:</para>
 
 	<programlisting>file descriptor 123 = /tmp/foo/, current working directory = /tmp/
 
 openat(123, /tmp/bah\, flags, mode)	/* opens /tmp/bah */
 openat(123, bah\, flags, mode)		/* opens /tmp/foo/bah */
 openat(AT_FDWCWD, bah\, flags, mode)	/* opens /tmp/bah */
 openat(stdio, bah\, flags, mode)	/* returns error because stdio is not a directory */</programlisting>
 
 	<para>This infrastructure is necessary to avoid races when
 	  opening files outside the working directory.  Imagine that a
 	  process consists of two threads, thread&nbsp;A and
 	  thread&nbsp;B.  Thread&nbsp;A issues
 	  <literal>open(./tmp/foo/bah., flags, mode)</literal> and
 	  before returning it gets preempted and thread&nbsp;B runs.
 	  Thread&nbsp;B does not care about the needs of thread&nbsp;A
 	  and renames or removes <filename>/tmp/foo/</filename>.  We
 	  got a race.  To avoid this we can open
 	  <filename>/tmp/foo</filename> and use it as
 	  <varname>dirfd</varname> for <function>openat</function>
 	  syscall.  This also enables user to implement per-thread
 	  working directories.</para>
 
 	<para>&linux; family of *at syscalls contains:
 	  <function>linux_openat</function>,
 	  <function>linux_mkdirat</function>,
 	  <function>linux_mknodat</function>,
 	  <function>linux_fchownat</function>,
 	  <function>linux_futimesat</function>,
 	  <function>linux_fstatat64</function>,
 	  <function>linux_unlinkat</function>,
 	  <function>linux_renameat</function>,
 	  <function>linux_linkat</function>,
 	  <function>linux_symlinkat</function>,
 	  <function>linux_readlinkat</function>,
 	  <function>linux_fchmodat</function> and
 	  <function>linux_faccessat</function>.  All these are
 	  implemented using the modified &man.namei.9; routine and
 	  simple wrapping layer.</para>
 
 	<sect4 xml:id="implementation">
 	  <title>Implementation</title>
 
 	  <para>The implementation is done by altering the
 	    &man.namei.9; routine (described above) to take additional
 	    parameter <varname>dirfd</varname> in its
 	    <literal>nameidata</literal> structure, which specifies
 	    the starting point of the pathname lookup instead of using
 	    the current working directory every time.  The resolution
 	    of <varname>dirfd</varname> from file descriptor number to
 	    a vnode is done in native *at syscalls.  When
 	    <varname>dirfd</varname> is <literal>AT_FDCWD</literal>
 	    the <varname>dvp</varname> entry in
 	    <literal>nameidata</literal> structure is
 	    <literal>NULL</literal> but when <varname>dirfd</varname>
 	    is a different number we obtain a file for this file
 	    descriptor, check whether this file is valid and if there
 	    is vnode attached to it then we get a vnode.  Then we
 	    check this vnode for being a directory.  In the actual
 	    &man.namei.9; routine we simply substitute the
 	    <varname>dvp</varname> vnode for <varname>dp</varname>
 	    variable in the &man.namei.9; function, which determines
 	    the starting point.  The &man.namei.9; is not used
 	    directly but via a trace of different functions on various
 	    levels.  For example the <function>openat</function> goes
 	    like this:</para>
 
 	  <programlisting>openat() --&gt; kern_openat() --&gt; vn_open() -&gt; namei()</programlisting>
 
 	  <para>For this reason <function>kern_open</function> and
 	    <function>vn_open</function> must be altered to
 	    incorporate the additional <varname>dirfd</varname>
 	    parameter.  No compat layer is created for those because
 	    there are not many users of this and the users can be
 	    easily converted.  This general implementation enables
 	    &os; to implement their own *at syscalls.  This is being
 	    discussed right now.</para>
 	</sect4>
       </sect3>
 
       <sect3 xml:id="ioctl">
 	<title>Ioctl</title>
 
 	<para>The ioctl interface is quite fragile due to its
 	  generality.  We have to bear in mind that devices differ
 	  between &linux; and &os; so some care must be applied to do
 	  ioctl emulation work right.  The ioctl handling is
 	  implemented in <filename>linux_ioctl.c</filename>, where
 	  <function>linux_ioctl</function> function is defined.  This
 	  function simply iterates over sets of ioctl handlers to find
 	  a handler that implements a given command.  The ioctl
 	  syscall has three parameters, the file descriptor, command
 	  and an argument.  The command is a 16-bit number, which in
 	  theory is divided into high 8&nbsp;bits determining class of
 	  the ioctl command and low 8&nbsp;bits, which are the actual
 	  command within the given set.  The emulation takes advantage
 	  of this division.  We implement handlers for each set, like
 	  <function>sound_handler</function> or
 	  <function>disk_handler</function>.  Each handler has a
 	  maximum command and a minimum command defined, which is used
 	  for determining what handler is used.  There are slight
 	  problems with this approach because &linux; does not use the
 	  set division consistently so sometimes ioctls for a
 	  different set are inside a set they should not belong to
 	  (SCSI generic ioctls inside cdrom set, etc.).  &os;
 	  currently does not implement many &linux; ioctls (compared
 	  to NetBSD, for example) but the plan is to port those from
 	  NetBSD.  The trend is to use &linux; ioctls even in the
 	  native &os; drivers because of the easy porting of
 	  applications.</para>
       </sect3>
 
       <sect3 xml:id="debugging">
 	<title>Debugging</title>
 
 	<para>Every syscall should be debuggable.  For this purpose we
 	  introduce a small infrastructure.  We have the ldebug
 	  facility, which tells whether a given syscall should be
 	  debugged (settable via a sysctl).  For printing we have LMSG
 	  and ARGS macros.  Those are used for altering a printable
 	  string for uniform debugging messages.</para>
       </sect3>
     </sect2>
   </sect1>
 
   <sect1 xml:id="conclusion">
     <title>Conclusion</title>
 
     <sect2 xml:id="results">
       <title>Results</title>
 
       <para>As of April 2007 the &linux; emulation layer is capable of
 	emulating the &linux;&nbsp;2.6.16 kernel quite well.  The
 	remaining problems concern futexes, unfinished *at family of
 	syscalls, problematic signals delivery, missing
 	<function>epoll</function> and <function>inotify</function>
 	and probably some bugs we have not discovered yet.  Despite
 	this we are capable of running basically all the &linux;
 	programs included in &os; Ports&nbsp;Collection with
 	Fedora&nbsp;Core&nbsp;4 at 2.6.16 and there are some
 	rudimentary reports of success with Fedora&nbsp;Core&nbsp;6 at
 	2.6.16.  The Fedora&nbsp;Core&nbsp;6 linux_base was recently
 	committed enabling some further testing of the emulation layer
 	and giving us some more hints where we should put our effort
 	in implementing missing stuff.</para>
 
       <para>We are able to run the most used applications like
 	<package>www/linux-firefox</package>,
 	<package>net-im/skype</package> and some games from the
 	Ports&nbsp;Collection.  Some of the programs exhibit bad
 	behavior under 2.6 emulation but this is currently under
 	investigation and hopefully will be fixed soon.  The only big
 	application that is known not to work is the &linux; &java;
 	Development Kit and this is because of the requirement of
 	<function>epoll</function> facility which is not directly
 	related to the &linux; kernel 2.6.</para>
 
       <para>We hope to enable 2.6.16 emulation by default some time
 	after &os; 7.0 is released at least to expose the 2.6
 	emulation parts for some wider testing.  Once this is done we
 	can switch to Fedora&nbsp;Core&nbsp;6 linux_base, which is the
 	ultimate plan.</para>
     </sect2>
 
     <sect2 xml:id="future-work">
       <title>Future work</title>
 
       <para>Future work should focus on fixing the remaining issues
 	with futexes, implement the rest of the *at family of
 	syscalls, fix the signal delivery and possibly implement the
 	<function>epoll</function> and <function>inotify</function>
 	facilities.</para>
 
       <para>We hope to be able to run the most important programs
 	flawlessly soon, so we will be able to switch to the 2.6
 	emulation by default and make the Fedora&nbsp;Core&nbsp;6 the
 	default linux_base because our currently used
 	Fedora&nbsp;Core&nbsp;4 is not supported any more.</para>
 
       <para>The other possible goal is to share our code with NetBSD
 	and DragonflyBSD.  NetBSD has some support for 2.6 emulation
 	but its far from finished and not really tested.  DragonflyBSD
 	has expressed some interest in porting the 2.6
 	improvements.</para>
 
       <para>Generally, as &linux; develops we would like to keep up
 	with their development, implementing newly added syscalls.
 	Splice comes to mind first.  Some already implemented syscalls
 	are also heavily crippled, for example
 	<function>mremap</function> and others.  Some performance
 	improvements can also be made, finer grained locking and
 	others.</para>
     </sect2>
 
     <sect2 xml:id="team">
       <title>Team</title>
 
       <para>I cooperated on this project with (in alphabetical
 	order):</para>
 
       <itemizedlist>
 	<listitem>
 	  <para>&a.jhb.email;</para>
 	</listitem>
 	<listitem>
 	  <para>&a.kib.email;</para>
 	</listitem>
 	<listitem>
 	  <para>Emmanuel Dreyfus</para>
 	</listitem>
 	<listitem>
 	  <para>Scot Hetzel</para>
 	</listitem>
 	<listitem>
 	  <para>&a.jkim.email;</para>
 	</listitem>
 	<listitem>
 	  <para>&a.netchild.email;</para>
 	</listitem>
 	<listitem>
 	  <para>&a.ssouhlal.email;</para>
 	</listitem>
 	<listitem>
 	  <para>Li Xiao</para>
 	</listitem>
 	<listitem>
 	  <para>&a.davidxu.email;</para>
 	</listitem>
       </itemizedlist>
 
       <para>I would like to thank all those people for their advice,
 	code reviews and general support.</para>
     </sect2>
   </sect1>
 
   <sect1 xml:id="literatures">
     <title>Literatures</title>
 
     <orderedlist>
       <listitem>
 	<para>Marshall Kirk McKusick - George V. Nevile-Neil.  Design
 	  and Implementation of the &os; operating system.
 	  Addison-Wesley, 2005.</para>
       </listitem>
       <listitem>
 	<para><uri
 	    xlink:href="https://tldp.org">https://tldp.org</uri></para>
       </listitem>
       <listitem>
 	<para><uri
 	    xlink:href="https://www.kernel.org">https://www.kernel.org</uri></para>
       </listitem>
     </orderedlist>
   </sect1>
 </article>
diff --git a/en_US.ISO8859-1/articles/serial-uart/article.xml b/en_US.ISO8859-1/articles/serial-uart/article.xml
index e57b052b9e..2ddbfbe2aa 100644
--- a/en_US.ISO8859-1/articles/serial-uart/article.xml
+++ b/en_US.ISO8859-1/articles/serial-uart/article.xml
@@ -1,2433 +1,2433 @@
 <?xml version="1.0" encoding="iso-8859-1"?>
 <!DOCTYPE article PUBLIC "-//FreeBSD//DTD DocBook XML V5.0-Based Extension//EN"
 	"http://www.FreeBSD.org/XML/share/xml/freebsd50.dtd">
 <article xmlns="http://docbook.org/ns/docbook" xmlns:xlink="http://www.w3.org/1999/xlink" version="5.0" xml:lang="en">
   <info><title>Serial and UART Tutorial</title>
     
 
     <authorgroup>
       <author><personname><firstname>Frank</firstname><surname>Durda</surname></personname><affiliation>
           <address><email>uhclem@FreeBSD.org</email></address>
         </affiliation></author>
     </authorgroup>
 
     <legalnotice xml:id="trademarks" role="trademarks">
       &tm-attrib.freebsd;
       &tm-attrib.microsoft;
       &tm-attrib.general;
     </legalnotice>
 
     <pubdate>$FreeBSD$</pubdate>
 
     <releaseinfo>$FreeBSD$</releaseinfo>
 
     <abstract>
       <para>This article talks about using serial hardware with FreeBSD.</para>
     </abstract>
   </info>
 
   <sect1 xml:id="uart">
     <title>The UART: What it is and how it works</title>
 
       <para><emphasis>Copyright &copy; 1996 &a.uhclem.email;, All Rights
         Reserved.  13 January 1996.</emphasis></para>
 
       <para>The Universal Asynchronous Receiver/Transmitter (UART)
 	controller is the key component of the serial communications
 	subsystem of a computer.  The UART takes bytes of data and
 	transmits the individual bits in a sequential fashion.  At the
 	destination, a second UART re-assembles the bits into complete
 	bytes.</para>
 
       <para>Serial transmission is commonly used with modems and for
 	non-networked communication between computers, terminals and
 	other devices.</para>
 
       <para>There are two primary forms of serial transmission:
 	Synchronous and Asynchronous.  Depending on the modes that are
 	supported by the hardware, the name of the communication
 	sub-system will usually include a <literal>A</literal> if it
 	supports Asynchronous communications, and a
 	<literal>S</literal> if it supports Synchronous
 	communications.  Both forms are described below.</para>
 
       <para>Some common acronyms are:</para>
 
       <blockquote>
 	<para>UART Universal Asynchronous Receiver/Transmitter</para>
       </blockquote>
 
       <blockquote>
 	<para>USART Universal Synchronous-Asynchronous
 	  Receiver/Transmitter</para>
       </blockquote>
 
       <sect2>
         <title>Synchronous Serial Transmission</title>
 
 	<para>Synchronous serial transmission requires that the sender
 	  and receiver share a clock with one another, or that the
 	  sender provide a strobe or other timing signal so that the
 	  receiver knows when to <quote>read</quote> the next bit of
 	  the data.  In most forms of serial Synchronous
 	  communication, if there is no data available at a given
 	  instant to transmit, a fill character must be sent instead
 	  so that data is always being transmitted.  Synchronous
 	  communication is usually more efficient because only data
 	  bits are transmitted between sender and receiver, and
 	  synchronous communication can be more costly if extra wiring
 	  and circuits are required to share a clock signal between
 	  the sender and receiver.</para>
 
 	<para>A form of Synchronous transmission is used with printers
 	  and fixed disk devices in that the data is sent on one set
 	  of wires while a clock or strobe is sent on a different
 	  wire. Printers and fixed disk devices are not normally
 	  serial devices because most fixed disk interface standards
 	  send an entire word of data for each clock or strobe signal
 	  by using a separate wire for each bit of the word.  In the
 	  PC industry, these are known as Parallel devices.</para>
 
 	<para>The standard serial communications hardware in the PC
 	  does not support Synchronous operations.  This mode is
 	  described here for comparison purposes only.</para>
       </sect2>
 
       <sect2>
         <title>Asynchronous Serial Transmission</title>
 
 	<para>Asynchronous transmission allows data to be transmitted
 	  without the sender having to send a clock signal to the
 	  receiver.  Instead, the sender and receiver must agree on
 	  timing parameters in advance and special bits are added to
 	  each word which are used to synchronize the sending and
 	  receiving units.</para>
 
 	<para>When a word is given to the UART for Asynchronous
 	  transmissions, a bit called the "Start Bit" is added to the
 	  beginning of each word that is to be transmitted.  The Start
 	  Bit is used to alert the receiver that a word of data is
 	  about to be sent, and to force the clock in the receiver
 	  into synchronization with the clock in the transmitter.
 	  These two clocks must be accurate enough to not have the
 	  frequency drift by more than 10% during the transmission of
 	  the remaining bits in the word.  (This requirement was set
 	  in the days of mechanical teleprinters and is easily met by
 	  modern electronic equipment.)</para>
 
 	<para>After the Start Bit, the individual bits of the word of
 	  data are sent, with the Least Significant Bit (LSB) being
 	  sent first.  Each bit in the transmission is transmitted for
 	  exactly the same amount of time as all of the other bits,
 	  and the receiver <quote>looks</quote> at the wire at
 	  approximately halfway through the period assigned to each
 	  bit to determine if the bit is a <literal>1</literal> or a
 	  <literal>0</literal>.  For example, if it takes two seconds
 	  to send each bit, the receiver will examine the signal to
 	  determine if it is a <literal>1</literal> or a
 	  <literal>0</literal> after one second has passed, then it
 	  will wait two seconds and then examine the value of the next
 	  bit, and so on.</para>
 
 	<para>The sender does not know when the receiver has
 	  <quote>looked</quote> at the value of the bit.  The sender
 	  only knows when the clock says to begin transmitting the
 	  next bit of the word.</para>
 
 	<para>When the entire data word has been sent, the transmitter
 	  may add a Parity Bit that the transmitter generates.  The
 	  Parity Bit may be used by the receiver to perform simple
 	  error checking.  Then at least one Stop Bit is sent by the
 	  transmitter.</para>
 
 	<para>When the receiver has received all of the bits in the
 	  data word, it may check for the Parity Bits (both sender and
 	  receiver must agree on whether a Parity Bit is to be used),
 	  and then the receiver looks for a Stop Bit.  If the Stop Bit
 	  does not appear when it is supposed to, the UART considers
 	  the entire word to be garbled and will report a Framing
 	  Error to the host processor when the data word is read.  The
 	  usual cause of a Framing Error is that the sender and
 	  receiver clocks were not running at the same speed, or that
 	  the signal was interrupted.</para>
 
 	<para>Regardless of whether the data was received correctly or
 	  not, the UART automatically discards the Start, Parity and
 	  Stop bits.  If the sender and receiver are configured
 	  identically, these bits are not passed to the host.</para>
 
 	<para>If another word is ready for transmission, the Start Bit
 	  for the new word can be sent as soon as the Stop Bit for the
 	  previous word has been sent.</para>
 
-	<para>Because asynchronous data is <quote>self
+	<para>As asynchronous data is <quote>self
 	  synchronizing</quote>, if there is no data to transmit, the
 	  transmission line can be idle.</para>
       </sect2>
 
       <sect2>
         <title>Other UART Functions</title>
 
 	<para>In addition to the basic job of converting data from
 	  parallel to serial for transmission and from serial to
 	  parallel on reception, a UART will usually provide
 	  additional circuits for signals that can be used to indicate
 	  the state of the transmission media, and to regulate the
 	  flow of data in the event that the remote device is not
 	  prepared to accept more data.  For example, when the device
 	  connected to the UART is a modem, the modem may report the
 	  presence of a carrier on the phone line while the computer
 	  may be able to instruct the modem to reset itself or to not
 	  take calls by raising or lowering one more of these
 	  extra signals. The function of each of these additional
 	  signals is defined in the EIA RS232-C standard.</para>
       </sect2>
 
       <sect2>
         <title>The RS232-C and V.24 Standards</title>
 
 	<para>In most computer systems, the UART is connected to
 	  circuitry that generates signals that comply with the EIA
 	  RS232-C specification.  There is also a CCITT standard named
 	  V.24 that mirrors the specifications included in
 	  RS232-C.</para>
 
 	<sect3>
 	  <title>RS232-C Bit Assignments (Marks and Spaces)</title>
 
 	  <para>In RS232-C, a value of <literal>1</literal> is called
 	    a <literal>Mark</literal> and a value of
 	    <literal>0</literal> is called a <literal>Space</literal>.
 	    When a communication line is idle, the line is said to be
 	    <quote>Marking</quote>, or transmitting continuous
 	    <literal>1</literal> values.</para>
 
 	  <para>The Start bit always has a value of
 	    <literal>0</literal> (a Space).  The Stop Bit always has a
 	    value of <literal>1</literal> (a Mark).  This means that
 	    there will always be a Mark (1) to Space (0) transition on
 	    the line at the start of every word, even when multiple
 	    word are transmitted back to back.  This guarantees that
 	    sender and receiver can resynchronize their clocks
 	    regardless of the content of the data bits that are being
 	    transmitted.</para>
 
 	  <para>The idle time between Stop and Start bits does not
 	    have to be an exact multiple (including zero) of the bit
 	    rate of the communication link, but most UARTs are
 	    designed this way for simplicity.</para>
 
 	  <para>In RS232-C, the "Marking" signal (a
 	    <literal>1</literal>) is represented by a voltage between
 	    -2 VDC and -12 VDC, and a "Spacing" signal (a
 	    <literal>0</literal>) is represented by a voltage between
 	    0 and +12 VDC.  The transmitter is supposed to send +12
 	    VDC or -12 VDC, and the receiver is supposed to allow for
 	    some voltage loss in long cables.  Some transmitters in
 	    low power devices (like portable computers) sometimes use
 	    only +5 VDC and -5 VDC, but these values are still
 	    acceptable to a RS232-C receiver, provided that the cable
 	    lengths are short.</para>
 	</sect3>
 
 	<sect3>
 	  <title>RS232-C Break Signal</title>
 
 	  <para>RS232-C also specifies a signal called a
 	    <literal>Break</literal>, which is caused by sending
 	    continuous Spacing values (no Start or Stop bits).  When
 	    there is no electricity present on the data circuit, the
 	    line is considered to be sending
 	    <literal>Break</literal>.</para>
 
 	  <para>The <literal>Break</literal> signal must be of a
 	    duration longer than the time it takes to send a complete
 	    byte plus Start, Stop and Parity bits.  Most UARTs can
 	    distinguish between a Framing Error and a Break, but if
 	    the UART cannot do this, the Framing Error detection can
 	    be used to identify Breaks.</para>
 
 	  <para>In the days of teleprinters, when numerous printers
 	    around the country were wired in series (such as news
 	    services), any unit could cause a <literal>Break</literal>
 	    by temporarily opening the entire circuit so that no
 	    current flowed.  This was used to allow a location with
 	    urgent news to interrupt some other location that was
 	    currently sending information.</para>
 
 	  <para>In modern systems there are two types of Break
 	    signals. If the Break is longer than 1.6 seconds, it is
 	    considered a "Modem Break", and some modems can be
 	    programmed to terminate the conversation and go on-hook or
 	    enter the modems' command mode when the modem detects this
 	    signal.  If the Break is smaller than 1.6 seconds, it
 	    signifies a Data Break and it is up to the remote computer
 	    to respond to this signal.  Sometimes this form of Break
 	    is used as an Attention or Interrupt signal and sometimes
 	    is accepted as a substitute for the ASCII CONTROL-C
 	    character.</para>
 
 	  <para>Marks and Spaces are also equivalent to
 	    <quote>Holes</quote> and <quote>No Holes</quote> in paper
 	    tape systems.</para>
 
 	  <note>
 	    <para>Breaks cannot be generated from paper tape or from
 	      any other byte value, since bytes are always sent with
 	      Start and Stop bit.  The UART is usually capable of
 	      generating the continuous Spacing signal in response to
 	      a special command from the host processor.</para>
 	  </note>
 	</sect3>
 
 	<sect3>
 	  <title>RS232-C DTE and DCE Devices</title>
 
 	  <para>The RS232-C specification defines two types of
 	    equipment: the Data Terminal Equipment (DTE) and the Data
 	    Carrier Equipment (DCE).  Usually, the DTE device is the
 	    terminal (or computer), and the DCE is a modem.  Across
 	    the phone line at the other end of a conversation, the
 	    receiving modem is also a DCE device and the computer that
 	    is connected to that modem is a DTE device.  The DCE
 	    device receives signals on the pins that the DTE device
 	    transmits on, and vice versa.</para>
 
 	  <para>When two devices that are both DTE or both DCE must be
 	    connected together without a modem or a similar media
 	    translator between them, a NULL modem must be used.  The
 	    NULL modem electrically re-arranges the cabling so that
 	    the transmitter output is connected to the receiver input
 	    on the other device, and vice versa.  Similar translations
 	    are performed on all of the control signals so that each
 	    device will see what it thinks are DCE (or DTE) signals
 	    from the other device.</para>
 
 	  <para>The number of signals generated by the DTE and DCE
 	    devices are not symmetrical.  The DTE device generates
 	    fewer signals for the DCE device than the DTE device
 	    receives from the DCE.</para>
 	</sect3>
 
 	<sect3>
 	  <title>RS232-C Pin Assignments</title>
 
 	  <para>The EIA RS232-C specification (and the ITU equivalent,
 	    V.24) calls for a twenty-five pin connector (usually a
 	    DB25) and defines the purpose of most of the pins in that
 	    connector.</para>
 
 	  <para>In the IBM Personal Computer and similar systems, a
 	    subset of RS232-C signals are provided via nine pin
 	    connectors (DB9).  The signals that are not included on
 	    the PC connector deal mainly with synchronous operation,
 	    and this transmission mode is not supported by the UART
 	    that IBM selected for use in the IBM PC.</para>
 
 	  <para>Depending on the computer manufacturer, a DB25, a DB9,
 	    or both types of connector may be used for RS232-C
 	    communications.  (The IBM PC also uses a DB25 connector
 	    for the parallel printer interface which causes some
 	    confusion.)</para>
 
 	  <para>Below is a table of the RS232-C signal assignments in
 	    the DB25 and DB9 connectors.</para>
 
 	  <informaltable frame="none" pgwide="1">
 	    <tgroup cols="7">
 	      <thead>
 	        <row>
 		  <entry>DB25 RS232-C Pin</entry> <entry>DB9 IBM PC
 		  Pin</entry> <entry>EIA Circuit Symbol</entry>
 		  <entry>CCITT Circuit Symbol</entry> <entry>Common
 		  Name</entry> <entry>Signal Source</entry>
 		  <entry>Description</entry>
 		</row>
 	      </thead>
 
 	      <tbody>
 		<row>
 		  <entry>1</entry>
 		  <entry>-</entry>
 		  <entry>AA</entry>
 		  <entry>101</entry>
 		  <entry>PG/FG</entry>
 		  <entry>-</entry>
 		  <entry>Frame/Protective Ground</entry>
 		</row>
 
 		<row>
 		  <entry>2</entry>
 		  <entry>3</entry>
 		  <entry>BA</entry>
 		  <entry>103</entry>
 		  <entry>TD</entry>
 		  <entry>DTE</entry>
 		  <entry>Transmit Data</entry>
 		</row>
 
 		<row>
 		  <entry>3</entry>
 		  <entry>2</entry>
 		  <entry>BB</entry>
 		  <entry>104</entry>
 		  <entry>RD</entry>
 		  <entry>DCE</entry>
 		  <entry>Receive Data</entry>
 		</row>
 
 		<row>
 		  <entry>4</entry>
 		  <entry>7</entry>
 		  <entry>CA</entry>
 		  <entry>105</entry>
 		  <entry>RTS</entry>
 		  <entry>DTE</entry>
 		  <entry>Request to Send</entry>
 		</row>
 
 		<row>
 		  <entry>5</entry>
 		  <entry>8</entry>
 		  <entry>CB</entry>
 		  <entry>106</entry>
 		  <entry>CTS</entry>
 		  <entry>DCE</entry>
 		  <entry>Clear to Send</entry>
 		</row>
 
 		<row>
 		  <entry>6</entry>
 		  <entry>6</entry>
 		  <entry>CC</entry>
 		  <entry>107</entry>
 		  <entry>DSR</entry>
 		  <entry>DCE</entry>
 		  <entry>Data Set Ready</entry>
 		</row>
 
 		<row>
 		  <entry>7</entry>
 		  <entry>5</entry>
 		  <entry>AV</entry>
 		  <entry>102</entry>
 		  <entry>SG/GND</entry>
 		  <entry>-</entry>
 		  <entry>Signal Ground</entry>
 		</row>
 
 		<row>
 		  <entry>8</entry>
 		  <entry>1</entry>
 		  <entry>CF</entry>
 		  <entry>109</entry>
 		  <entry>DCD/CD</entry>
 		  <entry>DCE</entry>
 		  <entry>Data Carrier Detect</entry>
 		</row>
 
 		<row>
 		  <entry>9</entry>
 		  <entry>-</entry>
 		  <entry>-</entry>
 		  <entry>-</entry>
 		  <entry>-</entry>
 		  <entry>-</entry>
 		  <entry>Reserved for Test</entry>
 		</row>
 
 		<row>
 		  <entry>10</entry>
 		  <entry>-</entry>
 		  <entry>-</entry>
 		  <entry>-</entry>
 		  <entry>-</entry>
 		  <entry>-</entry>
 		  <entry>Reserved for Test</entry>
 		</row>
 
 		<row>
 		  <entry>11</entry>
 		  <entry>-</entry>
 		  <entry>-</entry>
 		  <entry>-</entry>
 		  <entry>-</entry>
 		  <entry>-</entry>
 		  <entry>Reserved for Test</entry>
 		</row>
 
 		<row>
 		  <entry>12</entry>
 		  <entry>-</entry>
 		  <entry>CI</entry>
 		  <entry>122</entry>
 		  <entry>SRLSD</entry>
 		  <entry>DCE</entry>
 		  <entry>Sec. Recv. Line Signal Detector</entry>
 		</row>
 
 		<row>
 		  <entry>13</entry>
 		  <entry>-</entry>
 		  <entry>SCB</entry>
 		  <entry>121</entry>
 		  <entry>SCTS</entry>
 		  <entry>DCE</entry>
 		  <entry>Secondary Clear to Send</entry>
 		</row>
 
 		<row>
 		  <entry>14</entry>
 		  <entry>-</entry>
 		  <entry>SBA</entry>
 		  <entry>118</entry>
 		  <entry>STD</entry>
 		  <entry>DTE</entry>
 		  <entry>Secondary Transmit Data</entry>
 		</row>
 
 		<row>
 		  <entry>15</entry>
 		  <entry>-</entry>
 		  <entry>DB</entry>
 		  <entry>114</entry>
 		  <entry>TSET</entry>
 		  <entry>DCE</entry>
 		  <entry>Trans. Sig. Element Timing</entry>
 		</row>
 
 		<row>
 		  <entry>16</entry>
 		  <entry>-</entry>
 		  <entry>SBB</entry>
 		  <entry>119</entry>
 		  <entry>SRD</entry>
 		  <entry>DCE</entry>
 		  <entry>Secondary Received Data</entry>
 		</row>
 
 		<row>
 		  <entry>17</entry>
 		  <entry>-</entry>
 		  <entry>DD</entry>
 		  <entry>115</entry>
 		  <entry>RSET</entry>
 		  <entry>DCE</entry>
 		  <entry>Receiver Signal Element Timing</entry>
 		</row>
 
 		<row>
 		  <entry>18</entry>
 		  <entry>-</entry>
 		  <entry>-</entry>
 		  <entry>141</entry>
 		  <entry>LOOP</entry>
 		  <entry>DTE</entry>
 		  <entry>Local Loopback</entry>
 		</row>
 
 		<row>
 		  <entry>19</entry>
 		  <entry>-</entry>
 		  <entry>SCA</entry>
 		  <entry>120</entry>
 		  <entry>SRS</entry>
 		  <entry>DTE</entry>
 		  <entry>Secondary Request to Send</entry>
 		</row>
 
 		<row>
 		  <entry>20</entry>
 		  <entry>4</entry>
 		  <entry>CD</entry>
 		  <entry>108.2</entry>
 		  <entry>DTR</entry>
 		  <entry>DTE</entry>
 		  <entry>Data Terminal Ready</entry>
 		</row>
 
 		<row>
 		  <entry>21</entry>
 		  <entry>-</entry>
 		  <entry>-</entry>
 		  <entry>-</entry>
 		  <entry>RDL</entry>
 		  <entry>DTE</entry>
 		  <entry>Remote Digital Loopback</entry>
 		</row>
 
 		<row>
 		  <entry>22</entry>
 		  <entry>9</entry>
 		  <entry>CE</entry>
 		  <entry>125</entry>
 		  <entry>RI</entry>
 		  <entry>DCE</entry>
 		  <entry>Ring Indicator</entry>
 		</row>
 
 		<row>
 		  <entry>23</entry>
 		  <entry>-</entry>
 		  <entry>CH</entry>
 		  <entry>111</entry>
 		  <entry>DSRS</entry>
 		  <entry>DTE</entry>
 		  <entry>Data Signal Rate Selector</entry>
 		</row>
 
 		<row>
 		  <entry>24</entry>
 		  <entry>-</entry>
 		  <entry>DA</entry>
 		  <entry>113</entry>
 		  <entry>TSET</entry>
 		  <entry>DTE</entry>
 		  <entry>Trans. Sig. Element Timing</entry>
 		</row>
 
 		<row>
 		  <entry>25</entry>
 		  <entry>-</entry>
 		  <entry>-</entry>
 		  <entry>142</entry>
 		  <entry>-</entry>
 		  <entry>DCE</entry>
 		  <entry>Test Mode</entry>
 		</row>
 	      </tbody>
 	    </tgroup>
 	  </informaltable>
 	</sect3>
       </sect2>
 
       <sect2>
 	<title>Bits, Baud and Symbols</title>
 
 	<para>Baud is a measurement of transmission speed in
-	  asynchronous communication.  Because of advances in modem
+	  asynchronous communication.  Due to advances in modem
 	  communication technology, this term is frequently misused
 	  when describing the data rates in newer devices.</para>
 
 	<para>Traditionally, a Baud Rate represents the number of bits
 	  that are actually being sent over the media, not the amount
 	  of data that is actually moved from one DTE device to the
 	  other. The Baud count includes the overhead bits Start, Stop
 	  and Parity that are generated by the sending UART and
 	  removed by the receiving UART.  This means that seven-bit
 	  words of data actually take 10 bits to be completely
 	  transmitted.  Therefore, a modem capable of moving 300 bits
 	  per second from one place to another can normally only move
 	  30 7-bit words if Parity is used and one Start and Stop bit
 	  are present.</para>
 
 	<para>If 8-bit data words are used and Parity bits are also
 	  used, the data rate falls to 27.27 words per second, because
 	  it now takes 11 bits to send the eight-bit words, and the
 	  modem still only sends 300 bits per second.</para>
 
 	<para>The formula for converting bytes per second into a baud
 	  rate and vice versa was simple until error-correcting modems
 	  came along.  These modems receive the serial stream of bits
 	  from the UART in the host computer (even when internal
 	  modems are used the data is still frequently serialized) and
 	  converts the bits back into bytes.  These bytes are then
 	  combined into packets and sent over the phone line using a
 	  Synchronous transmission method.  This means that the Stop,
 	  Start, and Parity bits added by the UART in the DTE (the
 	  computer) were removed by the modem before transmission by
 	  the sending modem. When these bytes are received by the
 	  remote modem, the remote modem adds Start, Stop and Parity
 	  bits to the words, converts them to a serial format and then
 	  sends them to the receiving UART in the remote computer, who
 	  then strips the Start, Stop and Parity bits.</para>
 
 	<para>The reason all these extra conversions are done is so
 	  that the two modems can perform error correction, which
 	  means that the receiving modem is able to ask the sending
 	  modem to resend a block of data that was not received with
 	  the correct checksum.  This checking is handled by the
 	  modems, and the DTE devices are usually unaware that the
 	  process is occurring.</para>
 
 	<para>By striping the Start, Stop and Parity bits, the
 	  additional bits of data that the two modems must share
 	  between themselves to perform error-correction are mostly
 	  concealed from the effective transmission rate seen by the
 	  sending and receiving DTE equipment.  For example, if a
 	  modem sends ten 7-bit words to another modem without
 	  including the Start, Stop and Parity bits, the sending modem
 	  will be able to add 30 bits of its own information that the
 	  receiving modem can use to do error-correction without
 	  impacting the transmission speed of the real data.</para>
 
 	<para>The use of the term Baud is further confused by modems
 	  that perform compression.  A single 8-bit word passed over
 	  the telephone line might represent a dozen words that were
 	  transmitted to the sending modem.  The receiving modem will
 	  expand the data back to its original content and pass that
 	  data to the receiving DTE.</para>
 
 	<para>Modern modems also include buffers that allow the rate
 	  that bits move across the phone line (DCE to DCE) to be a
 	  different speed than the speed that the bits move between
 	  the DTE and DCE on both ends of the conversation.  Normally
 	  the speed between the DTE and DCE is higher than the DCE to
 	  DCE speed because of the use of compression by the
 	  modems.</para>
 
-	<para>Because the number of bits needed to describe a byte
+	<para>As the number of bits needed to describe a byte
 	  varied during the trip between the two machines plus the
 	  differing bits-per-seconds speeds that are used present on
 	  the DTE-DCE and DCE-DCE links, the usage of the term Baud to
 	  describe the overall communication speed causes problems and
 	  can misrepresent the true transmission speed.  So Bits Per
 	  Second (bps) is the correct term to use to describe the
 	  transmission rate seen at the DCE to DCE interface and Baud
 	  or Bits Per Second are acceptable terms to use when a
 	  connection is made between two systems with a wired
 	  connection, or if a modem is in use that is not performing
 	  error-correction or compression.</para>
 
 	<para>Modern high speed modems (2400, 9600, 14,400, and
 	  19,200bps) in reality still operate at or below 2400 baud,
 	  or more accurately, 2400 Symbols per second.  High speed
 	  modem are able to encode more bits of data into each Symbol
 	  using a technique called Constellation Stuffing, which is
 	  why the effective bits per second rate of the modem is
 	  higher, but the modem continues to operate within the
 	  limited audio bandwidth that the telephone system provides.
 	  Modems operating at 28,800 and higher speeds have variable
 	  Symbol rates, but the technique is the same.</para>
       </sect2>
 
       <sect2>
 	<title>The IBM Personal Computer UART</title>
 
 	<para>Starting with the original IBM Personal Computer, IBM
 	  selected the National Semiconductor INS8250 UART for use in
 	  the IBM PC Parallel/Serial Adapter.  Subsequent generations
 	  of compatible computers from IBM and other vendors continued
 	  to use the INS8250 or improved versions of the National
 	  Semiconductor UART family.</para>
 
 	<sect3>
 	  <title>National Semiconductor UART Family Tree</title>
 
 	  <para>There have been several versions and subsequent
 	    generations of the INS8250 UART.  Each major version is
 	    described below.</para>
 
 	    <!-- This should really be a graphic -->
 	  <programlisting>INS8250  -&gt; INS8250B
   \
    \
     \-&gt; INS8250A -&gt; INS82C50A
              \
               \
                \-&gt; NS16450 -&gt; NS16C450
                         \
                          \
                           \-&gt; NS16550 -&gt; NS16550A -&gt; PC16550D</programlisting>
 
 	  <variablelist>
 	    <varlistentry>
 	      <term>INS8250</term>
 
 	      <listitem>
 		<para>This part was used in the original IBM PC and
 		  IBM PC/XT.  The original name for this part was the
 		  INS8250 ACE (Asynchronous Communications Element)
 		  and it is made from NMOS technology.</para>
 
 		<para>The 8250 uses eight I/O ports and has a one-byte
 		  send and a one-byte receive buffer.  This original
 		  UART has several race conditions and other
 		  flaws. The original IBM BIOS includes code to work
 		  around these flaws, but this made the BIOS dependent
 		  on the flaws being present, so subsequent parts like
 		  the 8250A, 16450 or 16550 could not be used in the
 		  original IBM PC or IBM PC/XT.</para>
 	      </listitem>
 	    </varlistentry>
 
 	    <varlistentry>
 	      <term>INS8250-B</term>
 
 	      <listitem>
 		<para>This is the slower speed of the INS8250 made
 		  from NMOS technology.  It contains the same problems
 		  as the original INS8250.</para>
 	      </listitem>
 	    </varlistentry>
 
 	    <varlistentry>
 	      <term>INS8250A</term>
 
 	      <listitem>
 		<para>An improved version of the INS8250 using XMOS
 		  technology with various functional flaws
 		  corrected. The INS8250A was used initially in PC
 		  clone computers by vendors who used
-		  <quote>clean</quote> BIOS designs. Because of the
+		  <quote>clean</quote> BIOS designs. Due to the
 		  corrections in the chip, this part could not be used
 		  with a BIOS compatible with the INS8250 or
 		  INS8250B.</para>
 	      </listitem>
 	    </varlistentry>
 
 	    <varlistentry>
 	      <term>INS82C50A</term>
 
 	      <listitem>
 		<para>This is a CMOS version (low power consumption)
 		  of the INS8250A and has similar functional
 		  characteristics.</para>
 	      </listitem>
 	    </varlistentry>
 
 	    <varlistentry>
 	      <term>NS16450</term>
 
 	      <listitem>
 		<para>Same as NS8250A with improvements so it can be
 		  used with faster CPU bus designs.  IBM used this
 		  part in the IBM AT and updated the IBM BIOS to no
 		  longer rely on the bugs in the INS8250.</para>
               </listitem>
 	    </varlistentry>
 
 	    <varlistentry>
 	      <term>NS16C450</term>
 
 	      <listitem>
 		<para>This is a CMOS version (low power consumption)
 		  of the NS16450.</para>
               </listitem>
 	    </varlistentry>
 
 	    <varlistentry>
 	      <term>NS16550</term>
 
 	      <listitem>
 		<para>Same as NS16450 with a 16-byte send and receive
 		  buffer but the buffer design was flawed and could
 		  not be reliably be used.</para>
 	      </listitem>
 	    </varlistentry>
 
 	    <varlistentry>
 	      <term>NS16550A</term>
 
 	      <listitem>
 		<para>Same as NS16550 with the buffer flaws
 		  corrected. The 16550A and its successors have become
 		  the most popular UART design in the PC industry,
 		  mainly due to its ability to reliably handle higher
 		  data rates on operating systems with sluggish
 		  interrupt response times.</para>
 	      </listitem>
 	    </varlistentry>
 
 	    <varlistentry>
 	      <term>NS16C552</term>
 
 	      <listitem>
 		<para>This component consists of two NS16C550A CMOS
 		  UARTs in a single package.</para>
 	      </listitem>
 	    </varlistentry>
 
 	    <varlistentry>
 	      <term>PC16550D</term>
 
 	      <listitem>
 		<para>Same as NS16550A with subtle flaws
 		  corrected. This is revision D of the 16550 family
 		  and is the latest design available from National
 		  Semiconductor.</para>
 	      </listitem>
 	    </varlistentry>
 	  </variablelist>
 	</sect3>
 
 	<sect3>
 	  <title>The NS16550AF and the PC16550D are the same thing</title>
 
 	  <para>National reorganized their part numbering system a few
 	    years ago, and the NS16550AFN no longer exists by that
 	    name. (If you have a NS16550AFN, look at the date code on
 	    the part, which is a four digit number that usually starts
 	    with a nine.  The first two digits of the number are the
 	    year, and the last two digits are the week in that year
 	    when the part was packaged.  If you have a NS16550AFN, it
 	    is probably a few years old.)</para>
 
 	  <para>The new numbers are like PC16550DV, with minor
 	    differences in the suffix letters depending on the package
 	    material and its shape.  (A description of the numbering
 	    system can be found below.)</para>
 
 	  <para>It is important to understand that in some stores, you
 	    may pay &#36;15(US) for a NS16550AFN made in 1990 and in
 	    the next bin are the new PC16550DN parts with minor fixes
 	    that National has made since the AFN part was in
 	    production, the PC16550DN was probably made in the past
 	    six months and it costs half (as low as &#36;5(US) in
 	    volume) as much as the NS16550AFN because they are readily
 	    available.</para>
 
 	  <para>As the supply of NS16550AFN chips continues to shrink,
 	    the price will probably continue to increase until more
 	    people discover and accept that the PC16550DN really has
 	    the same function as the old part number.</para>
 	</sect3>
 
 	<sect3>
 	  <title>National Semiconductor Part Numbering System</title>
 
 	  <para>The older NS<replaceable>nnnnnrqp</replaceable> part
 	    numbers are now of the format
 	    PC<replaceable>nnnnnrgp</replaceable>.</para>
 
 	  <para>The <replaceable>r</replaceable> is the revision
 	    field.  The current revision of the 16550 from National
 	    Semiconductor is <literal>D</literal>.</para>
 
 	  <para>The <replaceable>p</replaceable> is the package-type
 	    field.  The types are:</para>
 
 	  <informaltable frame="none" pgwide="1">
 	    <tgroup cols="3">
 	      <tbody>
 		<row>
 		  <entry>"F"</entry>
 		  <entry>QFP</entry>
 		  <entry>(quad flat pack) L lead type</entry>
 		</row>
 
 		<row>
 		  <entry>"N"</entry>
 		  <entry>DIP</entry>
 		  <entry>(dual inline package) through hole straight lead
 		    type</entry>
 		</row>
 
 		<row>
 		  <entry>"V"</entry>
 		  <entry>LPCC</entry>
 		  <entry>(lead plastic chip carrier) J lead type</entry>
 		</row>
 	      </tbody>
 	    </tgroup>
 	  </informaltable>
 
 	  <para>The <replaceable>g</replaceable> is the product grade
 	    field.  If an <literal>I</literal> precedes the
 	    package-type letter, it indicates an
 	    <quote>industrial</quote> grade part, which has higher
 	    specs than a standard part but not as high as Military
 	    Specification (Milspec) component.  This is an optional
 	    field.</para>
 
 	  <para>So what we used to call a NS16550AFN (DIP Package) is
 	    now called a PC16550DN or PC16550DIN.</para>
         </sect3>
       </sect2>
 
       <sect2>
 	<title>Other Vendors and Similar UARTs</title>
 
 	<para>Over the years, the 8250, 8250A, 16450 and 16550 have
 	  been licensed or copied by other chip vendors.  In the case
 	  of the 8250, 8250A and 16450, the exact circuit (the
 	  <quote>megacell</quote>) was licensed to many vendors,
 	  including Western Digital and Intel. Other vendors
 	  reverse-engineered the part or produced emulations that had
 	  similar behavior.</para>
 
 	<para>In internal modems, the modem designer will frequently
 	  emulate the 8250A/16450 with the modem microprocessor, and
 	  the emulated UART will frequently have a hidden buffer
-	  consisting of several hundred bytes.  Because of the size of
+	  consisting of several hundred bytes.  Due to the size of
 	  the buffer, these emulations can be as reliable as a 16550A
 	  in their ability to handle high speed data.  However, most
 	  operating systems will still report that the UART is only a
 	  8250A or 16450, and may not make effective use of the extra
 	  buffering present in the emulated UART unless special
 	  drivers are used.</para>
 
 	<para>Some modem makers are driven by market forces to abandon
 	  a design that has hundreds of bytes of buffer and instead
 	  use a 16550A UART so that the product will compare favorably
 	  in market comparisons even though the effective performance
 	  may be lowered by this action.</para>
 
 	<para>A common misconception is that all parts with
 	  <quote>16550A</quote> written on them are identical in
 	  performance.  There are differences, and in some cases,
 	  outright flaws in most of these 16550A clones.</para>
 
 	<para>When the NS16550 was developed, the National
 	  Semiconductor obtained several patents on the design and
 	  they also limited licensing, making it harder for other
-	  vendors to provide a chip with similar features.  Because of
+	  vendors to provide a chip with similar features.  As a result of
 	  the patents, reverse-engineered designs and emulations had
 	  to avoid infringing the claims covered by the patents.
 	  Subsequently, these copies almost never perform exactly the
 	  same as the NS16550A or PC16550D, which are the parts most
 	  computer and modem makers want to buy but are sometimes
 	  unwilling to pay the price required to get the genuine
 	  part.</para>
 
 	<para>Some of the differences in the clone 16550A parts are
 	  unimportant, while others can prevent the device from being
 	  used at all with a given operating system or driver.  These
 	  differences may show up when using other drivers, or when
 	  particular combinations of events occur that were not well
 	  tested or considered in the &windows; driver.  This is because
 	  most modem vendors and 16550-clone makers use the Microsoft
 	  drivers from &windows; for Workgroups 3.11 and the &microsoft;
 	  &ms-dos; utility as the primary tests for compatibility with
 	  the NS16550A.  This over-simplistic criteria means that if a
 	  different operating system is used, problems could appear
 	  due to subtle differences between the clones and genuine
 	  components.</para>
 
 	<para>National Semiconductor has made available a program
 	  named <application>COMTEST</application> that performs
 	  compatibility tests independent of any OS drivers.  It
 	  should be remembered that the purpose of this type of
 	  program is to demonstrate the flaws in the products of the
 	  competition, so the program will report major as well as
 	  extremely subtle differences in behavior in the part being
 	  tested.</para>
 
 	<para>In a series of tests performed by the author of this
 	  document in 1994, components made by National Semiconductor,
 	  TI, StarTech, and CMD as well as megacells and emulations
 	  embedded in internal modems were tested with COMTEST.  A
 	  difference count for some of these components is listed
-	  below. Because these tests were performed in 1994, they may
+	  below. Since these tests were performed in 1994, they may
 	  not reflect the current performance of the given product
 	  from a vendor.</para>
 
 	<para>It should be noted that COMTEST normally aborts when an
 	  excessive number or certain types of problems have been
 	  detected.  As part of this testing, COMTEST was modified so
 	  that it would not abort no matter how many differences were
 	  encountered.</para>
 
 	  <informaltable frame="none" pgwide="1">
 	    <tgroup cols="3">
 	      <thead>
 		<row>
 		  <entry>Vendor</entry>
 		  <entry>Part Number</entry>
 		  <entry>Errors (aka "differences" reported)</entry>
 		</row>
 	      </thead>
 
 	      <tbody>
 		<row>
 		  <entry>National</entry>
 		  <entry>(PC16550DV)</entry>
 		  <entry>0</entry>
 		</row>
 
 		<row>
 		  <entry>National</entry>
 		  <entry>(NS16550AFN)</entry>
 		  <entry>0</entry>
 		</row>
 
 		<row>
 		  <entry>National</entry>
 		  <entry>(NS16C552V)</entry>
 		  <entry>0</entry>
 		</row>
 
 		<row>
 		  <entry>TI</entry>
 		  <entry>(TL16550AFN)</entry>
 		  <entry>3</entry>
 		</row>
 
 		<row>
 		  <entry>CMD</entry>
 		  <entry>(16C550PE)</entry>
 		  <entry>19</entry>
 		</row>
 
 		<row>
 		  <entry>StarTech</entry>
 		  <entry>(ST16C550J)</entry>
 		  <entry>23</entry>
 		</row>
 
 		<row>
 		  <entry>Rockwell</entry>
 		  <entry>Reference modem with internal 16550 or an
 		    emulation (RC144DPi/C3000-25)</entry>
 		  <entry>117</entry>
 		</row>
 
 		<row>
 		  <entry>Sierra</entry>
 		  <entry>Modem with an internal 16550
 		    (SC11951/SC11351)</entry>
 		  <entry>91</entry>
 		</row>
 	      </tbody>
 	    </tgroup>
 	  </informaltable>
 
 	<note>
 	  <para>To date, the author of this document has not found any
 	    non-National parts that report zero differences using the
 	    COMTEST program.  It should also be noted that National
 	    has had five versions of the 16550 over the years and the
 	    newest parts behave a bit differently than the classic
 	    NS16550AFN that is considered the benchmark for
 	    functionality.  COMTEST appears to turn a blind eye to the
 	    differences within the National product line and reports
 	    no errors on the National parts (except for the original
 	    16550) even when there are official erratas that describe
 	    bugs in the A, B and C revisions of the parts, so this
 	    bias in COMTEST must be taken into account.</para>
 	</note>
 
 	<para>It is important to understand that a simple count of
 	  differences from COMTEST does not reveal a lot about what
 	  differences are important and which are not.  For example,
 	  about half of the differences reported in the two modems
 	  listed above that have internal UARTs were caused by the
 	  clone UARTs not supporting five- and six-bit character
 	  modes.  The real 16550, 16450, and 8250 UARTs all support
 	  these modes and COMTEST checks the functionality of these
 	  modes so over fifty differences are reported.  However,
 	  almost no modern modem supports five- or six-bit characters,
 	  particularly those with error-correction and compression
 	  capabilities.  This means that the differences related to
 	  five- and six-bit character modes can be discounted.</para>
 
 	<para>Many of the differences COMTEST reports have to do with
 	  timing.  In many of the clone designs, when the host reads
 	  from one port, the status bits in some other port may not
 	  update in the same amount of time (some faster, some slower)
 	  as a <emphasis>real</emphasis> NS16550AFN and COMTEST looks
 	  for these differences.  This means that the number of
 	  differences can be misleading in that one device may only
 	  have one or two differences but they are extremely serious,
 	  and some other device that updates the status registers
 	  faster or slower than the reference part (that would
 	  probably never affect the operation of a properly written
 	  driver) could have dozens of differences reported.</para>
 
 	<para>COMTEST can be used as a screening tool to alert the
 	  administrator to the presence of potentially incompatible
 	  components that might cause problems or have to be handled
 	  as a special case.</para>
 
 	<para>If you run COMTEST on a 16550 that is in a modem or a
 	  modem is attached to the serial port, you need to first
 	  issue a ATE0&amp;W command to the modem so that the modem
 	  will not echo any of the test characters.  If you forget to
 	  do this, COMTEST will report at least this one
 	  difference:</para>
 
 	<screen>Error (6)...Timeout interrupt failed: IIR = c1  LSR = 61</screen>
       </sect2>
 
       <sect2>
 	<title>8250/16450/16550 Registers</title>
 
 	<para>The 8250/16450/16550 UART occupies eight contiguous I/O
 	  port addresses.  In the IBM PC, there are two defined
 	  locations for these eight ports and they are known
 	  collectively as <filename>COM1</filename> and <filename>COM2</filename>.  The makers of PC-clones and
 	  add-on cards have created two additional areas known as <filename>COM3</filename>
 	  and <filename>COM4</filename>, but these extra COM ports conflict with other
 	  hardware on some systems.  The most common conflict is with
 	  video adapters that provide IBM 8514 emulation.</para>
 
 	<para><filename>COM1</filename> is located from 0x3f8 to 0x3ff and normally uses
 	  IRQ 4.  <filename>COM2</filename> is located from 0x2f8 to 0x2ff and normally uses
 	  IRQ 3.  <filename>COM3</filename> is located from 0x3e8 to 0x3ef and has no
 	  standardized IRQ.  <filename>COM4</filename> is located from 0x2e8 to 0x2ef and has
 	  no standardized IRQ.</para>
 
 	<para>A description of the I/O ports of the 8250/16450/16550
 	  UART is provided below.</para>
 
         <informaltable frame="none" pgwide="1">
 	  <tgroup cols="3">
 	    <thead>
 	      <row>
 		<entry>I/O Port</entry>
 		<entry>Access Allowed</entry>
 		<entry>Description</entry>
 	      </row>
 	    </thead>
 
 	    <tbody>
 	      <row>
 		<entry>+0x00</entry>
 		<entry>write (DLAB==0)</entry>
 		<entry><para>Transmit Holding Register
 		    (THR).</para><para>Information written to this port are
 		    treated as data words and will be transmitted by the
 		    UART.</para></entry>
 	      </row>
 
 	      <row>
 		<entry>+0x00</entry>
 		<entry>read (DLAB==0)</entry>
 		<entry><para>Receive Buffer Register (RBR).</para><para>Any
 		    data words received by the UART form the serial link are
 		    accessed by the host by reading this
 		    port.</para></entry>
 	      </row>
 
 	      <row>
 		<entry>+0x00</entry>
 		<entry>write/read (DLAB==1)</entry>
 		<entry><para>Divisor Latch LSB (DLL)</para><para>This value
 		      will be divided from the master input clock (in the IBM
 		      PC, the master clock is 1.8432MHz) and the resulting
 		      clock will determine the baud rate of the UART.  This
 		      register holds bits 0 thru 7 of the
 		      divisor.</para></entry>
 	      </row>
 
 	      <row>
 		<entry>+0x01</entry>
 		<entry>write/read (DLAB==1)</entry>
 		<entry><para>Divisor Latch MSB (DLH)</para><para>This value
 		      will be divided from the master input clock (in the IBM
 		      PC, the master clock is 1.8432MHz) and the resulting
 		      clock will determine the baud rate of the UART.  This
 		      register holds bits 8 thru 15 of the
 		      divisor.</para></entry>
 	      </row>
 
 	      <row>
 		<entry>+0x01</entry>
 		<entry>write/read (DLAB==0)</entry>
 		<entrytbl cols="2">
 		  <colspec colnum="1" colname="col1"/>
 		  <colspec colnum="2" colname="col2"/>
 		  <spanspec namest="col1" nameend="col2" spanname="1to2"/>
 
 		  <tbody>
 		    <row>
 			<entry spanname="1to2"><para>Interrupt Enable Register
 			    (IER)</para><para>The 8250/16450/16550 UART
 			    classifies events into one of four categories.
 			    Each category can be configured to generate an
 			    interrupt when any of the events occurs.  The
 			    8250/16450/16550 UART generates a single external
 			    interrupt signal regardless of how many events in
 			    the enabled categories have occurred.  It is up to
 			    the host processor to respond to the interrupt and
 			    then poll the enabled interrupt categories
 			    (usually all categories have interrupts enabled)
 			    to determine the true cause(s) of the
 			    interrupt.</para></entry>
 		    </row>
 
 		    <row>
 		      <entry>Bit 7</entry>
 		      <entry>Reserved, always 0.</entry>
 		    </row>
 
 		    <row>
 		      <entry>Bit 6</entry>
 		      <entry>Reserved, always 0.</entry>
 		    </row>
 
 		    <row>
 		      <entry>Bit 5</entry>
 		      <entry>Reserved, always 0.</entry>
 		    </row>
 
 		    <row>
 		      <entry>Bit 4</entry>
 		      <entry>Reserved, always 0.</entry>
 		    </row>
 
 		    <row>
 		      <entry>Bit 3</entry>
 		      <entry>Enable Modem Status Interrupt (EDSSI). Setting
 			this bit to "1" allows the UART to generate an
 			interrupt when a change occurs on one or more of the
 			status lines.</entry>
 		    </row>
 
 		    <row>
 		      <entry>Bit 2</entry>
 		      <entry>Enable Receiver Line Status Interrupt (ELSI)
 			Setting this bit to "1" causes the UART to generate
 			an interrupt when the an error (or a BREAK signal)
 			has been detected in the incoming data.</entry>
 		    </row>
 
 		    <row>
 		      <entry>Bit 1</entry>
 		      <entry>Enable Transmitter Holding Register Empty
 			Interrupt (ETBEI) Setting this bit to "1" causes the
 			UART to generate an interrupt when the UART has room
 			for one or more additional characters that are to be
 			transmitted.</entry>
 		    </row>
 
 		    <row>
 		      <entry>Bit 0</entry>
 		      <entry>Enable Received Data Available Interrupt
 			(ERBFI) Setting this bit to "1" causes the UART to
 			generate an interrupt when the UART has received
 			enough characters to exceed the trigger level of the
 			FIFO, or the FIFO timer has expired (stale data), or
 			a single character has been received when the FIFO
 			is disabled.</entry>
 		    </row>
 		  </tbody>
 		</entrytbl>
 	      </row>
 
 	      <row>
 		<entry>+0x02</entry>
 		<entry>write</entry>
 		<entrytbl cols="4">
 		  <colspec colnum="1" colname="col1"/>
 		  <colspec colnum="2" colname="col2"/>
 		  <colspec colnum="3" colname="col3"/>
 		  <colspec colnum="4" colname="col4"/>
 		  <spanspec namest="col1" nameend="col4" spanname="1to4"/>
 		  <spanspec namest="col2" nameend="col4" spanname="2to4"/>
 
 		  <tbody>
 		    <row>
 		      <entry spanname="1to4">FIFO Control Register (FCR)
 			(This port does not exist on the 8250 and 16450
 			UART.)</entry>
 		    </row>
 
 		    <row>
 		      <entry>Bit 7</entry>
 		      <entry spanname="2to4">Receiver Trigger Bit #1</entry>
 		    </row>
 
 		    <row>
 		      <entry>Bit 6</entry>
 		      <entry spanname="2to4"><para>Receiver Trigger Bit
 			#0</para><para>These two bits control at what
 			point the receiver is to generate an interrupt
 			when the FIFO is active.</para></entry>
 		    </row>
 
 		    <row>
 		      <entry colname="col2">7</entry>
 		      <entry colname="col3">6</entry>
 		      <entry colname="col4">How many words are received
 			before an interrupt is generated</entry>
 		    </row>
 
 		    <row>
 		      <entry colname="col2">0</entry>
 		      <entry colname="col3">0</entry>
 		      <entry colname="col4">1</entry>
 		    </row>
 
 		    <row>
 		      <entry colname="col2">0</entry>
 		      <entry colname="col3">1</entry>
 		      <entry colname="col4">4</entry>
 		    </row>
 
 		    <row>
 		      <entry colname="col2">1</entry>
 		      <entry colname="col3">0</entry>
 		      <entry colname="col4">8</entry>
 		    </row>
 
 		    <row>
 		      <entry colname="col2">1</entry>
 		      <entry colname="col3">1</entry>
 		      <entry colname="col4">14</entry>
 		    </row>
 
 		    <row>
 		      <entry>Bit 5</entry>
 		      <entry spanname="2to4">Reserved, always 0.</entry>
 		    </row>
 
 		    <row>
 		      <entry>Bit 4</entry>
 		      <entry spanname="2to4">Reserved, always 0.</entry>
 		    </row>
 
 		    <row>
 		      <entry>Bit 3</entry>
 		      <entry spanname="2to4">DMA Mode Select.  If Bit 0 is
 			set to "1" (FIFOs enabled), setting this bit changes
 			the operation of the -RXRDY and -TXRDY signals from
 			Mode 0 to Mode 1.</entry>
 		    </row>
 
 		    <row>
 		      <entry>Bit 2</entry>
 		      <entry spanname="2to4">Transmit FIFO Reset.  When a
 			"1" is written to this bit, the contents of the FIFO
 			are discarded.  Any word currently being transmitted
 			will be sent intact.  This function is useful in
 			aborting transfers.</entry>
 		    </row>
 
 		    <row>
 		      <entry>Bit 1</entry>
 		      <entry spanname="2to4">Receiver FIFO Reset.  When a
 			"1" is written to this bit, the contents of the FIFO
 			are discarded.  Any word currently being assembled
 			in the shift register will be received
 			intact.</entry>
 		    </row>
 
 		    <row>
 		      <entry>Bit 0</entry>
 		      <entry spanname="2to4">16550 FIFO Enable.  When set,
 			both the transmit and receive FIFOs are enabled.
 			Any contents in the holding register, shift
 			registers or FIFOs are lost when FIFOs are enabled
 			or disabled.</entry>
 		    </row>
 		  </tbody>
 		</entrytbl>
 	      </row>
 
 	      <row>
 		<entry>+0x02</entry>
 		<entry>read</entry>
 		<entrytbl cols="6">
 		  <colspec colnum="1" colname="col1"/>
 		  <colspec colnum="2" colname="col2"/>
 		  <colspec colnum="3" colname="col3"/>
 		  <colspec colnum="4" colname="col4"/>
 		  <colspec colnum="5" colname="col5"/>
 		  <colspec colnum="6" colname="col6"/>
 		  <spanspec namest="col1" nameend="col6" spanname="1to6"/>
 		  <spanspec namest="col2" nameend="col6" spanname="2to6"/>
 
 		  <tbody>
 		    <row>
 		      <entry spanname="1to6">Interrupt Identification
 			Register</entry>
 		    </row>
 
 		    <row>
 		      <entry>Bit 7</entry>
 		      <entry spanname="2to6">FIFOs enabled.  On the
 			8250/16450 UART, this bit is zero.</entry>
 		    </row>
 
 		    <row>
 		      <entry>Bit 6</entry>
 		      <entry spanname="2to6">FIFOs enabled.  On the
 			8250/16450 UART, this bit is zero.</entry>
 		    </row>
 
 		    <row>
 		      <entry>Bit 5</entry>
 		      <entry spanname="2to6">Reserved, always 0.</entry>
 		    </row>
 
 		    <row>
 		      <entry>Bit 4</entry>
 		      <entry spanname="2to6">Reserved, always 0.</entry>
 		    </row>
 
 		    <row>
 		      <entry>Bit 3</entry>
 		      <entry spanname="2to6">Interrupt ID Bit #2.  On the
 			8250/16450 UART, this bit is zero.</entry>
 		    </row>
 
 		    <row>
 		      <entry>Bit 2</entry>
 		      <entry spanname="2to6">Interrupt ID Bit #1</entry>
 		    </row>
 
 		    <row>
 		      <entry>Bit 1</entry>
 		      <entry spanname="2to6">Interrupt ID Bit #0.These
 			three bits combine to report the category of
 			event that caused the interrupt that is in
 			progress.  These categories have priorities,
 			so if multiple categories of events occur at
 			the same time, the UART will report the more
 			important events first and the host must
 			resolve the events in the order they are
 			reported.  All events that caused the current
 			interrupt must be resolved before any new
 			interrupts will be generated.  (This is a
 			limitation of the PC architecture.)</entry>
 		    </row>
 
 		    <row>
 		      <entry colname="col2">2</entry>
 		      <entry colname="col3">1</entry>
 		      <entry colname="col4">0</entry>
 		      <entry colname="col5">Priority</entry>
 		      <entry colname="col6">Description</entry>
 		    </row>
 
 		    <row>
 		      <entry colname="col2">0</entry>
 		      <entry colname="col3">1</entry>
 		      <entry colname="col4">1</entry>
 		      <entry colname="col5">First</entry>
 		      <entry colname="col6">Received Error (OE, PE, BI, or
 			FE)</entry>
 		    </row>
 
 		    <row>
 		      <entry colname="col2">0</entry>
 		      <entry colname="col3">1</entry>
 		      <entry colname="col4">0</entry>
 		      <entry colname="col5">Second</entry>
 		      <entry colname="col6">Received Data Available</entry>
 		    </row>
 
 		    <row>
 		      <entry colname="col2">1</entry>
 		      <entry colname="col3">1</entry>
 		      <entry colname="col4">0</entry>
 		      <entry colname="col5">Second</entry>
 		      <entry colname="col6">Trigger level identification
 			(Stale data in receive buffer)</entry>
 		    </row>
 
 		    <row>
 		      <entry colname="col2">0</entry>
 		      <entry colname="col3">0</entry>
 		      <entry colname="col4">1</entry>
 		      <entry colname="col5">Third</entry>
 		      <entry colname="col6">Transmitter has room for more
 			words (THRE)</entry>
 		    </row>
 
 		    <row>
 		      <entry colname="col2">0</entry>
 		      <entry colname="col3">0</entry>
 		      <entry colname="col4">0</entry>
 		      <entry colname="col5">Fourth</entry>
 		      <entry colname="col6">Modem Status Change (-CTS, -DSR,
 			-RI, or -DCD)</entry>
 		    </row>
 
 		    <row>
 		      <entry>Bit 0</entry>
 		      <entry spanname="2to6">Interrupt Pending Bit.  If this
 			bit is set to "0", then at least one interrupt is
 			pending.</entry>
 		    </row>
 		  </tbody>
 		</entrytbl>
 	        </row>
 
 		<row>
 		  <entry>+0x03</entry>
 		  <entry>write/read</entry>
 		  <entrytbl cols="5">
 		    <colspec colnum="1" colname="col1"/>
 		    <colspec colnum="2" colname="col2"/>
 		    <colspec colnum="3" colname="col3"/>
 		    <colspec colnum="4" colname="col4"/>
 		    <colspec colnum="5" colname="col5"/>
 		    <spanspec namest="col1" nameend="col5" spanname="1to5"/>
 		    <spanspec namest="col2" nameend="col5" spanname="2to5"/>
 		    <spanspec namest="col4" nameend="col5" spanname="4to5"/>
 
 		    <tbody>
 		      <row>
 			<entry spanname="1to5">Line Control Register
 			  (LCR)</entry>
 		      </row>
 
 		      <row>
 			<entry>Bit 7</entry>
 			<entry spanname="2to5">Divisor Latch Access Bit
 			  (DLAB).  When set, access to the data
 			  transmit/receive register (THR/RBR) and the
 			  Interrupt Enable Register (IER) is disabled.  Any
 			  access to these ports is now redirected to the
 			  Divisor Latch Registers.  Setting this bit, loading
 			  the Divisor Registers, and clearing DLAB should be
 			  done with interrupts disabled.</entry>
 		      </row>
 
 		      <row>
 			<entry>Bit 6</entry>
 			<entry spanname="2to5">Set Break.  When set to "1",
 			  the transmitter begins to transmit continuous
 			  Spacing until this bit is set to "0".  This
 			  overrides any bits of characters that are being
 			  transmitted.</entry>
 		      </row>
 
 		      <row>
 			<entry>Bit 5</entry>
 			<entry spanname="2to5">Stick Parity.  When parity is
 			  enabled, setting this bit causes parity to always be
 			  "1" or "0", based on the value of Bit 4.</entry>
 		      </row>
 
 		      <row>
 			<entry>Bit 4</entry>
 			<entry spanname="2to5">Even Parity Select (EPS). When
 			  parity is enabled and Bit 5 is "0", setting this bit
 			  causes even parity to be transmitted and expected.
 			  Otherwise, odd parity is used.</entry>
 		      </row>
 
 		      <row>
 			<entry>Bit 3</entry>
 			<entry spanname="2to5">Parity Enable (PEN).  When set
 			  to "1", a parity bit is inserted between the last
 			  bit of the data and the Stop Bit.  The UART will
 			  also expect parity to be present in the received
 			  data.</entry>
 		      </row>
 
 		      <row>
 			<entry>Bit 2</entry>
 			<entry spanname="2to5">Number of Stop Bits (STB). If
 			  set to "1" and using 5-bit data words, 1.5 Stop Bits
 			  are transmitted and expected in each data word.  For
 			  6, 7 and 8-bit data words, 2 Stop Bits are
 			  transmitted and expected.  When this bit is set to
 			  "0", one Stop Bit is used on each data word.</entry>
 		      </row>
 
 		      <row>
 			<entry>Bit 1</entry>
 			<entry spanname="2to5">Word Length Select Bit #1
 			  (WLSB1)</entry>
 		      </row>
 
 		      <row>
 			<entry>Bit 0</entry>
 			<entry spanname="2to5">Word Length Select Bit #0
 			  (WLSB0)</entry>
 		      </row>
 
 		      <row>
 			<entry spanname="2to5">Together these
 			  bits specify the number of bits in each data
 			  word.</entry>
 		      </row>
 
 		      <row>
 			<entry colname="col2">1</entry>
 			<entry colname="col3">0</entry>
 			<entry spanname="4to5">Word
 			  Length</entry>
 		      </row>
 
 		      <row>
 			<entry colname="col2">0</entry>
 			<entry colname="col3">0</entry>
 			<entry spanname="4to5">5 Data
 			  Bits</entry>
 		      </row>
 
 		      <row>
 			<entry colname="col2">0</entry>
 			<entry colname="col3">1</entry>
 			<entry spanname="4to5">6 Data
 			  Bits</entry>
 		      </row>
 
 		      <row>
 			<entry colname="col2">1</entry>
 			<entry colname="col3">0</entry>
 			<entry spanname="4to5">7 Data
 			  Bits</entry>
 		      </row>
 
 		      <row>
 			<entry colname="col2">1</entry>
 			<entry colname="col3">1</entry>
 			<entry spanname="4to5">8 Data
 			  Bits</entry>
 		      </row>
 		    </tbody>
 		  </entrytbl>
 		</row>
 
 		<row>
 		  <entry>+0x04</entry>
 		  <entry>write/read</entry>
 		  <entrytbl cols="2">
 		    <colspec colnum="1" colname="col1"/>
 		    <colspec colnum="2" colname="col2"/>
 		    <spanspec namest="col1" nameend="col2" spanname="1to2"/>
 
 		    <tbody>
 		      <row>
 			<entry spanname="1to2">Modem Control Register
 			  (MCR)</entry>
 		      </row>
 
 		      <row>
 			<entry>Bit 7</entry>
 			<entry>Reserved, always 0.</entry>
 		      </row>
 
 		      <row>
 			<entry>Bit 6</entry>
 			<entry>Reserved, always 0.</entry>
 		      </row>
 
 		      <row>
 			<entry>Bit 5</entry>
 			<entry>Reserved, always 0.</entry>
 			  </row>
 
 		      <row>
 			<entry>Bit 4</entry>
 			<entry>Loop-Back Enable.  When set to "1", the UART
 			  transmitter and receiver are internally connected
 			  together to allow diagnostic operations.  In
 			  addition, the UART modem control outputs are
 			  connected to the UART modem  control inputs.  CTS is
 			  connected to RTS, DTR is connected to DSR, OUT1 is
 			  connected to RI, and OUT 2 is connected to
 			  DCD.</entry>
 		      </row>
 
 		      <row>
 			<entry>Bit 3</entry>
 			<entry>OUT 2.  An auxiliary output that the host
 			  processor may set high or low.  In the IBM PC serial
 			  adapter (and most clones), OUT 2 is used to
 			  tri-state (disable) the interrupt signal from the
 			  8250/16450/16550 UART.</entry>
 		      </row>
 
 		      <row>
 			<entry>Bit 2</entry>
 			<entry>OUT 1.  An auxiliary output that the host
 			  processor may set high or low.  This output is not
 			  used on the IBM PC serial adapter.</entry>
 		      </row>
 
 		      <row>
 			<entry>Bit 1</entry>
 			<entry>Request to Send (RTS).  When set to "1", the
 			  output of the UART -RTS line is Low
 			  (Active).</entry>
 		      </row>
 
 		      <row>
 			<entry>Bit 0</entry>
 			<entry>Data Terminal Ready (DTR).  When set to "1",
 			  the output of the UART -DTR line is Low
 			  (Active).</entry>
 		      </row>
 		    </tbody>
 		  </entrytbl>
 		</row>
 
 		<row>
 		  <entry>+0x05</entry>
 		  <entry>write/read</entry>
 		  <entrytbl cols="2">
 		    <colspec colnum="1" colname="col1"/>
 		    <colspec colnum="2" colname="col2"/>
 		    <spanspec namest="col1" nameend="col2" spanname="1to2"/>
 
 		    <tbody>
 		      <row>
 			<entry spanname="1to2">Line Status Register
 			  (LSR)</entry>
 		      </row>
 
 		      <row>
 			<entry>Bit 7</entry>
 			<entry>Error in Receiver FIFO.  On the 8250/16450
 			  UART, this bit is zero.  This bit is set to "1" when
 			  any of the bytes in the FIFO have one or more of the
 			  following error conditions: PE, FE, or BI.</entry>
 		      </row>
 
 		      <row>
 			<entry>Bit 6</entry>
 			<entry>Transmitter Empty (TEMT).  When set to "1",
 			  there are no words  remaining in the transmit FIFO
 			  or the transmit shift register.  The transmitter is
 			  completely idle.</entry>
 		      </row>
 
 		      <row>
 			<entry>Bit 5</entry>
 			<entry>Transmitter Holding Register Empty (THRE).
 			  When set to "1", the FIFO (or holding register) now
 			  has room for at least one additional word to
 			  transmit.  The transmitter may still be transmitting
 			  when this bit is set to "1".</entry>
 		      </row>
 
 		      <row>
 			<entry>Bit 4</entry>
 			<entry>Break Interrupt (BI).  The receiver has
 			  detected a Break signal.</entry>
 		      </row>
 
 		      <row>
 			<entry>Bit 3</entry>
 			<entry>Framing Error (FE).  A Start Bit was detected
 			  but the Stop Bit did not appear at the expected
 			  time.  The received word is probably
 			  garbled.</entry>
 		      </row>
 
 		      <row>
 			<entry>Bit 2</entry>
 			<entry>Parity Error (PE).  The parity bit was
 			  incorrect for the word received.</entry>
 		      </row>
 
 		      <row>
 			<entry>Bit 1</entry>
 			<entry>Overrun Error (OE).  A new word was received
 			  and there was no room in the receive buffer.  The
 			  newly-arrived word in the shift register is
 			  discarded.  On 8250/16450 UARTs, the word in the
 			  holding register is discarded and the newly- arrived
 			  word is put in the holding register.</entry>
 		      </row>
 
 		      <row>
 			<entry>Bit 0</entry>
 			<entry>Data Ready (DR) One or more words are in the
 			  receive FIFO that the host may read.  A word must be
 			  completely received and moved from the shift
 			  register into the FIFO (or holding register for
 			  8250/16450 designs) before this bit is set.</entry>
 		      </row>
 		    </tbody>
 		  </entrytbl>
 		</row>
 
 		<row>
 		  <entry>+0x06</entry>
 		  <entry>write/read</entry>
 		  <entrytbl cols="2">
 		    <colspec colnum="1" colname="col1"/>
 		    <colspec colnum="2" colname="col2"/>
 		    <spanspec namest="col1" nameend="col2" spanname="1to2"/>
 
 		    <tbody>
 		      <row>
 			<entry spanname="1to2">Modem Status Register
 			  (MSR)</entry>
 		      </row>
 
 		      <row>
 			<entry>Bit 7</entry>
 			<entry>Data Carrier Detect (DCD).  Reflects the state
 			  of the DCD line on the UART.</entry>
 		      </row>
 
 		      <row>
 			<entry>Bit 6</entry>
 			<entry>Ring Indicator (RI).  Reflects the state of the
 			  RI line on the UART.</entry>
 		      </row>
 
 		      <row>
 			<entry>Bit 5</entry>
 			<entry>Data Set Ready (DSR).  Reflects the state of
 			  the DSR line on the UART.</entry>
 		      </row>
 
 		      <row>
 			<entry>Bit 4</entry>
 			<entry>Clear To Send (CTS).  Reflects the state of the
 			  CTS line on the UART.</entry>
 		      </row>
 
 		      <row>
 			<entry>Bit 3</entry>
 			<entry>Delta Data Carrier Detect (DDCD).  Set to "1"
 			  if the -DCD line has changed state one more
 			  time since the last time the MSR was read by the
 			  host.</entry>
 		      </row>
 
 		      <row>
 			<entry>Bit 2</entry>
 			<entry>Trailing Edge Ring Indicator (TERI).  Set to
 			  "1" if the -RI line has had a low to high transition
 			  since the last time the MSR was read by the
 			  host.</entry>
 		      </row>
 
 		      <row>
 			<entry>Bit 1</entry>
 			<entry>Delta Data Set Ready (DDSR).  Set to "1" if the
 			  -DSR line has changed state one more time
 			  since the last time the MSR was read by the
 			  host.</entry>
 		      </row>
 
 		      <row>
 			<entry>Bit 0</entry>
 			<entry>Delta Clear To Send (DCTS).  Set to "1" if the
 			  -CTS line has changed state one more time
 			  since the last time the MSR was read by the
 			  host.</entry>
 		      </row>
 		    </tbody>
 		  </entrytbl>
 		</row>
 
 		<row>
 		  <entry>+0x07</entry>
 		  <entry>write/read</entry>
 		  <entry>Scratch Register (SCR).  This register performs no
 		    function in the UART.  Any value can be written by the
 		    host to this location and read by the host later
 		    on.</entry>
 		</row>
 	      </tbody>
 	    </tgroup>
 	  </informaltable>
       </sect2>
 
       <sect2>
 	<title>Beyond the 16550A UART</title>
 
 	<para>Although National Semiconductor has not offered any
 	  components compatible with the 16550 that provide additional
 	  features, various other vendors have.  Some of these
 	  components are described below.  It should be understood
 	  that to effectively utilize these improvements, drivers may
 	  have to be provided by the chip vendor since most of the
 	  popular operating systems do not support features beyond
 	  those provided by the 16550.</para>
 
 	  <variablelist>
 	    <varlistentry>
 	      <term>ST16650</term>
 
 	      <listitem>
 		<para>By default this part is similar to the NS16550A, but an
 		  extended 32-byte send and receive buffer can be optionally
 		  enabled.  Made by StarTech.</para>
 	      </listitem>
 	    </varlistentry>
 
 	    <varlistentry>
 	      <term>TIL16660</term>
 
 	      <listitem>
 		<para>By default this part behaves similar to the NS16550A,
 		  but an extended 64-byte send and receive buffer can be
 		  optionally enabled.  Made by Texas Instruments.</para>
 	      </listitem>
 	    </varlistentry>
 
 	    <varlistentry>
 	      <term>Hayes ESP</term>
 
 	      <listitem>
 		<para>This proprietary plug-in card contains a 2048-byte send
 		  and receive buffer, and supports data rates to
 		  230.4Kbit/sec.  Made by Hayes.</para>
 	      </listitem>
 	    </varlistentry>
 	  </variablelist>
 
 	  <para>In addition to these <quote>dumb</quote> UARTs, many vendors
 	    produce intelligent serial communication boards.  This type of
 	    design usually provides a microprocessor that interfaces with
 	    several UARTs, processes and buffers the data, and then alerts the
-	    main PC processor when necessary.  Because the UARTs are not
+	    main PC processor when necessary.  As the UARTs are not
 	    directly accessed by the PC processor in this type of
 	    communication system, it is not necessary for the vendor to use
 	    UARTs that are compatible with the 8250, 16450, or the 16550 UART.
 	    This leaves the designer free to components that may have better
 	    performance characteristics.</para>
       </sect2>
     </sect1>
 
     <sect1 xml:id="sio">
       <title>Configuring the <filename>sio</filename> driver</title>
 
       <para>The <filename>sio</filename> driver provides support
 	for NS8250-, NS16450-, NS16550 and NS16550A-based EIA RS-232C
 	(CCITT V.24) communications interfaces.  Several multiport
 	cards are supported as well.  See the &man.sio.4; manual page
 	for detailed technical documentation.</para>
 
       <sect2>
 	<title>Digi International (DigiBoard) PC/8</title>
 
 	<para><emphasis>Contributed by &a.awebster.email;.  26 August
 	  1995.</emphasis></para>
 
         <para>Here is a config snippet from a machine with a Digi
 	  International PC/8 with 16550.  It has 8 modems connected to
 	  these 8 lines, and they work just great.  Do not forget to
 	  add <literal>options COM_MULTIPORT</literal> or it will not
 	  work very well!</para>
 
 	<programlisting>device          sio4    at isa? port 0x100 flags 0xb05
 device          sio5    at isa? port 0x108 flags 0xb05
 device          sio6    at isa? port 0x110 flags 0xb05
 device          sio7    at isa? port 0x118 flags 0xb05
 device          sio8    at isa? port 0x120 flags 0xb05
 device          sio9    at isa? port 0x128 flags 0xb05
 device          sio10   at isa? port 0x130 flags 0xb05
 device          sio11   at isa? port 0x138 flags 0xb05 irq 9</programlisting>
 
 	<para>The trick in setting this up is that the MSB of the
 	  flags represent the last SIO port, in this case 11 so flags
 	  are 0xb05.</para>
       </sect2>
 
       <sect2>
 	<title>Boca 16</title>
 
 	<para><emphasis>Contributed by &a.whiteside.email;.  26 August
 	  1995.</emphasis></para>
 
 	<para>The procedures to make a Boca 16 port board with FreeBSD
 	  are pretty straightforward, but you will need a couple
 	  things to make it work:</para>
 
 	<orderedlist>
 	  <listitem>
 	    <para>You either need the kernel sources installed so you
 		can recompile the necessary options or you will need
 		someone else to compile it for you.  The 2.0.5 default
 		kernel does <emphasis>not</emphasis> come with
 		multiport support enabled and you will need to add a
 		device entry for each port anyways.</para>
 	  </listitem>
 
 	  <listitem>
 	    <para>Two, you will need to know the interrupt and IO
 	      setting for your Boca Board so you can set these options
 	      properly in the kernel.</para>
 	  </listitem>
 	</orderedlist>
 
 	<para>One important note &mdash; the actual UART chips for the
 	  Boca 16 are in the connector box, not on the internal board
 	  itself.  So if you have it unplugged, probes of those ports
 	  will fail.  I have never tested booting with the box
 	  unplugged and plugging it back in, and I suggest you do not
 	  either.</para>
 
         <para>If you do not already have a custom kernel
 	  configuration file set up, refer to <link xlink:href="&url.books.handbook;/kernelconfig.html">Kernel
 	  Configuration</link> chapter of the FreeBSD Handbook for
 	  general procedures.  The following are the specifics for the
 	  Boca 16 board and assume you are using the kernel name
 	  MYKERNEL and editing with vi.</para>
 
 	<procedure>
 	  <step>
 	    <para>Add the line
 
 	      <programlisting>options COM_MULTIPORT</programlisting>
 
 	      to the config file.</para>
 	  </step>
 
 	  <step>
 	    <para>Where the current <literal>device
 	      sio<replaceable>n</replaceable></literal> lines are, you
 	      will need to add 16 more devices.  The
 	      following example is for a Boca Board with an interrupt
 	      of 3, and a base IO address 100h.  The IO address for
 	      Each port is +8 hexadecimal from the previous port, thus
 	      the 100h, 108h, 110h...  addresses.</para>
 
 	    <programlisting>device sio1 at isa? port 0x100 flags 0x1005
 device sio2 at isa? port 0x108 flags 0x1005
 device sio3 at isa? port 0x110 flags 0x1005
 device sio4 at isa? port 0x118 flags 0x1005
 &hellip;
 device sio15 at isa? port 0x170 flags 0x1005
 device sio16 at isa? port 0x178 flags 0x1005 irq 3</programlisting>
 
 	    <para>The flags entry <emphasis>must</emphasis> be changed
 	      from this example unless you are using the exact same
 	      sio assignments. Flags are set according to
 	      0x<replaceable>M</replaceable><replaceable>YY</replaceable>
 	      where <replaceable>M</replaceable> indicates the minor
 	      number of the master port (the last port on a Boca 16)
 	      and <replaceable>YY</replaceable> indicates if FIFO is
 	      enabled or disabled(enabled), IRQ sharing is used(yes)
 	      and if there is an AST/4 compatible IRQ control
 	      register(no).  In this example, <programlisting> flags
 	      0x1005</programlisting> indicates that the master port
 	      is sio16.  If I added another board and assigned sio17
 	      through sio28, the flags for all 16 ports on
 	      <emphasis>that</emphasis> board would be 0x1C05, where
 	      1C indicates the minor number of the master port.  Do
 	      not change the 05 setting.</para>
 	  </step>
 
 	  <step>
 	    <para>Save and complete the kernel configuration,
 	      recompile, install and reboot.  Presuming you have
 	      successfully installed the recompiled kernel and have it
 	      set to the correct address and IRQ, your boot message
 	      should indicate the successful probe of the Boca ports
 	      as follows: (obviously the sio numbers, IO and IRQ could
 	      be different)</para>
 
 	    <screen>sio1 at 0x100-0x107 flags 0x1005 on isa
 sio1: type 16550A (multiport)
 sio2 at 0x108-0x10f flags 0x1005 on isa
 sio2: type 16550A (multiport)
 sio3 at 0x110-0x117 flags 0x1005 on isa
 sio3: type 16550A (multiport)
 sio4 at 0x118-0x11f flags 0x1005 on isa
 sio4: type 16550A (multiport)
 sio5 at 0x120-0x127 flags 0x1005 on isa
 sio5: type 16550A (multiport)
 sio6 at 0x128-0x12f flags 0x1005 on isa
 sio6: type 16550A (multiport)
 sio7 at 0x130-0x137 flags 0x1005 on isa
 sio7: type 16550A (multiport)
 sio8 at 0x138-0x13f flags 0x1005 on isa
 sio8: type 16550A (multiport)
 sio9 at 0x140-0x147 flags 0x1005 on isa
 sio9: type 16550A (multiport)
 sio10 at 0x148-0x14f flags 0x1005 on isa
 sio10: type 16550A (multiport)
 sio11 at 0x150-0x157 flags 0x1005 on isa
 sio11: type 16550A (multiport)
 sio12 at 0x158-0x15f flags 0x1005 on isa
 sio12: type 16550A (multiport)
 sio13 at 0x160-0x167 flags 0x1005 on isa
 sio13: type 16550A (multiport)
 sio14 at 0x168-0x16f flags 0x1005 on isa
 sio14: type 16550A (multiport)
 sio15 at 0x170-0x177 flags 0x1005 on isa
 sio15: type 16550A (multiport)
 sio16 at 0x178-0x17f irq 3 flags 0x1005 on isa
 sio16: type 16550A (multiport master)</screen>
 
 	    <para>If the messages go by too fast to see,
 
 	    <screen>&prompt.root; <userinput>dmesg | more</userinput></screen>
 	      will show you the boot messages.</para>
 	  </step>
 
 	  <step>
 	    <para>Next, appropriate entries in
 	      <filename>/dev</filename> for the devices must be made
 	      using the <filename>/dev/MAKEDEV</filename>
 	      script.  This step can be omitted if you are running
 	      FreeBSD&nbsp;5.X with a kernel that has &man.devfs.5;
 	      support compiled in.</para>
 
 	    <para>If you do need to create the <filename>/dev</filename>
 	      entries, run the following as <systemitem class="username">root</systemitem>:</para>
 
 	    <screen>&prompt.root; <userinput>cd /dev</userinput>
 &prompt.root; <userinput>./MAKEDEV tty1</userinput>
 &prompt.root; <userinput>./MAKEDEV cua1</userinput>
 <emphasis>(everything in between)</emphasis>
 &prompt.root; <userinput>./MAKEDEV ttyg</userinput>
 &prompt.root; <userinput>./MAKEDEV cuag</userinput></screen>
 
 	    <para>If you do not want or need call-out devices for some
 	      reason, you can dispense with making the
 	      <filename>cua*</filename> devices.</para>
 	  </step>
 
 	  <step>
 	    <para>If you want a quick and sloppy way to make sure the
 	      devices are working, you can simply plug a modem into
 	      each port and (as root)
 
             <screen>&prompt.root; <userinput>echo at &gt; ttyd*</userinput></screen>
 	      for each device you have made.  You
 	      <emphasis>should</emphasis> see the RX lights flash for each
 	      working port.</para>
 	  </step>
 	</procedure>
       </sect2>
 
       <sect2>
 	<title>Support for Cheap Multi-UART Cards</title>
 
 	<para><emphasis>Contributed by Helge Oldach
 	  <email>hmo@sep.hamburg.com</email>, September
 	  1999</emphasis></para>
 
 	<para>Ever wondered about FreeBSD support for your 20$
 	  multi-I/O card with two (or more) COM ports, sharing IRQs?
 	  Here is how:</para>
 
 	<para>Usually the only option to support these kind of boards
 	  is to use a distinct IRQ for each port.  For example, if
 	  your CPU board has an on-board <filename>COM1</filename>
 	  port (aka <filename>sio0</filename>&ndash;I/O address
 	  0x3F8 and IRQ 4) and you have an extension board with two
 	  UARTs, you will commonly need to configure them as
 	  <filename>COM2</filename> (aka
 	  <filename>sio1</filename>&ndash;I/O address 0x2F8 and
 	  IRQ 3), and the third port (aka
 	  <filename>sio2</filename>) as I/O 0x3E8 and IRQ 5.
 	  Obviously this is a waste of IRQ resources, as it should be
 	  basically possible to run both extension board ports using a
 	  single IRQ with the <literal>COM_MULTIPORT</literal>
 	  configuration described in the previous sections.</para>
 
 	<para>Such cheap I/O boards commonly have a 4 by 3 jumper
 	  matrix for the COM ports, similar to the following:</para>
 
 <programlisting>            o  o  o  *
 Port A               |
             o  *  o  *
 Port B         |
             o  *  o  o
 IRQ         2  3  4  5</programlisting>
 
 	<para>Shown here is port A wired for IRQ 5 and port B wired
 	  for IRQ 3.  The IRQ columns on your specific board may
 	  vary&mdash;other boards may supply jumpers for IRQs 3, 4, 5,
 	  and 7 instead.</para>
 
 	<para>One could conclude that wiring both ports for IRQ 3
 	  using a handcrafted wire-made jumper covering all three
 	  connection points in the IRQ 3 column would solve the issue,
 	  but no.  You cannot duplicate IRQ 3 because the output
 	  drivers of each UART are wired in a <quote>totem
 	  pole</quote> fashion, so if one of the UARTs drives IRQ 3,
 	  the output signal will not be what you would expect.
 	  Depending on the implementation of the extension board or
 	  your motherboard, the IRQ 3 line will continuously stay up,
 	  or always stay low.</para>
 
 	<para>You need to decouple the IRQ drivers for the two UARTs,
 	  so that the IRQ line of the board only goes up if (and only
 	  if) one of the UARTs asserts a IRQ, and stays low otherwise.
 	  The solution was proposed by Joerg Wunsch
 	  <email>j@ida.interface-business.de</email>: To solder up a
 	  wired-or consisting of two diodes (Germanium or
 	  Schottky-types strongly preferred) and a 1 kOhm resistor.
 	  Here is the schematic, starting from the 4 by 3 jumper field
 	  above:</para>
 
 <programlisting>                          Diode
                 +----------&gt;|-------+
                /                    |
             o  *  o  o              |     1 kOhm
 Port A                              +----|######|-------+
             o  *  o  o              |                   |
 Port B          `-------------------+                 ==+==
             o  *  o  o              |                 Ground
                 \                   |
                  +---------&gt;|-------+
 IRQ         2  3  4  5    Diode</programlisting>
 
 	<para>The cathodes of the diodes are connected to a common
 	  point, together with a 1 kOhm pull-down resistor.  It is
 	  essential to connect the resistor to ground to avoid
 	  floating of the IRQ line on the bus.</para>
 
 	<para>Now we are ready to configure a kernel.  Staying with
 	  this example, we would configure:</para>
 
 	<programlisting># standard on-board COM1 port
 device          sio0    at isa? port "IO_COM1" flags 0x10
 # patched-up multi-I/O extension board
 options         COM_MULTIPORT
 device          sio1    at isa? port "IO_COM2" flags 0x205
 device          sio2    at isa? port "IO_COM3" flags 0x205 irq 3</programlisting>
 
 	<para>Note that the <literal>flags</literal> setting for
 	  <filename>sio1</filename> and
 	  <filename>sio2</filename> is truly essential; refer to
 	  &man.sio.4; for details. (Generally, the
 	  <literal>2</literal> in the "flags" attribute refers to
 	  <filename>sio</filename>2 which holds the IRQ, and you
 	  surely want a <literal>5</literal> low nibble.)  With kernel
 	  verbose mode turned on this should yield something similar
 	  to this:</para>
 
 	<screen>sio0: irq maps: 0x1 0x11 0x1 0x1
 sio0 at 0x3f8-0x3ff irq 4 flags 0x10 on isa
 sio0: type 16550A
 sio1: irq maps: 0x1 0x9 0x1 0x1
 sio1 at 0x2f8-0x2ff flags 0x205 on isa
 sio1: type 16550A (multiport)
 sio2: irq maps: 0x1 0x9 0x1 0x1
 sio2 at 0x3e8-0x3ef irq 3 flags 0x205 on isa
 sio2: type 16550A (multiport master)</screen>
 
 	<para>Though <filename>/sys/i386/isa/sio.c</filename> is
 	  somewhat cryptic with its use of the <quote>irq maps</quote>
 	  array above, the basic idea is that you observe
 	  <literal>0x1</literal> in the first, third, and fourth
 	  place.  This means that the corresponding IRQ was set upon
 	  output and cleared after, which is just what we would
 	  expect. If your kernel does not display this behavior, most
 	  likely there is something wrong with your wiring.</para>
       </sect2>
     </sect1>
 
     <sect1 xml:id="cy">
       <title>Configuring the <filename>cy</filename> driver</title>
 
       <para><emphasis>Contributed by Alex Nash.  6 June
         1996.</emphasis></para>
 
       <para>The Cyclades multiport cards are based on the
 	<filename>cy</filename> driver instead of the usual
 	<filename>sio</filename> driver used by other multiport
 	cards.  Configuration is a simple matter of:</para>
 
 	<procedure>
 	  <step>
 	    <para>Add the <filename>cy</filename> device to your
 	      kernel configuration (note that your irq and iomem
 	      settings may differ).</para>
 
 	    <programlisting>device cy0 at isa? irq 10 iomem 0xd4000 iosiz 0x2000</programlisting>
 	  </step>
 
 	  <step>
 	    <para>Rebuild and install the new kernel.</para>
 	  </step>
 
 	  <step>
 	    <para>Make the device nodes by typing (the following
 	      example assumes an 8-port board)<footnote>
 	        <para>You can omit this part if you are running FreeBSD&nbsp;5.X
 		  with &man.devfs.5;.</para>
 	      </footnote>:</para>
 
 	    <screen>&prompt.root; <userinput>cd /dev</userinput>
 &prompt.root; <userinput>for i in 0 1 2 3 4 5 6 7;do ./MAKEDEV cuac$i ttyc$i;done</userinput></screen>
 	  </step>
 
 	  <step>
 	    <para>If appropriate, add dialup entries to
 	      <filename>/etc/ttys</filename> by duplicating serial
 	      device (<literal>ttyd</literal>) entries and using
 	      <literal>ttyc</literal> in place of
 	      <literal>ttyd</literal>.  For example:</para>
 
 	    <programlisting>ttyc0   "/usr/libexec/getty std.38400"  unknown on insecure
 ttyc1   "/usr/libexec/getty std.38400"  unknown on insecure
 ttyc2   "/usr/libexec/getty std.38400"  unknown on insecure
 &hellip;
 ttyc7   "/usr/libexec/getty std.38400"  unknown on insecure</programlisting>
 	  </step>
 
 	  <step>
 	    <para>Reboot with the new kernel.</para>
 	  </step>
 	</procedure>
     </sect1>
 
     <sect1>
       <title>Configuring the <filename>si</filename> driver</title>
 
       <para><emphasis>Contributed by &a.nsayer.email;. 25 March
 	1998.</emphasis></para>
 
       <para>The Specialix SI/XIO and SX multiport cards use the
 	<filename>si</filename> driver. A single machine can have
 	up to 4 host cards. The following host cards are
 	supported:</para>
 
 	<itemizedlist>
 	  <listitem><para>ISA SI/XIO host card (2 versions)</para></listitem>
 	  <listitem><para>EISA SI/XIO host card</para></listitem>
 	  <listitem><para>PCI SI/XIO host card</para></listitem>
 	  <listitem><para>ISA SX host card</para></listitem>
 	  <listitem><para>PCI SX host card</para></listitem>
 	</itemizedlist>
 
       <para>Although the SX and SI/XIO host cards look markedly
 	different, their functionality are basically the same. The
 	host cards do not use I/O locations, but instead require a 32K
 	chunk of memory. The factory configuration for ISA cards
 	places this at <literal>0xd0000-0xd7fff</literal>.  They also
 	require an IRQ. PCI cards will, of course, auto-configure
 	themselves.</para>
 
       <para>You can attach up to 4 external modules to each host
 	card. The external modules contain either 4 or 8 serial
 	ports. They come in the following varieties:</para>
 
 	<itemizedlist>
 	  <listitem><para>SI 4 or 8 port modules. Up to 57600 bps on each port
 	      supported.</para></listitem>
 
 	  <listitem><para>XIO 8 port modules. Up to 115200 bps on each port
 	      supported. One type of XIO module has 7 serial and 1 parallel
 	      port.</para></listitem>
 
 	  <listitem><para>SXDC 8 port modules. Up to 921600 bps on each port
 	      supported. Like XIO, a module is available with one parallel
 	      port as well.</para></listitem>
 	</itemizedlist>
 
       <para>To configure an ISA host card, add the following line to
 	your kernel configuration file, changing the numbers as
 	appropriate:</para>
 
       <programlisting>device si0 at isa? iomem 0xd0000 irq 11</programlisting>
 
       <para>Valid IRQ numbers are 9, 10, 11, 12 and 15 for SX ISA host
 	cards and 11, 12 and 15 for SI/XIO ISA host cards.</para>
 
       <para>To configure an EISA or PCI host card, use this line:</para>
 
       <programlisting>device si0</programlisting>
 
       <para>After adding the configuration entry, rebuild and
 	install your new kernel.</para>
 
       <note>
         <para>The following step, is not necessary if you are using
           &man.devfs.5; in FreeBSD&nbsp;5.<replaceable>X</replaceable>.</para>
       </note>
 
       <para>After rebooting with the new kernel, you need to make the
 	device nodes in <filename>/dev</filename>.  The <filename>MAKEDEV</filename> script
 	will take care of this for you.  Count how many total ports
 	you have and type:</para>
 
       <screen>&prompt.root; <userinput>cd /dev</userinput>
 &prompt.root; <userinput>./MAKEDEV ttyA<replaceable>nn</replaceable> cuaA<replaceable>nn</replaceable></userinput></screen>
 
       <para>(where <replaceable>nn</replaceable> is the number of
 	ports)</para>
 
       <para>If you want login prompts to appear on these ports, you
 	will need to add lines like this to
 	<filename>/etc/ttys</filename>:</para>
 
       <programlisting>ttyA01  "/usr/libexec/getty std.9600"   vt100   on insecure</programlisting>
 
       <para>Change the terminal type as appropriate. For modems,
 	<userinput>dialup</userinput> or
 	<userinput>unknown</userinput> is fine.</para>
 
   </sect1>
 
 </article>
diff --git a/en_US.ISO8859-1/articles/solid-state/article.xml b/en_US.ISO8859-1/articles/solid-state/article.xml
index 232a5d59f4..8de18f207c 100644
--- a/en_US.ISO8859-1/articles/solid-state/article.xml
+++ b/en_US.ISO8859-1/articles/solid-state/article.xml
@@ -1,498 +1,498 @@
 <?xml version="1.0" encoding="iso-8859-1"?>
 <!DOCTYPE article PUBLIC "-//FreeBSD//DTD DocBook XML V5.0-Based Extension//EN"
 	"http://www.FreeBSD.org/XML/share/xml/freebsd50.dtd">
 <!-- Copyright (c) 2001 The FreeBSD Documentation Project
 
      Redistribution and use in source (SGML DocBook) and 'compiled' forms
      (SGML, HTML, PDF, PostScript, RTF and so forth) with or without
      modification, are permitted provided that the following conditions
      are met:
 
       1. Redistributions of source code (SGML DocBook) must retain the above
          copyright notice, this list of conditions and the following
          disclaimer as the first lines of this file unmodified.
 
       2. Redistributions in compiled form (transformed to other DTDs,
          converted to PDF, PostScript, RTF and other formats) must reproduce
          the above copyright notice, this list of conditions and the
          following disclaimer in the documentation and/or other materials
          provided with the distribution.
 
      THIS DOCUMENTATION IS PROVIDED BY THE FREEBSD DOCUMENTATION PROJECT "AS
      IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO,
      THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR
      PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL NIK CLAYTON BE LIABLE FOR ANY
      DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL
      DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS
      OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION)
      HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT,
      STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN
      ANY WAY OUT OF THE USE OF THIS DOCUMENTATION, EVEN IF ADVISED OF THE
      POSSIBILITY OF SUCH DAMAGE.
 
      $FreeBSD$
 -->
 <article xmlns="http://docbook.org/ns/docbook"
   xmlns:xlink="http://www.w3.org/1999/xlink" version="5.0"
   xml:lang="en"> <info>
     <title>&os; and Solid State Devices</title>
 
     <authorgroup>
       <author>
 	<personname>
 	  <firstname>John</firstname>
 	  <surname>Kozubik</surname>
 	</personname>
 	<affiliation>
 	  <address>
 	    <email>john@kozubik.com</email>
 	  </address>
 	</affiliation>
       </author>
     </authorgroup>
 
     <copyright>
       <year>2001</year>
       <year>2009</year>
       <holder>The FreeBSD Documentation Project</holder>
     </copyright>
 
     <legalnotice xml:id="trademarks" role="trademarks">
       &tm-attrib.freebsd;
       &tm-attrib.general;
     </legalnotice>
 
     &legalnotice;
 
     <pubdate>$FreeBSD$</pubdate>
 
     <releaseinfo>$FreeBSD$</releaseinfo>
 
     <abstract>
       <para>This article covers the use of solid state disk devices in
 	&os; to create embedded systems.</para>
 
       <para>Embedded systems have the advantage of increased stability
 	due to the lack of integral moving parts (hard drives).
 	Account must be taken, however, for the generally low disk
 	space available in the system and the durability of the
 	storage medium.</para>
 
       <para>Specific topics to be covered include the types and
 	attributes of solid state media suitable for disk use in &os;,
 	kernel options that are of interest in such an environment,
 	the <filename>rc.initdiskless</filename> mechanisms that
 	automate the initialization of such systems and the need for
 	read-only filesystems, and building filesystems from scratch.
 	The article will conclude with some general strategies for
 	small and read-only &os; environments.</para>
     </abstract>
   </info>
 
   <sect1 xml:id="intro">
     <title>Solid State Disk Devices</title>
 
     <para>The scope of this article will be limited to solid state
       disk devices made from flash memory.  Flash memory is a solid
       state memory (no moving parts) that is non-volatile (the memory
       maintains data even after all power sources have been
       disconnected).  Flash memory can withstand tremendous physical
       shock and is reasonably fast (the flash memory solutions covered
       in this article are slightly slower than a EIDE hard disk for
       write operations, and much faster for read operations).  One
       very important aspect of flash memory, the ramifications of
       which will be discussed later in this article, is that each
       sector has a limited rewrite capacity.  You can only write,
       erase, and write again to a sector of flash memory a certain
       number of times before the sector becomes permanently unusable.
       Although many flash memory products automatically map bad
       blocks, and although some even distribute write operations
       evenly throughout the unit, the fact remains that there exists a
       limit to the amount of writing that can be done to the device.
       Competitive units have between 1,000,000 and 10,000,000 writes
       per sector in their specification.  This figure varies due to
       the temperature of the environment.</para>
 
     <para>Specifically, we will be discussing ATA compatible
       compact-flash units, which are quite popular as storage media
       for digital cameras.  Of particular interest is the fact that
       they pin out directly to the IDE bus and are compatible with the
       ATA command set.  Therefore, with a very simple and low-cost
       adaptor, these devices can be attached directly to an IDE bus in
       a computer.  Once implemented in this manner, operating systems
       such as &os; see the device as a normal hard disk (albeit
       small).</para>
 
     <para>Other solid state disk solutions do exist, but their
       expense, obscurity, and relative unease of use places them
       beyond the scope of this article.</para>
   </sect1>
 
   <sect1 xml:id="kernel">
     <title>Kernel Options</title>
 
     <para>A few kernel options are of specific interest to those
       creating an embedded &os; system.</para>
 
     <para>All embedded &os; systems that use flash memory as system
       disk will be interested in memory disks and memory filesystems.
-      Because of the limited number of writes that can be done to
+      As a result of the limited number of writes that can be done to
       flash memory, the disk and the filesystems on the disk will most
       likely be mounted read-only.  In this environment, filesystems
       such as <filename>/tmp</filename> and <filename>/var</filename>
       are mounted as memory filesystems to allow the system to create
       logs and update counters and temporary files.  Memory
       filesystems are a critical component to a successful solid state
       &os; implementation.</para>
 
     <para>You should make sure the following lines exist in your
       kernel configuration file:</para>
 
     <programlisting>options         MFS             # Memory Filesystem
 options         MD_ROOT         # md device usable as a potential root device
 pseudo-device   md              # memory disk</programlisting>
   </sect1>
 
   <sect1 xml:id="ro-fs">
     <title>The <literal>rc</literal> Subsystem and Read-Only
       Filesystems</title>
 
     <para>The post-boot initialization of an embedded &os; system is
       controlled by <filename>/etc/rc.initdiskless</filename>.</para>
 
     <para><filename>/etc/rc.d/var</filename> mounts
       <filename>/var</filename> as a memory filesystem, makes a
       configurable list of directories in <filename>/var</filename>
       with the &man.mkdir.1; command, and changes modes on some of
       those directories.  In the execution of
       <filename>/etc/rc.d/var</filename>, one other
       <filename>rc.conf</filename> variable comes into play &ndash;
       <literal>varsize</literal>.  A <filename>/var</filename>
       partition is created by <filename>/etc/rc.d/var</filename> based
       on the value of this variable in
       <filename>rc.conf</filename>:</para>
 
     <programlisting>varsize=8192</programlisting>
 
     <para>Remember that this value is in sectors by default.</para>
 
     <para>The fact that <filename>/var</filename> is a read-write
       filesystem is an important distinction, as the
       <filename>/</filename> partition (and any other partitions you
       may have on your flash media) should be mounted read-only.
       Remember that in <xref linkend="intro"/> we detailed the
       limitations of flash memory - specifically the limited write
       capability.  The importance of not mounting filesystems on flash
       media read-write, and the importance of not using a swap file,
       cannot be overstated.  A swap file on a busy system can burn
       through a piece of flash media in less than one year.  Heavy
       logging or temporary file creation and destruction can do the
       same.  Therefore, in addition to removing the
       <literal>swap</literal> entry from your
       <filename>/etc/fstab</filename>, you should also change the
       Options field for each filesystem to <literal>ro</literal> as
       follows:</para>
 
     <programlisting># Device                Mountpoint      FStype  Options         Dump    Pass#
 /dev/ad0s1a             /               ufs     ro              1       1</programlisting>
 
     <para>A few applications in the average system will immediately
       begin to fail as a result of this change.  For instance, cron
       will not run properly as a result of missing cron tabs in the
       <filename>/var</filename> created by
       <filename>/etc/rc.d/var</filename>, and syslog and dhcp will
       encounter problems as well as a result of the read-only
       filesystem and missing items in the <filename>/var</filename>
       that <filename>/etc/rc.d/var</filename> has created.  These are
       only temporary problems though, and are addressed, along with
       solutions to the execution of other common software packages in
       <xref linkend="strategies"/>.</para>
 
     <para>An important thing to remember is that a filesystem that was
       mounted read-only with <filename>/etc/fstab</filename> can be
       made read-write at any time by issuing the command:</para>
 
     <screen>&prompt.root; <userinput>/sbin/mount -uw <replaceable>partition</replaceable></userinput></screen>
 
     <para>and can be toggled back to read-only with the
       command:</para>
 
     <screen>&prompt.root; <userinput>/sbin/mount -ur <replaceable>partition</replaceable></userinput></screen>
   </sect1>
 
   <sect1>
     <title>Building a File System from Scratch</title>
 
-    <para>Because ATA compatible compact-flash cards are seen by &os;
+    <para>Since ATA compatible compact-flash cards are seen by &os;
       as normal IDE hard drives, you could theoretically install &os;
       from the network using the kern and mfsroot floppies or from a
       CD.</para>
 
     <para>However, even a small installation of &os; using normal
       installation procedures can produce a system in size of greater
-      than 200 megabytes.  Because most people will be using smaller
+      than 200 megabytes.  Most people will be using smaller
       flash memory devices (128 megabytes is considered fairly large -
-      32 or even 16 megabytes is common) an installation using normal
+      32 or even 16 megabytes is common), so an installation using normal
       mechanisms is not possible&mdash;there is simply not enough disk
       space for even the smallest of conventional
       installations.</para>
 
     <para>The easiest way to overcome this space limitation is to
       install &os; using conventional means to a normal hard disk.
       After the installation is complete, pare down the operating
       system to a size that will fit onto your flash media, then tar
       the entire filesystem.  The following steps will guide you
       through the process of preparing a piece of flash memory for
       your tarred filesystem.  Remember, because a normal installation
       is not being performed, operations such as partitioning,
       labeling, file-system creation, etc. need to be performed by
       hand.  In addition to the kern and mfsroot floppy disks, you
       will also need to use the fixit floppy.</para>
 
     <procedure>
       <step>
 	<title>Partitioning Your Flash Media Device</title>
 
 	<para>After booting with the kern and mfsroot floppies, choose
 	  <literal>custom</literal> from the installation menu.  In
 	  the custom installation menu, choose
 	  <literal>partition</literal>.  In the partition menu, you
 	  should delete all existing partitions using
 	  <keycap>d</keycap>.  After deleting all existing
 	  partitions, create a partition using <keycap>c</keycap>
 	  and accept the default value for the size of the
 	  partition.  When asked for the type of the partition, make
 	  sure the value is set to <literal>165</literal>.  Now write
 	  this partition table to the disk by pressing
 	  <keycap>w</keycap> (this is a hidden option on this
 	  screen).  If you are using an ATA compatible compact flash
 	  card, you should choose the &os; Boot Manager.  Now press
 	  <keycap>q</keycap> to quit the partition menu.  You
 	  will be shown the boot manager menu once more - repeat the
 	  choice you made earlier.</para>
       </step>
 
       <step>
 	<title>Creating Filesystems on Your Flash Memory
 	  Device</title>
 
 	<para>Exit the custom installation menu, and from the main
 	  installation menu choose the <literal>fixit</literal>
 	  option.  After entering the fixit environment, enter the
 	  following command:</para>
 
 	<screen>&prompt.root; <userinput>disklabel -e /dev/ad0c</userinput></screen>
 
 	<para>At this point you will have entered the vi editor under
 	  the auspices of the disklabel command.  Next, you need to
 	  add an <literal>a:</literal> line at the end of the file.
 	  This <literal>a:</literal> line should look like:</para>
 
 	<programlisting>a:      <replaceable>123456</replaceable>  0       4.2BSD  0       0</programlisting>
 
 	<para>Where <replaceable>123456</replaceable> is a number that
 	  is exactly the same as the number in the existing
 	  <literal>c:</literal> entry for size.  Basically you are
 	  duplicating the existing <literal>c:</literal> line as an
 	  <literal>a:</literal> line, making sure that fstype is
 	  <literal>4.2BSD</literal>.  Save the file and exit.</para>
 
 	<screen>&prompt.root; <userinput>disklabel -B -r /dev/ad0c</userinput>
 &prompt.root; <userinput>newfs /dev/ad0a</userinput></screen>
       </step>
 
       <step>
 	<title>Placing Your Filesystem on the Flash Media</title>
 
 	<para>Mount the newly prepared flash media:</para>
 
 	<screen>&prompt.root; <userinput>mount /dev/ad0a /flash</userinput></screen>
 
 	<para>Bring this machine up on the network so we may transfer
 	  our tar file and explode it onto our flash media filesystem.
 	  One example of how to do this is:</para>
 
 	<screen>&prompt.root; <userinput>ifconfig xl0 192.168.0.10 netmask 255.255.255.0</userinput>
 &prompt.root; <userinput>route add default 192.168.0.1</userinput></screen>
 
 	<para>Now that the machine is on the network, transfer your
 	  tar file.  You may be faced with a bit of a dilemma at this
 	  point - if your flash memory part is 128 megabytes, for
 	  instance, and your tar file is larger than 64 megabytes, you
 	  cannot have your tar file on the flash media at the same
 	  time as you explode it - you will run out of
 	  space.  One solution to this problem, if you are using FTP,
 	  is to untar the file while it is transferred over FTP.  If
 	  you perform your transfer in this manner, you will never
 	  have the tar file and the tar contents on your disk at the
 	  same time:</para>
 
 	<screen><prompt>ftp&gt;</prompt> <userinput>get tarfile.tar "| tar xvf -"</userinput></screen>
 
 	<para>If your tarfile is gzipped, you can accomplish this as
 	  well:</para>
 
 	<screen><prompt>ftp&gt;</prompt> <userinput>get tarfile.tar "| zcat | tar xvf -"</userinput></screen>
 
 	<para>After the contents of your tarred filesystem are on your
 	  flash memory filesystem, you can unmount the flash memory
 	  and reboot:</para>
 
 	<screen>&prompt.root; <userinput>cd /</userinput>
 &prompt.root; <userinput>umount /flash</userinput>
 &prompt.root; <userinput>exit</userinput></screen>
 
 	<para>Assuming that you configured your filesystem correctly
 	  when it was built on the normal hard disk (with your
 	  filesystems mounted read-only, and with the necessary
 	  options compiled into the kernel) you should now be
 	  successfully booting your &os; embedded system.</para>
       </step>
     </procedure>
   </sect1>
 
   <sect1 xml:id="strategies">
     <title>System Strategies for Small and Read Only
       Environments</title>
 
     <para>In <xref linkend="ro-fs"/>, it was pointed out that the
       <filename>/var</filename> filesystem constructed by
       <filename>/etc/rc.d/var</filename> and the presence of a
       read-only root filesystem causes problems with many common
       software packages used with &os;.  In this article, suggestions
       for successfully running cron, syslog, ports installations, and
       the Apache web server will be provided.</para>
 
     <sect2>
       <title>Cron</title>
 
       <para>Upon boot, <filename>/var</filename> gets populated by
 	<filename>/etc/rc.d/var</filename> using the list from
 	<filename>/etc/mtree/BSD.var.dist</filename>, so the
 	<filename>cron</filename>, <filename>cron/tabs</filename>,
 	<filename>at</filename>, and a few other standard directories
 	get created.</para>
 
       <para>However, this does not solve the problem of maintaining
 	cron tabs across reboots.  When the system reboots, the
 	<filename>/var</filename> filesystem that is in memory will
 	disappear and any cron tabs you may have had in it will also
 	disappear.  Therefore, one solution would be to create cron
 	tabs for the users that need them, mount your
 	<filename>/</filename> filesystem as read-write and copy those
 	cron tabs to somewhere safe, like
 	<filename>/etc/tabs</filename>, then add a line to the end of
 	<filename>/etc/rc.initdiskless</filename> that copies those
 	crontabs into <filename>/var/cron/tabs</filename> after that
 	directory has been created during system initialization.  You
 	may also need to add a line that changes modes and permissions
 	on the directories you create and the files you copy with
 	<filename>/etc/rc.initdiskless</filename>.</para>
     </sect2>
 
     <sect2>
       <title>Syslog</title>
 
       <para><filename>syslog.conf</filename> specifies the locations
 	of certain log files that exist in
 	<filename>/var/log</filename>.  These files are not created by
 	<filename>/etc/rc.d/var</filename> upon system initialization.
 	Therefore, somewhere in <filename>/etc/rc.d/var</filename>,
 	after the section that creates the directories in
 	<filename>/var</filename>, you will need to add something like
 	this:</para>
 
       <screen>&prompt.root; <userinput>touch /var/log/security /var/log/maillog /var/log/cron /var/log/messages</userinput>
 &prompt.root; <userinput>chmod 0644 /var/log/*</userinput></screen>
     </sect2>
 
     <sect2>
       <title>Ports Installation</title>
 
       <para>Before discussing the changes necessary to successfully
 	use the ports tree, a reminder is necessary regarding the
 	read-only nature of your filesystems on the flash media.
 	Since they are read-only, you will need to temporarily mount
 	them read-write using the mount syntax shown in <xref
 	  linkend="ro-fs"/>.  You should always remount those
 	filesystems read-only when you are done with any maintenance -
 	unnecessary writes to the flash media could considerably
 	shorten its lifespan.</para>
 
       <para>To make it possible to enter a ports directory and
 	successfully run <command>make</command>
 	<buildtarget>install</buildtarget>, we must create a packages
 	directory on a non-memory filesystem that will keep track of
-	our packages across reboots.  Because it is necessary to mount
+	our packages across reboots.  As it is necessary to mount
 	your filesystems as read-write for the installation of a
 	package anyway, it is sensible to assume that an area on the
 	flash media can also be used for package information to be
 	written to.</para>
 
       <para>First, create a package database directory.  This is
 	normally in <filename>/var/db/pkg</filename>, but we cannot
 	place it there as it will disappear every time the system is
 	booted.</para>
 
       <screen>&prompt.root; <userinput>mkdir /etc/pkg</userinput></screen>
 
       <para>Now, add a line to <filename>/etc/rc.d/var</filename> that
 	links the <filename>/etc/pkg</filename> directory to
 	<filename>/var/db/pkg</filename>.  An example:</para>
 
       <screen>&prompt.root; <userinput>ln -s /etc/pkg /var/db/pkg</userinput></screen>
 
       <para>Now, any time that you mount your filesystems as
 	read-write and install a package, the <command>make</command>
 	<buildtarget>install</buildtarget> will work, and package
 	information will be written successfully to
 	<filename>/etc/pkg</filename> (because the filesystem will, at
 	that time, be mounted read-write) which will always be
 	available to the operating system as
 	<filename>/var/db/pkg</filename>.</para>
     </sect2>
 
     <sect2>
       <title>Apache Web Server</title>
 
       <note>
 	<para>The steps in this section are only necessary if Apache
 	  is set up to write its pid or log information outside of
 	  <filename>/var</filename>.  By default, Apache keeps its pid
 	  file in <filename>/var/run/httpd.pid</filename> and its log
 	  files in <filename>/var/log</filename>.</para>
       </note>
 
       <para>It is now assumed that Apache keeps its log files in a
 	directory
 	<filename><replaceable>apache_log_dir</replaceable></filename>
 	outside of <filename>/var</filename>.  When this directory
 	lives on a read-only filesystem, Apache will not be able to
 	save any log files, and may have problems working.  If so, it
 	is necessary to add a new directory to the list of directories
 	in <filename>/etc/rc.d/var</filename> to create in
 	<filename>/var</filename>, and to link
 	<filename><replaceable>apache_log_dir</replaceable></filename>
 	to <filename>/var/log/apache</filename>.  It is also necessary
 	to set permissions and ownership on this new directory.</para>
 
       <para>First, add the directory <literal>log/apache</literal> to
 	the list of directories to be created in
 	<filename>/etc/rc.d/var</filename>.</para>
 
       <para>Second, add these commands to
 	<filename>/etc/rc.d/var</filename> after the directory
 	creation section:</para>
 
       <screen>&prompt.root; <userinput>chmod 0774 /var/log/apache</userinput>
 &prompt.root; <userinput>chown nobody:nobody /var/log/apache</userinput></screen>
 
       <para>Finally, remove the existing
 	<filename><replaceable>apache_log_dir</replaceable></filename>
 	directory, and replace it with a link:</para>
 
       <screen>&prompt.root; <userinput>rm -rf <replaceable>apache_log_dir</replaceable></userinput>
 &prompt.root; <userinput>ln -s /var/log/apache <replaceable>apache_log_dir</replaceable></userinput></screen>
     </sect2>
   </sect1>
 </article>
diff --git a/en_US.ISO8859-1/articles/vm-design/article.xml b/en_US.ISO8859-1/articles/vm-design/article.xml
index 2cf7e001eb..79b56d296c 100644
--- a/en_US.ISO8859-1/articles/vm-design/article.xml
+++ b/en_US.ISO8859-1/articles/vm-design/article.xml
@@ -1,899 +1,899 @@
 <?xml version="1.0" encoding="iso-8859-1"?>
 <!DOCTYPE article PUBLIC "-//FreeBSD//DTD DocBook XML V5.0-Based Extension//EN"
 	"http://www.FreeBSD.org/XML/share/xml/freebsd50.dtd">
 <!-- $FreeBSD$ -->
 <!-- FreeBSD Documentation Project -->
 <article xmlns="http://docbook.org/ns/docbook" xmlns:xlink="http://www.w3.org/1999/xlink" version="5.0" xml:lang="en">
   <info><title>Design elements of the &os; VM system</title>
     
 
     <authorgroup>
       <author><personname><firstname>Matthew</firstname><surname>Dillon</surname></personname><affiliation>
 	  <address>
 	    <email>dillon@apollo.backplane.com</email>
 	  </address>
 	</affiliation></author>
     </authorgroup>
 
     <legalnotice xml:id="trademarks" role="trademarks">
       &tm-attrib.freebsd;
       &tm-attrib.linux;
       &tm-attrib.microsoft;
       &tm-attrib.opengroup;
       &tm-attrib.general;
     </legalnotice>
 
     <pubdate>$FreeBSD$</pubdate>
 
     <releaseinfo>$FreeBSD$</releaseinfo>
 
     <abstract>
       <para>The title is really just a fancy way of saying that I am going to
 	attempt to describe the whole VM enchilada, hopefully in a way that
 	everyone can follow.  For the last year I have concentrated on a number
 	of major kernel subsystems within &os;, with the VM and Swap
 	subsystems being the most interesting and NFS being <quote>a necessary
 	chore</quote>.  I rewrote only small portions of the code.  In the VM
 	arena the only major rewrite I have done is to the swap subsystem.
 	Most of my work was cleanup and maintenance, with only moderate code
 	rewriting and no major algorithmic adjustments within the VM
 	subsystem.  The bulk of the VM subsystem's theoretical base remains
 	unchanged and a lot of the credit for the modernization effort in the
 	last few years belongs to John Dyson and David Greenman.  Not being a
 	historian like Kirk I will not attempt to tag all the various features
 	with peoples names, since I will invariably get it wrong.</para>
     </abstract>
 
     <legalnotice xml:id="legalnotice">
       <para>This article was originally published in the January 2000 issue of
 	<link xlink:href="http://www.daemonnews.org/">DaemonNews</link>.  This
 	version of the article may include updates from Matt and other authors
 	to reflect changes in &os;'s VM implementation.</para>
     </legalnotice>
   </info>
 
   <sect1 xml:id="introduction">
     <title>Introduction</title>
 
     <para>Before moving along to the actual design let's spend a little time
       on the necessity of maintaining and modernizing any long-living
       codebase.  In the programming world, algorithms tend to be more
       important than code and it is precisely due to BSD's academic roots that
       a great deal of attention was paid to algorithm design from the
       beginning.  More attention paid to the design generally leads to a clean
       and flexible codebase that can be fairly easily modified, extended, or
       replaced over time.  While BSD is considered an <quote>old</quote>
       operating system by some people, those of us who work on it tend to view
       it more as a <quote>mature</quote> codebase which has various components
       modified, extended, or replaced with modern code.  It has evolved, and
       &os; is at the bleeding edge no matter how old some of the code might
       be.  This is an important distinction to make and one that is
       unfortunately lost to many people.  The biggest error a programmer can
       make is to not learn from history, and this is precisely the error that
       many other modern operating systems have made.  &windowsnt; is the best example
       of this, and the consequences have been dire.  Linux also makes this
       mistake to some degree&mdash;enough that we BSD folk can make small
       jokes about it every once in a while, anyway.  Linux's problem is simply
       one of a lack of experience and history to compare ideas against, a
       problem that is easily and rapidly being addressed by the Linux
       community in the same way it has been addressed in the BSD
       community&mdash;by continuous code development.  The &windowsnt; folk, on the
       other hand, repeatedly make the same mistakes solved by &unix; decades ago
       and then spend years fixing them. Over and over again.  They have a
       severe case of <quote>not designed here</quote> and <quote>we are always
       right because our marketing department says so</quote>.  I have little
       tolerance for anyone who cannot learn from history.</para>
 
     <para>Much of the apparent complexity of the &os; design, especially in
       the VM/Swap subsystem, is a direct result of having to solve serious
       performance issues that occur under various conditions.  These issues
       are not due to bad algorithmic design but instead rise from
       environmental factors.  In any direct comparison between platforms,
       these issues become most apparent when system resources begin to get
       stressed.  As I describe &os;'s VM/Swap subsystem the reader should
       always keep two points in mind:</para>
 
     <orderedlist>
       <listitem>
         <para>The most important aspect of performance design is what is
           known as <quote>Optimizing the Critical Path</quote>.  It is often
           the case that performance optimizations add a little bloat to the
           code in order to make the critical path perform better.</para>
       </listitem>
 
       <listitem>
         <para>A solid, generalized design outperforms a heavily-optimized
           design over the long run.  While a generalized design may end up
           being slower than an heavily-optimized design when they are
           first implemented, the generalized design tends to be easier to
           adapt to changing conditions and the heavily-optimized design
           winds up having to be thrown away.</para>
       </listitem>
     </orderedlist>
 
     <para>Any codebase that will survive and be maintainable for
       years must therefore be designed properly from the beginning even if it
       costs some performance.  Twenty years ago people were still arguing that
       programming in assembly was better than programming in a high-level
       language because it produced code that was ten times as fast.  Today,
       the fallibility of that argument is obvious &nbsp;&mdash;&nbsp;as are
       the parallels to algorithmic design and code generalization.</para>
   </sect1>
 
   <sect1 xml:id="vm-objects">
     <title>VM Objects</title>
 
     <para>The best way to begin describing the &os; VM system is to look at
       it from the perspective of a user-level process.  Each user process sees
       a single, private, contiguous VM address space containing several types
       of memory objects.  These objects have various characteristics.  Program
       code and program data are effectively a single memory-mapped file (the
       binary file being run), but program code is read-only while program data
       is copy-on-write.  Program BSS is just memory allocated and filled with
       zeros on demand, called demand zero page fill.  Arbitrary files can be
       memory-mapped into the address space as well, which is how the shared
       library mechanism works.  Such mappings can require modifications to
       remain private to the process making them.  The fork system call adds an
       entirely new dimension to the VM management problem on top of the
       complexity already given.</para>
 
     <para>A program binary data page (which is a basic copy-on-write page)
       illustrates the complexity.  A program binary contains a preinitialized
       data section which is initially mapped directly from the program file.
       When a program is loaded into a process's VM space, this area is
       initially memory-mapped and backed by the program binary itself,
       allowing the VM system to free/reuse the page and later load it back in
       from the binary.  The moment a process modifies this data, however, the
       VM system must make a private copy of the page for that process.  Since
       the private copy has been modified, the VM system may no longer free it,
       because there is no longer any way to restore it later on.</para>
 
     <para>You will notice immediately that what was originally a simple file
       mapping has become much more complex.  Data may be modified on a
       page-by-page basis whereas the file mapping encompasses many pages at
       once.  The complexity further increases when a process forks.  When a
       process forks, the result is two processes&mdash;each with their own
       private address spaces, including any modifications made by the original
       process prior to the call to <function>fork()</function>.  It would be
       silly for the VM system to make a complete copy of the data at the time
       of the <function>fork()</function> because it is quite possible that at
       least one of the two processes will only need to read from that page
       from then on, allowing the original page to continue to be used.  What
       was a private page is made copy-on-write again, since each process
       (parent and child) expects their own personal post-fork modifications to
       remain private to themselves and not effect the other.</para>
 
     <para>&os; manages all of this with a layered VM Object model.  The
       original binary program file winds up being the lowest VM Object layer.
       A copy-on-write layer is pushed on top of that to hold those pages which
       had to be copied from the original file.  If the program modifies a data
       page belonging to the original file the VM system takes a fault and
       makes a copy of the page in the higher layer.  When a process forks,
       additional VM Object layers are pushed on.  This might make a little
       more sense with a fairly basic example.  A <function>fork()</function>
       is a common operation for any *BSD system, so this example will consider
       a program that starts up, and forks.  When the process starts, the VM
       system creates an object layer, let's call this A:</para>
 
     <mediaobject>
       <imageobject>
         <imagedata fileref="fig1"/>
       </imageobject>
 
       <textobject>
 	<literallayout class="monospaced">+---------------+
 |       A       |
 +---------------+</literallayout>
       </textobject>
 
       <textobject>
 	<phrase>A picture</phrase>
       </textobject>
     </mediaobject>
 
     <para>A represents the file&mdash;pages may be paged in and out of the
       file's physical media as necessary.  Paging in from the disk is
       reasonable for a program, but we really do not want to page back out and
       overwrite the executable.  The VM system therefore creates a second
       layer, B, that will be physically backed by swap space:</para>
 
     <mediaobject>
       <imageobject>
         <imagedata fileref="fig2"/>
       </imageobject>
 
       <textobject>
 	<literallayout class="monospaced">+---------------+
 |       B       |
 +---------------+
 |       A       |
 +---------------+</literallayout>
       </textobject>
     </mediaobject>
 
     <para>On the first write to a page after this, a new page is created in B,
       and its contents are initialized from A.  All pages in B can be paged in
       or out to a swap device.  When the program forks, the VM system creates
       two new object layers&mdash;C1 for the parent, and C2 for the
       child&mdash;that rest on top of B:</para>
 
     <mediaobject>
       <imageobject>
         <imagedata fileref="fig3"/>
       </imageobject>
 
       <textobject>
 	<literallayout class="monospaced">+-------+-------+
 |   C1  |   C2  |
 +-------+-------+
 |       B       |
 +---------------+
 |       A       |
 +---------------+</literallayout>
       </textobject>
     </mediaobject>
 
     <para>In this case, let's say a page in B is modified by the original
       parent process.  The process will take a copy-on-write fault and
       duplicate the page in C1, leaving the original page in B untouched.
       Now, let's say the same page in B is modified by the child process.  The
       process will take a copy-on-write fault and duplicate the page in C2.
       The original page in B is now completely hidden since both C1 and C2
       have a copy and B could theoretically be destroyed if it does not
       represent a <quote>real</quote> file; however, this sort of optimization is not
       trivial to make because it is so fine-grained.  &os; does not make
       this optimization.  Now, suppose (as is often the case) that the child
       process does an <function>exec()</function>.  Its current address space
       is usually replaced by a new address space representing a new file.  In
       this case, the C2 layer is destroyed:</para>
 
     <mediaobject>
       <imageobject>
         <imagedata fileref="fig4"/>
       </imageobject>
 
       <textobject>
 	<literallayout class="monospaced">+-------+
 |   C1  |
 +-------+-------+
 |       B       |
 +---------------+
 |       A       |
 +---------------+</literallayout>
       </textobject>
     </mediaobject>
 
     <para>In this case, the number of children of B drops to one, and all
       accesses to B now go through C1.  This means that B and C1 can be
       collapsed together.  Any pages in B that also exist in C1 are deleted
       from B during the collapse.  Thus, even though the optimization in the
       previous step could not be made, we can recover the dead pages when
       either of the processes exit or <function>exec()</function>.</para>
 
     <para>This model creates a number of potential problems.  The first is that
       you can wind up with a relatively deep stack of layered VM Objects which
       can cost scanning time and memory when you take a fault.  Deep
       layering can occur when processes fork and then fork again (either
       parent or child).  The second problem is that you can wind up with dead,
       inaccessible pages deep in the stack of VM Objects.  In our last example
       if both the parent and child processes modify the same page, they both
       get their own private copies of the page and the original page in B is
       no longer accessible by anyone.  That page in B can be freed.</para>
 
     <para>&os; solves the deep layering problem with a special optimization
       called the <quote>All Shadowed Case</quote>.  This case occurs if either
       C1 or C2 take sufficient COW faults to completely shadow all pages in B.
       Lets say that C1 achieves this.  C1 can now bypass B entirely, so rather
       then have C1-&gt;B-&gt;A and C2-&gt;B-&gt;A we now have C1-&gt;A and C2-&gt;B-&gt;A.  But
       look what also happened&mdash;now B has only one reference (C2), so we
       can collapse B and C2 together.  The end result is that B is deleted
       entirely and we have C1-&gt;A and C2-&gt;A.  It is often the case that B will
       contain a large number of pages and neither C1 nor C2 will be able to
       completely overshadow it.  If we fork again and create a set of D
       layers, however, it is much more likely that one of the D layers will
       eventually be able to completely overshadow the much smaller dataset
       represented by C1 or C2.  The same optimization will work at any point in
       the graph and the grand result of this is that even on a heavily forked
       machine VM Object stacks tend to not get much deeper then 4.  This is
       true of both the parent and the children and true whether the parent is
       doing the forking or whether the children cascade forks.</para>
 
     <para>The dead page problem still exists in the case where C1 or C2 do not
       completely overshadow B.  Due to our other optimizations this case does
       not represent much of a problem and we simply allow the pages to be
       dead.  If the system runs low on memory it will swap them out, eating a
       little swap, but that is it.</para>
 
     <para>The advantage to the VM Object model is that
       <function>fork()</function> is extremely fast, since no real data
       copying need take place.  The disadvantage is that you can build a
       relatively complex VM Object layering that slows page fault handling
       down a little, and you spend memory managing the VM Object structures.
       The optimizations &os; makes proves to reduce the problems enough
       that they can be ignored, leaving no real disadvantage.</para>
   </sect1>
 
   <sect1 xml:id="swap-layers">
     <title>SWAP Layers</title>
 
     <para>Private data pages are initially either copy-on-write or zero-fill
       pages.  When a change, and therefore a copy, is made, the original
       backing object (usually a file) can no longer be used to save a copy of
       the page when the VM system needs to reuse it for other purposes.  This
       is where SWAP comes in.  SWAP is allocated to create backing store for
       memory that does not otherwise have it.  &os; allocates the swap
       management structure for a VM Object only when it is actually needed.
       However, the swap management structure has had problems
       historically:</para>
 
     <itemizedlist>
       <listitem>
         <para>Under &os; 3.X the swap management structure preallocates an
           array that encompasses the entire object requiring swap backing
           store&mdash;even if only a few pages of that object are
           swap-backed.  This creates a kernel memory fragmentation problem
           when large objects are mapped, or processes with large runsizes
          (RSS) fork.</para>
       </listitem>
 
       <listitem>
         <para>Also, in order to keep track of swap space, a <quote>list of
           holes</quote> is kept in kernel memory, and this tends to get
           severely fragmented as well.  Since the <quote>list of
           holes</quote> is a linear list, the swap allocation and freeing
           performance is a non-optimal O(n)-per-page.</para>
       </listitem>
 
       <listitem>
         <para>It requires kernel memory allocations to take place during
           the swap freeing process, and that creates low memory deadlock
           problems.</para>
       </listitem>
 
       <listitem>
         <para>The problem is further exacerbated by holes created due to
           the interleaving algorithm.</para>
       </listitem>
 
       <listitem>
         <para>Also, the swap block map can become fragmented fairly easily
           resulting in non-contiguous allocations.</para>
       </listitem>
 
       <listitem>
         <para>Kernel memory must also be allocated on the fly for additional
           swap management structures when a swapout occurs.</para>
       </listitem>
     </itemizedlist>
 
     <para>It is evident from that list that there was plenty of room for
        improvement.  For &os; 4.X, I completely rewrote the swap
        subsystem:</para>
 
     <itemizedlist>
       <listitem>
         <para>Swap management structures are allocated through a hash
           table rather than a linear array giving them a fixed allocation
           size and much finer granularity.</para>
       </listitem>
 
       <listitem>
         <para>Rather then using a linearly linked list to keep track of
           swap space reservations, it now uses a bitmap of swap blocks
           arranged in a radix tree structure with free-space hinting in
           the radix  node structures.  This effectively makes swap
           allocation and freeing an O(1) operation.</para>
       </listitem>
 
       <listitem>
         <para>The entire radix tree bitmap is also preallocated in
           order to avoid having to allocate kernel memory during critical
           low memory swapping operations.  After all, the system tends to
           swap when it is low on memory so we should avoid allocating
           kernel memory at such times in order to avoid potential
           deadlocks.</para>
       </listitem>
 
       <listitem>
         <para>To reduce fragmentation the radix tree is capable
           of allocating large contiguous chunks at once, skipping over
           smaller fragmented chunks.</para>
       </listitem>
     </itemizedlist>
 
     <para>I did not take the final step of having an
       <quote>allocating hint pointer</quote> that would trundle
       through a portion of swap as allocations were made in order to further
       guarantee contiguous allocations or at least locality of reference, but
       I ensured that such an addition could be made.</para>
   </sect1>
 
   <sect1 xml:id="freeing-pages">
     <title>When to free a page</title>
 
     <para>Since the VM system uses all available memory for disk caching,
       there are usually very few truly-free pages.  The VM system depends on
       being able to properly choose pages which are not in use to reuse for
       new allocations.  Selecting the optimal pages to free is possibly the
       single-most important function any VM system can perform because if it
       makes a poor selection, the VM system may be forced to unnecessarily
       retrieve pages from disk, seriously degrading system performance.</para>
 
     <para>How much overhead are we willing to suffer in the critical path to
       avoid freeing the wrong page?  Each wrong choice we make will cost us
       hundreds of thousands of CPU cycles and a noticeable stall of the
       affected processes, so we are willing to endure a significant amount of
       overhead in order to be sure that the right page is chosen.  This is why
       &os; tends to outperform other systems when memory resources become
       stressed.</para>
 
     <para>The free page determination algorithm is built upon a history of the
       use of memory pages.  To acquire this history, the system takes advantage
       of a page-used bit feature that most hardware page tables have.</para>
 
     <para>In any case, the page-used bit is cleared and at some later point
       the VM system comes across the page again and sees that the page-used
       bit has been set.  This indicates that the page is still being actively
       used.  If the bit is still clear it is an indication that the page is not
       being actively used.  By testing this bit periodically, a use history (in
       the form of a counter) for the physical page is developed.  When the VM
       system later needs to free up some pages, checking this history becomes
       the cornerstone of determining the best candidate page to reuse.</para>
 
     <sidebar>
       <title>What if the hardware has no page-used bit?</title>
 
       <para>For those platforms that do not have this feature, the system
 	actually emulates a page-used bit.  It unmaps or protects a page,
 	forcing a page fault if the page is accessed again.  When the page
 	fault is taken, the system simply marks the page as having been used
 	and unprotects the page so that it may be used.  While taking such page
 	faults just to determine if a page is being used appears to be an
 	expensive proposition, it is much less expensive than reusing the page
 	for some other purpose only to find that a process needs it back and
 	then have to go to disk.</para>
     </sidebar>
 
     <para>&os; makes use of several page queues to further refine the
       selection of pages to reuse as well as to determine when dirty pages
       must be flushed to their backing store.  Since page tables are dynamic
       entities under &os;, it costs virtually nothing to unmap a page from
       the address space of any processes using it.  When a page candidate has
       been chosen based on the page-use counter, this is precisely what is
       done.  The system must make a distinction between clean pages which can
       theoretically be freed up at any time, and dirty pages which must first
       be written to their backing store before being reusable.  When a page
       candidate has been found it is moved to the inactive queue if it is
       dirty, or the cache queue if it is clean.  A separate algorithm based on
       the dirty-to-clean page ratio determines when dirty pages in the
       inactive queue must be flushed to disk.  Once this is accomplished, the
       flushed pages are moved from the inactive queue to the cache queue.  At
       this point, pages in the cache queue can still be reactivated by a VM
       fault at relatively low cost.  However, pages in the cache queue are
       considered to be <quote>immediately freeable</quote> and will be reused
       in an LRU (least-recently used) fashion when the system needs to
       allocate new memory.</para>
 
     <para>It is important to note that the &os; VM system attempts to
       separate clean and dirty pages for the express reason of avoiding
       unnecessary flushes of dirty pages (which eats I/O bandwidth), nor does
       it move pages between the various page queues gratuitously when the
       memory subsystem is not being stressed.  This is why you will see some
       systems with very low cache queue counts and high active queue counts
       when doing a <command>systat -vm</command> command.  As the VM system
       becomes more stressed, it makes a greater effort to maintain the various
       page queues at the levels determined to be the most effective.</para>
 
     <para>An urban
       myth has circulated for years that Linux did a better job avoiding
       swapouts than &os;, but this in fact is not true.  What was actually
       occurring was that &os; was proactively paging out unused pages in
       order to make room for more disk cache while Linux was keeping unused
       pages in core and leaving less memory available for cache and process
       pages.  I do not know whether this is still true today.</para>
   </sect1>
 
   <sect1 xml:id="prefault-optimizations">
     <title>Pre-Faulting and Zeroing Optimizations</title>
 
     <para>Taking a VM fault is not expensive if the underlying page is already
       in core and can simply be mapped into the process, but it can become
       expensive if you take a whole lot of them on a regular basis.  A good
       example of this is running a program such as &man.ls.1; or &man.ps.1;
       over and over again.  If the program binary is mapped into memory but
       not mapped into the page table, then all the pages that will be accessed
       by the program will have to be faulted in every time the program is run.
       This is unnecessary when the pages in question are already in the VM
       Cache, so &os; will attempt to pre-populate a process's page tables
       with those pages that are already in the VM Cache.  One thing that
       &os; does not yet do is pre-copy-on-write certain pages on exec.  For
       example, if you run the &man.ls.1; program while running <command>vmstat
 	1</command> you will notice that it always takes a certain number of
       page faults, even when you run it over and over again.  These are
       zero-fill faults, not program code faults (which were pre-faulted in
       already).  Pre-copying pages on exec or fork is an area that could use
       more study.</para>
 
     <para>A large percentage of page faults that occur are zero-fill faults.
       You can usually see this by observing the <command>vmstat -s</command>
       output.  These occur when a process accesses pages in its BSS area.  The
       BSS area is expected to be initially zero but the VM system does not
       bother to allocate any memory at all until the process actually accesses
       it.  When a fault occurs the VM system must not only allocate a new page,
       it must zero it as well.  To optimize the zeroing operation the VM system
       has the ability to pre-zero pages and mark them as such, and to request
       pre-zeroed pages when zero-fill faults occur.  The pre-zeroing occurs
       whenever the CPU is idle but the number of pages the system pre-zeros is
       limited in order to avoid blowing away the memory caches.  This is an
       excellent example of adding complexity to the VM system in order to
       optimize the critical path.</para>
   </sect1>
 
   <sect1 xml:id="page-table-optimizations">
     <title>Page Table Optimizations</title>
 
     <para>The page table optimizations make up the most contentious part of
       the &os; VM design and they have shown some strain with the advent of
       serious use of <function>mmap()</function>.  I think this is actually a
       feature of most BSDs though I am not sure when it was first introduced.
       There are two major optimizations.  The first is that hardware page
       tables do not contain persistent state but instead can be thrown away at
       any time with only a minor amount of management overhead.  The second is
       that every active page table entry in the system has a governing
       <literal>pv_entry</literal> structure which is tied into the
       <literal>vm_page</literal> structure.  &os; can simply iterate
       through those mappings that are known to exist while Linux must check
       all page tables that <emphasis>might</emphasis> contain a specific
       mapping to see if it does, which can achieve O(n^2) overhead in certain
       situations.  It is because of this that &os; tends to make better
       choices on which pages to reuse or swap when memory is stressed, giving
       it better performance under load. However, &os; requires kernel
       tuning to accommodate large-shared-address-space situations such as
       those that can occur in a news system because it may run out of
       <literal>pv_entry</literal> structures.</para>
 
     <para>Both Linux and &os; need work in this area.  &os; is trying to
       maximize the advantage of a potentially sparse active-mapping model (not
       all processes need to map all pages of a shared library, for example),
       whereas Linux is trying to simplify its algorithms.  &os; generally
       has the performance advantage here at the cost of wasting a little extra
       memory, but &os; breaks down in the case where a large file is
       massively shared across hundreds of processes.  Linux, on the other hand,
       breaks down in the case where many processes are sparsely-mapping the
       same shared library and also runs non-optimally when trying to determine
       whether a page can be reused or not.</para>
   </sect1>
 
   <sect1 xml:id="page-coloring-optimizations">
     <title>Page Coloring</title>
 
     <para>We will end with the page coloring optimizations.  Page coloring is a
       performance optimization designed to ensure that accesses to contiguous
       pages in virtual memory make the best use of the processor cache.  In
       ancient times (i.e. 10+ years ago) processor caches tended to map
       virtual memory rather than physical memory.  This led to a huge number of
       problems including having to clear the cache on every context switch in
       some cases, and problems with data aliasing in the cache.  Modern
       processor caches map physical memory precisely to solve those problems.
       This means that two side-by-side pages in a processes address space may
       not correspond to two side-by-side pages in the cache.  In fact, if you
       are not careful side-by-side pages in virtual memory could wind up using
       the same page in the processor cache&mdash;leading to cacheable data
       being thrown away prematurely and reducing CPU performance.  This is true
       even with multi-way set-associative caches (though the effect is
       mitigated somewhat).</para>
 
     <para>&os;'s memory allocation code implements page coloring
       optimizations, which means that the memory allocation code will attempt
       to locate free pages that are contiguous from the point of view of the
       cache.  For example, if page 16 of physical memory is assigned to page 0
       of a process's virtual memory and the cache can hold 4 pages, the page
       coloring code will not assign page 20 of physical memory to page 1 of a
       process's virtual memory.  It would, instead, assign page 21 of physical
       memory.  The page coloring code attempts to avoid assigning page 20
       because this maps over the same cache memory as page 16 and would result
       in non-optimal caching.  This code adds a significant amount of
       complexity to the VM memory allocation subsystem as you can well
       imagine, but the result is well worth the effort.  Page Coloring makes VM
       memory as deterministic as physical memory in regards to cache
       performance.</para>
   </sect1>
 
   <sect1 xml:id="conclusion">
     <title>Conclusion</title>
 
     <para>Virtual memory in modern operating systems must address a number of
       different issues efficiently and for many different usage patterns.  The
       modular and algorithmic approach that BSD has historically taken allows
       us to study and understand the current implementation as well as
       relatively cleanly replace large sections of the code.  There have been a
       number of improvements to the &os; VM system in the last several
       years, and work is ongoing.</para>
   </sect1>
 
   <sect1 xml:id="allen-briggs-qa">
     <title>Bonus QA session by Allen Briggs
       <email>briggs@ninthwonder.com</email></title>
 
     <qandaset>
       <qandaentry>
 	<question>
 	  <para>What is <quote>the interleaving algorithm</quote> that you
 	    refer to in your listing of the ills of the &os; 3.X swap
 	    arrangements?</para>
 	</question>
 
 	<answer>
 	  <para>&os; uses a fixed swap interleave which defaults to 4.  This
 	    means that &os; reserves space for four swap areas even if you
 	    only have one, two, or three.  Since swap is interleaved the linear
 	    address space representing the <quote>four swap areas</quote> will be
 	    fragmented if you do not actually have four swap areas.  For
 	    example, if you have two swap areas A and B &os;'s address
 	    space representation for that swap area will be interleaved in
 	    blocks of 16 pages:</para>
 
 	  <literallayout>A B C D A B C D A B C D A B C D</literallayout>
 
 	  <para>&os; 3.X uses a <quote>sequential list of free
 	    regions</quote> approach to accounting for the free swap areas.
 	    The idea is that large blocks of free linear space can be
 	    represented with a single list node
 	    (<filename>kern/subr_rlist.c</filename>).  But due to the
 	    fragmentation the sequential list winds up being insanely
 	    fragmented.  In the above example, completely unused swap will
 	    have A and B shown as <quote>free</quote> and C and D shown as
 	    <quote>all allocated</quote>.  Each A-B sequence requires a list
 	    node to account for because C and D are holes, so the list node
 	    cannot be combined with the next A-B sequence.</para>
 
 	  <para>Why do we interleave our swap space instead of just tack swap
-	    areas onto the end and do something fancier?  Because it is a whole
+	    areas onto the end and do something fancier?  It is a whole
 	    lot easier to allocate linear swaths of an address space and have
 	    the result automatically be interleaved across multiple disks than
 	    it is to try to put that sophistication elsewhere.</para>
 
 	  <para>The fragmentation causes other problems.  Being a linear list
 	    under 3.X, and having such a huge amount of inherent
 	    fragmentation, allocating and freeing swap winds up being an O(N)
 	    algorithm instead of an O(1) algorithm.  Combined with other
 	    factors (heavy swapping) and you start getting into O(N^2) and
 	    O(N^3) levels of overhead, which is bad.  The 3.X system may also
 	    need to allocate KVM during a swap operation to create a new list
 	    node which can lead to a deadlock if the system is trying to
 	    pageout pages in a low-memory situation.</para>
 
 	  <para>Under 4.X we do not use a sequential list.  Instead we use a
 	    radix tree and bitmaps of swap blocks rather than ranged list
 	    nodes.  We take the hit of preallocating all the bitmaps required
 	    for the entire swap area up front but it winds up wasting less
 	    memory due to the use of a bitmap (one bit per block) instead of a
 	    linked list of nodes.  The use of a radix tree instead of a
 	    sequential list gives us nearly O(1) performance no matter how
 	    fragmented the tree becomes.</para>
 	</answer>
       </qandaentry>
 
       <qandaentry>
 	<question>
 	  <para>How is the separation of clean and dirty (inactive) pages
 	    related to the situation where you see low cache queue counts and
 	    high active queue counts in <command>systat -vm</command>?  Do the
 	    systat stats roll the active and dirty pages together for the
 	    active queue count?</para>
 
 	  <para>I do not get the following:</para>
 
 	  <blockquote>
 	    <para>It is important to note that the &os; VM system attempts
 	      to separate clean and dirty pages for the express reason of
 	      avoiding unnecessary flushes of dirty pages (which eats I/O
 	      bandwidth), nor does it move pages between the various page
 	      queues gratuitously when the memory subsystem is not being
 	      stressed.  This is why you will see some systems with very low
 	      cache queue counts and high active queue counts when doing a
 	      <command>systat -vm</command> command.</para>
 	  </blockquote>
 	</question>
 
 	<answer>
 	  <para>Yes, that is confusing.  The relationship is
 	    <quote>goal</quote> verses <quote>reality</quote>.  Our goal is to
 	    separate the pages but the reality is that if we are not in a
 	    memory crunch, we do not really have to.</para>
 
 	  <para>What this means is that &os; will not try very hard to
 	    separate out dirty pages (inactive queue) from clean pages (cache
 	    queue) when the system is not being stressed, nor will it try to
 	    deactivate pages (active queue -&gt; inactive queue) when the system
 	    is not being stressed, even if they are not being used.</para>
 	</answer>
       </qandaentry>
 
       <qandaentry>
 	<question>
 	  <para> In the &man.ls.1; / <command>vmstat 1</command> example,
 	    would not some of the page faults be data page faults (COW from
 	    executable file to private page)?  I.e., I would expect the page
 	    faults to be some zero-fill and some program data.  Or are you
 	    implying that &os; does do pre-COW for the program data?</para>
 	</question>
 
 	<answer>
 	  <para>A COW fault can be either zero-fill or program-data.  The
 	    mechanism is the same either way because the backing program-data
 	    is almost certainly already in the cache.  I am indeed lumping the
 	    two together.  &os; does not pre-COW program data or zero-fill,
 	    but it <emphasis>does</emphasis> pre-map pages that exist in its
 	    cache.</para>
 	</answer>
       </qandaentry>
 
       <qandaentry>
 	<question>
 	  <para>In your section on page table optimizations, can you give a
 	    little more detail about <literal>pv_entry</literal> and
 	    <literal>vm_page</literal> (or should vm_page be
 	    <literal>vm_pmap</literal>&mdash;as in 4.4, cf. pp. 180-181 of
 	    McKusick, Bostic, Karel, Quarterman)?  Specifically, what kind of
 	    operation/reaction would require scanning the mappings?</para>
 
 	  <para>How does Linux do in the case where &os; breaks down
 	    (sharing a large file mapping over many processes)?</para>
 	</question>
 
 	<answer>
 	  <para>A <literal>vm_page</literal> represents an (object,index#)
 	    tuple.  A <literal>pv_entry</literal> represents a hardware page
 	    table entry (pte).  If you have five processes sharing the same
 	    physical page, and three of those processes's page tables actually
 	    map the page, that page will be represented by a single
 	    <literal>vm_page</literal> structure and three
 	    <literal>pv_entry</literal> structures.</para>
 
 	  <para><literal>pv_entry</literal> structures only represent pages
 	    mapped by the MMU (one <literal>pv_entry</literal> represents one
 	    pte).  This means that when we need to remove all hardware
 	    references to a <literal>vm_page</literal> (in order to reuse the
 	    page for something else, page it out, clear it, dirty it, and so
 	    forth) we can simply scan the linked list of
 	    <literal>pv_entry</literal>'s associated with that
 	    <literal>vm_page</literal> to remove or modify the pte's from
 	    their page tables.</para>
 
 	  <para>Under Linux there is no such linked list.  In order to remove
 	    all the hardware page table mappings for a
 	    <literal>vm_page</literal> linux must index into every VM object
 	    that <emphasis>might</emphasis> have mapped the page.  For
 	    example, if you have 50 processes all mapping the same shared
 	    library and want to get rid of page X in that library, you need to
 	    index into the page table for each of those 50 processes even if
 	    only 10 of them have actually mapped the page.  So Linux is
 	    trading off the simplicity of its design against performance.
 	    Many VM algorithms which are O(1) or (small N) under &os; wind
 	    up being O(N), O(N^2), or worse under Linux.  Since the pte's
 	    representing a particular page in an object tend to be at the same
 	    offset in all the page tables they are mapped in, reducing the
 	    number of accesses into the page tables at the same pte offset
 	    will often avoid blowing away the L1 cache line for that offset,
 	    which can lead to better performance.</para>
 
 	  <para>&os; has added complexity (the <literal>pv_entry</literal>
 	    scheme) in order to increase performance (to limit page table
 	    accesses to <emphasis>only</emphasis> those pte's that need to be
 	    modified).</para>
 
 	  <para>But &os; has a scaling problem that Linux does not in that
 	    there are a limited number of <literal>pv_entry</literal>
 	    structures and this causes problems when you have massive sharing
 	    of data.  In this case you may run out of
 	    <literal>pv_entry</literal> structures even though there is plenty
 	    of free memory available.  This can be fixed easily enough by
 	    bumping up the number of <literal>pv_entry</literal> structures in
 	    the kernel config, but we really need to find a better way to do
 	    it.</para>
 
 	  <para>In regards to the memory overhead of a page table verses the
 	    <literal>pv_entry</literal> scheme: Linux uses
 	    <quote>permanent</quote> page tables that are not throw away, but
 	    does not need a <literal>pv_entry</literal> for each potentially
 	    mapped pte.  &os; uses <quote>throw away</quote> page tables but
 	    adds in a <literal>pv_entry</literal> structure for each
 	    actually-mapped pte.  I think memory utilization winds up being
 	    about the same, giving &os; an algorithmic advantage with its
 	    ability to throw away page tables at will with very low
 	    overhead.</para>
 	</answer>
       </qandaentry>
 
       <qandaentry>
 	<question>
 	  <para>Finally, in the page coloring section, it might help to have a
 	    little more description of what you mean here.  I did not quite
 	    follow it.</para>
 	</question>
 
 	<answer>
 	  <para>Do you know how an L1 hardware memory cache works?  I will
 	    explain: Consider a machine with 16MB of main memory but only 128K
 	    of L1 cache.  Generally the way this cache works is that each 128K
 	    block of main memory uses the <emphasis>same</emphasis> 128K of
 	    cache.  If you access offset 0 in main memory and then offset
 	    128K in main memory you can wind up throwing away the
 	    cached data you read from offset 0!</para>
 
 	  <para>Now, I am simplifying things greatly.  What I just described
 	    is what is called a <quote>direct mapped</quote> hardware memory
 	    cache.  Most modern caches are what are called
 	    2-way-set-associative or 4-way-set-associative caches.  The
 	    set-associatively allows you to access up to N different memory
 	    regions that overlap the same cache memory without destroying the
 	    previously cached data.  But only N.</para>
 
 	  <para>So if I have a 4-way set associative cache I can access offset
 	    0, offset 128K, 256K and offset 384K and still be able to access
 	    offset 0 again and have it come from the L1 cache.  If I then
 	    access offset 512K, however, one of the four previously cached
 	    data objects will be thrown away by the cache.</para>
 
 	  <para>It is extremely important&hellip;
 	    <emphasis>extremely</emphasis> important for most of a processor's
 	    memory accesses to be able to come from the L1 cache, because the
 	    L1 cache operates at the processor frequency.  The moment you have
 	    an L1 cache miss and have to go to the L2 cache or to main memory,
 	    the processor will stall and potentially sit twiddling its fingers
 	    for <emphasis>hundreds</emphasis> of instructions worth of time
 	    waiting for a read from main memory to complete.  Main memory (the
 	    dynamic ram you stuff into a computer) is
 	    <emphasis>slow</emphasis>, when compared to the speed of a modern
 	    processor core.</para>
 
 	  <para>Ok, so now onto page coloring: All modern memory caches are
 	    what are known as <emphasis>physical</emphasis> caches.  They
 	    cache physical memory addresses, not virtual memory addresses.
 	    This allows the cache to be left alone across a process context
 	    switch, which is very important.</para>
 
 	  <para>But in the &unix; world you are dealing with virtual address
 	    spaces, not physical address spaces.  Any program you write will
 	    see the virtual address space given to it.  The actual
 	    <emphasis>physical</emphasis> pages underlying that virtual
 	    address space are not necessarily physically contiguous! In fact,
 	    you might have two pages that are side by side in a processes
 	    address space which wind up being at offset 0 and offset 128K in
 	    <emphasis>physical</emphasis> memory.</para>
 
 	  <para>A program normally assumes that two side-by-side pages will be
 	    optimally cached.  That is, that you can access data objects in
 	    both pages without having them blow away each other's cache entry.
 	    But this is only true if the physical pages underlying the virtual
 	    address space are contiguous (insofar as the cache is
 	    concerned).</para>
 
 	  <para>This is what Page coloring does.  Instead of assigning
 	    <emphasis>random</emphasis> physical pages to virtual addresses,
 	    which may result in non-optimal cache performance, Page coloring
 	    assigns <emphasis>reasonably-contiguous</emphasis> physical pages
 	    to virtual addresses.  Thus programs can be written under the
 	    assumption that the characteristics of the underlying hardware
 	    cache are the same for their virtual address space as they would
 	    be if the program had been run directly in a physical address
 	    space.</para>
 
 	  <para>Note that I say <quote>reasonably</quote> contiguous rather
 	    than simply <quote>contiguous</quote>.  From the point of view of a
 	    128K direct mapped cache, the physical address 0 is the same as
 	    the physical address 128K.  So two side-by-side pages in your
 	    virtual address space may wind up being offset 128K and offset
 	    132K in physical memory, but could also easily be offset 128K and
 	    offset 4K in physical memory and still retain the same cache
 	    performance characteristics.  So page-coloring does
 	    <emphasis>not</emphasis> have to assign truly contiguous pages of
 	    physical memory to contiguous pages of virtual memory, it just
 	    needs to make sure it assigns contiguous pages from the point of
 	    view of cache performance and operation.</para>
 	</answer>
       </qandaentry>
     </qandaset>
   </sect1>
 </article>
diff --git a/en_US.ISO8859-1/books/arch-handbook/boot/chapter.xml b/en_US.ISO8859-1/books/arch-handbook/boot/chapter.xml
index 65f8c6cfd0..798b7bc6d9 100644
--- a/en_US.ISO8859-1/books/arch-handbook/boot/chapter.xml
+++ b/en_US.ISO8859-1/books/arch-handbook/boot/chapter.xml
@@ -1,2396 +1,2396 @@
 <?xml version="1.0" encoding="iso-8859-1"?>
 <!--
 The FreeBSD Documentation Project
 
 Copyright (c) 2002 Sergey Lyubka <devnull@uptsoft.com>
 All rights reserved
 Copyright (c) 2014 Sergio Andr?s G?mez del Real <Sergio.G.delReal@gmail.com>
 All rights reserved
 $FreeBSD$
 -->
 
 <chapter xmlns="http://docbook.org/ns/docbook"
   xmlns:xlink="http://www.w3.org/1999/xlink" version="5.0"
   xml:id="boot">
 
   <info>
     <title>Bootstrapping and Kernel Initialization</title>
 
     <authorgroup>
       <author>
 	<personname>
 	  <firstname>Sergey</firstname>
 	  <surname>Lyubka</surname>
 	</personname>
 
 	<contrib>Contributed by </contrib>
       </author>
       <!-- devnull@uptsoft.com  12 Jun 2002 -->
     </authorgroup>
 
     <authorgroup>
       <author>
 	<personname>
 	  <firstname>Sergio Andr&eacute;s</firstname>
 	  <surname> G&oacute;mez del Real</surname>
 	</personname>
 
 	<contrib>Updated and enhanced by </contrib>
       </author>
       <!-- Sergio.G.DelReal@gmail.com  Jan 2014 -->
     </authorgroup>
   </info>
 
   <sect1 xml:id="boot-synopsis">
     <title>Synopsis</title>
 
     <indexterm><primary>BIOS</primary></indexterm>
     <indexterm><primary>firmware</primary></indexterm>
     <indexterm><primary>POST</primary></indexterm>
     <indexterm><primary>IA-32</primary></indexterm>
     <indexterm><primary>booting</primary></indexterm>
     <indexterm><primary>system initialization</primary></indexterm>
     <para>This chapter is an overview of the boot and system
       initialization processes, starting from the
       <acronym>BIOS</acronym> (firmware) <acronym>POST</acronym>, to
       the first user process creation.  Since the initial
       steps of system startup are very architecture dependent, the
       IA-32 architecture is used as an example.</para>
 
     <para>The &os; boot process can be surprisingly complex.  After
       control is passed from the <acronym>BIOS</acronym>, a
       considerable amount of low-level configuration must be done
       before the kernel can be loaded and executed.  This setup must
       be done in a simple and flexible manner, allowing the user a
       great deal of customization possibilities.</para>
   </sect1>
 
   <sect1 xml:id="boot-overview">
     <title>Overview</title>
 
     <para>The boot process is an extremely machine-dependent
       activity.  Not only must code be written for every computer
       architecture, but there may also be multiple types of booting on
       the same architecture.  For example, a directory listing of
       <filename>/usr/src/sys/boot</filename>
       reveals a great amount of architecture-dependent code.  There is
       a directory for each of the various supported architectures.  In
       the x86-specific <filename>i386</filename>
       directory, there are subdirectories for different boot standards
       like <filename>mbr</filename> (Master Boot Record),
       <filename>gpt</filename> (<acronym>GUID</acronym> Partition
       Table), and <filename>efi</filename> (Extensible Firmware
       Interface).  Each boot standard has its own conventions and data
       structures.  The example that follows shows booting an x86
       computer from an <acronym>MBR</acronym> hard drive with the &os;
       <filename>boot0</filename> multi-boot loader stored in the very
       first sector.  That boot code starts the &os; three-stage boot
       process.</para>
 
     <para>The key to understanding this process is that it is a series
       of stages of increasing complexity.  These stages are
       <filename>boot1</filename>, <filename>boot2</filename>, and
       <filename>loader</filename> (see &man.boot.8; for more detail).
       The boot system executes each stage in sequence.  The last
       stage, <filename>loader</filename>, is responsible for loading
       the &os; kernel.  Each stage is examined in the following
       sections.</para>
 
     <para>Here is an example of the output generated by the
       different boot stages.  Actual output
       may differ from machine to machine:</para>
 
     <informaltable frame="none" pgwide="0">
       <tgroup cols="2">
 	<tbody>
 	  <row>
 	    <entry>&os; Component</entry>
 	    <entry>Output (may vary)</entry>
 	  </row>
 
 	  <row>
 	    <entry><literal>boot0</literal></entry>
 	    <entry><screen>F1    FreeBSD
 F2    BSD
 F5    Disk 2</screen></entry>
 	  </row>
 
 	  <row>
 	    <entry><literal>boot2</literal>
 	      <footnote><para>This prompt will appear if the user
 		  presses a key just after selecting an OS to boot at
 		  the <literal>boot0</literal>
 		  stage.</para></footnote></entry>
 	    <entry><screen>&gt;&gt;FreeBSD/i386 BOOT
 Default: 1:ad(1,a)/boot/loader
 boot:</screen></entry>
 	  </row>
 
 	  <row>
 	    <entry><filename>loader</filename></entry>
 	    <entry><screen>BTX loader 1.00 BTX version is 1.02
 Consoles: internal video/keyboard
 BIOS drive C: is disk0
 BIOS 639kB/2096064kB available memory
 
 FreeBSD/x86 bootstrap loader, Revision 1.1
 Console internal video/keyboard
 (root@snap.freebsd.org, Thu Jan 16 22:18:05 UTC 2014)
 Loading /boot/defaults/loader.conf
 /boot/kernel/kernel text=0xed9008 data=0x117d28+0x176650 syms=[0x8+0x137988+0x8+0x1515f8]</screen></entry>
 	  </row>
 
 	  <row>
 	    <entry>kernel</entry>
 	    <entry><screen>Copyright (c) 1992-2013 The FreeBSD Project.
 Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
         The Regents of the University of California. All rights reserved.
 FreeBSD is a registered trademark of The FreeBSD Foundation.
 FreeBSD 10.0-RELEASE #0 r260789: Thu Jan 16 22:34:59 UTC 2014
     root@snap.freebsd.org:/usr/obj/usr/src/sys/GENERIC amd64
 FreeBSD clang version 3.3 (tags/RELEASE_33/final 183502) 20130610</screen></entry>
 	  </row>
 	</tbody>
       </tgroup>
     </informaltable>
   </sect1>
 
   <sect1 xml:id="boot-bios">
     <title>The <acronym>BIOS</acronym></title>
 
     <para>When the computer powers on, the processor's registers are
       set to some predefined values.  One of the registers is the
       <emphasis>instruction pointer</emphasis> register, and its value
       after a power on is well defined: it is a 32-bit value of
       <literal>0xfffffff0</literal>.  The instruction pointer register
       (also known as the Program Counter) points to code to be
       executed by the processor.  Another important register is the
       <literal>cr0</literal> 32-bit control register, and its value
       just after a reboot is <literal>0</literal>.  One of
       <literal>cr0</literal>'s bits, the PE (Protection Enabled) bit,
       indicates whether the processor is running in 32-bit protected
       mode or 16-bit real mode.  Since this bit is cleared at boot
       time, the processor boots in 16-bit real mode.  Real mode means,
       among other things, that linear and physical addresses are
       identical.  The reason for the processor not to start
       immediately in 32-bit protected mode is backwards compatibility.
       In particular, the boot process relies on the services provided
       by the <acronym>BIOS</acronym>, and the <acronym>BIOS</acronym>
       itself works in legacy, 16-bit code.</para>
 
     <para>The value of <literal>0xfffffff0</literal> is slightly less
       than 4&nbsp;GB, so unless the machine has 4&nbsp;GB of physical
       memory, it cannot point to a valid memory address.  The
       computer's hardware translates this address so that it points to
       a <acronym>BIOS</acronym> memory block.</para>
 
     <para>The <acronym>BIOS</acronym> (Basic Input Output
       System) is a chip on the motherboard that has a relatively small
       amount of read-only memory (<acronym>ROM</acronym>).  This
       memory contains various low-level routines that are specific to
       the hardware supplied with the motherboard.  The processor will
       first jump to the address 0xfffffff0, which really resides in
       the <acronym>BIOS</acronym>'s memory.  Usually this address
       contains a jump instruction to the <acronym>BIOS</acronym>'s
       POST routines.</para>
 
     <para>The <acronym>POST</acronym> (Power On Self Test)
       is a set of routines including the memory check, system bus
       check, and other low-level initialization so the
       <acronym>CPU</acronym> can set up the computer properly.  The
       important step of this stage is determining the boot device.
       Modern <acronym>BIOS</acronym> implementations permit the
       selection of a boot device, allowing booting from a floppy,
       <acronym>CD-ROM</acronym>, hard disk, or other devices.</para>
 
     <para>The very last thing in the <acronym>POST</acronym> is the
       <literal>INT 0x19</literal> instruction.  The
       <literal>INT 0x19</literal> handler reads 512 bytes from the
       first sector of boot device into the memory at address
       <literal>0x7c00</literal>.  The term
       <emphasis>first sector</emphasis> originates from hard drive
       architecture, where the magnetic plate is divided into a number
       of cylindrical tracks.  Tracks are numbered, and every track is
       divided into a number (usually 64) of sectors.  Track numbers
       start at 0, but sector numbers start from 1. Track 0 is the
       outermost on the magnetic plate, and sector 1, the first sector,
       has a special purpose.  It is also called the
       <acronym>MBR</acronym>, or Master Boot Record.  The remaining
       sectors on the first track are never used.</para>
 
     <para>This sector is our boot-sequence starting point.  As we will
       see, this sector contains a copy of our
       <filename>boot0</filename> program.  A jump is made by the
       <acronym>BIOS</acronym> to address <literal>0x7c00</literal> so
       it starts executing.</para>
   </sect1>
 
   <sect1 xml:id="boot-boot0">
     <title>The Master Boot Record (<literal>boot0</literal>)</title>
 
     <indexterm><primary>MBR</primary></indexterm>
 
     <para>After control is received from the <acronym>BIOS</acronym>
       at memory address <literal>0x7c00</literal>,
       <filename>boot0</filename> starts executing.  It is the first
       piece of code under &os; control.  The task of
       <filename>boot0</filename> is quite simple: scan the partition
       table and let the user choose which partition to boot from.  The
       Partition Table is a special, standard data structure embedded
       in the <acronym>MBR</acronym> (hence embedded in
       <filename>boot0</filename>) describing the four standard PC
       <quote>partitions</quote>
       <footnote>
 	<para><link
 	    xlink:href="http://en.wikipedia.org/wiki/Master_boot_record"></link></para></footnote>.
       <filename>boot0</filename> resides in the filesystem as
       <filename>/boot/boot0</filename>.  It is a small 512-byte file,
       and it is exactly what &os;'s installation procedure wrote to
       the hard disk's <acronym>MBR</acronym> if you chose the
       <quote>bootmanager</quote> option at installation time.  Indeed,
       <filename>boot0</filename> <emphasis>is</emphasis> the
       <acronym>MBR</acronym>.</para>
 
     <para>As mentioned previously, the <literal>INT 0x19</literal>
       instruction causes the <literal>INT 0x19</literal> handler to
       load an <acronym>MBR</acronym> (<filename>boot0</filename>) into
       memory at address <literal>0x7c00</literal>.  The source file
       for <filename>boot0</filename> can be found in
       <filename>sys/boot/i386/boot0/boot0.S</filename> - which is an
       awesome piece of code written by Robert Nordier.</para>
 
     <para>A special structure starting from offset
       <literal>0x1be</literal> in the <acronym>MBR</acronym> is called
       the <emphasis>partition table</emphasis>.  It has four records
       of 16 bytes each, called <emphasis>partition records</emphasis>,
       which represent how the hard disk is partitioned, or, in &os;'s
       terminology, sliced.  One byte of those 16 says whether a
       partition (slice) is bootable or not.  Exactly one record must
       have that flag set, otherwise <filename>boot0</filename>'s code
       will refuse to proceed.</para>
 
     <para>A partition record has the following fields:</para>
 
     <itemizedlist>
       <listitem>
 	<para>the 1-byte filesystem type</para>
       </listitem>
 
       <listitem>
 	<para>the 1-byte bootable flag</para>
       </listitem>
 
       <listitem>
 	<para>the 6 byte descriptor in CHS format</para>
       </listitem>
 
       <listitem>
 	<para>the 8 byte descriptor in LBA format</para>
       </listitem>
     </itemizedlist>
 
     <para>A partition record descriptor contains information about
       where exactly the partition resides on the drive.  Both
       descriptors, <acronym>LBA</acronym> and <acronym>CHS</acronym>,
       describe the same information, but in different ways:
       <acronym>LBA</acronym> (Logical Block Addressing) has the
       starting sector for the partition and the partition's length,
       while <acronym>CHS</acronym> (Cylinder Head Sector) has
       coordinates for the first and last sectors of the partition.
       The partition table ends with the special signature
       <literal>0xaa55</literal>.</para>
 
     <para>The <acronym>MBR</acronym> must fit into 512 bytes, a single
       disk sector.  This program uses low-level <quote>tricks</quote>
       like taking advantage of the side effects of certain
       instructions and reusing register values from previous
       operations to make the most out of the fewest possible
       instructions.  Care must also be taken when handling the
       partition table, which is embedded in the <acronym>MBR</acronym>
       itself.  For these reasons, be very careful when modifying
       <filename>boot0.S</filename>.</para>
 
     <para>Note that the <filename>boot0.S</filename> source file
       is assembled <quote>as is</quote>: instructions are translated
       one by one to binary, with no additional information (no
       <acronym>ELF</acronym> file format, for example).  This kind of
       low-level control is achieved at link time through special
       control flags passed to the linker.  For example, the text
       section of the program is set to be located at address
       <literal>0x600</literal>.  In practice this means that
       <filename>boot0</filename> must be loaded to memory address
       <literal>0x600</literal> in order to function properly.</para>
 
     <para>It is worth looking at the <filename>Makefile</filename> for
       <filename>boot0</filename>
       (<filename>sys/boot/i386/boot0/Makefile</filename>), as it
       defines some of the run-time behavior of
       <filename>boot0</filename>.  For instance, if a terminal
       connected to the serial port (COM1) is used for I/O, the macro
       <literal>SIO</literal> must be defined
       (<literal>-DSIO</literal>).  <literal>-DPXE</literal> enables
       boot through <acronym>PXE</acronym> by pressing
       <keycap>F6</keycap>.  Additionally, the program defines a set of
       <emphasis>flags</emphasis> that allow further modification of
       its behavior.  All of this is illustrated in the
       <filename>Makefile</filename>.  For example, look at the
       linker directives which command the linker to start the text
       section at address <literal>0x600</literal>, and to build the
       output file <quote>as is</quote> (strip out any file
       formatting):</para>
 
     <figure xml:id="boot-boot0-makefile-as-is">
       <title><filename>sys/boot/i386/boot0/Makefile</filename></title>
 
       <programlisting>      BOOT_BOOT0_ORG?=0x600
       LDFLAGS=-e start -Ttext ${BOOT_BOOT0_ORG} \
       -Wl,-N,-S,--oformat,binary</programlisting>
     </figure>
 
     <para>Let us now start our study of the <acronym>MBR</acronym>, or
       <filename>boot0</filename>, starting where execution
       begins.</para>
 
     <note>
       <para>Some modifications have been made to some instructions in
 	favor of better exposition.  For example, some macros are
 	expanded, and some macro tests are omitted when the result of
 	the test is known.  This applies to all of the code examples
 	shown.</para>
     </note>
 
     <figure xml:id="boot-boot0-entrypoint">
       <title><filename>sys/boot/i386/boot0/boot0.S</filename></title>
 
       <programlisting>start:
       cld			# String ops inc
       xorw %ax,%ax		# Zero
       movw %ax,%es		# Address
       movw %ax,%ds		#  data
       movw %ax,%ss		# Set up
       movw 0x7c00,%sp		#  stack</programlisting>
     </figure>
 
     <para>This first block of code is the entry point of the program.
       It is where the <acronym>BIOS</acronym> transfers control.
       First, it makes sure that the string operations autoincrement
       its pointer operands (the <literal>cld</literal> instruction)
       <footnote>
 	<para>When in doubt, we refer the reader to the official Intel
 	  manuals, which describe the exact semantics for each
 	  instruction: <link
 	    xlink:href="http://www.intel.com/content/www/us/en/processors/architectures-software-developer-manuals.html"></link>.</para></footnote>.
       Then, as it makes no assumption about the state of the segment
       registers, it initializes them.  Finally, it sets the stack
       pointer register (<literal>%sp</literal>) to address
       <literal>0x7c00</literal>, so we have a working stack.</para>
 
     <para>The next block is responsible for the relocation and
       subsequent jump to the relocated code.</para>
 
     <figure xml:id="boot-boot0-relocation">
       <title><filename>sys/boot/i386/boot0/boot0.S</filename></title>
 
       <programlisting>      movw $0x7c00,%si	# Source
       movw $0x600,%di		# Destination
       movw $512,%cx		# Word count
       rep			# Relocate
       movsb			#  code
       movw %di,%bp		# Address variables
       movb $16,%cl		# Words to clear
       rep			# Zero
       stosb			#  them
       incb -0xe(%di)		# Set the S field to 1
       jmp main-0x7c00+0x600	# Jump to relocated code</programlisting>
     </figure>
 
-    <para>Because <filename>boot0</filename> is loaded by the
+    <para>As <filename>boot0</filename> is loaded by the
       <acronym>BIOS</acronym> to address <literal>0x7C00</literal>, it
       copies itself to address <literal>0x600</literal> and then
       transfers control there (recall that it was linked to execute at
       address <literal>0x600</literal>).  The source address,
       <literal>0x7c00</literal>, is copied to register
       <literal>%si</literal>.  The destination address,
       <literal>0x600</literal>, to register <literal>%di</literal>.
       The number of bytes to copy, <literal>512</literal> (the
       program's size), is copied to register <literal>%cx</literal>.
       Next, the <literal>rep</literal> instruction repeats the
       instruction that follows, that is, <literal>movsb</literal>, the
       number of times dictated by the <literal>%cx</literal> register.
       The <literal>movsb</literal> instruction copies the byte pointed
       to by <literal>%si</literal> to the address pointed to by
       <literal>%di</literal>.  This is repeated another 511 times.  On
       each repetition, both the source and destination registers,
       <literal>%si</literal> and <literal>%di</literal>, are
       incremented by one.  Thus, upon completion of the 512-byte copy,
       <literal>%di</literal> has the value
       <literal>0x600</literal>+<literal>512</literal>=
       <literal>0x800</literal>, and <literal>%si</literal> has the
       value <literal>0x7c00</literal>+<literal>512</literal>=
       <literal>0x7e00</literal>; we have thus completed the code
       <emphasis>relocation</emphasis>.</para>
 
     <para>Next, the destination register
       <literal>%di</literal> is copied to <literal>%bp</literal>.
       <literal>%bp</literal> gets the value <literal>0x800</literal>.
       The value <literal>16</literal> is copied to
       <literal>%cl</literal> in preparation for a new string operation
       (like our previous <literal>movsb</literal>).  Now,
       <literal>stosb</literal> is executed 16 times.  This instruction
       copies a <literal>0</literal> value to the address pointed to by
       the destination register (<literal>%di</literal>, which is
       <literal>0x800</literal>), and increments it.  This is repeated
       another 15 times, so <literal>%di</literal> ends up with value
       <literal>0x810</literal>.  Effectively, this clears the address
       range <literal>0x800</literal>-<literal>0x80f</literal>.  This
       range is used as a (fake) partition table for writing the
       <acronym>MBR</acronym> back to disk.  Finally, the sector field
       for the <acronym>CHS</acronym> addressing of this fake partition
       is given the value 1 and a jump is made to the main function
       from the relocated code.  Note that until this jump to the
       relocated code, any reference to an absolute address was
       avoided.</para>
 
     <para>The following code block tests whether the drive number
       provided by the <acronym>BIOS</acronym> should be used, or
       the one stored in <filename>boot0</filename>.</para>
 
     <figure xml:id="boot-boot0-drivenumber">
       <title><filename>sys/boot/i386/boot0/boot0.S</filename></title>
 
       <programlisting>main:
       testb $SETDRV,-69(%bp)	# Set drive number?
       jnz disable_update	# Yes
       testb %dl,%dl		# Drive number valid?
       js save_curdrive		# Possibly (0x80 set)</programlisting>
     </figure>
 
     <para>This code tests the <literal>SETDRV</literal> bit
       (<literal>0x20</literal>) in the <emphasis>flags</emphasis>
       variable.  Recall that register <literal>%bp</literal> points to
       address location <literal>0x800</literal>, so the test is done
       to the <emphasis>flags</emphasis> variable at address
       <literal>0x800</literal>-<literal>69</literal>=
       <literal>0x7bb</literal>.  This is an example of the type of
       modifications that can be done to <filename>boot0</filename>.
       The <literal>SETDRV</literal> flag is not set by default, but it
       can be set in the <filename>Makefile</filename>.  When set, the
       drive number stored in the <acronym>MBR</acronym> is used
       instead of the one provided by the <acronym>BIOS</acronym>.  We
       assume the defaults, and that the <acronym>BIOS</acronym>
       provided a valid drive number, so we jump to
       <literal>save_curdrive</literal>.</para>
 
     <para>The next block saves the drive number provided by the
       <acronym>BIOS</acronym>, and calls <literal>putn</literal> to
       print a new line on the screen.</para>
 
     <figure xml:id="boot-boot0-savedrivenumber">
       <title><filename>sys/boot/i386/boot0/boot0.S</filename></title>
 
       <programlisting>save_curdrive:
       movb %dl, (%bp)		# Save drive number
       pushw %dx			# Also in the stack
 #ifdef	TEST	/* test code, print internal bios drive */
       rolb $1, %dl
       movw $drive, %si
       call putkey
 #endif
       callw putn		# Print a newline</programlisting>
     </figure>
 
     <para>Note that we assume <varname>TEST</varname> is not defined,
       so the conditional code in it is not assembled and will not
       appear in our executable <filename>boot0</filename>.</para>
 
     <para>Our next block implements the actual scanning of the
       partition table.  It prints to the screen the partition type for
       each of the four entries in the partition table.  It compares
       each type with a list of well-known operating system file
       systems.  Examples of recognized partition types are
       <acronym>NTFS</acronym> (&windows;, ID 0x7),
       <literal>ext2fs</literal> (&linux;, ID 0x83), and, of course,
       <literal>ffs</literal>/<literal>ufs2</literal> (&os;, ID 0xa5).
       The implementation is fairly simple.</para>
 
     <figure xml:id="boot-boot0-partition-scan">
       <title><filename>sys/boot/i386/boot0/boot0.S</filename></title>
 
       <programlisting>      movw $(partbl+0x4),%bx	# Partition table (+4)
       xorw %dx,%dx		# Item number
 
 read_entry:
       movb %ch,-0x4(%bx)	# Zero active flag (ch == 0)
       btw %dx,_FLAGS(%bp)	# Entry enabled?
       jnc next_entry		# No
       movb (%bx),%al		# Load type
       test %al, %al		# skip empty partition
       jz next_entry
       movw $bootable_ids,%di	# Lookup tables
       movb $(TLEN+1),%cl	# Number of entries
       repne			# Locate
       scasb			#  type
       addw $(TLEN-1), %di	# Adjust
       movb (%di),%cl		# Partition
       addw %cx,%di		#  description
       callw putx		# Display it
 
 next_entry:
       incw %dx			# Next item
       addb $0x10,%bl		# Next entry
       jnc read_entry		# Till done</programlisting>
     </figure>
 
     <para>It is important to note that the active flag for each entry
       is cleared, so after the scanning, <emphasis>no</emphasis>
       partition entry is active in our memory copy of
       <filename>boot0</filename>.  Later, the active flag will be set
       for the selected partition.  This ensures that only one active
       partition exists if the user chooses to write the changes back
       to disk.</para>
 
     <para>The next block tests for other drives.  At startup,
       the <acronym>BIOS</acronym> writes the number of drives present
       in the computer to address <literal>0x475</literal>.  If there
       are any other drives present, <filename>boot0</filename> prints
       the current drive to screen.  The user may command
       <filename>boot0</filename> to scan partitions on another drive
       later.</para>
 
     <figure xml:id="boot-boot0-test-drives">
       <title><filename>sys/boot/i386/boot0/boot0.S</filename></title>
 
       <programlisting>      popw %ax			# Drive number
       subb $0x79,%al		# Does next
       cmpb 0x475,%al		#  drive exist? (from BIOS?)
       jb print_drive		# Yes
       decw %ax			# Already drive 0?
       jz print_prompt		# Yes</programlisting>
     </figure>
 
     <para>We make the assumption that a single drive is present, so
       the jump to <literal>print_drive</literal> is not performed.  We
       also assume nothing strange happened, so we jump to
       <literal>print_prompt</literal>.</para>
 
     <para>This next block just prints out a prompt followed by the
       default option:</para>
 
     <figure xml:id="boot-boot0-prompt">
       <title><filename>sys/boot/i386/boot0/boot0.S</filename></title>
 
       <programlisting>print_prompt:
       movw $prompt,%si		# Display
       callw putstr		#  prompt
       movb _OPT(%bp),%dl	# Display
       decw %si			#  default
       callw putkey		#  key
       jmp start_input		# Skip beep</programlisting>
     </figure>
 
     <para>Finally, a jump is performed to
       <literal>start_input</literal>, where the
       <acronym>BIOS</acronym> services are used to start a timer and
       for reading user input from the keyboard; if the timer expires,
       the default option will be selected:</para>
 
     <figure xml:id="boot-boot0-start-input">
       <title><filename>sys/boot/i386/boot0/boot0.S</filename></title>
 
       <programlisting>start_input:
       xorb %ah,%ah		# BIOS: Get
       int $0x1a			#  system time
       movw %dx,%di		# Ticks when
       addw _TICKS(%bp),%di	#  timeout
 read_key:
       movb $0x1,%ah		# BIOS: Check
       int $0x16			#  for keypress
       jnz got_key		# Have input
       xorb %ah,%ah		# BIOS: int 0x1a, 00
       int $0x1a			#  get system time
       cmpw %di,%dx		# Timeout?
       jb read_key		# No</programlisting>
     </figure>
 
     <para>An interrupt is requested with number
       <literal>0x1a</literal> and argument <literal>0</literal> in
       register <literal>%ah</literal>.  The <acronym>BIOS</acronym>
       has a predefined set of services, requested by applications as
       software-generated interrupts through the <literal>int</literal>
       instruction and receiving arguments in registers (in this case,
       <literal>%ah</literal>).  Here, particularly, we are requesting
       the number of clock ticks since last midnight; this value is
       computed by the <acronym>BIOS</acronym> through the
       <acronym>RTC</acronym> (Real Time Clock).  This clock can be
       programmed to work at frequencies ranging from 2&nbsp;Hz to
       8192&nbsp;Hz.  The <acronym>BIOS</acronym> sets it to
       18.2&nbsp;Hz at startup.  When the request is satisfied, a
       32-bit result is returned by the <acronym>BIOS</acronym> in
       registers <literal>%cx</literal> and <literal>%dx</literal>
       (lower bytes in <literal>%dx</literal>).  This result (the
       <literal>%dx</literal> part) is copied to register
       <literal>%di</literal>, and the value of the
       <varname>TICKS</varname> variable is added to
       <literal>%di</literal>.  This variable resides in
       <filename>boot0</filename> at offset <literal>_TICKS</literal>
       (a negative value) from register <literal>%bp</literal> (which,
       recall, points to <literal>0x800</literal>).  The default value
       of this variable is <literal>0xb6</literal> (182 in decimal).
       Now, the idea is that <filename>boot0</filename> constantly
       requests the time from the <acronym>BIOS</acronym>, and when the
       value returned in register <literal>%dx</literal> is greater
       than the value stored in <literal>%di</literal>, the time is up
       and the default selection will be made.  Since the RTC ticks
       18.2 times per second, this condition will be met after 10
       seconds (this default behavior can be changed in the
       <filename>Makefile</filename>).  Until this time has passed,
       <filename>boot0</filename> continually asks the
       <acronym>BIOS</acronym> for any user input; this is done through
       <literal>int 0x16</literal>, argument <literal>1</literal> in
       <literal>%ah</literal>.</para>
 
     <para>Whether a key was pressed or the time expired, subsequent
       code validates the selection.  Based on the selection, the
       register <literal>%si</literal> is set to point to the
       appropriate partition entry in the partition table.  This new
       selection overrides the previous default one.  Indeed, it
       becomes the new default.  Finally, the ACTIVE flag of the
       selected partition is set.  If it was enabled at compile time,
       the in-memory version of <filename>boot0</filename> with these
       modified values is written back to the <acronym>MBR</acronym> on
       disk.  We leave the details of this implementation to the
       reader.</para>
 
     <para>We now end our study with the last code block from the
       <filename>boot0</filename> program:</para>
 
     <figure xml:id="boot-boot0-check-bootable">
       <title><filename>sys/boot/i386/boot0/boot0.S</filename></title>
 
       <programlisting>      movw $0x7c00,%bx		# Address for read
       movb $0x2,%ah		# Read sector
       callw intx13		#  from disk
       jc beep			# If error
       cmpw $0xaa55,0x1fe(%bx)	# Bootable?
       jne beep			# No
       pushw %si			# Save ptr to selected part.
       callw putn		# Leave some space
       popw %si			# Restore, next stage uses it
       jmp *%bx			# Invoke bootstrap</programlisting>
     </figure>
 
     <para>Recall that <literal>%si</literal> points to the selected
       partition entry.  This entry tells us where the partition begins
       on disk.  We assume, of course, that the partition selected is
       actually a &os; slice.</para>
 
     <note>
       <para>From now on, we will favor the use of the technically
 	more accurate term <quote>slice</quote> rather than
 	<quote>partition</quote>.</para>
     </note>
 
     <para>The transfer buffer is set to <literal>0x7c00</literal>
       (register <literal>%bx</literal>), and a read for the first
       sector of the &os; slice is requested by calling
       <literal>intx13</literal>.  We assume that everything went okay,
       so a jump to <literal>beep</literal> is not performed.  In
       particular, the new sector read must end with the magic sequence
       <literal>0xaa55</literal>.  Finally, the value at
       <literal>%si</literal> (the pointer to the selected partition
       table) is preserved for use by the next stage, and a jump is
       performed to address <literal>0x7c00</literal>, where execution
       of our next stage (the just-read block) is started.</para>
   </sect1>
 
   <sect1 xml:id="boot-boot1">
     <title><literal>boot1</literal> Stage</title>
 
     <para>So far we have gone through the following sequence:</para>
 
     <itemizedlist>
       <listitem>
 	<para>The <acronym>BIOS</acronym> did some early hardware
 	  initialization, including the <acronym>POST</acronym>.  The
 	  <acronym>MBR</acronym> (<filename>boot0</filename>) was
 	  loaded from absolute disk sector one to address
 	  <literal>0x7c00</literal>.  Execution control was passed to
 	  that location.</para>
       </listitem>
 
       <listitem>
 	<para><filename>boot0</filename> relocated itself to the
 	  location it was linked to execute
 	  (<literal>0x600</literal>), followed by a jump to continue
 	  execution at the appropriate place.  Finally,
 	  <filename>boot0</filename> loaded the first disk sector from
 	  the &os; slice to address <literal>0x7c00</literal>.
 	  Execution control was passed to that location.</para>
       </listitem>
     </itemizedlist>
 
     <para><filename>boot1</filename> is the next step in the
       boot-loading sequence.  It is the first of three boot stages.
       Note that we have been dealing exclusively
       with disk sectors.  Indeed, the <acronym>BIOS</acronym> loads
       the absolute first sector, while <filename>boot0</filename>
       loads the first sector of the &os; slice.  Both loads are to
       address <literal>0x7c00</literal>.  We can conceptually think of
       these disk sectors as containing the files
       <filename>boot0</filename> and <filename>boot1</filename>,
       respectively, but in reality this is not entirely true for
       <filename>boot1</filename>.  Strictly speaking, unlike
       <filename>boot0</filename>, <filename>boot1</filename> is not
       part of the boot blocks
       <footnote>
 	<para>There is a file <filename>/boot/boot1</filename>, but it
 	  is not the written to the beginning of the &os; slice.
 	  Instead, it is concatenated with <filename>boot2</filename>
 	  to form <filename>boot</filename>, which
 	  <emphasis>is</emphasis> written to the beginning of the &os;
 	  slice and read at boot time.</para></footnote>.
       Instead, a single, full-blown file, <filename>boot</filename>
       (<filename>/boot/boot</filename>), is what ultimately is
       written to disk.  This file is a combination of
       <filename>boot1</filename>, <filename>boot2</filename> and the
       <literal>Boot Extender</literal> (or <acronym>BTX</acronym>).
       This single file is greater in size than a single sector
       (greater than 512 bytes).  Fortunately,
       <filename>boot1</filename> occupies <emphasis>exactly</emphasis>
       the first 512 bytes of this single file, so when
       <filename>boot0</filename> loads the first sector of the &os;
       slice (512 bytes), it is actually loading
       <filename>boot1</filename> and transferring control to
       it.</para>
 
     <para>The main task of <filename>boot1</filename> is to load the
       next boot stage.  This next stage is somewhat more complex.  It
       is composed of a server called the <quote>Boot Extender</quote>,
       or <acronym>BTX</acronym>, and a client, called
       <filename>boot2</filename>.  As we will see, the last boot
       stage, <filename>loader</filename>, is also a client of the
       <acronym>BTX</acronym> server.</para>
 
     <para>Let us now look in detail at what exactly is done by
       <filename>boot1</filename>, starting like we did for
       <filename>boot0</filename>, at its entry point:</para>
 
     <figure xml:id="boot-boot1-entry">
       <title><filename>sys/boot/i386/boot2/boot1.S</filename></title>
 
       <programlisting>start:
 	jmp main</programlisting>
     </figure>
 
     <para>The entry point at <literal>start</literal> simply jumps
       past a special data area to the label <literal>main</literal>,
       which in turn looks like this:</para>
 
     <figure xml:id="boot-boot1-main">
       <title><filename>sys/boot/i386/boot2/boot1.S</filename></title>
 
       <programlisting>main:
       cld			# String ops inc
       xor %cx,%cx		# Zero
       mov %cx,%es		# Address
       mov %cx,%ds		#  data
       mov %cx,%ss		# Set up
       mov $start,%sp		#  stack
       mov %sp,%si		# Source
       mov $0x700,%di		# Destination
       incb %ch			# Word count
       rep			# Copy
       movsw			#  code</programlisting>
     </figure>
 
     <para>Just like <filename>boot0</filename>, this
       code relocates <filename>boot1</filename>,
       this time to memory address <literal>0x700</literal>.  However,
       unlike <filename>boot0</filename>, it does not jump there.
       <filename>boot1</filename> is linked to execute at
       address <literal>0x7c00</literal>, effectively where it was
       loaded in the first place.  The reason for this relocation will
       be discussed shortly.</para>
 
     <para>Next comes a loop that looks for the &os; slice.  Although
       <filename>boot0</filename> loaded <filename>boot1</filename>
       from the &os; slice, no information was passed to it about this
       <footnote>
 	<para>Actually we did pass a pointer to the slice entry in
 	  register <literal>%si</literal>.  However,
 	  <filename>boot1</filename> does not assume that it was
 	  loaded by <filename>boot0</filename> (perhaps some other
 	  <acronym>MBR</acronym> loaded it, and did not pass this
 	  information), so it assumes nothing.</para></footnote>,
       so <filename>boot1</filename> must rescan the
       partition table to find where the &os; slice starts.  Therefore
       it rereads the <acronym>MBR</acronym>:</para>
 
     <figure xml:id="boot-boot1-find-freebsd">
       <title><filename>sys/boot/i386/boot2/boot1.S</filename></title>
 
       <programlisting>      mov $part4,%si		# Partition
       cmpb $0x80,%dl		# Hard drive?
       jb main.4			# No
       movb $0x1,%dh		# Block count
       callw nread		# Read MBR</programlisting>
     </figure>
 
     <para>In the code above, register <literal>%dl</literal>
       maintains information about the boot device.  This is passed on
       by the <acronym>BIOS</acronym> and preserved by the
       <acronym>MBR</acronym>.  Numbers <literal>0x80</literal> and
       greater tells us that we are dealing with a hard drive, so a
       call is made to <literal>nread</literal>, where the
       <acronym>MBR</acronym> is read.  Arguments to
       <literal>nread</literal> are passed through
       <literal>%si</literal> and <literal>%dh</literal>.  The memory
       address at label <literal>part4</literal> is copied to
       <literal>%si</literal>.  This memory address holds a
       <quote>fake partition</quote> to be used by
       <literal>nread</literal>.  The following is the data in the fake
       partition:</para>
 
     <figure xml:id="boot-boot2-make-fake-partition">
       <title><filename>sys/boot/i386/boot2/Makefile</filename></title>
 
       <programlisting>      part4:
 	.byte 0x80, 0x00, 0x01, 0x00
 	.byte 0xa5, 0xfe, 0xff, 0xff
 	.byte 0x00, 0x00, 0x00, 0x00
 	.byte 0x50, 0xc3, 0x00, 0x00</programlisting>
     </figure>
 
     <para>In particular, the <acronym>LBA</acronym> for this fake
       partition is hardcoded to zero.  This is used as an argument to
       the <acronym>BIOS</acronym> for reading absolute sector one from
       the hard drive.  Alternatively, CHS addressing could be used.
       In this case, the fake partition holds cylinder 0, head 0 and
       sector 1, which is equivalent to absolute sector one.</para>
 
     <para>Let us now proceed to take a look at
       <literal>nread</literal>:</para>
 
     <figure xml:id="boot-boot1-nread">
       <title><filename>sys/boot/i386/boot2/boot1.S</filename></title>
 
       <programlisting>nread:
       mov $0x8c00,%bx		# Transfer buffer
       mov 0x8(%si),%ax		# Get
       mov 0xa(%si),%cx		#  LBA
       push %cs			# Read from
       callw xread.1		#  disk
       jnc return		# If success, return</programlisting>
     </figure>
 
     <para>Recall that <literal>%si</literal> points to the fake
       partition.  The word
       <footnote>
 	<para>In the context of 16-bit real mode, a word is 2
 	  bytes.</para></footnote>
       at offset <literal>0x8</literal> is copied to register
       <literal>%ax</literal> and word at offset <literal>0xa</literal>
       to <literal>%cx</literal>.  They are interpreted by the
       <acronym>BIOS</acronym> as the lower 4-byte value denoting the
       LBA to be read (the upper four bytes are assumed to be zero).
       Register <literal>%bx</literal> holds the memory address where
       the <acronym>MBR</acronym> will be loaded.  The instruction
       pushing <literal>%cs</literal> onto the stack is very
       interesting.  In this context, it accomplishes nothing.
       However, as we will see shortly, <filename>boot2</filename>, in
       conjunction with the <acronym>BTX</acronym> server, also uses
       <literal>xread.1</literal>.  This mechanism will be discussed in
       the next section.</para>
 
     <para>The code at <literal>xread.1</literal> further calls
       the <literal>read</literal> function, which actually calls the
       <acronym>BIOS</acronym> asking for the disk sector:</para>
 
     <figure xml:id="boot-boot1-xread1">
       <title><filename>sys/boot/i386/boot2/boot1.S</filename></title>
 
       <programlisting>xread.1:
 	pushl $0x0		#  absolute
 	push %cx		#  block
 	push %ax		#  number
 	push %es		# Address of
 	push %bx		#  transfer buffer
 	xor %ax,%ax		# Number of
 	movb %dh,%al		#  blocks to
 	push %ax		#  transfer
 	push $0x10		# Size of packet
 	mov %sp,%bp		# Packet pointer
 	callw read		# Read from disk
 	lea 0x10(%bp),%sp	# Clear stack
 	lret			# To far caller</programlisting>
     </figure>
 
     <para>Note the long return instruction at the end of this block.
       This instruction pops out the <literal>%cs</literal> register
       pushed by <literal>nread</literal>, and returns.  Finally,
       <literal>nread</literal> also returns.</para>
 
     <para>With the <acronym>MBR</acronym> loaded to memory, the actual
       loop for searching the &os; slice begins:</para>
 
     <figure xml:id="boot-boot1-find-part">
       <title><filename>sys/boot/i386/boot2/boot1.S</filename></title>
 
       <programlisting>	mov $0x1,%cx		 # Two passes
 main.1:
 	mov $0x8dbe,%si # Partition table
 	movb $0x1,%dh		 # Partition
 main.2:
 	cmpb $0xa5,0x4(%si)	 # Our partition type?
 	jne main.3		 # No
 	jcxz main.5		 # If second pass
 	testb $0x80,(%si)	 # Active?
 	jnz main.5		 # Yes
 main.3:
 	add $0x10,%si		 # Next entry
 	incb %dh		 # Partition
 	cmpb $0x5,%dh		 # In table?
 	jb main.2		 # Yes
 	dec %cx			 # Do two
 	jcxz main.1		 #  passes</programlisting>
     </figure>
 
     <para>If a &os; slice is identified, execution continues at
       <literal>main.5</literal>.  Note that when a &os; slice is found
       <literal>%si</literal> points to the appropriate entry in the
       partition table, and <literal>%dh</literal> holds the partition
       number.  We assume that a &os; slice is found, so we continue
       execution at <literal>main.5</literal>:</para>
 
     <figure xml:id="boot-boot1-main5">
       <title><filename>sys/boot/i386/boot2/boot1.S</filename></title>
 
       <programlisting>main.5:
 	mov %dx,0x900			   # Save args
 	movb $0x10,%dh			   # Sector count
 	callw nread			   # Read disk
 	mov $0x9000,%bx			   # BTX
 	mov 0xa(%bx),%si		   # Get BTX length and set
 	add %bx,%si			   #  %si to start of boot2.bin
 	mov $0xc000,%di			   # Client page 2
 	mov $0xa200,%cx			   # Byte
 	sub %si,%cx			   #  count
 	rep				   # Relocate
 	movsb				   #  client</programlisting>
     </figure>
 
     <para>Recall that at this point, register <literal>%si</literal>
       points to the &os; slice entry in the <acronym>MBR</acronym>
       partition table, so a call to <literal>nread</literal> will
       effectively read sectors at the beginning of this partition.
       The argument passed on register <literal>%dh</literal> tells
       <literal>nread</literal> to read 16 disk sectors.  Recall that
       the first 512 bytes, or the first sector of the &os; slice,
       coincides with the <filename>boot1</filename> program.  Also
       recall that the file written to the beginning of the &os;
       slice is not <filename>/boot/boot1</filename>, but
       <filename>/boot/boot</filename>.  Let us look at the size of
       these files in the filesystem:</para>
 
     <screen xml:id="boot-boot1-filesize">-r--r--r--  1 root  wheel   512B Jan  8 00:15 /boot/boot0
 -r--r--r--  1 root  wheel   512B Jan  8 00:15 /boot/boot1
 -r--r--r--  1 root  wheel   7.5K Jan  8 00:15 /boot/boot2
 -r--r--r--  1 root  wheel   8.0K Jan  8 00:15 /boot/boot</screen>
 
     <para>Both <filename>boot0</filename> and
       <filename>boot1</filename> are 512 bytes each, so they fit
       <emphasis>exactly</emphasis> in one disk sector.
       <filename>boot2</filename> is much bigger, holding both
       the <acronym>BTX</acronym> server and the
       <filename>boot2</filename> client.  Finally, a file called
       simply <filename>boot</filename> is 512 bytes larger than
       <filename>boot2</filename>.  This file is a
       concatenation of <filename>boot1</filename> and
       <filename>boot2</filename>.  As already noted,
       <filename>boot0</filename> is the file written to the absolute
       first disk sector (the <acronym>MBR</acronym>), and
       <filename>boot</filename> is the file written to the first
       sector of the &os; slice; <filename>boot1</filename> and
       <filename>boot2</filename> are <emphasis>not</emphasis> written
       to disk.  The command used to concatenate
       <filename>boot1</filename> and <filename>boot2</filename> into a
       single <filename>boot</filename> is merely
       <command>cat boot1 boot2 &gt; boot</command>.</para>
 
     <para>So <filename>boot1</filename> occupies exactly the first 512
       bytes of <filename>boot</filename> and, because
       <filename>boot</filename> is written to the first sector of the
       &os; slice, <filename>boot1</filename> fits exactly in this
-      first sector.  Because <literal>nread</literal> reads the first
+      first sector.  When <literal>nread</literal> reads the first
       16 sectors of the &os; slice, it effectively reads the entire
       <filename>boot</filename> file
       <footnote>
 	<para>512*16=8192 bytes, exactly the size of
 	  <filename>boot</filename></para></footnote>.
       We will see more details about how <filename>boot</filename> is
       formed from <filename>boot1</filename> and
       <filename>boot2</filename> in the next section.</para>
 
     <para>Recall that <literal>nread</literal> uses memory address
       <literal>0x8c00</literal> as the transfer buffer to hold the
       sectors read.  This address is conveniently chosen.  Indeed,
       because <filename>boot1</filename> belongs to the first 512
       bytes, it ends up in the address range
       <literal>0x8c00</literal>-<literal>0x8dff</literal>.  The 512
       bytes that follows (range
       <literal>0x8e00</literal>-<literal>0x8fff</literal>) is used to
       store the <emphasis>bsdlabel</emphasis>
       <footnote>
 	<para>Historically known as <quote>disklabel</quote>.  If you
 	  ever wondered where &os; stored this information, it is in
 	  this region.  See &man.bsdlabel.8;</para></footnote>.</para>
 
     <para>Starting at address <literal>0x9000</literal> is the
       beginning of the <acronym>BTX</acronym> server, and immediately
       following is the <filename>boot2</filename> client.  The
       <acronym>BTX</acronym> server acts as a kernel, and executes in
       protected mode in the most privileged level.  In contrast, the
       <acronym>BTX</acronym> clients (<filename>boot2</filename>, for
       example), execute in user mode.  We will see how this is
       accomplished in the next section.  The code after the call to
       <literal>nread</literal> locates the beginning of
       <filename>boot2</filename> in the memory buffer, and copies it
       to memory address <literal>0xc000</literal>.  This is because
       the <acronym>BTX</acronym> server arranges
       <filename>boot2</filename> to execute in a segment starting at
       <literal>0xa000</literal>.  We explore this in detail in the
       following section.</para>
 
     <para>The last code block of <filename>boot1</filename> enables
       access to memory above 1MB
       <footnote>
 	<para>This is necessary for legacy reasons.  Interested
 	  readers should see <link
 	    xlink:href="http://en.wikipedia.org/wiki/A20_line"/>.</para></footnote>
 	and concludes with a jump to the starting point of the
       <acronym>BTX</acronym> server:</para>
 
     <figure xml:id="boot-boot1-seta20">
       <title><filename>sys/boot/i386/boot2/boot1.S</filename></title>
 
       <programlisting>seta20:
 	cli			# Disable interrupts
 seta20.1:
 	dec %cx			# Timeout?
 	jz seta20.3		# Yes
 
 	inb $0x64,%al		# Get status
 	testb $0x2,%al		# Busy?
 	jnz seta20.1		# Yes
 	movb $0xd1,%al		# Command: Write
 	outb %al,$0x64		#  output port
 seta20.2:
 	inb $0x64,%al		# Get status
 	testb $0x2,%al		# Busy?
 	jnz seta20.2		# Yes
 	movb $0xdf,%al		# Enable
 	outb %al,$0x60		#  A20
 seta20.3:
 	sti			# Enable interrupts
 	jmp 0x9010		# Start BTX</programlisting>
     </figure>
 
     <para>Note that right before the jump, interrupts are
       enabled.</para>
   </sect1>
 
   <sect1 xml:id="btx-server">
     <title>The <acronym>BTX</acronym> Server</title>
 
     <para>Next in our boot sequence is the
       <acronym>BTX</acronym> Server.  Let us quickly remember how we
       got here:</para>
 
     <itemizedlist>
       <listitem>
 	<para>The <acronym>BIOS</acronym> loads the absolute sector
 	  one (the <acronym>MBR</acronym>, or
 	  <filename>boot0</filename>), to address
 	  <literal>0x7c00</literal> and jumps there.</para>
       </listitem>
 
       <listitem>
 	<para><filename>boot0</filename> relocates itself to
 	  <literal>0x600</literal>, the address it was linked to
 	  execute, and jumps over there.  It then reads the first
 	  sector of the &os; slice (which consists of
 	  <filename>boot1</filename>) into address
 	  <literal>0x7c00</literal> and jumps over there.</para>
       </listitem>
 
       <listitem>
 	<para><filename>boot1</filename> loads the first 16 sectors
 	  of the &os; slice into address <literal>0x8c00</literal>.
 	  This 16 sectors, or 8192 bytes, is the whole file
 	  <filename>boot</filename>.  The file is a
 	  concatenation of <filename>boot1</filename> and
 	  <filename>boot2</filename>.  <filename>boot2</filename>, in
 	  turn, contains the <acronym>BTX</acronym> server and the
 	  <filename>boot2</filename> client.  Finally, a jump is made
 	  to address <literal>0x9010</literal>, the entry point of the
 	  <acronym>BTX</acronym> server.</para>
       </listitem>
     </itemizedlist>
 
     <para>Before studying the <acronym>BTX</acronym> Server in detail,
       let us further review how the single, all-in-one
       <filename>boot</filename> file is created.  The way
       <filename>boot</filename> is built is defined in its
       <filename>Makefile</filename>
       (<filename>/usr/src/sys/boot/i386/boot2/Makefile</filename>).
       Let us look at the rule that creates the
       <filename>boot</filename> file:</para>
 
     <figure xml:id="boot-boot1-make-boot">
       <title><filename>sys/boot/i386/boot2/Makefile</filename></title>
 
       <programlisting>      boot: boot1 boot2
 	cat boot1 boot2 > boot</programlisting>
     </figure>
 
     <para>This tells us that <filename>boot1</filename> and
       <filename>boot2</filename> are needed, and the rule simply
       concatenates them to produce a single file called
       <filename>boot</filename>.  The rules for creating
       <filename>boot1</filename> are also quite simple:</para>
 
     <figure xml:id="boot-boot1-make-boot1">
       <title><filename>sys/boot/i386/boot2/Makefile</filename></title>
 
       <programlisting>      boot1: boot1.out
 	objcopy -S -O binary boot1.out boot1
 
       boot1.out: boot1.o
 	ld -e start -Ttext 0x7c00 -o boot1.out boot1.o</programlisting>
     </figure>
 
     <para>To apply the rule for creating
       <filename>boot1</filename>, <filename>boot1.out</filename> must
       be resolved.  This, in turn, depends on the existence of
       <filename>boot1.o</filename>.  This last file is simply the
       result of assembling our familiar <filename>boot1.S</filename>,
       without linking.  Now, the rule for creating
       <filename>boot1.out</filename> is applied.  This tells us that
       <filename>boot1.o</filename> should be linked with
       <literal>start</literal> as its entry point, and starting at
       address <literal>0x7c00</literal>.  Finally,
       <filename>boot1</filename> is created from
       <filename>boot1.out</filename> applying the appropriate rule.
       This rule is the <filename>objcopy</filename> command applied to
       <filename>boot1.out</filename>.  Note the flags passed to
       <filename>objcopy</filename>: <literal>-S</literal> tells it to
       strip all relocation and symbolic information;
       <literal>-O binary</literal> indicates the output format, that
       is, a simple, unformatted binary file.</para>
 
     <para>Having <filename>boot1</filename>, let us take a look at how
       <filename>boot2</filename> is constructed:</para>
 
     <figure xml:id="boot-boot1-make-boot2">
       <title><filename>sys/boot/i386/boot2/Makefile</filename></title>
 
       <programlisting>      boot2: boot2.ld
 	@set -- `ls -l boot2.ld`; x=$$((7680-$$5)); \
 	    echo "$$x bytes available"; test $$x -ge 0
 	dd if=boot2.ld of=boot2 obs=7680 conv=osync
 
       boot2.ld: boot2.ldr boot2.bin ../btx/btx/btx
 	btxld -v -E 0x2000 -f bin -b ../btx/btx/btx -l boot2.ldr \
 	    -o boot2.ld -P 1 boot2.bin
 
       boot2.ldr:
 	dd if=/dev/zero of=boot2.ldr bs=512 count=1
 
       boot2.bin: boot2.out
 	objcopy -S -O binary boot2.out boot2.bin
 
       boot2.out: ../btx/lib/crt0.o boot2.o sio.o
 	ld -Ttext 0x2000 -o boot2.out
 
       boot2.o: boot2.s
 	${CC} ${ACFLAGS} -c boot2.s
 
       boot2.s: boot2.c boot2.h ${.CURDIR}/../../common/ufsread.c
 	${CC} ${CFLAGS} -S -o boot2.s.tmp ${.CURDIR}/boot2.c
 	sed -e '/align/d' -e '/nop/d' "MISSING" boot2.s.tmp > boot2.s
 	rm -f boot2.s.tmp
 
       boot2.h: boot1.out
 	${NM} -t d ${.ALLSRC} | awk '/([0-9])+ T xread/ \
 	    { x = $$1 - ORG1; \
 	    printf("#define XREADORG %#x\n", REL1 + x) }' \
 	    ORG1=`printf "%d" ${ORG1}` \
 	    REL1=`printf "%d" ${REL1}` > ${.TARGET}</programlisting>
     </figure>
 
     <para>The mechanism for building <filename>boot2</filename> is
       far more elaborate.  Let us point out the most relevant facts.
       The dependency list is as follows:</para>
 
     <figure xml:id="boot-boot1-make-boot2-more">
       <title><filename>sys/boot/i386/boot2/Makefile</filename></title>
 
       <programlisting>      boot2: boot2.ld
       boot2.ld: boot2.ldr boot2.bin ${BTXDIR}/btx/btx
       boot2.bin: boot2.out
       boot2.out: ${BTXDIR}/lib/crt0.o boot2.o sio.o
       boot2.o: boot2.s
       boot2.s: boot2.c boot2.h ${.CURDIR}/../../common/ufsread.c
       boot2.h: boot1.out</programlisting>
     </figure>
 
     <para>Note that initially there is no header file
       <filename>boot2.h</filename>, but its creation depends on
       <filename>boot1.out</filename>, which we already have.  The rule
       for its creation is a bit terse, but the important thing is that
       the output, <filename>boot2.h</filename>, is something like
       this:</para>
 
     <figure xml:id="boot-boot1-make-boot2h">
       <title><filename>sys/boot/i386/boot2/boot2.h</filename></title>
 
       <programlisting>#define XREADORG 0x725</programlisting>
     </figure>
 
     <para>Recall that <filename>boot1</filename> was relocated (i.e.,
       copied from <literal>0x7c00</literal> to
       <literal>0x700</literal>).  This relocation will now make sense,
       because as we will see, the <acronym>BTX</acronym> server
       reclaims some memory, including the space where
       <filename>boot1</filename> was originally loaded.  However, the
       <acronym>BTX</acronym> server needs access to
       <filename>boot1</filename>'s <literal>xread</literal> function;
       this function, according to the output of
       <filename>boot2.h</filename>, is at location
       <literal>0x725</literal>.  Indeed, the
       <acronym>BTX</acronym> server uses the
       <literal>xread</literal> function from
       <filename>boot1</filename>'s relocated code.  This function is
       now accessible from within the <filename>boot2</filename>
       client.</para>
 
     <para>We next build <filename>boot2.s</filename> from files
       <filename>boot2.h</filename>, <filename>boot2.c</filename> and
       <filename>/usr/src/sys/boot/common/ufsread.c</filename>.  The
       rule for this is to compile the code in
       <filename>boot2.c</filename> (which includes
       <filename>boot2.h</filename> and <filename>ufsread.c</filename>)
       into assembly code.  Having <filename>boot2.s</filename>, the
       next rule assembles <filename>boot2.s</filename>, creating the
       object file <filename>boot2.o</filename>.  The
       next rule directs the linker to link various files
       (<filename>crt0.o</filename>,
       <filename>boot2.o</filename> and <filename>sio.o</filename>).
       Note that the output file, <filename>boot2.out</filename>, is
       linked to execute at address <literal>0x2000</literal>.  Recall
       that <filename>boot2</filename> will be executed in user mode,
       within a special user segment set up by the
       <acronym>BTX</acronym> server.  This segment starts at
       <literal>0xa000</literal>.  Also, remember that the
       <filename>boot2</filename> portion of <filename>boot</filename>
       was copied to address <literal>0xc000</literal>, that is, offset
       <literal>0x2000</literal> from the start of the user segment, so
       <filename>boot2</filename> will work properly when we transfer
       control to it.  Next, <filename>boot2.bin</filename> is created
       from <filename>boot2.out</filename> by stripping its symbols and
       format information; boot2.bin is a <emphasis>raw</emphasis>
       binary.  Now, note that a file <filename>boot2.ldr</filename> is
       created as a 512-byte file full of zeros.  This space is
       reserved for the bsdlabel.</para>
 
     <para>Now that we have files <filename>boot1</filename>,
       <filename>boot2.bin</filename> and
       <filename>boot2.ldr</filename>, only the
       <acronym>BTX</acronym> server is missing before creating the
       all-in-one <filename>boot</filename> file.  The
       <acronym>BTX</acronym> server is located in
       <filename>/usr/src/sys/boot/i386/btx/btx</filename>; it has its
       own <filename>Makefile</filename> with its own set of rules for
       building.  The important thing to notice is that it is also
       compiled as a <emphasis>raw</emphasis> binary, and that it is
       linked to execute at address <literal>0x9000</literal>.  The
       details can be found in
       <filename>/usr/src/sys/boot/i386/btx/btx/Makefile</filename>.</para>
 
     <para>Having the files that comprise the <filename>boot</filename>
       program, the final step is to <emphasis>merge</emphasis> them.
       This is done by a special program called
       <filename>btxld</filename> (source located in
       <filename>/usr/src/usr.sbin/btxld</filename>).  Some arguments
       to this program include the name of the output file
       (<filename>boot</filename>), its entry point
       (<literal>0x2000</literal>) and its file format
       (raw binary).  The various files are
       finally merged by this utility into the file
       <filename>boot</filename>, which consists of
       <filename>boot1</filename>, <filename>boot2</filename>, the
       <literal>bsdlabel</literal> and the
       <acronym>BTX</acronym> server.  This file, which takes
       exactly 16 sectors, or 8192 bytes, is what is
       actually written to the beginning of the &os; slice
       during installation.  Let us now proceed to study the
       <acronym>BTX</acronym> server program.</para>
 
     <para>The <acronym>BTX</acronym> server prepares a simple
       environment and switches from 16-bit real mode to 32-bit
       protected mode, right before passing control to the client.
       This includes initializing and updating the following data
       structures:</para>
 
     <indexterm><primary>virtual v86 mode</primary></indexterm>
     <itemizedlist>
       <listitem>
 	<para>Modifies the
 	  <literal>Interrupt Vector Table (IVT)</literal>.  The
 	  <acronym>IVT</acronym> provides exception and interrupt
 	  handlers for Real-Mode code.</para>
       </listitem>
 
       <listitem>
 	<para>The <literal>Interrupt Descriptor Table (IDT)</literal>
 	  is created.  Entries are provided for processor exceptions,
 	  hardware interrupts, two system calls and V86 interface.
 	  The IDT provides exception and interrupt handlers for
 	  Protected-Mode code.</para>
       </listitem>
 
       <listitem>
 	<para>A <literal>Task-State Segment (TSS)</literal> is
 	  created.  This is necessary because the processor works in
 	  the <emphasis>least</emphasis> privileged level when
 	  executing the client (<filename>boot2</filename>), but in
 	  the <emphasis>most</emphasis> privileged level when
 	  executing the <acronym>BTX</acronym> server.</para>
       </listitem>
 
       <listitem>
 	<para>The <acronym>GDT</acronym> (Global Descriptor Table) is
 	  set up.  Entries (descriptors) are provided for
 	  supervisor code and data, user code and data, and real-mode
 	  code and data.
 	  <footnote>
 	    <para>Real-mode code and data are necessary when switching
 	      back to real mode from protected mode, as suggested by
 	      the Intel manuals.</para></footnote></para>
       </listitem>
     </itemizedlist>
 
     <para>Let us now start studying the actual implementation.  Recall
       that <filename>boot1</filename> made a jump to address
       <literal>0x9010</literal>, the <acronym>BTX</acronym> server's
       entry point.  Before studying program execution there,
       note that the <acronym>BTX</acronym> server has a special header
       at address range <literal>0x9000-0x900f</literal>, right before
       its entry point.  This header is defined as follows:</para>
 
     <figure xml:id="btx-header">
       <title><filename>sys/boot/i386/btx/btx/btx.S</filename></title>
 
       <programlisting>start:						# Start of code
 /*
  * BTX header.
  */
 btx_hdr:	.byte 0xeb			# Machine ID
 		.byte 0xe			# Header size
 		.ascii "BTX"			# Magic
 		.byte 0x1			# Major version
 		.byte 0x2			# Minor version
 		.byte BTX_FLAGS			# Flags
 		.word PAG_CNT-MEM_ORG>>0xc	# Paging control
 		.word break-start		# Text size
 		.long 0x0			# Entry address</programlisting>
     </figure>
 
     <para>Note the first two bytes are <literal>0xeb</literal> and
       <literal>0xe</literal>.  In the IA-32 architecture, these two
       bytes are interpreted as a relative jump past the header into
       the entry point, so in theory, <filename>boot1</filename> could
       jump here (address <literal>0x9000</literal>) instead of address
       <literal>0x9010</literal>.  Note that the last field in the
       <acronym>BTX</acronym> header is a pointer to the client's
       (<filename>boot2</filename>) entry point.  This field is patched
       at link time.</para>
 
     <para>Immediately following the header is the
       <acronym>BTX</acronym> server's entry point:</para>
 
     <figure xml:id="btx-init">
       <title><filename>sys/boot/i386/btx/btx/btx.S</filename></title>
 
       <programlisting>/*
  * Initialization routine.
  */
 init:		cli				# Disable interrupts
 		xor %ax,%ax			# Zero/segment
 		mov %ax,%ss			# Set up
 		mov $0x1800,%sp		#  stack
 		mov %ax,%es			# Address
 		mov %ax,%ds			#  data
 		pushl $0x2			# Clear
 		popfl				#  flags</programlisting>
     </figure>
 
     <para>This code disables interrupts, sets up a working stack
       (starting at address <literal>0x1800</literal>) and clears the
       flags in the EFLAGS register.  Note that the
       <literal>popfl</literal> instruction pops out a doubleword (4
       bytes) from the stack and places it in the EFLAGS register.
-      Because the value actually popped is <literal>2</literal>, the
+      As the value actually popped is <literal>2</literal>, the
       EFLAGS register is effectively cleared (IA-32 requires that bit
       2 of the EFLAGS register always be 1).</para>
 
     <para>Our next code block clears (sets to <literal>0</literal>)
       the memory range <literal>0x5e00-0x8fff</literal>.  This range
       is where the various data structures will be created:</para>
 
     <figure xml:id="btx-clear-mem">
       <title><filename>sys/boot/i386/btx/btx/btx.S</filename></title>
 
       <programlisting>/*
  * Initialize memory.
  */
 		mov $0x5e00,%di		# Memory to initialize
 		mov $(0x9000-0x5e00)/2,%cx	# Words to zero
 		rep				# Zero-fill
 		stosw				#  memory</programlisting>
     </figure>
 
     <para>Recall that <filename>boot1</filename> was originally loaded
       to address <literal>0x7c00</literal>, so, with this memory
       initialization, that copy effectively disappeared.  However,
       also recall that <filename>boot1</filename> was relocated to
       <literal>0x700</literal>, so <emphasis>that</emphasis> copy is
       still in memory, and the <acronym>BTX</acronym> server will make
       use of it.</para>
 
     <para>Next, the real-mode <acronym>IVT</acronym> (Interrupt Vector
       Table is updated.  The <acronym>IVT</acronym> is an array of
       segment/offset pairs for exception and interrupt handlers.  The
       <acronym>BIOS</acronym> normally maps hardware interrupts to
       interrupt vectors <literal>0x8</literal> to
       <literal>0xf</literal> and <literal>0x70</literal> to
       <literal>0x77</literal> but, as will be seen, the 8259A
       Programmable Interrupt Controller, the chip controlling the
       actual mapping of hardware interrupts to interrupt vectors, is
       programmed to remap these interrupt vectors from
       <literal>0x8-0xf</literal> to <literal>0x20-0x27</literal> and
       from <literal>0x70-0x77</literal> to
       <literal>0x28-0x2f</literal>.  Thus, interrupt handlers are
       provided for interrupt vectors <literal>0x20-0x2f</literal>.
       The reason the <acronym>BIOS</acronym>-provided handlers are not
       used directly is because they work in 16-bit real mode, but not
       32-bit protected mode.  Processor mode will be switched to
       32-bit protected mode shortly.  However, the
       <acronym>BTX</acronym> server sets up a mechanism to effectively
       use the handlers provided by the <acronym>BIOS</acronym>:</para>
 
     <figure xml:id="btx-ivt">
       <title><filename>sys/boot/i386/btx/btx/btx.S</filename></title>
 
       <programlisting>/*
  * Update real mode IDT for reflecting hardware interrupts.
  */
 		mov $intr20,%bx			# Address first handler
 		mov $0x10,%cx			# Number of handlers
 		mov $0x20*4,%di			# First real mode IDT entry
 init.0:		mov %bx,(%di)			# Store IP
 		inc %di				# Address next
 		inc %di				#  entry
 		stosw				# Store CS
 		add $4,%bx			# Next handler
 		loop init.0			# Next IRQ</programlisting>
     </figure>
 
     <para>The next block creates the <acronym>IDT</acronym> (Interrupt
       Descriptor Table).  The <acronym>IDT</acronym> is analogous, in
       protected mode, to the <acronym>IVT</acronym> in real mode.
       That is, the <acronym>IDT</acronym> describes the various
       exception and interrupt handlers used when the processor is
       executing in protected mode.  In essence, it also consists of an
       array of segment/offset pairs, although the structure is
       somewhat more complex, because segments in protected mode are
       different than in real mode, and various protection mechanisms
       apply:</para>
 
     <figure xml:id="btx-idt">
       <title><filename>sys/boot/i386/btx/btx/btx.S</filename></title>
 
       <programlisting>/*
  * Create IDT.
  */
 		mov $0x5e00,%di			# IDT's address
 		mov $idtctl,%si			# Control string
 init.1:		lodsb				# Get entry
 		cbw				#  count
 		xchg %ax,%cx			#  as word
 		jcxz init.4			# If done
 		lodsb				# Get segment
 		xchg %ax,%dx			#  P:DPL:type
 		lodsw				# Get control
 		xchg %ax,%bx			#  set
 		lodsw				# Get handler offset
 		mov $SEL_SCODE,%dh		# Segment selector
 init.2:		shr %bx				# Handle this int?
 		jnc init.3			# No
 		mov %ax,(%di)			# Set handler offset
 		mov %dh,0x2(%di)		#  and selector
 		mov %dl,0x5(%di)		# Set P:DPL:type
 		add $0x4,%ax			# Next handler
 init.3:		lea 0x8(%di),%di		# Next entry
 		loop init.2			# Till set done
 		jmp init.1			# Continue</programlisting>
     </figure>
 
     <para>Each entry in the <literal>IDT</literal> is 8 bytes long.
       Besides the segment/offset information, they also describe the
       segment type, privilege level, and whether the segment is
       present in memory or not.  The construction is such that
       interrupt vectors from <literal>0</literal> to
       <literal>0xf</literal> (exceptions) are handled by function
       <literal>intx00</literal>; vector <literal>0x10</literal> (also
       an exception) is handled by <literal>intx10</literal>; hardware
       interrupts, which are later configured to start at interrupt
       vector <literal>0x20</literal> all the way to interrupt vector
       <literal>0x2f</literal>, are handled by function
       <literal>intx20</literal>.  Lastly, interrupt vector
       <literal>0x30</literal>, which is used for system calls, is
       handled by <literal>intx30</literal>, and vectors
       <literal>0x31</literal> and <literal>0x32</literal> are handled
       by <literal>intx31</literal>.  It must be noted that only
       descriptors for interrupt vectors <literal>0x30</literal>,
       <literal>0x31</literal> and <literal>0x32</literal> are given
       privilege level 3, the same privilege level as the
       <filename>boot2</filename> client, which means the client can
       execute a software-generated interrupt to this vectors through
       the <literal>int</literal> instruction without failing (this is
       the way <filename>boot2</filename> use the services provided by
       the <acronym>BTX</acronym> server).  Also, note that
       <emphasis>only</emphasis> software-generated interrupts are
       protected from code executing in lesser privilege levels.
       Hardware-generated interrupts and processor-generated exceptions
       are <emphasis>always</emphasis> handled adequately, regardless
       of the actual privileges involved.</para>
 
     <para>The next step is to initialize the <acronym>TSS</acronym>
       (Task-State Segment).  The <acronym>TSS</acronym> is a hardware
       feature that helps the operating system or executive software
       implement multitasking functionality through process
       abstraction.  The IA-32 architecture demands the creation and
       use of <emphasis>at least</emphasis> one <acronym>TSS</acronym>
       if multitasking facilities are used or different privilege
-      levels are defined.  Because the <filename>boot2</filename>
+      levels are defined.  Since the <filename>boot2</filename>
       client is executed in privilege level 3, but the
       <acronym>BTX</acronym> server does in privilege level 0, a
       <acronym>TSS</acronym> must be defined:</para>
 
     <figure xml:id="btx-tss">
       <title><filename>sys/boot/i386/btx/btx/btx.S</filename></title>
 
       <programlisting>/*
  * Initialize TSS.
  */
 init.4:		movb $_ESP0H,TSS_ESP0+1(%di)	# Set ESP0
 		movb $SEL_SDATA,TSS_SS0(%di)	# Set SS0
 		movb $_TSSIO,TSS_MAP(%di)	# Set I/O bit map base</programlisting>
     </figure>
 
     <para>Note that a value is given for the Privilege Level 0 stack
       pointer and stack segment in the <acronym>TSS</acronym>.  This
       is needed because, if an interrupt or exception is received
       while executing <filename>boot2</filename> in Privilege Level 3,
       a change to Privilege Level 0 is automatically performed by the
       processor, so a new working stack is needed.  Finally, the I/O
       Map Base Address field of the <acronym>TSS</acronym> is given a
       value, which is a 16-bit offset from the beginning of the
       <acronym>TSS</acronym> to the I/O Permission Bitmap and the
       Interrupt Redirection Bitmap.</para>
 
     <para>After the <acronym>IDT</acronym> and <acronym>TSS</acronym>
       are created, the processor is ready to switch to protected mode.
       This is done in the next block:</para>
 
     <figure xml:id="btx-prot">
       <title><filename>sys/boot/i386/btx/btx/btx.S</filename></title>
 
       <programlisting>/*
  * Bring up the system.
  */
 		mov $0x2820,%bx			# Set protected mode
 		callw setpic			#  IRQ offsets
 		lidt idtdesc			# Set IDT
 		lgdt gdtdesc			# Set GDT
 		mov %cr0,%eax			# Switch to protected
 		inc %ax				#  mode
 		mov %eax,%cr0			#
 		ljmp $SEL_SCODE,$init.8		# To 32-bit code
 		.code32
 init.8:		xorl %ecx,%ecx			# Zero
 		movb $SEL_SDATA,%cl		# To 32-bit
 		movw %cx,%ss			#  stack</programlisting>
     </figure>
 
     <para>First, a call is made to <literal>setpic</literal> to
       program the 8259A <acronym>PIC</acronym> (Programmable Interrupt
       Controller).  This chip is connected to multiple hardware
       interrupt sources.  Upon receiving an interrupt from a device,
       it signals the processor with the appropriate interrupt vector.
       This can be customized so that specific interrupts are
       associated with specific interrupt vectors, as explained before.
       Next, the <acronym>IDTR</acronym> (Interrupt Descriptor Table
       Register) and <acronym>GDTR</acronym> (Global Descriptor Table
       Register) are loaded with the instructions
       <literal>lidt</literal> and <literal>lgdt</literal>,
       respectively.  These registers are loaded with the base address
       and limit address for the <acronym>IDT</acronym> and
       <acronym>GDT</acronym>.  The following three instructions set
       the Protection Enable (PE) bit of the <literal>%cr0</literal>
       register.  This effectively switches the processor to 32-bit
       protected mode.  Next, a long jump is made to
       <literal>init.8</literal> using segment selector SEL_SCODE,
       which selects the Supervisor Code Segment.  The processor is
       effectively executing in CPL 0, the most privileged level, after
       this jump.  Finally, the Supervisor Data Segment is selected for
       the stack by assigning the segment selector SEL_SDATA to the
       <literal>%ss</literal> register.  This data segment also has a
       privilege level of <literal>0</literal>.</para>
 
     <para>Our last code block is responsible for loading the
       <acronym>TR</acronym> (Task Register) with the segment selector
       for the <acronym>TSS</acronym> we created earlier, and setting
       the User Mode environment before passing execution control to
       the <filename>boot2</filename> client.</para>
 
     <figure xml:id="btx-end">
       <title><filename>sys/boot/i386/btx/btx/btx.S</filename></title>
 
       <programlisting>/*
  * Launch user task.
  */
 		movb $SEL_TSS,%cl		# Set task
 		ltr %cx				#  register
 		movl $0xa000,%edx		# User base address
 		movzwl %ss:BDA_MEM,%eax		# Get free memory
 		shll $0xa,%eax			# To bytes
 		subl $ARGSPACE,%eax		# Less arg space
 		subl %edx,%eax			# Less base
 		movb $SEL_UDATA,%cl		# User data selector
 		pushl %ecx			# Set SS
 		pushl %eax			# Set ESP
 		push $0x202			# Set flags (IF set)
 		push $SEL_UCODE			# Set CS
 		pushl btx_hdr+0xc		# Set EIP
 		pushl %ecx			# Set GS
 		pushl %ecx			# Set FS
 		pushl %ecx			# Set DS
 		pushl %ecx			# Set ES
 		pushl %edx			# Set EAX
 		movb $0x7,%cl			# Set remaining
 init.9:		push $0x0			#  general
 		loop init.9			#  registers
 		popa				#  and initialize
 		popl %es			# Initialize
 		popl %ds			#  user
 		popl %fs			#  segment
 		popl %gs			#  registers
 		iret				# To user mode</programlisting>
     </figure>
 
     <para>Note that the client's environment include a stack segment
       selector and stack pointer (registers <literal>%ss</literal> and
       <literal>%esp</literal>).  Indeed, once the
       <acronym>TR</acronym> is loaded with the appropriate stack
       segment selector (instruction <literal>ltr</literal>), the stack
       pointer is calculated and pushed onto the stack along with the
       stack's segment selector.  Next, the value
       <literal>0x202</literal> is pushed onto the stack; it is the
       value that the EFLAGS will get when control is passed to the
       client.  Also, the User Mode code segment selector and the
       client's entry point are pushed.  Recall that this entry
       point is patched in the <acronym>BTX</acronym> header at link
       time.  Finally, segment selectors (stored in register
       <literal>%ecx</literal>) for the segment registers
       <literal>%gs, %fs, %ds and %es</literal> are pushed onto the
       stack, along with the value at <literal>%edx</literal>
       (<literal>0xa000</literal>).  Keep in mind the various values
       that have been pushed onto the stack (they will be popped out
       shortly).  Next, values for the remaining general purpose
       registers are also pushed onto the stack (note the
       <literal>loop</literal> that pushes the value
       <literal>0</literal> seven times).  Now, values will be started
       to be popped out of the stack.  First, the
       <literal>popa</literal> instruction pops out of the stack the
       latest seven values pushed.  They are stored in the general
       purpose registers in order
       <literal>%edi, %esi, %ebp, %ebx, %edx, %ecx, %eax</literal>.
       Then, the various segment selectors pushed are popped into the
       various segment registers.  Five values still remain on the
       stack.  They are popped when the <literal>iret</literal>
       instruction is executed.  This instruction first pops
       the value that was pushed from the <acronym>BTX</acronym>
       header.  This value is a pointer to <filename>boot2</filename>'s
       entry point.  It is placed in the register
       <literal>%eip</literal>, the instruction pointer register.
       Next, the segment selector for the User Code Segment is popped
       and copied to register <literal>%cs</literal>.  Remember that
       this segment's privilege level is 3, the least privileged
       level.  This means that we must provide values for the stack of
       this privilege level.  This is why the processor, besides
       further popping the value for the EFLAGS register, does two more
       pops out of the stack.  These values go to the stack
       pointer (<literal>%esp</literal>) and the stack segment
       (<literal>%ss</literal>).  Now, execution continues at
       <literal>boot0</literal>'s entry point.</para>
 
     <para>It is important to note how the User Code Segment is
       defined.  This segment's <emphasis>base address</emphasis> is
       set to <literal>0xa000</literal>.  This means that code memory
       addresses are <emphasis>relative</emphasis> to address 0xa000;
       if code being executed is fetched from address
       <literal>0x2000</literal>, the <emphasis>actual</emphasis>
       memory addressed is
       <literal>0xa000+0x2000=0xc000</literal>.</para>
   </sect1>
 
   <sect1 xml:id="boot2">
     <title><application>boot2</application> Stage</title>
 
     <para><literal>boot2</literal> defines an important structure,
       <literal>struct bootinfo</literal>.  This structure is
       initialized by <literal>boot2</literal> and passed to the
       loader, and then further to the kernel.  Some nodes of this
       structures are set by <literal>boot2</literal>, the rest by the
       loader.  This structure, among other information, contains the
       kernel filename, <acronym>BIOS</acronym> harddisk geometry,
       <acronym>BIOS</acronym> drive number for boot device, physical
       memory available, <literal>envp</literal> pointer etc.  The
       definition for it is:</para>
 
     <programlisting><filename>/usr/include/machine/bootinfo.h:</filename>
 struct bootinfo {
 	u_int32_t	bi_version;
 	u_int32_t	bi_kernelname;		/* represents a char * */
 	u_int32_t	bi_nfs_diskless;	/* struct nfs_diskless * */
 				/* End of fields that are always present. */
 #define	bi_endcommon	bi_n_bios_used
 	u_int32_t	bi_n_bios_used;
 	u_int32_t	bi_bios_geom[N_BIOS_GEOM];
 	u_int32_t	bi_size;
 	u_int8_t	bi_memsizes_valid;
 	u_int8_t	bi_bios_dev;		/* bootdev BIOS unit number */
 	u_int8_t	bi_pad[2];
 	u_int32_t	bi_basemem;
 	u_int32_t	bi_extmem;
 	u_int32_t	bi_symtab;		/* struct symtab * */
 	u_int32_t	bi_esymtab;		/* struct symtab * */
 				/* Items below only from advanced bootloader */
 	u_int32_t	bi_kernend;		/* end of kernel space */
 	u_int32_t	bi_envp;		/* environment */
 	u_int32_t	bi_modulep;		/* preloaded modules */
 };</programlisting>
 
     <para><literal>boot2</literal> enters into an infinite loop
       waiting for user input, then calls <function>load()</function>.
       If the user does not press anything, the loop breaks by a
       timeout, so <function>load()</function> will load the default
       file (<filename>/boot/loader</filename>).  Functions
       <function>ino_t lookup(char *filename)</function> and
       <function>int xfsread(ino_t inode, void *buf, size_t
       nbyte)</function> are used to read the content of a file into
       memory.  <filename>/boot/loader</filename> is an
       <acronym>ELF</acronym> binary, but where the
       <acronym>ELF</acronym> header is prepended with
       <filename>a.out</filename>'s <literal>struct
       exec</literal> structure.  <function>load()</function> scans the
       loader's ELF header, loading the content of
       <filename>/boot/loader</filename> into memory, and passing the
       execution to the loader's entry:</para>
 
     <programlisting><filename>sys/boot/i386/boot2/boot2.c:</filename>
     __exec((caddr_t)addr, RB_BOOTINFO | (opts &amp; RBX_MASK),
 	   MAKEBOOTDEV(dev_maj[dsk.type], 0, dsk.slice, dsk.unit, dsk.part),
 	   0, 0, 0, VTOP(&amp;bootinfo));</programlisting>
   </sect1>
 
   <sect1 xml:id="boot-loader">
     <title><application>loader</application> Stage</title>
 
     <para><application>loader</application> is a
       <acronym>BTX</acronym> client as well.  I will not describe it
       here in detail, there is a comprehensive man page written by
       Mike Smith, &man.loader.8;.  The underlying mechanisms and
       <acronym>BTX</acronym> were discussed above.</para>
 
     <para>The main task for the loader is to boot the kernel.  When
       the kernel is loaded into memory, it is being called by the
       loader:</para>
 
     <programlisting><filename>sys/boot/common/boot.c:</filename>
     /* Call the exec handler from the loader matching the kernel */
     module_formats[km-&gt;m_loader]-&gt;l_exec(km);</programlisting>
   </sect1>
 
   <sect1 xml:id="boot-kernel">
     <title>Kernel Initialization</title>
 
     <para>Let us take a look at the command that links the kernel.
       This will help identify the exact location where the loader
       passes execution to the kernel.  This location is the kernel's
       actual entry point.</para>
 
     <programlisting><filename>sys/conf/Makefile.i386:</filename>
 ld -elf -Bdynamic -T /usr/src/sys/conf/ldscript.i386  -export-dynamic \
 -dynamic-linker /red/herring -o kernel -X locore.o \
 &lt;lots of kernel .o files&gt;</programlisting>
 
     <indexterm><primary>ELF</primary></indexterm>
     <para>A few interesting things can be seen here.  First, the
       kernel is an ELF dynamically linked binary, but the dynamic
       linker for kernel is <filename>/red/herring</filename>, which is
       definitely a bogus file.  Second, taking a look at the file
       <filename>sys/conf/ldscript.i386</filename> gives an idea about
       what <application>ld</application> options are used when
       compiling a kernel.  Reading through the first few lines, the
       string</para>
 
     <programlisting><filename>sys/conf/ldscript.i386:</filename>
 ENTRY(btext)</programlisting>
 
     <para>says that a kernel's entry point is the symbol `btext'.
       This symbol is defined in <filename>locore.s</filename>:</para>
 
     <programlisting><filename>sys/i386/i386/locore.s:</filename>
 	.text
 /**********************************************************************
  *
  * This is where the bootblocks start us, set the ball rolling...
  *
  */
 NON_GPROF_ENTRY(btext)</programlisting>
 
     <para>First, the register EFLAGS is set to a predefined value of
       0x00000002.  Then all the segment registers are
       initialized:</para>
 
     <programlisting><filename>sys/i386/i386/locore.s:</filename>
 /* Don't trust what the BIOS gives for eflags. */
 	pushl	$PSL_KERNEL
 	popfl
 
 /*
  * Don't trust what the BIOS gives for %fs and %gs.  Trust the bootstrap
  * to set %cs, %ds, %es and %ss.
  */
 	mov	%ds, %ax
 	mov	%ax, %fs
 	mov	%ax, %gs</programlisting>
 
     <para>btext calls the routines
       <function>recover_bootinfo()</function>,
       <function>identify_cpu()</function>,
       <function>create_pagetables()</function>, which are also defined
       in <filename>locore.s</filename>.  Here is a description of what
       they do:</para>
 
     <informaltable frame="none" pgwide="1">
       <tgroup cols="2" align="left">
 	<tbody>
 	  <row>
 	    <entry><function>recover_bootinfo</function></entry>
 	    <entry>This routine parses the parameters to the kernel
 	      passed from the bootstrap.  The kernel may have been
 	      booted in 3 ways: by the loader, described above, by the
 	      old disk boot blocks, or by the old diskless boot
 	      procedure.  This function determines the booting method,
 	      and stores the <literal>struct bootinfo</literal>
 	      structure into the kernel memory.</entry>
 	  </row>
 
 	  <row>
 	    <entry><function>identify_cpu</function></entry>
 	    <entry>This functions tries to find out what CPU it is
 	      running on, storing the value found in a variable
 	      <varname>_cpu</varname>.</entry>
 	  </row>
 
 	  <row>
 	    <entry><function>create_pagetables</function></entry>
 	    <entry>This function allocates and fills out a Page Table
 	      Directory at the top of the kernel memory area.</entry>
 	  </row>
 	</tbody>
       </tgroup>
     </informaltable>
 
     <para>The next steps are enabling VME, if the CPU supports
       it:</para>
 
     <programlisting>	testl	$CPUID_VME, R(_cpu_feature)
 	jz	1f
 	movl	%cr4, %eax
 	orl	$CR4_VME, %eax
 	movl	%eax, %cr4</programlisting>
 
     <para>Then, enabling paging:</para>
 
     <programlisting>/* Now enable paging */
 	movl	R(_IdlePTD), %eax
 	movl	%eax,%cr3			/* load ptd addr into mmu */
 	movl	%cr0,%eax			/* get control word */
 	orl	$CR0_PE|CR0_PG,%eax		/* enable paging */
 	movl	%eax,%cr0			/* and let's page NOW! */</programlisting>
 
     <para>The next three lines of code are because the paging was set,
       so the jump is needed to continue the execution in virtualized
       address space:</para>
 
     <programlisting>	pushl	$begin				/* jump to high virtualized address */
 	ret
 
 /* now running relocated at KERNBASE where the system is linked to run */
 begin:</programlisting>
 
     <para>The function <function>init386()</function> is called with
       a pointer to the first free physical page, after that
       <function>mi_startup()</function>.  <function>init386</function>
       is an architecture dependent initialization function, and
       <function>mi_startup()</function> is an architecture independent
       one (the 'mi_' prefix stands for Machine Independent).  The
       kernel never returns from <function>mi_startup()</function>, and
       by calling it, the kernel finishes booting:</para>
 
     <programlisting><filename>sys/i386/i386/locore.s:</filename>
 	movl	physfree, %esi
 	pushl	%esi				/* value of first for init386(first) */
 	call	_init386			/* wire 386 chip for unix operation */
 	call	_mi_startup			/* autoconfiguration, mountroot etc */
 	hlt		/* never returns to here */</programlisting>
 
     <sect2>
       <title><function>init386()</function></title>
 
       <para><function>init386()</function> is defined in
 	<filename>sys/i386/i386/machdep.c</filename> and performs
 	low-level initialization specific to the i386 chip.  The
 	switch to protected mode was performed by the loader.  The
 	loader has created the very first task, in which the kernel
 	continues to operate.  Before looking at the code, consider
 	the tasks the processor must complete to initialize protected
 	mode execution:</para>
 
       <itemizedlist>
 	<listitem>
 	  <para>Initialize the kernel tunable parameters, passed from
 	    the bootstrapping program.</para>
 	</listitem>
 
 	<listitem>
 	  <para>Prepare the GDT.</para>
 	</listitem>
 
 	<listitem>
 	  <para>Prepare the IDT.</para>
 	</listitem>
 
 	<listitem>
 	  <para>Initialize the system console.</para>
 	</listitem>
 
 	<listitem>
 	  <para>Initialize the DDB, if it is compiled into
 	    kernel.</para>
 	</listitem>
 
 	<listitem>
 	  <para>Initialize the TSS.</para>
 	</listitem>
 
 	<listitem>
 	  <para>Prepare the LDT.</para>
 	</listitem>
 
 	<listitem>
 	  <para>Set up proc0's pcb.</para>
 	</listitem>
       </itemizedlist>
 
       <indexterm><primary>parameters</primary></indexterm>
       <para><function>init386()</function> initializes the tunable
 	parameters passed from bootstrap by setting the environment
 	pointer (envp) and calling <function>init_param1()</function>.
 	The envp pointer has been passed from loader in the
 	<literal>bootinfo</literal> structure:</para>
 
       <programlisting><filename>sys/i386/i386/machdep.c:</filename>
 		kern_envp = (caddr_t)bootinfo.bi_envp + KERNBASE;
 
 	/* Init basic tunables, hz etc */
 	init_param1();</programlisting>
 
       <para><function>init_param1()</function> is defined in
 	<filename>sys/kern/subr_param.c</filename>.  That file has a
 	number of sysctls, and two functions,
 	<function>init_param1()</function> and
 	<function>init_param2()</function>, that are called from
 	<function>init386()</function>:</para>
 
       <programlisting><filename>sys/kern/subr_param.c:</filename>
 	hz = HZ;
 	TUNABLE_INT_FETCH("kern.hz", &amp;hz);</programlisting>
 
       <para>TUNABLE_&lt;typename&gt;_FETCH is used to fetch the value
 	from the environment:</para>
 
       <programlisting><filename>/usr/src/sys/sys/kernel.h:</filename>
 #define	TUNABLE_INT_FETCH(path, var)	getenv_int((path), (var))</programlisting>
 
       <para>Sysctl <literal>kern.hz</literal> is the system clock
 	tick.  Additionally, these sysctls are set by
 	<function>init_param1()</function>: <literal>kern.maxswzone,
 	kern.maxbcache, kern.maxtsiz, kern.dfldsiz, kern.maxdsiz,
 	  kern.dflssiz, kern.maxssiz, kern.sgrowsiz</literal>.</para>
 
       <indexterm>
 	<primary>Global Descriptors Table (GDT)</primary>
       </indexterm>
 
       <para>Then <function>init386()</function> prepares the Global
 	Descriptors Table (GDT).  Every task on an x86 is running in
 	its own virtual address space, and this space is addressed by
 	a segment:offset pair.  Say, for instance, the current
 	instruction to be executed by the processor lies at CS:EIP,
 	then the linear virtual address for that instruction would be
 	<quote>the virtual address of code segment CS</quote> + EIP.
 	For convenience, segments begin at virtual address 0 and end
 	at a 4Gb boundary.  Therefore, the instruction's linear
 	virtual address for this example would just be the value of
 	EIP. Segment registers such as CS, DS etc are the selectors,
 	i.e., indexes, into GDT (to be more precise, an index is not a
 	selector itself, but the INDEX field of a selector).
 	FreeBSD's GDT holds descriptors for 15 selectors per
 	CPU:</para>
 
       <programlisting><filename>sys/i386/i386/machdep.c:</filename>
 union descriptor gdt[NGDT * MAXCPU];	/* global descriptor table */
 
 <filename>sys/i386/include/segments.h:</filename>
 /*
  * Entries in the Global Descriptor Table (GDT)
  */
 #define	GNULL_SEL	0	/* Null Descriptor */
 #define	GCODE_SEL	1	/* Kernel Code Descriptor */
 #define	GDATA_SEL	2	/* Kernel Data Descriptor */
 #define	GPRIV_SEL	3	/* SMP Per-Processor Private Data */
 #define	GPROC0_SEL	4	/* Task state process slot zero and up */
 #define	GLDT_SEL	5	/* LDT - eventually one per process */
 #define	GUSERLDT_SEL	6	/* User LDT */
 #define	GTGATE_SEL	7	/* Process task switch gate */
 #define	GBIOSLOWMEM_SEL	8	/* BIOS low memory access (must be entry 8) */
 #define	GPANIC_SEL	9	/* Task state to consider panic from */
 #define GBIOSCODE32_SEL	10	/* BIOS interface (32bit Code) */
 #define GBIOSCODE16_SEL	11	/* BIOS interface (16bit Code) */
 #define GBIOSDATA_SEL	12	/* BIOS interface (Data) */
 #define GBIOSUTIL_SEL	13	/* BIOS interface (Utility) */
 #define GBIOSARGS_SEL	14	/* BIOS interface (Arguments) */</programlisting>
 
       <para>Note that those #defines are not selectors themselves, but
 	just a field INDEX of a selector, so they are exactly the
 	indices of the GDT.  for example, an actual selector for the
 	kernel code (GCODE_SEL) has the value 0x08.</para>
 
       <indexterm><primary>Interrupt Descriptor Table
 	  (IDT)</primary></indexterm>
       <para>The next step is to initialize the Interrupt Descriptor
 	Table (IDT).  This table is referenced by the processor when a
 	software or hardware interrupt occurs.  For example, to make a
 	system call, user application issues the
 	<literal>INT 0x80</literal> instruction.  This is a software
 	interrupt, so the processor's hardware looks up a record with
 	index 0x80 in the IDT.  This record points to the routine that
 	handles this interrupt, in this particular case, this will be
 	the kernel's syscall gate.  The IDT may have a maximum of 256
 	(0x100) records.  The kernel allocates NIDT records for the
 	IDT, where NIDT is the maximum (256):</para>
 
       <programlisting><filename>sys/i386/i386/machdep.c:</filename>
 static struct gate_descriptor idt0[NIDT];
 struct gate_descriptor *idt = &amp;idt0[0];	/* interrupt descriptor table */</programlisting>
 
       <para>For each interrupt, an appropriate handler is set.  The
 	syscall gate for <literal>INT 0x80</literal> is set as
 	well:</para>
 
       <programlisting><filename>sys/i386/i386/machdep.c:</filename>
 	setidt(0x80, &amp;IDTVEC(int0x80_syscall),
 			SDT_SYS386TGT, SEL_UPL, GSEL(GCODE_SEL, SEL_KPL));</programlisting>
 
       <para>So when a userland application issues the
 	<literal>INT 0x80</literal> instruction, control will transfer
 	to the function <function>_Xint0x80_syscall</function>, which
 	is in the kernel code segment and will be executed with
 	supervisor privileges.</para>
 
       <para>Console and DDB are then initialized:</para>
       <indexterm><primary>DDB</primary></indexterm>
 
       <programlisting><filename>sys/i386/i386/machdep.c:</filename>
 	cninit();
 /* skipped */
 #ifdef DDB
 	kdb_init();
 	if (boothowto &amp; RB_KDB)
 		Debugger("Boot flags requested debugger");
 #endif</programlisting>
 
       <para>The Task State Segment is another x86 protected mode
 	structure, the TSS is used by the hardware to store task
 	information when a task switch occurs.</para>
 
       <para>The Local Descriptors Table is used to reference userland
 	code and data.  Several selectors are defined to point to the
 	LDT, they are the system call gates and the user code and data
 	selectors:</para>
 
       <programlisting><filename>/usr/include/machine/segments.h:</filename>
 #define	LSYS5CALLS_SEL	0	/* forced by intel BCS */
 #define	LSYS5SIGR_SEL	1
 #define	L43BSDCALLS_SEL	2	/* notyet */
 #define	LUCODE_SEL	3
 #define	LSOL26CALLS_SEL	4	/* Solaris &gt;= 2.6 system call gate */
 #define	LUDATA_SEL	5
 /* separate stack, es,fs,gs sels ? */
 /* #define	LPOSIXCALLS_SEL	5*/	/* notyet */
 #define LBSDICALLS_SEL	16	/* BSDI system call gate */
 #define NLDT		(LBSDICALLS_SEL + 1)</programlisting>
 
       <para>Next, proc0's Process Control Block
 	(<literal>struct pcb</literal>) structure is initialized.
 	proc0 is a <literal>struct proc</literal> structure that
 	describes a kernel process.  It is always present while the
 	kernel is running, therefore it is declared as global:</para>
 
       <programlisting><filename>sys/kern/kern_init.c:</filename>
     struct	proc proc0;</programlisting>
 
       <para>The structure <literal>struct pcb</literal> is a part of a
 	proc structure.  It is defined in
 	<filename>/usr/include/machine/pcb.h</filename> and has a
 	process's information specific to the i386 architecture, such
 	as registers values.</para>
     </sect2>
 
     <sect2>
       <title><function>mi_startup()</function></title>
 
       <para>This function performs a bubble sort of all the system
 	initialization objects and then calls the entry of each object
 	one by one:</para>
 
       <programlisting><filename>sys/kern/init_main.c:</filename>
 	for (sipp = sysinit; *sipp; sipp++) {
 
 		/* ... skipped ... */
 
 		/* Call function */
 		(*((*sipp)-&gt;func))((*sipp)-&gt;udata);
 		/* ... skipped ... */
 	}</programlisting>
 
       <para>Although the sysinit framework is described in the <link
 	  xlink:href="&url.doc.langbase;/books/developers-handbook">Developers'
 	  Handbook</link>, I will discuss the internals of it.</para>
 
       <indexterm><primary>sysinit objects</primary></indexterm>
       <para>Every system initialization object (sysinit object) is
 	created by calling a SYSINIT() macro.  Let us take as example
 	an <literal>announce</literal> sysinit object.  This object
 	prints the copyright message:</para>
 
       <programlisting><filename>sys/kern/init_main.c:</filename>
 static void
 print_caddr_t(void *data __unused)
 {
 	printf("%s", (char *)data);
 }
 SYSINIT(announce, SI_SUB_COPYRIGHT, SI_ORDER_FIRST, print_caddr_t, copyright)</programlisting>
 
       <para>The subsystem ID for this object is SI_SUB_COPYRIGHT
 	(0x0800001), which comes right after the SI_SUB_CONSOLE
 	(0x0800000).  So, the copyright message will be printed out
 	first, just after the console initialization.</para>
 
       <para>Let us take a look at what exactly the macro
 	<literal>SYSINIT()</literal> does.  It expands to a
 	<literal>C_SYSINIT()</literal> macro.  The
 	<literal>C_SYSINIT()</literal> macro then expands to a static
 	<literal>struct sysinit</literal> structure declaration with
 	another <literal>DATA_SET</literal> macro call:</para>
 
       <programlisting><filename>/usr/include/sys/kernel.h:</filename>
       #define C_SYSINIT(uniquifier, subsystem, order, func, ident) \
       static struct sysinit uniquifier ## _sys_init = { \ subsystem, \
       order, \ func, \ ident \ }; \ DATA_SET(sysinit_set,uniquifier ##
       _sys_init);
 
 #define	SYSINIT(uniquifier, subsystem, order, func, ident)	\
 	C_SYSINIT(uniquifier, subsystem, order,			\
 	(sysinit_cfunc_t)(sysinit_nfunc_t)func, (void *)ident)</programlisting>
 
       <para>The <literal>DATA_SET()</literal> macro expands to a
 	<literal>MAKE_SET()</literal>, and that macro is the point
 	where all the sysinit magic is hidden:</para>
 
       <programlisting><filename>/usr/include/linker_set.h:</filename>
 #define MAKE_SET(set, sym)						\
 	static void const * const __set_##set##_sym_##sym = &amp;sym;	\
 	__asm(".section .set." #set ",\"aw\"");				\
 	__asm(".long " #sym);						\
 	__asm(".previous")
 #endif
 #define TEXT_SET(set, sym) MAKE_SET(set, sym)
 #define DATA_SET(set, sym) MAKE_SET(set, sym)</programlisting>
 
       <para>In our case, the following declaration will occur:</para>
 
       <programlisting>static struct sysinit announce_sys_init = {
 	SI_SUB_COPYRIGHT,
 	SI_ORDER_FIRST,
 	(sysinit_cfunc_t)(sysinit_nfunc_t)  print_caddr_t,
 	(void *) copyright
 };
 
 static void const *const __set_sysinit_set_sym_announce_sys_init =
     &amp;announce_sys_init;
 __asm(".section .set.sysinit_set" ",\"aw\"");
 __asm(".long " "announce_sys_init");
 __asm(".previous");</programlisting>
 
       <para>The first <literal>__asm</literal> instruction will create
 	an ELF section within the kernel's executable.  This will
 	happen at kernel link time.  The section will have the name
 	<literal>.set.sysinit_set</literal>.  The content of this
 	section is one 32-bit value, the address of announce_sys_init
 	structure, and that is what the second
 	<literal>__asm</literal> is.  The third
 	<literal>__asm</literal> instruction marks the end of a
 	section.  If a directive with the same section name occurred
 	before, the content, i.e., the 32-bit value, will be appended
 	to the existing section, so forming an array of 32-bit
 	pointers.</para>
 
       <para>Running <application>objdump</application> on a kernel
 	binary, you may notice the presence of such small
 	sections:</para>
 
       <screen>&prompt.user; <userinput>objdump -h /kernel</userinput>
   7 .set.cons_set 00000014  c03164c0  c03164c0  002154c0  2**2
                   CONTENTS, ALLOC, LOAD, DATA
   8 .set.kbddriver_set 00000010  c03164d4  c03164d4  002154d4  2**2
                   CONTENTS, ALLOC, LOAD, DATA
   9 .set.scrndr_set 00000024  c03164e4  c03164e4  002154e4  2**2
                   CONTENTS, ALLOC, LOAD, DATA
  10 .set.scterm_set 0000000c  c0316508  c0316508  00215508  2**2
                   CONTENTS, ALLOC, LOAD, DATA
  11 .set.sysctl_set 0000097c  c0316514  c0316514  00215514  2**2
                   CONTENTS, ALLOC, LOAD, DATA
  12 .set.sysinit_set 00000664  c0316e90  c0316e90  00215e90  2**2
                   CONTENTS, ALLOC, LOAD, DATA</screen>
 
       <para>This screen dump shows that the size of .set.sysinit_set
 	section is 0x664 bytes, so <literal>0x664/sizeof(void
 	*)</literal> sysinit objects are compiled into the kernel.
 	The other sections such as <literal>.set.sysctl_set</literal>
 	represent other linker sets.</para>
 
       <para>By defining a variable of type <literal>struct
 	  linker_set</literal> the content of
 	<literal>.set.sysinit_set</literal> section will be
 	<quote>collected</quote> into that variable:</para>
 
       <programlisting><filename>sys/kern/init_main.c:</filename>
       extern struct linker_set sysinit_set; /* XXX */</programlisting>
 
       <para>The <literal>struct linker_set</literal> is defined as
 	follows:</para>
 
       <programlisting><filename>/usr/include/linker_set.h:</filename>
   struct linker_set {
 	int	ls_length;
 	void	*ls_items[1];		/* really ls_length of them, trailing NULL */
 };</programlisting>
 
       <para>The first node will be equal to the number of a sysinit
 	objects, and the second node will be a NULL-terminated array
 	of pointers to them.</para>
 
       <para>Returning to the <function>mi_startup()</function>
 	discussion, it is must be clear now, how the sysinit objects
 	are being organized.  The <function>mi_startup()</function>
 	function sorts them and calls each.  The very last object is
 	the system scheduler:</para>
 
       <programlisting><filename>/usr/include/sys/kernel.h:</filename>
 enum sysinit_sub_id {
 	SI_SUB_DUMMY		= 0x0000000,	/* not executed; for linker*/
 	SI_SUB_DONE		= 0x0000001,	/* processed*/
 	SI_SUB_CONSOLE		= 0x0800000,	/* console*/
 	SI_SUB_COPYRIGHT	= 0x0800001,	/* first use of console*/
 ...
 	SI_SUB_RUN_SCHEDULER	= 0xfffffff	/* scheduler: no return*/
 };</programlisting>
 
       <para>The system scheduler sysinit object is defined in the file
 	<filename>sys/vm/vm_glue.c</filename>, and the entry point for
 	that object is <function>scheduler()</function>.  That
 	function is actually an infinite loop, and it represents a
 	process with PID 0, the swapper process.  The proc0 structure,
 	mentioned before, is used to describe it.</para>
 
       <para>The first user process, called <emphasis>init</emphasis>,
 	is created by the sysinit object
 	<literal>init</literal>:</para>
 
       <programlisting><filename>sys/kern/init_main.c:</filename>
 static void
 create_init(const void *udata __unused)
 {
 	int error;
 	int s;
 
 	s = splhigh();
 	error = fork1(&amp;proc0, RFFDG | RFPROC, &amp;initproc);
 	if (error)
 		panic("cannot fork init: %d\n", error);
 	initproc-&gt;p_flag |= P_INMEM | P_SYSTEM;
 	cpu_set_fork_handler(initproc, start_init, NULL);
 	remrunqueue(initproc);
 	splx(s);
 }
 SYSINIT(init,SI_SUB_CREATE_INIT, SI_ORDER_FIRST, create_init, NULL)</programlisting>
 
       <para>The <function>create_init()</function> allocates a new
 	process by calling <function>fork1()</function>, but does not
 	mark it runnable.  When this new process is scheduled for
 	execution by the scheduler, the
 	<function>start_init()</function> will be called.  That
 	function is defined in <filename>init_main.c</filename>.  It
 	tries to load and exec the <filename>init</filename> binary,
 	probing <filename>/sbin/init</filename> first, then
 	<filename>/sbin/oinit</filename>,
 	<filename>/sbin/init.bak</filename>, and finally
 	<filename>/stand/sysinstall</filename>:</para>
 
       <programlisting><filename>sys/kern/init_main.c:</filename>
 static char init_path[MAXPATHLEN] =
 #ifdef	INIT_PATH
     __XSTRING(INIT_PATH);
 #else
     "/sbin/init:/sbin/oinit:/sbin/init.bak:/stand/sysinstall";
 #endif</programlisting>
     </sect2>
   </sect1>
 </chapter>
diff --git a/en_US.ISO8859-1/books/arch-handbook/driverbasics/chapter.xml b/en_US.ISO8859-1/books/arch-handbook/driverbasics/chapter.xml
index 6e5551873b..9826e3a1d9 100644
--- a/en_US.ISO8859-1/books/arch-handbook/driverbasics/chapter.xml
+++ b/en_US.ISO8859-1/books/arch-handbook/driverbasics/chapter.xml
@@ -1,423 +1,423 @@
 <?xml version="1.0" encoding="iso-8859-1"?>
 <!--
      The FreeBSD Documentation Project
 
      $FreeBSD$
 -->
 
 <chapter xmlns="http://docbook.org/ns/docbook"
   xmlns:xlink="http://www.w3.org/1999/xlink" version="5.0"
   xml:id="driverbasics">
 
   <info>
     <title>Writing FreeBSD Device Drivers</title>
 
     <authorgroup>
       <author>
 	<personname>
 	  <firstname>Murray</firstname>
 	  <surname>Stokely</surname>
 	</personname>
 
 	<contrib>Written by </contrib>
       </author>
     </authorgroup>
 
     <authorgroup>
       <author>
 	<personname>
 	  <firstname>J&ouml;rg</firstname>
 	  <surname>Wunsch</surname>
 	</personname>
 
 	<contrib>Based on intro(4) manual page by </contrib>
       </author>
     </authorgroup>
   </info>
 
   <sect1 xml:id="driverbasics-intro">
     <title>Introduction</title>
 
     <indexterm><primary>device driver</primary></indexterm>
     <indexterm><primary>pseudo-device</primary></indexterm>
 
     <para>This chapter provides a brief introduction to writing device
       drivers for FreeBSD.  A device in this context is a term used
       mostly for hardware-related stuff that belongs to the system,
       like disks, printers, or a graphics display with its keyboard.
       A device driver is the software component of the operating
       system that controls a specific device.  There are also
       so-called pseudo-devices where a device driver emulates the
       behavior of a device in software without any particular
       underlying hardware.  Device drivers can be compiled into the
       system statically or loaded on demand through the dynamic kernel
       linker facility `kld'.</para>
 
     <indexterm><primary>device nodes</primary></indexterm>
 
     <para>Most devices in a &unix;-like operating system are accessed
       through device-nodes, sometimes also called special files.
       These files are usually located under the directory
       <filename>/dev</filename> in the filesystem hierarchy.</para>
 
     <para>Device drivers can roughly be broken down into two
       categories; character and network device drivers.</para>
 
   </sect1>
 
   <sect1 xml:id="driverbasics-kld">
     <title>Dynamic Kernel Linker Facility - KLD</title>
 
     <indexterm>
       <primary>kernel linking</primary>
       <secondary>dynamic</secondary>
     </indexterm>
     <indexterm>
       <primary>kernel loadable modules (KLD)</primary>
     </indexterm>
 
     <para>The kld interface allows system administrators to
       dynamically add and remove functionality from a running system.
       This allows device driver writers to load their new changes into
       a running kernel without constantly rebooting to test
       changes.</para>
 
     <indexterm>
       <primary>kernel modules</primary>
       <secondary>loading</secondary>
     </indexterm>
     <indexterm>
       <primary>kernel modules</primary>
       <secondary>unloading</secondary>
     </indexterm>
     <indexterm>
       <primary>kernel modules</primary>
       <secondary>listing</secondary>
     </indexterm>
 
     <para>The kld interface is used through:</para>
 
     <itemizedlist>
       <listitem>
 	<simpara><command>kldload</command> - loads a new kernel
 	  module</simpara></listitem>
       <listitem>
 	<simpara><command>kldunload</command> - unloads a kernel
 	  module</simpara></listitem>
       <listitem>
 	<simpara><command>kldstat</command> - lists loaded
 	  modules</simpara></listitem>
     </itemizedlist>
 
     <para>Skeleton Layout of a kernel module</para>
 
     <programlisting>/*
  * KLD Skeleton
  * Inspired by Andrew Reiter's Daemonnews article
  */
 
 #include &lt;sys/types.h&gt;
 #include &lt;sys/module.h&gt;
 #include &lt;sys/systm.h&gt;  /* uprintf */
 #include &lt;sys/errno.h&gt;
 #include &lt;sys/param.h&gt;  /* defines used in kernel.h */
 #include &lt;sys/kernel.h&gt; /* types used in module initialization */
 
 /*
  * Load handler that deals with the loading and unloading of a KLD.
  */
 
 static int
 skel_loader(struct module *m, int what, void *arg)
 {
   int err = 0;
 
   switch (what) {
   case MOD_LOAD:                /* kldload */
     uprintf("Skeleton KLD loaded.\n");
     break;
   case MOD_UNLOAD:
     uprintf("Skeleton KLD unloaded.\n");
     break;
   default:
     err = EOPNOTSUPP;
     break;
   }
   return(err);
 }
 
 /* Declare this module to the rest of the kernel */
 
 static moduledata_t skel_mod = {
   "skel",
   skel_loader,
   NULL
 };
 
 DECLARE_MODULE(skeleton, skel_mod, SI_SUB_KLD, SI_ORDER_ANY);</programlisting>
 
 
     <sect2>
       <title>Makefile</title>
 
       <para>&os; provides a system makefile to simplify compiling a
 	kernel module.</para>
 
       <programlisting>SRCS=skeleton.c
 KMOD=skeleton
 
 .include &lt;bsd.kmod.mk&gt;</programlisting>
 
       <para>Running <command>make</command> with this makefile
 	will create a file <filename>skeleton.ko</filename> that can
 	be loaded into the kernel by typing:</para>
 
       <screen>&prompt.root; <userinput>kldload -v ./skeleton.ko</userinput></screen>
     </sect2>
   </sect1>
 
   <sect1 xml:id="driverbasics-char">
     <title>Character Devices</title>
 
     <indexterm>
       <primary>character devices</primary>
     </indexterm>
     <para>A character device driver is one that transfers data
       directly to and from a user process.  This is the most common
       type of device driver and there are plenty of simple examples in
       the source tree.</para>
 
     <para>This simple example pseudo-device remembers whatever values
       are written to it and can then echo them back when
       read.</para>
 
     <example>
       <title>Example of a Sample Echo Pseudo-Device Driver for
 	&os;&nbsp;10.X - 12.X</title>
 
       <programlisting>/*
  * Simple Echo pseudo-device KLD
  *
  * Murray Stokely
  * Søren (Xride) Straarup
  * Eitan Adler
  */
 
 #include &lt;sys/types.h&gt;
 #include &lt;sys/module.h&gt;
 #include &lt;sys/systm.h&gt;  /* uprintf */
 #include &lt;sys/param.h&gt;  /* defines used in kernel.h */
 #include &lt;sys/kernel.h&gt; /* types used in module initialization */
 #include &lt;sys/conf.h&gt;   /* cdevsw struct */
 #include &lt;sys/uio.h&gt;    /* uio struct */
 #include &lt;sys/malloc.h&gt;
 
 #define BUFFERSIZE 255
 
 /* Function prototypes */
 static d_open_t      echo_open;
 static d_close_t     echo_close;
 static d_read_t      echo_read;
 static d_write_t     echo_write;
 
 /* Character device entry points */
 static struct cdevsw echo_cdevsw = {
 	.d_version = D_VERSION,
 	.d_open = echo_open,
 	.d_close = echo_close,
 	.d_read = echo_read,
 	.d_write = echo_write,
 	.d_name = "echo",
 };
 
 struct s_echo {
 	char msg[BUFFERSIZE + 1];
 	int len;
 };
 
 /* vars */
 static struct cdev *echo_dev;
 static struct s_echo *echomsg;
 
 MALLOC_DECLARE(M_ECHOBUF);
 MALLOC_DEFINE(M_ECHOBUF, "echobuffer", "buffer for echo module");
 
 /*
  * This function is called by the kld[un]load(2) system calls to
  * determine what actions to take when a module is loaded or unloaded.
  */
 static int
 echo_loader(struct module *m __unused, int what, void *arg __unused)
 {
 	int error = 0;
 
 	switch (what) {
 	case MOD_LOAD:                /* kldload */
 		error = make_dev_p(MAKEDEV_CHECKNAME | MAKEDEV_WAITOK,
 		    &amp;echo_dev,
 		    &amp;echo_cdevsw,
 		    0,
 		    UID_ROOT,
 		    GID_WHEEL,
 		    0600,
 		    "echo");
 		if (error != 0)
 			break;
 
 		echomsg = malloc(sizeof(*echomsg), M_ECHOBUF, M_WAITOK |
 		    M_ZERO);
 		printf("Echo device loaded.\n");
 		break;
 	case MOD_UNLOAD:
 		destroy_dev(echo_dev);
 		free(echomsg, M_ECHOBUF);
 		printf("Echo device unloaded.\n");
 		break;
 	default:
 		error = EOPNOTSUPP;
 		break;
 	}
 	return (error);
 }
 
 static int
 echo_open(struct cdev *dev __unused, int oflags __unused, int devtype __unused,
     struct thread *td __unused)
 {
 	int error = 0;
 
 	uprintf("Opened device \"echo\" successfully.\n");
 	return (error);
 }
 
 static int
 echo_close(struct cdev *dev __unused, int fflag __unused, int devtype __unused,
     struct thread *td __unused)
 {
 
 	uprintf("Closing device \"echo\".\n");
 	return (0);
 }
 
 /*
  * The read function just takes the buf that was saved via
  * echo_write() and returns it to userland for accessing.
  * uio(9)
  */
 static int
 echo_read(struct cdev *dev __unused, struct uio *uio, int ioflag __unused)
 {
 	size_t amt;
 	int error;
 
 	/*
 	 * How big is this read operation?  Either as big as the user wants,
 	 * or as big as the remaining data.  Note that the 'len' does not
 	 * include the trailing null character.
 	 */
 	amt = MIN(uio-&gt;uio_resid, uio-&gt;uio_offset &gt;= echomsg-&gt;len + 1 ? 0 :
 	    echomsg-&gt;len + 1 - uio-&gt;uio_offset);
 
 	if ((error = uiomove(echomsg-&gt;msg, amt, uio)) != 0)
 		uprintf("uiomove failed!\n");
 
 	return (error);
 }
 
 /*
  * echo_write takes in a character string and saves it
  * to buf for later accessing.
  */
 static int
 echo_write(struct cdev *dev __unused, struct uio *uio, int ioflag __unused)
 {
 	size_t amt;
 	int error;
 
 	/*
 	 * We either write from the beginning or are appending -- do
 	 * not allow random access.
 	 */
 	if (uio-&gt;uio_offset != 0 &amp;&amp; (uio-&gt;uio_offset != echomsg-&gt;len))
 		return (EINVAL);
 
 	/* This is a new message, reset length */
 	if (uio-&gt;uio_offset == 0)
 		echomsg-&gt;len = 0;
 
 	/* Copy the string in from user memory to kernel memory */
 	amt = MIN(uio-&gt;uio_resid, (BUFFERSIZE - echomsg-&gt;len));
 
 	error = uiomove(echomsg-&gt;msg + uio-&gt;uio_offset, amt, uio);
 
 	/* Now we need to null terminate and record the length */
 	echomsg-&gt;len = uio-&gt;uio_offset;
 	echomsg-&gt;msg[echomsg-&gt;len] = 0;
 
 	if (error != 0)
 		uprintf("Write failed: bad address!\n");
 	return (error);
 }
 
 DEV_MODULE(echo, echo_loader, NULL);</programlisting>
     </example>
 
     <para>With this driver loaded try:</para>
 
     <screen>&prompt.root; <userinput>echo -n "Test Data" &gt; /dev/echo</userinput>
 &prompt.root; <userinput>cat /dev/echo</userinput>
 Opened device "echo" successfully.
 Test Data
 Closing device "echo".</screen>
 
     <para>Real hardware devices are described in the next
       chapter.</para>
   </sect1>
 
   <sect1 xml:id="driverbasics-block">
     <title>Block Devices (Are Gone)</title>
 
     <indexterm><primary>block devices</primary></indexterm>
 
     <para>Other &unix; systems may support a second type of disk
       device known as block devices.  Block devices are disk devices
       for which the kernel provides caching.  This caching makes
       block-devices almost unusable, or at least dangerously
       unreliable.  The caching will reorder the sequence of write
       operations, depriving the application of the ability to know the
       exact disk contents at any one instant in time.</para>
 
     <para>This makes predictable and reliable crash recovery of
       on-disk data structures (filesystems, databases, etc.)
       impossible.  Since writes may be delayed, there is no way
       the kernel can report to the application which particular
       write operation encountered a write error, this further
       compounds the consistency problem.</para>
 
     <para>For this reason, no serious applications rely on block
       devices, and in fact, almost all applications which access
       disks directly take great pains to specify that character
-      (or <quote>raw</quote>) devices should always be used.  Because
+      (or <quote>raw</quote>) devices should always be used.  As
       the implementation of the aliasing of each disk (partition) to
       two devices with different semantics significantly complicated
-      the relevant kernel code &os; dropped support for cached disk
+      the relevant kernel code, &os; dropped support for cached disk
       devices as part of the modernization of the disk I/O
       infrastructure.</para>
   </sect1>
 
   <sect1 xml:id="driverbasics-net">
     <title>Network Drivers</title>
 
     <indexterm>
       <primary>network devices</primary>
     </indexterm>
     <para>Drivers for network devices do not use device nodes in order
       to be accessed.  Their selection is based on other decisions
       made inside the kernel and instead of calling open(), use of a
       network device is generally introduced by using the system call
       socket(2).</para>
 
     <para>For more information see ifnet(9), the source of the
       loopback device, and Bill Paul's network drivers.</para>
   </sect1>
 </chapter>
diff --git a/en_US.ISO8859-1/books/arch-handbook/isa/chapter.xml b/en_US.ISO8859-1/books/arch-handbook/isa/chapter.xml
index 97bd2822c5..04de498a3f 100644
--- a/en_US.ISO8859-1/books/arch-handbook/isa/chapter.xml
+++ b/en_US.ISO8859-1/books/arch-handbook/isa/chapter.xml
@@ -1,2514 +1,2514 @@
 <?xml version="1.0" encoding="iso-8859-1"?>
 <!--
      The FreeBSD Documentation Project
 
      $FreeBSD$
 -->
 <chapter xmlns="http://docbook.org/ns/docbook" xmlns:xlink="http://www.w3.org/1999/xlink" version="5.0" xml:id="isa-driver">
   <info><title>ISA Device Drivers</title>
     <authorgroup>
       <author><personname><firstname>Sergey</firstname><surname>Babkin</surname></personname><contrib>Written by </contrib></author>
     </authorgroup>
     <authorgroup>
       <author><personname><firstname>Murray</firstname><surname>Stokely</surname></personname><contrib>Modifications for Handbook made by </contrib></author>
       <author><personname><firstname>Valentino</firstname><surname>Vaschetto</surname></personname></author>
       <author><personname><firstname>Wylie</firstname><surname>Stilwell</surname></personname></author>
     </authorgroup>
   </info>
 
   
 
   <sect1 xml:id="isa-driver-synopsis">
     <title>Synopsis</title>
 
     <indexterm><primary>ISA</primary></indexterm>
     <indexterm><primary>device driver</primary><secondary>ISA</secondary></indexterm>
 
     <para>This chapter introduces the issues relevant to writing a
       driver for an ISA device.  The pseudo-code presented here is
       rather detailed and reminiscent of the real code but is still
       only pseudo-code. It avoids the details irrelevant to the
       subject of the discussion. The real-life examples can be found
       in the source code of real drivers. In particular the drivers
       <literal>ep</literal> and <literal>aha</literal> are good sources of information.</para>
   </sect1>
 
   <sect1 xml:id="isa-driver-basics">
     <title>Basic Information</title>
 
     <para>A typical ISA driver would need the following include
       files:</para>
 
 <programlisting>#include &lt;sys/module.h&gt;
 #include &lt;sys/bus.h&gt;
 #include &lt;machine/bus.h&gt;
 #include &lt;machine/resource.h&gt;
 #include &lt;sys/rman.h&gt;
 
 #include &lt;isa/isavar.h&gt;
 #include &lt;isa/pnpvar.h&gt;</programlisting>
 
     <para>They describe the things specific to the ISA and generic
       bus subsystem.</para>
 
     <indexterm><primary>object-oriented</primary></indexterm>
     <para>The bus subsystem is implemented in an object-oriented
       fashion, its main structures are accessed by associated method
       functions.</para>
 
     <indexterm><primary>bus methods</primary></indexterm>
     <para>The list of bus methods implemented by an ISA driver is like
       one for any other bus. For a hypothetical driver named <quote>xxx</quote>
       they would be:</para>
 
     <itemizedlist>
       <listitem>
         <para><function>static void xxx_isa_identify (driver_t *,
           device_t);</function> Normally used for bus drivers, not
           device drivers. But for ISA devices this method may have
           special use: if the device provides some device-specific
           (non-PnP) way to auto-detect devices this routine may
           implement it.</para>
       </listitem>
 
       <listitem>
 	<para><function>static int xxx_isa_probe (device_t
           dev);</function> Probe for a device at a known (or PnP)
           location. This routine can also accommodate device-specific
           auto-detection of parameters for partially configured
           devices.</para>
       </listitem>
 
       <listitem>
 	<para><function>static int xxx_isa_attach (device_t
           dev);</function> Attach and initialize device.</para>
       </listitem>
 
       <listitem>
 	<para><function>static int xxx_isa_detach (device_t
           dev);</function> Detach device before unloading the driver
           module.</para>
       </listitem>
 
       <listitem>
         <para><function>static int xxx_isa_shutdown (device_t
           dev);</function> Execute shutdown of the device before
           system shutdown.</para>
       </listitem>
 
       <listitem>
 	<para><function>static int xxx_isa_suspend (device_t
           dev);</function> Suspend the device before the system goes
           to the power-save state. May also abort transition to the
           power-save state.</para>
       </listitem>
 
       <listitem>
 	<para><function>static int xxx_isa_resume (device_t
  	  dev);</function> Resume the device activity after return
  	  from power-save state.</para>
       </listitem>
 
     </itemizedlist>
 
     <para><function>xxx_isa_probe()</function> and
       <function>xxx_isa_attach()</function> are mandatory, the rest of
       the routines are optional, depending on the device's
       needs.</para>
 
     <para>The driver is linked to the system with the following set of
       descriptions.</para>
 
 <programlisting>    /* table of supported bus methods */
     static device_method_t xxx_isa_methods[] = {
         /* list all the bus method functions supported by the driver */
         /* omit the unsupported methods */
         DEVMETHOD(device_identify,  xxx_isa_identify),
         DEVMETHOD(device_probe,     xxx_isa_probe),
         DEVMETHOD(device_attach,    xxx_isa_attach),
         DEVMETHOD(device_detach,    xxx_isa_detach),
         DEVMETHOD(device_shutdown,  xxx_isa_shutdown),
         DEVMETHOD(device_suspend,   xxx_isa_suspend),
         DEVMETHOD(device_resume,    xxx_isa_resume),
 
 	DEVMETHOD_END
     };
 
     static driver_t xxx_isa_driver = {
         "xxx",
         xxx_isa_methods,
         sizeof(struct xxx_softc),
     };
 
 
     static devclass_t xxx_devclass;
 
     DRIVER_MODULE(xxx, isa, xxx_isa_driver, xxx_devclass,
         load_function, load_argument);</programlisting>
 
       <indexterm><primary>softc</primary></indexterm>
 
       <para>Here struct <varname remap="structname">xxx_softc</varname> is a
         device-specific structure that contains private driver data
         and descriptors for the driver's resources.  The bus code
         automatically allocates one softc descriptor per device as
         needed.</para>
 
       <indexterm><primary>kernel module</primary></indexterm>
 
       <para>If the driver is implemented as a loadable module then
         <function>load_function()</function> is called to do
         driver-specific initialization or clean-up when the driver is
         loaded or unloaded and load_argument is passed as one of its
         arguments.  If the driver does not support dynamic loading (in
         other words it must always be linked into the kernel) then these
         values should be set to 0 and the last definition would look
         like:</para>
 
       <programlisting> DRIVER_MODULE(xxx, isa, xxx_isa_driver,
        xxx_devclass, 0, 0);</programlisting>
 
       <indexterm><primary>PnP</primary></indexterm>
 
       <para>If the driver is for a device which supports PnP then a
         table of supported PnP IDs must be defined.  The table
         consists of a list of PnP IDs supported by this driver and
         human-readable descriptions of the hardware types and models
         having these IDs. It looks like:</para>
 
 <programlisting>    static struct isa_pnp_id xxx_pnp_ids[] = {
         /* a line for each supported PnP ID */
         { 0x12345678,   "Our device model 1234A" },
         { 0x12345679,   "Our device model 1234B" },
         { 0,        NULL }, /* end of table */
     };</programlisting>
 
       <para>If the driver does not support PnP devices it still needs
         an empty PnP ID table, like:</para>
 
 <programlisting>    static struct isa_pnp_id xxx_pnp_ids[] = {
         { 0,        NULL }, /* end of table */
     };</programlisting>
 
     </sect1>
 
     <sect1 xml:id="isa-driver-device-t">
       <title><varname remap="structname">device_t</varname> Pointer</title>
 
       <para><varname remap="structname">device_t</varname> is the pointer type for
 	the device structure. Here we consider only the methods
 	interesting from the device driver writer's standpoint.  The
 	methods to manipulate values in the device structure
 	are:</para>
 
       <itemizedlist>
 
         <listitem><para><function>device_t
 	  device_get_parent(dev)</function> Get the parent bus of a
 	  device.</para></listitem>
 
         <listitem><para><function>driver_t
 	  device_get_driver(dev)</function> Get pointer to its driver
 	  structure.</para></listitem>
 
 	<listitem><para><function>char
 	  *device_get_name(dev)</function> Get the driver name, such
 	  as <literal>"xxx"</literal> for our example.</para></listitem>
 
 	<listitem><para><function>int device_get_unit(dev)</function>
 	  Get the unit number (units are numbered from 0 for the
 	  devices associated with each driver).</para></listitem>
 
 	<listitem><para><function>char
 	  *device_get_nameunit(dev)</function> Get the device name
 	  including the unit number, such as <quote>xxx0</quote>, <quote>xxx1</quote> and so
 	  on.</para></listitem>
 
 	<listitem><para><function>char
 	  *device_get_desc(dev)</function> Get the device
 	  description. Normally it describes the exact model of device
 	  in human-readable form.</para></listitem>
 
 	<listitem><para><function>device_set_desc(dev,
 	  desc)</function> Set the description. This makes the device
 	  description point to the string desc which may not be
 	  deallocated or changed after that.</para></listitem>
 
 	<listitem><para><function>device_set_desc_copy(dev,
 	  desc)</function> Set the description. The description is
 	  copied into an internal dynamically allocated buffer, so the
 	  string desc may be changed afterwards without adverse
 	  effects.</para></listitem>
 
 	<listitem><para><function>void
 	  *device_get_softc(dev)</function> Get pointer to the device
 	  descriptor (struct <varname remap="structname">xxx_softc</varname>)
 	  associated with this device.</para></listitem>
 
 	<listitem><para><function>u_int32_t
 	  device_get_flags(dev)</function> Get the flags specified for
 	  the device in the configuration file.</para></listitem>
 
       </itemizedlist>
 
       <para>A convenience function <function>device_printf(dev, fmt,
 	...)</function> may be used to print the messages from the
 	device driver. It automatically prepends the unitname and
 	colon to the message.</para>
 
       <para>The device_t methods are implemented in the file
         <filename>kern/bus_subr.c</filename>.</para>
 
     </sect1>
 
     <sect1 xml:id="isa-driver-config">
       <title>Configuration File and the Order of Identifying and Probing
 	During Auto-Configuration</title>
 
       <indexterm><primary>ISA</primary><secondary>probing</secondary></indexterm>
 
       <para>The ISA devices are described in the kernel configuration file
   	like:</para>
 
       <programlisting>device xxx0 at isa? port 0x300 irq 10 drq 5
        iomem 0xd0000 flags 0x1 sensitive</programlisting>
 
       <indexterm><primary>IRQ</primary></indexterm>
 
       <para>The values of port, IRQ and so on are converted to the
 	resource values associated with the device. They are optional,
 	depending on the device's needs and abilities for
 	auto-configuration. For example, some devices do not need DRQ
 	at all and some allow the driver to read the IRQ setting from
 	the device configuration ports. If a machine has multiple ISA
 	buses the exact bus may be specified in the configuration
 	line, like <literal>isa0</literal> or <literal>isa1</literal>, otherwise the device would be
 	searched for on all the ISA buses.</para>
 
       <para><literal>sensitive</literal> is a resource requesting that this device must
 	be probed before all non-sensitive devices. It is supported
 	but does not seem to be used in any current driver.</para>
 
       <para>For legacy ISA devices in many cases the drivers are still
 	able to detect the configuration parameters. But each device
 	to be configured in the system must have a config line. If two
 	devices of some type are installed in the system but there is
 	only one configuration line for the corresponding driver, ie:
 	<programlisting>device xxx0 at isa?</programlisting> then only
 	one device will be configured.</para>
 
       <para>But for the devices supporting automatic identification by
 	the means of Plug-n-Play or some proprietary protocol one
 	configuration line is enough to configure all the devices in
 	the system, like the one above or just simply:</para>
 
       <programlisting>device xxx at isa?</programlisting>
 
       <para>If a driver supports both auto-identified and legacy
 	devices and both kinds are installed at once in one machine
 	then it is enough to describe in the config file the legacy
 	devices only. The auto-identified devices will be added
 	automatically.</para>
 
       <para>When an ISA bus is auto-configured the events happen as
   	follows:</para>
 
       <para>All the drivers' identify routines (including the PnP
 	identify routine which identifies all the PnP devices) are
 	called in random order.  As they identify the devices they add
 	them to the list on the ISA bus.  Normally the drivers'
 	identify routines associate their drivers with the new
 	devices. The PnP identify routine does not know about the
 	other drivers yet so it does not associate any with the new
 	devices it adds.</para>
 
       <para>The PnP devices are put to sleep using the PnP protocol to
         prevent them from being probed as legacy devices.</para>
 
       <para>The probe routines of non-PnP devices marked as
         <literal>sensitive</literal> are called.  If probe for a device went
         successfully, the attach routine is called for it.</para>
 
       <para>The probe and attach routines of all non-PNP devices are
   	called likewise.</para>
 
       <para>The PnP devices are brought back from the sleep state and
         assigned the resources they request: I/O and memory address
         ranges, IRQs and DRQs, all of them not conflicting with the
         attached legacy devices.</para>
 
       <para>Then for each PnP device the probe routines of all the
         present ISA drivers are called. The first one that claims the
         device gets attached.  It is possible that multiple drivers
         would claim the device with different priority; in this case, the
         highest-priority driver wins.  The probe routines must call
         <function>ISA_PNP_PROBE()</function> to compare the actual PnP
         ID with the list of the IDs supported by the driver and if the
         ID is not in the table return failure. That means that
         absolutely every driver, even the ones not supporting any PnP
         devices must call <function>ISA_PNP_PROBE()</function>, at
         least with an empty PnP ID table to return failure on unknown
         PnP devices.</para>
 
       <para>The probe routine returns a positive value (the error
         code) on error, zero or negative value on success.</para>
 
       <para>The negative return values are used when a PnP device
         supports multiple interfaces. For example, an older
         compatibility interface and a newer advanced interface which
         are supported by different drivers. Then both drivers would
         detect the device. The driver which returns a higher value in
         the probe routine takes precedence (in other words, the driver
         returning 0 has highest precedence, returning -1 is next,
         returning -2 is after it and so on). In result the devices
         which support only the old interface will be handled by the
         old driver (which should return -1 from the probe routine)
         while the devices supporting the new interface as well will be
         handled by the new driver (which should return 0 from the
         probe routine). If multiple drivers return the same value then
         the one called first wins. So if a driver returns value 0 it
         may be sure that it won the priority arbitration.</para>
 
       <para>The device-specific identify routines can also assign not
         a driver but a class of drivers to the device. Then all the
         drivers in the class are probed for this device, like the case
         with PnP. This feature is not implemented in any existing
         driver and is not considered further in this document.</para>
 
-      <para>Because the PnP devices are disabled when probing the
+      <para>As the PnP devices are disabled when probing the
         legacy devices they will not be attached twice (once as legacy
         and once as PnP).  But in case of device-dependent identify
         routines it is the responsibility of the driver to make sure
         that the same device will not be attached by the driver twice:
         once as legacy user-configured and once as
         auto-identified.</para>
 
       <para>Another practical consequence for the auto-identified
         devices (both PnP and device-specific) is that the flags can
         not be passed to them from the kernel configuration file. So
         they must either not use the flags at all or use the flags
         from the device unit 0 for all the auto-identified devices or
         use the sysctl interface instead of flags.</para>
 
       <para>Other unusual configurations may be accommodated by
         accessing the configuration resources directly with functions
         of families <function>resource_query_*()</function> and
         <function>resource_*_value()</function>. Their implementations
         are located in <filename>kern/subr_bus.c</filename>. The old IDE disk driver
         <filename>i386/isa/wd.c</filename> contains examples of such use. But the standard
         means of configuration must always be preferred. Leave parsing
         the configuration resources to the bus configuration
         code.</para>
 
     </sect1>
 
     <sect1 xml:id="isa-driver-resources">
       <title>Resources</title>
 
       <indexterm><primary>resources</primary></indexterm>
       <indexterm><primary>device driver</primary><secondary>resources</secondary></indexterm>
 
       <para>The information that a user enters into the kernel
         configuration file is processed and passed to the kernel as
         configuration resources. This information is parsed by the bus
         configuration code and transformed into a value of structure
         device_t and the bus resources associated with it. The drivers
         may access the configuration resources directly using
         functions <function>resource_*</function> for more complex cases of
         configuration. However, generally this is neither needed nor recommended,
         so this issue is not discussed further here.</para>
 
       <para>The bus resources are associated with each device. They
         are identified by type and number within the type. For the ISA
         bus the following types are defined:</para>
 
       <indexterm><primary>DMA channel</primary></indexterm>
 
       <itemizedlist>
 	<listitem>
 	  <para><emphasis>SYS_RES_IRQ</emphasis> - interrupt
 	    number</para>
 	</listitem>
 
 	<listitem>
 	  <para><emphasis>SYS_RES_DRQ</emphasis> - ISA DMA channel
 	    number</para>
 	</listitem>
 
 	<listitem>
 	  <para><emphasis>SYS_RES_MEMORY</emphasis> - range of
 	    device memory mapped into the system memory space
 	  </para>
 	</listitem>
 
 	<listitem>
 	  <para><emphasis>SYS_RES_IOPORT</emphasis> - range of
 	    device I/O registers</para>
         </listitem>
       </itemizedlist>
 
       <para>The enumeration within types starts from 0, so if a device
         has two memory regions it would have resources of type
         <literal>SYS_RES_MEMORY</literal> numbered 0 and 1.  The resource type has
         nothing to do with the C language type, all the resource
         values have the C language type <literal>unsigned long</literal> and must be
         cast as necessary. The resource numbers do not have to be
         contiguous, although for ISA they normally would be. The
         permitted resource numbers for ISA devices are:</para>
 
       <programlisting>          IRQ: 0-1
           DRQ: 0-1
           MEMORY: 0-3
           IOPORT: 0-7</programlisting>
 
       <para>All the resources are represented as ranges, with a start
         value and count.  For IRQ and DRQ resources the count would
         normally be equal to 1. The values for memory refer to the
         physical addresses.</para>
 
       <para>Three types of activities can be performed on
         resources:</para>
 
       <itemizedlist>
 	<listitem><para>set/get</para></listitem>
 	<listitem><para>allocate/release</para></listitem>
 	<listitem><para>activate/deactivate</para></listitem>
       </itemizedlist>
 
       <para>Setting sets the range used by the resource. Allocation
         reserves the requested range that no other driver would be
         able to reserve it (and checking that no other driver reserved
         this range already). Activation makes the resource accessible
         to the driver by doing whatever is necessary for that (for
         example, for memory it would be mapping into the kernel
         virtual address space).</para>
 
       <para>The functions to manipulate resources are:</para>
 
       <itemizedlist>
 	<listitem>
 	  <para><function>int bus_set_resource(device_t dev, int type,
             int rid, u_long start, u_long count)</function></para>
 
           <para>Set a range for a resource. Returns 0 if successful,
             error code otherwise.  Normally, this function will
             return an error only if one of <literal>type</literal>,
             <literal>rid</literal>, <literal>start</literal> or
             <literal>count</literal> has a value that falls out of the
             permitted range.</para>
 
           <itemizedlist>
             <listitem>
               <para> dev - driver's device</para>
             </listitem>
             <listitem>
               <para> type - type of resource, SYS_RES_* </para>
             </listitem>
             <listitem>
               <para> rid - resource number (ID) within type </para>
             </listitem>
             <listitem>
               <para> start, count - resource range </para>
             </listitem>
           </itemizedlist>
         </listitem>
 
         <listitem>
           <para><function>int bus_get_resource(device_t dev, int type,
           int rid, u_long *startp, u_long *countp)</function></para>
 
           <para>Get the range of resource. Returns 0 if successful,
             error code if the resource is not defined yet.</para>
         </listitem>
 
         <listitem>
 	  <para><function>u_long bus_get_resource_start(device_t dev,
             int type, int rid) u_long bus_get_resource_count (device_t
             dev, int type, int rid)</function></para>
 
           <para>Convenience functions to get only the start or
             count. Return 0 in case of error, so if the resource start
             has 0 among the legitimate values it would be impossible
             to tell if the value is 0 or an error occurred.  Luckily,
             no ISA resources for add-on drivers may have a start value
             equal to 0.</para>
         </listitem>
 
         <listitem>
           <para><function>void bus_delete_resource(device_t dev, int
             type, int rid)</function></para>
           <para> Delete a resource, make it undefined.</para>
         </listitem>
 
         <listitem>
           <para><function>struct resource *
             bus_alloc_resource(device_t dev, int type, int *rid,
             u_long start, u_long end, u_long count, u_int
             flags)</function></para>
 
           <para>Allocate a resource as a range of count values not
             allocated by anyone else, somewhere between start and
             end. Alas, alignment is not supported.  If the resource
             was not set yet it is automatically created. The special
             values of start 0 and end ~0 (all ones) means that the
             fixed values previously set by
             <function>bus_set_resource()</function> must be used
             instead: start and count as themselves and
             end=(start+count), in this case if the resource was not
             defined before then an error is returned.  Although rid is
             passed by reference it is not set anywhere by the resource
             allocation code of the ISA bus. (The other buses may use a
             different approach and modify it).</para>
         </listitem>
       </itemizedlist>
 
       <para>Flags are a bitmap, the flags interesting for the caller
         are:</para>
 
       <itemizedlist>
         <listitem>
           <para><emphasis>RF_ACTIVE</emphasis> - causes the resource
             to be automatically activated after allocation.</para>
         </listitem>
 
         <listitem>
           <para><emphasis>RF_SHAREABLE</emphasis> - resource may be
             shared at the same time by multiple drivers.</para>
         </listitem>
 
         <listitem>
           <para><emphasis>RF_TIMESHARE</emphasis> - resource may be
             time-shared by multiple drivers, i.e., allocated at the
             same time by many but activated only by one at any given
             moment of time.</para>
         </listitem>
 <!-- XXXDONT KNOW IT THESE SHOULD BE TWO SEPARATE LISTS OR NOT -->
         <listitem>
           <para>Returns 0 on error. The allocated values may be
             obtained from the returned handle using methods
             <function>rhand_*()</function>.</para>
         </listitem>
         <listitem>
           <para><function>int bus_release_resource(device_t dev, int
             type, int rid, struct resource *r)</function></para>
 	</listitem>
 
         <listitem>
           <para>Release the resource, r is the handle returned by
             <function>bus_alloc_resource()</function>.  Returns 0 on
             success, error code otherwise.</para>
         </listitem>
 
         <listitem>
           <para><function>int bus_activate_resource(device_t dev, int
             type, int rid, struct resource *r)</function>
             <function>int bus_deactivate_resource(device_t dev, int
             type, int rid, struct resource *r)</function></para>
         </listitem>
 
         <listitem>
           <para>Activate or deactivate resource. Return 0 on success,
             error code otherwise.  If the resource is time-shared and
             currently activated by another driver then <literal>EBUSY</literal> is
             returned.</para>
         </listitem>
 
         <listitem>
           <para><function>int bus_setup_intr(device_t dev, struct
             resource *r, int flags, driver_intr_t *handler, void *arg,
             void **cookiep)</function> <function>int
             bus_teardown_intr(device_t dev, struct resource *r, void
             *cookie)</function></para>
         </listitem>
 
         <listitem>
           <para>Associate or de-associate the interrupt handler with a
             device. Return 0 on success, error code otherwise.</para>
         </listitem>
 
         <listitem>
           <para>r - the activated resource handler describing the
             IRQ</para>
 	  <para>flags - the interrupt priority level, one of:</para>
 
           <itemizedlist>
             <listitem>
               <para><function>INTR_TYPE_TTY</function> - terminals and
                 other likewise character-type devices. To mask them
                 use <function>spltty()</function>.</para>
             </listitem>
             <listitem>
               <para><function>(INTR_TYPE_TTY |
                 INTR_TYPE_FAST)</function> - terminal type devices
                 with small input buffer, critical to the data loss on
                 input (such as the old-fashioned serial ports). To
                 mask them use <function>spltty()</function>.</para>
             </listitem>
             <listitem>
               <para><function>INTR_TYPE_BIO</function> - block-type
                 devices, except those on the CAM controllers. To mask
                 them use <function>splbio()</function>.</para>
             </listitem>
             <listitem>
               <para><function>INTR_TYPE_CAM</function> - CAM (Common
                 Access Method) bus controllers. To mask them use
                 <function>splcam()</function>.</para>
              </listitem>
              <listitem>
                <para><function>INTR_TYPE_NET</function> - network
                 interface controllers. To mask them use
                 <function>splimp()</function>.</para>
              </listitem>
              <listitem>
                <para><function>INTR_TYPE_MISC</function> -
                 miscellaneous devices.  There is no other way to mask
                 them than by <function>splhigh()</function> which
                 masks all interrupts.</para>
              </listitem>
           </itemizedlist>
         </listitem>
       </itemizedlist>
 
       <para>When an interrupt handler executes all the other
         interrupts matching its priority level will be masked. The
         only exception is the MISC level for which no other interrupts
         are masked and which is not masked by any other
         interrupt.</para>
 
       <itemizedlist>
         <listitem>
           <para><emphasis>handler</emphasis> - pointer to the handler
             function, the type driver_intr_t is defined as <function>void
             driver_intr_t(void *)</function></para>
         </listitem>
         <listitem>
           <para><emphasis>arg</emphasis> - the argument passed to the
             handler to identify this particular device. It is cast
             from void* to any real type by the handler. The old
             convention for the ISA interrupt handlers was to use the
             unit number as argument, the new (recommended) convention
             is using a pointer to the device softc structure.</para>
         </listitem>
         <listitem>
           <para><emphasis>cookie[p]</emphasis> - the value received
             from <function>setup()</function> is used to identify the
             handler when passed to
             <function>teardown()</function></para>
         </listitem>
       </itemizedlist>
 
       <para>A number of methods are defined to operate on the resource
         handlers (struct resource *). Those of interest to the device
         driver writers are:</para>
 
       <itemizedlist>
         <listitem>
           <para><function>u_long rman_get_start(r) u_long
             rman_get_end(r)</function> Get the start and end of
             allocated resource range.</para>
         </listitem>
         <listitem>
           <para><function>void *rman_get_virtual(r)</function> Get
             the virtual address of activated memory resource.</para>
         </listitem>
       </itemizedlist>
 
     </sect1>
 
     <sect1 xml:id="isa-driver-busmem">
       <title>Bus Memory Mapping</title>
 
       <para>In many cases data is exchanged between the driver and the
         device through the memory. Two variants are possible:</para>
 
       <para>(a) memory is located on the device card</para>
       <para>(b) memory is the main memory of the computer</para>
 
       <para>In case (a) the driver always copies the data back and
         forth between the on-card memory and the main memory as
         necessary. To map the on-card memory into the kernel virtual
         address space the physical address and length of the on-card
         memory must be defined as a <literal>SYS_RES_MEMORY</literal> resource. That
         resource can then be allocated and activated, and its virtual
         address obtained using
         <function>rman_get_virtual()</function>.  The older drivers
         used the function <function>pmap_mapdev()</function> for this
         purpose, which should not be used directly any more. Now it is
         one of the internal steps of resource activation.</para>
 
       <para>Most of the ISA cards will have their memory configured
         for physical location somewhere in range 640KB-1MB. Some of
         the ISA cards require larger memory ranges which should be
         placed somewhere under 16MB (because of the 24-bit address
         limitation on the ISA bus). In that case if the machine has
         more memory than the start address of the device memory (in
         other words, they overlap) a memory hole must be configured at
         the address range used by devices. Many BIOSes allow
         configuration of a memory hole of 1MB starting at 14MB or
         15MB. FreeBSD can handle the memory holes properly if the BIOS
         reports them properly (this feature may be broken on old BIOSes).</para>
 
       <para>In case (b) just the address of the data is sent to
         the device, and the device uses DMA to actually access the
         data in the main memory. Two limitations are present: First,
         ISA cards can only access memory below 16MB.  Second, the
         contiguous pages in virtual address space may not be
         contiguous in physical address space, so the device may have
         to do scatter/gather operations. The bus subsystem provides
         ready solutions for some of these problems, the rest has to be
         done by the drivers themselves.</para>
 
       <para>Two structures are used for DMA memory allocation,
         <varname>bus_dma_tag_t</varname> and <varname>bus_dmamap_t</varname>. Tag describes the properties
         required for the DMA memory. Map represents a memory block
         allocated according to these properties. Multiple maps may be
         associated with the same tag.</para>
 
       <para>Tags are organized into a tree-like hierarchy with
         inheritance of the properties. A child tag inherits all the
         requirements of its parent tag, and may make them more strict
         but never more loose.</para>
 
       <para>Normally one top-level tag (with no parent) is created for
         each device unit.  If multiple memory areas with different
         requirements are needed for each device then a tag for each of
         them may be created as a child of the parent tag.</para>
 
       <para>The tags can be used to create a map in two ways.</para>
 
       <para>First, a chunk of contiguous memory conformant with the
         tag requirements may be allocated (and later may be
         freed). This is normally used to allocate relatively
         long-living areas of memory for communication with the
         device. Loading of such memory into a map is trivial: it is
         always considered as one chunk in the appropriate physical
         memory range.</para>
 
       <para>Second, an arbitrary area of virtual memory may be loaded
         into a map. Each page of this memory will be checked for
         conformance to the map requirement.  If it conforms then it is
         left at its original location. If it is not then a fresh
         conformant <quote>bounce page</quote> is allocated and used as intermediate
         storage. When writing the data from the non-conformant
         original pages they will be copied to their bounce pages first
         and then transferred from the bounce pages to the device. When
         reading the data would go from the device to the bounce pages
         and then copied to their non-conformant original pages. The
         process of copying between the original and bounce pages is
         called synchronization. This is normally used on a per-transfer
         basis: buffer for each transfer would be loaded, transfer done
         and buffer unloaded.</para>
 
       <para>The functions working on the DMA memory are:</para>
 
       <itemizedlist>
         <listitem>
         <para><function>int bus_dma_tag_create(bus_dma_tag_t parent,
           bus_size_t alignment, bus_size_t boundary, bus_addr_t
           lowaddr, bus_addr_t highaddr, bus_dma_filter_t *filter, void
           *filterarg, bus_size_t maxsize, int nsegments, bus_size_t
           maxsegsz, int flags, bus_dma_tag_t *dmat)</function></para>
 
         <para>Create a new tag. Returns 0 on success, the error code
           otherwise.</para>
 
         <itemizedlist>
 	  <listitem>
             <para><emphasis>parent</emphasis> - parent tag, or NULL to
               create a top-level tag.</para>
 	  </listitem>
 
 	  <listitem>
             <para><emphasis>alignment</emphasis> -
               required physical alignment of the memory area to be
               allocated for this tag. Use value 1 for <quote>no specific
               alignment</quote>. Applies only to the future
               <function>bus_dmamem_alloc()</function> but not
               <function>bus_dmamap_create()</function> calls.</para>
 	  </listitem>
 
 	  <listitem>
               <para><emphasis>boundary</emphasis> - physical address
               boundary that must not be crossed when allocating the
               memory. Use value 0 for <quote>no boundary</quote>. Applies only to
               the future <function>bus_dmamem_alloc()</function> but
               not <function>bus_dmamap_create()</function> calls.
               Must be power of 2. If the memory is planned to be used
               in non-cascaded DMA mode (i.e., the DMA addresses will be
               supplied not by the device itself but by the ISA DMA
               controller) then the boundary must be no larger than
               64KB (64*1024) due to the limitations of the DMA
               hardware.</para>
           </listitem>
 
           <listitem>
             <para><emphasis>lowaddr, highaddr</emphasis> - the names
               are slightly misleading; these values are used to limit
               the permitted range of physical addresses used to
               allocate the memory.  The exact meaning varies depending
               on the planned future use:</para>
 
             <itemizedlist>
               <listitem>
                 <para>For <function>bus_dmamem_alloc()</function> all
                   the addresses from 0 to lowaddr-1 are considered
                   permitted, the higher ones are forbidden.</para>
               </listitem>
 
               <listitem>
                 <para>For <function>bus_dmamap_create()</function> all
                   the addresses outside the inclusive range [lowaddr;
                   highaddr] are considered accessible. The addresses
                   of pages inside the range are passed to the filter
                   function which decides if they are accessible. If no
                   filter function is supplied then all the range is
                   considered unaccessible.</para>
               </listitem>
 
               <listitem>
                 <para>For the ISA devices the normal values (with no
                   filter function) are:</para>
                 <para>lowaddr = BUS_SPACE_MAXADDR_24BIT</para>
                 <para>highaddr = BUS_SPACE_MAXADDR</para>
               </listitem>
             </itemizedlist>
 
           </listitem>
 
           <listitem>
             <para><emphasis>filter, filterarg</emphasis> - the filter
               function and its argument. If NULL is passed for filter
               then the whole range [lowaddr, highaddr] is considered
               unaccessible when doing
               <function>bus_dmamap_create()</function>.  Otherwise the
               physical address of each attempted page in range
               [lowaddr; highaddr] is passed to the filter function
               which decides if it is accessible. The prototype of the
               filter function is: <function>int filterfunc(void *arg,
               bus_addr_t paddr)</function>. It must return 0 if the
               page is accessible, non-zero otherwise.</para>
           </listitem>
 
 	  <listitem>
             <para><emphasis>maxsize</emphasis> - the maximal size of
               memory (in bytes) that may be allocated through this
               tag. In case it is difficult to estimate or could be
               arbitrarily big, the value for ISA devices would be
               <literal>BUS_SPACE_MAXSIZE_24BIT</literal>.</para>
           </listitem>
 
 	  <listitem>
             <para><emphasis>nsegments</emphasis> - maximal number of
               scatter-gather segments supported by the device. If
               unrestricted then the value <literal>BUS_SPACE_UNRESTRICTED</literal>
               should be used. This value is recommended for the parent
               tags, the actual restrictions would then be specified
               for the descendant tags. Tags with nsegments equal to
               <literal>BUS_SPACE_UNRESTRICTED</literal> may not be used to actually load
               maps, they may be used only as parent tags. The
               practical limit for nsegments seems to be about 250-300,
               higher values will cause kernel stack overflow (the hardware
 	      can not normally support that many
               scatter-gather buffers anyway).</para>
           </listitem>
 
 	  <listitem>
             <para><emphasis>maxsegsz</emphasis> - maximal size of a
               scatter-gather segment supported by the device. The
               maximal value for ISA device would be
               <literal>BUS_SPACE_MAXSIZE_24BIT</literal>.</para>
           </listitem>
 
 	  <listitem>
             <para><emphasis>flags</emphasis> - a bitmap of flags. The
               only interesting flags are:</para>
 
 	    <itemizedlist>
 	      <listitem>
                 <para><emphasis>BUS_DMA_ALLOCNOW</emphasis> - requests
                   to allocate all the potentially needed bounce pages
                   when creating the tag.</para>
               </listitem>
 
 	      <listitem>
 	        <para><emphasis>BUS_DMA_ISA</emphasis> - mysterious
                   flag used only on Alpha machines. It is not defined
                   for the i386 machines.  Probably it should be used
                   by all the ISA drivers for Alpha machines but it
                   looks like there are no such drivers yet.</para>
               </listitem>
 	    </itemizedlist>
 	  </listitem>
 
           <listitem>
             <para><emphasis>dmat</emphasis> - pointer to the storage
               for the new tag to be returned.</para>
           </listitem>
 
 	</itemizedlist>
 
       </listitem>
 
       <listitem> <!-- Second entry in list alpha -->
         <para><function>int bus_dma_tag_destroy(bus_dma_tag_t
 	  dmat)</function></para>
 
         <para>Destroy a tag. Returns 0 on success, the error code
 	  otherwise.</para>
 
         <para>dmat - the tag to be destroyed.</para>
 
       </listitem>
 
       <listitem> <!-- Third entry in list alpha -->
         <para><function>int bus_dmamem_alloc(bus_dma_tag_t dmat,
           void** vaddr, int flags, bus_dmamap_t
           *mapp)</function></para>
 
         <para>Allocate an area of contiguous memory described by the
           tag. The size of memory to be allocated is tag's maxsize.
           Returns 0 on success, the error code otherwise. The result
           still has to be loaded by
           <function>bus_dmamap_load()</function> before being used to get
           the physical address of the memory.</para>
 
             <itemizedlist>
               <listitem>
                 <para>
                   <emphasis>dmat</emphasis> - the tag
                 </para>
               </listitem>
               <listitem>
                 <para>
                   <emphasis>vaddr</emphasis> - pointer to the storage
                   for the kernel virtual address of the allocated area
                   to be returned.
                  </para>
               </listitem>
               <listitem>
                 <para>
                   flags - a bitmap of flags. The only interesting flag is:
                 </para>
                 <itemizedlist>
                   <listitem>
                     <para>
                       <emphasis>BUS_DMA_NOWAIT</emphasis> - if the
                       memory is not immediately available return the
                       error. If this flag is not set then the routine
                       is allowed to sleep until the memory
                       becomes available.
                     </para>
                   </listitem>
                 </itemizedlist>
               </listitem>
               <listitem>
                 <para>
                   <emphasis>mapp</emphasis> - pointer to the storage
                   for the new map to be returned.
                 </para>
               </listitem>
             </itemizedlist>
           </listitem>
 
           <listitem> <!-- Fourth entry in list alpha -->
             <para>
               <function>void bus_dmamem_free(bus_dma_tag_t dmat, void
               *vaddr, bus_dmamap_t map)</function>
             </para>
             <para>
               Free the memory allocated by
               <function>bus_dmamem_alloc()</function>. At present,
               freeing of the memory allocated with ISA restrictions is
-              not implemented.  Because of this the recommended model
+              not implemented.  Due to this the recommended model
               of use is to keep and re-use the allocated areas for as
               long as possible. Do not lightly free some area and then
               shortly allocate it again. That does not mean that
               <function>bus_dmamem_free()</function> should not be
               used at all: hopefully it will be properly implemented
               soon.
             </para>
 
             <itemizedlist>
               <listitem>
                 <para><emphasis>dmat</emphasis> - the tag
                 </para>
               </listitem>
               <listitem>
                 <para>
                   <emphasis>vaddr</emphasis> - the kernel virtual
                   address of the memory
                 </para>
               </listitem>
               <listitem>
                 <para>
                   <emphasis>map</emphasis> - the map of the memory (as
                   returned from
                   <function>bus_dmamem_alloc()</function>)
                 </para>
               </listitem>
             </itemizedlist>
           </listitem>
 
           <listitem> <!-- The fifth entry in list alpha -->
             <para>
               <function>int bus_dmamap_create(bus_dma_tag_t dmat, int
               flags, bus_dmamap_t *mapp)</function>
             </para>
             <para>
               Create a map for the tag, to be used in
               <function>bus_dmamap_load()</function> later.  Returns 0
               on success, the error code otherwise.
             </para>
             <itemizedlist>
               <listitem>
                 <para>
                   <emphasis>dmat</emphasis> - the tag
                 </para>
               </listitem>
               <listitem>
                 <para>
                   <emphasis>flags</emphasis> - theoretically, a bit map
                   of flags. But no flags are defined yet, so at present
                   it will be always 0.
                 </para>
               </listitem>
               <listitem>
                 <para>
                   <emphasis>mapp</emphasis> - pointer to the storage
                   for the new map to be returned
                 </para>
               </listitem>
             </itemizedlist>
           </listitem>
 
           <listitem> <!-- Sixth entry in the alpha list -->
             <para>
               <function>int bus_dmamap_destroy(bus_dma_tag_t dmat,
               bus_dmamap_t map)</function>
             </para>
             <para>
               Destroy a map. Returns 0 on success, the error code otherwise.
             </para>
 
             <itemizedlist>
               <listitem>
                 <para>
                   dmat - the tag to which the map is associated
                 </para>
               </listitem>
               <listitem>
                 <para>
                   map - the map to be destroyed
                 </para>
               </listitem>
             </itemizedlist>
           </listitem>
 
           <listitem> <!-- Seventh entry in list alpha -->
             <para>
               <function>int bus_dmamap_load(bus_dma_tag_t dmat,
               bus_dmamap_t map, void *buf, bus_size_t buflen,
               bus_dmamap_callback_t *callback, void *callback_arg, int
               flags)</function>
             </para>
             <para>
               Load a buffer into the map (the map must be previously
               created by <function>bus_dmamap_create()</function> or
               <function>bus_dmamem_alloc()</function>).  All the pages
               of the buffer are checked for conformance to the tag
               requirements and for those not conformant the bounce
               pages are allocated. An array of physical segment
               descriptors is built and passed to the callback
               routine. This callback routine is then expected to
               handle it in some way. The number of bounce buffers in
               the system is limited, so if the bounce buffers are
               needed but not immediately available the request will be
               queued and the callback will be called when the bounce
               buffers will become available. Returns 0 if the callback
               was executed immediately or <errorname>EINPROGRESS</errorname> if the request
               was queued for future execution. In the latter case the
               synchronization with queued callback routine is the
               responsibility of the driver.
             </para>
             <!--<blockquote>-->
             <itemizedlist>
               <listitem>
                 <para>
                   <emphasis>dmat</emphasis> - the tag
                 </para>
               </listitem>
               <listitem>
                 <para>
                   <emphasis>map</emphasis> - the map
                 </para>
               </listitem>
               <listitem>
                 <para>
                   <emphasis>buf</emphasis> - kernel virtual address of
                   the buffer
                 </para>
               </listitem>
               <listitem>
                 <para>
                   <emphasis>buflen</emphasis> - length of the buffer
                 </para>
               </listitem>
               <listitem>
                 <para>
                   <emphasis>callback</emphasis>,<function>
                   callback_arg</function> - the callback function and
                   its argument
                 </para>
               </listitem>
             </itemizedlist>
             <!--</blockquote>-->
             <para>
               The prototype of callback function is:
             </para>
             <para>
               <function>void callback(void *arg, bus_dma_segment_t
               *seg, int nseg, int error)</function>
             </para>
             <!--     <blockquote> -->
             <itemizedlist>
               <listitem>
                 <para>
                   <emphasis>arg</emphasis> - the same as callback_arg
                   passed to <function>bus_dmamap_load()</function>
                 </para>
               </listitem>
               <listitem>
                 <para>
                   <emphasis>seg</emphasis> - array of the segment
                   descriptors
                 </para>
               </listitem>
               <listitem>
                 <para>
                   <emphasis>nseg</emphasis> - number of descriptors in
                   array
                 </para>
               </listitem>
               <listitem>
                 <para>
                   <emphasis>error</emphasis> - indication of the
                   segment number overflow: if it is set to <errorname>EFBIG</errorname> then
                   the buffer did not fit into the maximal number of
                   segments permitted by the tag. In this case only the
                   permitted number of descriptors will be in the
                   array. Handling of this situation is up to the
                   driver: depending on the desired semantics it can
                   either consider this an error or split the buffer in
                   two and handle the second part separately
                 </para>
               </listitem>
             </itemizedlist>
             <!--     </blockquote>  -->
             <para>
               Each entry in the segments array contains the fields:
             </para>
 
             <!--   <blockquote> -->
             <itemizedlist>
               <listitem>
                 <para>
                   <emphasis>ds_addr</emphasis> - physical bus address
                   of the segment
                 </para>
               </listitem>
               <listitem>
                 <para>
                   <emphasis>ds_len</emphasis> - length of the segment
                 </para>
               </listitem>
             </itemizedlist>
             <!--   </blockquote>-->
           </listitem>
 
           <listitem> <!-- Eighth entry in alpha list -->
             <para>
               <function>void bus_dmamap_unload(bus_dma_tag_t dmat,
               bus_dmamap_t map)</function>
             </para>
             <para>unload the map.
             </para>
             <!--  <blockquote>  -->
             <itemizedlist>
               <listitem>
                 <para>
                   <emphasis>dmat</emphasis> - tag
                 </para>
               </listitem>
               <listitem>
                 <para>
                   <emphasis>map</emphasis> - loaded map
                 </para>
               </listitem>
             </itemizedlist>
             <!--  </blockquote>  -->
           </listitem>
 
           <listitem> <!-- Ninth entry list alpha -->
             <para>
               <function>void bus_dmamap_sync (bus_dma_tag_t dmat,
               bus_dmamap_t map, bus_dmasync_op_t op)</function>
             </para>
             <para>
               Synchronise a loaded buffer with its bounce pages before
               and after physical transfer to or from device. This is
               the function that does all the necessary copying of data
               between the original buffer and its mapped version. The
               buffers must be synchronized both before and after doing
               the transfer.
             </para>
             <!--  <blockquote> -->
             <itemizedlist>
               <listitem>
                 <para>
                   <emphasis>dmat</emphasis> - tag
                 </para>
               </listitem>
               <listitem>
                 <para>
                   <emphasis>map</emphasis> - loaded map
                 </para>
               </listitem>
               <listitem>
                 <para>
                   <emphasis>op</emphasis> - type of synchronization
                   operation to perform:
                 </para>
               </listitem>
             </itemizedlist>
             <!-- <blockquote> -->
             <itemizedlist>
               <listitem>
                 <para>
                   <function>BUS_DMASYNC_PREREAD</function> - before
                   reading from device into buffer
                 </para>
               </listitem>
               <listitem>
                 <para>
                   <function>BUS_DMASYNC_POSTREAD</function> - after
                   reading from device into buffer
                 </para>
               </listitem>
               <listitem>
                 <para>
                   <function>BUS_DMASYNC_PREWRITE</function> - before
                   writing the buffer to device
                 </para>
               </listitem>
               <listitem>
                 <para>
                   <function>BUS_DMASYNC_POSTWRITE</function> - after
                   writing the buffer to device
                 </para>
               </listitem>
             </itemizedlist>
           </listitem>
         </itemizedlist>   <!-- End of list alpha -->
 <!-- </blockquote>
 </blockquote> -->
 
         <para>
           As of now PREREAD and POSTWRITE are null operations but that
           may change in the future, so they must not be ignored in the
           driver. Synchronization is not needed for the memory
           obtained from <function>bus_dmamem_alloc()</function>.
         </para>
         <para>
           Before calling the callback function from
           <function>bus_dmamap_load()</function> the segment array is
           stored in the stack. And it gets pre-allocated for the
-          maximal number of segments allowed by the tag. Because of
+          maximal number of segments allowed by the tag. As a result of
           this the practical limit for the number of segments on i386
           architecture is about 250-300 (the kernel stack is 4KB minus
           the size of the user structure, size of a segment array
-          entry is 8 bytes, and some space must be left). Because the
+          entry is 8 bytes, and some space must be left). Since the
           array is allocated based on the maximal number this value
           must not be set higher than really needed. Fortunately, for
           most of hardware the maximal supported number of segments is
           much lower. But if the driver wants to handle buffers with a
           very large number of scatter-gather segments it should do
           that in portions: load part of the buffer, transfer it to
           the device, load next part of the buffer, and so on.
         </para>
         <para>
           Another practical consequence is that the number of segments
           may limit the size of the buffer. If all the pages in the
           buffer happen to be physically non-contiguous then the
           maximal supported buffer size for that fragmented case would
           be (nsegments * page_size). For example, if a maximal number
           of 10 segments is supported then on i386 maximal guaranteed
           supported buffer size would be 40K. If a higher size is
           desired then special tricks should be used in the driver.
         </para>
         <para>
           If the hardware does not support scatter-gather at all or
           the driver wants to support some buffer size even if it is
           heavily fragmented then the solution is to allocate a
           contiguous buffer in the driver and use it as intermediate
           storage if the original buffer does not fit.
         </para>
         <para>
           Below are the typical call sequences when using a map depend
           on the use of the map.  The characters -&gt; are used to show
           the flow of time.
         </para>
         <para>
           For a buffer which stays practically fixed during all the
           time between attachment and detachment of a device:</para>
         <para>
           bus_dmamem_alloc -&gt; bus_dmamap_load -&gt; ...use buffer... -&gt;
           -&gt; bus_dmamap_unload -&gt; bus_dmamem_free
         </para>
 
         <para>For a buffer that changes frequently and is passed from
         outside the driver:
 
 	<!-- XXX is this correct? -->
         <programlisting>          bus_dmamap_create -&gt;
           -&gt; bus_dmamap_load -&gt; bus_dmamap_sync(PRE...) -&gt; do transfer -&gt;
           -&gt; bus_dmamap_sync(POST...) -&gt; bus_dmamap_unload -&gt;
           ...
           -&gt; bus_dmamap_load -&gt; bus_dmamap_sync(PRE...) -&gt; do transfer -&gt;
           -&gt; bus_dmamap_sync(POST...) -&gt; bus_dmamap_unload -&gt;
           -&gt; bus_dmamap_destroy        </programlisting>
 
         </para>
         <para>
           When loading a map created by
           <function>bus_dmamem_alloc()</function> the passed address
           and size of the buffer must be the same as used in
           <function>bus_dmamem_alloc()</function>. In this case it is
           guaranteed that the whole buffer will be mapped as one
           segment (so the callback may be based on this assumption)
           and the request will be executed immediately (EINPROGRESS
           will never be returned).  All the callback needs to do in
           this case is to save the physical address.
         </para>
         <para>
           A typical example would be:
         </para>
 
         <programlisting>          static void
         alloc_callback(void *arg, bus_dma_segment_t *seg, int nseg, int error)
         {
           *(bus_addr_t *)arg = seg[0].ds_addr;
         }
 
           ...
           int error;
           struct somedata {
             ....
           };
           struct somedata *vsomedata; /* virtual address */
           bus_addr_t psomedata; /* physical bus-relative address */
           bus_dma_tag_t tag_somedata;
           bus_dmamap_t map_somedata;
           ...
 
           error=bus_dma_tag_create(parent_tag, alignment,
            boundary, lowaddr, highaddr, /*filter*/ NULL, /*filterarg*/ NULL,
            /*maxsize*/ sizeof(struct somedata), /*nsegments*/ 1,
            /*maxsegsz*/ sizeof(struct somedata), /*flags*/ 0,
            &#38;tag_somedata);
           if(error)
           return error;
 
           error = bus_dmamem_alloc(tag_somedata, &#38;vsomedata, /* flags*/ 0,
              &#38;map_somedata);
           if(error)
              return error;
 
           bus_dmamap_load(tag_somedata, map_somedata, (void *)vsomedata,
              sizeof (struct somedata), alloc_callback,
              (void *) &#38;psomedata, /*flags*/0);        </programlisting>
 
         <para>
           Looks a bit long and complicated but that is the way to do
           it. The practical consequence is: if multiple memory areas
           are allocated always together it would be a really good idea
           to combine them all into one structure and allocate as one
           (if the alignment and boundary limitations permit).
         </para>
         <para>
           When loading an arbitrary buffer into the map created by
           <function>bus_dmamap_create()</function> special measures
           must be taken to synchronize with the callback in case it
           would be delayed. The code would look like:
         </para>
 
         <programlisting>          {
            int s;
            int error;
 
            s = splsoftvm();
            error = bus_dmamap_load(
                dmat,
                dmamap,
                buffer_ptr,
                buffer_len,
                callback,
                /*callback_arg*/ buffer_descriptor,
                /*flags*/0);
            if (error == EINPROGRESS) {
                /*
                 * Do whatever is needed to ensure synchronization
                 * with callback. Callback is guaranteed not to be started
                 * until we do splx() or tsleep().
                 */
               }
            splx(s);
           }        </programlisting>
 
         <para>
           Two possible approaches for the processing of requests are:
         </para>
         <para>
           1. If requests are completed by marking them explicitly as
           done (such as the CAM requests) then it would be simpler to
           put all the further processing into the callback driver
           which would mark the request when it is done. Then not much
           extra synchronization is needed. For the flow control
           reasons it may be a good idea to freeze the request queue
           until this request gets completed.
         </para>
         <para>
           2. If requests are completed when the function returns (such
           as classic read or write requests on character devices) then
           a synchronization flag should be set in the buffer
           descriptor and <function>tsleep()</function> called.  Later
           when the callback gets called it will do its processing and
           check this synchronization flag. If it is set then the
           callback should issue a wakeup. In this approach the
           callback function could either do all the needed processing
           (just like the previous case) or simply save the segments
           array in the buffer descriptor. Then after callback
           completes the calling function could use this saved segments
           array and do all the processing.
 
         </para>
      </sect1>
 <!--_________________________________________________________________________-->
 <!--~~~~~~~~~~~~~~~~~~~~END OF SECTION~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-->
 
      <sect1 xml:id="isa-driver-dma">
         <title>DMA</title>
         <!-- Section Marked up by Wylie -->
 
         <indexterm><primary>Direct Memory Access (DMA)</primary></indexterm>
 
         <para>
           The Direct Memory Access (DMA) is implemented in the ISA bus
           through the DMA controller (actually, two of them but that is
           an irrelevant detail).  To make the early ISA devices simple
           and cheap the logic of the bus control and address
           generation was concentrated in the DMA controller.
           Fortunately, FreeBSD provides a set of functions that mostly
           hide the annoying details of the DMA controller from the
           device drivers.
         </para>
 
         <para>
           The simplest case is for the fairly intelligent
           devices. Like the bus master devices on PCI they can
           generate the bus cycles and memory addresses all by
           themselves. The only thing they really need from the DMA
           controller is bus arbitration. So for this purpose they
           pretend to be cascaded slave DMA controllers. And the only
           thing needed from the system DMA controller is to enable the
           cascaded mode on a DMA channel by calling the following
           function when attaching the driver:
         </para>
 
         <para>
           <function>void isa_dmacascade(int channel_number)</function>
         </para>
 
         <para>
           All the further activity is done by programming the
           device. When detaching the driver no DMA-related functions
           need to be called.
         </para>
 
         <para>
           For the simpler devices things get more complicated. The
           functions used are:
         </para>
 
         <itemizedlist>
 
           <listitem>
           <para>
             <function>int isa_dma_acquire(int chanel_number)</function>
           </para>
           <para>
                 Reserve a DMA channel. Returns 0 on success or EBUSY
                 if the channel was already reserved by this or a
                 different driver. Most of the ISA devices are not able
                 to share DMA channels anyway, so normally this
                 function is called when attaching a device. This
                 reservation was made redundant by the modern interface
                 of bus resources but still must be used in addition to
                 the latter. If not used then later, other DMA routines
                 will panic.
           </para>
         </listitem>
 
         <listitem>
           <para>
             <function>int isa_dma_release(int chanel_number)</function>
           </para>
           <para>
                 Release a previously reserved DMA channel. No
                 transfers must be in progress when the channel is
                 released (in addition the device must not try to
                 initiate transfer after the channel is released).
           </para>
         </listitem>
 
         <listitem>
           <para>
             <function>void isa_dmainit(int chan, u_int
             bouncebufsize)</function>
           </para>
           <para>
                 Allocate a bounce buffer for use with the specified
                 channel. The requested size of the buffer can not exceed
                 64KB. This bounce buffer will be automatically used
                 later if a transfer buffer happens to be not
                 physically contiguous or outside of the memory
                 accessible by the ISA bus or crossing the 64KB
                 boundary. If the transfers will be always done from
                 buffers which conform to these conditions (such as
                 those allocated by
                 <function>bus_dmamem_alloc()</function> with proper
                 limitations) then <function>isa_dmainit()</function>
                 does not have to be called. But it is quite convenient
                 to transfer arbitrary data using the DMA controller.
                 The bounce buffer will automatically care of the
                 scatter-gather issues.
           </para>
  <!-- <blockquote> -->
           <itemizedlist>
                 <listitem>
                   <para>
                     <emphasis>chan</emphasis> - channel number
                   </para>
                 </listitem>
                 <listitem>
                   <para>
                     <emphasis>bouncebufsize</emphasis> - size of the
                     bounce buffer in bytes
                   </para>
                 </listitem>
           </itemizedlist>
 <!-- </blockquote> -->
 <!--</para> -->
         </listitem>
 
         <listitem>
           <para>
             <function>void isa_dmastart(int flags, caddr_t addr, u_int
             nbytes, int chan)</function>
           </para>
           <para>
                 Prepare to start a DMA transfer. This function must be
                 called to set up the DMA controller before actually
                 starting transfer on the device. It checks that the
                 buffer is contiguous and falls into the ISA memory
                 range, if not then the bounce buffer is automatically
                 used. If bounce buffer is required but not set up by
                 <function>isa_dmainit()</function> or too small for
                 the requested transfer size then the system will
                 panic. In case of a write request with bounce buffer
                 the data will be automatically copied to the bounce
                 buffer.
           </para>
         </listitem>
         <listitem>
           <para>flags - a bitmask determining the type of operation to
           be done. The direction bits B_READ and B_WRITE are mutually
           exclusive.
           </para>
         <!--   <blockquote>  -->
           <itemizedlist>
             <listitem>
               <para>
                 B_READ - read from the ISA bus into memory
               </para>
             </listitem>
             <listitem>
               <para>
                 B_WRITE - write from the memory to the ISA bus
               </para>
             </listitem>
             <listitem>
               <para>
                 B_RAW - if set then the DMA controller will remember
                 the buffer and after the end of transfer will
                 automatically re-initialize itself to repeat transfer
                 of the same buffer again (of course, the driver may
                 change the data in the buffer before initiating
                 another transfer in the device). If not set then the
                 parameters will work only for one transfer, and
                 <function>isa_dmastart()</function> will have to be
                 called again before initiating the next
                 transfer. Using B_RAW makes sense only if the bounce
                 buffer is not used.
               </para>
             </listitem>
           </itemizedlist>
 <!--   </blockquote>  -->
         </listitem>
         <listitem>
           <para>
             addr - virtual address of the buffer
           </para>
         </listitem>
         <listitem>
           <para>
             nbytes - length of the buffer. Must be less or equal to
             64KB. Length of 0 is not allowed: the DMA controller will
             understand it as 64KB while the kernel code will
             understand it as 0 and that would cause unpredictable
             effects. For channels number 4 and higher the length must
             be even because these channels transfer 2 bytes at a
             time. In case of an odd length the last byte will not be
             transferred.
           </para>
         </listitem>
         <listitem>
           <para>
             chan - channel number
           </para>
         </listitem>
 
         <listitem>
           <para>
             <function>void isa_dmadone(int flags, caddr_t addr, int
             nbytes, int chan)</function>
           </para>
           <para>
             Synchronize the memory after device reports that transfer
             is done. If that was a read operation with a bounce buffer
             then the data will be copied from the bounce buffer to the
             original buffer. Arguments are the same as for
             <function>isa_dmastart()</function>. Flag B_RAW is
             permitted but it does not affect
             <function>isa_dmadone()</function> in any way.
           </para>
         </listitem>
 
         <listitem>
           <para>
             <function>int isa_dmastatus(int channel_number)</function>
           </para>
           <para>
             Returns the number of bytes left in the current transfer
             to be transferred.  In case the flag B_READ was set in
             <function>isa_dmastart()</function> the number returned
             will never be equal to zero. At the end of transfer it
             will be automatically reset back to the length of
             buffer. The normal use is to check the number of bytes
             left after the device signals that the transfer is
             completed.  If the number of bytes is not 0 then something
             probably went wrong with that transfer.
           </para>
         </listitem>
 
         <listitem>
           <para>
             <function>int isa_dmastop(int channel_number)</function>
           </para>
           <para>
             Aborts the current transfer and returns the number of
             bytes left untransferred.
           </para>
         </listitem>
        </itemizedlist>
      </sect1>
 
      <sect1 xml:id="isa-driver-probe">
      <title>xxx_isa_probe</title>
      <!-- Section marked up by Wylie -->
 
         <para>
           This function probes if a device is present. If the driver
           supports auto-detection of some part of device configuration
           (such as interrupt vector or memory address) this
           auto-detection must be done in this routine.
         </para>
 
         <para>
           As for any other bus, if the device cannot be detected or
           is detected but failed the self-test or some other problem
           happened then it returns a positive value of error. The
           value <errorname>ENXIO</errorname> must be returned if the device is not
           present. Other error values may mean other conditions. Zero
           or negative values mean success. Most of the drivers return
           zero as success.
         </para>
 
         <para>
           The negative return values are used when a PnP device
           supports multiple interfaces. For example, an older
           compatibility interface and a newer advanced interface which
           are supported by different drivers. Then both drivers would
           detect the device. The driver which returns a higher value
           in the probe routine takes precedence (in other words, the
           driver returning 0 has highest precedence, one returning -1
           is next, one returning -2 is after it and so on). In result
           the devices which support only the old interface will be
           handled by the old driver (which should return -1 from the
           probe routine) while the devices supporting the new
           interface as well will be handled by the new driver (which
           should return 0 from the probe routine).
         </para>
 
         <para>
           The device descriptor struct xxx_softc is allocated by the
           system before calling the probe routine. If the probe
           routine returns an error the descriptor will be
           automatically deallocated by the system. So if a probing
           error occurs the driver must make sure that all the
           resources it used during probe are deallocated and that
           nothing keeps the descriptor from being safely
           deallocated. If the probe completes successfully the
           descriptor will be preserved by the system and later passed
           to the routine <function>xxx_isa_attach()</function>. If a
           driver returns a negative value it can not be sure that it
           will have the highest priority and its attach routine will
           be called. So in this case it also must release all the
           resources before returning and if necessary allocate them
           again in the attach routine. When
           <function>xxx_isa_probe()</function> returns 0 releasing the
           resources before returning is also a good idea and a
           well-behaved driver should do so. But in cases where there is
           some problem with releasing the resources the driver is
           allowed to keep resources between returning 0 from the probe
           routine and execution of the attach routine.
         </para>
 
         <para>
           A typical probe routine starts with getting the device
           descriptor and unit:
         </para>
 
         <programlisting>         struct xxx_softc *sc = device_get_softc(dev);
           int unit = device_get_unit(dev);
           int pnperror;
           int error = 0;
 
           sc-&gt;dev = dev; /* link it back */
           sc-&gt;unit = unit;        </programlisting>
 
         <para>
           Then check for the PnP devices. The check is carried out by
           a table containing the list of PnP IDs supported by this
           driver and human-readable descriptions of the device models
           corresponding to these IDs.
         </para>
 
         <programlisting>
         pnperror=ISA_PNP_PROBE(device_get_parent(dev), dev,
         xxx_pnp_ids); if(pnperror == ENXIO) return ENXIO;
         </programlisting>
 
         <para>
           The logic of ISA_PNP_PROBE is the following: If this card
           (device unit) was not detected as PnP then ENOENT will be
           returned. If it was detected as PnP but its detected ID does
           not match any of the IDs in the table then ENXIO is
           returned. Finally, if it has PnP support and it matches on
           of the IDs in the table, 0 is returned and the appropriate
           description from the table is set by
           <function>device_set_desc()</function>.
         </para>
 
         <para>
           If a driver supports only PnP devices then the condition
           would look like:
         </para>
 
         <programlisting>          if(pnperror != 0)
               return pnperror;        </programlisting>
 
         <para>
           No special treatment is required for the drivers which do not
           support PnP because they pass an empty PnP ID table and will
           always get ENXIO if called on a PnP card.
         </para>
 
         <para>
           The probe routine normally needs at least some minimal set
           of resources, such as I/O port number to find the card and
           probe it. Depending on the hardware the driver may be able
           to discover the other necessary resources automatically. The
           PnP devices have all the resources pre-set by the PnP
           subsystem, so the driver does not need to discover them by
           itself.
         </para>
 
         <para>
           Typically the minimal information required to get access to
           the device is the I/O port number. Then some devices allow
           to get the rest of information from the device configuration
           registers (though not all devices do that).  So first we try
           to get the port start value:
         </para>
 
         <programlisting> sc-&gt;port0 = bus_get_resource_start(dev,
         SYS_RES_IOPORT, 0 /*rid*/); if(sc-&gt;port0 == 0) return ENXIO;
         </programlisting>
 
         <para>
           The base port address is saved in the structure softc for
           future use.  If it will be used very often then calling the
           resource function each time would be prohibitively slow. If
           we do not get a port we just return an error.  Some device
           drivers can instead be clever and try to probe all the
           possible ports, like this:
         </para>
 
         <programlisting>
           /* table of all possible base I/O port addresses for this device */
           static struct xxx_allports {
               u_short port; /* port address */
               short used; /* flag: if this port is already used by some unit */
           } xxx_allports = {
               { 0x300, 0 },
               { 0x320, 0 },
               { 0x340, 0 },
               { 0, 0 } /* end of table */
           };
 
           ...
           int port, i;
           ...
 
           port =  bus_get_resource_start(dev, SYS_RES_IOPORT, 0 /*rid*/);
           if(port !=0 ) {
               for(i=0; xxx_allports[i].port!=0; i++) {
                   if(xxx_allports[i].used || xxx_allports[i].port != port)
                       continue;
 
                   /* found it */
                   xxx_allports[i].used = 1;
                   /* do probe on a known port */
                   return xxx_really_probe(dev, port);
               }
               return ENXIO; /* port is unknown or already used */
           }
 
           /* we get here only if we need to guess the port */
           for(i=0; xxx_allports[i].port!=0; i++) {
               if(xxx_allports[i].used)
                   continue;
 
               /* mark as used - even if we find nothing at this port
                * at least we won't probe it in future
                */
                xxx_allports[i].used = 1;
 
               error = xxx_really_probe(dev, xxx_allports[i].port);
               if(error == 0) /* found a device at that port */
                   return 0;
           }
           /* probed all possible addresses, none worked */
           return ENXIO;</programlisting>
 
         <para>
           Of course, normally the driver's
           <function>identify()</function> routine should be used for
           such things. But there may be one valid reason why it may be
           better to be done in <function>probe()</function>: if this
           probe would drive some other sensitive device crazy.  The
           probe routines are ordered with consideration of the
           <literal>sensitive</literal> flag: the sensitive devices get probed first and
           the rest of the devices later.  But the
           <function>identify()</function> routines are called before
           any probes, so they show no respect to the sensitive devices
           and may upset them.
         </para>
 
         <para>
           Now, after we got the starting port we need to set the port
           count (except for PnP devices) because the kernel does not
           have this information in the configuration file.
         </para>
 
         <programlisting>
          if(pnperror /* only for non-PnP devices */
          &#38;&#38; bus_set_resource(dev, SYS_RES_IOPORT, 0, sc-&gt;port0,
          XXX_PORT_COUNT)&lt;0)
              return ENXIO;</programlisting>
 
         <para>
           Finally allocate and activate a piece of port address space
           (special values of start and end mean <quote>use those we set by
           <function>bus_set_resource()</function></quote>):
         </para>
 
         <programlisting>
           sc-&gt;port0_rid = 0;
           sc-&gt;port0_r = bus_alloc_resource(dev, SYS_RES_IOPORT,
           &#38;sc-&gt;port0_rid,
               /*start*/ 0, /*end*/ ~0, /*count*/ 0, RF_ACTIVE);
 
           if(sc-&gt;port0_r == NULL)
               return ENXIO;</programlisting>
 
         <para>
           Now having access to the port-mapped registers we can poke
           the device in some way and check if it reacts like it is
           expected to. If it does not then there is probably some
           other device or no device at all at this address.
         </para>
 
         <para>
           Normally drivers do not set up the interrupt handlers until
           the attach routine. Instead they do probes in the polling
           mode using the <function>DELAY()</function> function for
           timeout. The probe routine must never hang forever, all the
           waits for the device must be done with timeouts. If the
           device does not respond within the time it is probably broken
           or misconfigured and the driver must return error. When
           determining the timeout interval give the device some extra
           time to be on the safe side: although
           <function>DELAY()</function> is supposed to delay for the
           same amount of time on any machine it has some margin of
           error, depending on the exact CPU.
         </para>
 
         <para>
           If the probe routine really wants to check that the
           interrupts really work it may configure and probe the
           interrupts too. But that is not recommended.
         </para>
 
         <programlisting>
           /* implemented in some very device-specific way */
           if(error = xxx_probe_ports(sc))
               goto bad; /* will deallocate the resources before returning */
         </programlisting>
 
         <para>
           The function <function>xxx_probe_ports()</function> may also
           set the device description depending on the exact model of
           device it discovers.  But if there is only one supported
           device model this can be as well done in a hardcoded way.
           Of course, for the PnP devices the PnP support sets the
           description from the table automatically.
         </para>
 
 
         <programlisting>          if(pnperror)
               device_set_desc(dev, "Our device model 1234");
         </programlisting>
 
         <para>
           Then the probe routine should either discover the ranges of
           all the resources by reading the device configuration
           registers or make sure that they were set explicitly by the
           user. We will consider it with an example of on-board
           memory. The probe routine should be as non-intrusive as
           possible, so allocation and check of functionality of the
           rest of resources (besides the ports) would be better left
           to the attach routine.
         </para>
 
         <para>
           The memory address may be specified in the kernel
           configuration file or on some devices it may be
           pre-configured in non-volatile configuration registers.  If
           both sources are available and different, which one should
           be used?  Probably if the user bothered to set the address
           explicitly in the kernel configuration file they know what
           they are doing and this one should take precedence. An
           example of implementation could be:
         </para>
         <programlisting>
           /* try to find out the config address first */
           sc-&gt;mem0_p = bus_get_resource_start(dev, SYS_RES_MEMORY, 0 /*rid*/);
           if(sc-&gt;mem0_p == 0) { /* nope, not specified by user */
               sc-&gt;mem0_p = xxx_read_mem0_from_device_config(sc);
 
 
           if(sc-&gt;mem0_p == 0)
                   /* can't get it from device config registers either */
                   goto bad;
           } else {
               if(xxx_set_mem0_address_on_device(sc) &lt; 0)
                   goto bad; /* device does not support that address */
           }
 
           /* just like the port, set the memory size,
            * for some devices the memory size would not be constant
            * but should be read from the device configuration registers instead
            * to accommodate different models of devices. Another option would
            * be to let the user set the memory size as "msize" configuration
            * resource which will be automatically handled by the ISA bus.
            */
            if(pnperror) { /* only for non-PnP devices */
               sc-&gt;mem0_size = bus_get_resource_count(dev, SYS_RES_MEMORY, 0 /*rid*/);
               if(sc-&gt;mem0_size == 0) /* not specified by user */
                   sc-&gt;mem0_size = xxx_read_mem0_size_from_device_config(sc);
 
               if(sc-&gt;mem0_size == 0) {
                   /* suppose this is a very old model of device without
                    * auto-configuration features and the user gave no preference,
                    * so assume the minimalistic case
                    * (of course, the real value will vary with the driver)
                    */
                   sc-&gt;mem0_size = 8*1024;
               }
 
               if(xxx_set_mem0_size_on_device(sc) &lt; 0)
                   goto bad; /* device does not support that size */
 
               if(bus_set_resource(dev, SYS_RES_MEMORY, /*rid*/0,
                       sc-&gt;mem0_p, sc-&gt;mem0_size)&lt;0)
                   goto bad;
           } else {
               sc-&gt;mem0_size = bus_get_resource_count(dev, SYS_RES_MEMORY, 0 /*rid*/);
           }        </programlisting>
 
         <para>
           Resources for IRQ and DRQ are easy to check by analogy.
         </para>
 
         <para>
           If all went well then release all the resources and return success.
         </para>
 
         <programlisting>          xxx_free_resources(sc);
           return 0;</programlisting>
 
         <para>
           Finally, handle the troublesome situations. All the
           resources should be deallocated before returning. We make
           use of the fact that before the structure softc is passed to
           us it gets zeroed out, so we can find out if some resource
           was allocated: then its descriptor is non-zero.
         </para>
 
         <programlisting>          bad:
 
           xxx_free_resources(sc);
           if(error)
                 return error;
           else /* exact error is unknown */
               return ENXIO;</programlisting>
 
         <para>
           That would be all for the probe routine. Freeing of
           resources is done from multiple places, so it is moved to a
           function which may look like:
         </para>
 
 <programlisting>static void
            xxx_free_resources(sc)
               struct xxx_softc *sc;
           {
               /* check every resource and free if not zero */
 
               /* interrupt handler */
               if(sc-&gt;intr_r) {
                   bus_teardown_intr(sc-&gt;dev, sc-&gt;intr_r, sc-&gt;intr_cookie);
                   bus_release_resource(sc-&gt;dev, SYS_RES_IRQ, sc-&gt;intr_rid,
                       sc-&gt;intr_r);
                   sc-&gt;intr_r = 0;
               }
 
               /* all kinds of memory maps we could have allocated */
               if(sc-&gt;data_p) {
                   bus_dmamap_unload(sc-&gt;data_tag, sc-&gt;data_map);
                   sc-&gt;data_p = 0;
               }
                if(sc-&gt;data) { /* sc-&gt;data_map may be legitimately equal to 0 */
                   /* the map will also be freed */
                   bus_dmamem_free(sc-&gt;data_tag, sc-&gt;data, sc-&gt;data_map);
                   sc-&gt;data = 0;
               }
               if(sc-&gt;data_tag) {
                   bus_dma_tag_destroy(sc-&gt;data_tag);
                   sc-&gt;data_tag = 0;
               }
 
               ... free other maps and tags if we have them ...
 
               if(sc-&gt;parent_tag) {
                   bus_dma_tag_destroy(sc-&gt;parent_tag);
                   sc-&gt;parent_tag = 0;
               }
 
               /* release all the bus resources */
               if(sc-&gt;mem0_r) {
                   bus_release_resource(sc-&gt;dev, SYS_RES_MEMORY, sc-&gt;mem0_rid,
                       sc-&gt;mem0_r);
                   sc-&gt;mem0_r = 0;
               }
               ...
               if(sc-&gt;port0_r) {
                   bus_release_resource(sc-&gt;dev, SYS_RES_IOPORT, sc-&gt;port0_rid,
                       sc-&gt;port0_r);
                   sc-&gt;port0_r = 0;
               }
           }</programlisting>
 
      </sect1>
 
      <sect1 xml:id="isa-driver-attach">
      <title>xxx_isa_attach</title>
      <!-- Section Marked up by Wylie -->
 
         <para>The attach routine actually connects the driver to the
         system if the probe routine returned success and the system
         had chosen to attach that driver.  If the probe routine
         returned 0 then the attach routine may expect to receive the
         device structure softc intact, as it was set by the probe
         routine. Also if the probe routine returns 0 it may expect
         that the attach routine for this device shall be called at
         some point in the future. If the probe routine returns a
         negative value then the driver may make none of these
         assumptions.
         </para>
 
         <para>The attach routine returns 0 if it completed successfully or
           error code otherwise.
         </para>
 
         <para>The attach routine starts just like the probe routine,
           with getting some frequently used data into more accessible
           variables.
         </para>
 
         <programlisting>          struct xxx_softc *sc = device_get_softc(dev);
           int unit = device_get_unit(dev);
           int error = 0;</programlisting>
 
         <para>Then allocate and activate all the necessary
-          resources. Because normally the port range will be released
+          resources. As normally the port range will be released
           before returning from probe, it has to be allocated
           again. We expect that the probe routine had properly set all
           the resource ranges, as well as saved them in the structure
           softc. If the probe routine had left some resource allocated
           then it does not need to be allocated again (which would be
           considered an error).
         </para>
 
         <programlisting>          sc-&gt;port0_rid = 0;
           sc-&gt;port0_r = bus_alloc_resource(dev, SYS_RES_IOPORT,  &#38;sc-&gt;port0_rid,
               /*start*/ 0, /*end*/ ~0, /*count*/ 0, RF_ACTIVE);
 
           if(sc-&gt;port0_r == NULL)
                return ENXIO;
 
           /* on-board memory */
           sc-&gt;mem0_rid = 0;
           sc-&gt;mem0_r = bus_alloc_resource(dev, SYS_RES_MEMORY,  &#38;sc-&gt;mem0_rid,
               /*start*/ 0, /*end*/ ~0, /*count*/ 0, RF_ACTIVE);
 
           if(sc-&gt;mem0_r == NULL)
                 goto bad;
 
           /* get its virtual address */
           sc-&gt;mem0_v = rman_get_virtual(sc-&gt;mem0_r);</programlisting>
 
         <para>The DMA request channel (DRQ) is allocated likewise. To
           initialize it use functions of the
           <function>isa_dma*()</function> family. For example:
         </para>
 
         <para><function>isa_dmacascade(sc-&gt;drq0);</function></para>
 
         <para>The interrupt request line (IRQ) is a bit
           special. Besides allocation the driver's interrupt handler
           should be associated with it. Historically in the old ISA
           drivers the argument passed by the system to the interrupt
           handler was the device unit number. But in modern drivers
           the convention suggests passing the pointer to structure
           softc. The important reason is that when the structures
           softc are allocated dynamically then getting the unit number
           from softc is easy while getting softc from the unit number is
           difficult. Also this convention makes the drivers for
           different buses look more uniform and allows them to share
           the code: each bus gets its own probe, attach, detach and
           other bus-specific routines while the bulk of the driver
           code may be shared among them.
         </para>
 
         <programlisting>
           sc-&gt;intr_rid = 0;
           sc-&gt;intr_r = bus_alloc_resource(dev, SYS_RES_MEMORY,  &#38;sc-&gt;intr_rid,
                 /*start*/ 0, /*end*/ ~0, /*count*/ 0, RF_ACTIVE);
 
           if(sc-&gt;intr_r == NULL)
               goto bad;
 
           /*
            * XXX_INTR_TYPE is supposed to be defined depending on the type of
            * the driver, for example as INTR_TYPE_CAM for a CAM driver
            */
           error = bus_setup_intr(dev, sc-&gt;intr_r, XXX_INTR_TYPE,
               (driver_intr_t *) xxx_intr, (void *) sc, &#38;sc-&gt;intr_cookie);
           if(error)
               goto bad;
 
         </programlisting>
 
 
         <para>If the device needs to make DMA to the main memory then
           this memory should be allocated like described before:
         </para>
 
         <programlisting>          error=bus_dma_tag_create(NULL, /*alignment*/ 4,
               /*boundary*/ 0, /*lowaddr*/ BUS_SPACE_MAXADDR_24BIT,
               /*highaddr*/ BUS_SPACE_MAXADDR, /*filter*/ NULL, /*filterarg*/ NULL,
               /*maxsize*/ BUS_SPACE_MAXSIZE_24BIT,
               /*nsegments*/ BUS_SPACE_UNRESTRICTED,
               /*maxsegsz*/ BUS_SPACE_MAXSIZE_24BIT, /*flags*/ 0,
               &#38;sc-&gt;parent_tag);
           if(error)
               goto bad;
 
           /* many things get inherited from the parent tag
            * sc-&gt;data is supposed to point to the structure with the shared data,
            * for example for a ring buffer it could be:
            * struct {
            *   u_short rd_pos;
            *   u_short wr_pos;
            *   char    bf[XXX_RING_BUFFER_SIZE]
            * } *data;
            */
           error=bus_dma_tag_create(sc-&gt;parent_tag, 1,
               0, BUS_SPACE_MAXADDR, 0, /*filter*/ NULL, /*filterarg*/ NULL,
               /*maxsize*/ sizeof(* sc-&gt;data), /*nsegments*/ 1,
               /*maxsegsz*/ sizeof(* sc-&gt;data), /*flags*/ 0,
               &#38;sc-&gt;data_tag);
           if(error)
               goto bad;
 
           error = bus_dmamem_alloc(sc-&gt;data_tag, &#38;sc-&gt;data, /* flags*/ 0,
               &#38;sc-&gt;data_map);
           if(error)
                goto bad;
 
           /* xxx_alloc_callback() just saves the physical address at
            * the pointer passed as its argument, in this case &#38;sc-&gt;data_p.
            * See details in the section on bus memory mapping.
            * It can be implemented like:
            *
            * static void
            * xxx_alloc_callback(void *arg, bus_dma_segment_t *seg,
            *     int nseg, int error)
            * {
            *    *(bus_addr_t *)arg = seg[0].ds_addr;
            * }
            */
           bus_dmamap_load(sc-&gt;data_tag, sc-&gt;data_map, (void *)sc-&gt;data,
               sizeof (* sc-&gt;data), xxx_alloc_callback, (void *) &#38;sc-&gt;data_p,
               /*flags*/0);</programlisting>
 
 
         <para>After all the necessary resources are allocated the
           device should be initialized. The initialization may include
           testing that all the expected features are functional.</para>
 
         <programlisting>          if(xxx_initialize(sc) &lt; 0)
                goto bad;        </programlisting>
 
 
         <para>The bus subsystem will automatically print on the
           console the device description set by probe. But if the
           driver wants to print some extra information about the
           device it may do so, for example:</para>
 
         <programlisting>
         device_printf(dev, "has on-card FIFO buffer of %d bytes\n", sc-&gt;fifosize);
         </programlisting>
 
         <para>If the initialization routine experiences any problems
           then printing messages about them before returning error is
           also recommended.</para>
 
         <para>The final step of the attach routine is attaching the
           device to its functional subsystem in the kernel. The exact
           way to do it depends on the type of the driver: a character
           device, a block device, a network device, a CAM SCSI bus
           device and so on.</para>
 
         <para>If all went well then return success.</para>
 
         <programlisting>          error = xxx_attach_subsystem(sc);
           if(error)
               goto bad;
 
           return 0;        </programlisting>
 
         <para>Finally, handle the troublesome situations. All the
           resources should be deallocated before returning an
           error. We make use of the fact that before the structure
           softc is passed to us it gets zeroed out, so we can find out
           if some resource was allocated: then its descriptor is
           non-zero.</para>
 
         <programlisting>          bad:
 
           xxx_free_resources(sc);
           if(error)
               return error;
           else /* exact error is unknown */
               return ENXIO;</programlisting>
 
         <para>That would be all for the attach routine.</para>
 
      </sect1>
 
 
      <sect1 xml:id="isa-driver-detach">
        <title>xxx_isa_detach</title>
 
         <para>
           If this function is present in the driver and the driver is
           compiled as a loadable module then the driver gets the
           ability to be unloaded. This is an important feature if the
           hardware supports hot plug. But the ISA bus does not support
           hot plug, so this feature is not particularly important for
           the ISA devices. The ability to unload a driver may be
           useful when debugging it, but in many cases installation of
           the new version of the driver would be required only after
           the old version somehow wedges the system and a reboot will be
           needed anyway, so the efforts spent on writing the detach
           routine may not be worth it. Another argument that
           unloading would allow upgrading the drivers on a production
           machine seems to be mostly theoretical. Installing a new
           version of a driver is a dangerous operation which should
           never be performed on a production machine (and which is not
           permitted when the system is running in secure mode).  Still,
           the detach routine may be provided for the sake of
           completeness.
         </para>
 
         <para>
           The detach routine returns 0 if the driver was successfully
           detached or the error code otherwise.
         </para>
 
         <para>
           The logic of detach is a mirror of the attach. The first
           thing to do is to detach the driver from its kernel
           subsystem. If the device is currently open then the driver
           has two choices: refuse to be detached or forcibly close and
           proceed with detach. The choice used depends on the ability
           of the particular kernel subsystem to do a forced close and
           on the preferences of the driver's author. Generally the
           forced close seems to be the preferred alternative.
         <programlisting>          struct xxx_softc *sc = device_get_softc(dev);
           int error;
 
           error = xxx_detach_subsystem(sc);
           if(error)
               return error;</programlisting>
         </para>
         <para>
           Next the driver may want to reset the hardware to some
           consistent state.  That includes stopping any ongoing
           transfers, disabling the DMA channels and interrupts to
           avoid memory corruption by the device. For most of the
           drivers this is exactly what the shutdown routine does, so
           if it is included in the driver we can just call it.
         </para>
         <para><function>xxx_isa_shutdown(dev);</function></para>
 
         <para>
           And finally release all the resources and return success.
         <programlisting>          xxx_free_resources(sc);
           return 0;</programlisting>
 
         </para>
      </sect1>
 
      <sect1 xml:id="isa-driver-shutdown">
        <title>xxx_isa_shutdown</title>
 
         <para>
           This routine is called when the system is about to be shut
           down. It is expected to bring the hardware to some
           consistent state. For most of the ISA devices no special
           action is required, so the function is not really necessary
           because the device will be re-initialized on reboot
           anyway. But some devices have to be shut down with a special
           procedure, to make sure that they will be properly detected
           after soft reboot (this is especially true for many devices
           with proprietary identification protocols).  In any case
           disabling DMA and interrupts in the device registers and
           stopping any ongoing transfers is a good idea. The exact
           action depends on the hardware, so we do not consider it here
           in any detail.
         </para>
      </sect1>
 
      <sect1 xml:id="isa-driver-intr">
           <title>xxx_intr</title>
 
         <indexterm><primary>interrupt handler</primary></indexterm>
 
         <para>
           The interrupt handler is called when an interrupt is
           received which may be from this particular device. The ISA
           bus does not support interrupt sharing (except in some special
           cases) so in practice if the interrupt handler is called
           then the interrupt almost for sure came from its
           device. Still, the interrupt handler must poll the device
           registers and make sure that the interrupt was generated by
           its device. If not it should just return.
         </para>
 
         <para>
           The old convention for the ISA drivers was getting the
           device unit number as an argument. This is obsolete, and the
           new drivers receive whatever argument was specified for them
           in the attach routine when calling
           <function>bus_setup_intr()</function>. By the new convention
           it should be the pointer to the structure softc. So the
           interrupt handler commonly starts as:
         </para>
 
         <programlisting>
           static void
           xxx_intr(struct xxx_softc *sc)
           {
 
         </programlisting>
 
         <para>
           It runs at the interrupt priority level specified by the
           interrupt type parameter of
           <function>bus_setup_intr()</function>. That means that all
           the other interrupts of the same type as well as all the
           software interrupts are disabled.
         </para>
 
         <para>
           To avoid races it is commonly written as a loop:
         </para>
 
         <programlisting>
           while(xxx_interrupt_pending(sc)) {
               xxx_process_interrupt(sc);
               xxx_acknowledge_interrupt(sc);
           }        </programlisting>
 
         <para>
           The interrupt handler has to acknowledge interrupt to the
           device only but not to the interrupt controller, the system
           takes care of the latter.
         </para>
 
      </sect1>
 </chapter>
diff --git a/en_US.ISO8859-1/books/arch-handbook/pccard/chapter.xml b/en_US.ISO8859-1/books/arch-handbook/pccard/chapter.xml
index a9a2753d9a..59261a9568 100644
--- a/en_US.ISO8859-1/books/arch-handbook/pccard/chapter.xml
+++ b/en_US.ISO8859-1/books/arch-handbook/pccard/chapter.xml
@@ -1,367 +1,367 @@
 <?xml version="1.0" encoding="iso-8859-1"?>
 <!--
      The FreeBSD Documentation Project
 
      $FreeBSD$
 -->
 <chapter xmlns="http://docbook.org/ns/docbook" xmlns:xlink="http://www.w3.org/1999/xlink" version="5.0" xml:id="pccard">
   <title>PC Card</title>
 
   <indexterm><primary>PC Card</primary></indexterm>
   <indexterm><primary>CardBus</primary></indexterm>
 
   <para>This chapter will talk about the FreeBSD mechanisms for
     writing a device driver for a PC Card or CardBus device.  However,
     at present it just documents how to add a new device to an
     existing pccard driver.</para>
 
   <sect1 xml:id="pccard-adddev">
     <title>Adding a Device</title>
 
     <para>Device drivers know what devices they support.  There is a
       table of supported devices in the kernel that drivers use to
       attach to a device.</para>
 
     <sect2 xml:id="pccard-overview">
       <title>Overview</title>
 
       <indexterm><primary>CIS</primary></indexterm>
       <para>PC Cards are identified in one of two ways, both based on
 	the <firstterm>Card Information Structure</firstterm>
 	(<acronym role="Card Information Structure">CIS</acronym>)
 	stored on the card.  The
 	first method is to use numeric manufacturer and product
 	numbers.  The second method is to use the human readable
 	strings that are also contained in the CIS.  The PC Card bus
 	uses a centralized database and some macros to facilitate a
 	design pattern to help the driver writer match devices to his
 	driver.</para>
 
       <para>Original equipment manufacturers (<acronym>OEM</acronym>s)
 	often develop a reference design for a PC Card product, then
 	sell this design to other companies to market.  Those
 	companies refine the design, market the product to their
 	target audience or geographic area, and put their own name
 	plate onto the card.  The refinements to the physical card are
 	typically very minor, if any changes are made at all.  To
 	strengthen their brand, these vendors place their company name
 	in the human readable strings in the CIS space, but leave the
 	manufacturer and product IDs unchanged.</para>
 
       <indexterm><primary>NetGear</primary></indexterm>
       <indexterm><primary>Linksys</primary></indexterm>
       <indexterm><primary>D-Link</primary></indexterm>
 
-      <para>Because of this practice, FreeBSD drivers usually rely on
+      <para>Due to this practice, FreeBSD drivers usually rely on
 	numeric IDs for device identification.  Using numeric IDs and
 	a centralized database complicates adding IDs and support for
 	cards to the system.  One must carefully check to see who
 	really made the card, especially when it appears that the
 	vendor who made the card might already have a different
 	manufacturer ID listed in the central database.  Linksys,
 	D-Link, and NetGear are a number of US manufacturers of LAN
 	hardware that often sell the same design.  These same designs
 	can be sold in Japan under names such as Buffalo and Corega.
 	Often, these devices will all have the same manufacturer and
 	product IDs.</para>
 
       <para>The PC Card bus code keeps a central database of card
 	information, but not which driver is associated with them, in
 	<filename>/sys/dev/pccard/pccarddevs</filename>.  It also
 	provides a set of macros that allow one to easily construct
 	simple entries in the table the driver uses to claim
 	devices.</para>
 
       <para>Finally, some really low end devices do not contain
 	manufacturer identification at all.  These devices must be
 	detected by matching the human readable CIS strings.
 	While it would be nice if we did not need this method as a
 	fallback, it is necessary for some very low end CD-ROM players
 	and Ethernet cards.  This method should generally be
 	avoided, but a number of devices are listed in this section
 	because they were added prior to the recognition of the
 	<acronym>OEM</acronym> nature of the PC Card business.  When
 	adding new devices, prefer using the numeric method.</para>
     </sect2>
 
     <sect2 xml:id="pccard-pccarddevs">
       <title>Format of <filename>pccarddevs</filename></title>
 
       <para>There are four sections in the
 	<filename>pccarddevs</filename> files.  The first section
 	lists the manufacturer numbers for vendors that use
 	them.  This section is sorted in numerical order.  The next
 	section has all of the products that are used by these
 	vendors, along with their product ID numbers and a description
 	string.  The description string typically is not used (instead
 	we set the device's description based on the human readable
 	CIS, even if we match on the numeric version).  These two
 	sections are then repeated for devices that use the
 	string matching method.  Finally, C-style comments enclosed in
 	<literal>/*</literal> and <literal>*/</literal> characters are
 	allowed anywhere in the file.</para>
 
       <para>The first section of the file contains the vendor IDs.
 	Please keep this list sorted in numeric order.  Also, please
 	coordinate changes to this file because we share it with
 	NetBSD to help facilitate a common clearing house for this
 	information.  For example, here are the first few vendor
 	IDs:</para>
 
       <programlisting>vendor FUJITSU			0x0004  Fujitsu Corporation
 vendor NETGEAR_2		0x000b  Netgear
 vendor PANASONIC		0x0032	Matsushita Electric Industrial Co.
 vendor SANDISK			0x0045	Sandisk Corporation</programlisting>
 
       <para>Chances are very good
 	that the <literal>NETGEAR_2</literal> entry is really an OEM
 	that NETGEAR purchased cards from and the author of support
 	for those cards was unaware at the time that Netgear was using
 	someone else's ID.  These entries are fairly straightforward.
 	The vendor keyword denotes the kind of line that this is,
 	followed by the name of the vendor.  This name will be
 	repeated later in <filename>pccarddevs</filename>, as
 	well as used in the driver's match tables, so keep it short
 	and a valid C identifier.  A numeric ID in hex identifies the
 	manufacturer.  Do not add IDs of the form
 	<literal>0xffffffff</literal> or <literal>0xffff</literal>
 	because these are reserved IDs (the former is
 	<quote>no ID set</quote> while the latter is sometimes seen in
 	extremely poor quality cards to try to indicate
 	<quote>none</quote>).  Finally there is a string description
 	of the company that makes the card.  This string is not used
 	in FreeBSD for anything but commentary purposes.</para>
 
       <para>The second section of the file contains the products.  As
 	shown in this example, the format is similar to the vendor
 	lines:</para>
 
       <programlisting>/* Allied Telesis K.K. */
 product ALLIEDTELESIS LA_PCM	0x0002 Allied Telesis LA-PCM
 
 /* Archos */
 product	ARCHOS ARC_ATAPI	0x0043 MiniCD</programlisting>
 
       <para>The
 	<literal>product</literal> keyword is followed by the vendor
 	name, repeated from above.  This is followed by the product
 	name, which is used by the driver and should be a valid C
 	identifier, but may also start with a number.  As with the
 	vendors, the hex product ID for this card follows the same
 	convention for <literal>0xffffffff</literal> and
 	<literal>0xffff</literal>.  Finally, there is a string
 	description of the device itself.  This string typically is
 	not used in FreeBSD, since FreeBSD's pccard bus driver will
 	construct a string from the human readable CIS entries, but it
 	can be used in the rare cases where this is somehow
 	insufficient.  The products are in alphabetical order by
 	manufacturer, then numerical order by product ID.  They have a
 	C comment before each manufacturer's entries and there is a
 	blank line between entries.</para>
 
       <para>The third section is like the previous vendor section, but
 	with all of the manufacturer numeric IDs set to
 	<literal>-1</literal>, meaning
 	<quote>match anything found</quote> in the FreeBSD pccard
 	bus code.  Since these are C identifiers, their names must be
 	unique.  Otherwise the format is identical to the first
 	section of the file.</para>
 
       <para>The final section contains the entries for those cards
 	that must be identified by string entries.  This section's
 	format is a little different from the generic section:</para>
 
       <programlisting>product ADDTRON AWP100		{ "Addtron", "AWP-100&amp;spWireless&amp;spPCMCIA", "Version&amp;sp01.02", NULL }
 product ALLIEDTELESIS WR211PCM	{ "Allied&amp;spTelesis&amp;spK.K.", "WR211PCM", NULL, NULL } Allied Telesis WR211PCM</programlisting>
 
       <para>The familiar <literal>product</literal> keyword is
 	followed by the vendor name and the card name, just as in the
 	second section of the file.  Here the format deviates from
 	that used earlier.  There is a {} grouping, followed by a
 	number of strings.  These strings correspond to the vendor,
 	product, and extra information that is defined in a CIS_INFO
 	tuple.  These strings are filtered by the program that
 	generates <filename>pccarddevs.h</filename> to replace &amp;sp
 	with a real space.  NULL strings mean that the corresponding
 	part of the entry should be ignored.  The example shown here
 	contains a bad entry.  It should not contain the version
 	number unless that is critical for the operation of the card.
 	Sometimes vendors will have many different versions of the
 	card in the field that all work, in which case that
 	information only makes it harder for someone with a similar
 	card to use it with FreeBSD.  Sometimes it is necessary when a
 	vendor wishes to sell many different parts under the same
 	brand due to market considerations (availability, price, and
 	so forth).  Then it can be critical to disambiguating the card
 	in those rare cases where the vendor kept the same
 	manufacturer/product pair.  Regular expression matching is not
 	available at this time.</para>
     </sect2>
 
     <sect2 xml:id="pccard-probe">
       <title>Sample Probe Routine</title>
 
       <indexterm>
 	<primary>PC Card</primary>
 	<secondary>probe</secondary>
       </indexterm>
 
       <para>To understand how to add a device to the list of supported
 	devices, one must understand the probe and/or match routines
 	that many drivers have.  It is complicated a little in FreeBSD
 	5.x because there is a compatibility layer for OLDCARD present
 	as well.  Since only the window-dressing is different, an
 	idealized version will be presented here.</para>
 
       <programlisting>static const struct pccard_product wi_pccard_products[] = {
 	PCMCIA_CARD(3COM, 3CRWE737A, 0),
 	PCMCIA_CARD(BUFFALO, WLI_PCM_S11, 0),
 	PCMCIA_CARD(BUFFALO, WLI_CF_S11G, 0),
 	PCMCIA_CARD(TDK, LAK_CD011WL, 0),
 	{ NULL }
 };
 
 static int
 wi_pccard_probe(dev)
 	device_t	dev;
 {
 	const struct pccard_product *pp;
 
 	if ((pp = pccard_product_lookup(dev, wi_pccard_products,
 	    sizeof(wi_pccard_products[0]), NULL)) != NULL) {
 		if (pp-&gt;pp_name != NULL)
 			device_set_desc(dev, pp-&gt;pp_name);
 		return (0);
 	}
 	return (ENXIO);
 }</programlisting>
 
       <para>Here we have a simple pccard probe routine that matches a
 	few devices.  As stated above, the name may vary (if it is not
 	<function>foo_pccard_probe()</function> it will be
 	<function>foo_pccard_match()</function>).  The function
 	<function>pccard_product_lookup()</function> is a generalized
 	function that walks the table and returns a pointer to the
 	first entry that it matches.  Some drivers may use this
 	mechanism to convey additional information about some cards to
 	the rest of the driver, so there may be some variance in the
 	table.  The only requirement is that each row of the table
 	must have a <function>struct</function>
 	<varname remap="structname">pccard_product</varname> as the first
 	element.</para>
 
       <para>Looking at the table
 	<varname remap="structname">wi_pccard_products</varname>, one notices that
 	all the entries are of the form
 	<function>PCMCIA_CARD(<replaceable>foo</replaceable>,
 	  <replaceable>bar</replaceable>,
 	  <replaceable>baz</replaceable>)</function>.  The
 	<replaceable>foo</replaceable> part is the manufacturer ID
 	from <filename>pccarddevs</filename>.  The
 	<replaceable>bar</replaceable> part is the product ID.
 	<replaceable>baz</replaceable> is the expected function number
 	for this card.  Many pccards can have multiple functions,
 	and some way to disambiguate function 1 from function 0 is
 	needed.  You may see <literal>PCMCIA_CARD_D</literal>, which
 	includes the device description from
 	<filename>pccarddevs</filename>.  You may also see
 	<literal>PCMCIA_CARD2</literal> and
 	<literal>PCMCIA_CARD2_D</literal> which are used when you need
 	to match both CIS strings and manufacturer numbers, in the
 	<quote>use the default description</quote> and <quote>take the
 	  description from pccarddevs</quote> flavors.</para>
     </sect2>
 
     <sect2 xml:id="pccard-add">
       <title>Putting it All Together</title>
 
       <para>To add a new device, one must first obtain the
 	identification information from the
 	device.  The easiest way to do this is to insert the device
 	into a PC Card or CF slot and issue
 	<command>devinfo -v</command>.  Sample output:</para>
 
       <programlisting>        cbb1 pnpinfo vendor=0x104c device=0xac51 subvendor=0x1265 subdevice=0x0300 class=0x060700 at slot=10 function=1
           cardbus1
           pccard1
             unknown pnpinfo manufacturer=0x026f product=0x030c cisvendor="BUFFALO" cisproduct="WLI2-CF-S11" function_type=6 at function=0</programlisting>
 
       <para><literal>manufacturer</literal>
 	and <literal>product</literal> are the numeric IDs for this
 	product, while <literal>cisvendor</literal> and
 	<literal>cisproduct</literal> are the product description
 	strings from the CIS.</para>
 
       <para>Since we first want to prefer the numeric option, first
 	try to construct an entry based on that.  The above card has
 	been slightly fictionalized for the purpose of this example.
 	The vendor is BUFFALO, which we see already has an
 	entry:</para>
 
       <programlisting>vendor BUFFALO			0x026f	BUFFALO (Melco Corporation)</programlisting>
 
       <para>But there is no entry for this particular card.
 	Instead we find:</para>
 
       <programlisting>/* BUFFALO */
 product BUFFALO WLI_PCM_S11	0x0305	BUFFALO AirStation 11Mbps WLAN
 product BUFFALO LPC_CF_CLT	0x0307	BUFFALO LPC-CF-CLT
 product	BUFFALO	LPC3_CLT	0x030a	BUFFALO LPC3-CLT Ethernet Adapter
 product BUFFALO WLI_CF_S11G	0x030b	BUFFALO AirStation 11Mbps CF WLAN</programlisting>
 
       <para>To add the device, we can just add this entry to
 	<filename>pccarddevs</filename>:</para>
 
       <programlisting>product BUFFALO WLI2_CF_S11G	0x030c	BUFFALO AirStation ultra 802.11b CF</programlisting>
 
       <para>Once these steps are complete, the card can be added to
 	the driver.  That is a simple operation of adding one
 	line:</para>
 
       <programlisting>static const struct pccard_product wi_pccard_products[] = {
 	PCMCIA_CARD(3COM, 3CRWE737A, 0),
 	PCMCIA_CARD(BUFFALO, WLI_PCM_S11, 0),
 	PCMCIA_CARD(BUFFALO, WLI_CF_S11G, 0),
 +	PCMCIA_CARD(BUFFALO, WLI_CF2_S11G, 0),
 	PCMCIA_CARD(TDK, LAK_CD011WL, 0),
 	{ NULL }
 };</programlisting>
 
       <para>Note that I have included a '<literal>+</literal>' in the
 	line before the line that I added, but that is simply to
 	highlight the line.  Do not add it to the actual driver.  Once
 	you have added the line, you can recompile your kernel or
 	module and test it.  If the device is recognized and works,
 	please submit a patch.  If it does not work, please figure out
 	what is needed to make it work and submit a patch.  If the
 	device is not recognized at all, you have done something wrong
 	and should recheck each step.</para>
 
       <para>If you are a FreeBSD src committer, and everything appears
 	to be working, then you can commit the changes to the tree.
 	However, there are some minor tricky things to be considered.
 	<filename>pccarddevs</filename> must be committed to the tree
 	first.  Then <filename>pccarddevs.h</filename> must be
 	regenerated and committed as a second step, ensuring that the
 	right &dollar;FreeBSD&dollar; tag is in the latter file.
 	Finally, commit the additions to the driver.</para>
     </sect2>
 
     <sect2 xml:id="pccard-pr">
       <title>Submitting a New Device</title>
 
       <para>Please do not send entries for new devices to the author
 	directly.  Instead, submit them as a PR and send the author
 	the PR number for his records.  This ensures that entries are
 	not lost.  When submitting a PR, it is unnecessary to include
 	the <filename>pccardevs.h</filename> diffs in the patch, since
 	those will be regenerated.  It is necessary to include a
 	description of the device, as well as the patches to the
 	client driver.  If you do not know the name, use OEM99 as the
 	name, and the author will adjust OEM99 accordingly after
 	investigation.  Committers should not commit OEM99, but
 	instead find the highest OEM entry and commit one more than
 	that.</para>
     </sect2>
   </sect1>
 </chapter>
diff --git a/en_US.ISO8859-1/books/arch-handbook/scsi/chapter.xml b/en_US.ISO8859-1/books/arch-handbook/scsi/chapter.xml
index c325840bed..7de627b5b9 100644
--- a/en_US.ISO8859-1/books/arch-handbook/scsi/chapter.xml
+++ b/en_US.ISO8859-1/books/arch-handbook/scsi/chapter.xml
@@ -1,2239 +1,2239 @@
 <?xml version="1.0" encoding="iso-8859-1"?>
 <!--
      The FreeBSD Documentation Project
 
      $FreeBSD$
 -->
 <chapter xmlns="http://docbook.org/ns/docbook" xmlns:xlink="http://www.w3.org/1999/xlink" version="5.0" xml:id="scsi">
   <info><title>Common Access Method SCSI Controllers</title>
     <authorgroup>
       <author><personname><firstname>Sergey</firstname><surname>Babkin</surname></personname><contrib>Written by </contrib></author>
     </authorgroup>
     <authorgroup>
       <author><personname><firstname>Murray</firstname><surname>Stokely</surname></personname><contrib>Modifications for Handbook made by </contrib></author>
     </authorgroup>
   </info>
 
   
 
   <sect1 xml:id="scsi-synopsis">
     <title>Synopsis</title>
 
     <indexterm><primary>SCSI</primary></indexterm>
     <para>This document assumes that the reader has a general
       understanding of device drivers in FreeBSD and of the SCSI
       protocol.  Much of the information in this document was
       extracted from the drivers:</para>
 
     <itemizedlist>
 
       <listitem><para>ncr (<filename>/sys/pci/ncr.c</filename>) by
 	Wolfgang Stanglmeier and Stefan Esser</para></listitem>
 
       <listitem>
 	<para>sym (<filename>/sys/dev/sym/sym_hipd.c</filename>) by
 	  Gerard Roudier</para>
       </listitem>
 
       <listitem>
 	<para>aic7xxx
 	  (<filename>/sys/dev/aic7xxx/aic7xxx.c</filename>) by Justin
 	  T. Gibbs</para>
       </listitem>
     </itemizedlist>
 
     <para>and from the CAM code itself (by Justin T. Gibbs, see
       <filename>/sys/cam/*</filename>).  When some solution looked the
       most logical and was essentially verbatim extracted from the
       code by Justin T. Gibbs, I marked it as
       <quote>recommended</quote>.</para>
 
     <para>The document is illustrated with examples in
       pseudo-code.  Although sometimes the examples have many details
       and look like real code, it is still pseudo-code.  It was
       written to demonstrate the concepts in an understandable way.
       For a real driver other approaches may be more modular and
       efficient.  It also abstracts from the hardware details, as well
       as issues that would cloud the demonstrated concepts or that are
       supposed to be described in the other chapters of the developers
       handbook.  Such details are commonly shown as calls to functions
       with descriptive names, comments or pseudo-statements.
       Fortunately real life full-size examples with all the details
       can be found in the real drivers.</para>
   </sect1>
 
   <sect1 xml:id="scsi-general">
     <title>General Architecture</title>
 
     <indexterm>
       <primary>Common Access Method (CAM)</primary>
     </indexterm>
 
     <para>CAM stands for Common Access Method.  It is a generic way to
       address the I/O buses in a SCSI-like way.  This allows a
       separation of the generic device drivers from the drivers
       controlling the I/O bus: for example the disk driver becomes
       able to control disks on both SCSI, IDE, and/or any other bus so
       the disk driver portion does not have to be rewritten (or copied
       and modified) for every new I/O bus.  Thus the two most
       important active entities are:</para>
 
     <indexterm><primary>CD-ROM</primary></indexterm>
     <indexterm><primary>tape</primary></indexterm>
     <indexterm><primary>IDE</primary></indexterm>
     <itemizedlist>
       <listitem>
 	<para><emphasis>Peripheral Modules</emphasis> - a
 	  driver for peripheral devices (disk, tape, CD-ROM,
 	  etc.)</para>
       </listitem>
 
       <listitem>
 	<para><emphasis>SCSI Interface Modules </emphasis>(SIM) - a
 	  Host Bus Adapter drivers for connecting to an I/O bus such
 	  as SCSI or IDE.</para>
       </listitem>
     </itemizedlist>
 
     <para>A peripheral driver receives requests from the OS, converts
       them to a sequence of SCSI commands and passes these SCSI
       commands to a SCSI Interface Module.  The SCSI Interface Module
       is responsible for passing these commands to the actual hardware
       (or if the actual hardware is not SCSI but, for example, IDE
       then also converting the SCSI commands to the native commands of
       the hardware).</para>
 
-    <para>Because we are interested in writing a SCSI adapter driver
+    <para>As we are interested in writing a SCSI adapter driver
       here, from this point on we will consider everything from the
       SIM standpoint.</para>
 
     <para>A typical SIM driver needs to include the following
       CAM-related header files:</para>
 
     <programlisting>#include &lt;cam/cam.h&gt;
 #include &lt;cam/cam_ccb.h&gt;
 #include &lt;cam/cam_sim.h&gt;
 #include &lt;cam/cam_xpt_sim.h&gt;
 #include &lt;cam/cam_debug.h&gt;
 #include &lt;cam/scsi/scsi_all.h&gt;</programlisting>
 
     <para>The first thing each SIM driver must do is register itself
       with the CAM subsystem.  This is done during the driver's
       <function>xxx_attach()</function> function (here and further
       xxx_ is used to denote the unique driver name prefix).  The
       <function>xxx_attach()</function> function itself is called by
       the system bus auto-configuration code which we do not describe
       here.</para>
 
     <para>This is achieved in multiple steps: first it is necessary to
       allocate the queue of requests associated with this SIM:</para>
 
     <programlisting>    struct cam_devq *devq;
 
     if(( devq = cam_simq_alloc(SIZE) )==NULL) {
         error; /* some code to handle the error */
     }</programlisting>
 
     <para>Here <literal>SIZE</literal> is the size of the queue to be
       allocated, maximal number of requests it could contain.  It is
       the number of requests that the SIM driver can handle in
       parallel on one SCSI card.  Commonly it can be calculated
       as:</para>
 
     <programlisting>SIZE = NUMBER_OF_SUPPORTED_TARGETS * MAX_SIMULTANEOUS_COMMANDS_PER_TARGET</programlisting>
 
     <para>Next we create a descriptor of our SIM:</para>
 
     <programlisting>    struct cam_sim *sim;
 
     if(( sim = cam_sim_alloc(action_func, poll_func, driver_name,
             softc, unit, mtx, max_dev_transactions,
             max_tagged_dev_transactions, devq) )==NULL) {
         cam_simq_free(devq);
         error; /* some code to handle the error */
     }</programlisting>
 
     <para>Note that if we are not able to create a SIM descriptor we
       free the <varname remap="structname">devq</varname> also because we can do
       nothing else with it and we want to conserve memory.</para>
 
     <para>If a SCSI card has multiple SCSI
       buses<indexterm><primary>SCSI</primary><secondary>bus</secondary></indexterm>
       on it then each bus requires its own
       <varname remap="structname">cam_sim</varname> structure.</para>
 
     <para>An interesting question is what to do if a SCSI card has
       more than one SCSI bus, do we need one
       <varname remap="structname">devq</varname> structure per card or per SCSI
       bus?  The answer given in the comments to the CAM code is:
       either way, as the driver's author prefers.</para>
 
     <para>The arguments are:</para>
 
     <itemizedlist>
       <listitem>
 	<para><function>action_func</function> - pointer to
 	  the driver's <function>xxx_action</function> function.
 	  <funcsynopsis>
 	    <funcprototype>
 	      <funcdef>static void
 		<function>xxx_action</function>
 	      </funcdef>
 	      <paramdef>
 		<parameter>struct cam_sim *sim</parameter>,
 		<parameter>union ccb *ccb</parameter>
 	      </paramdef>
 	    </funcprototype>
 	  </funcsynopsis></para>
       </listitem>
 
       <listitem>
 	<para><function>poll_func</function> - pointer to
 	  the driver's <function>xxx_poll()</function>
 	  <funcsynopsis>
 	    <funcprototype>
 	      <funcdef>static void
 		<function>xxx_poll</function>
 	      </funcdef>
 	      <paramdef>
 		<parameter>struct cam_sim *sim</parameter>
 	      </paramdef>
 	    </funcprototype>
 	  </funcsynopsis></para>
       </listitem>
 
       <listitem>
 	<para>driver_name - the name of the actual driver,
 	  such as <quote>ncr</quote> or
 	  <quote>wds</quote>.</para>
       </listitem>
 
       <listitem>
 	<para><varname remap="structname">softc</varname> - pointer to the driver's
 	  internal descriptor for this SCSI card.  This pointer will
 	  be used by the driver in future to get private
 	  data.</para>
       </listitem>
 
       <listitem>
 	<para>unit - the controller unit number, for example
 	  for controller <quote>mps0</quote> this number will be
 	  0</para>
       </listitem>
 
       <listitem>
 	<para>mtx - Lock associated with this SIM. For SIMs that don't
 	know about locking, pass in Giant. For SIMs that do, pass in
 	the lock used to guard this SIM's data structures. This lock
 	will be held when xxx_action and xxx_poll are called.</para>
       </listitem>
 
       <listitem>
 	<para>max_dev_transactions - maximal number of simultaneous
 	  transactions per SCSI target in the non-tagged mode.  This
 	  value will be almost universally equal to 1, with possible
 	  exceptions only for the non-SCSI cards.  Also the drivers
 	  that hope to take advantage by preparing one transaction
 	  while another one is executed may set it to 2 but this does
 	  not seem to be worth the complexity.</para>
       </listitem>
 
       <listitem>
 	<para>max_tagged_dev_transactions - the same thing, but in the
 	  tagged mode.  Tags are the SCSI way to initiate multiple
 	  transactions on a device: each transaction is assigned a
 	  unique tag and the transaction is sent to the device.  When
 	  the device completes some transaction it sends back the
 	  result together with the tag so that the SCSI adapter (and
 	  the driver) can tell which transaction was completed.  This
 	  argument is also known as the maximal tag depth.  It depends
 	  on the abilities of the SCSI adapter.</para>
       </listitem>
 
     </itemizedlist>
 
     <para>Finally we register the SCSI buses associated with our SCSI
       adapter<indexterm><primary>SCSI</primary><secondary>adapter</secondary></indexterm>:</para>
 
     <programlisting>    if(xpt_bus_register(sim, softc, bus_number) != CAM_SUCCESS) {
         cam_sim_free(sim, /*free_devq*/ TRUE);
         error; /* some code to handle the error */
     }</programlisting>
 
     <para>If there is one <varname remap="structname">devq</varname> structure per
       SCSI bus (i.e., we consider a card with multiple buses as
       multiple cards with one bus each) then the bus number will
       always be 0, otherwise each bus on the SCSI card should be get a
       distinct number.  Each bus needs its own separate structure
       cam_sim.</para>
 
     <para>After that our controller is completely hooked to the CAM
       system.  The value of <varname remap="structname">devq</varname> can be
       discarded now: sim will be passed as an argument in all further
       calls from CAM and devq can be derived from it.</para>
 
     <para>CAM provides the framework for such asynchronous events.
       Some events originate from the lower levels (the SIM drivers),
       some events originate from the peripheral drivers, some events
       originate from the CAM subsystem itself.  Any driver can
       register callbacks for some types of the asynchronous events, so
       that it would be notified if these events occur.</para>
 
     <para>A typical example of such an event is a device reset.  Each
       transaction and event identifies the devices to which it applies
       by the means of <quote>path</quote>.  The target-specific events
       normally occur during a transaction with this device.  So the
       path from that transaction may be re-used to report this event
       (this is safe because the event path is copied in the event
       reporting routine but not deallocated nor passed anywhere
       further).  Also it is safe to allocate paths dynamically at any
       time including the interrupt routines, although that incurs
       certain overhead, and a possible problem with this approach is
       that there may be no free memory at that time.  For a bus reset
       event we need to define a wildcard path including all devices on
       the bus.  So we can create the path for the future bus reset
       events in advance and avoid problems with the future memory
       shortage:</para>
 
     <programlisting>    struct cam_path *path;
 
     if(xpt_create_path(&amp;path, /*periph*/NULL,
                 cam_sim_path(sim), CAM_TARGET_WILDCARD,
                 CAM_LUN_WILDCARD) != CAM_REQ_CMP) {
         xpt_bus_deregister(cam_sim_path(sim));
         cam_sim_free(sim, /*free_devq*/TRUE);
         error; /* some code to handle the error */
     }
 
     softc-&gt;wpath = path;
     softc-&gt;sim = sim;</programlisting>
 
     <para>As you can see the path includes:</para>
 
     <itemizedlist>
       <listitem>
 	<para>ID of the peripheral driver (NULL here because we have
 	  none)</para>
       </listitem>
 
       <listitem>
 	<para>ID of the SIM driver
 	  (<function>cam_sim_path(sim)</function>)</para>
       </listitem>
 
       <listitem>
 	<para>SCSI target number of the device (CAM_TARGET_WILDCARD
 	  means <quote>all devices</quote>)</para>
       </listitem>
 
       <listitem>
 	<para>SCSI LUN number of the subdevice (CAM_LUN_WILDCARD means
 	  <quote>all LUNs</quote>)</para>
       </listitem>
     </itemizedlist>
 
     <para>If the driver can not allocate this path it will not be able
       to work normally, so in that case we dismantle that SCSI
       bus.</para>
 
     <para>And we save the path pointer in the
       <varname remap="structname">softc</varname> structure for future use.  After
       that we save the value of sim (or we can also discard it on the
       exit from <function>xxx_probe()</function> if we wish).</para>
 
     <para>That is all for a minimalistic initialization.  To do things
       right there is one more issue left.</para>
 
     <para>For a SIM driver there is one particularly interesting
       event: when a target device is considered lost.  In this case
       resetting the SCSI negotiations with this device may be a good
       idea.  So we register a callback for this event with CAM.  The
       request is passed to CAM by requesting CAM action on a CAM
       control block for this type of request:</para>
 
     <programlisting>    struct ccb_setasync csa;
 
     xpt_setup_ccb(&amp;csa.ccb_h, path, /*priority*/5);
     csa.ccb_h.func_code = XPT_SASYNC_CB;
     csa.event_enable = AC_LOST_DEVICE;
     csa.callback = xxx_async;
     csa.callback_arg = sim;
     xpt_action((union ccb *)&amp;csa);</programlisting>
 
     <para>Now we take a look at the <function>xxx_action()</function>
       and <function>xxx_poll()</function> driver entry points.</para>
 
     <para>
       <funcsynopsis>
 	<funcprototype>
 	  <funcdef>static void
 	    <function>xxx_action</function>
 	  </funcdef>
 	  <paramdef>
 	    <parameter>struct cam_sim *sim</parameter>,
 	    <parameter>union ccb *ccb</parameter>
 	  </paramdef>
 	</funcprototype>
       </funcsynopsis></para>
 
     <para>Do some action on request of the CAM subsystem.  Sim
       describes the SIM for the request, CCB is the request itself.
       CCB stands for <quote>CAM Control Block</quote>.  It is a union
       of many specific instances, each describing arguments for some
       type of transactions.  All of these instances share the CCB
       header where the common part of arguments is stored.</para>
 
     <para>CAM supports the SCSI controllers working in both initiator
       (<quote>normal</quote>) mode and target (simulating a SCSI
       device) mode.  Here we only consider the part relevant to the
       initiator mode.</para>
 
     <para>There are a few function and macros (in other words,
       methods) defined to access the public data in the struct
       sim:</para>
 
     <itemizedlist>
       <listitem>
 	<para><function>cam_sim_path(sim)</function> - the path ID
 	  (see above)</para>
       </listitem>
 
       <listitem>
 	<para><function>cam_sim_name(sim)</function> - the name of the
 	  sim</para>
       </listitem>
 
       <listitem>
 	<para><function>cam_sim_softc(sim)</function> - the pointer to
 	  the softc (driver private data) structure</para>
       </listitem>
 
       <listitem>
 	<para><function> cam_sim_unit(sim)</function> - the unit
 	  number</para>
       </listitem>
 
       <listitem>
 	<para><function> cam_sim_bus(sim)</function> - the bus
 	  ID</para>
       </listitem>
     </itemizedlist>
 
     <para>To identify the device, <function>xxx_action()</function>
       can get the unit number and pointer to its structure softc using
       these functions.</para>
 
     <para>The type of request is stored in
       <varname remap="structfield">ccb-&gt;ccb_h.func_code</varname>.  So
       generally <function>xxx_action()</function> consists of a big
       switch:</para>
 
     <programlisting>    struct xxx_softc *softc = (struct xxx_softc *) cam_sim_softc(sim);
     struct ccb_hdr *ccb_h = &amp;ccb-&gt;ccb_h;
     int unit = cam_sim_unit(sim);
     int bus = cam_sim_bus(sim);
 
     switch(ccb_h-&gt;func_code) {
     case ...:
         ...
     default:
         ccb_h-&gt;status = CAM_REQ_INVALID;
         xpt_done(ccb);
         break;
     }</programlisting>
 
     <para>As can be seen from the default case (if an unknown command
       was received) the return code of the command is set into
       <varname remap="structfield">ccb-&gt;ccb_h.status</varname> and the
       completed CCB is returned back to CAM by calling
       <function>xpt_done(ccb)</function>.</para>
 
     <para><function>xpt_done()</function> does not have to be called
       from <function>xxx_action()</function>: For example an I/O
       request may be enqueued inside the SIM driver and/or its SCSI
       controller.  Then when the device would post an interrupt
       signaling that the processing of this request is complete
       <function>xpt_done()</function> may be called from the interrupt
       handling routine.</para>
 
     <para>Actually, the CCB status is not only assigned as a return
       code but a CCB has some status all the time.  Before CCB is
       passed to the <function>xxx_action()</function> routine it gets
       the status CCB_REQ_INPROG meaning that it is in progress.  There
       are a surprising number of status values defined in
       <filename>/sys/cam/cam.h</filename> which should be able to
       represent the status of a request in great detail.  More
       interesting yet, the status is in fact a <quote>bitwise
       or</quote> of an enumerated status value (the lower 6 bits) and
       possible additional flag-like bits (the upper bits).  The
       enumerated values will be discussed later in more detail.  The
       summary of them can be found in the Errors Summary section.  The
       possible status flags are:</para>
 
     <itemizedlist>
       <listitem>
 	<para><emphasis>CAM_DEV_QFRZN</emphasis> - if the SIM driver
 	  gets a serious error (for example, the device does not
 	  respond to the selection or breaks the SCSI protocol) when
 	  processing a CCB it should freeze the request queue by
 	  calling <function>xpt_freeze_simq()</function>, return the
 	  other enqueued but not processed yet CCBs for this device
 	  back to the CAM queue, then set this flag for the
 	  troublesome CCB and call <function>xpt_done()</function>.
 	  This flag causes the CAM subsystem to unfreeze the queue
 	  after it handles the error.</para>
       </listitem>
 
       <listitem>
 	<para><emphasis>CAM_AUTOSNS_VALID</emphasis> - if the
 	  device returned an error condition and the flag
 	  CAM_DIS_AUTOSENSE is not set in CCB the SIM driver must
 	  execute the REQUEST SENSE command automatically to extract
 	  the sense (extended error information) data from the device.
 	  If this attempt was successful the sense data should be
 	  saved in the CCB and this flag set.</para>
       </listitem>
 
       <listitem>
 	<para><emphasis>CAM_RELEASE_SIMQ</emphasis> - like
 	  CAM_DEV_QFRZN but used in case there is some problem (or
 	  resource shortage) with the SCSI controller itself.  Then
 	  all the future requests to the controller should be stopped
 	  by <function>xpt_freeze_simq()</function>.  The controller
 	  queue will be restarted after the SIM driver overcomes the
 	  shortage and informs CAM by returning some CCB with this
 	  flag set.</para>
       </listitem>
 
       <listitem>
 	<para><emphasis>CAM_SIM_QUEUED</emphasis> - when SIM puts a
 	  CCB into its request queue this flag should be set (and
 	  removed when this CCB gets dequeued before being returned
 	  back to CAM).  This flag is not used anywhere in the CAM
 	  code now, so its purpose is purely diagnostic.</para>
       </listitem>
 
       <listitem>
 	<para><emphasis>CAM_QOS_VALID</emphasis> - The QOS data
 	  is now valid.</para>
       </listitem>
     </itemizedlist>
 
     <para>The function <function>xxx_action()</function> is not
       allowed to sleep, so all the synchronization for resource access
       must be done using SIM or device queue freezing.  Besides the
       aforementioned flags the CAM subsystem provides functions
       <function>xpt_release_simq()</function> and
       <function>xpt_release_devq()</function> to unfreeze the queues
       directly, without passing a CCB to CAM.</para>
 
     <para>The CCB header contains the following fields:</para>
 
     <itemizedlist>
       <listitem>
 	<para><emphasis>path</emphasis> - path ID for the
 	  request</para>
       </listitem>
 
       <listitem>
 	<para><emphasis>target_id</emphasis> - target device ID for
 	  the request</para>
       </listitem>
 
       <listitem>
 	<para><emphasis>target_lun</emphasis> - LUN ID of the target
 	  device</para>
       </listitem>
 
       <listitem>
 	<para><emphasis>timeout</emphasis> - timeout interval for this
 	  command, in milliseconds</para>
       </listitem>
 
       <listitem>
 	<para><emphasis>timeout_ch</emphasis> - a convenience place
 	  for the SIM driver to store the timeout handle (the CAM
 	  subsystem itself does not make any assumptions about
 	  it)</para>
       </listitem>
 
       <listitem>
 	<para><emphasis>flags</emphasis> - various bits of information
 	  about the request spriv_ptr0, spriv_ptr1 - fields reserved
 	  for private use by the SIM driver (such as linking to the
 	  SIM queues or SIM private control blocks); actually, they
 	  exist as unions: spriv_ptr0 and spriv_ptr1 have the type
 	  (void *), spriv_field0 and spriv_field1 have the type
 	  unsigned long, sim_priv.entries[0].bytes and
 	  sim_priv.entries[1].bytes are byte arrays of the size
 	  consistent with the other incarnations of the union and
 	  sim_priv.bytes is one array, twice bigger.</para>
       </listitem>
     </itemizedlist>
 
     <para>The recommended way of using the SIM private fields of CCB
       is to define some meaningful names for them and use these
       meaningful names in the driver, like:</para>
 
     <programlisting>#define ccb_some_meaningful_name    sim_priv.entries[0].bytes
 #define ccb_hcb spriv_ptr1 /* for hardware control block */</programlisting>
 
     <para>The most common initiator mode requests are:</para>
 
     <itemizedlist>
       <listitem>
 	<para><emphasis>XPT_SCSI_IO</emphasis> - execute an I/O
 	  transaction</para>
 
 	<para>The instance <quote>struct ccb_scsiio csio</quote> of
 	  the union ccb is used to transfer the arguments.  They
 	  are:</para>
 
 	<itemizedlist>
 	  <listitem>
 	    <para><emphasis>cdb_io</emphasis> - pointer to the SCSI
 	      command buffer or the buffer itself</para>
 	  </listitem>
 
 	  <listitem>
 	    <para><emphasis>cdb_len</emphasis> - SCSI command
 	      length</para>
 	  </listitem>
 
 	  <listitem>
 	    <para><emphasis>data_ptr</emphasis> - pointer to the data
 	      buffer (gets a bit complicated if scatter/gather is
 	      used)</para>
 	  </listitem>
 
 	  <listitem>
 	    <para><emphasis>dxfer_len</emphasis> - length of the data
 	      to transfer</para>
 	  </listitem>
 
 	  <listitem>
 	    <para><emphasis>sglist_cnt</emphasis> - counter of the
 	      scatter/gather segments</para>
 	  </listitem>
 
 	  <listitem>
 	    <para><emphasis>scsi_status</emphasis> - place to return
 	      the SCSI status</para>
 	  </listitem>
 
 	  <listitem>
 	    <para><emphasis>sense_data</emphasis> - buffer for the
 	      SCSI sense information if the command returns an error
 	      (the SIM driver is supposed to run the REQUEST SENSE
 	      command automatically in this case if the CCB flag
 	      CAM_DIS_AUTOSENSE is not set)</para>
 	  </listitem>
 
 	  <listitem>
 	    <para><emphasis>sense_len</emphasis> - the length of that
 	      buffer (if it happens to be higher than size of
 	      sense_data the SIM driver must silently assume the
 	      smaller value) resid, sense_resid - if the transfer of
 	      data or SCSI sense returned an error these are the
 	      returned counters of the residual (not transferred)
 	      data.  They do not seem to be especially meaningful, so
 	      in a case when they are difficult to compute (say,
 	      counting bytes in the SCSI controller's FIFO buffer) an
 	      approximate value will do as well.  For a successfully
 	      completed transfer they must be set to
 	      zero.</para>
 	  </listitem>
 
 	  <listitem>
 	    <para><emphasis>tag_action</emphasis> - the kind of tag to
 	      use:</para>
 
 	    <itemizedlist>
 	      <listitem>
 		<para>CAM_TAG_ACTION_NONE - do not use tags for this
 		  transaction</para>
 	      </listitem>
 
 	      <listitem>
 		<para>MSG_SIMPLE_Q_TAG, MSG_HEAD_OF_Q_TAG,
 		  MSG_ORDERED_Q_TAG - value equal to the appropriate
 		  tag message (see /sys/cam/scsi/scsi_message.h); this
 		  gives only the tag type, the SIM driver must assign
 		  the tag value itself</para>
 	      </listitem>
 	    </itemizedlist>
 	  </listitem>
 	</itemizedlist>
 
 	<para>The general logic of handling this request is the
 	  following:</para>
 
 	<para>The first thing to do is to check for possible races, to
 	  make sure that the command did not get aborted when it was
 	  sitting in the queue:</para>
 
 	<programlisting>    struct ccb_scsiio *csio = &amp;ccb-&gt;csio;
 
     if ((ccb_h-&gt;status &amp; CAM_STATUS_MASK) != CAM_REQ_INPROG) {
         xpt_done(ccb);
         return;
     }</programlisting>
 
 	<para>Also we check that the device is supported at all by our
 	  controller:</para>
 
 	<programlisting>    if(ccb_h-&gt;target_id &gt; OUR_MAX_SUPPORTED_TARGET_ID
     || cch_h-&gt;target_id == OUR_SCSI_CONTROLLERS_OWN_ID) {
         ccb_h-&gt;status = CAM_TID_INVALID;
         xpt_done(ccb);
         return;
     }
     if(ccb_h-&gt;target_lun &gt; OUR_MAX_SUPPORTED_LUN) {
         ccb_h-&gt;status = CAM_LUN_INVALID;
         xpt_done(ccb);
         return;
     }</programlisting>
 
 	<para>Then allocate whatever data structures (such as
 	  card-dependent hardware control
 	  block<indexterm><primary>hardware control
 	    block</primary></indexterm>) we need to process this
 	  request.  If we can not then freeze the SIM queue and
 	  remember that we have a pending operation, return the CCB
 	  back and ask CAM to re-queue it.  Later when the resources
 	  become available the SIM queue must be unfrozen by returning
 	  a ccb with the <literal>CAM_SIMQ_RELEASE</literal> bit set
 	  in its status.  Otherwise, if all went well, link the CCB
 	  with the hardware control block (HCB) and mark it as
 	  queued.</para>
 
 	<programlisting>    struct xxx_hcb *hcb = allocate_hcb(softc, unit, bus);
 
     if(hcb == NULL) {
         softc-&gt;flags |= RESOURCE_SHORTAGE;
         xpt_freeze_simq(sim, /*count*/1);
         ccb_h-&gt;status = CAM_REQUEUE_REQ;
         xpt_done(ccb);
         return;
     }
 
     hcb-&gt;ccb = ccb; ccb_h-&gt;ccb_hcb = (void *)hcb;
     ccb_h-&gt;status |= CAM_SIM_QUEUED;</programlisting>
 
 	<para>Extract the target data from CCB into the hardware
 	  control block.  Check if we are asked to assign a tag and if
 	  yes then generate an unique tag and build the SCSI tag
 	  messages.  The SIM driver is also responsible for
 	  negotiations with the devices to set the maximal mutually
 	  supported bus width, synchronous rate and offset.</para>
 
 	<programlisting>    hcb-&gt;target = ccb_h-&gt;target_id; hcb-&gt;lun = ccb_h-&gt;target_lun;
     generate_identify_message(hcb);
     if( ccb_h-&gt;tag_action != CAM_TAG_ACTION_NONE )
         generate_unique_tag_message(hcb, ccb_h-&gt;tag_action);
     if( !target_negotiated(hcb) )
         generate_negotiation_messages(hcb);</programlisting>
 
 	<para>Then set up the SCSI command.  The command storage may
 	  be specified in the CCB in many interesting ways, specified
 	  by the CCB flags.  The command buffer can be contained in
 	  CCB or pointed to, in the latter case the pointer may be
 	  physical or virtual.  Since the hardware commonly needs
 	  physical address we always convert the address to the
 	  physical one, typically using the busdma API.</para>
 
 	<para>In case if a physical address is
 	  requested it is OK to return the CCB with the status
 	  <errorname>CAM_REQ_INVALID</errorname>, the current drivers
 	  do that.  If necessary a physical address can be also
 	  converted or mapped back to a virtual address but with
 	  big pain, so we do not do that.</para>
 
 	<programlisting>    if(ccb_h-&gt;flags &amp; CAM_CDB_POINTER) {
         /* CDB is a pointer */
         if(!(ccb_h-&gt;flags &amp; CAM_CDB_PHYS)) {
             /* CDB pointer is virtual */
             hcb-&gt;cmd = vtobus(csio-&gt;cdb_io.cdb_ptr);
         } else {
             /* CDB pointer is physical */
             hcb-&gt;cmd = csio-&gt;cdb_io.cdb_ptr ;
         }
     } else {
         /* CDB is in the ccb (buffer) */
         hcb-&gt;cmd = vtobus(csio-&gt;cdb_io.cdb_bytes);
     }
     hcb-&gt;cmdlen = csio-&gt;cdb_len;</programlisting>
 
 	<para>Now it is time to set up the data.  Again, the data
 	  storage may be specified in the CCB in many interesting
 	  ways, specified by the CCB flags.  First we get the
 	  direction of the data transfer.  The simplest case is if
 	  there is no data to transfer:</para>
 
 	<programlisting>    int dir = (ccb_h-&gt;flags &amp; CAM_DIR_MASK);
 
     if (dir == CAM_DIR_NONE)
         goto end_data;</programlisting>
 
 	<para>Then we check if the data is in one chunk or in a
 	  scatter-gather list, and the addresses are physical or
 	  virtual.  The SCSI controller may be able to handle only a
 	  limited number of chunks of limited length.  If the request
 	  hits this limitation we return an error.  We use a special
 	  function to return the CCB to handle in one place the HCB
 	  resource shortages.  The functions to add chunks are
 	  driver-dependent, and here we leave them without detailed
 	  implementation.  See description of the SCSI command (CDB)
 	  handling for the details on the address-translation issues.
 	  If some variation is too difficult or impossible to
 	  implement with a particular card it is OK to return the
 	  status <errorname>CAM_REQ_INVALID</errorname>.  Actually, it
 	  seems like the scatter-gather ability is not used anywhere
 	  in the CAM code now.  But at least the case for a single
 	  non-scattered virtual buffer must be implemented, it is
 	  actively used by CAM.</para>
 
 	<programlisting>    int rv;
 
     initialize_hcb_for_data(hcb);
 
     if((!(ccb_h-&gt;flags &amp; CAM_SCATTER_VALID)) {
         /* single buffer */
         if(!(ccb_h-&gt;flags &amp; CAM_DATA_PHYS)) {
             rv = add_virtual_chunk(hcb, csio-&gt;data_ptr, csio-&gt;dxfer_len, dir);
             }
         } else {
             rv = add_physical_chunk(hcb, csio-&gt;data_ptr, csio-&gt;dxfer_len, dir);
         }
     } else {
         int i;
         struct bus_dma_segment *segs;
         segs = (struct bus_dma_segment *)csio-&gt;data_ptr;
 
         if ((ccb_h-&gt;flags &amp; CAM_SG_LIST_PHYS) != 0) {
             /* The SG list pointer is physical */
             rv = setup_hcb_for_physical_sg_list(hcb, segs, csio-&gt;sglist_cnt);
         } else if (!(ccb_h-&gt;flags &amp; CAM_DATA_PHYS)) {
             /* SG buffer pointers are virtual */
             for (i = 0; i &lt; csio-&gt;sglist_cnt; i++) {
                 rv = add_virtual_chunk(hcb, segs[i].ds_addr,
                     segs[i].ds_len, dir);
                 if (rv != CAM_REQ_CMP)
                     break;
             }
         } else {
             /* SG buffer pointers are physical */
             for (i = 0; i &lt; csio-&gt;sglist_cnt; i++) {
                 rv = add_physical_chunk(hcb, segs[i].ds_addr,
                     segs[i].ds_len, dir);
                 if (rv != CAM_REQ_CMP)
                     break;
             }
         }
     }
     if(rv != CAM_REQ_CMP) {
         /* we expect that add_*_chunk() functions return CAM_REQ_CMP
          * if they added a chunk successfully, CAM_REQ_TOO_BIG if
          * the request is too big (too many bytes or too many chunks),
          * CAM_REQ_INVALID in case of other troubles
          */
         free_hcb_and_ccb_done(hcb, ccb, rv);
         return;
     }
     end_data:</programlisting>
 
 	<para>If disconnection is disabled for this CCB we pass this
 	  information to the hcb:</para>
 
 	<programlisting>    if(ccb_h-&gt;flags &amp; CAM_DIS_DISCONNECT)
         hcb_disable_disconnect(hcb);</programlisting>
 
 	<para>If the controller is able to run REQUEST SENSE command
 	  all by itself then the value of the flag CAM_DIS_AUTOSENSE
 	  should also be passed to it, to prevent automatic REQUEST
 	  SENSE if the CAM subsystem does not want it.</para>
 
 	<para>The only thing left is to set up the timeout, pass our
 	  hcb to the hardware and return, the rest will be done by the
 	  interrupt handler (or timeout handler).</para>
 
 	<programlisting>    ccb_h-&gt;timeout_ch = timeout(xxx_timeout, (caddr_t) hcb,
         (ccb_h-&gt;timeout * hz) / 1000); /* convert milliseconds to ticks */
     put_hcb_into_hardware_queue(hcb);
     return;</programlisting>
 
 	<para>And here is a possible implementation of the function
 	  returning CCB:</para>
 
 	<programlisting>    static void
     free_hcb_and_ccb_done(struct xxx_hcb *hcb, union ccb *ccb, u_int32_t status)
     {
         struct xxx_softc *softc = hcb-&gt;softc;
 
         ccb-&gt;ccb_h.ccb_hcb = 0;
         if(hcb != NULL) {
             untimeout(xxx_timeout, (caddr_t) hcb, ccb-&gt;ccb_h.timeout_ch);
             /* we're about to free a hcb, so the shortage has ended */
             if(softc-&gt;flags &amp; RESOURCE_SHORTAGE)  {
                 softc-&gt;flags &amp;= ~RESOURCE_SHORTAGE;
                 status |= CAM_RELEASE_SIMQ;
             }
             free_hcb(hcb); /* also removes hcb from any internal lists */
         }
         ccb-&gt;ccb_h.status = status |
             (ccb-&gt;ccb_h.status &amp; ~(CAM_STATUS_MASK|CAM_SIM_QUEUED));
         xpt_done(ccb);
     }</programlisting>
       </listitem>
 
       <listitem>
 	<para><emphasis>XPT_RESET_DEV</emphasis> - send the SCSI
 	  <quote>BUS DEVICE RESET</quote> message to a device</para>
 
 	<para>There is no data transferred in CCB except the header
 	  and the most interesting argument of it is target_id.
 	  Depending on the controller hardware a hardware control
 	  block just like for the XPT_SCSI_IO request may be
 	  constructed (see XPT_SCSI_IO request description) and sent
 	  to the controller or the SCSI controller may be immediately
 	  programmed to send this RESET message to the device or this
 	  request may be just not supported (and return the status
 	  <errorname>CAM_REQ_INVALID</errorname>).  Also on completion
 	  of the request all the disconnected transactions for this
 	  target must be aborted (probably in the interrupt
 	  routine).</para>
 
 	<para>Also all the current negotiations for the target are
 	  lost on reset, so they might be cleaned too.  Or they
 	  clearing may be deferred, because anyway the target would
 	  request re-negotiation on the next
 	  transaction.</para>
       </listitem>
 
       <listitem>
 	<para><emphasis>XPT_RESET_BUS</emphasis> - send the RESET
 	  signal to the SCSI bus</para>
 
 	<para>No arguments are passed in the CCB, the only interesting
 	  argument is the SCSI bus indicated by the struct sim
 	  pointer.</para>
 
 	<para>A minimalistic implementation would forget the SCSI
 	  negotiations for all the devices on the bus and return the
 	  status CAM_REQ_CMP.</para>
 
 	<para>The proper implementation would in addition actually
 	  reset the SCSI bus (possible also reset the SCSI controller)
 	  and mark all the CCBs being processed, both those in the
 	  hardware queue and those being disconnected, as done with
 	  the status CAM_SCSI_BUS_RESET. Like:</para>
 
 	<programlisting>    int targ, lun;
     struct xxx_hcb *h, *hh;
     struct ccb_trans_settings neg;
     struct cam_path *path;
 
     /* The SCSI bus reset may take a long time, in this case its completion
      * should be checked by interrupt or timeout. But for simplicity
      * we assume here that it is really fast.
      */
     reset_scsi_bus(softc);
 
     /* drop all enqueued CCBs */
     for(h = softc-&gt;first_queued_hcb; h != NULL; h = hh) {
         hh = h-&gt;next;
         free_hcb_and_ccb_done(h, h-&gt;ccb, CAM_SCSI_BUS_RESET);
     }
 
     /* the clean values of negotiations to report */
     neg.bus_width = 8;
     neg.sync_period = neg.sync_offset = 0;
     neg.valid = (CCB_TRANS_BUS_WIDTH_VALID
         | CCB_TRANS_SYNC_RATE_VALID | CCB_TRANS_SYNC_OFFSET_VALID);
 
     /* drop all disconnected CCBs and clean negotiations  */
     for(targ=0; targ &lt;= OUR_MAX_SUPPORTED_TARGET; targ++) {
         clean_negotiations(softc, targ);
 
         /* report the event if possible */
         if(xpt_create_path(&amp;path, /*periph*/NULL,
                 cam_sim_path(sim), targ,
                 CAM_LUN_WILDCARD) == CAM_REQ_CMP) {
             xpt_async(AC_TRANSFER_NEG, path, &amp;neg);
             xpt_free_path(path);
         }
 
         for(lun=0; lun &lt;= OUR_MAX_SUPPORTED_LUN; lun++)
             for(h = softc-&gt;first_discon_hcb[targ][lun]; h != NULL; h = hh) {
                 hh=h-&gt;next;
                 free_hcb_and_ccb_done(h, h-&gt;ccb, CAM_SCSI_BUS_RESET);
             }
     }
 
     ccb-&gt;ccb_h.status = CAM_REQ_CMP;
     xpt_done(ccb);
 
     /* report the event */
     xpt_async(AC_BUS_RESET, softc-&gt;wpath, NULL);
     return;</programlisting>
 
 	<para>Implementing the SCSI bus reset as a function may be a
 	  good idea because it would be re-used by the timeout
 	  function as a last resort if the things go
 	  wrong.</para>
       </listitem>
 
       <listitem>
 	<para><emphasis>XPT_ABORT</emphasis> - abort the specified
 	  CCB</para>
 
 	<para>The arguments are transferred in the instance
 	  <quote>struct ccb_abort cab</quote> of the union ccb.  The
 	  only argument field in it is:</para>
 
 	<para><emphasis>abort_ccb</emphasis> - pointer to the CCB to
 	  be aborted</para>
 
 	<para>If the abort is not supported just return the status
 	  CAM_UA_ABORT.  This is also the easy way to minimally
 	  implement this call, return CAM_UA_ABORT in any case.</para>
 
 	<para>The hard way is to implement this request honestly.
 	  First check that abort applies to a SCSI transaction:</para>
 
 	<programlisting>    struct ccb *abort_ccb;
     abort_ccb = ccb-&gt;cab.abort_ccb;
 
     if(abort_ccb-&gt;ccb_h.func_code != XPT_SCSI_IO) {
         ccb-&gt;ccb_h.status = CAM_UA_ABORT;
         xpt_done(ccb);
         return;
     }</programlisting>
 
 	<para>Then it is necessary to find this CCB in our queue.
 	  This can be done by walking the list of all our hardware
 	  control blocks in search for one associated with this
 	  CCB:</para>
 
 	<programlisting>    struct xxx_hcb *hcb, *h;
 
     hcb = NULL;
 
     /* We assume that softc-&gt;first_hcb is the head of the list of all
      * HCBs associated with this bus, including those enqueued for
      * processing, being processed by hardware and disconnected ones.
      */
     for(h = softc-&gt;first_hcb; h != NULL; h = h-&gt;next) {
         if(h-&gt;ccb == abort_ccb) {
             hcb = h;
             break;
         }
     }
 
     if(hcb == NULL) {
         /* no such CCB in our queue */
         ccb-&gt;ccb_h.status = CAM_PATH_INVALID;
         xpt_done(ccb);
         return;
     }
 
     hcb=found_hcb;</programlisting>
 
 	<para>Now we look at the current processing status of the HCB.
 	  It may be either sitting in the queue waiting to be sent to
 	  the SCSI bus, being transferred right now, or disconnected
 	  and waiting for the result of the command, or actually
 	  completed by hardware but not yet marked as done by
 	  software.  To make sure that we do not get in any races with
 	  hardware we mark the HCB as being aborted, so that if this
 	  HCB is about to be sent to the SCSI bus the SCSI controller
 	  will see this flag and skip it.</para>
 
 	<programlisting>    int hstatus;
 
     /* shown as a function, in case special action is needed to make
      * this flag visible to hardware
      */
     set_hcb_flags(hcb, HCB_BEING_ABORTED);
 
     abort_again:
 
     hstatus = get_hcb_status(hcb);
     switch(hstatus) {
     case HCB_SITTING_IN_QUEUE:
         remove_hcb_from_hardware_queue(hcb);
         /* FALLTHROUGH */
     case HCB_COMPLETED:
         /* this is an easy case */
         free_hcb_and_ccb_done(hcb, abort_ccb, CAM_REQ_ABORTED);
         break;</programlisting>
 
 	<para>If the CCB is being transferred right now we would like
 	  to signal to the SCSI controller in some hardware-dependent
 	  way that we want to abort the current transfer.  The SCSI
 	  controller would set the SCSI ATTENTION signal and when the
 	  target responds to it send an ABORT message.  We also reset
 	  the timeout to make sure that the target is not sleeping
 	  forever.  If the command would not get aborted in some
 	  reasonable time like 10 seconds the timeout routine would go
-	  ahead and reset the whole SCSI bus.  Because the command
+	  ahead and reset the whole SCSI bus.  Since the command
 	  will be aborted in some reasonable time we can just return
 	  the abort request now as successfully completed, and mark
 	  the aborted CCB as aborted (but not mark it as done
 	  yet).</para>
 
 	<programlisting>    case HCB_BEING_TRANSFERRED:
         untimeout(xxx_timeout, (caddr_t) hcb, abort_ccb-&gt;ccb_h.timeout_ch);
         abort_ccb-&gt;ccb_h.timeout_ch =
             timeout(xxx_timeout, (caddr_t) hcb, 10 * hz);
         abort_ccb-&gt;ccb_h.status = CAM_REQ_ABORTED;
         /* ask the controller to abort that HCB, then generate
          * an interrupt and stop
          */
         if(signal_hardware_to_abort_hcb_and_stop(hcb) &lt; 0) {
             /* oops, we missed the race with hardware, this transaction
              * got off the bus before we aborted it, try again */
             goto abort_again;
         }
 
         break;</programlisting>
 
 	<para>If the CCB is in the list of disconnected then set it up
 	  as an abort request and re-queue it at the front of hardware
 	  queue.  Reset the timeout and report the abort request to be
 	  completed.</para>
 
 	<programlisting>    case HCB_DISCONNECTED:
         untimeout(xxx_timeout, (caddr_t) hcb, abort_ccb-&gt;ccb_h.timeout_ch);
         abort_ccb-&gt;ccb_h.timeout_ch =
             timeout(xxx_timeout, (caddr_t) hcb, 10 * hz);
         put_abort_message_into_hcb(hcb);
         put_hcb_at_the_front_of_hardware_queue(hcb);
         break;
     }
     ccb-&gt;ccb_h.status = CAM_REQ_CMP;
     xpt_done(ccb);
     return;</programlisting>
 
 	<para>That is all for the ABORT request, although there is one
-	  more issue.  Because the ABORT message cleans all the
+	  more issue.  As the ABORT message cleans all the
 	  ongoing transactions on a LUN we have to mark all the other
 	  active transactions on this LUN as aborted.  That should be
 	  done in the interrupt routine, after the transaction gets
 	  aborted.</para>
 
 	<para>Implementing the CCB abort as a function may be quite a
 	  good idea, this function can be re-used if an I/O
 	  transaction times out.  The only difference would be that
 	  the timed out transaction would return the status
 	  CAM_CMD_TIMEOUT for the timed out request.  Then the case
 	  XPT_ABORT would be small, like that:</para>
 
 	<programlisting>    case XPT_ABORT:
         struct ccb *abort_ccb;
         abort_ccb = ccb-&gt;cab.abort_ccb;
 
         if(abort_ccb-&gt;ccb_h.func_code != XPT_SCSI_IO) {
             ccb-&gt;ccb_h.status = CAM_UA_ABORT;
             xpt_done(ccb);
             return;
         }
         if(xxx_abort_ccb(abort_ccb, CAM_REQ_ABORTED) &lt; 0)
             /* no such CCB in our queue */
             ccb-&gt;ccb_h.status = CAM_PATH_INVALID;
         else
             ccb-&gt;ccb_h.status = CAM_REQ_CMP;
         xpt_done(ccb);
         return;</programlisting>
       </listitem>
 
       <listitem>
 	<para><emphasis>XPT_SET_TRAN_SETTINGS</emphasis> - explicitly
 	  set values of SCSI transfer settings</para>
 
 	<para>The arguments are transferred in the instance
 	  <quote>struct ccb_trans_setting cts</quote> of the union
 	  ccb:</para>
 
 	<itemizedlist>
 	  <listitem>
 	    <para><emphasis>valid</emphasis> - a bitmask showing which
 	      settings should be updated:</para>
 	  </listitem>
 
 	  <listitem>
 	    <para><emphasis>CCB_TRANS_SYNC_RATE_VALID</emphasis> -
 	      synchronous transfer rate</para>
 	  </listitem>
 
 	  <listitem>
 	    <para><emphasis>CCB_TRANS_SYNC_OFFSET_VALID</emphasis> -
 	      synchronous offset</para>
 	  </listitem>
 
 	  <listitem>
 	    <para><emphasis>CCB_TRANS_BUS_WIDTH_VALID</emphasis> - bus
 	      width</para>
 	  </listitem>
 
 	  <listitem>
 	    <para><emphasis>CCB_TRANS_DISC_VALID</emphasis> - set
 	      enable/disable disconnection</para>
 	  </listitem>
 
 	  <listitem>
 	    <para><emphasis>CCB_TRANS_TQ_VALID</emphasis> - set
 	      enable/disable tagged queuing</para>
 	  </listitem>
 
 	  <listitem>
 	    <para><emphasis>flags</emphasis> - consists of two parts,
 	      binary arguments and identification of sub-operations.
 	      The binary arguments are:</para>
 
 	    <itemizedlist>
 	      <listitem>
 		<para><emphasis>CCB_TRANS_DISC_ENB</emphasis> - enable
 		  disconnection</para>
 	      </listitem>
 
 	      <listitem>
 		<para><emphasis>CCB_TRANS_TAG_ENB</emphasis> - enable
 		  tagged queuing</para>
 	      </listitem>
 	    </itemizedlist>
 	  </listitem>
 
 	  <listitem>
 	    <para>the sub-operations are:</para>
 
 	    <itemizedlist>
 	      <listitem>
 		<para><emphasis>CCB_TRANS_CURRENT_SETTINGS</emphasis>
 		  - change the current negotiations</para>
 	      </listitem>
 
 	      <listitem>
 		<para><emphasis>CCB_TRANS_USER_SETTINGS</emphasis> -
 		  remember the desired user values sync_period,
 		  sync_offset - self-explanatory, if sync_offset==0
 		  then the asynchronous mode is requested bus_width -
 		  bus width, in bits (not bytes)</para>
 	      </listitem>
 	    </itemizedlist>
 	  </listitem>
 	</itemizedlist>
 
 	<para>Two sets of negotiated parameters are supported, the
 	  user settings and the current settings.  The user settings
 	  are not really used much in the SIM drivers, this is mostly
 	  just a piece of memory where the upper levels can store (and
 	  later recall) its ideas about the parameters.  Setting the
 	  user parameters does not cause re-negotiation of the
 	  transfer rates.  But when the SCSI controller does a
 	  negotiation it must never set the values higher than the
 	  user parameters, so it is essentially the top
 	  boundary.</para>
 
 	<para>The current settings are, as the name says, current.
 	  Changing them means that the parameters must be
 	  re-negotiated on the next transfer.  Again, these
 	  <quote>new current settings</quote> are not supposed to be
 	  forced on the device, just they are used as the initial step
 	  of negotiations.  Also they must be limited by actual
 	  capabilities of the SCSI controller: for example, if the
 	  SCSI controller has 8-bit bus and the request asks to set
 	  16-bit wide transfers this parameter must be silently
 	  truncated to 8-bit transfers before sending it to the
 	  device.</para>
 
 	<para>One caveat is that the bus width and synchronous
 	  parameters are per target while the disconnection and tag
 	  enabling parameters are per lun.</para>
 
 	<para>The recommended implementation is to keep 3 sets of
 	  negotiated (bus width and synchronous transfer)
 	  parameters:</para>
 
 	<itemizedlist>
 	  <listitem>
 	    <para><emphasis>user</emphasis> - the user set, as
 	      above</para>
 	  </listitem>
 
 	  <listitem>
 	    <para><emphasis>current</emphasis> - those actually in
 	      effect</para>
 	  </listitem>
 
 	  <listitem>
 	    <para><emphasis>goal</emphasis> - those requested by
 	      setting of the <quote>current</quote>
 	      parameters</para>
 	  </listitem>
 	</itemizedlist>
 
 	<para>The code looks like:</para>
 
 	<programlisting>    struct ccb_trans_settings *cts;
     int targ, lun;
     int flags;
 
     cts = &amp;ccb-&gt;cts;
     targ = ccb_h-&gt;target_id;
     lun = ccb_h-&gt;target_lun;
     flags = cts-&gt;flags;
     if(flags &amp; CCB_TRANS_USER_SETTINGS) {
         if(flags &amp; CCB_TRANS_SYNC_RATE_VALID)
             softc-&gt;user_sync_period[targ] = cts-&gt;sync_period;
         if(flags &amp; CCB_TRANS_SYNC_OFFSET_VALID)
             softc-&gt;user_sync_offset[targ] = cts-&gt;sync_offset;
         if(flags &amp; CCB_TRANS_BUS_WIDTH_VALID)
             softc-&gt;user_bus_width[targ] = cts-&gt;bus_width;
 
         if(flags &amp; CCB_TRANS_DISC_VALID) {
             softc-&gt;user_tflags[targ][lun] &amp;= ~CCB_TRANS_DISC_ENB;
             softc-&gt;user_tflags[targ][lun] |= flags &amp; CCB_TRANS_DISC_ENB;
         }
         if(flags &amp; CCB_TRANS_TQ_VALID) {
             softc-&gt;user_tflags[targ][lun] &amp;= ~CCB_TRANS_TQ_ENB;
             softc-&gt;user_tflags[targ][lun] |= flags &amp; CCB_TRANS_TQ_ENB;
         }
     }
     if(flags &amp; CCB_TRANS_CURRENT_SETTINGS) {
         if(flags &amp; CCB_TRANS_SYNC_RATE_VALID)
             softc-&gt;goal_sync_period[targ] =
                 max(cts-&gt;sync_period, OUR_MIN_SUPPORTED_PERIOD);
         if(flags &amp; CCB_TRANS_SYNC_OFFSET_VALID)
             softc-&gt;goal_sync_offset[targ] =
                 min(cts-&gt;sync_offset, OUR_MAX_SUPPORTED_OFFSET);
         if(flags &amp; CCB_TRANS_BUS_WIDTH_VALID)
             softc-&gt;goal_bus_width[targ] = min(cts-&gt;bus_width, OUR_BUS_WIDTH);
 
         if(flags &amp; CCB_TRANS_DISC_VALID) {
             softc-&gt;current_tflags[targ][lun] &amp;= ~CCB_TRANS_DISC_ENB;
             softc-&gt;current_tflags[targ][lun] |= flags &amp; CCB_TRANS_DISC_ENB;
         }
         if(flags &amp; CCB_TRANS_TQ_VALID) {
             softc-&gt;current_tflags[targ][lun] &amp;= ~CCB_TRANS_TQ_ENB;
             softc-&gt;current_tflags[targ][lun] |= flags &amp; CCB_TRANS_TQ_ENB;
         }
     }
     ccb-&gt;ccb_h.status = CAM_REQ_CMP;
     xpt_done(ccb);
     return;</programlisting>
 
 	<para>Then when the next I/O request will be processed it will
 	  check if it has to re-negotiate, for example by calling the
 	  function target_negotiated(hcb).  It can be implemented like
 	  this:</para>
 
 	<programlisting>    int
     target_negotiated(struct xxx_hcb *hcb)
     {
         struct softc *softc = hcb-&gt;softc;
         int targ = hcb-&gt;targ;
 
         if( softc-&gt;current_sync_period[targ] != softc-&gt;goal_sync_period[targ]
         || softc-&gt;current_sync_offset[targ] != softc-&gt;goal_sync_offset[targ]
         || softc-&gt;current_bus_width[targ] != softc-&gt;goal_bus_width[targ] )
             return 0; /* FALSE */
         else
             return 1; /* TRUE */
     }</programlisting>
 
 	<para>After the values are re-negotiated the resulting values
 	  must be assigned to both current and goal parameters, so for
 	  future I/O transactions the current and goal parameters
 	  would be the same and
 	  <function>target_negotiated()</function> would return TRUE.
 	  When the card is initialized (in
 	  <function>xxx_attach()</function>) the current negotiation
 	  values must be initialized to narrow asynchronous mode, the
 	  goal and current values must be initialized to the maximal
 	  values supported by controller.</para>
 
 	<para><emphasis>XPT_GET_TRAN_SETTINGS</emphasis> - get values
 	  of SCSI transfer settings</para>
 
 	<para>This operations is the reverse of XPT_SET_TRAN_SETTINGS.
 	  Fill up the CCB instance
 	  <quote>struct ccb_trans_setting cts</quote> with data as
 	  requested by the flags CCB_TRANS_CURRENT_SETTINGS or
 	  CCB_TRANS_USER_SETTINGS (if both are set then the existing
 	  drivers return the current settings).  Set all the bits in
 	  the valid field.</para>
 
 	<para><emphasis>XPT_CALC_GEOMETRY</emphasis> - calculate
 	  logical (BIOS)<indexterm><primary>BIOS</primary></indexterm>
 	  geometry of the disk</para>
 
 	<para>The arguments are transferred in the instance
 	  <quote>struct ccb_calc_geometry ccg</quote> of the union
 	  ccb:</para>
 
 	<itemizedlist>
 
 	  <listitem>
 	    <para><emphasis>block_size</emphasis> - input, block
 	      (A.K.A sector) size in bytes</para>
 	  </listitem>
 
 	  <listitem>
 	    <para><emphasis>volume_size</emphasis> - input, volume
 	      size in bytes</para>
 	  </listitem>
 
 	  <listitem>
 	    <para><emphasis>cylinders</emphasis> - output, logical
 	      cylinders</para>
 	  </listitem>
 
 	  <listitem>
 	    <para><emphasis>heads</emphasis> - output, logical
 	      heads</para>
 	  </listitem>
 
 	  <listitem>
 	    <para><emphasis>secs_per_track</emphasis> - output,
 	      logical sectors per track</para>
 	  </listitem>
 	</itemizedlist>
 
 	<para>If the returned geometry differs much enough from what
 	  the SCSI controller BIOS<indexterm><primary>SCSI</primary>
 	  <secondary>BIOS</secondary></indexterm> thinks and a disk on
 	  this SCSI controller is used as bootable the system may not
 	  be able to boot.  The typical calculation example taken from
 	  the aic7xxx driver is:</para>
 
 	<programlisting>    struct    ccb_calc_geometry *ccg;
     u_int32_t size_mb;
     u_int32_t secs_per_cylinder;
     int   extended;
 
     ccg = &amp;ccb-&gt;ccg;
     size_mb = ccg-&gt;volume_size
         / ((1024L * 1024L) / ccg-&gt;block_size);
     extended = check_cards_EEPROM_for_extended_geometry(softc);
 
     if (size_mb &gt; 1024 &amp;&amp; extended) {
         ccg-&gt;heads = 255;
         ccg-&gt;secs_per_track = 63;
     } else {
         ccg-&gt;heads = 64;
         ccg-&gt;secs_per_track = 32;
     }
     secs_per_cylinder = ccg-&gt;heads * ccg-&gt;secs_per_track;
     ccg-&gt;cylinders = ccg-&gt;volume_size / secs_per_cylinder;
     ccb-&gt;ccb_h.status = CAM_REQ_CMP;
     xpt_done(ccb);
     return;</programlisting>
 
 	<para>This gives the general idea, the exact calculation
 	  depends on the quirks of the particular BIOS.  If BIOS
 	  provides no way set the <quote>extended translation</quote>
 	  flag in EEPROM this flag should normally be assumed equal to
 	  1. Other popular geometries are:</para>
 
 	<programlisting>    128 heads, 63 sectors - Symbios controllers
     16 heads, 63 sectors - old controllers</programlisting>
 
 	<para>Some system BIOSes and SCSI BIOSes fight with each other
 	  with variable success, for example a combination of Symbios
 	  875/895 SCSI and Phoenix BIOS can give geometry 128/63 after
 	  power up and 255/63 after a hard reset or soft
 	  reboot.</para>
       </listitem>
 
       <listitem>
 	<para><emphasis>XPT_PATH_INQ</emphasis> - path inquiry, in
 	  other words get the SIM driver and SCSI controller (also
 	  known as HBA - Host Bus Adapter) properties</para>
 
 	<para>The properties are returned in the instance
 	  <quote>struct ccb_pathinq cpi</quote> of the union
 	  ccb:</para>
 
 	<itemizedlist>
 	  <listitem>
 	    <para>version_num - the SIM driver version number, now all
 	      drivers use 1</para>
 	  </listitem>
 
 	  <listitem>
 	    <para>hba_inquiry - bitmask of features supported by the
 	      controller:</para>
 	  </listitem>
 
 	  <listitem>
 	    <para>PI_MDP_ABLE - supports MDP message (something from
 	      SCSI3?)</para>
 	  </listitem>
 
 	  <listitem>
 	    <para>PI_WIDE_32 - supports 32 bit wide
 	      SCSI</para>
 	  </listitem>
 
 	  <listitem>
 	    <para>PI_WIDE_16 - supports 16 bit wide
 	      SCSI</para>
 	  </listitem>
 
 	  <listitem>
 	    <para>PI_SDTR_ABLE - can negotiate synchronous transfer
 	      rate</para>
 	  </listitem>
 
 	  <listitem>
 	    <para>PI_LINKED_CDB - supports linked
 	      commands</para>
 	  </listitem>
 
 	  <listitem>
 	    <para>PI_TAG_ABLE - supports tagged
 	      commands</para>
 	  </listitem>
 
 	  <listitem>
 	    <para>PI_SOFT_RST - supports soft reset alternative (hard
 	      reset and soft reset are mutually exclusive within a
 	      SCSI bus)</para>
 	  </listitem>
 
 	  <listitem>
 	    <para>target_sprt - flags for target mode support, 0 if
 	      unsupported</para>
 	  </listitem>
 
 	  <listitem>
 	    <para>hba_misc - miscellaneous controller
 	      features:</para>
 	  </listitem>
 
 	  <listitem>
 	    <para>PIM_SCANHILO - bus scans from high ID to low
 	      ID</para>
 	  </listitem>
 
 	  <listitem>
 	    <para>PIM_NOREMOVE - removable devices not included in
 	      scan</para>
 	  </listitem>
 
 	  <listitem>
 	    <para>PIM_NOINITIATOR - initiator role not
 	      supported</para>
 	  </listitem>
 
 	  <listitem>
 	    <para>PIM_NOBUSRESET - user has disabled initial BUS
 	      RESET</para>
 	  </listitem>
 
 	  <listitem>
 	    <para>hba_eng_cnt - mysterious HBA engine count, something
 	      related to compression, now is always set to 0</para>
 	  </listitem>
 
 	  <listitem>
 	    <para>vuhba_flags - vendor-unique flags, unused now</para>
 	  </listitem>
 
 	  <listitem>
 	    <para>max_target - maximal supported target ID (7 for
 	      8-bit bus, 15 for 16-bit bus, 127 for Fibre
 	      Channel)</para>
 	  </listitem>
 
 	  <listitem>
 	    <para>max_lun - maximal supported LUN ID (7 for older SCSI
 	      controllers, 63 for newer ones)</para>
 	  </listitem>
 
 	  <listitem>
 	    <para>async_flags - bitmask of installed Async handler,
 	      unused now</para>
 	  </listitem>
 
 	  <listitem>
 	    <para>hpath_id - highest Path ID in the subsystem, unused
 	      now</para>
 	  </listitem>
 
 	  <listitem>
 	    <para>unit_number - the controller unit number,
 	      cam_sim_unit(sim)</para>
 	  </listitem>
 
 	  <listitem>
 	    <para>bus_id - the bus number, cam_sim_bus(sim)</para>
 	  </listitem>
 
 	  <listitem>
 	    <para>initiator_id - the SCSI ID of the controller
 	      itself</para>
 	  </listitem>
 
 	  <listitem>
 	    <para>base_transfer_speed - nominal transfer speed in KB/s
 	      for asynchronous narrow transfers, equals to 3300 for
 	      SCSI</para>
 	  </listitem>
 
 	  <listitem>
 	    <para>sim_vid - SIM driver's vendor id, a zero-terminated
 	      string of maximal length SIM_IDLEN including the
 	      terminating zero</para>
 	  </listitem>
 
 	  <listitem>
 	    <para>hba_vid - SCSI controller's vendor id, a
 	      zero-terminated string of maximal length HBA_IDLEN
 	      including the terminating zero</para>
 	  </listitem>
 
 	  <listitem>
 	    <para>dev_name - device driver name, a zero-terminated
 	      string of maximal length DEV_IDLEN including the
 	      terminating zero, equal to cam_sim_name(sim)</para>
 	  </listitem>
 	</itemizedlist>
 
 	<para>The recommended way of setting the string fields is
 	  using strncpy, like:</para>
 
 	<programlisting>    strncpy(cpi-&gt;dev_name, cam_sim_name(sim), DEV_IDLEN);</programlisting>
 
 	<para>After setting the values set the status to CAM_REQ_CMP
 	  and mark the CCB as done.</para>
       </listitem>
     </itemizedlist>
   </sect1>
 
   <sect1 xml:id="scsi-polling">
     <title>Polling</title>
 
     <funcsynopsis>
       <funcprototype>
 	<funcdef>static void
 	  <function>xxx_poll</function>
 	</funcdef>
 	<paramdef>
 	  <parameter>struct cam_sim *sim</parameter>
 	</paramdef>
       </funcprototype>
     </funcsynopsis>
 
     <para>The poll function is used to simulate the interrupts when
       the interrupt subsystem is not functioning (for example, when
       the system has crashed and is creating the system dump).  The
       CAM subsystem sets the proper interrupt level before calling the
       poll routine.  So all it needs to do is to call the interrupt
       routine (or the other way around, the poll routine may be doing
       the real action and the interrupt routine would just call the
       poll routine).  Why bother about a separate function then?
-      Because of different calling conventions.  The
+      Due to different calling conventions.  The
       <function>xxx_poll</function> routine gets the struct cam_sim
       pointer as its argument when the PCI interrupt routine by common
       convention gets pointer to the struct
       <varname remap="structname">xxx_softc</varname> and the ISA interrupt routine
       gets just the device unit number.  So the poll routine would
       normally look as:</para>
 
     <programlisting>static void
 xxx_poll(struct cam_sim *sim)
 {
     xxx_intr((struct xxx_softc *)cam_sim_softc(sim)); /* for PCI device */
 }</programlisting>
 
     <para>or</para>
 
     <programlisting>static void
 xxx_poll(struct cam_sim *sim)
 {
     xxx_intr(cam_sim_unit(sim)); /* for ISA device */
 }</programlisting>
   </sect1>
 
   <sect1 xml:id="scsi-async">
     <title>Asynchronous Events</title>
 
     <para>If an asynchronous event callback has been set up then the
       callback function should be defined.</para>
 
     <programlisting>static void
 ahc_async(void *callback_arg, u_int32_t code, struct cam_path *path, void *arg)</programlisting>
 
     <itemizedlist>
       <listitem>
 	<para>callback_arg - the value supplied when registering the
 	  callback</para>
       </listitem>
 
       <listitem>
 	<para>code - identifies the type of event</para>
       </listitem>
 
       <listitem>
 	<para>path - identifies the devices to which the event
 	  applies</para>
       </listitem>
 
       <listitem>
 	<para>arg - event-specific argument</para>
       </listitem>
     </itemizedlist>
 
     <para>Implementation for a single type of event, AC_LOST_DEVICE,
       looks like:</para>
 
     <programlisting>    struct xxx_softc *softc;
     struct cam_sim *sim;
     int targ;
     struct ccb_trans_settings neg;
 
     sim = (struct cam_sim *)callback_arg;
     softc = (struct xxx_softc *)cam_sim_softc(sim);
     switch (code) {
     case AC_LOST_DEVICE:
         targ = xpt_path_target_id(path);
         if(targ &lt;= OUR_MAX_SUPPORTED_TARGET) {
             clean_negotiations(softc, targ);
             /* send indication to CAM */
             neg.bus_width = 8;
             neg.sync_period = neg.sync_offset = 0;
             neg.valid = (CCB_TRANS_BUS_WIDTH_VALID
                 | CCB_TRANS_SYNC_RATE_VALID | CCB_TRANS_SYNC_OFFSET_VALID);
             xpt_async(AC_TRANSFER_NEG, path, &amp;neg);
         }
         break;
     default:
         break;
     }</programlisting>
   </sect1>
 
   <sect1 xml:id="scsi-interrupts">
     <title>Interrupts</title>
 
     <indexterm><primary>SCSI</primary><secondary>interrupts</secondary></indexterm>
 
     <para>The exact type of the interrupt routine depends on the type
       of the peripheral bus (PCI, ISA and so on) to which the SCSI
       controller is connected.</para>
 
     <para>The interrupt routines of the SIM drivers run at the
       interrupt level splcam.  So <function>splcam()</function> should
       be used in the driver to synchronize activity between the
       interrupt routine and the rest of the driver (for a
       multiprocessor-aware driver things get yet more interesting but
       we ignore this case here).  The pseudo-code in this document
       happily ignores the problems of synchronization.  The real code
       must not ignore them.  A simple-minded approach is to set
       <function>splcam()</function> on the entry to the other routines
       and reset it on return thus protecting them by one big critical
       section.  To make sure that the interrupt level will be always
       restored a wrapper function can be defined, like:</para>
 
     <programlisting>    static void
     xxx_action(struct cam_sim *sim, union ccb *ccb)
     {
         int s;
         s = splcam();
         xxx_action1(sim, ccb);
         splx(s);
     }
 
     static void
     xxx_action1(struct cam_sim *sim, union ccb *ccb)
     {
         ... process the request ...
     }</programlisting>
 
     <para>This approach is simple and robust but the problem with it
       is that interrupts may get blocked for a relatively long time
       and this would negatively affect the system's performance.  On
       the other hand the functions of the <function>spl()</function>
       family have rather high overhead, so vast amount of tiny
       critical sections may not be good either.</para>
 
     <para>The conditions handled by the interrupt routine and the
       details depend very much on the hardware.  We consider the set
       of <quote>typical</quote> conditions.</para>
 
     <para>First, we check if a SCSI reset was encountered on the bus
       (probably caused by another SCSI controller on the same SCSI
       bus).  If so we drop all the enqueued and disconnected requests,
       report the events and re-initialize our SCSI controller.  It is
       important that during this initialization the controller will
       not issue another reset or else two controllers on the same SCSI
       bus could ping-pong resets forever.  The case of fatal
       controller error/hang could be handled in the same place, but it
       will probably need also sending RESET signal to the SCSI bus to
       reset the status of the connections with the SCSI
       devices.</para>
 
     <programlisting>    int fatal=0;
     struct ccb_trans_settings neg;
     struct cam_path *path;
 
     if( detected_scsi_reset(softc)
     || (fatal = detected_fatal_controller_error(softc)) ) {
         int targ, lun;
         struct xxx_hcb *h, *hh;
 
         /* drop all enqueued CCBs */
         for(h = softc-&gt;first_queued_hcb; h != NULL; h = hh) {
             hh = h-&gt;next;
             free_hcb_and_ccb_done(h, h-&gt;ccb, CAM_SCSI_BUS_RESET);
         }
 
         /* the clean values of negotiations to report */
         neg.bus_width = 8;
         neg.sync_period = neg.sync_offset = 0;
         neg.valid = (CCB_TRANS_BUS_WIDTH_VALID
             | CCB_TRANS_SYNC_RATE_VALID | CCB_TRANS_SYNC_OFFSET_VALID);
 
         /* drop all disconnected CCBs and clean negotiations  */
         for(targ=0; targ &lt;= OUR_MAX_SUPPORTED_TARGET; targ++) {
             clean_negotiations(softc, targ);
 
             /* report the event if possible */
             if(xpt_create_path(&amp;path, /*periph*/NULL,
                     cam_sim_path(sim), targ,
                     CAM_LUN_WILDCARD) == CAM_REQ_CMP) {
                 xpt_async(AC_TRANSFER_NEG, path, &amp;neg);
                 xpt_free_path(path);
             }
 
             for(lun=0; lun &lt;= OUR_MAX_SUPPORTED_LUN; lun++)
                 for(h = softc-&gt;first_discon_hcb[targ][lun]; h != NULL; h = hh) {
                     hh=h-&gt;next;
                     if(fatal)
                         free_hcb_and_ccb_done(h, h-&gt;ccb, CAM_UNREC_HBA_ERROR);
                     else
                         free_hcb_and_ccb_done(h, h-&gt;ccb, CAM_SCSI_BUS_RESET);
                 }
         }
 
         /* report the event */
         xpt_async(AC_BUS_RESET, softc-&gt;wpath, NULL);
 
         /* re-initialization may take a lot of time, in such case
          * its completion should be signaled by another interrupt or
          * checked on timeout - but for simplicity we assume here that
          * it is really fast
          */
         if(!fatal) {
             reinitialize_controller_without_scsi_reset(softc);
         } else {
             reinitialize_controller_with_scsi_reset(softc);
         }
         schedule_next_hcb(softc);
         return;
     }</programlisting>
 
     <para>If interrupt is not caused by a controller-wide condition
       then probably something has happened to the current hardware
       control block.  Depending on the hardware there may be other
       non-HCB-related events, we just do not consider them here.  Then
       we analyze what happened to this HCB:</para>
 
     <programlisting>    struct xxx_hcb *hcb, *h, *hh;
     int hcb_status, scsi_status;
     int ccb_status;
     int targ;
     int lun_to_freeze;
 
     hcb = get_current_hcb(softc);
     if(hcb == NULL) {
         /* either stray interrupt or something went very wrong
          * or this is something hardware-dependent
          */
         handle as necessary;
         return;
     }
 
     targ = hcb-&gt;target;
     hcb_status = get_status_of_current_hcb(softc);</programlisting>
 
     <para>First we check if the HCB has completed and if so we check
       the returned SCSI status.</para>
 
     <programlisting>    if(hcb_status == COMPLETED) {
         scsi_status = get_completion_status(hcb);</programlisting>
 
     <para>Then look if this status is related to the REQUEST SENSE
       command and if so handle it in a simple way.</para>
 
     <programlisting>        if(hcb-&gt;flags &amp; DOING_AUTOSENSE) {
             if(scsi_status == GOOD) { /* autosense was successful */
                 hcb-&gt;ccb-&gt;ccb_h.status |= CAM_AUTOSNS_VALID;
                 free_hcb_and_ccb_done(hcb, hcb-&gt;ccb, CAM_SCSI_STATUS_ERROR);
             } else {
         autosense_failed:
                 free_hcb_and_ccb_done(hcb, hcb-&gt;ccb, CAM_AUTOSENSE_FAIL);
             }
             schedule_next_hcb(softc);
             return;
         }</programlisting>
 
     <para>Else the command itself has completed, pay more attention to
       details.  If auto-sense is not disabled for this CCB and the
       command has failed with sense data then run REQUEST SENSE
       command to receive that data.</para>
 
     <programlisting>        hcb-&gt;ccb-&gt;csio.scsi_status = scsi_status;
         calculate_residue(hcb);
 
         if( (hcb-&gt;ccb-&gt;ccb_h.flags &amp; CAM_DIS_AUTOSENSE)==0
         &amp;&amp; ( scsi_status == CHECK_CONDITION
                 || scsi_status == COMMAND_TERMINATED) ) {
             /* start auto-SENSE */
             hcb-&gt;flags |= DOING_AUTOSENSE;
             setup_autosense_command_in_hcb(hcb);
             restart_current_hcb(softc);
             return;
         }
         if(scsi_status == GOOD)
             free_hcb_and_ccb_done(hcb, hcb-&gt;ccb, CAM_REQ_CMP);
         else
             free_hcb_and_ccb_done(hcb, hcb-&gt;ccb, CAM_SCSI_STATUS_ERROR);
         schedule_next_hcb(softc);
         return;
     }</programlisting>
 
     <para>One typical thing would be negotiation events: negotiation
       messages received from a SCSI target (in answer to our
       negotiation attempt or by target's initiative) or the target is
       unable to negotiate (rejects our negotiation messages or does
       not answer them).</para>
 
     <programlisting>    switch(hcb_status) {
     case TARGET_REJECTED_WIDE_NEG:
         /* revert to 8-bit bus */
         softc-&gt;current_bus_width[targ] = softc-&gt;goal_bus_width[targ] = 8;
         /* report the event */
         neg.bus_width = 8;
         neg.valid = CCB_TRANS_BUS_WIDTH_VALID;
         xpt_async(AC_TRANSFER_NEG, hcb-&gt;ccb.ccb_h.path_id, &amp;neg);
         continue_current_hcb(softc);
         return;
     case TARGET_ANSWERED_WIDE_NEG:
         {
             int wd;
 
             wd = get_target_bus_width_request(softc);
             if(wd &lt;= softc-&gt;goal_bus_width[targ]) {
                 /* answer is acceptable */
                 softc-&gt;current_bus_width[targ] =
                 softc-&gt;goal_bus_width[targ] = neg.bus_width = wd;
 
                 /* report the event */
                 neg.valid = CCB_TRANS_BUS_WIDTH_VALID;
                 xpt_async(AC_TRANSFER_NEG, hcb-&gt;ccb.ccb_h.path_id, &amp;neg);
             } else {
                 prepare_reject_message(hcb);
             }
         }
         continue_current_hcb(softc);
         return;
     case TARGET_REQUESTED_WIDE_NEG:
         {
             int wd;
 
             wd = get_target_bus_width_request(softc);
             wd = min (wd, OUR_BUS_WIDTH);
             wd = min (wd, softc-&gt;user_bus_width[targ]);
 
             if(wd != softc-&gt;current_bus_width[targ]) {
                 /* the bus width has changed */
                 softc-&gt;current_bus_width[targ] =
                 softc-&gt;goal_bus_width[targ] = neg.bus_width = wd;
 
                 /* report the event */
                 neg.valid = CCB_TRANS_BUS_WIDTH_VALID;
                 xpt_async(AC_TRANSFER_NEG, hcb-&gt;ccb.ccb_h.path_id, &amp;neg);
             }
             prepare_width_nego_rsponse(hcb, wd);
         }
         continue_current_hcb(softc);
         return;
     }</programlisting>
 
     <para>Then we handle any errors that could have happened during
       auto-sense in the same simple-minded way as before.  Otherwise
       we look closer at the details again.</para>
 
     <programlisting>    if(hcb-&gt;flags &amp; DOING_AUTOSENSE)
         goto autosense_failed;
 
     switch(hcb_status) {</programlisting>
 
     <para>The next event we consider is unexpected disconnect.  Which
       is considered normal after an ABORT or BUS DEVICE RESET message
       and abnormal in other cases.</para>
 
     <programlisting>    case UNEXPECTED_DISCONNECT:
         if(requested_abort(hcb)) {
             /* abort affects all commands on that target+LUN, so
              * mark all disconnected HCBs on that target+LUN as aborted too
              */
             for(h = softc-&gt;first_discon_hcb[hcb-&gt;target][hcb-&gt;lun];
                     h != NULL; h = hh) {
                 hh=h-&gt;next;
                 free_hcb_and_ccb_done(h, h-&gt;ccb, CAM_REQ_ABORTED);
             }
             ccb_status = CAM_REQ_ABORTED;
         } else if(requested_bus_device_reset(hcb)) {
             int lun;
 
             /* reset affects all commands on that target, so
              * mark all disconnected HCBs on that target+LUN as reset
              */
 
             for(lun=0; lun &lt;= OUR_MAX_SUPPORTED_LUN; lun++)
                 for(h = softc-&gt;first_discon_hcb[hcb-&gt;target][lun];
                         h != NULL; h = hh) {
                     hh=h-&gt;next;
                     free_hcb_and_ccb_done(h, h-&gt;ccb, CAM_SCSI_BUS_RESET);
                 }
 
             /* send event */
             xpt_async(AC_SENT_BDR, hcb-&gt;ccb-&gt;ccb_h.path_id, NULL);
 
             /* this was the CAM_RESET_DEV request itself, it is completed */
             ccb_status = CAM_REQ_CMP;
         } else {
             calculate_residue(hcb);
             ccb_status = CAM_UNEXP_BUSFREE;
             /* request the further code to freeze the queue */
             hcb-&gt;ccb-&gt;ccb_h.status |= CAM_DEV_QFRZN;
             lun_to_freeze = hcb-&gt;lun;
         }
         break;</programlisting>
 
     <para>If the target refuses to accept tags we notify CAM about
       that and return back all commands for this LUN:</para>
 
     <programlisting>    case TAGS_REJECTED:
         /* report the event */
         neg.flags = 0 &amp; ~CCB_TRANS_TAG_ENB;
         neg.valid = CCB_TRANS_TQ_VALID;
         xpt_async(AC_TRANSFER_NEG, hcb-&gt;ccb.ccb_h.path_id, &amp;neg);
 
         ccb_status = CAM_MSG_REJECT_REC;
         /* request the further code to freeze the queue */
         hcb-&gt;ccb-&gt;ccb_h.status |= CAM_DEV_QFRZN;
         lun_to_freeze = hcb-&gt;lun;
         break;</programlisting>
 
     <para>Then we check a number of other conditions, with processing
       basically limited to setting the CCB status:</para>
 
     <programlisting>    case SELECTION_TIMEOUT:
         ccb_status = CAM_SEL_TIMEOUT;
         /* request the further code to freeze the queue */
         hcb-&gt;ccb-&gt;ccb_h.status |= CAM_DEV_QFRZN;
         lun_to_freeze = CAM_LUN_WILDCARD;
         break;
     case PARITY_ERROR:
         ccb_status = CAM_UNCOR_PARITY;
         break;
     case DATA_OVERRUN:
     case ODD_WIDE_TRANSFER:
         ccb_status = CAM_DATA_RUN_ERR;
         break;
     default:
         /* all other errors are handled in a generic way */
         ccb_status = CAM_REQ_CMP_ERR;
         /* request the further code to freeze the queue */
         hcb-&gt;ccb-&gt;ccb_h.status |= CAM_DEV_QFRZN;
         lun_to_freeze = CAM_LUN_WILDCARD;
         break;
     }</programlisting>
 
     <para>Then we check if the error was serious enough to freeze the
       input queue until it gets proceeded and do so if it is:</para>
 
     <programlisting>    if(hcb-&gt;ccb-&gt;ccb_h.status &amp; CAM_DEV_QFRZN) {
         /* freeze the queue */
         xpt_freeze_devq(ccb-&gt;ccb_h.path, /*count*/1);
 
         /* re-queue all commands for this target/LUN back to CAM */
 
         for(h = softc-&gt;first_queued_hcb; h != NULL; h = hh) {
             hh = h-&gt;next;
 
             if(targ == h-&gt;targ
             &amp;&amp; (lun_to_freeze == CAM_LUN_WILDCARD || lun_to_freeze == h-&gt;lun) )
                 free_hcb_and_ccb_done(h, h-&gt;ccb, CAM_REQUEUE_REQ);
         }
     }
     free_hcb_and_ccb_done(hcb, hcb-&gt;ccb, ccb_status);
     schedule_next_hcb(softc);
     return;</programlisting>
 
     <para>This concludes the generic interrupt handling although
       specific controllers may require some additions.</para>
   </sect1>
 
   <sect1 xml:id="scsi-errors">
     <title>Errors Summary</title>
 
     <indexterm><primary>SCSI</primary><secondary>errors</secondary></indexterm>
 
     <para>When executing an I/O request many things may go wrong.  The
       reason of error can be reported in the CCB status with great
       detail.  Examples of use are spread throughout this document.
       For completeness here is the summary of recommended responses
       for the typical error conditions:</para>
 
     <itemizedlist>
       <listitem>
 	<para><emphasis>CAM_RESRC_UNAVAIL</emphasis> - some resource
 	  is temporarily unavailable and the SIM driver cannot
 	  generate an event when it will become available.  An example
 	  of this resource would be some intra-controller hardware
 	  resource for which the controller does not generate an
 	  interrupt when it becomes available.</para>
       </listitem>
 
       <listitem>
 	<para><emphasis>CAM_UNCOR_PARITY</emphasis> - unrecovered
 	  parity error occurred</para>
       </listitem>
 
       <listitem>
 	<para><emphasis>CAM_DATA_RUN_ERR</emphasis> - data overrun or
 	  unexpected data phase (going in other direction than
 	  specified in CAM_DIR_MASK) or odd transfer length for wide
 	  transfer</para>
       </listitem>
 
       <listitem>
 	<para><emphasis>CAM_SEL_TIMEOUT</emphasis> - selection timeout
 	  occurred (target does not respond)</para>
       </listitem>
 
       <listitem>
 	<para><emphasis>CAM_CMD_TIMEOUT</emphasis> - command timeout
 	  occurred (the timeout function ran)</para>
       </listitem>
 
       <listitem>
 	<para><emphasis>CAM_SCSI_STATUS_ERROR</emphasis> - the device
 	  returned error</para>
       </listitem>
 
       <listitem>
 	<para><emphasis>CAM_AUTOSENSE_FAIL</emphasis> - the device
 	  returned error and the REQUEST SENSE COMMAND failed</para>
       </listitem>
 
       <listitem>
 	<para><emphasis>CAM_MSG_REJECT_REC</emphasis> - MESSAGE REJECT
 	  message was received</para>
       </listitem>
 
       <listitem>
 	<para><emphasis>CAM_SCSI_BUS_RESET</emphasis> - received SCSI
 	  bus reset</para>
       </listitem>
 
       <listitem>
 	<para><emphasis>CAM_REQ_CMP_ERR</emphasis> -
 	  <quote>impossible</quote> SCSI phase occurred or something
 	  else as weird or just a generic error if further detail is
 	  not available</para>
       </listitem>
 
       <listitem>
 	<para><emphasis>CAM_UNEXP_BUSFREE</emphasis> - unexpected
 	  disconnect occurred</para>
       </listitem>
 
       <listitem>
 	<para><emphasis>CAM_BDR_SENT</emphasis> - BUS DEVICE RESET
 	  message was sent to the target</para>
       </listitem>
 
       <listitem>
 	<para><emphasis>CAM_UNREC_HBA_ERROR</emphasis> - unrecoverable
 	  Host Bus Adapter Error</para>
       </listitem>
 
       <listitem>
 	<para><emphasis>CAM_REQ_TOO_BIG</emphasis> - the request was
 	  too large for this controller</para>
       </listitem>
 
       <listitem>
 	<para><emphasis>CAM_REQUEUE_REQ</emphasis> - this request
 	  should be re-queued to preserve transaction ordering.  This
 	  typically occurs when the SIM recognizes an error that
 	  should freeze the queue and must place other queued requests
 	  for the target at the sim level back into the XPT queue.
 	  Typical cases of such errors are selection timeouts, command
 	  timeouts and other like conditions.  In such cases the
 	  troublesome command returns the status indicating the error,
 	  the and the other commands which have not be sent to the bus
 	  yet get re-queued.</para>
       </listitem>
 
       <listitem>
 	<para><emphasis>CAM_LUN_INVALID</emphasis> - the LUN ID in the
 	  request is not supported by the SCSI controller</para>
       </listitem>
 
       <listitem>
 	<para><emphasis>CAM_TID_INVALID</emphasis> - the target ID in
 	  the request is not supported by the SCSI controller</para>
       </listitem>
     </itemizedlist>
   </sect1>
 
   <sect1 xml:id="scsi-timeout">
     <title>Timeout Handling</title>
 
     <para>When the timeout for an HCB expires that request should be
       aborted, just like with an XPT_ABORT request.  The only
       difference is that the returned status of aborted request should
       be CAM_CMD_TIMEOUT instead of CAM_REQ_ABORTED (that is why
       implementation of the abort better be done as a function).  But
       there is one more possible problem: what if the abort request
       itself will get stuck? In this case the SCSI bus should be
       reset, just like with an XPT_RESET_BUS request (and the idea
       about implementing it as a function called from both places
       applies here too).  Also we should reset the whole SCSI bus if a
       device reset request got stuck.  So after all the timeout
       function would look like:</para>
 
     <programlisting>static void
 xxx_timeout(void *arg)
 {
     struct xxx_hcb *hcb = (struct xxx_hcb *)arg;
     struct xxx_softc *softc;
     struct ccb_hdr *ccb_h;
 
     softc = hcb-&gt;softc;
     ccb_h = &amp;hcb-&gt;ccb-&gt;ccb_h;
 
     if(hcb-&gt;flags &amp; HCB_BEING_ABORTED
     || ccb_h-&gt;func_code == XPT_RESET_DEV) {
         xxx_reset_bus(softc);
     } else {
         xxx_abort_ccb(hcb-&gt;ccb, CAM_CMD_TIMEOUT);
     }
 }</programlisting>
 
     <para>When we abort a request all the other disconnected requests
       to the same target/LUN get aborted too.  So there appears a
       question, should we return them with status CAM_REQ_ABORTED or
       CAM_CMD_TIMEOUT?  The current drivers use CAM_CMD_TIMEOUT. This
       seems logical because if one request got timed out then probably
       something really bad is happening to the device, so if they
       would not be disturbed they would time out by themselves.</para>
   </sect1>
 </chapter>
diff --git a/en_US.ISO8859-1/books/arch-handbook/usb/chapter.xml b/en_US.ISO8859-1/books/arch-handbook/usb/chapter.xml
index 6fa02b2d59..1d34c3192b 100644
--- a/en_US.ISO8859-1/books/arch-handbook/usb/chapter.xml
+++ b/en_US.ISO8859-1/books/arch-handbook/usb/chapter.xml
@@ -1,721 +1,721 @@
 <?xml version="1.0" encoding="iso-8859-1"?>
 <!--
      The FreeBSD Documentation Project
 
      $FreeBSD$
 -->
 <chapter xmlns="http://docbook.org/ns/docbook"
   xmlns:xlink="http://www.w3.org/1999/xlink" version="5.0"
   xml:id="usb">
   <info>
     <title>USB Devices</title>
 
     <authorgroup>
       <author>
 	<personname>
 	  <firstname>Nick</firstname>
 	  <surname>Hibma</surname>
 	</personname>
 	<contrib>Written by </contrib>
       </author>
     </authorgroup>
     <authorgroup>
       <author>
 	<personname>
 	  <firstname>Murray</firstname>
 	  <surname>Stokely</surname>
 	</personname>
 	<contrib>Modifications for Handbook made by </contrib>
       </author>
     </authorgroup>
   </info>
 
   <sect1 xml:id="usb-intro">
     <title>Introduction</title>
 
     <indexterm><primary>Universal Serial Bus
 	(USB)</primary></indexterm>
     <indexterm><primary>NetBSD</primary></indexterm>
 
     <para>The Universal Serial Bus (USB) is a new way of attaching
       devices to personal computers.  The bus architecture features
       two-way communication and has been developed as a response to
       devices becoming smarter and requiring more interaction with the
       host.  USB support is included in all current PC chipsets and is
       therefore available in all recently built PCs.  Apple's
       introduction of the USB-only iMac has been a major incentive for
       hardware manufacturers to produce USB versions of their devices.
       The future PC specifications specify that all legacy connectors
       on PCs should be replaced by one or more USB connectors,
       providing generic plug and play capabilities.  Support for USB
       hardware was available at a very early stage in NetBSD and was
       developed by Lennart Augustsson for the NetBSD project.  The
       code has been ported to FreeBSD and we are currently maintaining
       a shared code base.  For the implementation of the USB subsystem
       a number of features of USB are important.</para>
 
     <para><emphasis>Lennart Augustsson has done most of the
 	implementation of the USB support for the NetBSD project.
 	Many thanks for this incredible amount of work.  Many thanks
 	also to Ardy and Dirk for their comments and proofreading of
 	this paper.</emphasis></para>
 
     <itemizedlist>
 
       <listitem>
 	<para>Devices connect to ports on the computer directly or on
 	  devices called hubs, forming a treelike device
 	  structure.</para>
       </listitem>
 
       <listitem>
 	<para>The devices can be connected and disconnected at run
 	  time.</para>
       </listitem>
 
       <listitem>
 	<para>Devices can suspend themselves and trigger resumes of
 	  the host system</para>
       </listitem>
 
       <listitem>
 	<para>As the devices can be powered from the bus, the host
 	  software has to keep track of power budgets for each
 	  hub.</para>
       </listitem>
 
       <listitem>
 	<para>Different quality of service requirements by the
 	  different device types together with the maximum of 126
 	  devices that can be connected to the same bus, require
 	  proper scheduling of transfers on the shared bus to take
 	  full advantage of the 12Mbps bandwidth available.  (over
 	  400Mbps with USB 2.0)</para>
       </listitem>
 
       <listitem>
 	<para>Devices are intelligent and contain easily accessible
 	  information about themselves</para>
       </listitem>
 
     </itemizedlist>
 
     <para>The development of drivers for the USB subsystem and devices
       connected to it is supported by the specifications that have
       been developed and will be developed.  These specifications are
       publicly available from the USB home pages.  Apple has been very
       strong in pushing for standards based drivers, by making drivers
       for the generic classes available in their operating system
       MacOS and discouraging the use of separate drivers for each new
       device.  This chapter tries to collate essential information for
       a basic understanding of the USB 2.0 implementation stack in
       FreeBSD/NetBSD.  It is recommended however to read it together
       with the relevant 2.0 specifications and other developer
       resources:</para>
 
     <itemizedlist>
       <listitem>
 	<para>USB 2.0 Specification (<link
 	    xlink:href="http://www.usb.org/developers/docs/usb20_docs/">http://www.usb.org/developers/docs/usb20_docs/</link>)</para>
       </listitem>
 
       <listitem>
 	<para>Universal Host Controller Interface
 	  (<acronym>UHCI</acronym>) Specification (<link
 	    xlink:href="ftp://ftp.netbsd.org/pub/NetBSD/misc/blymn/uhci11d.pdf">ftp://ftp.netbsd.org/pub/NetBSD/misc/blymn/uhci11d.pdf)</link></para>
       </listitem>
 
       <listitem>
 	<para>Open Host Controller Interface (<acronym>OHCI</acronym>)
 	  Specification(<link
 	    xlink:href="ftp://ftp.compaq.com/pub/supportinformation/papers/hcir1_0a.pdf">ftp://ftp.compaq.com/pub/supportinformation/papers/hcir1_0a.pdf</link>)</para>
       </listitem>
 
       <listitem>
 	<para>Developer section of <acronym>USB</acronym> home page
 	  (<link
 	    xlink:href="http://www.usb.org/developers/">http://www.usb.org/developers/</link>)</para>
       </listitem>
     </itemizedlist>
 
     <sect2>
       <title>Structure of the USB Stack</title>
 
       <para>The USB support in FreeBSD can be split into three layers.
 	The lowest layer contains the host controller driver,
 	providing a generic interface to the hardware and its
 	scheduling facilities.  It supports initialisation of the
 	hardware, scheduling of transfers and handling of completed
 	and/or failed transfers.  Each host controller driver
 	implements a virtual hub providing hardware independent access
 	to the registers controlling the root ports on the back of the
 	machine.</para>
 
       <para>The middle layer handles the device connection and
 	disconnection, basic initialisation of the device, driver
 	selection, the communication channels (pipes) and does
 	resource management.  This services layer also controls the
 	default pipes and the device requests transferred over
 	them.</para>
 
       <para>The top layer contains the individual drivers supporting
 	specific (classes of) devices.  These drivers implement the
 	protocol that is used over the pipes other than the default
 	pipe.  They also implement additional functionality to make
 	the device available to other parts of the kernel or userland.
 	They use the USB driver interface (USBDI) exposed by the
 	services layer.</para>
     </sect2>
   </sect1>
 
   <sect1 xml:id="usb-hc">
     <title>Host Controllers</title>
 
     <indexterm><primary>USB</primary><secondary>host
 	controllers</secondary></indexterm>
     <para>The host controller (HC) controls the transmission of
       packets on the bus.  Frames of 1 millisecond are used.  At the
       start of each frame the host controller generates a Start of
       Frame (SOF) packet.</para>
 
     <para>The SOF packet is used to synchronise to the start of the
       frame and to keep track of the frame number.  Within each frame
       packets are transferred, either from host to device (out) or
       from device to host (in).  Transfers are always initiated by the
       host (polled transfers).  Therefore there can only be one host
       per USB bus.  Each transfer of a packet has a status stage in
       which the recipient of the data can return either ACK
       (acknowledge reception), NAK (retry), STALL (error condition) or
       nothing (garbled data stage, device not available or
       disconnected).  Section 8.5 of the USB 2.0 Specification
       explains the details of packets in more detail.  Four different
       types of transfers can occur on a USB bus: control, bulk,
       interrupt and isochronous.  The types of transfers and their
       characteristics are described below.</para>
 
     <para>Large transfers between the device on the USB bus and the
       device driver are split up into multiple packets by the host
       controller or the HC driver.</para>
 
     <para>Device requests (control transfers) to the default endpoints
       are special.  They consist of two or three phases: SETUP, DATA
       (optional) and STATUS. The set-up packet is sent to the device.
       If there is a data phase, the direction of the data packet(s) is
       given in the set-up packet.  The direction in the status phase
       is the opposite of the direction during the data phase, or IN if
       there was no data phase.  The host controller hardware also
       provides registers with the current status of the root ports and
       the changes that have occurred since the last reset of the
       status change register.  Access to these registers is provided
       through a virtualised hub as suggested in the USB specification.
       The virtual hub must comply with the hub device class given in
       chapter 11 of that specification.  It must provide a default
       pipe through which device requests can be sent to it.  It
       returns the standard andhub class specific set of descriptors.
       It should also provide an interrupt pipe that reports changes
       happening at its ports.  There are currently two specifications
       for host controllers available: Universal Host Controller
       Interface (<acronym>UHCI</acronym>) from Intel and Open Host
       Controller Interface (<acronym>OHCI</acronym>) from Compaq,
       Microsoft, and National Semiconductor.  The
       <acronym>UHCI</acronym> specification has been designed to
       reduce hardware complexity by requiring the host controller
       driver to supply a complete schedule of the transfers for each
       frame.  OHCI type controllers are much more independent by
       providing a more abstract interface doing a lot of work
       themselves.</para>
 
     <sect2>
       <title>UHCI</title>
 
       <indexterm>
 	<primary>USB</primary>
 	<secondary>UHCI</secondary>
       </indexterm>
 
       <para>The UHCI host controller maintains a framelist with 1024
 	pointers to per frame data structures.  It understands two
 	different data types: transfer descriptors (TD) and queue
 	heads (QH).  Each TD represents a packet to be communicated to
 	or from a device endpoint.  QHs are a means to groupTDs (and
 	QHs) together.</para>
 
       <para>Each transfer consists of one or more packets.  The UHCI
 	driver splits large transfers into multiple packets.  For
 	every transfer, apart from isochronous transfers, a QH is
 	allocated.  For every type of transfer these QHs are collected
 	at a QH for that type.  Isochronous transfers have to be
 	executed first because of the fixed latency requirement and
 	are directly referred to by the pointer in the framelist.  The
 	last isochronous TD refers to the QH for interrupt transfers
 	for that frame.  All QHs for interrupt transfers point at the
 	QH for control transfers, which in turn points at the QH for
 	bulk transfers.  The following diagram gives a graphical
 	overview of this:</para>
 
       <para>This results in the following schedule being run in each
 	frame.  After fetching the pointer for the current frame from
 	the framelist the controller first executes the TDs for all
 	the isochronous packets in that frame.  The last of these TDs
 	refers to the QH for the interrupt transfers for thatframe.
 	The host controller will then descend from that QH to the QHs
 	for the individual interrupt transfers.  After finishing that
 	queue, the QH for the interrupt transfers will refer the
 	controller to the QH for all control transfers.  It will
 	execute all the subqueues scheduled there, followed by all the
 	transfers queued at the bulk QH.  To facilitate the handling
 	of finished or failed transfers different types of interrupts
 	are generated by the hardware at the end of each frame.  In
 	the last TD for a transfer the Interrupt-On Completion bit is
 	set by the HC driver to flag an interrupt when the transfer
 	has completed.  An error interrupt is flagged if a TD reaches
 	its maximum error count.  If the short packet detect bit is
 	set in a TD and less than the set packet length is transferred
 	this interrupt is flagged to notify the controller driver of
 	the completed transfer.  It is the host controller driver's
 	task to find out which transfer has completed or produced an
 	error.  When called the interrupt service routine will locate
 	all the finished transfers and call their callbacks.</para>
 
       <para>Refer to the <acronym>UHCI</acronym> Specification for a
 	more elaborate description.</para>
 
     </sect2>
 
     <sect2>
       <title>OHCI</title>
 
       <indexterm>
 	<primary>USB</primary>
 	<secondary>OHCI</secondary>
       </indexterm>
 
       <para>Programming an OHCI host controller is much simpler.  The
 	controller assumes that a set of endpoints is available, and
 	is aware of scheduling priorities and the ordering of the
 	types of transfers in a frame.  The main data structure used
 	by the host controller is the endpoint descriptor (ED) to
 	which a queue of transfer descriptors (TDs) is attached.  The
 	ED contains the maximum packet size allowed for an endpoint
 	and the controller hardware does the splitting into packets.
 	The pointers to the data buffers are updated after each
 	transfer and when the start and end pointer are equal, the TD
 	is retired to the done-queue.  The four types of endpoints
 	(interrupt, isochronous, control, and bulk) have their own
 	queues.  Control and bulk endpoints are queued each at their
 	own queue.  Interrupt EDs are queued in a tree, with the level
 	in the tree defining the frequency at which they run.</para>
 
       <para>The schedule being run by the host controller in each
 	frame looks as follows.  The controller will first run the
 	non-periodic control and bulk queues, up to a time limit set
 	by the HC driver.  Then the interrupt transfers for that frame
 	number are run, by using the lower five bits of the frame
 	number as an index into level 0 of the tree of interrupts EDs.
 	At the end of this tree the isochronous EDs are connected and
 	these are traversed subsequently.  The isochronous TDs contain
 	the frame number of the first frame the transfer should be run
 	in.  After all the periodic transfers have been run, the
 	control and bulk queues are traversed again.  Periodically the
 	interrupt service routine is called to process the done queue
 	and call the callbacks for each transfer and reschedule
 	interrupt and isochronous endpoints.</para>
 
       <para>See the <acronym>UHCI</acronym> Specification for a more
 	elaborate description.  The middle layer provides access to
 	the device in a controlled way and maintains resources in use
 	by the different drivers and the services layer.  The layer
 	takes care of the following aspects:</para>
 
       <itemizedlist>
 	<listitem>
 	  <para>The device configuration information</para>
 	</listitem>
 	<listitem>
 	  <para>The pipes to communicate with a device</para>
 	</listitem>
 	<listitem>
 	  <para>Probing and attaching and detaching form a
 	    device.</para>
 	</listitem>
       </itemizedlist>
     </sect2>
   </sect1>
 
   <sect1 xml:id="usb-dev">
     <title>USB Device Information</title>
 
     <sect2>
       <title>Device Configuration Information</title>
 
       <para>Each device provides different levels of configuration
 	information.  Each device has one or more configurations, of
 	which one is selected during probe/attach.  A configuration
 	provides power and bandwidth requirements.  Within each
 	configuration there can be multiple interfaces.  A device
 	interface is a collection of endpoints.  For example USB
 	speakers can have an interface for the audio data (Audio
 	Class) and an interface for the knobs, dials and buttons (HID
 	Class).  All interfaces in a configuration are active at the
 	same time and can be attached to by different drivers.  Each
 	interface can have alternates, providing different quality of
 	service parameters.  In for example cameras this is used to
 	provide different frame sizes and numbers of frames per
 	second.</para>
 
       <para>Within each interface, 0 or more endpoints can be
 	specified.  Endpoints are the unidirectional access points for
 	communicating with a device.  They provide buffers to
 	temporarily store incoming or outgoing data from the device.
 	Each endpoint has a unique address within a configuration, the
 	endpoint's number plus its direction.  The default endpoint,
 	endpoint 0, is not part of any interface and available in all
 	configurations.  It is managed by the services layer and not
 	directly available to device drivers.</para>
 <!--
 This part is unclear, is it an unformatted code example?
       <para>Level 0 Level 1 Level 2 Slot 0</para>
       <para>Slot 3 Slot 2 Slot 1</para>
       <para>(Only 4 out of 32 slots shown)</para>
       -->
 
       <para>This hierarchical configuration information is described
 	in the device by a standard set of descriptors (see section
 	9.6 of the USB specification).  They can be requested through
 	the Get Descriptor Request.  The services layer caches these
 	descriptors to avoid unnecessary transfers on the USB bus.
 	Access to the descriptors is provided through function
 	calls.</para>
 
       <itemizedlist>
 	<listitem>
 	  <para>Device descriptors: General information about the
 	    device, like Vendor, Product and Revision Id, supported
 	    device class, subclass and protocol if applicable, maximum
 	    packet size for the default endpoint, etc.</para>
 	</listitem>
 
 	<listitem>
 	  <para>Configuration descriptors: The number of interfaces in
 	    this configuration, suspend and resume functionality
 	    supported and power requirements.</para>
 	</listitem>
 
 	<listitem>
 	  <para>Interface descriptors: interface class, subclass and
 	    protocol if applicable, number of alternate settings for
 	    the interface and the number of endpoints.</para>
 	</listitem>
 
 	<listitem>
 	  <para>Endpoint descriptors: Endpoint address, direction and
 	    type, maximum packet size supported and polling frequency
 	    if type is interrupt endpoint.  There is no descriptor for
 	    the default endpoint (endpoint 0) and it is never counted
 	    in an interface descriptor.</para>
 	</listitem>
 
 	<listitem>
 	  <para>String descriptors: In the other descriptors string
 	    indices are supplied for some fields.These can be used to
 	    retrieve descriptive strings, possibly in multiple
 	    languages.</para>
 	</listitem>
       </itemizedlist>
 
       <para>Class specifications can add their own descriptor types
 	that are available through the GetDescriptor Request.</para>
 
       <para>Pipes Communication to end points on a device flows
 	through so-called pipes.  Drivers submit transfers to
 	endpoints to a pipe and provide a callback to be called on
 	completion or failure of the transfer (asynchronous transfers)
 	or wait for completion (synchronous transfer).  Transfers to
 	an endpoint are serialised in the pipe.  A transfer can either
 	complete, fail or time-out (if a time-out has been set).
 	There are two types of time-outs for transfers.  Time-outs can
 	happen due to time-out on the USBbus (milliseconds).  These
 	time-outs are seen as failures and can be due to disconnection
 	of the device.  A second form of time-out is implemented in
 	software and is triggered when a transfer does not complete
 	within a specified amount of time (seconds).  These are caused
 	by a device acknowledging negatively (NAK) the transferred
 	packets.  The cause for this is the device not being ready to
 	receive data, buffer under- or overrun or protocol
 	errors.</para>
 
       <para>If a transfer over a pipe is larger than the maximum
 	packet size specified in the associated endpoint descriptor,
 	the host controller (OHCI) or the HC driver (UHCI) will split
 	the transfer into packets of maximum packet size, with the
 	last packet possibly smaller than the maximum packet
 	size.</para>
 
       <para>Sometimes it is not a problem for a device to return less
 	data than requested.  For example abulk-in-transfer to a modem
 	might request 200 bytes of data, but the modem has only 5
 	bytes available at that time.  The driver can set the short
 	packet (SPD) flag.  It allows the host controller to accept a
 	packet even if the amount of data transferred is less than
 	requested.  This flag is only valid for in-transfers, as the
 	amount of data to be sent to a device is always known
 	beforehand.  If an unrecoverable error occurs in a device
 	during a transfer the pipe is stalled.  Before any more data
 	is accepted or sent the driver needs to resolve the cause of
 	the stall and clear the endpoint stall condition through send
 	the clear endpoint halt device request over the default pipe.
 	The default endpoint should never stall.</para>
 
       <para>There are four different types of endpoints and
 	corresponding pipes: - Control pipe / default pipe: There is
 	one control pipe per device, connected to the default endpoint
 	(endpoint 0).  The pipe carries the device requests and
 	associated data.  The difference between transfers over the
 	default pipe and other pipes is that the protocol for the
 	transfers is described in the USB specification.  These
 	requests are used to reset and configure the device.  A basic
 	set of commands that must be supported by each device is
 	provided in chapter 9 of the USB specification.  The commands
 	supported on this pipe can be extended by a device class
 	specification to support additional functionality.</para>
 
       <itemizedlist>
 	<listitem>
 	  <para>Bulk pipe: This is the USB equivalent to a raw
 	    transmission medium.</para>
 	</listitem>
 
 	<listitem>
 	  <para>Interrupt pipe: The host sends a request for data to
 	    the device and if the device has nothing to send, it will
 	    NAK the data packet.  Interrupt transfers are scheduled at
 	    a frequency specified when creating the
 	    pipe.</para>
 	</listitem>
 
 	<listitem>
 	  <para>Isochronous pipe: These pipes are intended for
 	    isochronous data, for example video or audio streams, with
 	    fixed latency, but no guaranteed delivery.  Some support
 	    for pipes of this type is available in the current
 	    implementation.  Packets in control, bulk and interrupt
 	    transfers are retried if an error occurs during
 	    transmission or the device acknowledges the packet
 	    negatively (NAK) due to for example lack of buffer space
 	    to store the incoming data.  Isochronous packets are
 	    however not retried in case of failed delivery or NAK of a
 	    packet as this might violate the timing
 	    constraints.</para>
 	</listitem>
       </itemizedlist>
 
       <para>The availability of the necessary bandwidth is calculated
 	during the creation of the pipe.  Transfers are scheduled
 	within frames of 1 millisecond.  The bandwidth allocation
 	within a frame is prescribed by the USB specification, section
 	5.6 [ 2].  Isochronous and interrupt transfers are allowed to
 	consume up to 90% of the bandwidth within a frame.  Packets
 	for control and bulk transfers are scheduled after all
 	isochronous and interrupt packets and will consume all the
 	remaining bandwidth.</para>
 
       <para>More information on scheduling of transfers and bandwidth
 	reclamation can be found in chapter 5 of the USB
 	specification, section 1.3 of the UHCI specification, and
 	section 3.4.2 of the OHCI specification.</para>
     </sect2>
   </sect1>
 
   <sect1 xml:id="usb-devprobe">
     <title>Device Probe and Attach</title>
 
     <indexterm>
       <primary>USB</primary>
       <secondary>probe</secondary>
     </indexterm>
 
     <para>After the notification by the hub that a new device has been
       connected, the service layer switches on the port, providing the
       device with 100 mA of current.  At this point the device is in
       its default state and listening to device address 0.  The
       services layer will proceed to retrieve the various descriptors
       through the default pipe.  After that it will send a Set Address
       request to move the device away from the default device address
       (address 0).  Multiple device drivers might be able to support
       the device.  For example a modem driver might be able to support
       an ISDN TA through the AT compatibility interface.  A driver for
       that specific model of the ISDN adapter might however be able to
       provide much better support for this device.  To support this
       flexibility, the probes return priorities indicating their level
       of support.  Support for a specific revision of a product ranks
       the highest and the generic driver the lowest priority.  It
       might also be that multiple drivers could attach to one device
       if there are multiple interfaces within one configuration.  Each
       driver only needs to support a subset of the interfaces.</para>
 
     <para>The probing for a driver for a newly attached device checks
       first for device specific drivers.  If not found, the probe code
       iterates over all supported configurations until a driver
       attaches in a configuration.  To support devices with multiple
       drivers on different interfaces, the probe iterates over all
       interfaces in a configuration that have not yet been claimed by
       a driver.  Configurations that exceed the power budget for the
       hub are ignored.  During attach the driver should initialise the
       device to its proper state, but not reset it, as this will make
       the device disconnect itself from the bus and restart the
       probing process for it.  To avoid consuming unnecessary
       bandwidth should not claim the interrupt pipe at attach time,
       but should postpone allocating the pipe until the file is opened
       and the data is actually used.  When the file is closed the pipe
       should be closed again, even though the device might still be
       attached.</para>
 
     <sect2>
       <title>Device Disconnect and Detach</title>
 
       <indexterm>
 	<primary>USB</primary>
 	<secondary>disconnect</secondary>
       </indexterm>
 
       <para>A device driver should expect to receive errors during any
 	transaction with the device.  The design of USB supports and
 	encourages the disconnection of devices at any point in time.
 	Drivers should make sure that they do the right thing when the
 	device disappears.</para>
 
       <para>Furthermore a device that has been disconnected and
 	reconnected will not be reattached at the same device
 	instance.  This might change in the future when more devices
 	support serial numbers (see the device descriptor) or other
 	means of defining an identity for a device have been
 	developed.</para>
 
       <para>The disconnection of a device is signaled by a hub in the
 	interrupt packet delivered to the hub driver.  The status
 	change information indicates which port has seen a connection
 	change.  The device detach method for all device drivers for
 	the device connected on that port are called and the
 	structures cleaned up.  If the port status indicates that in
 	the mean time a device has been connected to that port, the
 	procedure for probing and attaching the device will be
 	started.  A device reset will produce a disconnect-connect
 	sequence on the hub and will be handled as described
 	above.</para>
     </sect2>
   </sect1>
 
   <sect1 xml:id="usb-protocol">
     <title>USB Drivers Protocol Information</title>
 
     <para>The protocol used over pipes other than the default pipe is
       undefined by the USB specification.  Information on this can be
       found from various sources.  The most accurate source is the
       developer's section on the USB home pages.  From these pages, a
       growing number of deviceclass specifications are available.
       These specifications specify what a compliant device should look
       like from a driver perspective, basic functionality it needs to
       provide and the protocol that is to be used over the
       communication channels.  The USB specification includes the
       description of the Hub Class.  A class specification for Human
       Interface Devices (HID) has been created to cater for keyboards,
       tablets, bar-code readers, buttons, knobs, switches, etc.  A
       third example is the class specification for mass storage
       devices.  For a full list of device classes see the developers
       section on the USB home pages.</para>
 
     <para>For many devices the protocol information has not yet been
       published however.  Information on the protocol being used might
       be available from the company making the device.  Some companies
       will require you to sign a Non -Disclosure Agreement (NDA)
       before giving you the specifications.  This in most cases
       precludes making the driver open source.</para>
 
     <para>Another good source of information is the Linux driver
       sources, as a number of companies have started to provide
       drivers for Linux for their devices.  It is always a good idea
       to contact the authors of those drivers for their source of
       information.</para>
 
     <para>Example: Human Interface Devices The specification for the
       Human Interface Devices like keyboards, mice, tablets, buttons,
       dials,etc. is referred to in other device class specifications
       and is used in many devices.</para>
 
     <para>For example audio speakers provide endpoints to the digital
       to analogue converters and possibly an extra pipe for a
       microphone.  They also provide a HID endpoint in a separate
       interface for the buttons and dials on the front of the device.
       The same is true for the monitor control class.  It is
       straightforward to build support for these interfaces through
       the available kernel and userland libraries together with the
       HID class driver or the generic driver.  Another device that
       serves as an example for interfaces within one configuration
       driven by different device drivers is a cheap keyboard with
       built-in legacy mouse port.  To avoid having the cost of
       including the hardware for a USB hub in the device,
       manufacturers combined the mouse data received from the PS/2
       port on the back of the keyboard and the key presses from the
       keyboard into two separate interfaces in the same configuration.
       The mouse and keyboard drivers each attach to the appropriate
       interface and allocate the pipes to the two independent
       endpoints.</para>
 
     <indexterm>
       <primary>USB</primary>
       <secondary>firmware</secondary>
     </indexterm>
 
     <para>Example: Firmware download Many devices that have been
       developed are based on a general purpose processor with an
-      additional USB core added to it.  Because the development of
+      additional USB core added to it.  Since the development of
       drivers and firmware for USB devices is still very new, many
       devices require the downloading of the firmware after they have
       been connected.</para>
 
     <para>The procedure followed is straightforward.  The device
       identifies itself through a vendor and product Id.  The first
       driver probes and attaches to it and downloads the firmware into
       it.  After that the device soft resets itself and the driver is
       detached.  After a short pause the device announces its presence
       on the bus.  The device will have changed its
       vendor/product/revision Id to reflect the fact that it has been
       supplied with firmware and as a consequence a second driver will
       probe it and attach to it.</para>
 
     <para>An example of these types of devices is the ActiveWire I/O
       board, based on the EZ-USB chip.  For this chip a generic
       firmware downloader is available.  The firmware downloaded into
       the ActiveWire board changes the revision Id.  It will then
       perform a soft reset of the USB part of the EZ-USB chip to
       disconnect from the USB bus and again reconnect.</para>
 
     <para>Example: Mass Storage Devices Support for mass storage
       devices is mainly built around existing protocols.  The Iomega
       USB Zipdrive is based on the SCSI version of their drive.  The
       SCSI commands and status messages are wrapped in blocks and
       transferred over the bulk pipes to and from the device,
       emulating a SCSI controller over the USB wire.  ATAPI and UFI
       commands are supported in a similar fashion.</para>
 
     <indexterm><primary>ATAPI</primary></indexterm>
 
     <para>The Mass Storage Specification supports 2 different types of
       wrapping of the command block.The initial attempt was based on
       sending the command and status through the default pipe and
       using bulk transfers for the data to be moved between the host
       and the device.  Based on experience a second approach was
       designed that was based on wrapping the command and status
       blocks and sending them over the bulk out and in endpoint.  The
       specification specifies exactly what has to happen when and what
       has to be done in case an error condition is encountered.  The
       biggest challenge when writing drivers for these devices is to
       fit USB based protocol into the existing support for mass
       storage devices.  CAM provides hooks to do this in a fairly
       straight forward way.  ATAPI is less simple as historically the
       IDE interface has never had many different appearances.</para>
 
     <para>The support for the USB floppy from Y-E Data is again less
       straightforward as a new command set has been designed.</para>
   </sect1>
 </chapter>