The SWI-Prolog website (http:www.swi-prolog.org) is written in Prolog and integrates PlDoc to provide a comprehensive searchable online manual.
Version 2 of PlDoc extends the syntax with Markdown markup as specified by Docygen. Based on experience with version 1, PlDoc 2 both tightens some rules to avoid misinterpretations and relaxes others that were considered too conservative.
When developing Prolog source that has to be maintained for a longer period or is developed by a ---possibly distributed--- team some basic quality mechanisms need to be adopted. A shared and well designed codingstyle Covington et al., 2012 is one of them. In addition, documentation of source-files and their primary interfaces as well as a testing framework must be established.
In our view, hitherto existing documentation and testing frameworks fell short realising the basic needs in a lightweight and easy to adopt system. To encourage consistent style, well commented code and test-assisted development, we make sure that
First, we describe the documentation system we developed for SWI-Prolog. In section 11 we motivate our main choices.
Like JavaDoc, the PlDoc infrastructure is based on structured 
comments. Using comments, no changes have to be made to Prolog to 
load the documented source. If the library(pldoc) library 
is loaded, Prolog will not only load the source, but also parse all 
structured comments. It processes the mode-declarations inside the 
comments and stores these as annotations in the Prolog database to 
support the test framework and other runtime and compiletime analysis 
tools that may be developed in the future.
Documentation for all or some of the loaded files can be written to file in either HTML+CSS or LaTeX (see section 10.5) format. Each source file is documented in a single file. In addition, the documentation generator will generate an index file that can be used as an index for a browser or input file for LaTeX for producing nicely typeset document.
To support the developer, the documentation system can be asked to start a web-server that can be used to browse the documentation.
Structured comments come in two flavours, the line-comment (%) based 
one, seen mostly in the Prolog community and the block-comment (/*...*/) 
based one, commonly seen in the Java and C domains. As we cannot 
determine the argument names, type and modes from following (predicate) 
source itself, we must supply this in the comment.1See section 
11. The overall structure of the comment therefore is:
@keyword value, see section 
6)
The /*...*/ style comment starts with
/**<white>. The type and mode declarations start at 
the first non-blank line and are ended by a blank line.
The %-style line comments start with %!<white> 
or, for compatibility reasons, with %%<white>.2The %% 
leader was considered to give too many false positives on arbitrary 
source code. It is still accepted, but invalid comments are silently 
ignored, while invalid comments that start with %! result 
in a warning. The type and mode declaration is ended by the 
first line that starts with a single %. E.g., the following two 
fragments are identical wrt. PlDoc. Skipping blank-lines in /** 
comments allows to start the comment on the second line.
%! predicate(-Arg:type) is nondet % Predicate ...
/** * predicate(-Arg:type) is nondet * * Predicate ... */
The JavaDoc style keyword list starts at the first line starting with @<word>.
An important aspect is documentation of the file or module as a whole, explaining its design, purpose and relation to other modules. In JavaDoc this is the comment that preceeds the class definition. The Prolog equivalent would be to put the module comment in front of the module declaration. The module declaration itself however is an important index to the content of the file and is therefore best kept first.
The general comment-structure for module comments is to use a type 
identifier between angled brackets, followed by the title of the 
section. Currently the only type provided is module. Other 
types may be added later.
/** <module> Prolog documentation processor This module processes structured comments and generates both formal mode declarations from them as well as documentation in the form of HTML or LaTeX. @author Jan Wielemaker @license GPL */
Many predicates can sensibly be called in different ways, e.g. with a specific argument as input or as output. The header of the documentation of a predicate consists of one or more templates, each representing a specific way of calling the predicate.
A template can contain information about types, argument instantiation patterns, determinism and more. The syntax is informally described below:
| <template> | ::= | <head>['//'] 'is' <determinism> | 
| | | <head>['//'] | |
| <determinism> | ::= | 'det' | 
| | | 'semidet' | |
| | | 'failure' | |
| | | 'nondet' | |
| | | 'multi' | |
| <head> | ::= | <functor>'('<argspec> ',' <argspec>')' | 
| | | <functor> | |
| <argspec> | ::= | [<instantiation>]<argname>[':'<type>] | 
| <instantiation> | ::= | '++' | '+' | '-' | '--' | '?' | ':' | '@' | '!' | 
| <type> | ::= | <term> | 
The determinism values originate from Mercury. Their meaning 
is explained in the table below. Informally, det is used 
for deterministic transformations (e.g. arithmetic), semidet 
for tests, nondet and multi for generators. 
The
failure indicator is rarely used. It mostly appears in 
hooks or the recovery goal of catch/3.
| Determinism | Predicate behaviour | 
| det | Succeeds exactly once without a choice point | 
| semidet | Fails or Succeeds exactly once without a choice-point | 
| failure | Always fails | 
| nondet | No constraints on the number of times the predicate succeeds and whether or not it leaves choice-points on the last success. | 
| multi | As nondet, 
but succeeds at least one time. | 
The meanings of the instantiation patterns for individual arguments are:
Users should be aware that calling a predicate with arguments instantiated in a way other than specified by one of the templates may result in errors or unexpected behavior.
Developers should ensure that predicates are steadfast with respect to output arguments (marked - in the template). This means that instantiation of output arguments at call-time does not change the semantics of the goal (it may be used for optimization, though). If this steadfast behavior cannot be guaranteed, -- should be used instead.
In the current version, argument types are represented by an arbitrary term without formal semantics. In future versions we may adopt a formal type system that allows for runtime verification and static type analysis Hermenegildo, 2000, Mycroft & O'Keefe, 1984, Jeffery et al., 2000
%! length(+List:list, -Length:int) is det. %! length(?List:list, -Length:int) is nondet. %! length(?List:list, +Length:int) is det. % % True if List is a list of length Length. % % @compat iso
@see, 
etc.Optionally, the description may be followed by one or more tags. Our tag convention is strongly based on the conventions used by javaDoc. It is adviced to place tags in the order they are described below.
@arg Name Description@arg 
tag. The first word is the name of the argument. The remainder of the 
tag is the description. Arguments declarations normally appear in order 
used by the predicate.@param Name Description@arg, using the JavaDoc tag name.@throws Term Description@error Error Description@throws, but the exception is embedded in error(Error, 
Context).@author Name@version Version@see Text@deprecated Alternative@compat Standards and systems@copyright Copyright holder@license License conditions@bug Bug description@tbd Work to be doneStructured comments that provide part of the documentation are written in Wiki notation, based on TWiki, with some Prolog specific additions.
blockquote element and in LaTeX using the
quote environment. Finally, if the initial indentation is 
16 or more, the paragraph is centered.ul), numbered lists (HTML ol) and
description lists (HTML dl). Each list environment 
is headed by an empty line and each list-item has a special symbol at 
the start, followed by a space. Each subsequent item must be indented at 
exactly the same column. Lists may be nested by starting a new list at a 
higher level of indentation. The list prefixes are:
| * | Bulleted list item | 
| 1. | Numbered list item. Any number from 1..9 is allowed, which allows for proper numbering in the source. Actual numbers in the HTML or LaTeX however are re-generated, starting at 1. | 
| $ Title : Item | Description list item. | 
' . ' to the 
text and reads it using the operator definitions also used to read the 
mode terms. See section 5. Variable 
names encountered in the Term are used for indentifying 
variables in the following
Description. At least one Description must be 
non-empty to avoid confusion with a simple item list.
   * Term1
     Description
   * Term2
     Description
The following predicates are considered Prolog's prime list processing
primitives:
    * [[member/2]]
    * [[append/3]]
||
        | Algorithm    | Time (sec) |
        | Depth first  | 1.0        |
        | Breath first | 0.7        |
        | A*           | 0.3        |
---+ Section level 1 ---++ Section level 2 ---+++ Section level 3 ---++++ Section level 4
In addition, PlDoc recognises the markdown syntax, including 
named sections as defined by doxygen. A section is named 
(labeled) using an optional sequence {\#name}. 
The three code sections below provide examples. Note that # 
section headers should be positioned at the left margin and the
# must be followed by blank space. If the header is 
underlined, the underline is a line that only contains =-==
Section level 1 =============== Section level 2 ---------------
# Section level 1 ## Section level 2 ### Section level 3 #### Section level 4
Section level 1         {#label}
===============
# Section level 1       {#label}
==. Doxygen fence 
lines are also accepted. They contain at least three tilde (~{.pl} is optional.
  ==
  small(X) :-
     X < 2.
  ==
  ~~~{.pl}
    ...,
    format('Hello ~w~n', [World]),
    ...,
  ~~~
The second form of code blocks are indented blocks. Such a 
block must be indented between 4 and 8 characters, relative to the 
indentation of the last preceeding non-blank line. The block is opened 
with a blank line and closed by a blank line or a line that is indented 
less than the indentation of the initial line. It is allowed to have a 
single blank line in the middle of a code block, provided that the next 
line is again indented at least as much as the initial line. The initial 
line as well as a line that follows a blank line may not be a valid list 
opening line or a table row, i.e., it may not start with one of
*- followed by a space or |.
_) or asterisk (*). Below are 
three examples, the last two of which are valid markdown.
--
***
                - - -
Text emphasis is a combination of old plaintext conventions in Usenet and E-mail and the doxygen version of markdown. Table 1 shows the font-changing constructions. The phrase limited context means that
*_ must be 
preceeded by white space or a character from the set <{([,:; 
and must be followed by an alphanumerical character.
*_ may not be 
followed by an alphanumerical character and may not be preceeded by 
white space or a character from the set ({[<=+-\@.
Note that =<identifier>= is limited 
to a an
identifier, such as a file name, XML name, etc. Identifiers 
must start and end with an alphanumerical character, while characters 
from the set .-/: may appear internally. Note that this set 
explicitly does not allow for white space in code spans delimited by a 
single =
| *bold* | Typeset text in bold for limited content (see running text). | 
| *|bold|* | Typeset text in bold. Content can be long. | 
| _emphasize_ | Typeset text as emphasize for limited content (see running text). | 
| _|emphasize|_ | Typeset text as emphasize. Content can be long. | 
| =code= | Typeset text fixedfont for identifiers (see running text). | 
| =|code|= | Typeset text fixedfont. Content can be long. | 
| Word | Capitalised words that appear as argument-name are written in Italic | 
Inline code can be realised using the = switch described 
in
section 7.2.1 or the markdown 
backtick. In addition, it can use the mardown/Doxygen backtick 
(`) convention: a string that is delimited by backticks is 
considered code, provided:
Currently, `Var` is typeset as a variable (italics) and other terms are typeset using a fixed-width code font.
In addition, compound terms in canonical notation (i.e.,
functor(,...args...) that can be 
parsed are first verified as a file-specification for
absolute_file_name/3 
and otherwise rendered as code.
Table 2 shows the constructs for creating links.
Images can be included in the documentation by referencing an image 
file using one of the extensions .gif, .png,
.jpeg, .jpg or .svg.5SVG 
images are included using the object element. This is 
supported by many modern browsers. When using IE, one needs at least 
IE9. By default this creates a link to the image file that 
must be visited to see the image. Inline images can be created by 
enclosing the filename in double square brackets. For example
The [[open.png]] icon is used open an existing file.
The markdown alternative for images is also supported, and looks as below. The current implementation only deals with image files, not external resources.
    
A directory index consists of the contents of the file README 
(or README.TXT), followed by a table holding all currently 
loaded source-files that appear below the given directory (i.e. traversal 
is
recursive) and for each file a list of public predicates and 
their descriptive summary. Finally, if a file TODO or TODO.TXT 
exists, its content is added at the end of the directory index.
Sometimes it is desirable to document aspects of a package outside 
the source-files. For this reason the system creates a link to files 
using the extension .txt. The referenced file is processed 
as Wiki source. The two fragments below illustrate the relation between 
an
.pl file and a .txt file.
%! read_setup(+File, -Setup) is det. % % Read application setup information from File. The details % on setup are described in setup.txt.
---+ Application setup data If a file =|.myapprc|= exists in the user's home directory the application will process this data using setup.pl. ...
To support the developer with an up-to-date version of the 
documentation of both the application under development and the system 
libraries the developer can start an HTTP documentation server using the 
command
doc_server(?Port). A good way to deploy PlDoc for program 
development is to write a file called e.g., debug.pl that 
sets up the preferred development environment and loads your program. 
below is an example debug.pl that starts PlDoc and prints 
strings as text before loading the remainder of your program.
:- doc_server(4000). % Start PlDoc at port 4000 :- portray_text(true). % Enable portray of strings :- [load]. % load your program
doc_server(Port, [allow(localhost), workers(1)]). This 
predicate must be called before loading the program for which 
you consult the documentation. It calls doc_collect/1 
to start collecting documentation while (re-)loading your program.///false, do not allow editing, even if the connection 
comes from localhost. Intended together with the root 
option to make pldoc available from behind a reverse proxy. See the HTTP 
package for configuring a Prolog server behind an Apache 
reverse proxy.allow('.uva.nl') 
grants access to all machines in this domain. IP addresses are specified 
using the library(socket) ip/4 
term. I.e. to allow access from the 10.0.0.X domain, specify
allow(ip(10,0,0,_)).allow option.
Access is granted iff
The library library(pldoc/doc_library) defines doc_load_library/0 
to load the entire library.
:- doc_server(4000,
              [ allow('.my.org')
              ]).
:- use_module(library(pldoc/doc_library)).
:- doc_load_library.
Example code can be found in $PLBASE/doc/packages/examples/pldoc.
The documentation system is normally accessed from a web-browser after starting the server using doc_server/1. This section briefly explains the user-interface provided from the browser.
The top-right of the screen provides a search-form. The search string 
typed is searched as a substring and case-insensitive. Multiple strings 
seperated by spaces search for the intersection. Searching for objects 
that do not contain a string is written as -"<string>". 
Here are some examples:
| load file | Searches for all 
objects with the strings loadandfile. | 
| load -file | Searches for 
objects with load, but
withoutfile. | 
| "load file" | Searches for the 
string load file. | 
The two radio-buttons below the search box can be used to limit the search. All searches both the application and manuals. Searching for Summary also implies Name.
The web-browser supports several views, which we briefly summarise here:
README and TODO files is 
given.
/** <module ... */ comment and the public 
predicates with their full documentation. Using the zoom button 
the user can select to view both public and documentated private 
predicates. Using the source button, the system shows the source 
with syntax highlighting as in PceEmacs and formatted structured 
comments.6This mode is still 
incomplete. It would be nice to add line-numbers and links to 
documentation and definitions in the sources.
If the browser is accessed from localhost, each object 
that is related to a known source-location has an edit icon at the right 
side. Clicking this calls edit/1 
on the object, calling the user's default editor in the file. To use the 
built-in PceEmacs editor, either set the Prolog flag editor 
to pce_emacs or run ?- emacs. before clicking 
an edit button.
Prolog source-files have a reload button attached. Clicking this reloads the source file if it was modified and refreshes the page. This supports a comfortable edit-view loop to maintain the source-code documentation.
Create stand-alone documentation from a bundle of source-files. Typical use of the PlDoc package is to run it as a web-server from the project in progress, providing search and guaranteed consistency with the loaded version. Creating stand-alone files as provided by this file can be useful for printing or distribution.
html.
doc.
http://www.swi-prolog.org/pldoc/
index.
loaded (default) only 
documents files loaded into the Prolog image. true 
documents all files.
true, recurse into subdirectories.
copy, copy the CSS file to created directories. Using inline, 
include the CSS file into the created files. Currently, only the default copy 
is supported.
The typical use-case is to document the Prolog files that belong to a 
project in the current directory. To do this load the Prolog files and 
run the goal below. This creates a sub-directory doc with 
an index file index.html. It replicates the directory 
structure of the source directory, creating an HTML file for each Prolog 
file and an index file for each sub-directory. A copy of the required 
CSS and image resources is copied to the doc directory.
?- doc_save(., [recursive(true)]).
doc inside the 
pack. The index page consists of the content of readme or
readme.txt in the main directory of the pack and an index 
of all files and their public predicates.
The LaTeX backend aims at producing quality paper documentation as 
well as integration of predicate description and Wiki files in LaTeX 
documents such as articles and technical reports. It is realised by the 
library doc_latex.pl.
The best practice for using the LaTeX backend is yet to be 
established. For now we anticipate processing a Wiki document saved in a 
.txt file using doc_latex/3 to 
produce either a simple complete LaTeX document or a partial document 
that is included into the the main document using the LaTeX \input 
command. Typically, this is best established by writing a Prolog 
Script that generates the required LaTeX document and call this from 
a Makefile. We give a simple example from PlDoc, creating this 
section from the wiki-file latex.txt below.
:- use_module(library(doc_latex)). :- [my_program].
We generate latex.tex from latex.txt using this Makefile 
fragment:
.SUFFIXES: .txt .tex
.txt.tex:
        swipl -f script.pl \
            -g "doc_latex('$*.txt','$*.tex',[stand_alone(false)]),halt" \
            -t "halt(1)"
High-level access is provided by doc_latex/3, 
while more low level access is provided by the remaining predicates. 
Generated LaTeX depends on the style file pldoc.sty, which 
is a plain copy of pl.sty from the SWI-Prolog manual 
sources. The installation installs
pldoc.sty in the pldoc subdirectory of the 
Prolog manual.
/ Arity// ArityTypically Spec is either a list of filenames or a list of predicate indicators. Defined options are:
true (default), create a document that can be run 
through LaTeX. If false, produce a document to be included 
in another LaTeX document.
true (default), only emit documentation for exported 
predicates.
section.
stand_alone, public_only 
and section_level. See doc_latex/3 
for a description of the options.stand_alone, public_only 
and
section_level. See doc_latex/3 
for a description of the options.\begin{description}...\end{description} 
environment, just a plain list of \predicate, etc. 
statements. The current implementation ignores Options.
Literal programming is an established field. The TeX source is one of the first and best known examples of this approach, where input files are a mixture of TeX and PASCAL source. External tools are used to untangle the common source and process one branch to produce the documentation, while the other is compiled to produce the program.
A program and its documentation consists of various different parts:
Comments can be added through Prolog directives, a route taken by Ciao Prolog with lpdoc Hermenegildo, 2000 and Logtalk Moura, 2003. We feel structured comments are a better alternative for the following reasons:
We are aware that the above problems can be dealt with using syntax-aware editors. Only a few editors are sufficiently powerful to support this correctly though and we do not expect the required advanced modes to be widely available. Using comments we do not need to force users into using a particular editor.
JavaDoc uses HTML as markup inside the structured comments. Although HTML is more widely known than ---for example--- LaTeX or TeXinfo, we think the Wiki approach is sufficiently widely known today. Using text with minimal layout conventions taken largely from plaintext newsnet and E-mail, Wiki input is much easier to read in the source-file than HTML without syntax support from an editor.
Types and modes are not a formal part of the Prolog language. Nevertheless, their role goes beyond pure documentation. The test-system can use information about non-determinism to validate that deterministic calls are indeed deterministic. Type information can be used to analyse coverage from the test-suite, to generate runtime type verification or to perform static type-analysis. We have chosen to use a structured comment with formal syntax for the following reasons:
SWI-Prolog aims at platform independency. We want tools to rely as much as possible on Prolog itself. Therefore, the entire infrastructure is written in Prolog. Output as HTML is suitable for browsing and not very high quality printing on virtually all platforms. Output to LaTeX requires more infrastructure for processing and allows for producing high-quality PDF documents.
Initially, the PlDoc wiki language was based on Twiki. Currently, markdown is a wiki syntax that is widely accepted and not tight to a single system. In PlDoc 2, we have adopted markdown, including many of the limitations and extensions introduced by Docygen. Limitations are needed to avoid ambiguities due to the common use of symbol charaters in programming languages. Extensions are desirable to make use of already existing conventions and support requirements of program documentation.
Some of the changes in PlDoc 2 are to achieve compatibility with the Prolog Commons project. The library documentation conventions of this project will be based on PlDoc and the Ciao lpdoc standards. It is likely that there will be more changes to the PlDoc format to synchronise with Commons. We do not anticipate significant impact on existing documentation.