GROFF_OUT(5) | GROFF_OUT(5) |
As the GNU roff processor groff(1) is a wrapper program around troff that automatically calls a postprocessor, this output does not show up normally. This is why it is called intermediate within the groff system. The groff program provides the option -Z to inhibit postprocessing, such that the produced intermediate output is sent to standard output just like calling troff manually.
In this document, the term troff output describes what is output by the GNU troff program, while intermediate output refers to the language that is accepted by the parser that prepares this output for the postprocessors. This parser is smarter on whitespace and implements obsolete elements for compatibility, otherwise both formats are the same. Both formats can be viewed directly with gxditview(1).
The main purpose of the intermediate output concept is to facilitate the development of postprocessors by providing a common programming interface for all devices. It has a language of its own that is completely different from the groff(7) language. While the groff language is a high-level programming language for text processing, the intermediate output language is a kind of low-level assembler language by specifying all positions on the page for writing and drawing.
The pre-groff roff versions are denoted as classical troff. The intermediate output produced by groff is fairly readable, while classical troff output was hard to understand because of strange habits that are still supported, but not used any longer by GNU troff.
Any sequence of space or tab characters is treated as a single syntactical space. It separates commands and arguments, but is only required when there would occur a clashing between the command code and the arguments without the space. Most often, this happens when variable length command names, arguments, argument lists, or command clusters meet. Commands and arguments with a known, fixed length need not be separated by syntactical space.
A line break is a syntactical element, too. Every command argument can be followed by whitespace, a comment, or a newline character. Thus a syntactical line break is defined to consist of optional syntactical space that is optionally followed by a comment, and a newline character.
The normal commands, those for positioning and text, consist of a single letter taking a fixed number of arguments. For historical reasons, the parser allows to stack such commands on the same line, but fortunately, in groff intermediate output, every command with at least one argument is followed by a line break, thus providing excellent readability.
The other commands — those for drawing and device controlling — have a more complicated structure; some recognize long command names, and some take a variable number of arguments. So all D and x commands were designed to request a syntactical line break after their last argument. Only one command, `x X' has an argument that can stretch over several lines, all other commands must have all of their arguments on the same line as the command, i.e., the arguments may not be splitted by a line break.
Empty lines, i.e., lines containing only space and/or a comment, can occur everywhere. They are just ignored.
Note that single characters can have the eighth bit set, as can the names of fonts and special characters. The names of characters and fonts can be of arbitrary length. A character that is to be printed will always be in the current font.
A string argument is always terminated by the next whitespace character (space, tab, or newline); an embedded # character is regarded as part of the argument, not as the beginning of a comment command. An integer argument is already terminated by the next non-digit character, which then is regarded as the first character of the next argument or command.
The task of the prologue is to set the general device parameters using three exactly specified commands. The groff prologue is guaranteed to consist of the following three lines (in that order):
x T device
with the arguments set as outlined in the section Device Control Commands. But the parser for the intermediate output format is able to swallow additional whitespace and comments as well.
The body is the main section for processing the document data. Syntactically, it is a sequence of any commands different from the ones used in the prologue. Processing is terminated as soon as the first x stop command is encountered; the last line of any groff intermediate output always contains such a command.
Semantically, the body is page oriented. A new page is started by a p command. Positioning, writing, and drawing commands are always done within the current page, so they cannot occur before the first p command. Absolute positioning (by the H and V commands) is done relative to the current page, all other positioning is done relative to the current location within this page.
#anything<end_of_line> | A comment. Ignore any characters from the # character up to the next newline character. |
This command is the only possibility for commenting in the intermediate output. Each comment can be preceded by arbitrary syntactical space; every command can be terminated by a comment.
troff output follows the classical spacing rules (no space between command and subcommand, all arguments are preceded by a single space character), but the parser allows optional space between the command letters and makes the space before the first argument optional. As usual, each space can be any sequence of tab and space characters.
Some graphics commands can take a variable number of arguments. In this case, they are integers representing a size measured in basic units The arguments called h1, stand for horizontal distances where positive means right, negative left. The arguments called v1, stand for vertical distances where positive means down, negative up. All these distances are offsets relative to the current location.
Unless indicated otherwise, each graphics command directly corresponds to a similar groff \D escape sequence; see groff(7).
Unknown D commands are assumed to be device-specific. Its arguments are parsed as strings; the whole information is then sent to the postprocessor.
In the following command reference, the syntax element <line_break> means a syntactical line break as defined in section Separation. As the polygon is closed, the end of drawing is the starting point, so the position doesn't change. This command is a groff extension. No position changing. This command is a groff extension. Set the current line thickness to (an integer in basic units if if select the smallest available line thickness; if set the line thickness proportional to the point size (this is the default before the first Dt command was specified). the argument to the actual horizontal position, while the vertical position is not changed. Although this doesn't make sense it is kept for compatibility. No position changing. This command is a groff extension.
The subcommand is basically a single letter, but to increase readability, it can be written as a word, i.e., an arbitrary sequence of characters terminated by the next tab, space, or newline character. All characters of the subcommand word but the first are simply ignored. For example, troff outputs the initialization command x i as x init and the resolution command x r as x res. But writings like x i_like_groff and x roff_is_groff resp. are accepted as well to mean the same commands.
In the following, the syntax element <line_break> means a syntactical line break as defined in section Separation. Use as the intended name for the current file in error reports. This is useful for remembering the original file name when groff uses an internal piping mechanism. The input file is not changed by this command. This command is a groff extension. Mount font position (a non-negative integer) with font named (a text word), cf. groff_font(5). Set character height to (a positive integer in scaled points Classical troff used the unit points ( instead; see section COMPATIBILITY. Initialize device. This is the third command of the prologue. Parsed but ignored. The classical documentation reads pause device, can be restarted. Resolution is while is the minimal horizontal motion, and the minimal vertical motion possible with this device; all arguments are positive integers in basic units per inch. This is the second command of the prologue. Set slant to degrees (an integer in basic units Terminates the processing of the current file; issued as the last command of any intermediate troff output. Generate trailer information, if any. In groff, this is actually just ignored. Set name of device to word a sequence of characters ended by the next whitespace character. The possible device names coincide with those from the groff -T option. This is the first command of the prologue. Configure underlining of spaces. If is 1, start underlining of spaces; if is 0, stop underlining of spaces. This is needed for the cu request in nroff mode and is ignored otherwise. This command is a groff extension. Send string uninterpreted to the device. If the line following this command starts with a + character this line is interpreted as a continuation line in the following sense. The + is ignored, but a newline character is sent instead to the device, the rest of the line is sent uninterpreted. The same applies to all following lines until the first character of a line is not a + character. This command is generated by the groff escape sequence \X. The line-continuing feature is a groff extension.
Move right | (exactly two decimal digits) basic units then print character |
In groff, arbitrary syntactical space around and within this command is allowed to be added. Only when a preceding command on the same line ends with an argument of variable length a separating space is obligatory. In classical troff, large clusters of these and other commands were used, mostly without spaces; this made such output almost unreadable.
For modern high-resolution devices, this command does not make sense because the width of the characters can become much larger than two decimal digits. In groff, this is only used for the devices X75, X75-12, X100, and X100-12. For other devices, the commands t and u provide a better functionality.
There is a library function that parses the intermediate output and sends the information obtained to the device via methods of a class with a common interface for each device. So a groff postprocessor must only redefine the methods of this class. For details, see the reference in section FILES.
x T ps
x res 72000 1 1
x init
p1
x font 5 TR
f5
s10000
V12000
H72000
thell
wh2500
tw
H96620
torld
n12000 0
x trailer
V792000
x stop
This output can be fed into the postprocessor grops(1) to get its representation as a PostScript file. Low-resolution device latin1
This is similar to the high-resolution device except that the positioning is done at a minor scale. Some comments (lines starting with #) were added for clarification; they were not generated by the formatter.
# prologue
x T latin1
x res 240 24 40
x init
# begin a new page
p1
# font setup
x font 1 R
f1
s10
# initial positioning on the page
V40
H0
# write text `hell'
thell
# inform about a space, and do it by a horizontal jump
wh24
# write text `world'
tworld
# announce line break, but do nothing because ...
n40 0
# ... the end of the document has been reached
x trailer
V2640
x stop
This output can be fed into the postprocessor grotty(1) to get a formatted text document. Classical style output
As a computer monitor has a very low resolution compared to modern printers the intermediate output for the X devices can use the jump-and-write command with its 2-digit displacements.
x T X100
x res 100 1 1
x init
p1
x font 5 TR
f5
s10
V16
H100
# write text with old-style jump-and-write command
ch07e07l03lw06w11o07r05l03dh7
n16 0
x trailer
V1100
x stop
This output can be fed into the postprocessor xditview(1x) or gxditview(1) for displaying in X.
Due to the obsolete jump-and-write command, the text clusters in the classical output are almost unreadable.
The position after a graphic object has been drawn is at its end; for circles and ellipses, the "end" is at the right side.
From this, the positionings specified for the drawing commands above follow quite naturally.
The differences between groff and classical troff are documented in groff_diff(7).
/usr/share/groff_font/devname/DESC | Device description file for device name. |
<groff_source_dir>/src/libs/libdriver/input.cpp | Defines the parser and postprocessor for the intermediate output. It is located relative to the top directory of the groff source tree, e.g. @GROFFSRCDIR@. This parser is the definitive specification of the groff intermediate output format. |
For more details, see man(1).
groff(1) | option -Z and further readings on groff. |
groff(7) | for details of the groff language such as numerical units and escape sequences. |
groff_font(5) | for details on the device scaling parameters of the DESC file. |
troff(1) | generates the device-independent intermediate output. |
roff(7) | for historical aspects and the general structure of roff systems. |
groff_diff(7) | The differences between the intermediate output in groff and classical troff. |
gxditview(1) | Viewer for the intermediate output. |
grodvi(1), grohtml(1), grolbp(1), grolj4(1), grops(1), grotty(1)
For a treatment of all aspects of the groff system within a single document, see the groff info file. It can be read within the integrated help systems, within emacs(1) or from the shell prompt by
The classical troff output language is described in two AT&T Bell Labs CSTR documents available on-line at
[CSTR #97] | A Typesetter-independent TROFF by Brian Kernighan is the original and most concise documentation on the output language; see |
[CSTR #54] | The 1992 revision of the Nroff/:Troff User's Manual by J. F. Osanna and Brian Kernighan isn't as concise as [CSTR #97] regarding the output language; see |
This document is distributed under the terms of the FDL (GNU Free Documentation License) version 1.1 or later. You should have received a copy of the FDL with this package; it is also available on-line at the
This document is part of groff, the GNU roff distribution. It is based on a former version - published under the GPL - that described only parts of the groff extensions of the output language. It has been rewritten 2002 by Bernd Warken and is maintained by
6 February 2006 | Groff Version 1.19.2 |