]> git.cameronkatri.com Git - mandoc.git/blob - TODO
Drop trailing whitespace, adjust a few indentations,
[mandoc.git] / TODO
1 ************************************************************************
2 * Official mandoc TODO.
3 * $Id: TODO,v 1.182 2014/10/10 10:49:38 schwarze Exp $
4 ************************************************************************
5
6 ************************************************************************
7 * crashes
8 ************************************************************************
9
10 - The abort() in bufcat(), html.c, can be triggered via buffmt_includes()
11 by running -Thtml -Oincludes on a file containing a long .In argument.
12 Fixing this will probably require reworking the whole bufcat() concept.
13
14 ************************************************************************
15 * missing features
16 ************************************************************************
17
18 --- missing roff features ----------------------------------------------
19
20 - .ad (adjust margins)
21 .ad l -- adjust left margin only (flush left)
22 .ad r -- adjust right margin only (flush right)
23 .ad c -- center text on line
24 .ad b -- adjust both margins (alias: .ad n)
25 .na -- temporarily disable adjustment without changing the mode
26 .ad -- re-enable adjustment without changing the mode
27 Adjustment mode is ignored while in no-fill mode (.nf).
28
29 - .fc (field control)
30 found by naddy@ in xloadimage(1)
31
32 - .nr third argument (auto-increment step size, requires \n+)
33 found by bentley@ in sbcl(1) Mon, 9 Dec 2013 18:36:57 -0700
34
35 - .ns (no-space mode) occurs in xine-config(1)
36 reported by brad@ Sat, 15 Jan 2011 15:45:23 -0500
37
38 - .ta (tab settings) occurs in ircbug(1) and probably gnats(1)
39 reported by brad@ Sat, 15 Jan 2011 15:50:51 -0500
40 also Tcl_NewStringObj(3) via wiz@ Wed, 5 Mar 2014 22:27:43 +0100
41
42 - .ti (temporary indent)
43 found by naddy@ in xloadimage(1)
44 found by bentley@ in nmh(1) Mon, 23 Apr 2012 13:38:28 -0600
45
46 - .while and .shift
47 found by jca@ in ratpoison(1) Sun, 30 Jun 2013 12:01:09 +0200
48
49 - \c (interrupted text) should prevent the line break
50 even inside .Bd literal; that occurs in chat(8)
51 also found in cclive(1) - DocBook output
52
53 - \h horizontal move
54 found in cclive(1) DocBook output
55 Anthony J. Bentley on discuss@ Sat, 21 Sep 2013 22:29:34 -0600
56
57 - \n+ and \n- numerical register increment and decrement
58 found by bentley@ in sbcl(1) Mon, 9 Dec 2013 18:36:57 -0700
59
60 - \w'' width measurements
61 would not be very useful without an expression parser, see below
62 needed for Tcl_NewStringObj(3) via wiz@ Wed, 5 Mar 2014 22:27:43 +0100
63
64 - using undefined strings or macros defines them to be empty
65 wl@ Mon, 14 Nov 2011 14:37:01 +0000
66
67 --- missing mdoc features ----------------------------------------------
68
69 - fix bad block nesting involving multiple identical explicit blocks
70 see the OpenBSD mdoc_macro.c 1.47 commit message
71
72 - .Bl -column .Xo support is missing
73 ultimate goal:
74 restore .Xr and .Dv to
75 lib/libc/compat-43/sigvec.3
76 lib/libc/gen/signal.3
77 lib/libc/sys/sigaction.2
78
79 - edge case: decide how to deal with blk_full bad nesting, e.g.
80 .Sh .Nm .Bk .Nm .Ek .Sh found by jmc@ in ssh-keygen(1)
81 from jmc@ Wed, 14 Jul 2010 18:10:32 +0100
82
83 - \\ is now implemented correctly
84 * when defining strings and macros using .ds and .de
85 * when parsing roff(7) and man(7) macro arguments
86 It does not yet work in mdoc(7) macro arguments
87 because libmdoc does not yet use mandoc_getarg().
88 Also check what happens in plain text, it must be identical to \e.
89
90 - .Bd -centered implies -filled, not -unfilled, which is not
91 easy to implement; it requires code similar to .ce, which
92 we don't have either.
93 Besides, groff has bug causing text right *before* .Bd -centered
94 to be centered as well.
95
96 - .Bd -filled should not be the same as .Bd -ragged, but align both
97 the left and right margin. In groff, it is implemented in terms
98 of .ad b, which we don't have either. Found in cksum(1).
99
100 - implement blank `Bl -column', such as
101 .Bl -column
102 .It foo Ta bar
103 .El
104
105 - explicitly disallow nested `Bl -column', which would clobber internal
106 flags defined for struct mdoc_macro
107
108 - In .Bl -column .It, the end of the line probably has to be regarded
109 as an implicit .Ta, if there could be one, see the following mildly
110 ugly code from login.conf(5):
111 .Bl -column minpasswordlen program xetcxmotd
112 .It path Ta path Ta value of Dv _PATH_DEFPATH
113 .br
114 Default search path.
115 reported by Michal Mazurek <akfaew at jasminek dot net>
116 via jmc@ Thu, 7 Apr 2011 16:00:53 +0059
117
118 - inside `.Bl -column' phrases, punctuation is handled like normal
119 text, e.g. `.Bl -column .It Fl x . Ta ...' should give "-x -."
120
121 - inside `.Bl -column' phrases, TERMP_IGNDELIM handling by `Pf'
122 is not safe, e.g. `.Bl -column .It Pf a b .' gives "ab."
123 but should give "ab ."
124
125 - set a meaningful default if no `Bl' list type is assigned
126
127 - have a blank `It' head for `Bl -tag' not puke
128
129 - check whether it is correct that `D1' uses INDENT+1;
130 does it need its own constant?
131
132 - prohibit `Nm' from having non-text HEAD children
133 (e.g., NetBSD mDNSShared/dns-sd.1)
134 (mdoc_html.c and mdoc_term.c `Nm' handlers can be slightly simplified)
135
136 - support translated section names
137 e.g. x11/scrotwm scrotwm_es.1:21:2: error: NAME section must be first
138 that one uses NOMBRE because it is spanish...
139 deraadt tends to think that section-dependent macro behaviour
140 is a bad idea in the first place, so this may be irrelevant
141
142 - When there is free text in the SYNOPSIS and that free text contains
143 the .Nm macro, groff somehow understands to treat the .Nm as an in-line
144 macro, while mandoc treats it as a block macro and breaks the line.
145 No idea how the logic for distinguishing in-line and block instances
146 should be, needs investigation.
147 uqs@ Thu, 2 Jun 2011 11:03:51 +0200
148 uqs@ Thu, 2 Jun 2011 11:33:35 +0200
149
150 --- missing man features -----------------------------------------------
151
152 - -T[x]html doesn't stipulate non-collapsing spaces in literal mode
153
154 --- missing tbl features -----------------------------------------------
155
156 - look at the POSIX manuals in the books/man-pages-posix port,
157 they use some unsupported tbl(7) features.
158
159 - investigate tbl(1) errors in sox(1)
160 see also naddy@ Sat, 16 Oct 2010 23:51:57 +0200
161
162 - allow standalone `.' to be interpreted as an end-of-layout
163 delimiter instead of being thrown away as a no-op roff line
164 reported by Yuri Pankov, Wed 18 May 2011 11:34:59 CEST
165
166 --- missing eqn features -----------------------------------------------
167
168 - set, delim, fonts
169
170 - The "size" keyword is parsed, but ignored by the formatter.
171
172 - The spacing characters `~', `^', and tab are currently ignored,
173 see User's Guide (Second Edition) page 2 section 4.
174
175 - Mark and lineup are parsed and ignored,
176 see User's Guide (Second Edition) page 5 section 15.
177
178 --- missing misc features ----------------------------------------------
179
180 - italic correction (\/) in PostScript mode
181 Werner LEMBERG on groff at gnu dot org Sun, 10 Nov 2013 12:47:46
182
183 - When makewhatis(8) encounters a FATAL parse error,
184 it silently treats the file as formatted, which makes no sense
185 at all for paths like man1/foo.1 - and which also contradicts
186 what the manual says at the end of the description.
187 The end result will be ENOENT for file names returned
188 by mansearch() in manpage.file.
189
190 - makewhatis(8) for preformatted pages:
191 parse the section number from the header line
192 and compare to the section number from the directory name
193
194 - Does makewhatis(8) detect missing NAME sections, missing names,
195 and missing descriptions in all the file formats?
196
197 - clean up escape sequence handling, creating three classes:
198 (1) fully implemented, or parsed and ignored without loss of content
199 (2) unimplemented, potentially causing loss of content
200 or serious mangling of formatting (e.g. \n) -> ERROR
201 see textproc/mgdiff(1) for nice examples
202 (3) undefined, just output the character -> perhaps WARNING
203
204 - kettenis wants base roff, ms, and me Fri, 1 Jan 2010 22:13:15 +0100 (CET)
205
206 --- compatibility checks -----------------------------------------------
207
208 - is .Bk implemented correctly in modern groff?
209 sobrado@ Tue, 19 Apr 2011 22:12:55 +0200
210
211 - compare output to Heirloom roff, Solaris roff, and
212 http://repo.or.cz/w/neatroff.git http://litcave.rudi.ir/
213
214 - look at AT&T DWB http://www2.research.att.com/sw/download
215 Carsten Kunze <carsten dot kunze at arcor dot de> has patches
216 Mon, 4 Aug 2014 17:01:28 +0200
217
218 - look at pages generated from reStructeredText, e.g. devel/mercurial hg(1)
219 These are a weird mixture of man(7) and custom autogenerated low-level
220 roff stuff. Figure out to what extent we can cope.
221 For details, see http://docutils.sourceforge.net/rst.html
222 noted by stsp@ Sat, 24 Apr 2010 09:17:55 +0200
223 reminded by nicm@ Mon, 3 May 2010 09:52:41 +0100
224
225 - look at pages generated from ronn(1) github.com/rtomayko/ronn
226 (based on markdown)
227
228 - look at pages generated from Texinfo source by yat2m, e.g. security/gnupg
229 First impression is not that bad.
230
231 - look at pages generated by pandoc; see
232 https://github.com/jgm/pandoc/blob/master/src/Text/Pandoc/Writers/Man.hs
233 porting planned by kili@ Thu, 19 Jun 2014 19:46:28 +0200
234
235 - check compatibility with Plan9:
236 http://swtch.com/usr/local/plan9/tmac/tmac.an
237 http://swtch.com/plan9port/man/man7/man.html
238 "Anthony J. Bentley" <anthonyjbentley@gmail.com> 28 Dec 2010 21:58:40 -0700
239
240 - check compatibility with the man(7) formatter
241 https://raw.githubusercontent.com/rofl0r/hardcore-utils/master/man.c
242
243 ************************************************************************
244 * formatting issues: ugly output
245 ************************************************************************
246
247 - a column list with blank `Ta' cells triggers a spurrious
248 start-with-whitespace printing of a newline
249
250 - In .Bl -column,
251 .It Em Authentication<tab>Key Length
252 ought to render "Key Length" with emphasis, too,
253 see OpenBSD iked.conf(5).
254 reported again Nicolas Joly via wiz@ Wed, 12 Oct 2011 00:20:00 +0200
255
256 - empty phrases in .Bl column produce too few blanks
257 try e.g. .Bl -column It Ta Ta
258 reported by millert Fri, 02 Apr 2010 16:13:46 -0400
259
260 - .%T can have trailing punctuation. Currently, it puts the trailing
261 punctuation into a trailing MDOC_TEXT element inside its own scope.
262 That element should rather be outside its scope, such that the
263 punctuation does not get underlines. This is not trivial to
264 implement because .%T then needs some features of in_line_eoln() -
265 slurp all arguments into one single text element - and one feature
266 of in_line() - put trailing punctuation out of scope.
267 Found in mount_nfs(8) and exports(5), search for "Appendix".
268
269 - Trailing punctuation after .%T triggers EOS spacing, at least
270 outside .Rs (eek!). Simply setting ARGSFL_DELIM for .%T is not
271 the right solution, it sends mandoc into an endless loop.
272 reported by Nicolas Joly Sat, 17 Nov 2012 11:49:54 +0100
273
274 - global variables in the SYNOPSIS of section 3 pages
275 .Vt vs .Vt/.Va vs .Ft/.Va vs .Ft/.Fa ...
276 from kristaps@ Tue, 08 Jun 2010 11:13:32 +0200
277
278 - in enclosures, mandoc sometimes fancies a bogus end of sentence
279 reminded by jmc@ Thu, 23 Sep 2010 18:13:39 +0059
280
281 - formatting /usr/local/man/man1/latex2man.1 with groff and mandoc
282 reveals lots of bugs both in groff and mandoc...
283 reported by bentley@ Wed, 22 May 2013 23:49:30 -0600
284
285 --- PDF issues ---------------------------------------------------------
286
287 - PDF output doesn't use a monospaced font for .Bd -literal
288 Example: "mandoc -Tpdf afterboot.8 > output.pdf && pdfviewer output.pdf".
289 Search the text "Routing tables".
290 Also check what PostScript mode does when fixing this.
291 reported by juanfra@ Wed, 04 Jun 2014 21:44:58 +0200
292
293 --- HTML issues --------------------------------------------------------
294
295 - <dl><dt><dd> formatting is ugly
296 hints are easy to find on the web, e.g.
297 http://stackoverflow.com/questions/1713048/
298 see also matthew@ Fri, 18 Jul 2014 19:25:12 -0700
299
300 - The tables used to render the three-part page headers actually force
301 the width of the <body> to the max-width given for <html>.
302 Not yet sure how to fix that...
303 Observed by an Anonymous Coward on undeadly.org:
304 http://undeadly.org/cgi?action=article&sid=20140925064244&pid=1
305
306 - consider whether <var> can be used for Ar Dv Er Ev Fa Va.
307 from bentley@ Wed, 13 Aug 2014 09:17:55 -0600
308
309 - check https://github.com/trentm/mdocml
310
311 --- eqn issues ---------------------------------------------------------
312
313 - If .EQ follows preceding text, a space should be output between the
314 text and the equation.
315
316 ************************************************************************
317 * formatting issues: gratuitous differences
318 ************************************************************************
319
320 - .Rv (and probably .Ex) print different text if an `Nm' has been named
321 or not (run a manual without `Nm blah' to see this). I'm not sure
322 that this exists in the wild, but it's still an error.
323
324 - In .Bl -bullet, the groff bullet is "+\b+\bo\bo", the mandoc bullet
325 is just "o\bo".
326 see for example OpenBSD ksh(1)
327
328 - In .Bl -enum -width 0n, groff continues one the same line after
329 the number, mandoc breaks the line.
330 mail to kristaps@ Mon, 20 Jul 2009 02:21:39 +0200
331
332 - .Pp between two .It in .Bl -column should produce one,
333 not two blank lines, see e.g. login.conf(5).
334 reported by jmc@ Sun, 17 Apr 2011 14:04:58 +0059
335 reported again by sthen@ Wed, 18 Jan 2012 02:09:39 +0000 (UTC)
336
337 - If the *first* line after .It is .Pp, break the line right after
338 the tag, do not pad with space characters before breaking.
339 See the description of the a, c, and i commands in sed(1).
340
341 - If the first line after .It is .D1, do not assert a blank line
342 in between, see for example tmux(1).
343 reported by nicm@ 13 Jan 2011 00:18:57 +0000
344
345 - Trailing punctuation after .It should trigger EOS spacing.
346 reported by Nicolas Joly Sat, 17 Nov 2012 11:49:54 +0100
347 Probably, this should be fixed somewhere in termp_it_pre(), not sure.
348
349 - .Nx 1.0a
350 should be "NetBSD 1.0A", not "NetBSD 1.0a",
351 see OpenBSD ccdconfig(8).
352
353 - In .Bl -tag, if a tag exceeds the right margin and must be continued
354 on the next line, it must be indented by -width, not width+1;
355 see "rule block|pass" in OpenBSD ifconfig(8).
356
357 - When the -width string contains macros, the macros must be rendered
358 before measuring the width, for example
359 .Bl -tag -width ".Dv message"
360 in magic(5), located in src/usr.bin/file, is the same
361 as -width 7n, not -width 11n.
362 The same applies to .Bl -column column widths;
363 reported again by Nicolas Joly Thu, 1 Mar 2012 13:41:26 +0100 via wiz@ 5 Mar
364 reported again by Franco Fichtner Fri, 27 Sep 2013 21:02:28 +0200
365 An easy partial fix would be to just skip the first word if it starts
366 with a dot, including any following white space, when measuring.
367
368 - The \& zero-width character counts as output.
369 That is, when it is alone on a line between two .Pp,
370 we want three blank lines, not two as in mandoc.
371
372 - Header lines of excessive length:
373 Port OpenBSD man_term.c rev. 1.25 to mdoc_term.c
374 and document it in mdoc(7) and man(7) COMPATIBILITY
375 found while talking to Chris Bennett
376
377 - trailing whitespace must be ignored even when followed by a font escape,
378 see for example
379 makes
380 \fBdig \fR
381 operate in batch mode
382 in dig(1).
383
384 ************************************************************************
385 * warning issues
386 ************************************************************************
387
388 - check that MANDOCERR_BADTAB is thrown in the right cases,
389 i.e. when finding a literal tab character in fill mode,
390 and possibly change the wording of the warning message
391 to refer to fill mode, not literal mode
392 See the mail from Werner LEMBERG on the groff list,
393 Fri, 14 Feb 2014 18:54:42 +0100 (CET)
394
395 - warn about "new sentence, new line"
396
397 - mandoc_special does not really check the escape sequence,
398 but just the overall format
399
400 - integrate mdoclint into mandoc ("end-of-line whitespace" thread)
401 from jmc@ Mon, 13 Jul 2009 17:12:09 +0100
402 from kristaps@ Mon, 13 Jul 2009 18:34:53 +0200
403 from jmc@ Mon, 13 Jul 2009 17:45:37 +0059
404 from kristaps@ Mon, 13 Jul 2009 19:02:03 +0200
405
406 - -Tlint parser errors and warnings to stdout
407 to tech@mdocml, naddy@ Wed, 28 Sep 2011 11:21:46 +0200
408 wait! kristaps@ Sun, 02 Oct 2011 17:12:52 +0200
409
410 - for system errors, use errno/strerror/warn/err
411
412 ************************************************************************
413 * documentation issues
414 ************************************************************************
415
416 - mention hyphenation rules:
417 breaking at letter-letter in text mode (not macro args)
418 proper hyphenation is unimplemented
419
420 - talk about spacing around delimiters
421 to jmc@, kristaps@ Sat, 23 Apr 2011 17:41:27 +0200
422
423 - mark macros as: page structure domain, manual domain, general text domain
424 is this useful?
425
426 - mention /usr/share/misc/mdoc.template in mdoc(7)?
427
428 ************************************************************************
429 * performance issues
430 ************************************************************************
431
432 - Why are we using MAP_SHARED, not MAP_PRIVATE for mmap(2)?
433 How does SQLITE_CONFIG_PAGECACHE actually work? Document it!
434 from kristaps@ Sat, 09 Aug 2014 13:51:36 +0200
435
436 Several areas can be cleaned up to make mandoc even faster. These are
437
438 - improve hashing mechanism for macros (quite important: performance)
439
440 - improve hashing mechanism for characters (not as important)
441
442 - the PDF file is HUGE: this can be reduced by using relative offsets
443
444 - instead of re-initialising the roff predefined-strings set before each
445 parse, create a read-only version the first time and copy it
446
447 ************************************************************************
448 * structural issues
449 ************************************************************************
450
451 - We use the input line number at several places to distinguish
452 same-line from different-line input. That plainly doesn't work
453 with user-defined macros, leading to random breakage.
454
455 - Find better ways to prevent endless loops
456 in roff(7) macro and string expansion.
457
458 - Finish cleanup of date handling.
459 Decide which formats should be recognized where.
460 Update both mdoc(7) and man(7) documentation.
461 Triggered by Tim van der Molen Tue, 22 Feb 2011 20:30:45 +0100
462
463 - Consider creating some views that will make the database more
464 readable from the sqlite3 shell. Consider using them to
465 abstract from the database structure, too.
466 suggested by espie@ Sat, 19 Apr 2014 14:52:57 +0200
467
468 ************************************************************************
469 * CGI issues
470 ************************************************************************
471
472 - Enable HTTP compression by detecting gzip encoding and filtering
473 output through libz.
474 - Sandbox (see OpenSSH).
475 - Enable caching support via HTTP 304 and If-Modified-Since.
476 - Allow for cgi.h to be overridden by CGI environment variables.
477 Otherwise, binary distributions will inherit the compile-time
478 behaviour, which is not optimal.
479 - Have Mac OSX systems automatically disable -static compilation of the
480 CGI: -static isn't supported.
481