1 <?xml version=
"1.0" encoding=
"ISO-8859-1"?>
2 <rss version='
2.0' xmlns:lj='http://www.livejournal.org/rss/lj/
1.0/'
>
4 <title>Petter Reinholdtsen - Entries from November
2023</title>
5 <description>Entries from November
2023</description>
6 <link>https://people.skolelinux.org/pere/blog/
</link>
10 <title>Test framework for DocBook processors / formatters
</title>
11 <link>https://people.skolelinux.org/pere/blog/Test_framework_for_DocBook_processors___formatters.html
</link>
12 <guid isPermaLink=
"true">https://people.skolelinux.org/pere/blog/Test_framework_for_DocBook_processors___formatters.html
</guid>
13 <pubDate>Sun,
5 Nov
2023 13:
00:
00 +
0100</pubDate>
14 <description><p
>All the books I have published so far has been using
15 <a href=
"https://docbook.org/
">DocBook
</a
> somewhere in the process.
16 For the first book, the source format was DocBook, while for every
17 later book it was an intermediate format used as the stepping stone to
18 be able to present the same manuscript in several formats, on paper,
19 as ebook in ePub format, as a HTML page and as a PDF file either for
20 paper production or for Internet consumption. This is made possible
21 with a wide variety of free software tools with DocBook support in
22 Debian. The source format of later books have been docx via rst,
23 Markdown, Filemaker and Asciidoc, and for all of these I was able to
24 generate a suitable DocBook file for further processing using pandoc,
25 a2x and asciidoctor, as well as rendering using
26 <a href=
"https://tracker.debian.org/pkg/xmlto
">xmlto
</a
>,
27 <a href=
"https://tracker.debian.org/pkg/dbtoepub
">dbtoepub
</a
>,
28 <a href=
"https://tracker.debian.org/pkg/dblatex
">dblatex
</a
>,
29 <a href=
"https://tracker.debian.org/pkg/dblatex
">docbook-xsl
</a
> and
30 <a href=
"https://tracker.debian.org/pkg/fop
">fop
</a
>.
</p
>
32 <p
>Most of the books I have published are translated books, with
33 English as the source language. The use of
34 <a href=
"https://tracker.debian.org/pkg/po4a
">po4a
</a
> to
35 handle translations using the gettext PO format has been a blessing,
36 but publishing translated books had triggered the need to ensure the
37 DocBook tools handle relevant languages correctly. For every new
38 language I have published, I had to submit patches dblatex, dbtoepub
39 and docbook-xsl fixing incorrect language and country specific issues
40 in the framework themselves. Typically this has been missing keywords
41 like
'figure
' or sort ordering of index entries. After a while it
42 became tiresome to only discover issues like this by accident, and I
43 decided to write a DocBook
"test framework
" exercising various
44 features of DocBook and allowing me to see all features exercised for
45 a given language. It consist of a set of DocBook files, a version
4
46 book, a version
5 book, a v4 book set, a v4 selection of problematic
47 tables, one v4 testing sidefloat and finally one v4 testing a book of
48 articles. The DocBook files are accompanied with a set of build rules
49 for building PDF using dblatex and docbook-xsl/fop, HTML using xmlto
50 or docbook-xsl and epub using dbtoepub. The result is a set of files
51 visualizing footnotes, indexes, table of content list, figures,
52 formulas and other DocBook features, allowing for a quick review on
53 the completeness of the given locale settings. To build with a
54 different language setting, all one need to do is edit the lang= value
55 in the .xml file to pick a different ISO
639 code value and run
56 'make
'.
</p
>
58 <p
>The
<a href=
"https://codeberg.org/pere/docbook-example/
">test framework
59 source code
</a
> is available from Codeberg, and a generated set of
60 presentations of the various examples is available as Codeberg static
62 <a href=
"https://pere.codeberg.page/docbook-example/
">https://pere.codeberg.page/docbook-example/
</a
>.
63 Using this test framework I have been able to discover and report
64 several bugs and missing features in various tools, and got a lot of
65 them fixed. For example I got Northern Sami keywoards added to both
66 docbook-xsl and dblatex, fixed several typos in Norwegian bokmål and
67 Norwegian Nynorsk, support for non-ascii title IDs added to pandoc,
68 Norwegian index sorting support fixed in xindy and initial Norwegian
69 Bokmål support added to dblatex. Some issues still remains, though.
70 Default index sorting rules are still broken in several tools, so the
71 Norwegian letters æ, ø and å are more often than not sorted properly
72 in the book index.
</p
>
74 <p
>The test framework recently received some more polish, as part of
75 publishing my latest book. This book contained a lot of fairly
76 complex tables, which exposed bugs in some of the tools. This made me
77 add a new test file with various tables, as well as spend some time to
78 brush up the build rules. My goal is for the test framework to
79 exercise all DocBook features to make it easier to see which features
80 work with different processors, and hopefully get them all to support
81 the full set of DocBook features. Feel free to send patches to extend
82 the test set, and test it with your favorite DocBook processor.
83 Please visit these two URLs to learn more:
</p
>
86 <li
><a href=
"https://codeberg.org/pere/docbook-example/
">https://codeberg.org/pere/docbook-example/
</a
></li
>
87 <li
><a href=
"https://pere.codeberg.page/docbook-example/
">https://pere.codeberg.page/docbook-example/
</a
></li
>
90 <p
>If you want to learn more on Docbook and translations, I recommend
91 having a look at the
<a href=
"https://docbook.org/
">the DocBook
93 <a href=
"https://doccookbook.sourceforge.net/html/en/
">the DoCookBook
94 site
<a/
> and my earlier blog post on
95 <a href=
"https://people.skolelinux.org/pere/blog/From_English_wiki_to_translated_PDF_and_epub_via_Docbook.html
">how
96 the Skolelinux project process and translate documentation
</a
>.
</p
>
100 https://github.com/docbook/xslt10-stylesheets/issues/
205 (docbook-xsl: sme support)
101 https://bugs.debian.org/
968437 (xindy: index sorting rules for nb/nn)
102 https://bugs.debian.org/
856123 (pandoc: markdown to docbook with non-english titles)
103 https://bugs.debian.org/
864813 (dblatex: missing nb words)
104 https://bugs.debian.org/
756386 (dblatex: index sorting rules for nb/nn)
105 https://bugs.debian.org/
796871 (dbtoepub: index sorting rules for nb/nn)
106 https://bugs.debian.org/
792616 (dblatex: PDF metadata)
107 https://bugs.debian.org/
686908 (docbook-xsl: index sorting rules for nb/nn)
108 https://sourceforge.net/tracker/?func=detail
&atid=
373747&aid=
3556630&group_id=
21935 (docbook-xsl: nb/nn support)
109 https://bugs.debian.org/
684391 (dblatex: initial nb support)
113 <p
>As usual, if you use Bitcoin and want to show your support of my
114 activities, please send Bitcoin donations to my address
115 <b
><a href=
"bitcoin:
15oWEoG9dUPovwmUL9KWAnYRtNJEkP1u1b
">15oWEoG9dUPovwmUL9KWAnYRtNJEkP1u1b
</a
></b
>.
</p
>