Uploaded by Jan Jakubowski

demo

advertisement
Demonstration of DOCX support in
calibre
This document demonstrates the ability of the calibre DOCX Input plugin to
convert the various typographic features in a Microsoft Word (2007 and newer)
document. Convert this document to a modern ebook format, such as AZW3 for
Kindles or EPUB for other ebook readers, to see it in action.
There is support for images, tables, lists, footnotes, endnotes, links, dropcaps and
various types of text and paragraph level formatting.
To see the DOCX conversion in action, simply add this file to calibre using the
“Add Books” button and then click “Convert”. Set the output format in the top right
corner of the conversion dialog to EPUB or AZW3 and click “OK”.
Demonstration of DOCX support in calibre
This document demonstrates the ability of the calibre DOCX Input plugin to
convert the various typographic features in a Microsoft Word (2007 and newer)
document. Convert this document to a modern ebook format, such as AZW3 for
Kindles or EPUB for other ebook readers, to see it in action.
There is support for images, tables, lists, footnotes, endnotes, links, dropcaps and
various types of text and paragraph level formatting.
To see the DOCX conversion in action, simply add this file to calibre using the
“Add Books” button and then click “Convert”. Set the output format in the top right
corner of the conversion dialog to EPUB or AZW3 and click “OK”.
Text Formatting
Inline formatting
Here, we demonstrate various types of inline text formatting and the use of
embedded fonts.
Here is some bold, italic, bold-italic, underlined and struck out text. Then, we
have a superscript and a subscript. Now we see some red, green and blue text. Some text
with a yellow highlight. Some text in a box. Some text in inverse video.
A paragraph with styled text: subtle emphasis followed by strong text and
intense emphasis. This paragraph uses document wide styles for styling rather than
inline text properties as demonstrated in the previous paragraph — calibre can handle
both with equal ease.
Fun with fonts
This document has embedded the Ubuntu font family. The body text is in the
Ubuntu typeface, here is some text in the Ubuntu Mono typeface, notice how every
letter has the same width, even i and m. Every embedded font will automatically be
embedded in the output ebook during conversion.
Paragraph level formatting
You can do crazy things with paragraphs, if the urge strikes you. For instance this
paragraph is right aligned and has a right border. It has also been given a light gray
background.
For the lovers of poetry amongst you, paragraphs with hanging indents, like this
often come in handy. You can use hanging indents to ensure that a line of poetry
retains its individual identity as a line even when the screen is too narrow to display it
as a single line. Not only does this paragraph have a hanging indent, it is also has an
extra top margin, setting it apart from the preceding paragraph.
Tables
Tables in Word can vary from the extremely
ITEM
NEEDED
simple to the extremely complex. calibre tries to do
Books
1
its best when converting tables. While you may run
Pens
3
into trouble with the occasional table, the vast
Pencils
2
majority of common cases should be converted very
Highlighte
2 colors
well, as demonstrated in this section. Note that for
r
optimum results, when creating tables in Word, you
Scissors
1 pair
should set their widths using percentages, rather
than absolute units. To the left of this paragraph is a floating two column table with a
nice green border and header row.
Now let’s look at a fancier table—one with alternating row colors and partial
borders. This table is stretched out to take 100% of the available width.
City or
Point A
Point B
Point C
Point D
Point E
Town
Point A
Point B
Point C
Point D
Point E
—
87
64
37
93
—
56
32
35
—
91
54
—
43
—
Next, we see a table with special formatting in various locations. Notice how the
formatting for the header row and sub header rows is preserved.
College
New students
Graduating
Change
students
Cedar
University
Oak Institute
Undergraduate
110
103
+7
202
210
-8
Cedar
University
Elm College
Total
Graduate
24
20
+4
43
53
-10
998
908
90
Source: Fictitious data, for illustration purposes only
Next, we have something a little more complex, a nested table, i.e. a table inside
another table. Additionally, the inner table has some of its cells merged. The table is
displayed horizontally centered.
To the left is a table
One
Two
inside a table, with some
Three
Four
cells merged.
We end with a fancy calendar, note how much of the original formatting is
preserved. Note that this table will only display correctly on relatively wide screens. In
general, very wide tables or tables whose cells have fixed width requirements don’t
fare well in ebooks.
December 2007
Su
n
Mo
n
Tu
e
W
ed
Th
Sa
u
Fri
t
1
2
3
4
5
6
7
8
9
10
1
1
1
1
1
1
1
17
6
2
1
8
2
24
3
3
31
0
Structural Elements
1
9
2
5
3
2
0
2
6
4
2
1
2
7
5
2
2
2
8
2
9
Miscellaneous structural elements you can add to your document, like footnotes,
endnotes, dropcaps and the like.
Footnotes & Endnotes
Footnotes1 and endnotesi are automatically recognized and both are converted to
endnotes, with backlinks for maximum ease of use in ebook devices.
Dropcaps
D
rop caps are used to emphasize the leading paragraph at the start of a section. In
Word it is possible to specify how many lines of text a drop-cap should use. Because of
limitations in ebook technology, this is not possible when converting. Instead, the
converted drop cap will use font size and line height to simulate the effect as well as
possible. While not as good as the original, the result is usually tolerable. This
paragraph has a “D” dropcap set to occupy three lines of text with a font size of 58.5
pts. Depending on the screen width and capabilities of the device you view the book
on, this dropcap can look anything from perfect to ugly.
Links
Two kinds of links are possible, those that refer to an external website and those
that refer to locations inside the document itself. Both are supported by calibre. For
example, here is a link pointing to the calibre download page. Then we have a link that
points back to the section on paragraph level formatting in this document.
Table of Contents
There are two approaches that calibre takes when generating a Table of Contents.
The first is if the Word document has a Table of Contents itself. Provided that the
Table of Contents uses hyperlinks, calibre will automatically use it. The levels of the
Table of Contents are identified by their left indent, so if you want the ebook to have
a multi-level Table of Contents, make sure you create a properly indented Table of
Contents in Word.
If no Table of Contents is found in the document, then a table of contents is
automatically generated from the headings in the document. A heading is identified
as something that has the Heading 1 or Heading 2, etc. style applied to it. These
headings are turned into a Table of Contents with Heading 1 being the topmost level,
Heading 2 the second level and so on.
You can see the Table of Contents created by calibre by clicking the Table of
Contents button in whatever viewer you are using to view the converted ebook.
Demonstration of DOCX support in calibre
1
Text Formatting
61
Inline formatting
61
1
In paged media, footnotes are usually displayed at the bottom of the text. However, in ebooks, a
better paradigm is to make them clickable endnotes that the user can browse at her pleasure. This
conversion is handled automatically by calibre.
Fun with fonts 61
Paragraph level formatting
Tables 62
Structural Elements 64
Footnotes & Endnotes 64
Dropcaps 64
Links
64
Table of Contents
Images 66
Lists
67
Bulleted List
67
Numbered List 67
Multi-level Lists 67
Continued Lists 67
61
64
Images
Images can be of three main types. Inline images are images that are part of the
normal text flow, like this image of a green dot . Inline images do not cause breaks
in the text and are usually small in size. The next category of image is a floating image,
one that “floats “ on the page and is surrounded by text. Word supports more types of
floating images than are possible with current ebook technology, so the conversion
maps floating images to simple left and right floats, as you can see with the left and
right arrow images on the sides of this paragraph.
The final type of image is a “block” image, one that becomes a paragraph on its
own and has no text on either side. Below is a centered green dot.
Centered images like this are useful for large pictures that should be a focus of
attention.
Generally, it is not possible to translate the exact positioning of images from a
Word document to an ebook. That is because in Word, image positioning is specified
in absolute units from the page boundaries. There is no analogous technology in
ebooks, so the conversion will usually end up placing the image either centered or
floating close to the point in the text where it was inserted, not necessarily where it
appears on the page in Word.
Lists
All types of lists are supported by the conversion, with the exception of lists that
use fancy bullets, these get converted to regular bullets.
Bulleted List
 One
 Two
Numbered List
1. One, with a very long line to demonstrate that the hanging indent for the
list is working correctly
2. Two
Multi-level Lists
1. One
1.1. Two
1.1.1.
Three
1.1.2.
Four with a very long line to demonstrate that the hanging indent
for the list is working correctly.
1.1.3.
Five
2. Six
A Multi-level list with bullets:
 One
 Two
This bullet uses an image as the bullet item
o Four
 Five
Continued Lists
i.
One
ii.
Two
An interruption in our regularly scheduled listing, for this essential and very
relevant public service announcement.
iii.
We now resume our normal programming
iv. Four
Demonstration of DOCX support in calibre
This document demonstrates the ability of the calibre DOCX Input plugin to
convert the various typographic features in a Microsoft Word (2007 and newer)
document. Convert this document to a modern ebook format, such as AZW3 for
Kindles or EPUB for other ebook readers, to see it in action.
There is support for images, tables, lists, footnotes, endnotes, links, dropcaps and
various types of text and paragraph level formatting.
To see the DOCX conversion in action, simply add this file to calibre using the
“Add Books” button and then click “Convert”. Set the output format in the top right
corner of the conversion dialog to EPUB or AZW3 and click “OK”.
Text Formatting
Inline formatting
Here, we demonstrate various types of inline text formatting and the use of
embedded fonts.
Here is some bold, italic, bold-italic, underlined and struck out text. Then, we
have a superscript and a subscript. Now we see some red, green and blue text. Some text
with a yellow highlight. Some text in a box. Some text in inverse video.
A paragraph with styled text: subtle emphasis followed by strong text and
intense emphasis. This paragraph uses document wide styles for styling rather than
inline text properties as demonstrated in the previous paragraph — calibre can handle
both with equal ease.
Fun with fonts
This document has embedded the Ubuntu font family. The body text is in the
Ubuntu typeface, here is some text in the Ubuntu Mono typeface, notice how every
letter has the same width, even i and m. Every embedded font will automatically be
embedded in the output ebook during conversion.
Paragraph level formatting
You can do crazy things with paragraphs, if the urge strikes you. For instance this
paragraph is right aligned and has a right border. It has also been given a light gray
background.
For the lovers of poetry amongst you, paragraphs with hanging indents, like this
often come in handy. You can use hanging indents to ensure that a line of poetry
retains its individual identity as a line even when the screen is too narrow to display it
as a single line. Not only does this paragraph have a hanging indent, it is also has an
extra top margin, setting it apart from the preceding paragraph.
Tables
Tables in Word can vary from the extremely
ITEM
NEEDED
simple to the extremely complex. calibre tries to do
Books
1
its best when converting tables. While you may run
Pens
3
into trouble with the occasional table, the vast
Pencils
2
majority of common cases should be converted very
Highlighte
2 colors
well, as demonstrated in this section. Note that for
r
optimum results, when creating tables in Word, you
Scissors
1 pair
should set their widths using percentages, rather
than absolute units. To the left of this paragraph is a floating two column table with a
nice green border and header row.
Now let’s look at a fancier table—one with alternating row colors and partial
borders. This table is stretched out to take 100% of the available width.
City or
Point A
Point B
Point C
Point D
Point E
Town
Point A
—
Point B
Point C
Point D
Point E
87
64
37
93
—
56
32
35
—
91
54
—
43
—
Next, we see a table with special formatting in various locations. Notice how the
formatting for the header row and sub header rows is preserved.
College
New students
Graduating
Change
students
Undergraduate
110
Cedar
University
Oak Institute
Cedar
University
Elm College
Total
103
+7
202
Graduate
24
210
-8
20
+4
43
53
-10
998
908
90
Source: Fictitious data, for illustration purposes only
Next, we have something a little more complex, a nested table, i.e. a table inside
another table. Additionally, the inner table has some of its cells merged. The table is
displayed horizontally centered.
To the left is a table
One
Two
inside a table, with some
Three
Four
cells merged.
We end with a fancy calendar, note how much of the original formatting is
preserved. Note that this table will only display correctly on relatively wide screens. In
general, very wide tables or tables whose cells have fixed width requirements don’t
fare well in ebooks.
December 2007
Su
n
Mo
n
Tu
e
W
ed
Th
u
Sa
Fri
t
1
2
3
4
5
6
7
8
9
10
1
1
1
17
6
2
1
8
2
24
3
1
3
1
9
2
5
3
1
4
2
0
2
6
1
5
2
1
2
7
1
2
2
2
8
2
9
31
0
Structural Elements
Miscellaneous structural elements you can add to your document, like footnotes,
endnotes, dropcaps and the like.
Footnotes & Endnotes
Footnotes2 and endnotesii are automatically recognized and both are converted
to endnotes, with backlinks for maximum ease of use in ebook devices.
Dropcaps
D
rop caps are used to emphasize the leading paragraph at the start of a section. In
Word it is possible to specify how many lines of text a drop-cap should use. Because of
limitations in ebook technology, this is not possible when converting. Instead, the
converted drop cap will use font size and line height to simulate the effect as well as
possible. While not as good as the original, the result is usually tolerable. This
paragraph has a “D” dropcap set to occupy three lines of text with a font size of 58.5
pts. Depending on the screen width and capabilities of the device you view the book
on, this dropcap can look anything from perfect to ugly.
Links
Two kinds of links are possible, those that refer to an external website and those
that refer to locations inside the document itself. Both are supported by calibre. For
example, here is a link pointing to the calibre download page. Then we have a link that
points back to the section on paragraph level formatting in this document.
Table of Contents
There are two approaches that calibre takes when generating a Table of Contents.
The first is if the Word document has a Table of Contents itself. Provided that the
Table of Contents uses hyperlinks, calibre will automatically use it. The levels of the
2
In paged media, footnotes are usually displayed at the bottom of the text. However, in ebooks, a
better paradigm is to make them clickable endnotes that the user can browse at her pleasure. This
conversion is handled automatically by calibre.
Table of Contents are identified by their left indent, so if you want the
ebook to have a multi-level Table of Contents, make sure you create a
properly indented Table of Contents in Word.
If no Table of Contents is found in the document, then a table of
contents is automatically generated from the headings in the
document. A heading is identified as something that has the Heading 1 or Heading 2,
etc. style applied to it. These headings are turned into a Table of Contents with
Heading 1 being the topmost level, Heading 2 the second level and so on.
You can see the Table of Contents created by calibre by clicking the Table of
Contents button in whatever viewer you are using to view the converted ebook.
Demonstration of DOCX support in calibre
1
Text Formatting
61
Inline formatting
61
Fun with fonts 61
Paragraph level formatting 61
Tables 62
Structural Elements 64
Footnotes & Endnotes 64
Dropcaps 64
Links
64
Table of Contents
64
Images 66
Lists
67
Bulleted List
67
Numbered List 67
Multi-level Lists 67
Continued Lists 67
Images
Images can be of three main types. Inline images are images that are part of the
normal text flow, like this image of a green dot . Inline images do not cause breaks
in the text and are usually small in size. The next category of image is a floating image,
one that “floats “ on the page and is surrounded by text. Word supports more types of
floating images than are possible with current ebook technology, so the conversion
maps floating images to simple left and right floats, as you can see with the left and
right arrow images on the sides of this paragraph.
The final type of image is a “block” image, one that becomes a paragraph on its
own and has no text on either side. Below is a centered green dot.
Centered images like this are useful for large pictures that should
be a focus of attention.
Generally, it is not possible to translate the exact positioning of
images from a Word document to an ebook. That is because in Word,
image positioning is specified in absolute units from the page boundaries. There is no
analogous technology in ebooks, so the conversion will usually end up placing the
image either centered or floating close to the point in the text where it was inserted,
not necessarily where it appears on the page in Word.
Lists
All types of lists are supported by the conversion, with the exception of lists that
use fancy bullets, these get converted to regular bullets.
Bulleted List
 One
 Two
Numbered List
3. One, with a very long line to demonstrate that the hanging indent for the
list is working correctly
4. Two
Multi-level Lists
3. One
3.1. Two
3.1.1.
Three
3.1.2.
Four with a very long line to demonstrate that the hanging indent
for the list is working correctly.
3.1.3.
Five
4. Six
A Multi-level list with bullets:
 One
 Two
This bullet uses an image as the bullet item
o Four
 Five
Continued Lists
v. One
vi. Two
An interruption in our regularly scheduled listing, for this essential and very
relevant public service announcement.
vii. We now resume our normal programming
viii.
Four
Demonstration of DOCX support in calibre
This document demonstrates the ability of the calibre DOCX Input plugin to
convert the various typographic features in a Microsoft Word (2007 and newer)
document. Convert this document to a modern ebook format, such as AZW3 for
Kindles or EPUB for other ebook readers, to see it in action.
There is support for images, tables, lists, footnotes, endnotes, links, dropcaps and
various types of text and paragraph level formatting.
To see the DOCX conversion in action, simply add this file to calibre using the
“Add Books” button and then click “Convert”. Set the output format in the top right
corner of the conversion dialog to EPUB or AZW3 and click “OK”.
Text Formatting
Inline formatting
Here, we demonstrate various types of inline text formatting and the use of
embedded fonts.
Here is some bold, italic, bold-italic, underlined and struck out text. Then, we
have a superscript and a subscript. Now we see some red, green and blue text. Some text
with a yellow highlight. Some text in a box. Some text in inverse video.
A paragraph with styled text: subtle emphasis followed by strong text and
intense emphasis. This paragraph uses document wide styles for styling rather than
inline text properties as demonstrated in the previous paragraph — calibre can handle
both with equal ease.
Fun with fonts
This document has embedded the Ubuntu font family. The body text is in the
Ubuntu typeface, here is some text in the Ubuntu Mono typeface, notice how every
letter has the same width, even i and m. Every embedded font will automatically be
embedded in the output ebook during conversion.
Paragraph level formatting
You can do crazy things with paragraphs, if the urge strikes you. For instance this
paragraph is right aligned and has a right border. It has also been given a light gray
background.
For the lovers of poetry amongst you, paragraphs with hanging indents, like this
often come in handy. You can use hanging indents to ensure that a line of poetry
retains its individual identity as a line even when the screen is too narrow to display it
as a single line. Not only does this paragraph have a hanging indent, it is also has an
extra top margin, setting it apart from the preceding paragraph.
Tables
ITEM
NEEDED
Tables in Word can vary from the extremely
simple to the extremely complex. calibre tries to do
its best when converting tables. While you may run
into trouble with the occasional table, the vast
majority of common cases should be converted very
r
well, as demonstrated in this section. Note that for
Scissors
1 pair
optimum results, when creating tables in Word, you
should set their widths using percentages, rather than absolute units. To the left of
this paragraph is a floating two column table with a nice green border and header
row.
Now let’s look at a fancier table—one with alternating row colors and partial
borders. This table is stretched out to take 100% of the available width.
City or
Point A
Point B
Point C
Point D
Point E
Town
Books
Pens
Pencils
Highlighte
Point A
Point B
Point C
Point D
Point E
1
3
2
2 colors
—
87
64
37
93
—
56
32
35
—
91
54
—
43
—
Next, we see a table with special formatting in various locations. Notice how the
formatting for the header row and sub header rows is preserved.
College
New students
Graduating
Change
students
Cedar
University
Oak Institute
Cedar
University
Elm College
Undergraduate
110
103
+7
202
Graduate
24
210
-8
20
+4
43
53
-10
Total
998
908
90
Source: Fictitious data, for illustration purposes only
Next, we have something a little more complex, a nested table, i.e. a table inside
another table. Additionally, the inner table has some of its cells merged. The table is
displayed horizontally centered.
One
Three
To the left is a table
inside a table, with some
cells merged.
Two
Four
We end with a fancy calendar, note how much of the original formatting is
preserved. Note that this table will only display correctly on relatively wide screens. In
general, very wide tables or tables whose cells have fixed width requirements don’t
fare well in ebooks.
December 2007
Su
n
Mo
n
Tu
e
W
ed
Th
Sa
u
Fri
t
1
2
3
4
5
6
7
8
9
10
1
1
1
1
1
1
1
17
6
2
1
8
2
24
3
3
1
9
2
5
3
2
0
2
6
4
2
1
2
7
5
2
2
2
8
2
9
31
0
Structural Elements
Miscellaneous structural elements you can add to your document, like footnotes,
endnotes, dropcaps and the like.
Footnotes & Endnotes
Footnotes3 and endnotesiii are automatically recognized and both are converted
to endnotes, with backlinks for maximum ease of use in ebook devices.
Dropcaps
D
3
In paged media, footnotes are usually displayed at the bottom of the text. However, in ebooks, a
better paradigm is to make them clickable endnotes that the user can browse at her pleasure. This
conversion is handled automatically by calibre.
rop caps are used to emphasize the leading paragraph at the start of a section. In
Word it is possible to specify how many lines of text a drop-cap should use. Because of
limitations in ebook technology, this is not possible when converting. Instead, the
converted drop cap will use font size and line height to simulate the effect as well as
possible. While not as good as the original, the result is usually tolerable. This
paragraph has a “D” dropcap set to occupy three lines of text with a font size of 58.5
pts. Depending on the screen width and capabilities of the device you view the book
on, this dropcap can look anything from perfect to ugly.
Links
Two kinds of links are possible, those that refer to an external website and those
that refer to locations inside the document itself. Both are supported by calibre. For
example, here is a link pointing to the calibre download page. Then we have a link that
points back to the section on paragraph level formatting in this document.
Table of Contents
There are two approaches that calibre takes when generating a Table of Contents.
The first is if the Word document has a Table of Contents itself. Provided that the
Table of Contents uses hyperlinks, calibre will automatically use it. The levels of the
Table of Contents are identified by their left indent, so if you want the ebook to have
a multi-level Table of Contents, make sure you create a properly indented Table of
Contents in Word.
If no Table of Contents is found in the document, then a table of contents is
automatically generated from the headings in the document. A heading is identified
as something that has the Heading 1 or Heading 2, etc. style applied to it. These
headings are turned into a Table of Contents with Heading 1 being the topmost level,
Heading 2 the second level and so on.
You can see the Table of Contents created by calibre by clicking the Table of
Contents button in whatever viewer you are using to view the converted ebook.
Demonstration of DOCX support in calibre
1
Text Formatting
61
Inline formatting
61
Fun with fonts 61
Paragraph level formatting 61
Tables 62
Structural Elements 64
Footnotes & Endnotes 64
Dropcaps 64
Links
64
Table of Contents
64
Images 66
Lists
67
Bulleted List
Numbered List
Multi-level Lists
Continued Lists
67
67
67
67
Images
Images can be of three main types. Inline images are images that are part of the
normal text flow, like this image of a green dot . Inline images do not cause breaks
in the text and are usually small in size. The next category of image is a floating image,
one that “floats “ on the page and is surrounded by text. Word supports more types of
floating images than are possible with current ebook technology, so the conversion
maps floating images to simple left and right floats, as you can see with the left and
right arrow images on the sides of this paragraph.
The final type of image is a “block” image, one that becomes a paragraph on its
own and has no text on either side. Below is a centered green dot.
Centered images like this are useful for large pictures that should be a focus of
attention.
Generally, it is not possible to translate the exact positioning of images from a
Word document to an ebook. That is because in Word, image positioning is specified
in absolute units from the page boundaries. There is no analogous technology in
ebooks, so the conversion will usually end up placing the image either centered or
floating close to the point in the text where it was inserted, not necessarily where it
appears on the page in Word.
Lists
All types of lists are supported by the conversion, with the exception of lists that
use fancy bullets, these get converted to regular bullets.
Bulleted List
 One
 Two
Numbered List
5. One, with a very long line to demonstrate that the hanging indent for the
list is working correctly
6. Two
Multi-level Lists
5. One
5.1. Two
5.1.1.
Three
5.1.2.
Four with a very long line to demonstrate that the hanging indent
for the list is working correctly.
5.1.3.
Five
6. Six
A Multi-level list with bullets:
 One
 Two
This bullet uses an image as the bullet item
o Four
 Five
Continued Lists
ix.
One
x.
Two
An interruption in our regularly scheduled listing, for this essential and very
relevant public service announcement.
xi.
We now resume our normal programming
xii.
Four
Demonstration of DOCX support in calibre
This document demonstrates the ability of the calibre DOCX Input plugin to
convert the various typographic features in a Microsoft Word (2007 and newer)
document. Convert this document to a modern ebook format, such as AZW3 for
Kindles or EPUB for other ebook readers, to see it in action.
There is support for images, tables, lists, footnotes, endnotes, links, dropcaps and
various types of text and paragraph level formatting.
To see the DOCX conversion in action, simply add this file to calibre using the
“Add Books” button and then click “Convert”. Set the output format in the top right
corner of the conversion dialog to EPUB or AZW3 and click “OK”.
Text Formatting
Inline formatting
Here, we demonstrate various types of inline text formatting and the use of
embedded fonts.
Here is some bold, italic, bold-italic, underlined and struck out text. Then, we
have a superscript and a subscript. Now we see some red, green and blue text. Some text
with a yellow highlight. Some text in a box. Some text in inverse video.
A paragraph with styled text: subtle emphasis followed by strong text and
intense emphasis. This paragraph uses document wide styles for styling rather than
inline text properties as demonstrated in the previous paragraph — calibre can handle
both with equal ease.
Fun with fonts
This document has embedded the Ubuntu font family. The body text is in the
Ubuntu typeface, here is some text in the Ubuntu Mono typeface, notice how every
letter has the same width, even i and m. Every embedded font will automatically be
embedded in the output ebook during conversion.
Paragraph level formatting
You can do crazy things with paragraphs, if the urge strikes you. For instance this
paragraph is right aligned and has a right border. It has also been given a light gray
background.
For the lovers of poetry amongst you, paragraphs with hanging indents, like this
often come in handy. You can use hanging indents to ensure that a line of poetry
retains its individual identity as a line even when the screen is too narrow to display it
as a single line. Not only does this paragraph have a hanging indent, it is also has an
extra top margin, setting it apart from the preceding paragraph.
Tables
Tables in Word can vary from the extremely
ITEM
NEEDED
simple to the extremely complex. calibre tries to do
Books
1
its best when converting tables. While you may run
Pens
3
into trouble with the occasional table, the vast
Pencils
2
majority of common cases should be converted very
Highlighte
2 colors
well, as demonstrated in this section. Note that for
r
optimum results, when creating tables in Word, you
Scissors
1 pair
should set their widths using percentages, rather
than absolute units. To the left of this paragraph is a floating two column table with a
nice green border and header row.
Now let’s look at a fancier table—one with alternating row colors and partial
borders. This table is stretched out to take 100% of the available width.
City or
Point A
Point B
Point C
Point D
Point E
Town
Point A
Point B
Point C
Point D
Point E
—
87
64
37
93
—
56
32
35
—
91
54
—
43
—
Next, we see a table with special formatting in various locations. Notice how the
formatting for the header row and sub header rows is preserved.
College
New students
Graduating
Change
students
Undergraduate
110
Cedar
University
Oak Institute
Cedar
University
Elm College
Total
103
+7
202
Graduate
24
210
-8
20
+4
43
998
53
908
-10
90
Source: Fictitious data, for illustration purposes only
Next, we have something a little more complex, a nested table, i.e. a table inside
another table. Additionally, the inner table has some of its cells merged. The table is
displayed horizontally centered.
To the left is a table
One
Two
inside a table, with some
Three
Four
cells merged.
We end with a fancy calendar, note how much of the original formatting is
preserved. Note that this table will only display correctly on relatively wide screens. In
general, very wide tables or tables whose cells have fixed width requirements don’t
fare well in ebooks.
December 2007
Su
n
Mo
n
Tu
e
W
ed
Th
Sa
u
Fri
t
1
2
3
4
5
6
7
8
9
10
1
1
1
1
1
1
1
6
17
2
1
8
3
1
9
4
2
0
5
2
1
2
2
2
3
24
2
5
2
6
2
7
2
8
2
9
3
31
0
Structural Elements
Miscellaneous structural elements you can add to your document, like footnotes,
endnotes, dropcaps and the like.
Footnotes & Endnotes
Footnotes4 and endnotesiv are automatically recognized and both are converted
to endnotes, with backlinks for maximum ease of use in ebook devices.
Dropcaps
D
rop caps are used to emphasize the leading paragraph at the start of a section. In
Word it is possible to specify how many lines of text a drop-cap should use. Because of
limitations in ebook technology, this is not possible when converting. Instead, the
converted drop cap will use font size and line height to simulate the effect as well as
possible. While not as good as the original, the result is usually tolerable. This
paragraph has a “D” dropcap set to occupy three lines of text with a font size of 58.5
pts. Depending on the screen width and capabilities of the device you view the book
on, this dropcap can look anything from perfect to ugly.
Links
Two kinds of links are possible, those that refer to an external website and those
that refer to locations inside the document itself. Both are supported by calibre. For
example, here is a link pointing to the calibre download page. Then we have a link that
points back to the section on paragraph level formatting in this document.
Table of Contents
There are two approaches that calibre takes when generating a Table of Contents.
The first is if the Word document has a Table of Contents itself. Provided that the
Table of Contents uses hyperlinks, calibre will automatically use it. The levels of the
Table of Contents are identified by their left indent, so if you want the ebook to have
a multi-level Table of Contents, make sure you create a properly indented Table of
Contents in Word.
If no Table of Contents is found in the document, then a table of contents is
automatically generated from the headings in the document. A heading is identified
as something that has the Heading 1 or Heading 2, etc. style applied to it. These
4
In paged media, footnotes are usually displayed at the bottom of the text. However, in ebooks, a
better paradigm is to make them clickable endnotes that the user can browse at her pleasure. This
conversion is handled automatically by calibre.
headings are turned into a Table of Contents with
Heading 1 being the topmost level, Heading 2 the
second level and so on.
You can see the Table of Contents created by
calibre by clicking the Table of Contents button in
whatever viewer you are using to view the converted ebook.
Demonstration of DOCX support in calibre
1
Text Formatting
61
Inline formatting
61
Fun with fonts 61
Paragraph level formatting 61
Tables 62
Structural Elements 64
Footnotes & Endnotes 64
Dropcaps 64
Links
64
Table of Contents
64
Images 66
Lists
67
Bulleted List
67
Numbered List 67
Multi-level Lists 67
Continued Lists 67
Images
Images can be of three main types. Inline images are images that are part of the
normal text flow, like this image of a green dot . Inline images do not cause breaks
in the text and are usually small in size. The next category of image is a floating image,
one that “floats “ on the page and is surrounded by text. Word supports more types of
floating images than are possible with current ebook technology, so the conversion
maps floating images to simple left and right floats, as you can see with the left and
right arrow images on the sides of this paragraph.
The final type of image is a “block” image, one that becomes a paragraph on its
own and has no text on either side. Below is a centered green dot.
Centered images like this are useful for large pictures that should be a focus of
attention.
Generally, it is not possible to translate the exact positioning of images from a
Word document to an ebook. That is because in Word, image positioning is specified
in absolute units from the page boundaries. There is no analogous technology in
ebooks, so the conversion will usually end up placing the image either centered or
floating close to the point in the text where it was inserted, not necessarily where it
appears on the page in Word.
Lists
All types of lists are supported by the conversion, with the exception of lists that
use fancy bullets, these get converted to regular bullets.
Bulleted List
 One
 Two
Numbered List
7. One, with a very long line to demonstrate that the hanging indent for the
list is working correctly
8. Two
Multi-level Lists
7. One
7.1. Two
7.1.1.
Three
7.1.2.
Four with a very long line to demonstrate that the hanging indent
for the list is working correctly.
7.1.3.
Five
8. Six
A Multi-level list with bullets:
 One
 Two
This bullet uses an image as the bullet item
o Four
 Five
Continued Lists
xiii.
One
xiv. Two
An interruption in our regularly scheduled listing, for this essential and very
relevant public service announcement.
xv. We now resume our normal programming
xvi. Four
Demonstration of DOCX support in calibre
This document demonstrates the ability of the calibre DOCX Input plugin to
convert the various typographic features in a Microsoft Word (2007 and newer)
document. Convert this document to a modern ebook format, such as AZW3 for
Kindles or EPUB for other ebook readers, to see it in action.
There is support for images, tables, lists, footnotes, endnotes, links, dropcaps and
various types of text and paragraph level formatting.
To see the DOCX conversion in action, simply add this file to calibre using the
“Add Books” button and then click “Convert”. Set the output format in the top right
corner of the conversion dialog to EPUB or AZW3 and click “OK”.
Text Formatting
Inline formatting
Here, we demonstrate various types of inline text formatting and the use of
embedded fonts.
Here is some bold, italic, bold-italic, underlined and struck out text. Then, we
have a superscript and a subscript. Now we see some red, green and blue text. Some text
with a yellow highlight. Some text in a box. Some text in inverse video.
A paragraph with styled text: subtle emphasis followed by strong text and
intense emphasis. This paragraph uses document wide styles for styling rather than
inline text properties as demonstrated in the previous paragraph — calibre can handle
both with equal ease.
Fun with fonts
This document has embedded the Ubuntu font family. The body text is in the
Ubuntu typeface, here is some text in the Ubuntu Mono typeface, notice how every
letter has the same width, even i and m. Every embedded font will automatically be
embedded in the output ebook during conversion.
Paragraph level formatting
You can do crazy things with paragraphs, if the urge strikes you. For instance this
paragraph is right aligned and has a right border. It has also been given a light gray
background.
For the lovers of poetry amongst you, paragraphs with hanging indents, like this
often come in handy. You can use hanging indents to ensure that a line of poetry
retains its individual identity as a line even when the screen is too narrow to display it
as a single line. Not only does this paragraph have a hanging indent, it is also has an
extra top margin, setting it apart from the preceding paragraph.
Tables
ITEM
NEEDED
Books
1
Pens
3
Pencils
2
Highlighte
2 colors
r
Tables in Word can vary from the extremely
simple to the extremely complex. calibre tries to do
its best when converting tables. While you may run into trouble with the occasional
table, the vast majority of common cases should be converted very well, as
demonstrated in this section. Note that for optimum results, when creating tables in
Word, you should set their widths using percentages, rather than absolute units. To
the left of this paragraph is a floating two column table with a nice green border and
header row.
Now let’s look at a fancier table—one with alternating row colors and partial
borders. This table is stretched out to take 100% of the available width.
City or
Point A
Point B
Point C
Point D
Point E
Town
Scissors
Point A
Point B
Point C
Point D
Point E
1 pair
—
87
64
37
93
—
56
32
35
—
91
54
—
43
—
Next, we see a table with special formatting in various locations. Notice how the
formatting for the header row and sub header rows is preserved.
College
New students
Graduating
Change
students
Cedar
University
Oak Institute
Cedar
University
Elm College
Total
Undergraduate
110
103
+7
202
Graduate
24
210
-8
20
+4
43
53
-10
998
908
90
Source: Fictitious data, for illustration purposes only
Next, we have something a little more complex, a nested table, i.e. a table inside
another table. Additionally, the inner table has some of its cells merged. The table is
displayed horizontally centered.
To the left is a table
One
Two
inside a table, with some
Three
Four
cells merged.
We end with a fancy calendar, note how much of the original formatting is
preserved. Note that this table will only display correctly on relatively wide screens. In
general, very wide tables or tables whose cells have fixed width requirements don’t
fare well in ebooks.
December 2007
Su
n
Mo
n
Tu
e
W
ed
Th
Sa
u
Fri
t
1
2
3
4
5
6
7
8
9
10
1
1
1
1
1
1
1
17
6
2
1
8
2
24
3
3
1
9
2
5
3
2
0
2
6
4
2
1
2
7
5
2
2
2
8
2
9
31
0
Structural Elements
Miscellaneous structural elements you can add to your document, like footnotes,
endnotes, dropcaps and the like.
Footnotes & Endnotes
Footnotes5 and endnotesv are automatically recognized and both are converted
to endnotes, with backlinks for maximum ease of use in ebook devices.
Dropcaps
D
rop caps are used to emphasize the leading paragraph at the start of a section. In
Word it is possible to specify how many lines of text a drop-cap should use. Because of
limitations in ebook technology, this is not possible when converting. Instead, the
converted drop cap will use font size and line height to simulate the effect as well as
5
In paged media, footnotes are usually displayed at the bottom of the text. However, in ebooks, a
better paradigm is to make them clickable endnotes that the user can browse at her pleasure. This
conversion is handled automatically by calibre.
possible. While not as good as the original, the result is usually tolerable. This
paragraph has a “D” dropcap set to occupy three lines of text with a font size of 58.5
pts. Depending on the screen width and capabilities of the device you view the book
on, this dropcap can look anything from perfect to ugly.
Links
Two kinds of links are possible, those that refer to an external website and those
that refer to locations inside the document itself. Both are supported by calibre. For
example, here is a link pointing to the calibre download page. Then we have a link that
points back to the section on paragraph level formatting in this document.
Table of Contents
There are two approaches that calibre takes when generating a Table of Contents.
The first is if the Word document has a Table of Contents itself. Provided that the
Table of Contents uses hyperlinks, calibre will automatically use it. The levels of the
Table of Contents are identified by their left indent, so if you want the ebook to have
a multi-level Table of Contents, make sure you create a properly indented Table of
Contents in Word.
If no Table of Contents is found in the document, then a table of contents is
automatically generated from the headings in the document. A heading is identified
as something that has the Heading 1 or Heading 2, etc. style applied to it. These
headings are turned into a Table of Contents with Heading 1 being the topmost level,
Heading 2 the second level and so on.
You can see the Table of Contents created by calibre by clicking the Table of
Contents button in whatever viewer you are using to view the converted ebook.
Demonstration of DOCX support in calibre
1
Text Formatting
61
Inline formatting
61
Fun with fonts 61
Paragraph level formatting 61
Tables 62
Structural Elements 64
Footnotes & Endnotes 64
Dropcaps 64
Links
64
Table of Contents
64
Images 66
Lists
67
Bulleted List
67
Numbered List 67
Multi-level Lists 67
Continued Lists 67
Images
Images can be of three main types. Inline images
are images that are part of the normal text flow, like
this image of a green dot . Inline images do not cause breaks in the text and are
usually small in size. The next category of image is a floating image, one that “floats “
on the page and is surrounded by text. Word supports more types of floating images
than are possible with current ebook technology, so the conversion maps floating
images to simple left and right floats, as you can see with the left and right arrow
images on the sides of this paragraph.
The final type of image is a “block” image, one that becomes a paragraph on its
own and has no text on either side. Below is a centered green dot.
Centered images like this are useful for large pictures that should be a focus of
attention.
Generally, it is not possible to translate the exact positioning of images from a
Word document to an ebook. That is because in Word, image positioning is specified
in absolute units from the page boundaries. There is no analogous technology in
ebooks, so the conversion will usually end up placing the image either centered or
floating close to the point in the text where it was inserted, not necessarily where it
appears on the page in Word.
Lists
All types of lists are supported by the conversion, with the exception of lists that
use fancy bullets, these get converted to regular bullets.
Bulleted List
 One
 Two
Numbered List
9. One, with a very long line to demonstrate that the hanging indent for the
list is working correctly
10. Two
Multi-level Lists
9. One
9.1. Two
9.1.1.
Three
9.1.2.
Four with a very long line to demonstrate that the hanging indent
for the list is working correctly.
9.1.3.
Five
10. Six
A Multi-level list with bullets:
 One
 Two
This bullet uses an image as the bullet item
o Four
 Five
Continued Lists
xvii. One
xviii.
Two
An interruption in our regularly scheduled listing, for this essential and very
relevant public service announcement.
xix.
We now resume our normal programming
xx.
Four
Demonstration of DOCX support in calibre
This document demonstrates the ability of the calibre DOCX Input plugin to
convert the various typographic features in a Microsoft Word (2007 and newer)
document. Convert this document to a modern ebook format, such as AZW3 for
Kindles or EPUB for other ebook readers, to see it in action.
There is support for images, tables, lists, footnotes, endnotes, links, dropcaps and
various types of text and paragraph level formatting.
To see the DOCX conversion in action, simply add this file to calibre using the
“Add Books” button and then click “Convert”. Set the output format in the top right
corner of the conversion dialog to EPUB or AZW3 and click “OK”.
Text Formatting
Inline formatting
Here, we demonstrate various types of inline text formatting and the use of
embedded fonts.
Here is some bold, italic, bold-italic, underlined and struck out text. Then, we
have a superscript and a subscript. Now we see some red, green and blue text. Some text
with a yellow highlight. Some text in a box. Some text in inverse video.
A paragraph with styled text: subtle emphasis followed by strong text and
intense emphasis. This paragraph uses document wide styles for styling rather than
inline text properties as demonstrated in the previous paragraph — calibre can handle
both with equal ease.
Fun with fonts
This document has embedded the Ubuntu font family. The body text is in the
Ubuntu typeface, here is some text in the Ubuntu Mono typeface, notice how every
letter has the same width, even i and m. Every embedded font will automatically be
embedded in the output ebook during conversion.
Paragraph level formatting
You can do crazy things with paragraphs, if the urge strikes you. For instance this
paragraph is right aligned and has a right border. It has also been given a light gray
background.
For the lovers of poetry amongst you, paragraphs with hanging indents, like this
often come in handy. You can use hanging indents to ensure that a line of poetry
retains its individual identity as a line even when the screen is too narrow to display it
as a single line. Not only does this paragraph have a hanging indent, it is also has an
extra top margin, setting it apart from the preceding paragraph.
Tables
Tables in Word can vary from the extremely
ITEM
NEEDED
simple to the extremely complex. calibre tries to do
Books
1
its best when converting tables. While you may run
Pens
3
into trouble with the occasional table, the vast
Pencils
2
majority of common cases should be converted very
Highlighte
2 colors
well, as demonstrated in this section. Note that for
r
optimum results, when creating tables in Word, you
Scissors
1 pair
should set their widths using percentages, rather
than absolute units. To the left of this paragraph is a floating two column table with a
nice green border and header row.
Now let’s look at a fancier table—one with alternating row colors and partial
borders. This table is stretched out to take 100% of the available width.
City or
Point A
Point B
Point C
Point D
Point E
Town
Point A
Point B
Point C
Point D
Point E
—
87
64
37
93
—
56
32
35
—
91
54
—
43
—
Next, we see a table with special formatting in various locations. Notice how the
formatting for the header row and sub header rows is preserved.
College
New students
Graduating
Change
students
Cedar
University
Oak Institute
Undergraduate
110
103
+7
202
210
-8
Cedar
University
Elm College
Total
Graduate
24
20
+4
43
53
-10
998
908
90
Source: Fictitious data, for illustration purposes only
Next, we have something a little more complex, a nested table, i.e. a table inside
another table. Additionally, the inner table has some of its cells merged. The table is
displayed horizontally centered.
To the left is a table
One
Two
inside a table, with some
Three
Four
cells merged.
We end with a fancy calendar, note how much of the original formatting is
preserved. Note that this table will only display correctly on relatively wide screens. In
general, very wide tables or tables whose cells have fixed width requirements don’t
fare well in ebooks.
December 2007
Su
n
Mo
n
Tu
e
W
ed
Th
Sa
u
Fri
t
1
2
3
4
5
6
7
8
9
10
1
1
1
1
1
1
1
17
6
2
1
8
2
24
3
3
31
0
Structural Elements
1
9
2
5
3
2
0
2
6
4
2
1
2
7
5
2
2
2
8
2
9
Miscellaneous structural elements you can add to your document, like footnotes,
endnotes, dropcaps and the like.
Footnotes & Endnotes
Footnotes6 and endnotesvi are automatically recognized and both are converted
to endnotes, with backlinks for maximum ease of use in ebook devices.
Dropcaps
D
rop caps are used to emphasize the leading paragraph at the start of a section. In
Word it is possible to specify how many lines of text a drop-cap should use. Because of
limitations in ebook technology, this is not possible when converting. Instead, the
converted drop cap will use font size and line height to simulate the effect as well as
possible. While not as good as the original, the result is usually tolerable. This
paragraph has a “D” dropcap set to occupy three lines of text with a font size of 58.5
pts. Depending on the screen width and capabilities of the device you view the book
on, this dropcap can look anything from perfect to ugly.
Links
Two kinds of links are possible, those that refer to an external website and those
that refer to locations inside the document itself. Both are supported by calibre. For
example, here is a link pointing to the calibre download page. Then we have a link that
points back to the section on paragraph level formatting in this document.
Table of Contents
There are two approaches that calibre takes when generating a Table of Contents.
The first is if the Word document has a Table of Contents itself. Provided that the
Table of Contents uses hyperlinks, calibre will automatically use it. The levels of the
Table of Contents are identified by their left indent, so if you want the ebook to have
a multi-level Table of Contents, make sure you create a properly indented Table of
Contents in Word.
If no Table of Contents is found in the document, then a table of contents is
automatically generated from the headings in the document. A heading is identified
as something that has the Heading 1 or Heading 2, etc. style applied to it. These
headings are turned into a Table of Contents with Heading 1 being the topmost level,
Heading 2 the second level and so on.
You can see the Table of Contents created by calibre by clicking the Table of
Contents button in whatever viewer you are using to view the converted ebook.
Demonstration of DOCX support in calibre
1
Text Formatting
61
Inline formatting
61
6
In paged media, footnotes are usually displayed at the bottom of the text. However, in ebooks, a
better paradigm is to make them clickable endnotes that the user can browse at her pleasure. This
conversion is handled automatically by calibre.
Fun with fonts 61
Paragraph level formatting
Tables 62
Structural Elements 64
Footnotes & Endnotes 64
Dropcaps 64
Links
64
Table of Contents
Images 66
Lists
67
Bulleted List
67
Numbered List 67
Multi-level Lists 67
Continued Lists 67
61
64
Images
Images can be of three main types. Inline images are images that are part of the
normal text flow, like this image of a green dot . Inline images do not cause breaks
in the text and are usually small in size. The next category of image is a floating image,
one that “floats “ on the page and is surrounded by text. Word supports more types of
floating images than are possible with current ebook technology, so the conversion
maps floating images to simple left and right floats, as you can see with the left and
right arrow images on the sides of this paragraph.
The final type of image is a “block” image, one that becomes a paragraph on its
own and has no text on either side. Below is a centered green dot.
Centered images like this are useful for large pictures that should be a focus of
attention.
Generally, it is not possible to translate the exact positioning of images from a
Word document to an ebook. That is because in Word, image positioning is specified
in absolute units from the page boundaries. There is no analogous technology in
ebooks, so the conversion will usually end up placing the image either centered or
floating close to the point in the text where it was inserted, not necessarily where it
appears on the page in Word.
Lists
All types of lists are supported by the conversion, with the exception of lists that
use fancy bullets, these get converted to regular bullets.
Bulleted List
 One
 Two
Numbered List
11. One, with a very long line to demonstrate that the hanging indent for the
list is working correctly
12. Two
Multi-level Lists
11. One
11.1. Two
11.1.1.
Three
11.1.2.
Four with a very long line to demonstrate that the hanging indent
for the list is working correctly.
11.1.3.
Five
12. Six
A Multi-level list with bullets:
 One
 Two
This bullet uses an image as the bullet item
o Four
 Five
Continued Lists
xxi.
One
xxii.
Two
An interruption in our regularly scheduled listing, for this essential and very
relevant public service announcement.
xxiii.
We now resume our normal programming
xxiv. Four
Demonstration of DOCX support in calibre
This document demonstrates the ability of the calibre DOCX Input plugin to
convert the various typographic features in a Microsoft Word (2007 and newer)
document. Convert this document to a modern ebook format, such as AZW3 for
Kindles or EPUB for other ebook readers, to see it in action.
There is support for images, tables, lists, footnotes, endnotes, links, dropcaps and
various types of text and paragraph level formatting.
To see the DOCX conversion in action, simply add this file to calibre using the
“Add Books” button and then click “Convert”. Set the output format in the top right
corner of the conversion dialog to EPUB or AZW3 and click “OK”.
Text Formatting
Inline formatting
Here, we demonstrate various types of inline text formatting and the use of
embedded fonts.
Here is some bold, italic, bold-italic, underlined and struck out text. Then, we
have a superscript and a subscript. Now we see some red, green and blue text. Some text
with a yellow highlight. Some text in a box. Some text in inverse video.
A paragraph with styled text: subtle emphasis followed by strong text and
intense emphasis. This paragraph uses document wide styles for styling rather than
inline text properties as demonstrated in the previous paragraph — calibre can handle
both with equal ease.
Fun with fonts
This document has embedded the Ubuntu font family. The body text is in the
Ubuntu typeface, here is some text in the Ubuntu Mono typeface, notice how every
letter has the same width, even i and m. Every embedded font will automatically be
embedded in the output ebook during conversion.
Paragraph level formatting
You can do crazy things with paragraphs, if the urge strikes you. For instance this
paragraph is right aligned and has a right border. It has also been given a light gray
background.
For the lovers of poetry amongst you, paragraphs with hanging indents, like this
often come in handy. You can use hanging indents to ensure that a line of poetry
retains its individual identity as a line even when the screen is too narrow to display it
as a single line. Not only does this paragraph have a hanging indent, it is also has an
extra top margin, setting it apart from the preceding paragraph.
Tables
Tables in Word can vary from the extremely
ITEM
NEEDED
simple to the extremely complex. calibre tries to do
Books
1
its best when converting tables. While you may run
Pens
3
into trouble with the occasional table, the vast
Pencils
2
majority of common cases should be converted very
Highlighte
2 colors
well, as demonstrated in this section. Note that for
r
optimum results, when creating tables in Word, you
Scissors
1 pair
should set their widths using percentages, rather
than absolute units. To the left of this paragraph is a floating two column table with a
nice green border and header row.
Now let’s look at a fancier table—one with alternating row colors and partial
borders. This table is stretched out to take 100% of the available width.
City or
Point A
Point B
Point C
Point D
Point E
Town
Point A
—
Point B
Point C
Point D
Point E
87
64
37
93
—
56
32
35
—
91
54
—
43
—
Next, we see a table with special formatting in various locations. Notice how the
formatting for the header row and sub header rows is preserved.
College
New students
Graduating
Change
students
Undergraduate
110
Cedar
University
Oak Institute
Cedar
University
Elm College
Total
103
+7
202
Graduate
24
210
-8
20
+4
43
53
-10
998
908
90
Source: Fictitious data, for illustration purposes only
Next, we have something a little more complex, a nested table, i.e. a table inside
another table. Additionally, the inner table has some of its cells merged. The table is
displayed horizontally centered.
To the left is a table
One
Two
inside a table, with some
Three
Four
cells merged.
We end with a fancy calendar, note how much of the original formatting is
preserved. Note that this table will only display correctly on relatively wide screens. In
general, very wide tables or tables whose cells have fixed width requirements don’t
fare well in ebooks.
December 2007
Su
n
Mo
n
Tu
e
W
ed
Th
u
Sa
Fri
t
1
2
3
4
5
6
7
8
9
10
1
1
1
17
6
2
1
8
2
24
3
1
3
1
9
2
5
3
1
4
2
0
2
6
1
5
2
1
2
7
1
2
2
2
8
2
9
31
0
Structural Elements
Miscellaneous structural elements you can add to your document, like footnotes,
endnotes, dropcaps and the like.
Footnotes & Endnotes
Footnotes7 and endnotesvii are automatically recognized and both are converted
to endnotes, with backlinks for maximum ease of use in ebook devices.
Dropcaps
D
rop caps are used to emphasize the leading paragraph at the start of a section. In
Word it is possible to specify how many lines of text a drop-cap should use. Because of
limitations in ebook technology, this is not possible when converting. Instead, the
converted drop cap will use font size and line height to simulate the effect as well as
possible. While not as good as the original, the result is usually tolerable. This
paragraph has a “D” dropcap set to occupy three lines of text with a font size of 58.5
pts. Depending on the screen width and capabilities of the device you view the book
on, this dropcap can look anything from perfect to ugly.
Links
Two kinds of links are possible, those that refer to an external website and those
that refer to locations inside the document itself. Both are supported by calibre. For
example, here is a link pointing to the calibre download page. Then we have a link that
points back to the section on paragraph level formatting in this document.
Table of Contents
There are two approaches that calibre takes when generating a Table of Contents.
The first is if the Word document has a Table of Contents itself. Provided that the
Table of Contents uses hyperlinks, calibre will automatically use it. The levels of the
7
In paged media, footnotes are usually displayed at the bottom of the text. However, in ebooks, a
better paradigm is to make them clickable endnotes that the user can browse at her pleasure. This
conversion is handled automatically by calibre.
Table of Contents are identified by their left indent, so if you want the
ebook to have a multi-level Table of Contents, make sure you create a
properly indented Table of Contents in Word.
If no Table of Contents is found in the document, then a table of
contents is automatically generated from the headings in the
document. A heading is identified as something that has the Heading 1 or Heading 2,
etc. style applied to it. These headings are turned into a Table of Contents with
Heading 1 being the topmost level, Heading 2 the second level and so on.
You can see the Table of Contents created by calibre by clicking the Table of
Contents button in whatever viewer you are using to view the converted ebook.
Demonstration of DOCX support in calibre
1
Text Formatting
61
Inline formatting
61
Fun with fonts 61
Paragraph level formatting 61
Tables 62
Structural Elements 64
Footnotes & Endnotes 64
Dropcaps 64
Links
64
Table of Contents
64
Images 66
Lists
67
Bulleted List
67
Numbered List 67
Multi-level Lists 67
Continued Lists 67
Images
Images can be of three main types. Inline images are images that are part of the
normal text flow, like this image of a green dot . Inline images do not cause breaks
in the text and are usually small in size. The next category of image is a floating image,
one that “floats “ on the page and is surrounded by text. Word supports more types of
floating images than are possible with current ebook technology, so the conversion
maps floating images to simple left and right floats, as you can see with the left and
right arrow images on the sides of this paragraph.
The final type of image is a “block” image, one that becomes a paragraph on its
own and has no text on either side. Below is a centered green dot.
Centered images like this are useful for large pictures that should
be a focus of attention.
Generally, it is not possible to translate the exact positioning of
images from a Word document to an ebook. That is because in Word,
image positioning is specified in absolute units from the page boundaries. There is no
analogous technology in ebooks, so the conversion will usually end up placing the
image either centered or floating close to the point in the text where it was inserted,
not necessarily where it appears on the page in Word.
Lists
All types of lists are supported by the conversion, with the exception of lists that
use fancy bullets, these get converted to regular bullets.
Bulleted List
 One
 Two
Numbered List
13. One, with a very long line to demonstrate that the hanging indent for the
list is working correctly
14. Two
Multi-level Lists
13. One
13.1. Two
13.1.1.
Three
13.1.2.
Four with a very long line to demonstrate that the hanging indent
for the list is working correctly.
13.1.3.
Five
14. Six
A Multi-level list with bullets:
 One
 Two
This bullet uses an image as the bullet item
o Four
 Five
Continued Lists
xxv. One
xxvi. Two
An interruption in our regularly scheduled listing, for this essential and very
relevant public service announcement.
xxvii.
We now resume our normal programming
xxviii.
Four
Demonstration of DOCX support in calibre
This document demonstrates the ability of the calibre DOCX Input plugin to
convert the various typographic features in a Microsoft Word (2007 and newer)
document. Convert this document to a modern ebook format, such as AZW3 for
Kindles or EPUB for other ebook readers, to see it in action.
There is support for images, tables, lists, footnotes, endnotes, links, dropcaps and
various types of text and paragraph level formatting.
To see the DOCX conversion in action, simply add this file to calibre using the
“Add Books” button and then click “Convert”. Set the output format in the top right
corner of the conversion dialog to EPUB or AZW3 and click “OK”.
Text Formatting
Inline formatting
Here, we demonstrate various types of inline text formatting and the use of
embedded fonts.
Here is some bold, italic, bold-italic, underlined and struck out text. Then, we
have a superscript and a subscript. Now we see some red, green and blue text. Some text
with a yellow highlight. Some text in a box. Some text in inverse video.
A paragraph with styled text: subtle emphasis followed by strong text and
intense emphasis. This paragraph uses document wide styles for styling rather than
inline text properties as demonstrated in the previous paragraph — calibre can handle
both with equal ease.
Fun with fonts
This document has embedded the Ubuntu font family. The body text is in the
Ubuntu typeface, here is some text in the Ubuntu Mono typeface, notice how every
letter has the same width, even i and m. Every embedded font will automatically be
embedded in the output ebook during conversion.
Paragraph level formatting
You can do crazy things with paragraphs, if the urge strikes you. For instance this
paragraph is right aligned and has a right border. It has also been given a light gray
background.
For the lovers of poetry amongst you, paragraphs with hanging indents, like this
often come in handy. You can use hanging indents to ensure that a line of poetry
retains its individual identity as a line even when the screen is too narrow to display it
as a single line. Not only does this paragraph have a hanging indent, it is also has an
extra top margin, setting it apart from the preceding paragraph.
Tables
ITEM
NEEDED
Tables in Word can vary from the extremely
simple to the extremely complex. calibre tries to do
its best when converting tables. While you may run
into trouble with the occasional table, the vast
majority of common cases should be converted very
r
well, as demonstrated in this section. Note that for
Scissors
1 pair
optimum results, when creating tables in Word, you
should set their widths using percentages, rather than absolute units. To the left of
this paragraph is a floating two column table with a nice green border and header
row.
Now let’s look at a fancier table—one with alternating row colors and partial
borders. This table is stretched out to take 100% of the available width.
City or
Point A
Point B
Point C
Point D
Point E
Town
Books
Pens
Pencils
Highlighte
Point A
Point B
Point C
Point D
Point E
1
3
2
2 colors
—
87
64
37
93
—
56
32
35
—
91
54
—
43
—
Next, we see a table with special formatting in various locations. Notice how the
formatting for the header row and sub header rows is preserved.
College
New students
Graduating
Change
students
Cedar
University
Oak Institute
Cedar
University
Elm College
Undergraduate
110
103
+7
202
Graduate
24
210
-8
20
+4
43
53
-10
Total
998
908
90
Source: Fictitious data, for illustration purposes only
Next, we have something a little more complex, a nested table, i.e. a table inside
another table. Additionally, the inner table has some of its cells merged. The table is
displayed horizontally centered.
One
Three
To the left is a table
inside a table, with some
cells merged.
Two
Four
We end with a fancy calendar, note how much of the original formatting is
preserved. Note that this table will only display correctly on relatively wide screens. In
general, very wide tables or tables whose cells have fixed width requirements don’t
fare well in ebooks.
December 2007
Su
n
Mo
n
Tu
e
W
ed
Th
Sa
u
Fri
t
1
2
3
4
5
6
7
8
9
10
1
1
1
1
1
1
1
17
6
2
1
8
2
24
3
3
1
9
2
5
3
2
0
2
6
4
2
1
2
7
5
2
2
2
8
2
9
31
0
Structural Elements
Miscellaneous structural elements you can add to your document, like footnotes,
endnotes, dropcaps and the like.
Footnotes & Endnotes
Footnotes8 and endnotesviii are automatically recognized and both are converted
to endnotes, with backlinks for maximum ease of use in ebook devices.
Dropcaps
D
8
In paged media, footnotes are usually displayed at the bottom of the text. However, in ebooks, a
better paradigm is to make them clickable endnotes that the user can browse at her pleasure. This
conversion is handled automatically by calibre.
rop caps are used to emphasize the leading paragraph at the start of a section. In
Word it is possible to specify how many lines of text a drop-cap should use. Because of
limitations in ebook technology, this is not possible when converting. Instead, the
converted drop cap will use font size and line height to simulate the effect as well as
possible. While not as good as the original, the result is usually tolerable. This
paragraph has a “D” dropcap set to occupy three lines of text with a font size of 58.5
pts. Depending on the screen width and capabilities of the device you view the book
on, this dropcap can look anything from perfect to ugly.
Links
Two kinds of links are possible, those that refer to an external website and those
that refer to locations inside the document itself. Both are supported by calibre. For
example, here is a link pointing to the calibre download page. Then we have a link that
points back to the section on paragraph level formatting in this document.
Table of Contents
There are two approaches that calibre takes when generating a Table of Contents.
The first is if the Word document has a Table of Contents itself. Provided that the
Table of Contents uses hyperlinks, calibre will automatically use it. The levels of the
Table of Contents are identified by their left indent, so if you want the ebook to have
a multi-level Table of Contents, make sure you create a properly indented Table of
Contents in Word.
If no Table of Contents is found in the document, then a table of contents is
automatically generated from the headings in the document. A heading is identified
as something that has the Heading 1 or Heading 2, etc. style applied to it. These
headings are turned into a Table of Contents with Heading 1 being the topmost level,
Heading 2 the second level and so on.
You can see the Table of Contents created by calibre by clicking the Table of
Contents button in whatever viewer you are using to view the converted ebook.
Demonstration of DOCX support in calibre
1
Text Formatting
61
Inline formatting
61
Fun with fonts 61
Paragraph level formatting 61
Tables 62
Structural Elements 64
Footnotes & Endnotes 64
Dropcaps 64
Links
64
Table of Contents
64
Images 66
Lists
67
Bulleted List
Numbered List
Multi-level Lists
Continued Lists
67
67
67
67
Images
Images can be of three main types. Inline images are images that are part of the
normal text flow, like this image of a green dot . Inline images do not cause breaks
in the text and are usually small in size. The next category of image is a floating image,
one that “floats “ on the page and is surrounded by text. Word supports more types of
floating images than are possible with current ebook technology, so the conversion
maps floating images to simple left and right floats, as you can see with the left and
right arrow images on the sides of this paragraph.
The final type of image is a “block” image, one that becomes a paragraph on its
own and has no text on either side. Below is a centered green dot.
Centered images like this are useful for large pictures that should be a focus of
attention.
Generally, it is not possible to translate the exact positioning of images from a
Word document to an ebook. That is because in Word, image positioning is specified
in absolute units from the page boundaries. There is no analogous technology in
ebooks, so the conversion will usually end up placing the image either centered or
floating close to the point in the text where it was inserted, not necessarily where it
appears on the page in Word.
Lists
All types of lists are supported by the conversion, with the exception of lists that
use fancy bullets, these get converted to regular bullets.
Bulleted List
 One
 Two
Numbered List
15. One, with a very long line to demonstrate that the hanging indent for the
list is working correctly
16. Two
Multi-level Lists
15. One
15.1. Two
15.1.1.
Three
15.1.2.
Four with a very long line to demonstrate that the hanging indent
for the list is working correctly.
15.1.3.
Five
16. Six
A Multi-level list with bullets:
 One
 Two
This bullet uses an image as the bullet item
o Four
 Five
Continued Lists
xxix.
One
xxx.
Two
An interruption in our regularly scheduled listing, for this essential and very
relevant public service announcement.
xxxi.
We now resume our normal programming
xxxii.
Four
Demonstration of DOCX support in calibre
This document demonstrates the ability of the calibre DOCX Input plugin to
convert the various typographic features in a Microsoft Word (2007 and newer)
document. Convert this document to a modern ebook format, such as AZW3 for
Kindles or EPUB for other ebook readers, to see it in action.
There is support for images, tables, lists, footnotes, endnotes, links, dropcaps and
various types of text and paragraph level formatting.
To see the DOCX conversion in action, simply add this file to calibre using the
“Add Books” button and then click “Convert”. Set the output format in the top right
corner of the conversion dialog to EPUB or AZW3 and click “OK”.
Text Formatting
Inline formatting
Here, we demonstrate various types of inline text formatting and the use of
embedded fonts.
Here is some bold, italic, bold-italic, underlined and struck out text. Then, we
have a superscript and a subscript. Now we see some red, green and blue text. Some text
with a yellow highlight. Some text in a box. Some text in inverse video.
A paragraph with styled text: subtle emphasis followed by strong text and
intense emphasis. This paragraph uses document wide styles for styling rather than
inline text properties as demonstrated in the previous paragraph — calibre can handle
both with equal ease.
Fun with fonts
This document has embedded the Ubuntu font family. The body text is in the
Ubuntu typeface, here is some text in the Ubuntu Mono typeface, notice how every
letter has the same width, even i and m. Every embedded font will automatically be
embedded in the output ebook during conversion.
Paragraph level formatting
You can do crazy things with paragraphs, if the urge strikes you. For instance this
paragraph is right aligned and has a right border. It has also been given a light gray
background.
For the lovers of poetry amongst you, paragraphs with hanging indents, like this
often come in handy. You can use hanging indents to ensure that a line of poetry
retains its individual identity as a line even when the screen is too narrow to display it
as a single line. Not only does this paragraph have a hanging indent, it is also has an
extra top margin, setting it apart from the preceding paragraph.
Tables
Tables in Word can vary from the extremely
ITEM
NEEDED
simple to the extremely complex. calibre tries to do
Books
1
its best when converting tables. While you may run
Pens
3
into trouble with the occasional table, the vast
Pencils
2
majority of common cases should be converted very
Highlighte
2 colors
well, as demonstrated in this section. Note that for
r
optimum results, when creating tables in Word, you
Scissors
1 pair
should set their widths using percentages, rather
than absolute units. To the left of this paragraph is a floating two column table with a
nice green border and header row.
Now let’s look at a fancier table—one with alternating row colors and partial
borders. This table is stretched out to take 100% of the available width.
City or
Point A
Point B
Point C
Point D
Point E
Town
Point A
Point B
Point C
Point D
Point E
—
87
64
37
93
—
56
32
35
—
91
54
—
43
—
Next, we see a table with special formatting in various locations. Notice how the
formatting for the header row and sub header rows is preserved.
College
New students
Graduating
Change
students
Undergraduate
110
Cedar
University
Oak Institute
Cedar
University
Elm College
Total
103
+7
202
Graduate
24
210
-8
20
+4
43
998
53
908
-10
90
Source: Fictitious data, for illustration purposes only
Next, we have something a little more complex, a nested table, i.e. a table inside
another table. Additionally, the inner table has some of its cells merged. The table is
displayed horizontally centered.
To the left is a table
One
Two
inside a table, with some
Three
Four
cells merged.
We end with a fancy calendar, note how much of the original formatting is
preserved. Note that this table will only display correctly on relatively wide screens. In
general, very wide tables or tables whose cells have fixed width requirements don’t
fare well in ebooks.
December 2007
Su
n
Mo
n
Tu
e
W
ed
Th
Sa
u
Fri
t
1
2
3
4
5
6
7
8
9
10
1
1
1
1
1
1
1
6
17
2
1
8
3
1
9
4
2
0
5
2
1
2
2
2
3
24
2
5
2
6
2
7
2
8
2
9
3
31
0
Structural Elements
Miscellaneous structural elements you can add to your document, like footnotes,
endnotes, dropcaps and the like.
Footnotes & Endnotes
Footnotes9 and endnotesix are automatically recognized and both are converted
to endnotes, with backlinks for maximum ease of use in ebook devices.
Dropcaps
D
rop caps are used to emphasize the leading paragraph at the start of a section. In
Word it is possible to specify how many lines of text a drop-cap should use. Because of
limitations in ebook technology, this is not possible when converting. Instead, the
converted drop cap will use font size and line height to simulate the effect as well as
possible. While not as good as the original, the result is usually tolerable. This
paragraph has a “D” dropcap set to occupy three lines of text with a font size of 58.5
pts. Depending on the screen width and capabilities of the device you view the book
on, this dropcap can look anything from perfect to ugly.
Links
Two kinds of links are possible, those that refer to an external website and those
that refer to locations inside the document itself. Both are supported by calibre. For
example, here is a link pointing to the calibre download page. Then we have a link that
points back to the section on paragraph level formatting in this document.
Table of Contents
There are two approaches that calibre takes when generating a Table of Contents.
The first is if the Word document has a Table of Contents itself. Provided that the
Table of Contents uses hyperlinks, calibre will automatically use it. The levels of the
Table of Contents are identified by their left indent, so if you want the ebook to have
a multi-level Table of Contents, make sure you create a properly indented Table of
Contents in Word.
If no Table of Contents is found in the document, then a table of contents is
automatically generated from the headings in the document. A heading is identified
as something that has the Heading 1 or Heading 2, etc. style applied to it. These
9
In paged media, footnotes are usually displayed at the bottom of the text. However, in ebooks, a
better paradigm is to make them clickable endnotes that the user can browse at her pleasure. This
conversion is handled automatically by calibre.
headings are turned into a Table of Contents with
Heading 1 being the topmost level, Heading 2 the
second level and so on.
You can see the Table of Contents created by
calibre by clicking the Table of Contents button in
whatever viewer you are using to view the converted ebook.
Demonstration of DOCX support in calibre
1
Text Formatting
61
Inline formatting
61
Fun with fonts 61
Paragraph level formatting 61
Tables 62
Structural Elements 64
Footnotes & Endnotes 64
Dropcaps 64
Links
64
Table of Contents
64
Images 66
Lists
67
Bulleted List
67
Numbered List 67
Multi-level Lists 67
Continued Lists 67
Images
Images can be of three main types. Inline images are images that are part of the
normal text flow, like this image of a green dot . Inline images do not cause breaks
in the text and are usually small in size. The next category of image is a floating image,
one that “floats “ on the page and is surrounded by text. Word supports more types of
floating images than are possible with current ebook technology, so the conversion
maps floating images to simple left and right floats, as you can see with the left and
right arrow images on the sides of this paragraph.
The final type of image is a “block” image, one that becomes a paragraph on its
own and has no text on either side. Below is a centered green dot.
Centered images like this are useful for large pictures that should be a focus of
attention.
Generally, it is not possible to translate the exact positioning of images from a
Word document to an ebook. That is because in Word, image positioning is specified
in absolute units from the page boundaries. There is no analogous technology in
ebooks, so the conversion will usually end up placing the image either centered or
floating close to the point in the text where it was inserted, not necessarily where it
appears on the page in Word.
Lists
All types of lists are supported by the conversion, with the exception of lists that
use fancy bullets, these get converted to regular bullets.
Bulleted List
 One
 Two
Numbered List
17. One, with a very long line to demonstrate that the hanging indent for the
list is working correctly
18. Two
Multi-level Lists
17. One
17.1. Two
17.1.1.
Three
17.1.2.
Four with a very long line to demonstrate that the hanging indent
for the list is working correctly.
17.1.3.
Five
18. Six
A Multi-level list with bullets:
 One
 Two
This bullet uses an image as the bullet item
o Four
 Five
Continued Lists
xxxiii.
One
xxxiv. Two
An interruption in our regularly scheduled listing, for this essential and very
relevant public service announcement.
xxxv. We now resume our normal programming
xxxvi. Four
Demonstration of DOCX support in calibre
This document demonstrates the ability of the calibre DOCX Input plugin to
convert the various typographic features in a Microsoft Word (2007 and newer)
document. Convert this document to a modern ebook format, such as AZW3 for
Kindles or EPUB for other ebook readers, to see it in action.
There is support for images, tables, lists, footnotes, endnotes, links, dropcaps and
various types of text and paragraph level formatting.
To see the DOCX conversion in action, simply add this file to calibre using the
“Add Books” button and then click “Convert”. Set the output format in the top right
corner of the conversion dialog to EPUB or AZW3 and click “OK”.
Text Formatting
Inline formatting
Here, we demonstrate various types of inline text formatting and the use of
embedded fonts.
Here is some bold, italic, bold-italic, underlined and struck out text. Then, we
have a superscript and a subscript. Now we see some red, green and blue text. Some text
with a yellow highlight. Some text in a box. Some text in inverse video.
A paragraph with styled text: subtle emphasis followed by strong text and
intense emphasis. This paragraph uses document wide styles for styling rather than
inline text properties as demonstrated in the previous paragraph — calibre can handle
both with equal ease.
Fun with fonts
This document has embedded the Ubuntu font family. The body text is in the
Ubuntu typeface, here is some text in the Ubuntu Mono typeface, notice how every
letter has the same width, even i and m. Every embedded font will automatically be
embedded in the output ebook during conversion.
Paragraph level formatting
You can do crazy things with paragraphs, if the urge strikes you. For instance this
paragraph is right aligned and has a right border. It has also been given a light gray
background.
For the lovers of poetry amongst you, paragraphs with hanging indents, like this
often come in handy. You can use hanging indents to ensure that a line of poetry
retains its individual identity as a line even when the screen is too narrow to display it
as a single line. Not only does this paragraph have a hanging indent, it is also has an
extra top margin, setting it apart from the preceding paragraph.
Tables
ITEM
NEEDED
Books
1
Pens
3
Pencils
2
Highlighte
2 colors
r
Tables in Word can vary from the extremely
simple to the extremely complex. calibre tries to do
its best when converting tables. While you may run into trouble with the occasional
table, the vast majority of common cases should be converted very well, as
demonstrated in this section. Note that for optimum results, when creating tables in
Word, you should set their widths using percentages, rather than absolute units. To
the left of this paragraph is a floating two column table with a nice green border and
header row.
Now let’s look at a fancier table—one with alternating row colors and partial
borders. This table is stretched out to take 100% of the available width.
City or
Point A
Point B
Point C
Point D
Point E
Town
Scissors
Point A
Point B
Point C
Point D
Point E
1 pair
—
87
64
37
93
—
56
32
35
—
91
54
—
43
—
Next, we see a table with special formatting in various locations. Notice how the
formatting for the header row and sub header rows is preserved.
College
New students
Graduating
Change
students
Cedar
University
Oak Institute
Cedar
University
Elm College
Total
Undergraduate
110
103
+7
202
Graduate
24
210
-8
20
+4
43
53
-10
998
908
90
Source: Fictitious data, for illustration purposes only
Next, we have something a little more complex, a nested table, i.e. a table inside
another table. Additionally, the inner table has some of its cells merged. The table is
displayed horizontally centered.
To the left is a table
One
Two
inside a table, with some
Three
Four
cells merged.
We end with a fancy calendar, note how much of the original formatting is
preserved. Note that this table will only display correctly on relatively wide screens. In
general, very wide tables or tables whose cells have fixed width requirements don’t
fare well in ebooks.
December 2007
Su
n
Mo
n
Tu
e
W
ed
Th
Sa
u
Fri
t
1
2
3
4
5
6
7
8
9
10
1
1
1
1
1
1
1
17
6
2
1
8
2
24
3
3
1
9
2
5
3
2
0
2
6
4
2
1
2
7
5
2
2
2
8
2
9
31
0
Structural Elements
Miscellaneous structural elements you can add to your document, like footnotes,
endnotes, dropcaps and the like.
Footnotes & Endnotes
Footnotes10 and endnotesx are automatically recognized and both are converted
to endnotes, with backlinks for maximum ease of use in ebook devices.
Dropcaps
D
rop caps are used to emphasize the leading paragraph at the start of a section. In
Word it is possible to specify how many lines of text a drop-cap should use. Because of
limitations in ebook technology, this is not possible when converting. Instead, the
converted drop cap will use font size and line height to simulate the effect as well as
10
In paged media, footnotes are usually displayed at the bottom of the text. However, in ebooks, a
better paradigm is to make them clickable endnotes that the user can browse at her pleasure. This
conversion is handled automatically by calibre.
possible. While not as good as the original, the result is usually tolerable. This
paragraph has a “D” dropcap set to occupy three lines of text with a font size of 58.5
pts. Depending on the screen width and capabilities of the device you view the book
on, this dropcap can look anything from perfect to ugly.
Links
Two kinds of links are possible, those that refer to an external website and those
that refer to locations inside the document itself. Both are supported by calibre. For
example, here is a link pointing to the calibre download page. Then we have a link that
points back to the section on paragraph level formatting in this document.
Table of Contents
There are two approaches that calibre takes when generating a Table of Contents.
The first is if the Word document has a Table of Contents itself. Provided that the
Table of Contents uses hyperlinks, calibre will automatically use it. The levels of the
Table of Contents are identified by their left indent, so if you want the ebook to have
a multi-level Table of Contents, make sure you create a properly indented Table of
Contents in Word.
If no Table of Contents is found in the document, then a table of contents is
automatically generated from the headings in the document. A heading is identified
as something that has the Heading 1 or Heading 2, etc. style applied to it. These
headings are turned into a Table of Contents with Heading 1 being the topmost level,
Heading 2 the second level and so on.
You can see the Table of Contents created by calibre by clicking the Table of
Contents button in whatever viewer you are using to view the converted ebook.
Demonstration of DOCX support in calibre
1
Text Formatting
61
Inline formatting
61
Fun with fonts 61
Paragraph level formatting 61
Tables 62
Structural Elements 64
Footnotes & Endnotes 64
Dropcaps 64
Links
64
Table of Contents
64
Images 66
Lists
67
Bulleted List
67
Numbered List 67
Multi-level Lists 67
Continued Lists 67
Images
Images can be of three main types. Inline images
are images that are part of the normal text flow, like
this image of a green dot . Inline images do not cause breaks in the text and are
usually small in size. The next category of image is a floating image, one that “floats “
on the page and is surrounded by text. Word supports more types of floating images
than are possible with current ebook technology, so the conversion maps floating
images to simple left and right floats, as you can see with the left and right arrow
images on the sides of this paragraph.
The final type of image is a “block” image, one that becomes a paragraph on its
own and has no text on either side. Below is a centered green dot.
Centered images like this are useful for large pictures that should be a focus of
attention.
Generally, it is not possible to translate the exact positioning of images from a
Word document to an ebook. That is because in Word, image positioning is specified
in absolute units from the page boundaries. There is no analogous technology in
ebooks, so the conversion will usually end up placing the image either centered or
floating close to the point in the text where it was inserted, not necessarily where it
appears on the page in Word.
Lists
All types of lists are supported by the conversion, with the exception of lists that
use fancy bullets, these get converted to regular bullets.
Bulleted List
 One
 Two
Numbered List
19. One, with a very long line to demonstrate that the hanging indent for the
list is working correctly
20. Two
Multi-level Lists
19. One
19.1. Two
19.1.1.
Three
19.1.2.
Four with a very long line to demonstrate that the hanging indent
for the list is working correctly.
19.1.3.
Five
20. Six
A Multi-level list with bullets:
 One
 Two
This bullet uses an image as the bullet item
o Four
 Five
Continued Lists
xxxvii.
One
xxxviii.
Two
An interruption in our regularly scheduled listing, for this essential and very
relevant public service announcement.
xxxix.
We now resume our normal programming
xl.
Four
Demonstration of DOCX support in calibre
This document demonstrates the ability of the calibre DOCX Input plugin to
convert the various typographic features in a Microsoft Word (2007 and newer)
document. Convert this document to a modern ebook format, such as AZW3 for
Kindles or EPUB for other ebook readers, to see it in action.
There is support for images, tables, lists, footnotes, endnotes, links, dropcaps and
various types of text and paragraph level formatting.
To see the DOCX conversion in action, simply add this file to calibre using the
“Add Books” button and then click “Convert”. Set the output format in the top right
corner of the conversion dialog to EPUB or AZW3 and click “OK”.
Text Formatting
Inline formatting
Here, we demonstrate various types of inline text formatting and the use of
embedded fonts.
Here is some bold, italic, bold-italic, underlined and struck out text. Then, we
have a superscript and a subscript. Now we see some red, green and blue text. Some text
with a yellow highlight. Some text in a box. Some text in inverse video.
A paragraph with styled text: subtle emphasis followed by strong text and
intense emphasis. This paragraph uses document wide styles for styling rather than
inline text properties as demonstrated in the previous paragraph — calibre can handle
both with equal ease.
Fun with fonts
This document has embedded the Ubuntu font family. The body text is in the
Ubuntu typeface, here is some text in the Ubuntu Mono typeface, notice how every
letter has the same width, even i and m. Every embedded font will automatically be
embedded in the output ebook during conversion.
Paragraph level formatting
You can do crazy things with paragraphs, if the urge strikes you. For instance this
paragraph is right aligned and has a right border. It has also been given a light gray
background.
For the lovers of poetry amongst you, paragraphs with hanging indents, like this
often come in handy. You can use hanging indents to ensure that a line of poetry
retains its individual identity as a line even when the screen is too narrow to display it
as a single line. Not only does this paragraph have a hanging indent, it is also has an
extra top margin, setting it apart from the preceding paragraph.
Tables
Tables in Word can vary from the extremely
ITEM
NEEDED
simple to the extremely complex. calibre tries to do
Books
1
its best when converting tables. While you may run
Pens
3
into trouble with the occasional table, the vast
Pencils
2
majority of common cases should be converted very
Highlighte
2 colors
well, as demonstrated in this section. Note that for
r
optimum results, when creating tables in Word, you
Scissors
1 pair
should set their widths using percentages, rather
than absolute units. To the left of this paragraph is a floating two column table with a
nice green border and header row.
Now let’s look at a fancier table—one with alternating row colors and partial
borders. This table is stretched out to take 100% of the available width.
City or
Point A
Point B
Point C
Point D
Point E
Town
Point A
Point B
Point C
Point D
Point E
—
87
64
37
93
—
56
32
35
—
91
54
—
43
—
Next, we see a table with special formatting in various locations. Notice how the
formatting for the header row and sub header rows is preserved.
College
New students
Graduating
Change
students
Cedar
University
Oak Institute
Undergraduate
110
103
+7
202
210
-8
Cedar
University
Elm College
Total
Graduate
24
20
+4
43
53
-10
998
908
90
Source: Fictitious data, for illustration purposes only
Next, we have something a little more complex, a nested table, i.e. a table inside
another table. Additionally, the inner table has some of its cells merged. The table is
displayed horizontally centered.
To the left is a table
One
Two
inside a table, with some
Three
Four
cells merged.
We end with a fancy calendar, note how much of the original formatting is
preserved. Note that this table will only display correctly on relatively wide screens. In
general, very wide tables or tables whose cells have fixed width requirements don’t
fare well in ebooks.
December 2007
Su
n
Mo
n
Tu
e
W
ed
Th
Sa
u
Fri
t
1
2
3
4
5
6
7
8
9
10
1
1
1
1
1
1
1
17
6
2
1
8
2
24
3
3
31
0
Structural Elements
1
9
2
5
3
2
0
2
6
4
2
1
2
7
5
2
2
2
8
2
9
Miscellaneous structural elements you can add to your document, like footnotes,
endnotes, dropcaps and the like.
Footnotes & Endnotes
Footnotes11 and endnotesxi are automatically recognized and both are converted
to endnotes, with backlinks for maximum ease of use in ebook devices.
Dropcaps
D
rop caps are used to emphasize the leading paragraph at the start of a section. In
Word it is possible to specify how many lines of text a drop-cap should use. Because of
limitations in ebook technology, this is not possible when converting. Instead, the
converted drop cap will use font size and line height to simulate the effect as well as
possible. While not as good as the original, the result is usually tolerable. This
paragraph has a “D” dropcap set to occupy three lines of text with a font size of 58.5
pts. Depending on the screen width and capabilities of the device you view the book
on, this dropcap can look anything from perfect to ugly.
Links
Two kinds of links are possible, those that refer to an external website and those
that refer to locations inside the document itself. Both are supported by calibre. For
example, here is a link pointing to the calibre download page. Then we have a link that
points back to the section on paragraph level formatting in this document.
Table of Contents
There are two approaches that calibre takes when generating a Table of Contents.
The first is if the Word document has a Table of Contents itself. Provided that the
Table of Contents uses hyperlinks, calibre will automatically use it. The levels of the
Table of Contents are identified by their left indent, so if you want the ebook to have
a multi-level Table of Contents, make sure you create a properly indented Table of
Contents in Word.
If no Table of Contents is found in the document, then a table of contents is
automatically generated from the headings in the document. A heading is identified
as something that has the Heading 1 or Heading 2, etc. style applied to it. These
headings are turned into a Table of Contents with Heading 1 being the topmost level,
Heading 2 the second level and so on.
You can see the Table of Contents created by calibre by clicking the Table of
Contents button in whatever viewer you are using to view the converted ebook.
Demonstration of DOCX support in calibre
1
Text Formatting
61
Inline formatting
61
11
In paged media, footnotes are usually displayed at the bottom of the text. However, in ebooks, a
better paradigm is to make them clickable endnotes that the user can browse at her pleasure. This
conversion is handled automatically by calibre.
Fun with fonts 61
Paragraph level formatting
Tables 62
Structural Elements 64
Footnotes & Endnotes 64
Dropcaps 64
Links
64
Table of Contents
Images 66
Lists
67
Bulleted List
67
Numbered List 67
Multi-level Lists 67
Continued Lists 67
61
64
Images
Images can be of three main types. Inline images are images that are part of the
normal text flow, like this image of a green dot . Inline images do not cause breaks
in the text and are usually small in size. The next category of image is a floating image,
one that “floats “ on the page and is surrounded by text. Word supports more types of
floating images than are possible with current ebook technology, so the conversion
maps floating images to simple left and right floats, as you can see with the left and
right arrow images on the sides of this paragraph.
The final type of image is a “block” image, one that becomes a paragraph on its
own and has no text on either side. Below is a centered green dot.
Centered images like this are useful for large pictures that should be a focus of
attention.
Generally, it is not possible to translate the exact positioning of images from a
Word document to an ebook. That is because in Word, image positioning is specified
in absolute units from the page boundaries. There is no analogous technology in
ebooks, so the conversion will usually end up placing the image either centered or
floating close to the point in the text where it was inserted, not necessarily where it
appears on the page in Word.
Lists
All types of lists are supported by the conversion, with the exception of lists that
use fancy bullets, these get converted to regular bullets.
Bulleted List
 One
 Two
Numbered List
21. One, with a very long line to demonstrate that the hanging indent for the
list is working correctly
22. Two
Multi-level Lists
21. One
21.1. Two
21.1.1.
Three
21.1.2.
Four with a very long line to demonstrate that the hanging indent
for the list is working correctly.
21.1.3.
Five
22. Six
A Multi-level list with bullets:
 One
 Two
This bullet uses an image as the bullet item
o Four
 Five
Continued Lists
xli.
One
xlii.
Two
An interruption in our regularly scheduled listing, for this essential and very
relevant public service announcement.
xliii.
We now resume our normal programming
xliv. Four
Text Formatting
Inline formatting
Here, we demonstrate various types of inline text formatting and the use of
embedded fonts.
Here is some bold, italic, bold-italic, underlined and struck out text. Then, we
have a superscript and a subscript. Now we see some red, green and blue text. Some text
with a yellow highlight. Some text in a box. Some text in inverse video.
A paragraph with styled text: subtle emphasis followed by strong text and
intense emphasis. This paragraph uses document wide styles for styling rather than
inline text properties as demonstrated in the previous paragraph — calibre can handle
both with equal ease.
Fun with fonts
This document has embedded the Ubuntu font family. The body text is in the
Ubuntu typeface, here is some text in the Ubuntu Mono typeface, notice how every
letter has the same width, even i and m. Every embedded font will automatically
be embedded in the output ebook during conversion.
Paragraph level formatting
You can do crazy things with paragraphs, if the urge strikes you. For instance this
paragraph is right aligned and has a right border. It has also been given a light gray
background.
For the lovers of poetry amongst you, paragraphs with hanging indents, like this often
come in handy. You can use hanging indents to ensure that a line of poetry
retains its individual identity as a line even when the screen is too narrow to
display it as a single line. Not only does this paragraph have a hanging indent, it
is also has an extra top margin, setting it apart from the preceding paragraph.
Tables
Tables in Word can vary from the extremely
simple to the extremely complex. calibre tries to do
its best when converting tables. While you may run
into trouble with the occasional table, the vast
majority of common cases should be converted very
well, as demonstrated in this section. Note that for
optimum results, when creating tables in Word, you should set their widths using
percentages, rather than absolute units. To the left of this paragraph is a floating two
column table with a nice green border and header row.
Now let’s look at a fancier table—one with alternating row colors and partial
borders. This table is stretched out to take 100% of the available width.
ITEM
Books
Pens
Pencils
Highlighter
Scissors
City or
Town
Point A
Point B
Point C
Point D
Point E
NEEDED
1
3
2
2 colors
1 pair
Point A
Point B
Point C
Point D
Point E
—
87
64
37
93
—
56
32
35
—
91
54
—
43
—
Next, we see a table with special formatting in various locations. Notice how the
formatting for the header row and sub header rows is preserved.
College
New students
Graduating
students
Change
Undergraduate
Cedar University
110
103
+7
Oak Institute
202
210
-8
Graduate
Cedar University
24
20
+4
Elm College
43
53
-10
998
908
90
Total
Source: Fictitious data, for illustration purposes only
Next, we have something a little more complex, a nested table, i.e. a table inside
another table. Additionally, the inner table has some of its cells merged. The table is
displayed horizontally centered.
One
Two
Three
Four
To the left is a table inside a
table, with some cells
merged.
We end with a fancy calendar, note how much of the original formatting is
preserved. Note that this table will only display correctly on relatively wide screens. In
general, very wide tables or tables whose cells have fixed width requirements don’t
fare well in ebooks.
December 2007
Su
n
Mon
Tu
e
We
d
Th
u
Fri
Sat
1
2
3
4
5
6
7
8
9
10
1
1
1
2
1
3
1
4
1
5
1
6
17
1
8
1
9
2
0
2
1
2
2
2
3
24
2
5
2
6
2
7
2
8
2
9
3
0
31
Structural Elements
Miscellaneous structural elements you can add to your document, like footnotes,
endnotes, dropcaps and the like.
Footnotes & Endnotes
Footnotes12 and endnotesxii are automatically recognized and both are converted
to endnotes, with backlinks for maximum ease of use in ebook devices.
Dropcaps
D
rop caps are used to emphasize the leading paragraph at the start of a
section. In Word it is possible to specify how many lines of text a drop-cap
should use. Because of limitations in ebook technology, this is not possible
when converting. Instead, the converted drop cap will use font size and line height to
simulate the effect as well as possible. While not as good as the original, the result is
usually tolerable. This paragraph has a “D” dropcap set to occupy three lines of text
with a font size of 58.5 pts. Depending on the screen width and capabilities of the
device you view the book on, this dropcap can look anything from perfect to ugly.
Links
Two kinds of links are possible, those that refer to an external website and those
that refer to locations inside the document itself. Both are supported by calibre. For
example, here is a link pointing to the calibre download page. Then we have a link that
points back to the section on paragraph level formatting in this document.
Table of Contents
There are two approaches that calibre takes when generating a Table of Contents.
The first is if the Word document has a Table of Contents itself. Provided that the
Table of Contents uses hyperlinks, calibre will automatically use it. The levels of the
Table of Contents are identified by their left indent, so if you want the ebook to have
a multi-level Table of Contents, make sure you create a properly indented Table of
Contents in Word.
If no Table of Contents is found in the document, then a table of contents is
automatically generated from the headings in the document. A heading is identified
as something that has the Heading 1 or Heading 2, etc. style applied to it. These
headings are turned into a Table of Contents with Heading 1 being the topmost level,
Heading 2 the second level and so on.
12
In paged media, footnotes are usually displayed at the bottom of the text. However, in ebooks, a
better paradigm is to make them clickable endnotes that the user can browse at her pleasure. This
conversion is handled automatically by calibre.
You can see the Table of Contents created by calibre by clicking the Table of
Contents button in whatever viewer you are using to view the converted ebook.
Demonstration of DOCX support in calibre................................................................. 1
Text Formatting ................................................................................................................ 61
Inline formatting.............................................................................................................. 61
Fun with fonts .................................................................................................................. 61
Paragraph level formatting............................................................................................ 61
Tables ................................................................................................................................... 62
Structural Elements ......................................................................................................... 64
Footnotes & Endnotes ..................................................................................................... 64
Dropcaps ........................................................................................................................... 64
Links .................................................................................................................................. 64
Table of Contents ............................................................................................................ 64
Images .................................................................................................................................. 66
Lists ...................................................................................................................................... 67
Bulleted List ..................................................................................................................... 67
Numbered List .................................................................................................................. 67
Multi-level Lists ................................................................................................................ 67
Continued Lists ................................................................................................................ 67
Images
Images can be of three main types. Inline images
are images that are part of the normal text flow, like
this image of a green dot . Inline images do not
cause breaks in the text and are usually small in size. The next category of image is a
floating image, one that “floats “ on the page and is surrounded by text. Word
supports more types of floating images than are possible with current ebook
technology, so the conversion maps floating images to simple left and right floats, as
you can see with the left and right arrow images on the sides of this paragraph.
The final type of image is a “block” image, one that becomes a paragraph on its
own and has no text on either side. Below is a centered green dot.
Centered images like this are useful for large pictures that should be a focus of
attention.
Generally, it is not possible to translate the exact positioning of images from a
Word document to an ebook. That is because in Word, image positioning is specified
in absolute units from the page boundaries. There is no analogous technology in
ebooks, so the conversion will usually end up placing the image either centered or
floating close to the point in the text where it was inserted, not necessarily where it
appears on the page in Word.
Lists
All types of lists are supported by the conversion, with the exception of lists that
use fancy bullets, these get converted to regular bullets.
Bulleted List


One
Two
Numbered List
23. One, with a very long line to demonstrate that the hanging indent for the
list is working correctly
24. Two
Multi-level Lists
23. One
23.1. Two
23.1.1.
Three
23.1.2.
Four with a very long line to demonstrate that the hanging indent
for the list is working correctly.
23.1.3.
Five
24. Six
A Multi-level list with bullets:
 One
 Two
This bullet uses an image as the bullet item
o Four
 Five
Continued Lists
xlv. One
xlvi. Two
An interruption in our regularly scheduled listing, for this essential and very
relevant public service announcement.
xlvii.
We now resume our normal programming
xlviii.
Four
i
Endnotes are typically used for longer notes, they remain endnotes when converted into ebook
form, except that they have an additional backlink to make it easy to return to the current position
after reading the note.
ii
Endnotes are typically used for longer notes, they remain endnotes when converted into ebook
form, except that they have an additional backlink to make it easy to return to the current position
after reading the note.
iii
Endnotes are typically used for longer notes, they remain endnotes when converted into ebook
form, except that they have an additional backlink to make it easy to return to the current position
after reading the note.
iv
Endnotes are typically used for longer notes, they remain endnotes when converted into ebook
form, except that they have an additional backlink to make it easy to return to the current position
after reading the note.
v
Endnotes are typically used for longer notes, they remain endnotes when converted into ebook
form, except that they have an additional backlink to make it easy to return to the current position
after reading the note.
vi
Endnotes are typically used for longer notes, they remain endnotes when converted into ebook
form, except that they have an additional backlink to make it easy to return to the current position
after reading the note.
vii
Endnotes are typically used for longer notes, they remain endnotes when converted into ebook
form, except that they have an additional backlink to make it easy to return to the current position
after reading the note.
viii
Endnotes are typically used for longer notes, they remain endnotes when converted into ebook
form, except that they have an additional backlink to make it easy to return to the current position
after reading the note.
ix
Endnotes are typically used for longer notes, they remain endnotes when converted into ebook
form, except that they have an additional backlink to make it easy to return to the current position
after reading the note.
x
Endnotes are typically used for longer notes, they remain endnotes when converted into ebook
form, except that they have an additional backlink to make it easy to return to the current position
after reading the note.
xi
Endnotes are typically used for longer notes, they remain endnotes when converted into ebook
form, except that they have an additional backlink to make it easy to return to the current position
after reading the note.
xii
Endnotes are typically used for longer notes, they remain endnotes when converted into ebook
form, except that they have an additional backlink to make it easy to return to the current position
after reading the note.
Download