mirror of https://github.com/foliojs/pdfkit.git synced 2025-12-08 20:15:54 +00:00

Ben Schmidt 272444e397 Fully support accessibility features from PDF reference.

2021-01-15 16:43:49 -03:00

11 KiB

Raw Blame History

Accessibility

Accessible PDFs are usable by visually impaired users who rely on screen readers/text-to-speech engines/vocalisation.

The two main tasks required to create accessible PDFs are marking content and defining the document's logical structure. These are detailed in the following sections.

Some other simpler tasks are also required.

This checklist covers everything that is required to create a conformant Tagged PDF:

Pass the option pdfVersion: '1.5' (or a higher version) when creating your PDFDocument (depending on the features you use, you may only need 1.4; refer to the PDF reference for details).
Pass the option tagged: true when creating your PDFDocument (technically, this sets the Marked property in the Markings dictionary to true in the PDF).
Specify natural language in the document options and/or logical structure and/or non-structure marked Span content.
Add logical structure with all significant content included.
Include accessibility information (such as alternative text, actual text, etc.) in the logical structure and/or non-structure marked Span content.
Include all spaces which separate words/sentences/etc. in your marked structure content, even at the ends of lines, paragraphs, etc.. I.e. don't do doc.text("Hello, world!") but instead do doc.text("Hello, world! ").
Mark all non-structure content as artifacts.
As well as creating the logical structure, write objects to the PDF in the natural "reading order".

Marked Content

Marked content sequences are foundational to creating accessible PDFs.

All marked content sequences are associated with a registered tag, such as 'Span'.

Example of marking content:

// Mark some text as a "Span"
doc.markContent('Span');
doc.text('Hello, world! ');
doc.endMarkedContent();

Marked content is automatically ended when a page is ended, and if a new page is automatically added by text wrapping, marking is automatically begun again on the new page.

Tags to use are listed in a later section.

Marked Content Options

When marking content, you can provide options (take care to use correct capitalisation):

type - used for artifact content; may be Pagination (e.g. headers and footers), Layout (e.g. rules and backgrounds) or Page (cut marks etc.)
bbox - bounding box for artifact content: [left, top, right, bottom] in default coordinates
attached - used for Pagination artifact content, array of one or more strings: Top, Bottom, Left, Right
lang - used for Span content: human language code (e.g. en-AU) which overrides default document language, and any enclosing structure element language
alt - used for Span content: alternative text for an image or other visual content
expanded - used for Span content: the expanded form of an abbreviation or acronym
actual - used for Span content: the actual text the content represents (e.g. if it is rendered as vector graphics)

It is advisable not to use Span content for specifying alternative text, expanded form, or actual text, especially if there is a possibility of the content automatically wrapping, which would result in the text appearing twice. Set these options on an associated structure element instead.

Logical Structure

Logical structures defines the reading order of a document, and can provide alternative text for images and other visual content.

To define logical structure, you need to mark the structure content, keep a reference to it, then incorporate it into a structure tree.

So far, PDFKit only supports marked content in the logical structure, not annotations, forms, or anything else.

Example of marking structure content:

// Mark some text as a paragraph ("P"); the tag should match the intended structure element's type
const myStructContent = doc.markStructureContent('P');
doc.text('Hello, world! ');
doc.endMarkedContent();

Example of the simplest of structure trees:

// Add a single structure element which includes the structure content to the document's structure
doc.addStructure(doc.struct('P', [ myStructContent ]));

Tags/element types to use are listed in a later section.

Note that to be conformant to Tagged PDF, all content not part of the logical structure should be marked as Artifact.

Automatic Ending of Structure Content and Artifacts

Structure content does not nest, and is mutually exclusive with artifact content; marking structure or artifact content will automatically end current marking of structure or artifact content (and any descendent marking):

// Mark multiple paragraphs without needing to close them
doc.markContent('Artifact', { type: "Layout" });
doc.rect(x1, y1, w1, h1);
const myStructContent = doc.markStructureContent('P');
doc.text('Hello, world! ');
doc.markContent('Artifact', { type: "Layout" });
doc.rect(x2, y2, w2, h2);
const myStructContent = doc.markStructureContent('P');
doc.markContent('Span');
doc.text('Bonjour, tout le monde! ');
doc.markContent('Artifact', { type: "Layout" });
doc.rect(x3, y3, w3, h3);
const myStructContent = doc.markStructureContent('P');
doc.text('Hello again! ');

Complex Structure

Multiple elements may be added directly to the document, and may nest:

// Create nested structure elements
const section1 = doc.struct('Sect', [
    doc.struct('P', [
        someTextStructureContent,
        doc.struct('Link', [ someLinkStructureContent ]),
        moreTextStructureContent
    ])
]);
const section2 = doc.struct('Sect', [ secondSectionStructureContent ]);

// Add them to the document's structure
doc.addStructure(section1).addStructure(section2);

Incremental Construction of Structure

Structure can be built incrementally. Elements can optionally be (recursively) ended once you have finished adding to them, allowing them to be flushed out as soon as possible:

// Begin a new section and add it to the document's structure
const mySection = doc.struct('Sect');
doc.addToStructure(mySection);

// Create a new paragraph and add it to the section
const myParagraph = doc.struct('P');
mySection.add(myParagraph);

// Add content, both to the page, and the paragraph
const myParagraphContent = doc.markStructureContent('P');
myParagraph.add(myParagraphContent);
doc.text('Hello, world! ');

// End the paragraph, allowing it to be flushed out, freeing memory
myParagraph.end();

Note that if you provide content when creating a structure element (i.e. providing it to doc.struct() rather than using structElem.add()) then structElem.end() is called automatically. You therefore should not add additional content, as the element may already have been flushed out. Do not mix atomic and incremental styles for the same structure element.

Structure Element Options

When creating a structure element, you can provide options:

title - title of the structure element (e.g. "Chapter 1")
lang - human language code (e.g. en-AU) which overrides default document language
alt - alternative text for an image or other visual content
expanded - the expanded form of an abbreviation or acronym
actual - the actual text the content represents (e.g. if it is rendered as vector graphics)

Example of a structure tree with options specified:

const titlePage = doc.struct('Sect', {
    title: 'Title Page'
}, [
    doc.struct('H', [
        doc.struct('Span', {
            expanded: 'Portable Document Format/Universal Accessibility',
            actual: 'PDF/UA'
        }, [
            pdfUAStructureContent
        ]),
        doc.struct('Span', {
            actual: 'in a Nutshell'
        }, [
            inANutshellStructureContent
        ]),
    ]),
    doc.struct('Figure', {
        alt: 'photo of a concrete path with tactile paving'
    }, [
        photoStructureContent
    ])
]);

Tags and Structure Element Types

Here are the tags and structure element types which are defined in Tagged PDF. You must ensure you give them with the correct capitalisation.

Tagged PDF also supports custom types which map to standard types, but PDFKit does not have support for this.

Non-structure tags:

Artifact - used to mark all content not part of the logical structure
ReversedChars - every string of text has characters in reverse order for technical reasons (due to how fonts work for right-to-left languages); strings may have spaces at the beginning or end to separate words, but may not have spaces in the middle

"Grouping" elements:

Document - whole document; must be used if there are multiple parts or articles
Part - part of a document
Art - article
Sect - section; may nest
Div - generic division
BlockQuote - block quotation
Caption - describing a figure or table
TOC - table of contents, may be nested, and may be used for lists of figures, tables, etc.
TOCI - table of contents (leaf) item
Index - index (text with accompanying Reference content)
NonStruct - non-structural grouping element (element itself not intended to be exported to other formats like HTML, but 'transparent' to its content which is processed normally)
Private - content only meaningful to the creator (element and its content not intended to be exported to other formats like HTML)

"Block" elements:

H - heading (first element in a section, etc.)
H1 to H6 - heading of a particular level intended for use only if nesting sections is not possible for some reason
P - paragraph
L - list; should include optional Caption, and list items
LI - list item; should contain Lbl and/or LBody
Lbl - label (bullet, number, or "dictionary headword")
LBody - list body (item text, or "dictionary definition"); may have nested lists or other blocks

"Table" elements:

Table - table; should either contain TR, or THead, TBody and/or TFoot
TR - table row
TH - table heading cell
TD - table data cell
THead - table header row group
TBody - table body row group; may have more than one per table
TFoot - table footer row group

"Inline" elements:

Span - generic inline content
Quote - inline quotation
Note - e.g. footnote; may have a Lbl (see "block" elements)
Reference - content in a document that refers to other content (e.g. page number in an index)
BibEntry - bibliography entry; may have a Lbl (see "block" elements)
Code - code
Link - hyperlink; should contain a link annotation
Annot - annotation (other than a link)
Ruby - Chinese/Japanese pronunciation/explanation
RB - Ruby base text
RT - Ruby annotation text
RP - Ruby punctuation
Warichu - Japanese/Chinese longer description
WT - Warichu text
WP - Warichu punctuation

"Illustration" elements (should have alt and/or actualtext set):

Figure - figure
Formula - formula
Form - form widget

11 KiB Raw Blame History