Which option should I choose?
Choose "Plain text" when you want to:
- Pull the text out of a Word, Excel, PowerPoint, or PDF file
- Get a simple version you can read, search, or copy-paste
- Feed a document into an AI tool or text analysis pipeline
- Create a starting point for writing accessible Markdown documentation
Accepts: Word (.docx), Excel (.xlsx, .xls), PowerPoint (.pptx), PDF (.pdf), HTML, CSV, JSON, XML, ePub (.epub), and ZIP files.
You get back: a .md (Markdown) file -- plain text with simple formatting marks for headings, lists, and links. Opens in any text editor.
Powered by: MarkItDown (Microsoft). This tool is optimized for extracting readable text and preserving document structure (headings, lists, tables, links) as lightweight Markdown. It handles the widest range of input formats but produces plain text, not a finished web page.
Choose "Accessible web page" when you want to:
- Turn a Markdown, Word, or other text document into a page people can read in a browser
- Create an HTML version that already meets ACB Large Print and WCAG accessibility standards
- Get a file you can upload to a website, email as an attachment, or paste into a content management system
- Publish meeting minutes, agendas, reports, or other documents online in an accessible format
Accepts: Markdown (.md), Word (.docx), reStructuredText (.rst), OpenDocument (.odt), Rich Text (.rtf), and ePub (.epub) files.
You get back: a standalone .html web page with accessibility formatting already built in -- Arial font, large text, proper spacing, high contrast, and all the other ACB requirements. Just open it in a browser or upload it to your website.
Powered by: Pandoc (John MacFarlane, UC Berkeley). Pandoc is the gold standard for document-to-HTML conversion. It understands heading hierarchy, semantic lists, tables, footnotes, and cross-references -- and produces clean, well-structured HTML. We then embed our ACB Large Print CSS so the output meets accessibility standards out of the box. Pandoc accepts fewer input formats than MarkItDown, but produces much higher quality HTML output.
Choose "Word document" when you want to:
- Deliver an editable document to someone who uses Microsoft Word
- Convert Markdown or other text formats into a Word file for collaboration or review
- Create a starting point for a Word document that you will format with ACB styles later
- Share content with people who cannot open HTML or EPUB files
Accepts: Markdown (.md), reStructuredText (.rst), OpenDocument (.odt), Rich Text (.rtf), and ePub (.epub) files. (Does not accept .docx input -- the file is already a Word document.)
You get back: a .docx Word document with headings, lists, tables, and links preserved. You can then open it in Word and apply ACB Large Print styles using our desktop tool or the Word add-in.
Powered by: Pandoc. Pandoc produces clean, well-structured Word documents that use Word's built-in heading styles (Heading 1, Heading 2, etc.), making them easy to format and audit for accessibility.
Choose "EPUB 3 e-book (via Pandoc)" when you want to:
- Create an e-book from Markdown, Word, or other text documents
- Read the content on an e-reader, phone, or tablet
- Get an EPUB with ACB Large Print CSS already embedded
- Quickly produce an EPUB without needing the DAISY Pipeline service running
Accepts: Markdown (.md), Word (.docx), reStructuredText (.rst), OpenDocument (.odt), and Rich Text (.rtf) files. (Does not accept .epub input -- the file is already an EPUB.)
You get back: an .epub file with ACB Large Print CSS embedded. The EPUB uses the same Arial font, large text, and accessibility formatting as the HTML output.
Powered by: Pandoc. Pandoc produces EPUB 3 files with proper navigation documents and chapter structure. This option is always available when Pandoc is installed -- unlike the Pipeline option below, it does not require a separate DAISY Pipeline service.
Choose "Accessible PDF" when you want to:
- Create a fixed-layout document that looks the same on every screen and printer
- Print meeting agendas, reports, or newsletters in ACB Large Print format
- Share a document that cannot be easily edited (read-only distribution)
- Produce a print-ready file with proper margins, binding gutters, and page breaks
Accepts: Markdown (.md), Word (.docx), reStructuredText (.rst), OpenDocument (.odt), Rich Text (.rtf), and ePub (.epub) files.
You get back: a .pdf file with ACB BOP print formatting -- Arial 18-point body text, 22-point headings, 20-point subheadings, 1.15 line spacing (ACB print standard), 1-inch margins, flush-left alignment, and underline-only emphasis. If you check the binding margin option, the left margin increases to 1.5 inches.
Powered by: Pandoc and WeasyPrint (CourtBouillon, BSD license). Pandoc converts your document to HTML, then WeasyPrint renders that HTML with our ACB print CSS into a PDF. WeasyPrint is a CSS-based PDF engine, so the same ACB BOP typography and spacing rules that work in the browser also produce pixel-perfect print output.
Choose "EPUB or DAISY format" when you want to:
- Create an accessible EPUB 3 e-book from a Word document or HTML page
- Distribute content to people who use e-readers, refreshable braille displays, or DAISY players
- Convert an EPUB to DAISY 2.02 talking book format for playback on dedicated DAISY hardware
- Convert an EPUB to DAISY 3 / DTBook format for structure-preserving archival
Accepts: depends on the specific conversion -- Word (.docx), HTML (.html), or ePub (.epub).
You get back: an .epub file (for EPUB conversions) or a .zip file containing the DAISY output folder (for DAISY conversions).
Powered by: DAISY Pipeline 2 (DAISY Consortium). Pipeline is purpose-built for accessible publishing. Unlike the other two tools, Pipeline produces packaged publications -- EPUB e-books and DAISY talking books -- designed for assistive technology. It adds proper EPUB accessibility metadata, navigation documents, and reading order. Use Pipeline when the end result needs to work on an e-reader, a refreshable braille display, or a DAISY player, rather than in a web browser.
How do I decide between them?
Think about who will read it and how they will read it:
- Reading on screen or sharing online? Use "Accessible web page" -- you get an HTML file with ACB formatting that looks right in any browser.
- Printing on paper? Use "Accessible PDF" -- you get a fixed-layout document with ACB BOP print formatting (18pt Arial, 1.15 line spacing, 1-inch margins) that prints identically on any printer.
- Sharing an editable document? Use "Word document" -- you get a .docx file people can open and edit in Microsoft Word.
- Reading on an e-reader or phone? Use "EPUB 3 e-book" -- the content reflows to fit small screens and works with e-reader accessibility features.
- Reading on a DAISY player or braille display? Use "EPUB or DAISY format" (Pipeline) -- these are the formats those devices expect.
- Editing, searching, or feeding to an AI tool? Use "Plain text" -- you get raw content in a simple format that works everywhere.
- Pasting into WordPress or Drupal? Use Export to HTML instead -- it has a CMS Fragment mode that avoids conflicts with your site's theme.
Can I chain them together?
Yes, and this is often the best approach for complex workflows:
- Start with Plain text to extract content from a PDF or PowerPoint into Markdown.
- Edit the Markdown in any text editor to clean up headings, fix links, and improve structure.
- Then convert the Markdown to an accessible web page, Word document, EPUB, or PDF using one of the other options.
This two-step process often produces better results than a single direct conversion, because you can review and improve the content between steps.
Another common workflow: convert a Word document to HTML for your website, then also convert the same source to PDF for a print handout and to EPUB for e-reader distribution -- all from the same source file.
What is the difference between "Export to HTML" and this page?
Export (in the main navigation) is specifically for Word documents and gives you extra options like CMS fragments for pasting into WordPress or Drupal. This page handles a wider range of input formats and always produces a complete, standalone web page with all the accessibility styling included.