Free Word to PDF Converter
Convert Word (.docx) files to PDF instantly. No sign-up required. Your files never leave your device.
Supports .docx · up to 25 MB
How It Works
- Upload Word file: Drop or select a .docx file to convert.
- Configure options: Choose your preferred page size and margin settings.
- Convert: Click "Convert to PDF" to generate your PDF. Conversion happens instantly in your browser.
Why Convert to PDF?
PDF is the universal document format. Converting Word documents to PDF ensures consistent formatting across all devices and operating systems, makes documents read-only to prevent accidental edits, reduces file size, and is ideal for sharing, archiving, and professional submissions. PDFs are compatible everywhere and preserve your original formatting.
What "DOCX to PDF" actually means
A `.docx` file is not a single document in the way it appears in Word. It is a ZIP archive (the Open Packaging Conventions container defined in ISO/IEC 29500-2) holding a tree of XML files, embedded images, and relationship metadata. The main document text lives in word/document.xml as WordprocessingML markup; paragraph styles live in word/styles.xml; list numbering in word/numbering.xml; embedded images sit in word/media/. Rename any `.docx` to `.zip` and unzip it to see the structure with a text editor. The format was introduced with Microsoft Office 2007, standardised as ECMA-376 in December 2006 and as ISO/IEC 29500 in 2008, and is the default Word format used by hundreds of millions of authors today.
A PDF (Portable Document Format) describes the opposite paradigm: fixed-layout pages with glyphs at exact coordinates, embedded fonts, and absolute positioning. Where DOCX is flow-based (text reflows when the page size, margins, or fonts change), PDF is pixel-positioned (every character occupies a specific point in the page coordinate system). Converting between the two is structurally a translation between paradigms: a flexible authoring format becomes a frozen rendering format. Every conversion tool, whether Microsoft Word itself, LibreOffice's headless converter, Smallpdf's cloud engine, or this browser tool, faces the same inherent gap: the source describes intent, the destination describes appearance, and the intent has to be committed to a specific appearance during conversion.
Different conversion approaches accept different trade-offs in that gap. Word's own Save As PDF uses the same rendering engine that displays the document on screen, so the conversion is essentially "render to a PDF surface rather than a screen surface"; fidelity is excellent and text is selectable in the output. Cloud services typically use server-side LibreOffice or a proprietary engine; fidelity is high, text is selectable, the file gets uploaded. This browser tool uses a different pipeline (described in the next section) that optimises for privacy and reach (any browser, any device, no install, no upload) and accepts a fidelity penalty in exchange. None of these approaches is universally best; each fits different workflows.
How this tool works under the hood
The conversion runs entirely in your browser through a three-stage pipeline: mammoth.js parses the DOCX into semantic HTML, html2canvas (invoked by html2pdf.js) paints that HTML onto an off-screen `<canvas>` element using the browser's own rendering primitives, and jsPDF (also invoked by html2pdf.js) wraps the canvas as JPEG image data, places it on a PDF page, and adds page breaks at the boundaries the configured page size and margins imply. The output PDF is a sequence of full-page bitmap images, one or more per source page, plus the PDF metadata header.
An important consequence of this pipeline that deserves an upfront, honest statement: the output PDF is bitmap-based, not text-based. The visible characters in the PDF body are pixels in an image, not text-positioning operators with embedded font data. Practically, this means: search-in-PDF will not find words; copy-paste from the page body returns nothing; screen readers cannot read the content as text; OCR is needed if the PDF is fed into downstream text-processing tools. For workflows where any of these matter (legal discovery, accessibility submissions, document archives intended for search, anything that will be parsed by software), the right tool is Microsoft Word's File → Save As PDF or LibreOffice's File → Export as PDF, both of which produce text-based PDFs with selectable, searchable, screen-reader-accessible text. This tool's purpose is to deliver a clean visual PDF from a Word document without ever uploading the document anywhere; the bitmap output is the cost of that privacy guarantee.
When you select a `.docx`, the browser's File API hands the bytes to mammoth.js. mammoth treats the DOCX as an OPC package, extracts word/document.xml, and walks the WordprocessingML tree: `<w:p>` becomes `<p>`, `<w:tbl>` becomes `<table>`, runs with bold/italic become inline markup, hyperlinks become `<a>`, embedded images in `word/media/` become inline base64-encoded `<img>` tags. The tool then renders the HTML inside a hidden DOM container, applies the chosen page size and margin as CSS, and invokes html2pdf with that container. html2pdf walks the container, splits it into virtual pages at the CSS page boundaries, calls html2canvas on each page-sized chunk, and assembles the resulting canvas bitmaps into a jsPDF document. The whole pipeline never makes a network request after the initial library load.
The three libraries behind the conversion
mammoth.js is the DOCX parser. Created by Michael Williamson under a BSD-2-Clause licence and maintained on GitHub at mwilliamson/mammoth.js. The design philosophy, stated in Williamson's own write-up, is that conversion should preserve semantic content (paragraphs, headings, lists, tables, footnotes, hyperlinks, embedded images) rather than direct formatting (specific fonts, sizes, colours, spacing). The author argues that documents move between contexts (Word, web, ePub, PDF) and the visual styling appropriate to one context is rarely appropriate to another; carrying semantic structure is more useful than carrying exact appearance. The browser bundle is about 600 KB minified.
jsPDF is the PDF writer. Created by James Hall (GitHub: MrRio) in 2010, now co-maintained by yWorks GmbH, MIT-licensed, on GitHub at parallax/jsPDF. jsPDF can write PDFs from text, vector graphics, images, and HTML canvases; in this tool, it consumes canvas bitmaps from html2canvas and assembles them into PDF pages. html2canvas, by Niklas von Hertzen (MIT licence), is the HTML-to-canvas renderer that paints DOM nodes onto a `<canvas>` using the computed CSS for each element. html2pdf.js by Erik Koopmans (MIT licence) is the glue: it orchestrates html2canvas and jsPDF, handles pagination, exposes a clean configuration API, and packages everything as a single bundle of about 350 KB minified.
Combined first-load footprint is about 950 KB. Both libraries are cached by the browser after the first visit, so subsequent visits are instantaneous from the network's perspective. All four projects (mammoth.js, html2canvas, jsPDF, html2pdf.js) are open source, MIT or BSD licensed, and have been deployed in tens of thousands of production projects. The pipeline is mature, well-understood, and battle-tested for the use case it serves: visual-fidelity DOCX-to-PDF conversion without server-side rendering.
A brief history of DOCX and the OOXML standard
- Word .doc (1983 to 2007). Microsoft's binary `.doc` format, used by every version of Word from Word 1.0 on through Word 2003. Proprietary, undocumented for most of its life, reverse-engineered by competitors to support interoperability.
- OOXML announced (November 2005). Microsoft published the first draft of Office Open XML, an XML-based format intended to replace the binary `.doc`, `.xls`, and `.ppt`. The submission was to Ecma International for standardisation.
- ECMA-376 first edition (December 2006). Ecma International ratified OOXML as ECMA-376 first edition: the first official standard for what is now `.docx`, `.xlsx`, `.pptx`. The spec ran to over 6,000 pages.
- Office 2007 ships (January 2007). Microsoft Office 2007 launched with `.docx` as the default save format. Office 2003 received a free compatibility pack that let it open the new formats.
- ISO/IEC 29500:2008 ratification (November 2008). After a contentious fast-track process that drew strong criticism from the open-source community and the OpenDocument Format (ODF) camp, ISO and IEC adopted OOXML as ISO/IEC 29500:2008. The spec defines two variants: Strict (clean ISO-compliant) and Transitional (the variant Word actually writes by default, with backward-compatibility constructs preserved).
- Today (2026). Almost every real-world `.docx` file is Transitional OOXML, generated by Word, LibreOffice, Google Docs, Apple Pages, WPS Office, or one of the many SaaS document tools. The format is now ubiquitous: it is the default at every level of the document-authoring economy from individual writers to enterprise document-management systems.
Real-world workflows that drive Word-to-PDF conversion
- CV or resume submissions to recruiters and job portals. Every major job platform (LinkedIn Recruiter, Indeed, Workday, Greenhouse, government job portals) expects PDF. The author drafts in Word, converts before sending. Privacy matters because the CV contains personal contact details, addresses, and employment history.
- Cover letters and application essays. Universities, grant programmes, government applications, and corporate hiring all converge on PDF as the submission format. Word remains the drafting tool of choice because of grammar checking, comments, and revision support; PDF is the delivery format because formatting is locked.
- Contracts and legal documents pre-signature. Convert to PDF before circulating a draft contract so the counterparty sees a fixed-layout document. After signature, the signed PDF becomes the canonical version. The conversion needs to happen on a trusted device because the document is confidential.
- Academic papers, theses, and journal submissions. Most journals and university repositories accept PDF as the submission format. Authors draft in Word (or LaTeX) and submit PDF. For LaTeX users, this tool is irrelevant; for the very large population of Word users in social sciences, humanities, business, and law, it is the standard workflow.
- Internal reports and archival snapshots. Convert a Word report to PDF for permanent archival. The PDF is the official record; the Word source may continue to evolve as a draft. Many regulated industries (finance, healthcare, legal) require PDF for compliance archives.
- Reliable email distribution. PDF renders consistently across email clients in a way `.docx` does not. Outlook, Gmail, Apple Mail, and mobile mail apps all open PDFs inline; `.docx` requires the recipient to have Word or a compatible app installed. PDF is the lowest-friction distribution format for documents that go to a non-technical audience.
Common pitfalls and what they mean
- The text in the PDF is not selectable or searchable. This is by design and not a bug. The pipeline rasterises the rendered HTML, so the PDF body is a bitmap image. Search-in-PDF, copy-paste, and screen-reader access to body text won't work. For documents where selectable text matters (legal, archival, accessibility, document databases), use Microsoft Word's File → Save As PDF or LibreOffice's File → Export as PDF. Both produce text-based PDFs from DOCX sources.
- The fonts in the PDF don't match the original Word document. mammoth.js intentionally focuses on semantic content and does not transplant direct font metadata; the browser renders the HTML with the fonts available locally. If your Word document used a custom corporate font that's installed only on your work machine, the PDF will render in the browser's fallback. The fix is either to install the source fonts on the device doing the conversion or to use Word's own Save As PDF, which embeds the fonts the document references.
- Complex tables simplify or render imperfectly. mammoth handles standard tables (rows, columns, cell borders) well. Multi-merged cells, deeply nested tables, and tables with elaborate conditional formatting may simplify or break visually because mammoth's HTML output is a subset of what Word's renderer produces. For table-heavy reports, the right tool is Word's native PDF export.
- Multi-column layout becomes single-column. mammoth does not preserve Word's `<w:cols>` directive. A newspaper-style two-column article becomes a single column in the resulting HTML and therefore the PDF. For documents that genuinely need column layouts in the output, use Word's File → Save As PDF or LibreOffice headless mode, both of which preserve column structure during conversion.
- Headers, footers, and page numbers from the source are dropped. mammoth focuses on the document body and does not transplant Word's headers, footers, or page-number fields. html2pdf adds its own pagination but cannot inject the original Word headers/footers. The workaround for documents that need original headers/footers is to convert through Word, or to convert here and then add page numbers downstream with PDF Page Numbers.
- Comments and tracked changes don't appear. By default mammoth treats comments and revision marks as metadata and ignores them. The resulting PDF reflects the document with all revisions effectively accepted (the state of
word/document.xmlignoring revision elements). If you need comments or markup visible in the PDF, use Word's File → Save As PDF with the "Print Markup" option enabled, or accept/reject all revisions in Word first and then convert. - Very large documents may freeze the tab. mammoth and html2canvas both load the whole document into memory and process synchronously. Documents over 100 pages or with many embedded images can saturate browser memory, especially on mobile devices with 4 GB of RAM or less. The mitigation is to split the document in Word first, convert sections separately, then merge the resulting PDFs with PDF Merge.
Privacy: the document stays on your device
Every major cloud Word-to-PDF service (Smallpdf, iLovePDF, Adobe Acrobat Online, PDF24, Sejda, Soda PDF) uploads your file to the operator's servers, parses it server-side using LibreOffice or a proprietary engine, renders the PDF, and serves it back as a download. The privacy implications are not trivial because Word documents are routinely personal or sensitive: CVs and cover letters contain home addresses, phone numbers, and employment history; cover letters and personal essays contain candid self-disclosure; HR forms contain salary expectations; contracts contain commercial terms; medical-record letters contain protected health information; legal drafts contain attorney-client privileged content. The major providers publish privacy policies committing to delete within an hour or two and to encrypt in transit with TLS, and the bigger ones hold ISO/IEC 27001 certification and GDPR alignment. They have strong commercial reasons to honour these commitments. But "deleted within an hour" is not "never seen": during that hour the file content sits in their infrastructure, accessible to any process or human with appropriate access, and visible in logs and backups for whatever retention applies.
This converter does not upload anything. mammoth.js and html2pdf.js run entirely in your browser tab; the file bytes are read by the File API, processed in JavaScript, and the PDF output is offered back as a Blob download. You can verify the absence of uploads by opening the browser developer tools to the Network tab before clicking Convert: no requests fire that include your file content. The only network traffic is the initial one-time CDN load of the two libraries (about 950 KB total minified), which happens once when you first open the page and is then cached by the browser. Switch to airplane mode after the page loads and the converter still works on local files. The trade-off, as stated in the section on how the tool works, is that the output PDF is bitmap-based: it loses the text-selectability and searchability that server-side converters preserve. For most personal Word-to-PDF flows (CVs, cover letters, internal memos, contract drafts, application forms), the bitmap output is perfectly fine and the privacy guarantee is worth the trade. For documents that need selectable text in the output, the right answer is Word's own Save As PDF or LibreOffice's Export as PDF, both of which are also offline and private if the document never needs to leave the local machine.
When another tool is the right choice
- The output PDF needs selectable, searchable text. Legal discovery, document archives, accessibility submissions, anything that will be indexed by a search engine or read by a screen reader: the output of this tool is bitmap-only and won't work. Use Microsoft Word's File → Save As PDF or LibreOffice's File → Export as PDF. Both are offline-capable and produce text-based PDFs.
- The document has complex layouts, equations, or embedded objects. Multi-column newspapers, math-heavy academic papers, documents with embedded Excel charts or PowerPoint slides, complex multi-merged tables: these benefit from Word's native renderer or LibreOffice's headless conversion. mammoth's semantic-first approach intentionally drops or simplifies what it cannot represent cleanly.
- You need to batch-convert many files. This tool accepts one document at a time and the rendering pipeline is single-threaded. For batches, the standard CLI command is
soffice --headless --convert-to pdf *.docx --outdir output/using LibreOffice. Alternatively the same mammoth and html2pdf libraries this tool uses also run in Node.js for scripted conversion if you prefer the same fidelity profile. - The document is very large (over 100 pages or with many big images). mammoth + html2canvas load and process the whole document in memory; large documents can freeze a browser tab, especially on mobile. Either split in Word first and convert sections separately (then merge with PDF Merge), or use a desktop tool that streams the document from disk: Word, LibreOffice, or a CLI converter.
Features
- Fast conversion: Instant Word to PDF conversion with no waiting.
- Customizable: Choose page size (A4 or Letter) and margin settings.
- Privacy: All processing happens locally in your browser. Files never uploaded to any server.
- Simple: Just drag and drop your Word file and click convert.
- Single file: Convert one Word document at a time for best results.
Frequently Asked Questions
What Word formats are supported?
This converter supports .docx files (Microsoft Word 2007 and later). Older .doc files are not supported; you can convert them to .docx in Microsoft Word and then use this tool.
Will my formatting be preserved?
Most formatting will be preserved, including fonts, sizes, colors, and basic layout. However, complex elements like advanced tables, multi-column layouts, and embedded objects may be simplified or adjusted during conversion.
Is my Word document uploaded to a server?
No. All conversion happens locally in your browser using Mammoth and html2pdf libraries. Your Word file never leaves your device, ensuring complete privacy and security.
Can I convert multiple files at once?
This tool converts one Word file at a time. For each conversion, simply upload a new .docx file and convert it.
What's the file size limit?
Files up to 25 MB are supported. Very large files may take longer to process depending on your browser's available memory.
Can I adjust page margins and size?
Yes. Before converting, you can select your preferred page size (A4 or Letter) and margin width (Normal, Narrow, or Wide).
More frequently asked questions
Why isn't the text in the PDF selectable?
This tool produces a bitmap-based PDF: it renders the Word document to HTML, paints that HTML onto an off-screen canvas using the browser, then embeds the canvas as an image inside the PDF. The output PDF therefore contains pictures of pages, not characters with positions. Copy-paste from the page body returns nothing useful, and a search-in-PDF function won't find words. For workflows that need selectable text (legal discovery, research databases, accessibility), use Microsoft Word's File → Save As PDF or LibreOffice's File → Export as PDF; both produce text-based PDFs from DOCX sources.
Why is the font different from my original Word document?
mammoth.js, the library this tool uses to parse DOCX, intentionally focuses on semantic content (paragraphs, headings, lists, tables) rather than direct formatting (specific fonts, exact sizes, colours). The browser renders the resulting HTML using the fonts you have installed locally. If your Word document used a custom corporate or branded font that's only on your work machine, the PDF will render in the browser's fallback font (typically a generic sans-serif). To preserve fonts, either install the same fonts on your device, or use Word's own Save As PDF, which embeds the fonts your document uses.
Does this work offline?
Yes, after the first visit. The libraries (mammoth at about 600 KB, html2pdf at about 350 KB) are loaded once from a CDN and then cached by the browser. Subsequent visits to the converter work entirely offline, as long as the browser cache has not been cleared. The conversion itself never required network in the first place; only the initial library load did. You can verify by enabling airplane mode after the page loads once and converting a local DOCX.
Can I convert old .doc files (Word 97-2003)?
No. This tool only accepts .docx (Office Open XML), the format introduced in Word 2007. The older binary .doc format requires a different parser (mammoth handles only the XML-based .docx). To convert a .doc file, first open it in Word or LibreOffice and Save As .docx, then convert here. The conversion from .doc to .docx is a one-click operation in either tool and preserves the content.
Will hyperlinks be clickable in the resulting PDF?
Hyperlinks in the source document appear as visually styled text (blue, underlined) in the PDF but they are not interactive: because the PDF body is a bitmap image of the rendered HTML, clicking the link text in a PDF reader will not navigate anywhere. If you need clickable links in the output PDF, use Word's File → Save As PDF (which preserves PDF link annotations) or LibreOffice's Export as PDF. Both produce PDFs where hyperlinks remain functional after conversion.
Is there a desktop or command-line equivalent?
Yes, several. The standard server-side and CLI approach is LibreOffice's headless mode: soffice --headless --convert-to pdf input.docx --outdir output/ converts the file in place, batch-friendly, no GUI required. LibreOffice can be installed on Linux, macOS, and Windows; the same command line works everywhere. For pixel-identical-to-Word output programmatically, Microsoft Word + PowerShell COM automation on Windows is the gold standard (requires Word installed). The same mammoth.js and html2pdf.js libraries this tool uses also run in Node.js for scripted browser-less conversion at the same fidelity profile. Pandoc with a LaTeX engine is another option for academic publishing workflows.