{"id":13578710,"url":"https://github.com/michaelrsweet/htmldoc","last_synced_at":"2025-04-06T20:12:14.480Z","repository":{"id":18095042,"uuid":"83096999","full_name":"michaelrsweet/htmldoc","owner":"michaelrsweet","description":"HTML Conversion Software","archived":false,"fork":false,"pushed_at":"2024-09-04T15:46:42.000Z","size":24976,"stargazers_count":208,"open_issues_count":18,"forks_count":47,"subscribers_count":19,"default_branch":"master","last_synced_at":"2024-10-13T08:44:12.292Z","etag":null,"topics":["encryption","html","html-doc","html-files","pdf","pdf-files","postscript"],"latest_commit_sha":null,"homepage":"https://www.msweet.org/htmldoc","language":"C++","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"gpl-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/michaelrsweet.png","metadata":{"files":{"readme":"README.md","changelog":"CHANGES.md","contributing":"CONTRIBUTING.md","funding":".github/FUNDING.yml","license":"COPYING","code_of_conduct":"CODE_OF_CONDUCT.md","threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":"SECURITY.md","support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null},"funding":{"github":"michaelrsweet","liberapay":"michaelrsweet"}},"created_at":"2017-02-25T01:25:32.000Z","updated_at":"2024-09-27T03:28:04.000Z","dependencies_parsed_at":"2024-01-14T09:33:06.216Z","dependency_job_id":"2db61318-ee90-4408-8b01-417d529b29bb","html_url":"https://github.com/michaelrsweet/htmldoc","commit_stats":null,"previous_names":[],"tags_count":84,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/michaelrsweet%2Fhtmldoc","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/michaelrsweet%2Fhtmldoc/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/michaelrsweet%2Fhtmldoc/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/michaelrsweet%2Fhtmldoc/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/michaelrsweet","download_url":"https://codeload.github.com/michaelrsweet/htmldoc/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":247543593,"owners_count":20955865,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["encryption","html","html-doc","html-files","pdf","pdf-files","postscript"],"created_at":"2024-08-01T15:01:33.067Z","updated_at":"2025-04-06T20:12:14.452Z","avatar_url":"https://github.com/michaelrsweet.png","language":"C++","readme":"HTMLDOC - HTML Conversion Software\n==================================\n\n![Version](https://img.shields.io/github/v/release/michaelrsweet/htmldoc?include_prereleases)\n![GPL-2.0](https://img.shields.io/github/license/michaelrsweet/htmldoc)\n[![Build](https://github.com/michaelrsweet/pdfio/workflows/Build/badge.svg)](https://github.com/michaelrsweet/htmldoc/actions/workflows/build.yml)\n[![Coverity Scan Status](https://img.shields.io/coverity/scan/22389.svg)](https://scan.coverity.com/projects/michaelrsweet-htmldoc)\n\n\nHTMLDOC is a program that reads HTML and Markdown source files or web pages and\ngenerates corresponding EPUB, HTML, PostScript, or PDF files with an optional\ntable of contents.\n\nHTMLDOC was developed in the 1990's as a documentation generator for my previous\ncompany, and has since seen a lot of usage as a report generator embedded in web\nservers.  However, it does not support many things in \"the modern web\", such as:\n\n- Cascading Style Sheets (CSS): While I have experimented with adding CSS\n  support to HTMLDOC, proper CSS support is non-trivial especially for paged\n  output (which is not well supported by CSS)\n\n- Encryption: HTMLDOC currently supports the older (and very insecure) PDF 1.4\n  (128-bit RC4) encryption.  I have looked at supporting AES (128/256-bit)\n  encryption, however it is not widely supported and there are incompatible\n  changes in PDF 2.0, so it is unlikely to be added to HTMLDOC.\n\n- Forms: HTML forms and PDF forms are very different things.  While I have had\n  many requests to add support for PDF forms over the years, I have not found a\n  satisfactory way to do so.\n\n- Tables: HTMLDOC supports HTML 3.2 tables with basic support for TBODY and\n  THEAD.\n\n- Unicode: While HTMLDOC does support UTF-8 for \"Western\" languages, there is\n  absolutely no support for languages that require dynamic rewriting or\n  right-to-left text formatting.  Basically this means you can't use HTMLDOC\n  to format Arabic, Chinese, Hebrew, Japanese, or other languages that are not\n  based on latin-based alphabets that read left-to-right.\n\n- Emoji: The fonts bundled with HTMLDOC do not include Unicode Emoji characters.\n\n\nResources\n---------\n\nThe following HTMLDOC resources are available online:\n\n- Official web site and online documentation:\n\n    https://www.msweet.org/htmldoc\n\n- Issue tracker and questions:\n\n    https://github.com/michaelrsweet/htmldoc/issues\n\n\nUsing HTMLDOC\n-------------\n\nNote: Complete documentation for HTMLDOC is available in the \"doc\" subdirectory.\nThe following provides basic information on using HTMLDOC at the command-line\nand does not discuss the GUI or web server functionality.\n\nHTMLDOC accepts a list of HTML and/or Markdown \"source\" files and will generate\nEPUB, HTML, PostScript, or PDF output via command-line options.  A summary of\ncommand-line options can be shown with the \"--help\" option:\n\n    htmldoc --help\n\nHTMLDOC normally expects \"structured\" documents, with chapters, etc.  Chapters\nbegin with a H1 markup and continue to the end of the listed HTML files or the\nnext H1 markup, whichever comes first.  To convert unstructured documents such\nas web pages, use the \"--webpage\" option to HTMLDOC:\n\n    htmldoc --webpage ...\n\nTo generate a Level 2 PostScript file you might use:\n\n    htmldoc -f outfile.ps chapter1.html ... chapterN.html\n\nSimilarly you can generate an EPUB or PDF file of the same source files using:\n\n    htmldoc -f outfile.epub chapter1.html ... chapterN.html\n\n    htmldoc -f outfile.pdf chapter1.html ... chapterN.html\n\nFinally, to generate HTML files for viewing (with a linked table-of-contents) do\nthe following:\n\n    htmldoc -t html -d outdir chapter1.html ... chapterN.html\n\nor:\n\n    htmldoc -t html -f outfile.html chapter1.html ... chapterN.html\n\nA complete description of all command-line options and HTML guidelines can be\nfound in the software users manual in the \"doc\" directory.\n\n\nCredits\n-------\n\nMany thanks to Leonard Rosenthol for providing the original changes to support\na macOS (9) version of HTMLDOC.\n\nThe original table VALIGN and \"HALF PAGE\" code was contributed by D. Richard\nHipp.\n\nThe original multiple header/footer image code was contributed by Lynn Pye.\n\nThe RC4 encryption code is from librc4 1.1 by the folks at Carnegie Mellon\nUniversity.\n\nThe MD5 hash code is from L. Peter Deutsch at Aladdin Enterprises (creators\nof Ghostscript, among other things).\n\n\nLegal Stuff\n-----------\n\nHTMLDOC is copyright © 1997-2025 by Michael R Sweet.  This program is free\nsoftware.  Distribution and use rights are outlined in the file \"COPYING\".\n\nHTMLDOC includes code to encrypt PDF document files using the RC4 algorithm\nwith up to a 128-bit key.  While this software and code may be freely used\nand exported under current US laws, other countries may restrict your use\nand possession of this code and software.\n\nThe Adobe Portable Document Format is Copyright 1985-2005 by Adobe Systems\nIncorporated. Adobe, FrameMaker, and PostScript are registered trademarks of\nAdobe Systems, Incorporated.\n\nThe Graphics Interchange Format is the copyright and GIF is the service mark\nproperty of CompuServe Incorporated.\n\nLinux is a registered trademark of Linus Torvalds.\n\nmacOS is a registered trademark of Apple Inc.\n\nMicrosoft and Windows are registered trademarks of Microsoft Corporation.\n\nUNIX is a registered trademark of the X/Open Company, Ltd.\n\nThis software is based in part on the work of the Independent JPEG Group and\nFLTK project.\n","funding_links":["https://github.com/sponsors/michaelrsweet","https://liberapay.com/michaelrsweet"],"categories":["\u003ca name=\"cpp\"\u003e\u003c/a\u003eC++","C++"],"sub_categories":[],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmichaelrsweet%2Fhtmldoc","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fmichaelrsweet%2Fhtmldoc","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmichaelrsweet%2Fhtmldoc/lists"}