{"id":23207993,"url":"https://github.com/gitbruno/docxicml","last_synced_at":"2025-04-05T11:40:27.795Z","repository":{"id":147470002,"uuid":"70648768","full_name":"GitBruno/docxicml","owner":"GitBruno","description":"A semantical DOCX to ICML converter","archived":false,"fork":false,"pushed_at":"2018-03-03T05:12:51.000Z","size":4607,"stargazers_count":12,"open_issues_count":0,"forks_count":2,"subscribers_count":3,"default_branch":"master","last_synced_at":"2025-02-10T22:34:09.314Z","etag":null,"topics":["converter","docx","icml"],"latest_commit_sha":null,"homepage":"","language":"XSLT","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"bsd-2-clause","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/GitBruno.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2016-10-12T01:06:28.000Z","updated_at":"2024-10-14T05:47:36.000Z","dependencies_parsed_at":"2023-07-24T20:16:18.859Z","dependency_job_id":null,"html_url":"https://github.com/GitBruno/docxicml","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/GitBruno%2Fdocxicml","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/GitBruno%2Fdocxicml/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/GitBruno%2Fdocxicml/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/GitBruno%2Fdocxicml/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/GitBruno","download_url":"https://codeload.github.com/GitBruno/docxicml/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":247332510,"owners_count":20921852,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["converter","docx","icml"],"created_at":"2024-12-18T17:26:46.580Z","updated_at":"2025-04-05T11:40:27.771Z","avatar_url":"https://github.com/GitBruno.png","language":"XSLT","funding_links":[],"categories":[],"sub_categories":[],"readme":"\nDOCX to ICML converter \n======================\n\ndocxicml is designed to convert MS Word (DOCX) documents to Adobe InDesign (ICML). It aims to produce clean files using semantic information only. \n\nThis converter ignores all non-semantical info like font names and colours. It will however keep track of unstyled italics, bolds and page breaks. Unlike [Pandoc](https://pandoc.org/), docxicml assumes styles are applied semantically and therefore tracks all style references.\n\nThis package is standing on the shoulders of [Python-Mammoth](https://github.com/mwilliamson/python-mammoth) it generates a dynamic style map and transform the HTML to ICML using a XSLT stylesheet.\n\nContents\n--------\n\n 1. [Usage](#usage)\n 2. [Supported Elements](#supported-elements)\n 3. [Dependencies](#dependencies)\n 5. [Installation](#installation)\n 5. [Limitations](#limitations)\n 6. [Getting Help](#getting-help)\n\n\nUsage\n-----\n\nConvert a word document (`docx`) to `xhtml` and `icml` with the following command:\n\n    docxicml source.docx\n\nThe newly generated files will be at the same location as source document:\n\n    source.docx\n    source.xhtml\n    source.icml\n\n\nSupported Elements\n------------------\n\nThe following elements are supported:\n\n* Paragraph Styles  \n* Character Styles  \n* Bold and italic  \n* Strikethrough and Underlines  \n* Superscript and Subscript  \n* Headings  \n* Ordered and Unordered Lists  \n* Tables (Including headers and footers)  \n* Footnotes and endnotes (Yet to be implemented)   \n* Line, Column and Page Breaks  \n* Hyperlinks (Yet to be implemented)  \n* Images (Only embedded EMF) \n\n\nDependencies\n------------\n\ndocxicml requires [Java 6](https://www.java.com/en/download/) or later. (It uses [SaxonHE](https://sourceforge.net/projects/saxon/files/Saxon-HE/) for [XSLT 2.0](https://www.w3.org/TR/xslt20/) transformations.)\n\n\nInstallation\n------------\n\n    make install\n\n\nLimitations\n-----------\n\nAs it stands, there is room for improvements. We need to finalise implementation of all [elements](#supported-elements) listed above. It might be a good idea to port this to Javascript so we can run it with easy on a wide variety of systems without installing the Java runtime. Both XSLT processor and Mammoth have Javascript implementations: [mammoth.js](https://github.com/mwilliamson/mammoth.js), [Saxon-JS](www.saxonica.com/saxon-js). It would be usefull to be able to round-trip the files.\n\n\nGetting Help\n------------\n\nBugs and feature requests are tracked with [GitHub Issue Tracker](http://github.com/gitbruno/docxicml/issues).\n\n\n[![License](https://img.shields.io/badge/License-BSD%202--Clause-orange.svg)](https://opensource.org/licenses/BSD-2-Clause)\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fgitbruno%2Fdocxicml","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fgitbruno%2Fdocxicml","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fgitbruno%2Fdocxicml/lists"}