Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/Vitaliy-1/docxToJats
DOCX to JATS XML Converter
https://github.com/Vitaliy-1/docxToJats
Last synced: 3 months ago
JSON representation
DOCX to JATS XML Converter
- Host: GitHub
- URL: https://github.com/Vitaliy-1/docxToJats
- Owner: Vitaliy-1
- License: gpl-3.0
- Created: 2018-11-13T22:48:37.000Z (almost 6 years ago)
- Default Branch: main
- Last Pushed: 2023-01-09T12:03:57.000Z (almost 2 years ago)
- Last Synced: 2024-05-15T04:31:42.333Z (6 months ago)
- Language: PHP
- Homepage:
- Size: 1.03 MB
- Stars: 20
- Watchers: 6
- Forks: 7
- Open Issues: 5
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Description
DocxToJats is a PHP library that converts DOCX archives that comply OOXML standards into JATS XML (Journal Article Tag Suite) format. It's tested with DOCX produced by LibreOffice, MS Word, and Google Docs.
## Requirements
* The only requirement is PHP 7.3 or higher. CLI version if running from a command line
## Usage
1. `git clone https://github.com/Vitaliy-1/docxToJats.git`
2. `cd docxToJats`
3. `php docxtojats.php [/path/to/input/file.docx or /path/to/input/dir/] [/path/to/output/file.xml or /path/to/output/dir]`. E.g., to process a single file: `php docxtojats.php /mydir/file.docx /mydir/converted/file.xml` - if output filename is pointed, attached files, like figures, will be moved into the same folder; to process multiple files in a folder by relative path: `samples/input/ samples/output/`.
## Additional info
* The list of supported elements: https://github.com/Vitaliy-1/docxConverter#what-article-elements-are-supported.
* How to achieve the best results: https://github.com/Vitaliy-1/docxConverter#how-to-achieve-best-resultsDocxToJats is used as a submodule to the DOCX Converter Plugin, written for Open Journal Systems. Unfortunately DOCX archive doesn't contain much metadata and JATS `front` elements remain not populated, thus, the best way would be to integrate docxToJats with editorial manager from where article's metadata can be retrieved. DOCX Converter Plugin is such an example.