{"id":19536015,"url":"https://github.com/miyako/4d-plugin-doctotext","last_synced_at":"2026-05-16T19:04:55.480Z","repository":{"id":147172339,"uuid":"434208050","full_name":"miyako/4d-plugin-doctotext","owner":"miyako","description":"4D implementation of DocToText.","archived":false,"fork":false,"pushed_at":"2021-12-13T00:33:09.000Z","size":122244,"stargazers_count":1,"open_issues_count":1,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-01-08T17:57:24.920Z","etag":null,"topics":["4d-plugin","doc","docx","rtf"],"latest_commit_sha":null,"homepage":"","language":"C","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/miyako.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2021-12-02T12:14:42.000Z","updated_at":"2024-08-27T05:04:29.000Z","dependencies_parsed_at":"2023-07-17T09:45:51.530Z","dependency_job_id":null,"html_url":"https://github.com/miyako/4d-plugin-doctotext","commit_stats":null,"previous_names":[],"tags_count":4,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/miyako%2F4d-plugin-doctotext","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/miyako%2F4d-plugin-doctotext/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/miyako%2F4d-plugin-doctotext/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/miyako%2F4d-plugin-doctotext/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/miyako","download_url":"https://codeload.github.com/miyako/4d-plugin-doctotext/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":240788993,"owners_count":19857735,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["4d-plugin","doc","docx","rtf"],"created_at":"2024-11-11T02:20:40.917Z","updated_at":"2026-05-16T19:04:55.467Z","avatar_url":"https://github.com/miyako.png","language":"C","funding_links":[],"categories":[],"sub_categories":[],"readme":"![version](https://img.shields.io/badge/version-18%2B-EB8E5F)\n![platform](https://img.shields.io/static/v1?label=platform\u0026message=mac-intel%20|%20mac-arm%20|%20win-64\u0026color=blue)\n[![license](https://img.shields.io/github/license/miyako/4d-plugin-doctotext)](LICENSE)\n![downloads](https://img.shields.io/github/downloads/miyako/4d-plugin-doctotext/total)\n\n### Dependencies and Licensing\n\n* the source code of this plugin developed using the [4D Plug-in SDK](https://github.com/4d/4D-Plugin-SDK) is licensed under the MIT license.\n* see [SILVERCODERS](https://silvercoders.com/en/products/doctotext/) for the licensing of **DocToText** (GPLv2 or commercial).\n* the licensing of the binary product of this plugin is subject to the licensing of all its dependencies.\n\n# 4d-plugin-doctotext\n4D implementation of [DocToText](https://silvercoders.com/en/products/doctotext/).\n\n\u003cimg width=\"797\" alt=\"ss\" src=\"https://user-images.githubusercontent.com/1725068/144632527-14f2e26d-db2c-45d8-860e-a677092a8602.png\"\u003e\n\n### Abstract\n\nthe goal of this project is to support legacy Microsoft Word documents with the `.doc` file extension.\n\n* [`wv`](https://sourceforge.net/projects/wvware/files/wv/) can load and parse Word 2000, 97, 95 and 6 file formats. \n\n* [`wvware`](http://wvware.sourceforge.net) is a document converter that uses `wv` to import `.doc` files. the outout format includes `.rtf`, `.txt`, `.tex`, `.pdf` or `.html`. see [unofficial mirror](https://github.com/remram44/wvware).\n\n* [`abiword`](http://www.abisource.com) is a word processor that uses `wv` to import `.doc` files. it has a command line interface and server mode, similar to OpenOffice, that can be uses as a document converter. `wvware` deprecated its own suite of converters in favour of `abiword`.\n\n* [`wv2`](https://sourceforge.net/projects/wvware/files/wv2/) is the successor to `wv`. it depends on `zlib`, `libgsf`, `libbz2`, `libxml2`, `libiconv` and `glib`, which in turns depends on `libffi`  and `libpcre`.\n\n* [`doctotext`](http://silvercoders.com/en/products/doctotext/) is a document converter that uses `wv2` to import `.doc` files. additionally it uses [`libcharsetdetect`](https://github.com/batterseapower/libcharsetdetect), [`htmlcxx`](http://htmlcxx.sourceforge.net), `libmimetic`, `minizip` to support other input formats. the outout format is always plain text.\n  \n* [`pthread-win32`](https://github.com/GerHobbelt/pthread-win32) nuget might not work, need to compile from source. \n  \n### Features\n\nextract plain text from various file types:\n\n### Syntax\n\n```4d\nstatus:=DocToText (document;options;attachments)\n```\n\nParameter|Type|Description\n------------|------|----\ndocument|BLOB|\noptions|Object|see below\nattachments|Array BLOB|\nstatus|Object|\n\n#### Options\n\nProperty|Type|Description\n------------|------|----\nxml | Text |`parse` (default) `fix` `strip` \ntable | Text | `table` (default) `row` `col` \nurl | Text | `underscored` (default) `text` `extended` \nlist | Text |` * ` (default) or any string\nverbose | Boolean |`false` (default)\nfallback | Boolean |`false` (default)\nformat | Text | `.doc` (default) `.rtf` `.docx` `.pptx` `.xlsx` `.fodt` `.fods` `.fodp` `.fodg` `.odt` `.ods` `.odp` `.odg` `.ppt` `.xls` `.xlsb` `.pages` `.numbers` `.key` `.html` `.pdf` `.eml`\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmiyako%2F4d-plugin-doctotext","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fmiyako%2F4d-plugin-doctotext","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmiyako%2F4d-plugin-doctotext/lists"}