{"id":22677219,"url":"https://github.com/daemon/pywikiclean","last_synced_at":"2025-03-29T12:44:41.091Z","repository":{"id":75720097,"uuid":"173401031","full_name":"daemon/pywikiclean","owner":"daemon","description":"Python port of @lintool's comprehensive Java-based Wikipedia markup to plaintext converter: https://github.com/lintool/wikiclean","archived":false,"fork":false,"pushed_at":"2019-09-10T14:34:37.000Z","size":9,"stargazers_count":2,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"master","last_synced_at":"2025-02-04T13:43:44.362Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/daemon.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2019-03-02T04:00:35.000Z","updated_at":"2023-11-21T05:14:26.000Z","dependencies_parsed_at":"2023-06-07T11:30:17.815Z","dependency_job_id":null,"html_url":"https://github.com/daemon/pywikiclean","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/daemon%2Fpywikiclean","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/daemon%2Fpywikiclean/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/daemon%2Fpywikiclean/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/daemon%2Fpywikiclean/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/daemon","download_url":"https://codeload.github.com/daemon/pywikiclean/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":246187218,"owners_count":20737459,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-12-09T17:59:28.102Z","updated_at":"2025-03-29T12:44:41.075Z","avatar_url":"https://github.com/daemon.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# PyWikiClean\nPython port of @lintool's comprehensive Java-based Wikipedia markup to plaintext converter: https://github.com/lintool/wikiclean.\n\n## Overview\n\nI couldn't find a Python-based Wikipedia markup cleaner as comprehensive as @lintool's, so I ported [his](https://github.com/lintool/wikiclean). It's easy to use:\n\n1. Install: `pip install git+https://github.com/daemon/pywikiclean`\n2. Clean: `import wikiclean; wikiclean.clean(\"Wikipedia [[markup]] here!\") # Wikipedia markup here!`\n\nSo how does PyWikiClean compare to other Python tools?\n\n### Original\n```\n{{good article}}\n{{Infobox single \u0026lt;!-- See Wikipedia:WikiProject_Songs --\u0026gt;\n| Name           = One of Those Days\n| Cover          = Whitney Houston – One of Those Days.jpg\n| Border         = yes\n| Artist         = [[Whitney Houston]]\n| Album          = [[Just Whitney]]\n| Released       = {{start date|2002|10|29}}\n| Format         = {{flat list|\n*[[CD single]]\n*[[Music download|digital download]]}}\n| Recorded       = February 2002;\u0026lt;br\u0026gt;at Atlanta Premier Recordings\u0026lt;br\u0026gt;([[Atlanta, Georgia]])\n| Genre          = [[Contemporary R\u0026amp;B|R\u0026amp;B]]\n| Length         = {{Duration|m=3|s=56}}\n| Label          = [[Arista Records|Arista]]\n| Writer         = {{flat list|\n*[[Kevin \"She'kspere\" Briggs|Kevin Briggs]]\n*Dwight Renolds\n*Patrice Stewart\n*[[Ernest Isley]]\n*[[Marvin Isley]]\n*Christopher Jasper\n*Kelly Isley\n*[[Ronald Isley]]\n*[[Rudolph Isley]]}}\n| Producer       = Kevin Briggs\n| Last single    = \"[[Whatchulookinat]]\"\u0026lt;br /\u0026gt;(2002)\n| This single    = \"'''One of Those Days'''\"\u0026lt;br /\u0026gt;(2002)\n| Next single    = \"[[Try It on My Own]]\"\u0026lt;br /\u0026gt;(2003)\n|misc={{External music video|{{YouTube|-GW0jZQSmsw|\"One of Those Days\"}}}}\n}}\n\n\"'''One of Those Days'''\" is a song by American recording artist [[Whitney Houston]], from her fifth studio album ''[[Just Whitney]]'' (2002). Written by [[Kevin \"She'kspere\" Briggs|Kevin Briggs]], Dwight Renolds, Patrice Stewart, [[Ernest Isley]], [[Marvin Isley]], Christopher Jasper, Kelly Isley, [[Ronald Isley]], and [[Rudolph Isley]], and produced by Briggs, the song was released as the second single from the album, following the under-performance of the [[lead single]] \"[[Whatchulookinat]]\", on October 29, 2002 through [[Arista Records]]. A mid-tempo [[Contemporary R\u0026amp;B|R\u0026amp;B]] track, \"One of Those Days\" samples [[The Isley Brothers]]' song \"[[Between the Sheets (song)|Between the Sheets]]\" (1983), and its lyrics speak about getting away from the stress of daily life.\n```\n### [UnWiki](https://github.com/fitnr/unwiki)\n```\n\n\n| Format          \n| Recorded        February 2002;\u0026lt;br\u0026gt;at Atlanta Premier Recordings\u0026lt;br\u0026gt;(Atlanta, Georgia)\n| Genre           R\u0026amp;B\n| Length          \n| Label           Arista\n| Writer          \n| Producer        Kevin Briggs\n| Last single     \"Whatchulookinat\"\u0026lt;br /\u0026gt;(2002)\n| This single     \"One of Those Days\"\u0026lt;br /\u0026gt;(2002)\n| Next single     \"Try It on My Own\"\u0026lt;br /\u0026gt;(2003)\n|misc}}\n}}\n\n\"One of Those Days\" is a song by American recording artist Whitney Houston, from her fifth studio album Just Whitney (2002). Written by Kevin Briggs, Dwight Renolds, Patrice Stewart, Ernest Isley, Marvin Isley, Christopher Jasper, Kelly Isley, Ronald Isley, and Rudolph Isley, and produced by Briggs, the song was released as the second single from the album, following the under-performance of the lead single \"Whatchulookinat\", on October 29, 2002 through Arista Records. A mid-tempo R\u0026amp;B track, \"One of Those Days\" samples The Isley Brothers' song \"Between the Sheets\" (1983), and its lyrics speak about getting away from the stress of daily life.\n\n```\n### [DeWiki](https://github.com/daddyd/dewiki)\n```\n\n{{Infobox single \u0026lt;!-- See Wikipedia:WikiProject_Songs --\u0026gt;\n| Name           = One of Those Days\n| Cover          = Whitney Houston – One of Those Days.jpg\n| Border         = yes\n| Artist         = Whitney Houston\n| Album          = Just Whitney\n| Released       = \n| Format         = {{\n*CD single\n*digital download}}\n| Recorded       = February 2002;\u0026lt;br\u0026gt;at Atlanta Premier Recordings\u0026lt;br\u0026gt;(Atlanta, Georgia)\n| Genre          = Contemporary R\u0026amp;R\u0026amp;B\n| Length         = \n| Label          = Arista\n| Writer         = {{\n*Kevin \"She'kspere\"Kevin Briggs\n*Dwight Renolds\n*Patrice Stewart\n*Ernest Isley\n*Marvin Isley\n*Christopher Jasper\n*Kelly Isley\n*Ronald Isley\n*Rudolph Isley}}\n| Producer       = Kevin Briggs\n| Last single    = \"Whatchulookinat\"\u0026lt;br /\u0026gt;(2002)\n| This single    = \"One of Those Days\"\u0026lt;br /\u0026gt;(2002)\n| Next single    = \"Try It on My Own\"\u0026lt;br /\u0026gt;(2003)\n|misc=\n}}\n\n\"One of Those Days\" is a song by American recording artist Whitney Houston, from her fifth studio album Just Whitney (2002). Written by Kevin \"She'kspere\"Kevin Briggs, Dwight Renolds, Patrice Stewart, Ernest Isley, Marvin Isley, Christopher Jasper, Kelly Isley, Ronald Isley, and Rudolph Isley, and produced by Briggs, the song was released as the second single from the album, following the under-performance of the lead single \"Whatchulookinat\", on October 29, 2002 through Arista Records. A mid-tempo Contemporary R\u0026amp;R\u0026amp;B track, \"One of Those Days\" samples The Isley Brothers' song \"Between the Sheets\" (1983), and its lyrics speak about getting away from the stress of daily life.\n```\n### PyWikiClean\n```\n\"One of Those Days\" is a song by American recording artist Whitney Houston, from her fifth studio album Just Whitney (2002). Written by Kevin Briggs, Dwight Renolds, Patrice Stewart, Ernest Isley, Marvin Isley, Christopher Jasper, Kelly Isley, Ronald Isley, and Rudolph Isley, and produced by Briggs, the song was released as the second single from the album, following the under-performance of the lead single \"Whatchulookinat\", on October 29, 2002 through Arista Records. A mid-tempo R\u0026B track, \"One of Those Days\" samples The Isley Brothers' song \"Between the Sheets\" (1983), and its lyrics speak about getting away from the stress of daily life.\n```\n\nFor now, the tool handles English only. It should be simple to add the other languages that the original WikiClean supports.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdaemon%2Fpywikiclean","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fdaemon%2Fpywikiclean","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdaemon%2Fpywikiclean/lists"}