{"id":23885607,"url":"https://github.com/reclamador/document_clipper","last_synced_at":"2025-08-17T16:45:14.068Z","repository":{"id":22195089,"uuid":"95535102","full_name":"reclamador/document_clipper","owner":"reclamador","description":"A set of utility classes and functions to process documents with Python","archived":false,"fork":false,"pushed_at":"2022-12-26T20:27:55.000Z","size":2325,"stargazers_count":4,"open_issues_count":16,"forks_count":2,"subscribers_count":2,"default_branch":"master","last_synced_at":"2025-03-24T02:51:42.222Z","etag":null,"topics":["document-conversion","document-management","python","python27"],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/reclamador.png","metadata":{"files":{"readme":"README.rst","changelog":"HISTORY.rst","contributing":"CONTRIBUTING.rst","funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2017-06-27T08:22:41.000Z","updated_at":"2023-06-28T11:58:42.000Z","dependencies_parsed_at":"2023-01-11T21:31:17.232Z","dependency_job_id":null,"html_url":"https://github.com/reclamador/document_clipper","commit_stats":null,"previous_names":[],"tags_count":30,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/reclamador%2Fdocument_clipper","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/reclamador%2Fdocument_clipper/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/reclamador%2Fdocument_clipper/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/reclamador%2Fdocument_clipper/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/reclamador","download_url":"https://codeload.github.com/reclamador/document_clipper/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":247779796,"owners_count":20994569,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["document-conversion","document-management","python","python27"],"created_at":"2025-01-04T05:45:19.301Z","updated_at":"2025-04-10T01:13:19.351Z","avatar_url":"https://github.com/reclamador.png","language":"Python","readme":"================\ndocument-clipper\n================\n\n\n.. image:: https://img.shields.io/pypi/v/document_clipper.svg\n        :target: https://pypi.python.org/pypi/document_clipper\n\n.. image:: https://img.shields.io/travis/reclamador/document_clipper.svg\n        :target: https://travis-ci.org/reclamador/document_clipper\n\n.. image:: https://readthedocs.org/projects/document-clipper/badge/?version=latest\n        :target: https://document-clipper.readthedocs.io/en/latest/?badge=latest\n        :alt: Documentation Status\n\n.. image:: https://pyup.io/repos/github/reclamador/document_clipper/shield.svg\n     :target: https://pyup.io/repos/github/reclamador/document_clipper/\n     :alt: Updates\n\n.. image:: https://coveralls.io/repos/github/reclamador/document_clipper/badge.svg?branch=master\n     :target: https://coveralls.io/github/reclamador/document_clipper?branch=master\n\n\n\nA set of utility classes and functions to process documents with Python\n\n\n* Free software: MIT license\n* Documentation: https://document-clipper.readthedocs.io.\n\nInstallation\n------------\n\nThe `document_clipper` package uses libraries that relies on several command-line tools included in the\n`poppler-utils` package such as:\n- `pdftohtml`\n- `pdfimages`\n- `pftocairo`\n\nBefore attempting to use `document_clipper`, please install the `poppler-utils` package.\n\nFor instance, in Ubuntu, you may do so by running the following command:\n\n.. code-block:: bash\n\n    $ sudo apt-get install poppler-utils\n\n\nThen, you may install `document_clipper` as usual via Python package managers, such as PIP:\n\n.. code-block:: bash\n\n    $ pip install document_clipper\n\n\n\nFeatures\n--------\n\n* Fetch the number of pages associated to a PDF file.\n* Extract the coordinates and dimensions of a given text located in a PDF file.\n* Combine multiple PDFs into a single PDF.\n* Combine multiple PDF **and image** files into a single PDF.\n* Generate a new PDF file containing a subset of a provided source PDF file's pages. Rotations can be applied to each page individually.\n* Optionally fix the document(s) involved in the slicing/merging processes beforehand.\n","funding_links":[],"categories":[],"sub_categories":[],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Freclamador%2Fdocument_clipper","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Freclamador%2Fdocument_clipper","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Freclamador%2Fdocument_clipper/lists"}