Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/sukhbinder/winzy-pdf-to-text

Extract text from a given pdf
https://github.com/sukhbinder/winzy-pdf-to-text

Last synced: about 2 months ago
JSON representation

Extract text from a given pdf

Host: GitHub
URL: https://github.com/sukhbinder/winzy-pdf-to-text
Owner: sukhbinder
License: apache-2.0
Created: 2024-10-21T16:51:02.000Z (3 months ago)
Default Branch: main
Last Pushed: 2024-10-22T03:20:56.000Z (3 months ago)
Last Synced: 2024-10-22T05:59:50.719Z (3 months ago)
Language: Python
Size: 11.7 KB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

        # winzy-pdf-to-text

[![PyPI](https://img.shields.io/pypi/v/winzy-pdf-to-text.svg)](https://pypi.org/project/winzy-pdf-to-text/)

[![Changelog](https://img.shields.io/github/v/release/sukhbinder/winzy-pdf-to-text?include_prereleases&label=changelog)](https://github.com/sukhbinder/winzy-pdf-to-text/releases)

[![Tests](https://github.com/sukhbinder/winzy-pdf-to-text/workflows/Test/badge.svg)](https://github.com/sukhbinder/winzy-pdf-to-text/actions?query=workflow%3ATest)

[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/sukhbinder/winzy-pdf-to-text/blob/main/LICENSE)

Extract text from a given pdf

## Installation

First configure your Winzy project [to use Winzy](https://github.com/sukhbinder/winzy).

Then install this plugin in the same environment as your Winzy application.

```bash

pip install winzy-pdf-to-text

```

## Usage

```bash

winzy pdf2text example.pdf -p 1

```

This will extract all text from page 1 to the standard output.

One can also provide range

```bash

winzy pdf2text example.pdf -p 3-6

```

This will extract text from page 3 to 5 .

## Development

To set up this plugin locally, first checkout the code. Then create a new virtual environment:

```bash

cd winzy-pdf-to-text

python -m venv venv

source venv/bin/activate

```

Now install the dependencies and test dependencies:

```bash

pip install -e '.[test]'

```

To run the tests:

```bash

python -m pytest

```