An open API service indexing awesome lists of open source software.

https://github.com/a2-nabil/pdf-extractor-by-nabil

a pdf extractor . It can extract images and texts from your pdf.
https://github.com/a2-nabil/pdf-extractor-by-nabil

ajax css3-animations html5 jquery js python

Last synced: 7 months ago
JSON representation

a pdf extractor . It can extract images and texts from your pdf.

Awesome Lists containing this project

README

          

```markdown
# PDF Extractor

A simple tool for extracting images and text from PDF documents. This project allows users to easily retrieve content from their PDFs for further processing or analysis.

## Features

- Extracts text content from PDF files
- Extracts images from PDF files
- Supports multiple PDF formats
- Easy to use command-line interface

## Requirements

- Python 3.x
- php
- jquery
- Required libraries:
- `PyMuPDF`

You can install the required libraries using pip:

```bash
pip install PyMuPDF
```

## Usage

1. Clone the repository:

```bash
git clone https://github.com/a2-nabil/PDF-Extractor-by-Nabil.git
cd pdf-extractor
```

2. Run the extractor:

```bash
python pdf_extractor.py
```

3. Extracted content will be saved in the `output` directory.

## Example

This command will extract all text and images from `example.pdf` and save them in the `output` directory.

## Contributing

Contributions are welcome! If you have suggestions or improvements, feel free to open an issue or submit a pull request.

## License

This project is licensed under the MIT License. See the [LICENSE](LICENSE) file for details.

## Acknowledgments

- [PyMuPDF](https://github.com/pymupdf/PyMuPDF) for converting PDF pages to images.

## Contact

For questions or feedback, please contact me at [a2nabil000@gmail.com].