https://github.com/mauricelambert/pdforensic

This package analyses PDF files for Forensic Investigations.
https://github.com/mauricelambert/pdforensic

analysis forensic-analysis forensics investigation package parser pdf python3 tool

Last synced: about 1 year ago
JSON representation

This package analyses PDF files for Forensic Investigations.

Host: GitHub
URL: https://github.com/mauricelambert/pdforensic
Owner: mauricelambert
License: gpl-3.0
Created: 2022-12-31T10:38:54.000Z (over 3 years ago)
Default Branch: main
Last Pushed: 2025-02-18T13:26:34.000Z (over 1 year ago)
Last Synced: 2025-03-25T07:41:36.984Z (over 1 year ago)
Topics: analysis, forensic-analysis, forensics, investigation, package, parser, pdf, python3, tool
Language: Python
Homepage:
Size: 78.1 KB
Stars: 4
Watchers: 1
Forks: 2
Open Issues: 1
Metadata Files:
- Readme: README.md
- License: LICENSE.txt

Awesome Lists containing this project

README

          ![PDForensic logo](https://mauricelambert.github.io/info/python/security/PDForensic_small.png "PDForensic logo")

# PDForensic

## Description

This package analyses PDF files for Forensic Investigations.

## Requirements

This package require :

 - python3

 - python3 Standard Library

## Installation

```bash

pip install PDForensic

```

## Usages

### Command line

```bash

python3 -m PDForensic sample.pdf

python3 PDForensic.pyz sample.pdf

PDForensic sample.pdf

PDForensic objstm.pdf --data --hexa 000102

PDForensic objstm.pdf --data --types objstm --no-csv --no-json

PDForensic objstm.pdf --data --logs 20 --regex '[0-9a-f]{32}' --no-csv --no-json

cat blank.pdf | PDForensic - *.pdf ../*.pdf https://www.pdfscripting.com/public/FreeStuff/PDFSamples/TheFlyv3_EN4Rdr.pdf

PDForensic https://www.pdfscripting.com/public/FreeStuff/PDFSamples/TheFlyv3_EN4Rdr.pdf --data --ids 79 83 --ids 84 --strings URI --no-csv --no-json

```

### Python script

```python

from PDForensic import PDForensic

class MyPDFparser(PDForensic):

    def __init__(self):

        super().__init__("objstm.pdf")

    def handle(self, type_: str, data: bytes, typename: str = "") -> None:

        print(type_, data, typename)

parser = MyPDFparser()

parser.parse()

print(parser.report())

class MyPDFparser(PDForensic):

    def __init__(self):

        super().__init__("objstm.pdf", process_data = True, process_tags = False, filter_ = True, strings = ["/Pages"], hexa = ["000102"], regexs = ['[0-9a-f]{32}'], types = ["xref"], ids = [2])

    def handle(self, type_: str, data: bytes, typename: str = "") -> None:

        print(type_, data, typename)

parser = MyPDFparser()

parser.parse()

print(parser.report())

```

## Links

 - [Github Page](https://github.com/mauricelambert/PDForensic/)

 - [Documentation](https://mauricelambert.github.io/info/python/security/PDForensic.html)

 - [Pypi package](https://pypi.org/project/PDForensic/)

 - [Executable](https://mauricelambert.github.io/info/python/security/PDForensic.pyz)

## Licence

Licensed under the [GPL, version 3](https://www.gnu.org/licenses/).

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/mauricelambert/pdforensic

Awesome Lists containing this project

README