Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/unickz/website2pdf

Simple and Fast Python framework to convert HTML files or Web Site to PDF
https://github.com/unickz/website2pdf

convert converter html htmltopdf pdf python python3 site sitetopdf url urltopdf website websitetopdf

Last synced: about 2 months ago
JSON representation

Simple and Fast Python framework to convert HTML files or Web Site to PDF

Host: GitHub
URL: https://github.com/unickz/website2pdf
Owner: uNickz
License: gpl-3.0
Created: 2023-01-05T14:35:05.000Z (about 2 years ago)
Default Branch: main
Last Pushed: 2023-12-23T13:02:35.000Z (about 1 year ago)
Last Synced: 2024-11-11T03:39:07.720Z (3 months ago)
Topics: convert, converter, html, htmltopdf, pdf, python, python3, site, sitetopdf, url, urltopdf, website, websitetopdf
Language: Python
Homepage:
Size: 535 KB
Stars: 4
Watchers: 1
Forks: 2
Open Issues: 1
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

        


    

        

    

    


    Examples

    •

    Documentation

    •

    PyPi

    •

    News

    •

    Chat



# WebSite2PDF

> Simple and Fast Python framework to convert HTML files or Web Site to PDF

### Installing with pip

``` bash

pip3 install WebSite2PDF

```

or

``` bash

pip3 install git+https://github.com/uNickz/WebSite2PDF

```

### Installing with python

``` bash

python3 -m pip install WebSite2PDF

```

or

``` bash

python3 -m pip install git+https://github.com/uNickz/WebSite2PDF

```

### Dependencies

 - [Selenium Chrome WebDriver](https://chromedriver.chromium.org/downloads) (If [Chrome](https://www.google.com/chrome/) is installed on the machine you won't need to install the chrome driver)

## Example

### Using a url

``` python

import WebSite2PDF

url = "https://pypi.org"

c = WebSite2PDF.Client()

with open("file_name.pdf", "wb+") as file:

    file.write(c.pdf(url))

```

or

``` python

import WebSite2PDF

url = "https://pypi.org"

c = WebSite2PDF.Client()

c.pdf(url, filename = "file_name.pdf")

```

### Using a file HTML

``` python

import WebSite2PDF

file_path = "C:\Users\uNickz\index.html"

c = WebSite2PDF.Client()

with open("file_name.pdf", "wb+") as file:

    file.write(c.pdf(f"file:///{file_path}"))

```

or

``` python

import WebSite2PDF

file_path = "C:\Users\uNickz\index.html"

c = WebSite2PDF.Client()

c.pdf(f"file:///{file_path}", filename = "file_name.pdf")

```

### Using multiple urls or files HTML

``` python

import WebSite2PDF

urls_or_path = ["https://pypi.org", "file:///C:\Users\uNickz\index.html", "https://github.com/"]

c = WebSite2PDF.Client()

c.pdf(urls_or_path, filename = ["pypi.pdf", "index.pdf", "github.pdf"])

```

or

``` python

import WebSite2PDF

urls_or_path = ["https://pypi.org", "file:///C:\Users\uNickz\index.html", "https://github.com/"]

file_name = ["pypi.pdf", "index.pdf", "github.pdf"]

c = WebSite2PDF.Client()

data = c.pdf(urls_or_path)

for name, data in zip(name, data):

    with open(name, "wb+") as file:

        file.write(data)

```

### Using a delay (in seconds) before create PDF

``` python

import WebSite2PDF

url = "https://pypi.org"

c = WebSite2PDF.Client()

c.pdf(url, filename = "file_name.pdf", delay = 3)

```

### Using global [PDF Options](https://github.com/uNickz/WebSite2PDF/blob/main/PDF%20Page%20Options.md)

``` python

import WebSite2PDF

url = "https://pypi.org"

c = WebSite2PDF.Client(

    pdfOptions = {

        "landscape" = True,

        "displayHeaderFooter": True,

        "printBackground": True,

        "preferCSSPageSize": True,

    }

)

c.pdf(url, filename = "file_name.pdf")

```

### Using specific [PDF Options](https://github.com/uNickz/WebSite2PDF/blob/main/PDF%20Page%20Options.md) for a PDF

``` python

import WebSite2PDF

url = "https://pypi.org"

c = WebSite2PDF.Client(

    pdfOptions = {

        "landscape" = True,

        "displayHeaderFooter": True,

        "printBackground": True,

        "preferCSSPageSize": True,

    }

)

c.pdf(url, filename = "file_name.pdf", pdfOptions = {

    "landscape" = False,

    "displayHeaderFooter": True,

})

```

### Using global [Selenium ChromeDriver Options](https://github.com/uNickz/WebSite2PDF/blob/main/Selenium%20ChromeDriver%20Options.md)

``` python

import WebSite2PDF

url = "https://pypi.org"

c = WebSite2PDF.Client(

    pdfOptions = {

        "landscape" = True,

        "displayHeaderFooter": True,

        "printBackground": True,

        "preferCSSPageSize": True,

    }, seleniumOptions = [

        "--no-sandbox",

        "--headless",

    ]

)

c.pdf(url, filename = "file_name.pdf")

```

### Using specific [Selenium ChromeDriver Options](https://github.com/uNickz/WebSite2PDF/blob/main/Selenium%20ChromeDriver%20Options.md) for a PDF

``` python

import WebSite2PDF

url = "https://pypi.org"

c = WebSite2PDF.Client(

    pdfOptions = {

        "landscape" = True,

        "displayHeaderFooter": True,

        "printBackground": True,

        "preferCSSPageSize": True,

    }, seleniumOptions = [

        "--no-sandbox",

        "--headless",

    ]

)

c.pdf(url, filename = "file_name.pdf", pdfOptions = {

        "landscape" = False,

        "displayHeaderFooter": True,

    }, seleniumOptions = [

        "--no-sandbox",

        "--disable-gpu",

])

```