https://github.com/nighmared/rad

Readallcomics.com downloading tool. Automatically combines all pages into a pdf and tries to crop nicely
https://github.com/nighmared/rad

comic-downloader pdf python released scraper script tool

Last synced: 9 days ago
JSON representation

Readallcomics.com downloading tool. Automatically combines all pages into a pdf and tries to crop nicely

Host: GitHub
URL: https://github.com/nighmared/rad
Owner: Nighmared
License: mpl-2.0
Created: 2021-11-18T16:14:05.000Z (about 4 years ago)
Default Branch: main
Last Pushed: 2024-12-05T22:38:52.000Z (about 1 year ago)
Last Synced: 2024-12-05T23:25:37.540Z (about 1 year ago)
Topics: comic-downloader, pdf, python, released, scraper, script, tool
Language: Python
Homepage:
Size: 50.8 KB
Stars: 2
Watchers: 1
Forks: 2
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# RAD
Readallcomics.com downloading tool. automatically combines all pages into a PDF and tries to crop and rotate the pages nicely if neccessary.

## Usage

For this script you will need Python3 as well as an up-to-date version of PIP installed on your system.

After downloading, first install all dependencies by running

`$ pip install -r requirements.txt`

in the directory of the project. After that run the script a first time with `$ python3 RAD.py` and it will create a "links.txt" file.
This can then be filled with links to comics on readallcomics.com in the following format:

```
title ;
title that contains spaces ;
...
```

The amount of whitespaces around the ";" doesn't matter. The title and the link just need to be separated
by a semicolon.
After having put all the entries you need in the "links.txt" file you can run the script again just the same as before with

`$ python3 RAD.py`

(Depending on the number and length of the comics this could take a while)
After this you will be able to find the finished PDFs in the `./pdf/` directory and the single
pages of the comics as image files organized by title in the `./imgs/` directory.

### Example links.txt file

```
Ruins of the Empire Part One ; http://readallcomics.com/legend-of-korra-ruins-of-the-empire-part-01-2019/
Ruins of the Empire Part Two ; http://readallcomics.com/legend-of-korra-ruins-of-the-empire-part-02-2019/
Ruins of the Empire Part Three ; http://readallcomics.com/legend-of-korra-ruins-of-the-empire-part-03-2020/
```

### Example console output

![](./docs/output.png)

## Troubleshooting

If you see an `AssertionError` complaining about the length of an `images` list,
there is a good chance that the html structure on readallcomics has changed and the
script can't find the pages anymore, please open an issue here and if you feel like it try to
figure out a new selector path

#### Bonus: Additionally here the formatted content for links.txt to download all avatar comics:
https://pastebin.com/MJ6veNiP

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/nighmared/rad

Awesome Lists containing this project

README