https://github.com/nighmared/rad
Readallcomics.com downloading tool. Automatically combines all pages into a pdf and tries to crop nicely
https://github.com/nighmared/rad
comic-downloader pdf python released scraper script tool
Last synced: 9 days ago
JSON representation
Readallcomics.com downloading tool. Automatically combines all pages into a pdf and tries to crop nicely
- Host: GitHub
- URL: https://github.com/nighmared/rad
- Owner: Nighmared
- License: mpl-2.0
- Created: 2021-11-18T16:14:05.000Z (about 4 years ago)
- Default Branch: main
- Last Pushed: 2024-12-05T22:38:52.000Z (about 1 year ago)
- Last Synced: 2024-12-05T23:25:37.540Z (about 1 year ago)
- Topics: comic-downloader, pdf, python, released, scraper, script, tool
- Language: Python
- Homepage:
- Size: 50.8 KB
- Stars: 2
- Watchers: 1
- Forks: 2
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# RAD
Readallcomics.com downloading tool. automatically combines all pages into a PDF and tries to crop and rotate the pages nicely if neccessary.
## Usage
For this script you will need Python3 as well as an up-to-date version of PIP installed on your system.
After downloading, first install all dependencies by running
`$ pip install -r requirements.txt`
in the directory of the project. After that run the script a first time with `$ python3 RAD.py` and it will create a "links.txt" file.
This can then be filled with links to comics on readallcomics.com in the following format:
```
title ;
title that contains spaces ;
...
```
The amount of whitespaces around the ";" doesn't matter. The title and the link just need to be separated
by a semicolon.
After having put all the entries you need in the "links.txt" file you can run the script again just the same as before with
`$ python3 RAD.py`
(Depending on the number and length of the comics this could take a while)
After this you will be able to find the finished PDFs in the `./pdf/` directory and the single
pages of the comics as image files organized by title in the `./imgs/` directory.
### Example links.txt file
```
Ruins of the Empire Part One ; http://readallcomics.com/legend-of-korra-ruins-of-the-empire-part-01-2019/
Ruins of the Empire Part Two ; http://readallcomics.com/legend-of-korra-ruins-of-the-empire-part-02-2019/
Ruins of the Empire Part Three ; http://readallcomics.com/legend-of-korra-ruins-of-the-empire-part-03-2020/
```
### Example console output

## Troubleshooting
If you see an `AssertionError` complaining about the length of an `images` list,
there is a good chance that the html structure on readallcomics has changed and the
script can't find the pages anymore, please open an issue here and if you feel like it try to
figure out a new selector path
#### Bonus: Additionally here the formatted content for links.txt to download all avatar comics:
https://pastebin.com/MJ6veNiP
(c) nighmared