https://github.com/moisentinel/pdflinkextractor
extracts PDF links from websites and dumps them to the stdout and as a textfile. only works for links pointing to files with the ".pdf" extension
https://github.com/moisentinel/pdflinkextractor
cli extractor hacktoberfest links linux pdf tool
Last synced: about 1 month ago
JSON representation
extracts PDF links from websites and dumps them to the stdout and as a textfile. only works for links pointing to files with the ".pdf" extension
- Host: GitHub
- URL: https://github.com/moisentinel/pdflinkextractor
- Owner: moiSentineL
- License: unlicense
- Created: 2024-09-22T11:07:53.000Z (almost 2 years ago)
- Default Branch: main
- Last Pushed: 2024-11-03T15:15:13.000Z (over 1 year ago)
- Last Synced: 2025-01-21T11:29:20.380Z (over 1 year ago)
- Topics: cli, extractor, hacktoberfest, links, linux, pdf, tool
- Language: Shell
- Homepage:
- Size: 7.81 KB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# pdflinkextractor
Originally made by [Glutanimate](https://askubuntu.com/a/395155).
This script finds any `pdf` files hyperlinked in a webpage and provides output of the links or download them.
## Usage
### Bash One Liner
```bash
curl -fsSL https://nibirsan.org/pdflinkextractor/script.sh | sh -s - [-d]
```
Using `-d` will enable you to download the files instead of just saving the links.
To save the links to a file, just do `> file` at the end of the command.
### Local
```bash
git clone https://github.com/moiSentineL/pdflinkextractor.git
cd pdflinkextractor && chmod +x script.sh
./script.sh [-d]
```
> [!TIP]
> Alias it for better access.
## Dependencies
You have to have `wget` and `lynx` installed:
### Ubuntu
```bash
sudo apt-get install wget lynx
```
### Arch Linux
```bash
sudo pacman -S wget lynx
```