An open API service indexing awesome lists of open source software.

https://github.com/moisentinel/pdflinkextractor

extracts PDF links from websites and dumps them to the stdout and as a textfile. only works for links pointing to files with the ".pdf" extension
https://github.com/moisentinel/pdflinkextractor

cli extractor hacktoberfest links linux pdf tool

Last synced: about 1 month ago
JSON representation

extracts PDF links from websites and dumps them to the stdout and as a textfile. only works for links pointing to files with the ".pdf" extension

Awesome Lists containing this project

README

          

# pdflinkextractor

Originally made by [Glutanimate](https://askubuntu.com/a/395155).

This script finds any `pdf` files hyperlinked in a webpage and provides output of the links or download them.

## Usage
### Bash One Liner

```bash
curl -fsSL https://nibirsan.org/pdflinkextractor/script.sh | sh -s - [-d]
```

Using `-d` will enable you to download the files instead of just saving the links.

To save the links to a file, just do `> file` at the end of the command.

### Local

```bash
git clone https://github.com/moiSentineL/pdflinkextractor.git
cd pdflinkextractor && chmod +x script.sh
./script.sh [-d]
```

> [!TIP]
> Alias it for better access.

## Dependencies

You have to have `wget` and `lynx` installed:

### Ubuntu

```bash
sudo apt-get install wget lynx
```

### Arch Linux

```bash
sudo pacman -S wget lynx
```