https://github.com/moisentinel/pdflinkextractor

extracts PDF links from websites and dumps them to the stdout and as a textfile. only works for links pointing to files with the ".pdf" extension
https://github.com/moisentinel/pdflinkextractor

cli extractor hacktoberfest links linux pdf tool

Last synced: about 1 month ago
JSON representation

extracts PDF links from websites and dumps them to the stdout and as a textfile. only works for links pointing to files with the ".pdf" extension

Host: GitHub
URL: https://github.com/moisentinel/pdflinkextractor
Owner: moiSentineL
License: unlicense
Created: 2024-09-22T11:07:53.000Z (almost 2 years ago)
Default Branch: main
Last Pushed: 2024-11-03T15:15:13.000Z (over 1 year ago)
Last Synced: 2025-01-21T11:29:20.380Z (over 1 year ago)
Topics: cli, extractor, hacktoberfest, links, linux, pdf, tool
Language: Shell
Homepage:
Size: 7.81 KB
Stars: 1
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# pdflinkextractor

Originally made by [Glutanimate](https://askubuntu.com/a/395155).

This script finds any `pdf` files hyperlinked in a webpage and provides output of the links or download them.

## Usage
### Bash One Liner

```bash
curl -fsSL https://nibirsan.org/pdflinkextractor/script.sh | sh -s - [-d]
```

Using `-d` will enable you to download the files instead of just saving the links.

To save the links to a file, just do `> file` at the end of the command.

### Local

```bash
git clone https://github.com/moiSentineL/pdflinkextractor.git
cd pdflinkextractor && chmod +x script.sh
./script.sh [-d]
```

> [!TIP]
> Alias it for better access.

## Dependencies

You have to have `wget` and `lynx` installed:

### Ubuntu

```bash
sudo apt-get install wget lynx
```

### Arch Linux

```bash
sudo pacman -S wget lynx
```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/moisentinel/pdflinkextractor

Awesome Lists containing this project

README