https://github.com/aasmpro/xurl
Extract links (href data) from html files/web pages.
https://github.com/aasmpro/xurl
hacktoberfest python3 urls
Last synced: 7 months ago
JSON representation
Extract links (href data) from html files/web pages.
- Host: GitHub
- URL: https://github.com/aasmpro/xurl
- Owner: aasmpro
- License: mit
- Created: 2021-03-25T07:16:47.000Z (over 4 years ago)
- Default Branch: master
- Last Pushed: 2022-09-12T12:49:35.000Z (about 3 years ago)
- Last Synced: 2025-02-23T05:17:11.233Z (7 months ago)
- Topics: hacktoberfest, python3, urls
- Language: Python
- Homepage: https://pypi.org/project/xurl/
- Size: 4.88 KB
- Stars: 2
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
### xurl
extract links (href data) from html files/web pages.
#### Installation
```
pip install xurl
```#### Options
run the `xurl -h` or `xurl --help` for options
```
-a = append an URL to start of the links
-c = contain text (REGEX)
-C = not contain text (REGEX)
-q = quiet mode (do not print Errors/Warnings/Infos)
-v = version
```#### Usages
```
xurl https://example.com
```
and same for the files
```
xurl path/to/file
```
search using regex
```
xurl https://example.com -c "section\-[1-10].*.[pdf|xlsx]"
```