https://github.com/zebrajaeger/html2text
https://github.com/zebrajaeger/html2text
Last synced: 3 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/zebrajaeger/html2text
- Owner: zebrajaeger
- Created: 2023-05-15T06:49:34.000Z (about 2 years ago)
- Default Branch: main
- Last Pushed: 2023-05-15T06:52:15.000Z (about 2 years ago)
- Last Synced: 2024-12-28T19:02:44.961Z (5 months ago)
- Language: Java
- Size: 2.93 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: readme.md
Awesome Lists containing this project
README
# html2text
Tool um Texte aus lokal gespeicherte Web-Verzeichnisbäumen zu extrahieren.## Download einer Webseite
z.B. per wget:
wget -r -k -E https://www.veltec-services.com/
## Text extrahieren
### Voraussetzungen
* Java 8 oder neuer
* Maven### Projekt bauen
mvn clean package
### Programm starten (Windows)
html2text-1.0-SNAPSHOT.exe [-dry]
### Programm starten (Sonst)
java -jar html2text-1.0-SNAPSHOT.jar [-dry]