https://github.com/rsdoiel/sherwood
Some tools to help a friend transform their website into an e-book.
https://github.com/rsdoiel/sherwood
Last synced: 2 months ago
JSON representation
Some tools to help a friend transform their website into an e-book.
- Host: GitHub
- URL: https://github.com/rsdoiel/sherwood
- Owner: rsdoiel
- License: bsd-2-clause
- Created: 2015-08-06T14:01:13.000Z (almost 10 years ago)
- Default Branch: master
- Last Pushed: 2016-03-12T08:55:11.000Z (over 9 years ago)
- Last Synced: 2025-02-16T07:13:16.610Z (4 months ago)
- Language: Shell
- Size: 9.77 KB
- Stars: 0
- Watchers: 3
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# sherwood
This is a set of tools I am developing to help my friend translate a website project into an ebook. The tools are organized to complete several steps
+ Mirror the existing website (Bash & wget)
+ Save of zip of the unaltered mirror
+ Convert the documents in mirrored copy to use root folder relative links
+ Correct munged filenames (e.g. filenames that contain a "?") re-link
+ Convert search boxes to an embedded browser search
+ Transform the site into content units preserving linkage (Bash and shorthand)
+ Transform content units into ePub/Mobi documents## data organization
+ Category (paged by record count)
+ Architect
+ Building_Type
+ City
+ Country
+ Project_Name
+ Instance (group by field value, sorted by field)
+ Architect
+ Architect
+ Building_Type
+ City
+ Country
+ Project_Name
+ Building_Type
+ Architect
+ Building_Type
+ City
+ Country
+ Project_Name
+ City
+ Architect
+ Building_Type
+ City
+ Country
+ Project_Name
+ Country
+ Architect
+ Building_Type
+ City
+ Country
+ Project_Name
+ Project
+ File_NoI need to make sure that I have all projects accounted for and to identify which
images are broken and get replacement files for the eBook version and static
preservation copy.## Requirements
+ Presumes a Linux based system like a Raspberry Pi (e.g. the sed on Mac OS X is broken for the purposes of this set of tools)
+ Bash
+ go (shorthand is written in Go)
+ NodeJS (using jsDom for some of the content extraction and validation)
+ wget
+ zip
+ [shorthand](https://github.com/rsdoiel/shorthand)