Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/chrisseto/ajax-mirror

A casperjs script for cloning ajax and javascript heavy websites for offline viewing.
https://github.com/chrisseto/ajax-mirror

Last synced: 3 months ago
JSON representation

A casperjs script for cloning ajax and javascript heavy websites for offline viewing.

Awesome Lists containing this project

README

        

Ajax-mirror
===========

A casperjs script for cloning ajax and javascript heavy websites for offline viewing.

Uses [mockjax](https://github.com/appendto/jquery-mockjax) to mock ajax calls to local JSON.

Currently about 90% working.

###Searching
Create a [custom google search](https://www.google.com/cse/create/new) and make note of your 'cx.'
Open search/index.html and insert your cx into the `var cx=...` statement.

###404 Pages
Rename htaccess to .htaccess and place it along with 404.html into your root www/ directory.

###ToDo
* Async
* faster
* resume?
* Disable buttons?
* code review/cleanup
* Search does not work.
* Files GUID's
* Hitting download links still an issue.
* Dupilcate link issue (trailing /'s)

###Done
* not clones pages with #'s
- ie: index.html#somesection
* Emebedded youtube
* stop downloading files
* folder Naming
* FauxJax still catching html
* Url params
* create one FauxJax file?
- would be worse...
* better way to pull page source
- Doesn't seem to be a better way
* FauxJax double listing
* Inject readonly flag
* Strip piwik
* Stripping facebook and google analytics?
* github branches
- Just the issue with dashboard

Usage:
```
$capserjs Ajax-mirror.js --web-security=no
or
$casperjs Build-Mirrorable.js
$wget -x -i mirrorable.txt

```

Target site defaults to localhost:5000

Set target either in file or with --s=http://example.com
or --server=http://example.com

run with k, `casperjs Ajax-mirror.js k`, to convert links to the current working directory.

Notes:
* Embedded youtube not working
* This script is currently site specific.
- It may however work for other sites.
* The script sometimes randomly freezes on the first couple pages.
- clearing the directory and restarting the script seems to fix this.
* requires access to the filebrowser object
* Build hgrid dump method