https://github.com/64bit/word-count-pubnub
Counts word on a webpage using Block on PubNub
https://github.com/64bit/word-count-pubnub
Last synced: 10 months ago
JSON representation
Counts word on a webpage using Block on PubNub
- Host: GitHub
- URL: https://github.com/64bit/word-count-pubnub
- Owner: 64bit
- License: mit
- Created: 2017-01-10T08:45:01.000Z (over 9 years ago)
- Default Branch: master
- Last Pushed: 2017-01-11T23:14:15.000Z (over 9 years ago)
- Last Synced: 2025-05-31T03:39:49.094Z (about 1 year ago)
- Language: JavaScript
- Size: 10.7 KB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# word-count-pubnub
Counts word on a webpage using Block on PubNub
Demo: https://64bit.github.io/word-count-pubnub/
# Data Flow:
```
|
|
(channel:wordcount)
|
|
[Event Handler: download_url.js ]
|
|
(channel:page)
|
|
[Event Handler: split_page.js ]
|
|
(channel:words)
|
|
[Event Handler: word_counter.js ]
|
|
(channel:counted-words)
|
\|/
```
Notes:
* download_url.js publishes only 15000 characters of downloaded page as PubNub payload limit is 32K. The page is published as string and will have escaped characters making it bigger than 15K characters
* download_url.js doesn't seem to handle 30X responses, for example http://www.google.com won't work because it redirects to https. Similarly https://www.twitter.com doesn't work because it redirects to https://twitter.com
* split_page.js splits on any character which is not digit or alphabet, using regex: `/[^A-Za-z0-9]/`