Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/billpalombi/pocket_parse
Script to parse the html export from Pocket and write out a csv
https://github.com/billpalombi/pocket_parse
Last synced: 4 months ago
JSON representation
Script to parse the html export from Pocket and write out a csv
- Host: GitHub
- URL: https://github.com/billpalombi/pocket_parse
- Owner: billpalombi
- License: mit
- Created: 2018-10-24T01:44:50.000Z (over 6 years ago)
- Default Branch: master
- Last Pushed: 2022-12-12T18:26:01.000Z (about 2 years ago)
- Last Synced: 2024-08-01T16:55:13.602Z (7 months ago)
- Language: Python
- Size: 4.88 KB
- Stars: 4
- Watchers: 1
- Forks: 1
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE.txt
Awesome Lists containing this project
README
# Pocket Parse
This is a simple Python script that parses the `ril_export.html` file from [Pocket web app](https://getpocket.com/options) and writes the data to a csv file.## Output
The csv file has eight columns. Each pocketed webpage is on a seperate row.**status** - Either `Unread` or `Read Archive`
**title** - The title of the webpage pocketed
**href** - full URL of the webpage pocketed
**domain** - the root or sub domain of the webpage pocketed
**date_added** - The calendar date that the webpage was pocketed
**time_added** - The time of day that the webpage was pocketed
**day_of_week_added** - The day of the week that the webpage was pocketed
**tags** - A comma seperated list of tags assigned to the webpage
## Requirements
* Python 2.x
* csv
* codecs
* datetime
* BeautifulSoup## Usage
1. Save the `ril_export.html` file exported from Pocket in the same directory as `pocket_parse.py`
2. Run the script from the terminal `python pocket_parse.py`