https://github.com/yjg30737/wiki-offline
convert Wikipedia html into txt which makes it able to read offline
https://github.com/yjg30737/wiki-offline
beautifulsoup python python3 python37 python38 urllib web-crawler web-crawling wiki wikipedia
Last synced: 6 months ago
JSON representation
convert Wikipedia html into txt which makes it able to read offline
- Host: GitHub
- URL: https://github.com/yjg30737/wiki-offline
- Owner: yjg30737
- License: mit
- Created: 2022-06-30T10:44:24.000Z (almost 4 years ago)
- Default Branch: main
- Last Pushed: 2022-07-01T03:08:23.000Z (almost 4 years ago)
- Last Synced: 2024-12-29T08:42:08.659Z (over 1 year ago)
- Topics: beautifulsoup, python, python3, python37, python38, urllib, web-crawler, web-crawling, wiki, wikipedia
- Language: Python
- Homepage:
- Size: 12.7 KB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# wiki-offline
## Overview
This converts Wikipedia html into txt which makes it able to read offline.
Text file will be made and open at the same time right after calling `convertWikiToText(url: str)`.
## Tutorial (how to make the wiki-offline by myself)
See the post
## Setup
`python -m pip install git+https://github.com/yjg30737/wiki-offline.git --upgrade`
## Usage
```python
from wiki_offline import convertWikiToText
convertWikiToText('https://en.m.wikipedia.org/wiki/Banjo')
```
## Preview
Application below which is being used for preview is Windows Notepad.
