Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/yjg30737/wiki-offline
convert Wikipedia html into txt which makes it able to read offline
https://github.com/yjg30737/wiki-offline
beautifulsoup python python3 python37 python38 urllib web-crawler web-crawling wiki wikipedia
Last synced: 7 days ago
JSON representation
convert Wikipedia html into txt which makes it able to read offline
- Host: GitHub
- URL: https://github.com/yjg30737/wiki-offline
- Owner: yjg30737
- License: mit
- Created: 2022-06-30T10:44:24.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2022-07-01T03:08:23.000Z (over 2 years ago)
- Last Synced: 2024-07-30T21:03:20.195Z (4 months ago)
- Topics: beautifulsoup, python, python3, python37, python38, urllib, web-crawler, web-crawling, wiki, wikipedia
- Language: Python
- Homepage:
- Size: 12.7 KB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# wiki-offline
## Overview
This converts Wikipedia html into txt which makes it able to read offline.Text file will be made and open at the same time right after calling `convertWikiToText(url: str)`.
## Tutorial (how to make the wiki-offline by myself)
See the post## Setup
`python -m pip install git+https://github.com/yjg30737/wiki-offline.git --upgrade`## Usage
```python
from wiki_offline import convertWikiToTextconvertWikiToText('https://en.m.wikipedia.org/wiki/Banjo')
```## Preview
Application below which is being used for preview is Windows Notepad.![image](https://user-images.githubusercontent.com/55078043/176660545-d847c4d7-fa2d-4432-8d66-4fad2befb4bb.png)