https://github.com/z23788677/ccp_gfw_detector
This Project is a website detector, Detect website from text to image both way. Detect content that's CCP may don't like.
https://github.com/z23788677/ccp_gfw_detector
ccp china detector gfw-china gui pillow pytesseract python python3 selenium text-classification text-detect text-detection text-detector tk tkinter tkinter-gui tkinter-python
Last synced: about 2 months ago
JSON representation
This Project is a website detector, Detect website from text to image both way. Detect content that's CCP may don't like.
- Host: GitHub
- URL: https://github.com/z23788677/ccp_gfw_detector
- Owner: z23788677
- Created: 2024-07-09T17:02:47.000Z (almost 2 years ago)
- Default Branch: main
- Last Pushed: 2025-04-24T13:57:40.000Z (about 1 year ago)
- Last Synced: 2025-05-28T21:10:10.260Z (about 1 year ago)
- Topics: ccp, china, detector, gfw-china, gui, pillow, pytesseract, python, python3, selenium, text-classification, text-detect, text-detection, text-detector, tk, tkinter, tkinter-gui, tkinter-python
- Language: Python
- Homepage:
- Size: 25.3 MB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.MD
Awesome Lists containing this project
README
Statement
===
### This project doesn't mean to political propaganda, this is aiming for fun and learning.
### Wish you get offended by it.
CCP_GFW_Detector
===
This is a program with a GUI, allowing users to input a URL link to the program. The program will access any single Chinese characters from the website HTML content. Since some websites use images more than text, once the program accesses the HTML content, it'll also process images for Chinese characters. It doesn't matter if the website uses simplified or traditional Chinese, the program won't miss any of them.
What is GFW
===
### Great Fire Wall
The Chinese government has a GFW policy. Generally speaking, this means that Chinese people can't access websites banned by the government, such as YouTube and Facebook. Since some website content is something that the CCP doesn't want Chinese people to see, this is the aim of the project.
The "word_list" Folder
===
I listed a lot of words that the CCP may not like, and rated them by "Red", "Orange", and "Yellow".
- **Red**: Words that MUST BE BANNED
- **Orange**: Words that are OKAY to use, but as less as possible
- **Yellow**: Words that are between safe and Orange words
These DAT files can be edited.
Some of the Python files have this, please ignore them as they are for testing during the developing process.
` if __name__ == "__main__":
something... `
Please Notice
===
You may have to Install Firefox Befoce running the program.
Since the GUI using "微軟正黑體", if you don't have this font in your local machine, the program may not running properly.