https://github.com/ssujitx/google-news-url-decoder
A Python script to decode Google News article URLs.
https://github.com/ssujitx/google-news-url-decoder
decoder google google-news google-news-api google-news-scraper googleapi googlenews googlenewsscraper news newsapi python python-script python3
Last synced: 22 days ago
JSON representation
A Python script to decode Google News article URLs.
- Host: GitHub
- URL: https://github.com/ssujitx/google-news-url-decoder
- Owner: SSujitX
- License: mit
- Created: 2024-06-04T19:43:40.000Z (11 months ago)
- Default Branch: main
- Last Pushed: 2025-03-31T16:14:27.000Z (about 1 month ago)
- Last Synced: 2025-04-02T06:06:07.749Z (29 days ago)
- Topics: decoder, google, google-news, google-news-api, google-news-scraper, googleapi, googlenews, googlenewsscraper, news, newsapi, python, python-script, python3
- Language: Python
- Homepage:
- Size: 255 KB
- Stars: 159
- Watchers: 19
- Forks: 17
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
- License: LICENSE
- Security: SECURITY.md
Awesome Lists containing this project
README
[](https://badge.fury.io/py/googlenewsdecoder)
[](https://pypi.org/project/googlenewsdecoder/)
[](https://pepy.tech/project/googlenewsdecoder)
[](https://pepy.tech/project/googlenewsdecoder)# Google News Decoder
Google News Decoder is a Python package that can decode Google News links or Google News URLs to their original URLs. It is a simple tool that saves you time and effort. If you find it useful, please support the package by hitting the star on GitHub. Your support helps keep the project going!
## Update
- **Version 0.1.7**:
- **New Feature**: Added **proxy support** to handle rate limiting and bypass restrictions.
- **Improved**: Enhanced error handling with a fallback mechanism for decoding parameters.
- **Refined**: Optimized `get_decoding_params` to try decoding via `https://news.google.com/articles` first, falling back to `https://news.google.com/rss/articles` if needed.
- **Updated**: Reduced occurrences of HTTP 429 (Too Many Requests).
- **Removed**: Logging functionality for a cleaner codebase.
- **Fixed**: Resolved time delay issue between requests.## Demo

## Installation
You can install this package using pip:
```sh
pip install googlenewsdecoder
```- You can upgrade this package using pip (upgrade to latest version):
```sh
pip install googlenewsdecoder --upgrade
```## Supported Proxy Formats
- **HTTP/HTTPS Proxy**:
- **With authentication**: `http://user:pass@host:port` or `https://user:pass@host:port`
- **Without authentication**: `http://host:port` or `https://host:port`- **SOCKS5 Proxy**:
- **With authentication**: `socks5://user:pass@host:port`
- **Without authentication**: `socks5://host:port`- **IP and Port Only**:
- **HTTP**: `http://127.0.0.1:8080`
- **SOCKS5**: `socks5://127.0.0.1:1080`## Usage
Here is an example of how to use this package with different decoders:
### Using gnewsdecoder
```python
from googlenewsdecoder import gnewsdecoderdef main():
interval_time = 1 # interval is optional, default is Nonesource_url = "https://news.google.com/read/CBMi2AFBVV95cUxPd1ZCc1loODVVNHpnbFFTVHFkTG94eWh1NWhTeE9yT1RyNTRXMVV2S1VIUFM3ZlVkVjl6UHh3RkJ0bXdaTVRlcHBjMWFWTkhvZWVuM3pBMEtEdlllRDBveGdIUm9GUnJ4ajd1YWR5cWs3VFA5V2dsZnY1RDZhVDdORHRSSE9EalF2TndWdlh4bkJOWU5UMTdIV2RCc285Q2p3MFA4WnpodUNqN1RNREMwa3d5T2ZHS0JlX0MySGZLc01kWDNtUEkzemtkbWhTZXdQTmdfU1JJaXY?hl=en-US&gl=US&ceid=US%3Aen"
try:
decoded_url = gnewsdecoder(source_url, interval=interval_time)if decoded_url.get("status"):
print("Decoded URL:", decoded_url["decoded_url"])
else:
print("Error:", decoded_url["message"])
except Exception as e:
print(f"Error occurred: {e}")if __name__ == "__main__":
main()
```### Using gnewsdecoder with proxy
```python
from googlenewsdecoder import gnewsdecoderdef main():
interval_time = 1 # interval is optional, default is None
proxy = "http://user:pass@localhost:8080" # proxy is optional, default is Nonesource_url = "https://news.google.com/read/CBMi2AFBVV95cUxPd1ZCc1loODVVNHpnbFFTVHFkTG94eWh1NWhTeE9yT1RyNTRXMVV2S1VIUFM3ZlVkVjl6UHh3RkJ0bXdaTVRlcHBjMWFWTkhvZWVuM3pBMEtEdlllRDBveGdIUm9GUnJ4ajd1YWR5cWs3VFA5V2dsZnY1RDZhVDdORHRSSE9EalF2TndWdlh4bkJOWU5UMTdIV2RCc285Q2p3MFA4WnpodUNqN1RNREMwa3d5T2ZHS0JlX0MySGZLc01kWDNtUEkzemtkbWhTZXdQTmdfU1JJaXY?hl=en-US&gl=US&ceid=US%3Aen"
try:
decoded_url = gnewsdecoder(source_url, interval=interval_time, proxy=proxy)if decoded_url.get("status"):
print("Decoded URL:", decoded_url["decoded_url"])
else:
print("Error:", decoded_url["message"])
except Exception as e:
print(f"Error occurred: {e}")if __name__ == "__main__":
main()
```### Using a for loop to decode multiple URLs
```python
from googlenewsdecoder import gnewsdecoderdef main():
interval_time = 1 # interval is optional, default is Nonesource_urls = [
"https://news.google.com/read/CBMilgFBVV95cUxOM0JJaFRwV2dqRDk5dEFpWmF1cC1IVml5WmVtbHZBRXBjZHBfaUsyalRpa1I3a2lKM1ZnZUI4MHhPU2sydi1nX3JrYU0xWjhLaHNfU0N6cEhOYVE2TEptRnRoZGVTU3kzZGJNQzc2aDZqYjJOR0xleTdsemdRVnJGLTVYTEhzWGw4Z19lR3AwR0F1bXlyZ0HSAYwBQVVfeXFMTXlLRDRJUFN5WHg3ZTI0X1F4SjN6bmFIck1IaGxFVVZyOFQxdk1JT3JUbl91SEhsU0NpQzkzRFdHSEtjVGhJNzY4ZTl6eXhESUQ3XzdWVTBGOGgwSmlXaVRmU3BsQlhPVjV4VWxET3FQVzJNbm5CUDlUOHJUTExaME5YbjZCX1NqOU9Ta3U?hl=en-US&gl=US&ceid=US%3Aen",
"https://news.google.com/read/CBMiiAFBVV95cUxQOXZLdC1hSzFqQVVLWGJVZzlPaDYyNjdWTURScV9BbVp0SWhFNzZpSWZxSzdhc0tKbVlHMU13NmZVOFdidFFkajZPTm9SRnlZMWFRZ01CVHh0dXU0TjNVMUxZNk9Ibk5DV3hrYlRiZ20zYkIzSFhMQVVpcTFPc00xQjhhcGV1aXM00gF_QVVfeXFMTmtFQXMwMlY1el9WY0VRWEh5YkxXbHF0SjFLQVByNk1xS3hpdnBuUDVxOGZCQXl1QVFXaUVpbk5lUGgwRVVVT25tZlVUVWZqQzc4cm5MSVlfYmVlclFTOUFmTHF4eTlfemhTa2JKeG14bmNabENkSmZaeHB4WnZ5dw?hl=en-US&gl=US&ceid=US%3Aen"
]for url in source_urls:
try:
decoded_url = gnewsdecoder(url, interval=interval_time)
if decoded_url.get("status"):
print("Decoded URL:", decoded_url["decoded_url"])
else:
print("Error:", decoded_url["message"])
except Exception as e:
print(f"Error occurred: {e}")if __name__ == "__main__":
main()
```### Using a for loop to decode multiple URLs with Proxy
```python
from googlenewsdecoder import gnewsdecoderdef main():
interval_time = 1 # interval is optional, default is None
proxy = "http://user:pass@localhost:8080" # proxy is optional, default is Nonesource_urls = [
"https://news.google.com/read/CBMilgFBVV95cUxOM0JJaFRwV2dqRDk5dEFpWmF1cC1IVml5WmVtbHZBRXBjZHBfaUsyalRpa1I3a2lKM1ZnZUI4MHhPU2sydi1nX3JrYU0xWjhLaHNfU0N6cEhOYVE2TEptRnRoZGVTU3kzZGJNQzc2aDZqYjJOR0xleTdsemdRVnJGLTVYTEhzWGw4Z19lR3AwR0F1bXlyZ0HSAYwBQVVfeXFMTXlLRDRJUFN5WHg3ZTI0X1F4SjN6bmFIck1IaGxFVVZyOFQxdk1JT3JUbl91SEhsU0NpQzkzRFdHSEtjVGhJNzY4ZTl6eXhESUQ3XzdWVTBGOGgwSmlXaVRmU3BsQlhPVjV4VWxET3FQVzJNbm5CUDlUOHJUTExaME5YbjZCX1NqOU9Ta3U?hl=en-US&gl=US&ceid=US%3Aen",
"https://news.google.com/read/CBMiiAFBVV95cUxQOXZLdC1hSzFqQVVLWGJVZzlPaDYyNjdWTURScV9BbVp0SWhFNzZpSWZxSzdhc0tKbVlHMU13NmZVOFdidFFkajZPTm9SRnlZMWFRZ01CVHh0dXU0TjNVMUxZNk9Ibk5DV3hrYlRiZ20zYkIzSFhMQVVpcTFPc00xQjhhcGV1aXM00gF_QVVfeXFMTmtFQXMwMlY1el9WY0VRWEh5YkxXbHF0SjFLQVByNk1xS3hpdnBuUDVxOGZCQXl1QVFXaUVpbk5lUGgwRVVVT25tZlVUVWZqQzc4cm5MSVlfYmVlclFTOUFmTHF4eTlfemhTa2JKeG14bmNabENkSmZaeHB4WnZ5dw?hl=en-US&gl=US&ceid=US%3Aen"
]for url in source_urls:
try:
decoded_url = gnewsdecoder(url, interval=interval_time, proxy=proxy)
if decoded_url.get("status"):
print("Decoded URL:", decoded_url["decoded_url"])
else:
print("Error:", decoded_url["message"])
except Exception as e:
print(f"Error occurred: {e}")if __name__ == "__main__":
main()
```## Thank You
Thank you for installing and using Google News Decoder! I hope this tool saves you time and effort when working with Google News URLs. If you find it useful, please consider hitting the star button on GitHub. If you’d like to contribute or fork the project, your support is greatly appreciated. Thank you for your support!
## Credits
- Original script by [huksley](https://gist.github.com/huksley/)
