https://github.com/dcramer/dateminer
Extract dates from webpages
https://github.com/dcramer/dateminer
Last synced: 8 months ago
JSON representation
Extract dates from webpages
- Host: GitHub
- URL: https://github.com/dcramer/dateminer
- Owner: dcramer
- License: apache-2.0
- Created: 2011-09-16T22:35:51.000Z (almost 15 years ago)
- Default Branch: master
- Last Pushed: 2011-09-22T23:50:11.000Z (almost 15 years ago)
- Last Synced: 2025-02-06T11:18:54.549Z (over 1 year ago)
- Language: Python
- Homepage:
- Size: 163 KB
- Stars: 12
- Watchers: 3
- Forks: 3
- Open Issues: 1
-
Metadata Files:
- Readme: README.rst
- License: LICENSE
Awesome Lists containing this project
README
dateminer is a Python port of John Muellerleile's dateminer Java library:
https://github.com/jrecursive/date_miner
It gives you a *best guess* at the creation date of an article (webpage) based on the URL and content of that page.
Usage
=====
>>> from dateminer import guess_date
>>> date = guess_date(url, html_content)