https://github.com/ajithksenthil/document-date-extractor
Extracting dates using NER from Documents
https://github.com/ajithksenthil/document-date-extractor
nlp textanalysis
Last synced: 11 months ago
JSON representation
Extracting dates using NER from Documents
- Host: GitHub
- URL: https://github.com/ajithksenthil/document-date-extractor
- Owner: ajithksenthil
- Created: 2021-02-18T01:22:58.000Z (almost 5 years ago)
- Default Branch: main
- Last Pushed: 2021-02-18T01:25:08.000Z (almost 5 years ago)
- Last Synced: 2025-02-08T12:45:45.408Z (about 1 year ago)
- Topics: nlp, textanalysis
- Language: Jupyter Notebook
- Homepage:
- Size: 129 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Document-Date-Extractor
Date extractor, it parses through a docx file and then extracts the dates from it.
The problem solved is using regex to find dates of multiple forms for example December 12th and December twelvth should both be matched by this regex query.
To run the code you run kernel in assignment-1-ajithksenthil-main/src/code/assignment-1.pynb.