https://github.com/unhammer/wshiml
📚 Word shingling for near duplicate document detection
https://github.com/unhammer/wshiml
dup-finder duplicate-files duplicatefilefinder information-retrieval shingles
Last synced: 3 months ago
JSON representation
📚 Word shingling for near duplicate document detection
- Host: GitHub
- URL: https://github.com/unhammer/wshiml
- Owner: unhammer
- License: other
- Created: 2015-05-18T10:47:36.000Z (about 11 years ago)
- Default Branch: master
- Last Pushed: 2017-06-02T08:29:28.000Z (about 9 years ago)
- Last Synced: 2025-07-21T10:34:43.137Z (11 months ago)
- Topics: dup-finder, duplicate-files, duplicatefilefinder, information-retrieval, shingles
- Language: OCaml
- Homepage:
- Size: 29.3 KB
- Stars: 4
- Watchers: 2
- Forks: 1
- Open Issues: 0