https://github.com/sadit/nearduplicates.jl
Remove near duplicate from one json per line files files (e.g., tweets)
https://github.com/sadit/nearduplicates.jl
Last synced: 3 months ago
JSON representation
Remove near duplicate from one json per line files files (e.g., tweets)
- Host: GitHub
- URL: https://github.com/sadit/nearduplicates.jl
- Owner: sadit
- License: mit
- Created: 2024-04-05T20:14:02.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2024-05-06T23:34:38.000Z (about 1 year ago)
- Last Synced: 2025-01-19T05:57:33.727Z (4 months ago)
- Language: Julia
- Size: 10.7 KB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# NearDuplicates
[](https://github.com/sadit/NearDuplicates.jl/actions/workflows/CI.yml?query=branch%3Amain)