{"id":15460908,"url":"https://github.com/machawk1/cdxjgenerator","last_synced_at":"2026-01-20T20:33:23.032Z","repository":{"id":45397310,"uuid":"169581763","full_name":"machawk1/cdxjGenerator","owner":"machawk1","description":"A script to generate CDXJ TimeMaps for testing elsewhere","archived":false,"fork":false,"pushed_at":"2021-12-15T21:08:20.000Z","size":27,"stargazers_count":1,"open_issues_count":3,"forks_count":1,"subscribers_count":4,"default_branch":"main","last_synced_at":"2025-04-02T03:18:21.565Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/machawk1.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2019-02-07T14:08:58.000Z","updated_at":"2021-12-15T21:08:21.000Z","dependencies_parsed_at":"2022-09-19T10:01:55.184Z","dependency_job_id":null,"html_url":"https://github.com/machawk1/cdxjGenerator","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/machawk1%2FcdxjGenerator","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/machawk1%2FcdxjGenerator/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/machawk1%2FcdxjGenerator/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/machawk1%2FcdxjGenerator/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/machawk1","download_url":"https://codeload.github.com/machawk1/cdxjGenerator/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":247640512,"owners_count":20971555,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-10-01T23:39:50.134Z","updated_at":"2026-01-20T20:33:23.026Z","avatar_url":"https://github.com/machawk1.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# CDXJ Generator\n\nA Python script to generate CDXJ TimeMaps for testing elsewhere. \n\n# Install\n\nThis tool is published to pypi. To install it:\n\n`pip install cdxjGenerator`\n\nTo use the development version, clone this repository then `pip install .`\n\n\n## Usage\n\nThese inststructions assume installation via `pip`.\n\nTo run:\n\n    cdxjGenerator [number of lines] [URI-R]\n    \nFor example:\n\n    cdxjGenerator 12\n    \n...will generate CDXJ output (to stdout by default) consisting of entries for 12 random URIs. Alternatively:\n\n    cdxjGenerator 25000 memento.us\n    \n...will generate 25,000 entries for the URI-R `memento.us`. This output can be written to a file like:\n\n    cdxjGenerator 25000 memento.us \u003e sample.cdxj\n\nThe resulting file will likely need to be sorted before used elsewhere. Do this via:\n\n    LC_ALL=C sort sample.cdxj \u003e sample_sorted.cdxj\n\nThis can also be performed in a single command, instead of writing to the temporary, unsorted `sample.cdxj` like:\n\n    cdxjGenerator 25000 memento.us | LC_ALL=C sort \u003e sample_sorted.cdxj\n\n## Background\nTimeMaps are lists that enumerate URIs of resources that encapsulate prior states of the given resource. ([RFC7089 - Memento](https://tools.ietf.org/html/rfc7089)). TimeMaps are often expressed in an extension of the Web Linking ([RFC5988](https://tools.ietf.org/html/rfc5988)) format. Additional, less common formats, like JSON and CDXJ TimeMaps can also express the same information in a less rigid format. [CDXJ](https://github.com/oduwsdl/ORS/wiki/CDXJ) is the most flexible of the three and is used by [InterPlanetary Wayback (ipwb)](https://github.com/oduwsdl/ipwb), which sparked the initial need for this software existing.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmachawk1%2Fcdxjgenerator","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fmachawk1%2Fcdxjgenerator","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmachawk1%2Fcdxjgenerator/lists"}