{"id":16521549,"url":"https://github.com/cyphunk/jstor_archive","last_synced_at":"2026-06-01T01:31:03.228Z","repository":{"id":19432673,"uuid":"22676206","full_name":"cyphunk/jstor_archive","owner":"cyphunk","description":"JSTOR_archive provides scripts to download and save in PDF/HTML form articles added to a free-user's shelf on the Journal Storage archive site JSTOR","archived":false,"fork":false,"pushed_at":"2014-08-07T08:59:57.000Z","size":164,"stargazers_count":4,"open_issues_count":0,"forks_count":0,"subscribers_count":3,"default_branch":"master","last_synced_at":"2025-03-03T02:22:22.493Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":"","language":"Shell","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/cyphunk.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2014-08-06T08:55:37.000Z","updated_at":"2024-10-19T16:19:13.000Z","dependencies_parsed_at":"2022-08-24T07:20:31.821Z","dependency_job_id":null,"html_url":"https://github.com/cyphunk/jstor_archive","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/cyphunk/jstor_archive","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/cyphunk%2Fjstor_archive","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/cyphunk%2Fjstor_archive/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/cyphunk%2Fjstor_archive/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/cyphunk%2Fjstor_archive/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/cyphunk","download_url":"https://codeload.github.com/cyphunk/jstor_archive/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/cyphunk%2Fjstor_archive/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":33756575,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-05-26T15:22:16.424Z","status":"online","status_checked_at":"2026-05-31T02:00:06.040Z","response_time":95,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-10-11T16:56:54.924Z","updated_at":"2026-06-01T01:31:03.206Z","avatar_url":"https://github.com/cyphunk.png","language":"Shell","funding_links":[],"categories":[],"sub_categories":[],"readme":"Found it dificult to find scripts that help archive JSTOR content. This is\nsurprising considering their sordid history in relation to Aaron Swartz.\n\nThese scripts do not download PDF content of privileged JSTOR users. For\nthat look elsewhere. Instead these scripts allow for archiving of articles\nprovided freely on a limited bases to public unprivileged users.\nThe shell script simply monitors your browser cache and copies GIF's\nthat match the JSTOR preview size (configurable) to a directory and then\ngenerates a pdf and html index when interrupted. The included javascript\nuser script can be loaded in the browser (via greasemonkey or other user\nscript management extensions) and will emulate a click on the \"next\"\nbutton of a JSTOR article so as to fill up the browser cache with the\narticles contents.\n\n# Use\n\nDefaults are configured for Firefox browsers on Linux systems or Chrome on OSX.\nFor other browsers or systems set the ``CACHE_DIR`` path appropriately.\n\n1. ``git clone https://github.com/cyphunk/jstor_archive.git``\n2. Install Javascript userscript via User Script manager (such as grease\n   monkey) or directly if your browser supports it by accessing the raw\n   [jstor_clicknext.user.js](https://github.com/cyphunk/jstor_archive/raw/master/jstor_clicknext.user.js).\n3. ``cd jstor_archive`` and ``./cache_monitor.sh EXAMPLE_NAME`` where\n   ``EXAMPLE_NAME`` is the directory to be created within the current directory\n   for storing GIF's. This name will also be used as the name for the PDF and\n   HTML indexes that will be created later.\n4. If the monitored ``CACHE_DIR`` shown on execution is different from that of\n   your browser change the ``CACHE_DIR`` environment variable and execute again.\n5. Open a JSTOR article in your browser.\n6. When all pages of the article have been viewed, either by manually\n   clicking through each page or by letting the ``jstor_clicknext.user.js``\n   user script do it, press \"``ctrl+c``\" in the scripts.\n   PDF and HTML files will be created in the ``EXAMPLE_NAME`` directory.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fcyphunk%2Fjstor_archive","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fcyphunk%2Fjstor_archive","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fcyphunk%2Fjstor_archive/lists"}