{"id":21035446,"url":"https://github.com/archiveteam/terroroftinytown-client-grab","last_synced_at":"2025-10-30T12:38:23.539Z","repository":{"id":15802516,"uuid":"18541980","full_name":"ArchiveTeam/terroroftinytown-client-grab","owner":"ArchiveTeam","description":"The Seesaw pipeline grab script for the URLTeam (terroroftinytown) project","archived":false,"fork":false,"pushed_at":"2024-08-14T17:52:12.000Z","size":45,"stargazers_count":27,"open_issues_count":3,"forks_count":11,"subscribers_count":14,"default_branch":"master","last_synced_at":"2025-05-12T08:34:43.246Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"unlicense","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/ArchiveTeam.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2014-04-08T02:15:39.000Z","updated_at":"2024-08-14T17:52:15.000Z","dependencies_parsed_at":"2024-01-18T14:36:36.137Z","dependency_job_id":"a574f9b8-00c2-4e0e-ba9f-09ca0d6356bc","html_url":"https://github.com/ArchiveTeam/terroroftinytown-client-grab","commit_stats":{"total_commits":103,"total_committers":10,"mean_commits":10.3,"dds":"0.10679611650485432","last_synced_commit":"ae93345d1ce1817d68b64b27a63c600e547f1f8b"},"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ArchiveTeam%2Fterroroftinytown-client-grab","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ArchiveTeam%2Fterroroftinytown-client-grab/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ArchiveTeam%2Fterroroftinytown-client-grab/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ArchiveTeam%2Fterroroftinytown-client-grab/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/ArchiveTeam","download_url":"https://codeload.github.com/ArchiveTeam/terroroftinytown-client-grab/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":254358681,"owners_count":22057959,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-11-19T13:14:56.771Z","updated_at":"2025-10-30T12:38:23.458Z","avatar_url":"https://github.com/ArchiveTeam.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"terroroftinytown-client-grab\n============================\n\nThe Seesaw pipeline grab script for the URLTeam (terroroftinytown) project\n\nMore information about the archiving project can be found on the ArchiveTeam wiki: [URLTeam](http://archiveteam.org/index.php?title=URLTeam)\n\nSee [terroroftinytown](https://github.com/ArchiveTeam/terroroftinytown) for source code details.\n\n\nSetup instructions\n=========================\n\nThere are now several ways to run this; the preffered method is via the included Dockerfile.\n\nBe sure to replace `YOURNICKHERE` with the nickname that you want to be shown as, on the tracker. You don't need to register it, just pick a nickname you like.\n\nIn most of the below cases, there will be a web interface running at http://localhost:8001/. If you don't know or care what this is, you can just ignore it—otherwise, it gives you a fancy view of what's going on.\n\n**If anything goes wrong while running the commands below, please scroll down to the bottom of this page. There's troubleshooting information there.**\n\nRunning with docker\n--------------------\n\n\u003cimg alt=\"Docker logo\" src=\"https://upload.wikimedia.org/wikipedia/commons/7/79/Docker_%28container_engine%29_logo.png\" height=\"100px\"\u003e\n\nAssuming this is a stand alone box, not part of a swarm etc, basic instructions for configuring your docker instance can be found at [docker documentation](https://docs.docker.com/install/) or for [Ubuntu](https://docs.docker.com/install/linux/docker-ce/ubuntu/) / [Debian](https://docs.docker.com/install/linux/docker-ce/debian/).\n\nMake a directory, cd into the directry and copy the included dockerfile into it; the rest of the files are not required. Edit the final line to include the concurrency and replace `YOURNICK` with your username.\n\nBuild the container with the following arguments;\n\n    docker build -t \u003c\u003cdockername\u003e\u003e \u003c\u003cfoldername\u003e\u003e/\n\nfor example\n\n    docker build -t at-terroroftinytown archiveteam-terroroftinytown/\n    \nThen simply run the container with either;\n\n    docker run -d -it at-terroroftinytown\n\nor if you want to give it a known name and make it easier to run commands;\n\n    docker run -d -it --name at-terroroftinytown at-terroroftinytown\n\nor if you really want that web page to be available;\n\n    docker run -d -it -p 8001:8001 --name at-terroroftinytown at-terroroftinytown\n\nStopping the container (clean);\n\n    docker run -d -it --name \u003c\u003ccontainername\u003e\u003e touch STOP\n\nStopping the container (hard);\n\n    docker stop \u003c\u003ccontainername\u003e\u003e\n\nConnecting to the container console;\n\n    docker attach \u003c\u003ccontainername\u003e\u003e\n\nRunning with a warrior\n-------------------------\n\nFollow the [instructions on the ArchiveTeam wiki](http://archiveteam.org/index.php?title=Warrior) for installing the Warrior, and select the \"URLTeam\" project in the Warrior interface.\n\nRunning without a warrior\n-------------------------\nTo run this outside the warrior, clone this repository, cd into its directory and run:\n\n    pip install seesaw requests\n\nthen start downloading with:\n\n    run-pipeline pipeline.py --concurrent 2 YOURNICKHERE\n\nFor more options, run:\n\n    run-pipeline --help\n\nIf you don't have root access and/or your version of pip is very old, you can replace \"pip install seesaw\" with:\n\n    wget https://raw.github.com/pypa/pip/master/contrib/get-pip.py ; python get-pip.py --user ; ~/.local/bin/pip install --user seesaw requests\n\nso that pip and seesaw are installed in your home, then run\n\n    ~/.local/bin/run-pipeline pipeline.py --concurrent 2 YOURNICKHERE\n\n**Note**: The code is partially self-updating by default with Git version 1.8.2 or higher. Use `--context-value no_submodule=1` if you do not want this behavior. Conversely, if you want full updating, use the `--auto-update` feature available in Seesaw 0.8 or higher.\n\n\nRunning multiple instances on different IPs\n-------------------------------------------\n\nThis feature requires seesaw version 0.0.16 or greater. Use `pip install --upgrade seesaw` to upgrade.\n\nUse the `--context-value` argument to pass in `bind_address=123.4.5.6` (replace the IP address with your own).\n\nExample of running 2 threads, no web interface, and binding of IP address:\n\n    run-pipeline pipeline.py --concurrent 2 YOURNICKHERE --disable-web-server --context-value bind_address=123.4.5.6\n\nDistribution-specific setup\n-------------------------\n### For Debian/Ubuntu:\n\n    adduser --system --group --shell /bin/bash archiveteam\n    apt-get install -y git-core screen python-dev python-pip\n    pip install seesaw requests\n    su -c \"cd /home/archiveteam; git clone https://github.com/ArchiveTeam/terroroftinytown-client-grab.git; cd terroroftinytown-client-grab;\" archiveteam\n    screen su -c \"cd /home/archiveteam/terroroftinytown-client-grab/; run-pipeline pipeline.py --concurrent 2 --address '127.0.0.1' YOURNICKHERE\" archiveteam\n    [... ctrl+A D to detach ...]\n\n\n### For CentOS:\n\nEnsure that you have the CentOS equivalent of bzip2 installed as well. You might need the EPEL repository to be enabled.\n\n    yum -y install python-pip\n    pip install seesaw requests\n    [... pretty much the same as above ...]\n\n### For openSUSE:\n\n    zypper install screen python-pip python-devel gcc make\n    pip install seesaw requests\n    [... pretty much the same as above ...]\n\n### For OS X:\n\nYou need Homebrew. Ensure that you have the OS X equivalent of bzip2 installed as well.\n\n    brew install python\n    pip install seesaw requests\n    [... pretty much the same as above ...]\n\n\n### For Arch Linux:\n\nEnsure that you have the Arch equivalent of bzip2 installed as well.\n\n1. Make sure you have `python2-pip` installed.\n3. Run `pip2 install seesaw requests`.\n4. Modify the run-pipeline script in seesaw to point at `#!/usr/bin/python2` instead of `#!/usr/bin/python`.\n5. `useradd --system --group users --shell /bin/bash --create-home archiveteam`\n6. `screen su -c \"cd /home/archiveteam/terroroftinytown-client-grab/; run-pipeline pipeline.py --concurrent 2 --address '127.0.0.1' YOURNICKHERE\" archiveteam`\n\n\nTroubleshooting\n=========================\n\nBroken? These are some of the possible solutions:\n\n\n### ImportError: No module named seesaw\n\nIf you're sure that you followed the steps to install `seesaw`, permissions on your module directory may be set incorrectly. Try the following:\n\n    chmod o+rX -R /usr/local/lib/python2.7/dist-packages\n\n### Issues in the code\n\nIf you notice a bug and want to file a bug report, please use the GitHub issues tracker.\n\nAre you a developer? Help write code for us! Look at our [developer documentation](http://archiveteam.org/index.php?title=Dev) for details.\n\n### Other problems\n\nHave an issue not listed here? Join us on IRC and ask! We can be found at irc.efnet.org #urlteam.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Farchiveteam%2Fterroroftinytown-client-grab","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Farchiveteam%2Fterroroftinytown-client-grab","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Farchiveteam%2Fterroroftinytown-client-grab/lists"}