{"id":16696922,"url":"https://github.com/seratch/gyotaku","last_synced_at":"2025-04-05T03:43:40.769Z","repository":{"id":3512942,"uuid":"4570647","full_name":"seratch/gyotaku","owner":"seratch","description":"Saving complete web pages by using Selenium Web Driver","archived":false,"fork":false,"pushed_at":"2012-06-19T02:40:39.000Z","size":230,"stargazers_count":5,"open_issues_count":1,"forks_count":0,"subscribers_count":3,"default_branch":"master","last_synced_at":"2025-02-10T12:12:37.744Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":"","language":"Scala","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":"kimmking/kk","license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/seratch.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2012-06-06T09:10:23.000Z","updated_at":"2019-08-13T15:04:04.000Z","dependencies_parsed_at":"2022-08-20T19:50:42.015Z","dependency_job_id":null,"html_url":"https://github.com/seratch/gyotaku","commit_stats":null,"previous_names":[],"tags_count":5,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/seratch%2Fgyotaku","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/seratch%2Fgyotaku/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/seratch%2Fgyotaku/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/seratch%2Fgyotaku/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/seratch","download_url":"https://codeload.github.com/seratch/gyotaku/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":247284918,"owners_count":20913691,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-10-12T17:45:17.905Z","updated_at":"2025-04-05T03:43:40.749Z","avatar_url":"https://github.com/seratch.png","language":"Scala","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Gyotaku - 魚拓（ぎょたく）\n\n## What's this?\n\nGyotaku is a simple tool to completely save web pages.\n\n## Requirement\n\n- Mac OS/Linux/Windows\n- Java Runtime Environment\n- Firefox\n\n## Usage\n\n### Get Gyotaku\n\nDownload gyotaku.zip and unzip it.\n\nhttps://github.com/seratch/gyotaku/downloads\n\n\n### Invoke Gyotaku\n\nUsing Gyotaku UI (Swing Application) is the easiest way.\n\n```\n./gyotaku_ui\n````\n\n![screen_shot](https://github.com/seratch/gyotaku/raw/master/img/gyotaku_screen_shot.png)\n\n\n### Authentication\n\nIf you want to get a page which requires authentication, use the selenium web driver which is customized by yourself.\n\n#### input/tumblr-login.scala\n\nAdded the following source code:\n\n```scala\nimport org.openqa.selenium._\nval driver = new firefox.FirefoxDriver\ndriver.get(\"https://www.tumblr.com/login\")\ndriver.findElement(By.id(\"signup_email\")).sendKeys(\"YOUR_EMAIL\")\ndriver.findElement(By.id(\"signup_password\")).sendKeys(\"YOUR_PASSWORD\")\ndriver.findElement(By.id(\"signup_form\")).submit()\ndriver\n```\n\n#### input/tumblr.yml\n\n```yml\nname: tumblr-dashbord\nurl: http://www.tumblr.com/dashboard\ndriver: { path: input/tumblr-login.scala }\n```\n\n\n## Configuration\n\n```yml\nname: example\nurl: http://www.example.com/\ndriver: input/login_operation.scala\ncharset: UTF-8\nprettify: false\nreplaceNoDomainOnly: false\n```\n\n### name\n\nThe name of gyotaku. It'll be used as directory name under output directory.\n\n### url\n\nThe url to download.\n\n### driver\n\nHow to create a `org.openqa.selenium.WebDriver` instance. \n\n`FirefoxDriver` will be used if it's omitted.\n\n```yml\ndriver\n  path: path/to/driver.scala\n```\n\n### charset\n\nCharset which is used for the downloaded html and modified css files. \n\n\"UTF-8\" if it's omitted.\n\n### prettify\n\nModify the html using HtmlCleaner or not. \n\n`false` if it's omitted.\n\n### replaceNoDomainOnly\n\nReplace urls in html/css only when they don't start with 'http://' or 'https://'.\n\n`true` if it's omitted.\n \n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fseratch%2Fgyotaku","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fseratch%2Fgyotaku","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fseratch%2Fgyotaku/lists"}