{"id":18577523,"url":"https://github.com/pjt3591oo/python-parse","last_synced_at":"2025-08-04T09:35:31.929Z","repository":{"id":100332723,"uuid":"75445607","full_name":"pjt3591oo/python-parse","owner":"pjt3591oo","description":"this are modules for url pasing","archived":false,"fork":false,"pushed_at":"2017-02-12T02:07:11.000Z","size":4,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":2,"default_branch":"master","last_synced_at":"2025-07-26T15:53:58.581Z","etag":null,"topics":["crawler"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/pjt3591oo.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2016-12-03T02:05:02.000Z","updated_at":"2016-12-03T02:19:44.000Z","dependencies_parsed_at":"2023-05-13T22:15:49.423Z","dependency_job_id":null,"html_url":"https://github.com/pjt3591oo/python-parse","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/pjt3591oo/python-parse","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/pjt3591oo%2Fpython-parse","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/pjt3591oo%2Fpython-parse/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/pjt3591oo%2Fpython-parse/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/pjt3591oo%2Fpython-parse/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/pjt3591oo","download_url":"https://codeload.github.com/pjt3591oo/python-parse/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/pjt3591oo%2Fpython-parse/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":268675709,"owners_count":24288330,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-08-04T02:00:09.867Z","response_time":79,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["crawler"],"created_at":"2024-11-06T23:29:30.777Z","updated_at":"2025-08-04T09:35:31.884Z","avatar_url":"https://github.com/pjt3591oo.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"\n\n# 해당 모듈은 url파싱에 필요한 기능을 제공하는 모듈입니다.\n-----\n\n## parse.py\n1. 구현 기능\n    * 최상위 url 가져오기\n    * 패스 가져오기\n    * 쿼리 스트링 가져오기\n    * 쿼리 스트링 파싱\n    * 해당 링크 마지막에 존재하는 /,?와 같은 특수문자 제거\n\n2. `테스트 코드 실행`\n\n```\npython ./test/test.py\n```\n\n\n3. 해당 코드에서 사용되는 테스트 url들\n    * \"http://www.naver.com/\"\n    * \"http://www.naver.com\"\n    * \"http://www.naver.com/path1\"\n    * \"http://www.naver.com/path1?\"\n    * \"http://www.naver.com/path1/path2\"\n    * \"http://www.naver.com/path1/path2/\"\n    * \"http://www.naver.com/p?a=10\u0026b=12\"\n    * \"http://www.naver.com/p?a=10\u0026b=12\u0026c=\"\n    * \"?a=10\u0026b=12\"\n    * \"a=10\u0026b=12\"\n\n\n## category.py\n\n1. 모듈 설명\n\t해당 모듈은 메인 페이지에서 카테고리의 정보를 dictionary로 만들어 주는 모듈입니다.\n\t\n\t반환 형태 : \n\t\n\t```\n\t{'001': 'category1', '002': 'category2' ...}\n\t```\n\t\n2. 사용방법 \n\t카테고리의 쿼리스트링을 넘겨주면 해당 쿼리스트링을 딕셔너리로 만들어 줍니다.\n\t\n\t```.py\n\tfrom category import Category\n\t\n\tSHOP_URL = 'http://www.dahong.co.kr'\n\tQUERY_STRING_KEY = 'a'\n\n\tif __name__ =='__main__':\n    \n    \tc = Category(SHOP_URL, QUERY_STRING_KEY)\n    \tprint(c())\n\t```\t\n\t\n3. 출력결과\n\t\n\t```\n\t{'1': 'TOP', '41': 'BIKINI', '5': 'PANTS', '318': 'SHIRTS \u0026 BLOUSE', '4': 'DRESS', '6': 'BAG \u0026 SHOES', '321': 'SKIRT', '3': 'OUTER', '8': 'ACC', '9': 'INNER'}\n\t```\t\n\t\n\t","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fpjt3591oo%2Fpython-parse","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fpjt3591oo%2Fpython-parse","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fpjt3591oo%2Fpython-parse/lists"}