{"id":13826099,"url":"https://github.com/moehmeni/ezweb","last_synced_at":"2025-04-06T09:32:02.402Z","repository":{"id":49316935,"uuid":"386418245","full_name":"moehmeni/ezweb","owner":"moehmeni","description":"Easy to use web page analyzer","archived":false,"fork":false,"pushed_at":"2022-11-21T19:11:12.000Z","size":546,"stargazers_count":5,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-03-21T21:44:32.866Z","etag":null,"topics":["analyzer","crawler","scraper","text-analysis","text-classification","text-mining","webcrawler","webcrawling","webpage","webscraper","webscraping","www"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/moehmeni.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2021-07-15T20:41:30.000Z","updated_at":"2024-12-31T11:24:41.000Z","dependencies_parsed_at":"2023-01-23T11:45:59.219Z","dependency_job_id":null,"html_url":"https://github.com/moehmeni/ezweb","commit_stats":null,"previous_names":["0x7d4/ezweb","lo3me/ezweb","moehmeni/ezweb"],"tags_count":5,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/moehmeni%2Fezweb","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/moehmeni%2Fezweb/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/moehmeni%2Fezweb/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/moehmeni%2Fezweb/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/moehmeni","download_url":"https://codeload.github.com/moehmeni/ezweb/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":247463745,"owners_count":20942935,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["analyzer","crawler","scraper","text-analysis","text-classification","text-mining","webcrawler","webcrawling","webpage","webscraper","webscraping","www"],"created_at":"2024-08-04T09:01:32.089Z","updated_at":"2025-04-06T09:32:02.130Z","avatar_url":"https://github.com/moehmeni.png","language":"Python","funding_links":[],"categories":["Python"],"sub_categories":[],"readme":"# EzWeb\n An easy to use web page analyzer (scraper or crawler) with many useful features and properties\n\n## Quick Access\n- [Notes](#notes)\n- [Installation](#installation)\n- [Basic example](#basic-example)\n- [EzProduct](#ezproduct)\n\n## Installation\n```\npip install https://github.com/rtcq/ezweb/releases/download/v4.5.0/ezweb-4.5.0-py3-none-any.whl\n```\n## Basic Example\n```python\nfrom ezweb import EzSoup\n\nurl = \"https://www.theverge.com/22731034/google-pixel-6-pro-price-specs-features-release-date-hands-on\"\n\npage = EzSoup(url = url)\n\nprint(page.json_summary)\n```\nOutput :\n```json\n{\n    \"url\": \"https://www.theverge.com/22731034/google-pixel-6-pro-price-specs-features-release-date-hands-on\",\n    \"source\": {\n        \"url\": \"https://www.theverge.com\",\n        \"name\": \"The Verge\",\n        \"description\": \"The Verge was founded in 2011 in partnership with Vox Media, and covers the intersection of technology, science, art, and culture. Its mission is to offer in-depth reporting and long-form feature stories, breaking news coverage, product information, and community content in a unified and cohesive manner. The site is powered by Vox Media's Chorus platform, a modern media stack built for web-native news in the 21st century.\",\n        \"language\": \"en\",\n        \"image\": \"https://cdn.vox-cdn.com/uploads/chorus_asset/file/7395351/android-chrome-192x192.0.png\",\n        \"rss_feed_url\": \"https://theverge.com/rss/index.xml\",\n        \"sitemap_url\": \"https://www.theverge.com/sitemaps\"\n    },\n    \"title\": \"Pixel 6 and 6 Pro: a first look at Google’s shot at a premium Android phone\",\n    \"description\": \"Google has officially announced its new Pixel 6 and Pixel 6 Pro. The new models start at $599 and $899, respectively, and feature new designs, new cameras, and the first-ever Google custom processor. They are available to preorder starting October 19th and will be shipping on October 28th.\",\n    \"date\": \"2021-10-19 13:00:00-04:00\",\n    \"main_image\": \"https://cdn.vox-cdn.com/thumbor/5f5xEVqSF0S3aTCRnoByipEng_4=/0x53:2040x1121/fit-in/1200x630/cdn.vox-cdn.com/uploads/chorus_asset/file/22934833/bfarsace_211014_4802_0013.jpg\",\n    \"main_content\": \"After many leaks, official teases, and months of waiting, Google has finally given its latest Pixel  ... [MORE]\",\n    \"possible_topics\": [\n        \"Google\"\n    ],\n    \"comments\": \"Loading comments...\"\n}\n```\n\n## EzProduct\n```python\nfrom ezweb import EzProduct\n\nurl = \"https://www.razer.com/gaming-laptops/Razer-Blade-15/RZ09-0409JED3-R3U1\"\n\npage = EzProduct(url)\n\nprint(page.json_summary)\n```\nOutput:\n```json\n{\n    \"provider\": {\n        \"name\": \"Razer\",\n        \"domain\": \"razer.com\",\n        \"addresses\": null,\n        \"phone\": []\n    },\n    \"url\": \"https://www.razer.com/gaming-laptops/Razer-Blade-15/RZ09-0409JED3-R3U1\",\n    \"id_sku_or_mpn\": null,\n    \"title\": \"Blade 15 Advanced Model QHD 240Hz GeForce RTX 3070 Black\",\n    \"second_title\": null,\n    \"is_available\": true,\n    \"low_price\": 2699.99,\n    \"high_price\": 2699.99,\n    \"has_discount\": true,\n    \"discount_percentage\": 0,\n    \"price\": {\n        \"number\": 2699.99,\n        \"unit\": \"USD\",\n        \"number_humanize\": \"2,700\",\n        \"humanize\": \"2,700 USD\"\n    },\n    \"brand\": \"Razer\",\n    \"images\": [\n        \"https://assets3.razerzone.com/BXmAEATSJMaLlom3EfL6iwV0QuU=/1500x1000/https%3A%2F%2Fhybrismediaprod.blob.core.windows.net%2Fsys-master-phoenix-images-container%2Fha6%2Fh11%2F9208511594526%2F500x500-blade15-may2021-fhd.png\"\n    ],\n    \"specs\": [\n        {\n            \"Processor\": \"11th Gen Intel® Core™ i7-11800H 8 Cores (2.3GHz / 4.6GHz)\"\n        },\n        {\n            \"OS\": \"Windows 11 Home\"\n        },\n        {\n            \"Display\": \"15.6\\\" QHD 240Hz, 100% DCI-P3, G-Sync, 2.5ms, individually factory calibrated\"\n        },\n        {\n            \"Graphics\": \"Discrete: NVIDIA® GeForce RTX™3070 (8GB GDDR6 VRAM)Integrated: Intel® UHD Graphics\"\n        },\n        // And more...\n    ],\n    \"possible_topics\": []\n}\n```\n\n## Notes\n- `EzSoup` and especially `EzProduct` results are more accurate for Persian websites\n- Since I did not spend much time documenting the code, the package structure might look confusing\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmoehmeni%2Fezweb","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fmoehmeni%2Fezweb","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmoehmeni%2Fezweb/lists"}