{"id":28249137,"url":"https://github.com/gitchaell/computer-scrapping","last_synced_at":"2026-05-01T00:31:51.548Z","repository":{"id":168847861,"uuid":"429506746","full_name":"gitchaell/computer-scrapping","owner":"gitchaell","description":"Tool that extracts data from the pages of companies that sell computers in the city of Trujillo - Peru, exports them in an XLSX file according to a relational data model, and displays them on a Power BI dashboard.","archived":false,"fork":false,"pushed_at":"2021-11-26T02:53:22.000Z","size":4770,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-06-13T11:46:07.319Z","etag":null,"topics":["data-analysis","data-structures","data-visualization","database","dbdiagram","export-excel","powerbi","scrapper-script","scrapping","xlsx"],"latest_commit_sha":null,"homepage":"","language":"TypeScript","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/gitchaell.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2021-11-18T16:44:37.000Z","updated_at":"2021-11-26T02:53:25.000Z","dependencies_parsed_at":null,"dependency_job_id":"3697fda2-7e65-46d7-8f9b-4c82d01dfa3e","html_url":"https://github.com/gitchaell/computer-scrapping","commit_stats":null,"previous_names":["gitchaell/computer-scrapping"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/gitchaell/computer-scrapping","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/gitchaell%2Fcomputer-scrapping","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/gitchaell%2Fcomputer-scrapping/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/gitchaell%2Fcomputer-scrapping/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/gitchaell%2Fcomputer-scrapping/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/gitchaell","download_url":"https://codeload.github.com/gitchaell/computer-scrapping/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/gitchaell%2Fcomputer-scrapping/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":32481553,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-04-30T13:12:12.517Z","status":"ssl_error","status_checked_at":"2026-04-30T13:12:06.837Z","response_time":57,"last_error":"SSL_read: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["data-analysis","data-structures","data-visualization","database","dbdiagram","export-excel","powerbi","scrapper-script","scrapping","xlsx"],"created_at":"2025-05-19T13:14:14.880Z","updated_at":"2026-05-01T00:31:51.540Z","avatar_url":"https://github.com/gitchaell.png","language":"TypeScript","funding_links":[],"categories":[],"sub_categories":[],"readme":"\u003c!--\nCreated: Thu Nov 18 2021 12:56:11 GMT-0400 (hora de Bolivia)\nModified: Wed Nov 24 2021 14:48:59 GMT-0400 (hora de Bolivia)\n--\u003e\n\n# Computer Scrapping\n\nTool that extracts data from the pages of companies that sell computers in the city of Trujillo - Peru, exports them in an XLSX file according to a relational data model, and displays them on a Power BI dashboard.\n\n## Objectives\n\n* Use web scraping techniques to the selected websites to obtain the data. (minimum 03 web pages and 02 web scraping techniques)\n* Build the Dashboard in Power BI that shows the dynamic analysis of the data\n\n## Steps\n\n### 1. Search for companies that sell computers in Trujillo\n\n* [Falabella](https://www.falabella.com.pe/)\n* [La Curacao](https://www.lacuracao.pe/)\n* [Oechsle](https://www.oechsle.pe/)\n* [Efe](https://www.efe.com.pe/)\n* [Hiraoka](https://hiraoka.com.pe/)\n* [Coolbox](https://www.coolbox.pe/)\n\n### 2. Design of the data model\n\n* [DB Diagram](https://dbdiagram.io/) - Data Modeling Tool\n\n![datamodel](https://raw.githubusercontent.com/MichaellAlavedraMunayco/computer-scrapping/main/.github/images/computers.database.png)\n\nDownload Data Model PDF File [here](https://raw.githubusercontent.com/MichaellAlavedraMunayco/computer-scrapping/main/.github/docs/computers.database.pdf)\n\n### 3. Search for tools for data extraction\n\n* Node JS - Javascript Engine\n* [Puppeteer](https://www.npmjs.com/package/puppeteer) - Web page manipulation tool\n* [Cheerio JS](https://www.npmjs.com/package/cheerio) - Web page querier tool like JQuery\n* [Excel JS](https://www.npmjs.com/package/exceljs) - Tool for exporting extracted data to XLSX file\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fgitchaell%2Fcomputer-scrapping","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fgitchaell%2Fcomputer-scrapping","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fgitchaell%2Fcomputer-scrapping/lists"}