{"id":21166002,"url":"https://github.com/peterdee/parser-test","last_synced_at":"2026-04-11T10:37:09.348Z","repository":{"id":48739346,"uuid":"195374287","full_name":"peterdee/parser-test","owner":"peterdee","description":"Parser test task","archived":false,"fork":false,"pushed_at":"2021-07-13T06:50:04.000Z","size":98,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"develop","last_synced_at":"2025-09-04T10:53:39.163Z","etag":null,"topics":["expressjs","nodejs","parsing"],"latest_commit_sha":null,"homepage":"http://localhost:3000","language":"JavaScript","has_issues":false,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/peterdee.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2019-07-05T08:48:14.000Z","updated_at":"2021-07-13T06:50:56.000Z","dependencies_parsed_at":"2022-09-19T09:41:07.663Z","dependency_job_id":null,"html_url":"https://github.com/peterdee/parser-test","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/peterdee/parser-test","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/peterdee%2Fparser-test","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/peterdee%2Fparser-test/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/peterdee%2Fparser-test/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/peterdee%2Fparser-test/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/peterdee","download_url":"https://codeload.github.com/peterdee/parser-test/tar.gz/refs/heads/develop","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/peterdee%2Fparser-test/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":278283453,"owners_count":25961311,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-10-04T02:00:05.491Z","response_time":63,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["expressjs","nodejs","parsing"],"created_at":"2024-11-20T14:47:52.223Z","updated_at":"2025-10-04T07:55:10.393Z","avatar_url":"https://github.com/peterdee.png","language":"JavaScript","readme":"Parser test task\n---\n\nTask:\n```\nPython Developer Sample Project\nThe task is to create a simple web scraper and display the results on a webpage.\n\nPython libraries you will use\n\nrequests\nBeautifulSoup4\nflask\nYou will be scraping the Auto's section of the New York Daily News website.\n\nTrucks Page: http://www.nydailynews.com/autos/types/truck\nSport Page: http://www.nydailynews.com/autos/types/sports-car\nScrape the list of vehicles reviewed (only results page 1) and store them locally (sqlite? json file? etc).\n\nThen, using flask, create a simple webpage endpoint that displays an HTML table showing the title (year/make/model) and summary of the vehicle. You can have 1 table for Trucks, and one for Sport.\n\nThis task should be limited to 4 hours maximum, and you will be paid for this time. If you have questions, or get stuck, feel free to reach out and ask us. When the task is complete, please send us a link to a github repository showing your work.\n\nBonus features:\n\nCrawl pagination (results page 2, 3, etc of the car lists)\nCrawl vehicle price (how is it loaded into the page? :) )\nAllow the crawler / flask app to be run inside a docker container\n```\n\nStack: `Node`, `Express`, `Sequelize`, `PostgreSQL`, `JQuery`\n\nDEV: http://localhost:3000\n\nRequires `keys.js` file in the `bin` directory.\n\n### `keys.js` example\n\n```javascript\nconst DATABASE = {\n  dbname: '\u003cDATABASE_NAME\u003e',\n  host: '\u003cDATABASE_HOST\u003e',\n  password: '\u003cUSER_PASSWORD\u003e',\n  user: '\u003cUSER_NAME\u003e',\n};\n\nmodule.exports = {\n  DATABASE,\n};\n```\n\n### Deploy\n\n`git clone https://github.com/peterdee/pareser-test.git`\n\n`cd parser-test`\n\n`createdb parserdb`\n\n`nvm use 12.6` (please check the required Node version in the [package.json](package.json))\n\n`npm i`\n\n### Sync database (optional)\n\n`npm run sync`\n\n### Drop database (optional)\n\n`npm run drop`\n\n### Launch\n\n`npm run dev`\n\n### How to use\n\n- Open the page: http://localhost:3000\n\n- Click on `SHOW RESULTS` to show parsing results\n\n- Click on `START PARSING` to perform parsing. Please notice: **previous parsing results will be deleted!**\n","funding_links":[],"categories":[],"sub_categories":[],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fpeterdee%2Fparser-test","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fpeterdee%2Fparser-test","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fpeterdee%2Fparser-test/lists"}