{"id":17793006,"url":"https://github.com/evcu/propertycrawler","last_synced_at":"2026-01-11T22:59:36.496Z","repository":{"id":79244281,"uuid":"36425684","full_name":"evcu/propertycrawler","owner":"evcu","description":"Property Advert Crawler ","archived":false,"fork":false,"pushed_at":"2015-05-28T09:22:26.000Z","size":160,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":2,"default_branch":"master","last_synced_at":"2025-02-07T17:13:49.836Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Matlab","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"gpl-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/evcu.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2015-05-28T08:40:49.000Z","updated_at":"2015-05-28T09:16:33.000Z","dependencies_parsed_at":"2023-02-24T18:45:39.413Z","dependency_job_id":null,"html_url":"https://github.com/evcu/propertycrawler","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/evcu%2Fpropertycrawler","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/evcu%2Fpropertycrawler/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/evcu%2Fpropertycrawler/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/evcu%2Fpropertycrawler/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/evcu","download_url":"https://codeload.github.com/evcu/propertycrawler/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":246741122,"owners_count":20826067,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-10-27T11:03:42.293Z","updated_at":"2026-01-11T22:59:36.488Z","avatar_url":"https://github.com/evcu.png","language":"Matlab","funding_links":[],"categories":[],"sub_categories":[],"readme":"# propertycrawler\nProperty Advert Crawler \n\nThis repo contains code written for the web-page crawler for property price estimation project PROPER. \n\n###Which has bascially following layers in the structure:\n\n#####CrawlNow(.*)\n#####|\n#####{CrawlHur(.*),CrawlSah(.*)}\n#####|\n#####{GetProperStartPages.+(.*),CrawlPage(.*)}\n#####|\n#####CrawlAdvert(.*)\n#####|\n#####PropertyGetter(.*)\n*Data fields are like the following.\n1. d1=Ilan No\n2. d2=Fiyat\n3. d3=Konum_Latitude\n4. d4=Konum_Longitude\n5. d5=Sehir\n6. d6=Ilce\n7. d7=Mah/Koy\n8. d8=Ilan Tarihi\n9. d9=m2\n10. d10=Oda Sayisi\n11. d11=Banyo Sayisi\n12. d12=Bina Yasi\n13. d13=Kat Sayisi\n14. d14=Bulundugu Kat\n15. d15=Isitma\n16. d16=KullanimDurumu\n17. d17=SiteIcerisinde\n18. d18=Krediye Uygun?\n19. d19=Kimden\n20. d20=html\n\n###After crawling data into csv-like structure. I do preprocess the data to use it with the UFLDL-Codes that I completed with following\n\n#####PrepareData(.*)\n#####|\n#####preProcessor(.*)\n\n####I also used GetStatistics(.*) method to get the unique instances of the fields. \n\n##TODO's\n-Improve PrepareData, such that it may ask the user how to represent and remember after! Which would be easy. \n-Improve Crawling such that no manual entries in GetProperStartPages exists.\n-Improve data respresentation. Binary feature data for a lot of things needed. \n\n \n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fevcu%2Fpropertycrawler","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fevcu%2Fpropertycrawler","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fevcu%2Fpropertycrawler/lists"}