{"id":17382512,"url":"https://github.com/usrrname/aryscraper","last_synced_at":"2025-09-09T04:49:27.037Z","repository":{"id":37248295,"uuid":"453798487","full_name":"usrrname/aryscraper","owner":"usrrname","description":"A series of python3 scripts written for creating an image data set","archived":false,"fork":false,"pushed_at":"2022-07-14T18:05:00.000Z","size":614,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-03-27T20:47:38.426Z","etag":null,"topics":["python","python3","scraper"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/usrrname.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2022-01-30T20:51:12.000Z","updated_at":"2022-05-19T22:46:48.000Z","dependencies_parsed_at":"2022-08-18T03:15:28.089Z","dependency_job_id":null,"html_url":"https://github.com/usrrname/aryscraper","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/usrrname/aryscraper","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/usrrname%2Faryscraper","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/usrrname%2Faryscraper/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/usrrname%2Faryscraper/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/usrrname%2Faryscraper/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/usrrname","download_url":"https://codeload.github.com/usrrname/aryscraper/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/usrrname%2Faryscraper/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":262180947,"owners_count":23271313,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["python","python3","scraper"],"created_at":"2024-10-16T07:36:40.207Z","updated_at":"2025-06-27T03:06:40.761Z","avatar_url":"https://github.com/usrrname.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"Aryan scraper\n============\n### WORK IN PROGRESS\n\nThis is one of multiple tools I'm working on as a _Black Mirror_-style proof-of-concept tool to demonstrate the ongoing harm of racial profiling, and how it may be perpetuated by face recognition and machine learning.\n\nThis repository shows a series of scripts I used to:\n\n- [x] scrape the names of the SS high command from the Wikipedia page `scrape-names.py` and output them as a .csv\n- [x] use them to create folders `create-named-folders.py`\n- [x] finds portraits of their faces with serpapi/google search image api `scraper.py`\n- [x] extracts metadata for each name and saves it all as a .json file `wiki.py`\n- [x] some elementary face extraction in `haar_cascade.py`\n\n## About\n\nAryScraper is the scraper that created the male and female face \"Aryan\" data sets using photos of Holocaust perpetrators and the SS high command. Note: This whole exercise is arcane and not rooted in science but the interest was to use AI to see if we could perform some hyperblic \"categorization\" or averaging on samples of self-proclaimed \"Aryans\".\n\nSuch gestures in public take after \"tactical media\" or \"tactical technology\".\n\nAlthough images here were compiled from publicly available sources, the dataset nor model is available here to prevent harmful misuse.\n\n## Folder Structure\n\n```\n.\n├── Makefile\n├── .github               // github actions\n├── README.md\n├── requirements.txt\n├── __pycache__\n├── test\n├── automate_scraping.py\n├── create_folders.py\n├── haar_cascade.py\n├── names.py\n├── scrape_names.py\n├── scraper.py\n├── ss-ranks.csv\n├── util.py\n├── wiki.py\n├── men                   // images of male faces\n└── women                 // images of female faces\n```\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fusrrname%2Faryscraper","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fusrrname%2Faryscraper","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fusrrname%2Faryscraper/lists"}