{"id":13467847,"url":"https://github.com/EBazarov/nsfw_data_source_urls","last_synced_at":"2025-03-26T03:31:11.504Z","repository":{"id":38804501,"uuid":"170478079","full_name":"EBazarov/nsfw_data_source_urls","owner":"EBazarov","description":"Collection of NSFW images URLs for the purposes of training an NSFW Image Classifier","archived":false,"fork":false,"pushed_at":"2020-12-14T09:40:00.000Z","size":28570,"stargazers_count":3408,"open_issues_count":6,"forks_count":739,"subscribers_count":131,"default_branch":"master","last_synced_at":"2025-03-24T15:47:57.275Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/EBazarov.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2019-02-13T09:21:38.000Z","updated_at":"2025-03-12T03:46:21.000Z","dependencies_parsed_at":"2022-07-10T13:30:19.316Z","dependency_job_id":null,"html_url":"https://github.com/EBazarov/nsfw_data_source_urls","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/EBazarov%2Fnsfw_data_source_urls","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/EBazarov%2Fnsfw_data_source_urls/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/EBazarov%2Fnsfw_data_source_urls/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/EBazarov%2Fnsfw_data_source_urls/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/EBazarov","download_url":"https://codeload.github.com/EBazarov/nsfw_data_source_urls/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":245449629,"owners_count":20617190,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-07-31T15:01:01.356Z","updated_at":"2025-03-26T03:31:11.489Z","avatar_url":"https://github.com/EBazarov.png","language":null,"funding_links":[],"categories":["Others","其他_机器视觉","Part 3 : Dataset"],"sub_categories":["网络服务_其他","Code Repositories"],"readme":"# NSFW data source URLs\n\n## Description\n\nRepository contains lists of URLs that will help you download NSFW images, this set can be used in building big enough dataset to train robust NSFM classification model.\n\nThis work inspired by [nsfw_data_scrapper](https://github.com/alexkimxyz/nsfw_data_scrapper) and for downloading images suggested to use scripts from the scrapper.\n\n\n## Stats\n\nIn folder `raw_data` you will find different `txt` files each of them contains list of URLs, here some stats for this set:\n\n- **159** different categories\n- in total **1 589 331** URLs\n- after downloading and cleaning it's possible to have ~ **500GB** or in other words ~ **1 300 000** of NSFW images\n\n|                          file name                           | number of URLs |\n|--------------------------------------------------------------|----------------|\n| urls_age_college.txt                                         |      2949      |\n| urls_age_mature.txt                                          |      5942      |\n| urls_age_milf.txt                                            |      8503      |\n| urls_age_teen.txt                                            |      5389      |\n| urls_amateur.txt                                             |     13033      |\n| urls_amateur_self-shots.txt                                  |     10306      |\n| urls_appearance.txt                                          |      2734      |\n| urls_appearance_appearance-modification.txt                  |      3795      |\n| urls_appearance_appearance-modification_piercings.txt        |      1339      |\n| urls_appearance_appearance-modification_tattoos.txt          |      1983      |\n| urls_appearance_clothing.txt                                 |     24924      |\n| urls_appearance_clothing_bodyparts-through-clothes.txt       |      6691      |\n| urls_appearance_clothing_bottomless.txt                      |      2390      |\n| urls_appearance_clothing_clothed-naked-pair.txt              |      1274      |\n| urls_appearance_clothing_dresses.txt                         |      4360      |\n| urls_appearance_clothing_shoes.txt                           |      1238      |\n| urls_appearance_clothing_stockings.txt                       |      2556      |\n| urls_appearance_clothing_swimwear.txt                        |      741       |\n| urls_appearance_clothing_tight-clothing.txt                  |     11522      |\n| urls_appearance_clothing_topless.txt                         |      1009      |\n| urls_appearance_clothing_underwear.txt                       |      3190      |\n| urls_appearance_clothing_underwear_panties.txt               |      9512      |\n| urls_appearance_clothing_underwear_thongs.txt                |      2636      |\n| urls_appearance_clothing_uniforms-outfits.txt                |     15390      |\n| urls_appearance_clothing_uniforms-outfits_cosplay.txt        |      6465      |\n| urls_appearance_clothing_upskirt-downblouse.txt              |      2599      |\n| urls_appearance_expressions.txt                              |      1396      |\n| urls_appearance_pose.txt                                     |      8377      |\n| urls_appearance_wet-\u0026-messy.txt                              |      9169      |\n| urls_artificial-images.txt                                   |     247993     |\n| urls_artificial-images_fictional-characters-shows.txt        |     73349      |\n| urls_artificial-images_hentai.txt                            |     81178      |\n| urls_artificial-images_photoshop.txt                         |     10146      |\n| urls_body-parts_head_hair.txt                                |      1797      |\n| urls_body-parts_head_hair_blonde.txt                         |      6227      |\n| urls_body-parts_head_hair_brunette.txt                       |      2022      |\n| urls_body-parts_head_hair_dyed.txt                           |      1011      |\n| urls_body-parts_head_hair_hairstyle.txt                      |      6946      |\n| urls_body-parts_head_hair_redhead.txt                        |      4725      |\n| urls_body-parts_head_lips-mouth.txt                          |      4449      |\n| urls_body-parts_lower-body.txt                               |      2136      |\n| urls_body-parts_lower-body_ass.txt                           |      9420      |\n| urls_body-parts_lower-body_ass_large.txt                     |      3654      |\n| urls_body-parts_lower-body_asshole.txt                       |      1826      |\n| urls_body-parts_lower-body_feet.txt                          |      3539      |\n| urls_body-parts_lower-body_gap.txt                           |      1332      |\n| urls_body-parts_lower-body_genitalia_penis.txt               |      6611      |\n| urls_body-parts_lower-body_genitalia_penis_large.txt         |      1607      |\n| urls_body-parts_lower-body_genitalia_penis_small.txt         |      2233      |\n| urls_body-parts_lower-body_genitalia_vulva.txt               |     12746      |\n| urls_body-parts_lower-body_genitalia_vulva_hair.txt          |     12085      |\n| urls_body-parts_lower-body_genitalia_vulva_labia.txt         |      5037      |\n| urls_body-parts_lower-body_hips.txt                          |      3490      |\n| urls_body-parts_lower-body_legs.txt                          |      3104      |\n| urls_body-parts_upper-body.txt                               |      4465      |\n| urls_body-parts_upper-body_breasts.txt                       |     11962      |\n| urls_body-parts_upper-body_breasts_from-an-angle.txt         |      7196      |\n| urls_body-parts_upper-body_breasts_implants.txt              |      3913      |\n| urls_body-parts_upper-body_breasts_large.txt                 |     11582      |\n| urls_body-parts_upper-body_breasts_nipples.txt               |      4383      |\n| urls_body-parts_upper-body_breasts_small.txt                 |      3094      |\n| urls_body-traits_complexion_freckles.txt                     |      2309      |\n| urls_body-traits_complexion_light-skin.txt                   |      1436      |\n| urls_body-traits_complexion_tan.txt                          |      827       |\n| urls_body-traits_traits.txt                                  |      157       |\n| urls_body-traits_traits_flexible.txt                         |      862       |\n| urls_body-traits_traits_pregnant.txt                         |      2674      |\n| urls_body-traits_types_bbw.txt                               |      8160      |\n| urls_body-traits_types_chubby.txt                            |      8207      |\n| urls_body-traits_types_curvy.txt                             |      1799      |\n| urls_body-traits_types_petite.txt                            |      2305      |\n| urls_body-traits_types_skinny-thin.txt                       |      4560      |\n| urls_classic-vintage.txt                                     |     16532      |\n| urls_communities.txt                                         |     12500      |\n| urls_communities_identification.txt                          |      1507      |\n| urls_communities_personals.txt                               |      1106      |\n| urls_communities_role-play.txt                               |      226       |\n| urls_cum-play_cum.txt                                        |      4514      |\n| urls_cum-play_cum_creampie.txt                               |      1493      |\n| urls_cum-play_cum_cum-shot.txt                               |      4719      |\n| urls_cum-play_cum_cum-shot_bukkake.txt                       |      1042      |\n| urls_cum-play_cum_cum-shot_facial.txt                        |      2458      |\n| urls_cum-play_cum_swallowing.txt                             |       51       |\n| urls_cum-play_female.txt                                     |      921       |\n| urls_ethnicity.txt                                           |     19675      |\n| urls_ethnicity_asian.txt                                     |     26674      |\n| urls_ethnicity_black.txt                                     |      4220      |\n| urls_ethnicity_euro.txt                                      |      3949      |\n| urls_ethnicity_indian.txt                                    |     11195      |\n| urls_ethnicity_japanese.txt                                  |      8109      |\n| urls_exhibition.txt                                          |       10       |\n| urls_exhibition_gonewild.txt                                 |     96718      |\n| urls_exhibition_public.txt                                   |     15066      |\n| urls_fetish.txt                                              |     22656      |\n| urls_fetish_bdsm.txt                                         |      3301      |\n| urls_fetish_bdsm_bondage.txt                                 |      8962      |\n| urls_fetish_bdsm_domination-\u0026-submission.txt                 |     13608      |\n| urls_fetish_bdsm_domination-\u0026-submission_femdom.txt          |      9205      |\n| urls_fetish_drugs.txt                                        |      1171      |\n| urls_fetish_role-enactment.txt                               |      942       |\n| urls_fetish_role-enactment_age-play.txt                      |      2053      |\n| urls_fetish_role-enactment_furry.txt                         |      2455      |\n| urls_fetish_role-enactment_pet-play.txt                      |      1270      |\n| urls_fetish_role-enactment_rape-abuse.txt                    |      1091      |\n| urls_fetish_watersports.txt                                  |      5128      |\n| urls_general-categories.txt                                  |     212869     |\n| urls_general-categories_artistic-or-borderline-porn.txt      |      8944      |\n| urls_general-categories_desktop-wallpaper.txt                |     20173      |\n| urls_general-categories_gifs.txt                             |      1228      |\n| urls_general-categories_humorous.txt                         |      1909      |\n| urls_general-categories_p.o.v..txt                           |      1025      |\n| urls_general-categories_passionate.txt                       |      781       |\n| urls_general-categories_porn-for-women.txt                   |       31       |\n| urls_general-categories_videos.txt                           |      400       |\n| urls_groups.txt                                              |       97       |\n| urls_groups_alt.txt                                          |     10321      |\n| urls_groups_athlete.txt                                      |      7719      |\n| urls_groups_camgirl.txt                                      |      4321      |\n| urls_groups_celebrity.txt                                    |     46437      |\n| urls_groups_country.txt                                      |      787       |\n| urls_groups_nerd.txt                                         |      3742      |\n| urls_groups_pornstar.txt                                     |      3860      |\n| urls_groups_pornstar_pornstar-lookalike.txt                  |       0        |\n| urls_groups_religious.txt                                    |      1054      |\n| urls_groups_specific-personality.txt                         |      4012      |\n| urls_illegal-taboo.txt                                       |       0        |\n| urls_illegal-taboo_bestiality.txt                            |       0        |\n| urls_illegal-taboo_incest.txt                                |      3816      |\n| urls_illegal-taboo_voyeurism.txt                             |      439       |\n| urls_lgbt_bisexual.txt                                       |      1244      |\n| urls_lgbt_crossdressing.txt                                  |      2443      |\n| urls_lgbt_gay.txt                                            |     19812      |\n| urls_lgbt_lesbian.txt                                        |      5179      |\n| urls_lgbt_transgender.txt                                    |      719       |\n| urls_lgbt_transsexual.txt                                    |     13106      |\n| urls_literary.txt                                            |      1953      |\n| urls_locations_man-made.txt                                  |      3869      |\n| urls_locations_nature.txt                                    |      3831      |\n| urls_locations_nature_beach.txt                              |      4698      |\n| urls_non-porn-nsfw.txt                                       |     21389      |\n| urls_sex.txt                                                 |      1313      |\n| urls_sex_anal.txt                                            |      4683      |\n| urls_sex_anal_gaping.txt                                     |      754       |\n| urls_sex_anal_rimming.txt                                    |      688       |\n| urls_sex_breasts.txt                                         |      176       |\n| urls_sex_fisting.txt                                         |      1033      |\n| urls_sex_group.txt                                           |      1134      |\n| urls_sex_group_large-group.txt                               |      2989      |\n| urls_sex_group_swinging.txt                                  |      4466      |\n| urls_sex_group_threesome.txt                                 |      1747      |\n| urls_sex_insertion.txt                                       |      4344      |\n| urls_sex_interracial.txt                                     |      906       |\n| urls_sex_masturbation.txt                                    |      2032      |\n| urls_sex_oral.txt                                            |      4155      |\n| urls_sex_orgasm.txt                                          |      327       |\n| urls_sex_toys.txt                                            |      6710      |\n| urls_specific-actor-actress.txt                              |     52409      |\n| urls_specific-company.txt                                    |     18763      |\n| urls_wtf.txt                                                 |      4001      |\n\n\n## NOTE\n\n1. After downloading is highly suggested to clean your dataset, for example:\n\t- delete duplicates\n\t- remove images that was banned/deleted (they have a special image placeholder)\n\t- find out corrupted data and remove it also\n\t- etc\n2. Pay attention to noise, some resources provide highly mixed data of NSFW and neutral images\n3. This repository helps in retrieving NSFW images and there's no special URLs for neutral content\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2FEBazarov%2Fnsfw_data_source_urls","html_url":"https://awesome.ecosyste.ms/projects/github.com%2FEBazarov%2Fnsfw_data_source_urls","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2FEBazarov%2Fnsfw_data_source_urls/lists"}