{"id":28450476,"url":"https://github.com/infinitode/pwlds","last_synced_at":"2026-02-22T16:34:57.006Z","repository":{"id":285519793,"uuid":"846428373","full_name":"Infinitode/PWLDS","owner":"Infinitode","description":"A public dataset of over 10 million passwords, with assigned strength levels.","archived":false,"fork":false,"pushed_at":"2025-04-08T19:41:20.000Z","size":130386,"stargazers_count":2,"open_issues_count":0,"forks_count":2,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-10-26T13:43:35.484Z","etag":null,"topics":["ai","classes","classification","cyber-security","data","dataset","ml","open-source","password","passwords","synthetic-data"],"latest_commit_sha":null,"homepage":"https://infinitode.netlify.app","language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"other","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/Infinitode.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2024-08-23T07:27:17.000Z","updated_at":"2025-04-08T19:41:24.000Z","dependencies_parsed_at":"2025-06-30T16:48:00.296Z","dependency_job_id":null,"html_url":"https://github.com/Infinitode/PWLDS","commit_stats":null,"previous_names":["infinitode/pwlds"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/Infinitode/PWLDS","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Infinitode%2FPWLDS","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Infinitode%2FPWLDS/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Infinitode%2FPWLDS/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Infinitode%2FPWLDS/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/Infinitode","download_url":"https://codeload.github.com/Infinitode/PWLDS/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Infinitode%2FPWLDS/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":29718454,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-02-22T15:10:41.462Z","status":"ssl_error","status_checked_at":"2026-02-22T15:10:04.636Z","response_time":110,"last_error":"SSL_read: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["ai","classes","classification","cyber-security","data","dataset","ml","open-source","password","passwords","synthetic-data"],"created_at":"2025-06-06T15:07:40.346Z","updated_at":"2026-02-22T16:34:56.978Z","avatar_url":"https://github.com/Infinitode.png","language":null,"funding_links":[],"categories":[],"sub_categories":[],"readme":"# PWLDS (Password Weakness and Level Dataset)\n\nPWLDS is a public dataset consisting of over 10 million passwords with varying assigned strength levels. The dataset was designed to help researchers, security professionals, and developers analyze password strength and build more secure systems. The dataset contains 5 classes:\n\n- `weak`: Represented by `1` in the dataset\n- `average`: Represented by `2` in the dataset\n- `strong`: Represented by `3` in the dataset\n- `very_strong`: Represented by `4` in the dataset\n\nFor strength level `4`, we used Python's `secrets` module to generate cryptographically secure passwords, ensuring the robustness and security of these passwords.\n\n\u003e [!NOTE]\n\u003e Due to GitHub's single file size constraints of 100mb, `pwlds_full.csv` was removed in version 1.1. You can still however, download it from Hugging Face Datasets or create it on your own by combining all of the files into one.\n\n## Intended Uses\n\n- **Security Analysis**: PWLDS can be used to study common password patterns and weaknesses, helping to identify vulnerabilities and improve password policies.\n- **Machine Learning**: The dataset can be used to train machine learning models for password strength estimation or prediction.\n- **Educational Purposes**: This dataset is valuable for educational projects and demonstrations related to cybersecurity and data science.\n- **Benchmarking**: PWLDS provides a large, labeled dataset for benchmarking password strength estimation algorithms and tools.\n\n## Limitations\n\n- **No Real-World Data**: The dataset is synthetically generated and does not contain real user passwords. While this avoids privacy concerns, it may not fully represent real-world password distribution and usage patterns.\n- **Bias in Password Generation**: Since passwords are generated based on predefined rules and patterns, there may be biases that do not reflect the diversity of passwords used in real-life scenarios.\n- **Exclusivity to English Words**: The weak and very weak password categories rely heavily on English words, which may not be representative of non-English-speaking populations.\n\n## How to Use the Dataset\n\n1. **Download the Dataset**: You can download the dataset from [GitHub](https://github.com/Infinitode/PWLDS) or from [Hugging Face Datasets](https://huggingface.co/datasets/InfinitodeLTD/PWLDS/).\n2. **Load the Dataset**: Use Python, R, or any data processing tool to load and analyze the dataset. The file is in CSV format for easy use.\n3. **Data Structure**: Each entry in the dataset includes a password and its associated strength level (0-4) labels for columns are respectively: `Password` and `Strength_Level`.\n\n## Citation\n\nWhen using this dataset, please cite as follows:\n\n```\nDataset Title: Password Weakness and Level Dataset (PWLDS)\nAuthor: Infinitode Pty Ltd\nDate: 2024\nSource: https://github.com/Infinitode/PWLDS\nLicense: Creative Commons Attribution 4.0 International (CC BY 4.0)\n```\n\n## License\n\nThis dataset is licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0). See the LICENSE file for more details.\n\n## Contact\n\nFor questions or suggestions, please contact us through our website or open an issue on the project's GitHub repository.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Finfinitode%2Fpwlds","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Finfinitode%2Fpwlds","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Finfinitode%2Fpwlds/lists"}