{"id":21444197,"url":"https://github.com/jack-development/redditfetch","last_synced_at":"2026-05-18T22:04:13.174Z","repository":{"id":189170825,"uuid":"679560104","full_name":"Jack-Development/RedditFetch","owner":"Jack-Development","description":"RedditFetch is a robust tool for collecting and managing Reddit user data using Python and PRAW. It fetches posts and comments, assigns unique IDs, and structures the data seamlessly for easy access and analysis.","archived":false,"fork":false,"pushed_at":"2023-08-18T14:52:19.000Z","size":36,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-01-23T10:45:28.986Z","etag":null,"topics":["api","dataset-generation","praw","pytorch","reddit"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/Jack-Development.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null}},"created_at":"2023-08-17T05:49:46.000Z","updated_at":"2023-08-18T14:52:39.000Z","dependencies_parsed_at":"2023-08-18T16:21:47.415Z","dependency_job_id":null,"html_url":"https://github.com/Jack-Development/RedditFetch","commit_stats":null,"previous_names":["jack-development/redditdatapipeline"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Jack-Development%2FRedditFetch","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Jack-Development%2FRedditFetch/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Jack-Development%2FRedditFetch/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Jack-Development%2FRedditFetch/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/Jack-Development","download_url":"https://codeload.github.com/Jack-Development/RedditFetch/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":243955804,"owners_count":20374373,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["api","dataset-generation","praw","pytorch","reddit"],"created_at":"2024-11-23T02:16:35.267Z","updated_at":"2026-05-18T22:04:08.139Z","avatar_url":"https://github.com/Jack-Development.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"\u0026nbsp;\n\u003cdiv id=\"header\" align=\"center\"\u003e\n  \u003cimg src=\"https://github.com/Jack-Development/RedditFetch/blob/main/Resources/logo.png\" width=\"300\"/\u003e\n\u003c/div\u003e\n\u0026nbsp;\n\n# RedditFetch\n\nRedditFetch is a robust and efficient tool for collecting and managing datasets from Reddit. This repository is designed to fetch posts and comments from specified Reddit users, assign unique IDs to users, and save the data in a structured manner.\n\nInspired by the vast amount of data available on Reddit and the need for a streamlined data collection process, this project was developed to provide a seamless experience for researchers and developers interested in Reddit data.\n\nThe initial implementation focuses on user-specific data collection, but the modular architecture of the codebase allows for potential expansion to other Reddit data types.\n\n## Skills and Technologies Used\n\nThe project heavily relies on:\n\n- Python\n- PRAW (Python Reddit API Wrapper)\n- JSON\n\n\u003cdiv\u003e\n  \u003ccode\u003e\u003cimg height=\"50\" src=\"https://github.com/devicons/devicon/blob/master/icons/python/python-original.svg\" alt=\"python\"\u003e\u003c/code\u003e\n  \u003ccode\u003e\u003cimg height=\"50\" src=\"https://github.com/Jack-Development/Jack-Development/blob/main/resources/reddit_logo.svg\" alt=\"praw\"\u003e\u003c/code\u003e \u003c!-- Note: This is a placeholder as there might is not a PRAW logo --\u003e\n  \u003ccode\u003e\u003cimg height=\"50\" src=\"https://github.com/Jack-Development/Jack-Development/blob/main/resources/json_logo.png\" alt=\"json\"\u003e\u003c/code\u003e\n\u003c/div\u003e\n\n## Getting Started\n\n_Coming soon..._\n\nA comprehensive guide on how to utilize this project will be available soon. The guide will detail the steps to set up the pipeline, fetch data, and manage the datasets effectively.\n\n## Contributing\n\nContributions, issues, and feature requests are welcome. If you're interested in enhancing the capabilities of RedditDataPipeline or have found any bugs, please check the [issues page](https://github.com/Jack-Development/RedditDataPipeline/issues).\n\n## License\n\nThis project is [MIT](https://choosealicense.com/licenses/mit/) licensed.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fjack-development%2Fredditfetch","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fjack-development%2Fredditfetch","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fjack-development%2Fredditfetch/lists"}