{"id":31959429,"url":"https://github.com/huggingface/personas","last_synced_at":"2025-10-14T15:32:18.778Z","repository":{"id":40346515,"uuid":"81720853","full_name":"huggingface/personas","owner":"huggingface","description":"Datasets for Deep learning Personas","archived":false,"fork":false,"pushed_at":"2017-12-27T01:35:42.000Z","size":2,"stargazers_count":62,"open_issues_count":3,"forks_count":24,"subscribers_count":7,"default_branch":"master","last_synced_at":"2025-09-30T18:02:30.480Z","etag":null,"topics":["datasets","deep-learning","neural-conversation-models"],"latest_commit_sha":null,"homepage":null,"language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/huggingface.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2017-02-12T11:22:42.000Z","updated_at":"2025-09-14T14:34:15.000Z","dependencies_parsed_at":"2022-08-27T18:51:55.657Z","dependency_job_id":null,"html_url":"https://github.com/huggingface/personas","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/huggingface/personas","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/huggingface%2Fpersonas","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/huggingface%2Fpersonas/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/huggingface%2Fpersonas/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/huggingface%2Fpersonas/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/huggingface","download_url":"https://codeload.github.com/huggingface/personas/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/huggingface%2Fpersonas/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":279019322,"owners_count":26086711,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-10-14T02:00:06.444Z","response_time":60,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["datasets","deep-learning","neural-conversation-models"],"created_at":"2025-10-14T15:30:43.729Z","updated_at":"2025-10-14T15:32:18.773Z","avatar_url":"https://github.com/huggingface.png","language":null,"funding_links":[],"categories":[],"sub_categories":[],"readme":"# personas\nDatasets for Deep learning Personas\n\n***TL;DR:*** These are the datasets that we've used in our fun AI side project experiment, over at https://personas.huggingface.co/\n\nWe've trained seq2seq models using [DeepQA](https://github.com/Conchylicultor/DeepQA), a tensorflow implementation of \"A neural conversational model\" (a.k.a. the Google paper), a Deep learning based chatbot.\n\n## Datasets used\n\n * [Cornell Movie Dialogs](http://www.cs.cornell.edu/~cristian/Cornell_Movie-Dialogs_Corpus.html) corpus\n * Supreme Court Conversation Data.\n * [Ubuntu Dialogue Corpus](https://arxiv.org/abs/1506.08909) for tech-support type discussion.\n * [Stack Exchange Data Dump](https://archive.org/details/stackexchange)\n \nThis is an anonymized dump of all user-contributed content on the Stack Exchange network. Each site is formatted as a separate archive consisting of XML files zipped via 7-zip using bzip2 compression. Each site archive includes Posts, Users, Votes, Comments, PostHistory and PostLinks. For complete schema information, see the included readme.txt.\n\nAttribution: cc-by-sa 3.0\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fhuggingface%2Fpersonas","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fhuggingface%2Fpersonas","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fhuggingface%2Fpersonas/lists"}