{"id":18561804,"url":"https://github.com/pseudomanifold/shakespeare","last_synced_at":"2026-03-16T20:31:57.270Z","repository":{"id":92126030,"uuid":"64840349","full_name":"Pseudomanifold/Shakespeare","owner":"Pseudomanifold","description":"Code and data for extracting co-occurrence networks from Shakespeare's plays","archived":false,"fork":false,"pushed_at":"2025-01-16T21:40:15.000Z","size":5046,"stargazers_count":14,"open_issues_count":0,"forks_count":6,"subscribers_count":2,"default_branch":"master","last_synced_at":"2025-03-24T15:47:23.038Z","etag":null,"topics":["co-occurence","complex-networks","networks","shakespeare","shakespeare-plays","social-network","social-network-analysis"],"latest_commit_sha":null,"homepage":"https://pseudomanifold.github.io/Shakespeare/","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/Pseudomanifold.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE.md","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2016-08-03T11:17:19.000Z","updated_at":"2025-03-03T15:52:29.000Z","dependencies_parsed_at":"2023-03-13T17:33:25.285Z","dependency_job_id":null,"html_url":"https://github.com/Pseudomanifold/Shakespeare","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Pseudomanifold%2FShakespeare","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Pseudomanifold%2FShakespeare/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Pseudomanifold%2FShakespeare/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Pseudomanifold%2FShakespeare/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/Pseudomanifold","download_url":"https://codeload.github.com/Pseudomanifold/Shakespeare/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":248150816,"owners_count":21055988,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["co-occurence","complex-networks","networks","shakespeare","shakespeare-plays","social-network","social-network-analysis"],"created_at":"2024-11-06T22:07:57.015Z","updated_at":"2026-03-16T20:31:57.265Z","avatar_url":"https://github.com/Pseudomanifold.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"This repository contains the code used to extract co-occurrence networks\nfrom a tagged corpus of Shakespeare's plays.\n\nThe networks have been analysed using *persistent homology*, a technique\nfrom computational topology. Please refer to our paper\n\n[*Shall I compare thee to a network?* \u0026ndash; Visualizing the Topological Structure of Shakespeare's Plays](http://bastian.rieck.me/research/Vis2016.pdf)\n\nfor more details.\n\n# Data\n\n* The folder `Corpus` contains the original corpus that was used to\n  calculate co-occurrence networks. Additional information about the\n  amount of speech between certain characters has been added. Please\n  refer to [lexically.net](http://lexically.net/wordsmith/support/shakespeare.html) for the original data.\n* The folder `Networks` contains the co-occurrence networks for all the\n  plays that we used in the paper. Networks are categorized into\n  *speech-based* and *time-based* filtrations. Please refer to the paper\n  for more details.\n* The folder `Plays` contains the corrected variants of the plays,\n  sorted into three broad categories.\n\n# Usage\n\nThe main script is called `co-occurrence.py`. Given the filename of\na tagged play, it automatically produces a co-occurrence network using\nthe *speech-based filtration* we described in the paper. The network\nwill be stored in the current directory. To batch-process all networks\nautomatically, you could for example use:\n\n    find ./Plays/ -name \"*.txt\" -exec ./co-occurrence.py {} \\;\n\nThis traverses the folder `Plays` and executes the extraction script for\nevery file. If you want the *time-based filtration* instead, use the\nparameter `-t`, i.e.:\n\n    find ./Plays/ -name \"*.txt\" -exec ./co-occurrence.py {} -t \\;\n\nAgain, this will result in a set of networks. Note that all existing\nnetworks will be overwritten in the current folder.\n\n# Demo\n\nA [demo](https://pseudomanifold.github.io/Shakespeare) of all the extracted\nnetworks is available. The demo uses a simple force-directed graph\nlayout to visualize the network.\n\n# Licence\n\nThe data and the code is are released under an MIT licence. Please refer\nto the file `LICENSE` for more information.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fpseudomanifold%2Fshakespeare","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fpseudomanifold%2Fshakespeare","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fpseudomanifold%2Fshakespeare/lists"}