{"id":20114700,"url":"https://github.com/gatenlp/tweet-network-gexf-generator","last_synced_at":"2026-05-15T05:02:28.420Z","repository":{"id":145771246,"uuid":"218493966","full_name":"GateNLP/Tweet-Network-GEXF-Generator","owner":"GateNLP","description":"Tweet Network GEXF Generator","archived":false,"fork":false,"pushed_at":"2020-08-11T12:32:11.000Z","size":34,"stargazers_count":2,"open_issues_count":3,"forks_count":0,"subscribers_count":12,"default_branch":"master","last_synced_at":"2025-03-09T19:51:31.067Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Groovy","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"lgpl-3.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/GateNLP.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2019-10-30T09:47:54.000Z","updated_at":"2020-08-11T12:30:55.000Z","dependencies_parsed_at":null,"dependency_job_id":"1c31d3bb-43da-4fdf-ba95-ab32cdce27f5","html_url":"https://github.com/GateNLP/Tweet-Network-GEXF-Generator","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/GateNLP/Tweet-Network-GEXF-Generator","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/GateNLP%2FTweet-Network-GEXF-Generator","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/GateNLP%2FTweet-Network-GEXF-Generator/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/GateNLP%2FTweet-Network-GEXF-Generator/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/GateNLP%2FTweet-Network-GEXF-Generator/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/GateNLP","download_url":"https://codeload.github.com/GateNLP/Tweet-Network-GEXF-Generator/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/GateNLP%2FTweet-Network-GEXF-Generator/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":33054454,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-05-13T13:14:54.681Z","status":"online","status_checked_at":"2026-05-15T02:00:06.351Z","response_time":103,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-11-13T18:31:25.758Z","updated_at":"2026-05-15T05:02:28.384Z","avatar_url":"https://github.com/GateNLP.png","language":"Groovy","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Tweet Network GEXF Generator\n\nTo convert tweets stored in JSON files to a GEXF network graph use the following command\n\n```\ngroovy generate.groovy [options] \u003cinput files\u003e \u003coutput file\u003e\n```\n\nNote that you need Groovy 2.5 or above for the script to work correctly\n\nwhere options are\n\n```\n    --retweets: includes retweets within the graph\n     --replies: includes replies within the graph\n    --mentions: includes mentions within the graph\n\n--edgeWeight n: removes any edges with a weight less than or equal to n\n        --trim: trim the graph to only keep the biggest connected network\n\n       --twint: the data is in Twint JSON format not native Twitter style\n       --tweep: the data is in Tweep JSON format not native Twitter style\n\n```\n\nif no options are provided then rather than producing an empty graph, it is\nassumed that the user wants retweets, replies, and mentions.\n\nAny number of input files or dirctories can be provided to the script and these\nwill be recursed to find all \\*.gz files which it assumes contains Tweets in\nJSON format, one tweet per line. This is the format in which Tweets are made\navailable for download from GATE Cloud.\n\nThe final arguent should be the name of the GEXF file to write the graph to.\n\n## Known Issues\n- When processing retweets in Twint data it's unclear if the timestamp is for\n  the retweet or the original tweet\n- Replies in both Twint and Tweep data are horrid as they are just the tweets\n  that start @\n- Unclear in Twint if the mentions includes the person being replied to or not\n  (we don't want them to be as that would duplicate the edge)\n- Tweep format doesn't pull out the mentions and we've not yet added support\n  for pulling them out of the tweet text\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fgatenlp%2Ftweet-network-gexf-generator","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fgatenlp%2Ftweet-network-gexf-generator","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fgatenlp%2Ftweet-network-gexf-generator/lists"}