{"id":18764443,"url":"https://github.com/tripplyons/find-twitter-accounts","last_synced_at":"2025-12-07T07:30:19.500Z","repository":{"id":81907854,"uuid":"537105399","full_name":"tripplyons/find-twitter-accounts","owner":"tripplyons","description":"Find and classify Twitter accounts using text embeddings","archived":false,"fork":false,"pushed_at":"2023-03-30T19:33:06.000Z","size":24,"stargazers_count":2,"open_issues_count":0,"forks_count":0,"subscribers_count":3,"default_branch":"main","last_synced_at":"2024-12-29T05:42:10.319Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/tripplyons.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2022-09-15T16:06:56.000Z","updated_at":"2023-03-27T18:12:01.000Z","dependencies_parsed_at":null,"dependency_job_id":"f2980701-288b-4380-b8e0-4491e12b5e06","html_url":"https://github.com/tripplyons/find-twitter-accounts","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/tripplyons%2Ffind-twitter-accounts","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/tripplyons%2Ffind-twitter-accounts/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/tripplyons%2Ffind-twitter-accounts/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/tripplyons%2Ffind-twitter-accounts/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/tripplyons","download_url":"https://codeload.github.com/tripplyons/find-twitter-accounts/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":239662746,"owners_count":19676435,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-11-07T18:29:52.616Z","updated_at":"2025-12-07T07:30:19.451Z","avatar_url":"https://github.com/tripplyons.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Find Twitter Accounts\n\nFind and classify Twitter accounts using text embeddings\n\n## Use cases\n\n- Finding bot accounts\n- Finding accounts of cryptocurrency projects ([example output and dataset](https://gist.github.com/tripplyons/eb5977dcf788ca408f4fe542daeb914e))\n- Finding any other kind of account you can make a dataset for\n\n## Installation\n\n```bash\nconda env create -f environment.yml\nconda activate twitter\n```\n\n## Usage\n\n### Labeling data\n\nThis will create or add to a dataset stored in `dataset.json`.\n\n```bash\npython labeling.py \u003csearch query\u003e \u003cnumber of users to label\u003e\n```\n\n### Training a classifier\n\nThis will train a linear classifier on embeddings.\n\nIt will use the dataset defined in `dataset.json` and save a model to `classifier.pkl`:\n\n```bash\npython classifier.py\n```\n\n### Finding accounts\n\nThis will find embeddings of scraped accounts and use the classifier to classify them.\n\nIt will output links to any accounts with a specified label.\n\n```bash\npython main.py \u003csearch query\u003e \u003cnumber of tweets to find users\u003e \u003clabel to search for\u003e\n```\n\n## Details\n\n### Input format for the model\n\n```\nDisplay Name, Username, Profile description\n```\n\n## Credits\n\n- https://platform.openai.com/docs/guides/embeddings (Embeddings)\n- https://github.com/JustAnotherArchivist/snscrape (Twitter Scraper)\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Ftripplyons%2Ffind-twitter-accounts","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Ftripplyons%2Ffind-twitter-accounts","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Ftripplyons%2Ffind-twitter-accounts/lists"}