{"id":22832612,"url":"https://github.com/timkong21/facebook-page-classifier","last_synced_at":"2025-07-19T14:35:13.486Z","repository":{"id":190841033,"uuid":"405562471","full_name":"TimKong21/Facebook-page-classifier","owner":"TimKong21","description":"Apply Multi-class node classification algorithm to classify Facebook page content type","archived":false,"fork":false,"pushed_at":"2021-09-12T21:48:25.000Z","size":2294,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-06-29T05:34:21.499Z","etag":null,"topics":["biased-random-walk","gensim-word2vec","graph","machine-learning","multi-class-classification","networkx","node2vec","stellargraph","tsne"],"latest_commit_sha":null,"homepage":"","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/TimKong21.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null}},"created_at":"2021-09-12T06:12:43.000Z","updated_at":"2023-02-02T22:35:24.000Z","dependencies_parsed_at":"2023-08-26T17:50:06.701Z","dependency_job_id":null,"html_url":"https://github.com/TimKong21/Facebook-page-classifier","commit_stats":null,"previous_names":["timkong21/facebook-page-classifier"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/TimKong21/Facebook-page-classifier","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/TimKong21%2FFacebook-page-classifier","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/TimKong21%2FFacebook-page-classifier/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/TimKong21%2FFacebook-page-classifier/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/TimKong21%2FFacebook-page-classifier/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/TimKong21","download_url":"https://codeload.github.com/TimKong21/Facebook-page-classifier/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/TimKong21%2FFacebook-page-classifier/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":265947511,"owners_count":23853382,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["biased-random-walk","gensim-word2vec","graph","machine-learning","multi-class-classification","networkx","node2vec","stellargraph","tsne"],"created_at":"2024-12-12T21:08:15.319Z","updated_at":"2025-07-19T14:35:13.440Z","avatar_url":"https://github.com/TimKong21.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"\n# Facebook Page Classifier\n\nThe [facebook_large dataset](https://github.com/TimKong21/Facebook-page-classifier/tree/main/facebook_large) contains page-page graph of verified Facebook sites. \nThis graph was collected through the Facebook Graph API in November 2017 \nand restricted to pages from four categories which are defined by Facebook. \nMulti-class node classification on the Facebook sites is performed in this repository.\n\n## Dataset\nThere are nodes and edges in the dataset. They are:\n\n[musae_facebook_target.csv](https://raw.githubusercontent.com/TimKong21/Facebook-page-classifier/main/facebook_large/musae_facebook_target.csv)\n- Nodes are the oficial Facebook pages with unique ids.\n- Each page is labelled with the page type - **tvshow**, **government**, **company**, **politician**.\n\n[musae_facebook_edges.csv](https://raw.githubusercontent.com/TimKong21/Facebook-page-classifier/main/facebook_large/musae_facebook_edges.csv)\n- Edges are the mutual likes between the Facebook pages\n- There could be no relarionships, one-to-one or one-to-many relationships.\n\n## Notebook\n[Facebook page classifier.ipynb](https://github.com/TimKong21/Facebook-page-classifier/blob/main/Facebook%20page%20classifier.ipynb)\ncontains the step for node classification. Through the process, every node is assigned a specific label. A predefined percentage of random nodes is used to train the classifier, while the rest serves as test data for evaluating the embedding method and specific classifier.\nThe implementation can be divided into four main steps:\n\n    1. Calculate and save node embeddings for the whole graph.\n    2. Split the node embeddings into training and testing sets.\n    3. Train the classifier.\n    4. Evaluate the classifier on the test data.\n## Acknowledgements\n\n - [Node embeddings visualization reference](https://stellargraph.readthedocs.io/en/stable/demos/node-classification/node2vec-node-classification.html)\n - [Node classification algorithms reference](https://github.com/memgraph/graph-analytics-course/blob/master/lecture-5/node-classification/classifier.py)\n  ","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Ftimkong21%2Ffacebook-page-classifier","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Ftimkong21%2Ffacebook-page-classifier","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Ftimkong21%2Ffacebook-page-classifier/lists"}