{"id":21398854,"url":"https://github.com/mart-dore/pagerank","last_synced_at":"2025-03-16T15:41:30.328Z","repository":{"id":259321884,"uuid":"877576137","full_name":"mart-dore/PageRank","owner":"mart-dore","description":"Implementation of Page Rank Algorithm in Python","archived":false,"fork":false,"pushed_at":"2024-10-23T22:36:52.000Z","size":2666,"stargazers_count":1,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-01-23T02:41:23.627Z","etag":null,"topics":["pagerank-algorithm","wikipedia"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/mart-dore.png","metadata":{"files":{"readme":"readme.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-10-23T22:24:39.000Z","updated_at":"2024-10-24T15:52:10.000Z","dependencies_parsed_at":"2024-10-24T11:44:07.612Z","dependency_job_id":"94b1a999-e116-4d1f-ba59-5993ce88a2ca","html_url":"https://github.com/mart-dore/PageRank","commit_stats":null,"previous_names":["mart-dore/pagerank"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/mart-dore%2FPageRank","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/mart-dore%2FPageRank/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/mart-dore%2FPageRank/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/mart-dore%2FPageRank/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/mart-dore","download_url":"https://codeload.github.com/mart-dore/PageRank/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":243891926,"owners_count":20364607,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["pagerank-algorithm","wikipedia"],"created_at":"2024-11-22T15:10:48.216Z","updated_at":"2025-03-16T15:41:30.294Z","avatar_url":"https://github.com/mart-dore.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Big Data Project: PageRank\n\n## 🎓 Project Overview\n\nThis project was developed as part of a university course with the goal of implementing the **PageRank algorithm** on Wikipedia pages. The dataset is based on the results from the **Wiki Speed Run** game, where players attempt to connect two Wikipedia pages as quickly as possible by navigating through clickable links between pages. This analysis simulates the PageRank process using these paths to rank Wikipedia pages based on their importance.\n\n## 🚀 How to Run the Classic PageRank Program\n\nTo execute the classic PageRank algorithm, run the following command:\n```bash\npython projet.py\n```\nMake sure the file `paths_finished.csv` is in the same directory as the executable file.\n\n## 🛠 Method Selection\n\nWhen prompted, choose the desired method:\n1. **PageRank Classic**\n2. **Custom Method**\n\n### 🔹 For Classic PageRank:\nYou will need to input the beta value (a float between [0,1]).\n\n### 🔹 For Custom PageRank:\nYou will be asked to input specific page names separated by spaces. For example:\n```bash\nFrance Germany Africa\n```\n\n## 📊 Output Example\n\nAn example output for PageRank with `beta = 0.5` might look like this:\n\n| **PAGE**          | **SCORE** |\n|-------------------|------------|\n| United_States     | 0.02246299 |\n| Europe            | 0.01025971 |\n| England           | 0.00926042 |\n| United_Kingdom    | 0.00921579 |\n| Africa            | 0.00584046 |\n| World_War_II      | 0.00549170 |\n| Earth             | 0.00464188 |\n| France            | 0.00433498 |\n| Germany           | 0.00411631 |\n\nIn this example, *United_States* and *Europe* are the two pages with the highest rank, meaning that they are most used to reach other pages.\n\n## 🛠 Running the Program for Unfinished Paths\n\nTo run the program that also takes into account unfinished paths, use the following command:\n```bash\npython projet_path_unfinished.py\n```\nEnsure both `paths_finished.csv` and `paths_unfinished.csv` are in the same directory as the executable file.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmart-dore%2Fpagerank","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fmart-dore%2Fpagerank","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmart-dore%2Fpagerank/lists"}