{"id":21975242,"url":"https://github.com/ishan-surana/metadatascraper","last_synced_at":"2025-04-28T15:50:25.302Z","repository":{"id":246741818,"uuid":"821099642","full_name":"ishan-surana/MetaDataScraper","owner":"ishan-surana","description":"MetaDataScraper is a Python package designed to automate the extraction of follower counts and post details from a public Facebook page. It uses Selenium WebDriver for web automation and scraping. Official documentation at https://metadatascraper.readthedocs.io","archived":false,"fork":false,"pushed_at":"2024-08-23T15:36:39.000Z","size":183,"stargazers_count":6,"open_issues_count":0,"forks_count":1,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-04-21T08:10:17.253Z","etag":null,"topics":["facebook","meta","no-api","no-login","python-library","python-package","scraper","webscraper"],"latest_commit_sha":null,"homepage":"https://pypi.org/project/MetaDataScraper/","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/ishan-surana.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":".github/FUNDING.yml","license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":"SECURITY.md","support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null},"funding":{"custom":["https://www.paypal.com/paypalme/ishansurana"],"github":"ishan-surana","buy_me_a_coffee":"ishansurana"}},"created_at":"2024-06-27T20:13:26.000Z","updated_at":"2025-03-08T19:47:16.000Z","dependencies_parsed_at":"2024-08-23T16:13:16.202Z","dependency_job_id":"3f24814b-ef2b-4b02-8b54-8d4e9bb11aca","html_url":"https://github.com/ishan-surana/MetaDataScraper","commit_stats":null,"previous_names":["ishan-surana/metadatascraper"],"tags_count":8,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ishan-surana%2FMetaDataScraper","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ishan-surana%2FMetaDataScraper/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ishan-surana%2FMetaDataScraper/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ishan-surana%2FMetaDataScraper/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/ishan-surana","download_url":"https://codeload.github.com/ishan-surana/MetaDataScraper/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":251342700,"owners_count":21574242,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["facebook","meta","no-api","no-login","python-library","python-package","scraper","webscraper"],"created_at":"2024-11-29T15:50:28.583Z","updated_at":"2025-04-28T15:50:25.284Z","avatar_url":"https://github.com/ishan-surana.png","language":"Python","funding_links":["https://www.paypal.com/paypalme/ishansurana","https://github.com/sponsors/ishan-surana","https://buymeacoffee.com/ishansurana","https://www.buymeacoffee.com/ishansurana"],"categories":[],"sub_categories":[],"readme":"[![Licence](https://badgen.net/github/license/ishan-surana/MetaDataScraper?color=DC143C)](https://github.com/ishan-surana/MetaDataScraper/blob/main/LICENCE) [![Python](https://img.shields.io/badge/python-%3E=3.10-slateblue.svg)](https://www.python.org/downloads/release/python-3119/) [![Wheel](https://img.shields.io/badge/wheel-yes-FF00C9.svg)](https://files.pythonhosted.org/packages/02/80/c53d5e8439361c913e23b6345e85e748a7ac7e82e22cb9f7cd9ec77d5d52/MetaDataScraper-1.0.0-py3-none-any.whl) [![Latest](https://badgen.net/github/release/ishan-surana/MetaDataScraper?label=latest+release\u0026color=green)](https://pypi.org/project/MetaDataScraper/1.0.0/) [![Releases](https://badgen.net/github/releases/ishan-surana/MetaDataScraper?color=orange)](https://github.com/ishan-surana/MetaDataScraper/releases) [![Stars](https://badgen.net/github/stars/ishan-surana/MetaDataScraper?color=yellow)](https://github.com/ishan-surana/MetaDataScraper/stargazers) [![Forks](https://badgen.net/github/forks/ishan-surana/MetaDataScraper?color=dark)](https://github.com/ishan-surana/MetaDataScraper/forks) [![Issues](https://badgen.net/github/issues/ishan-surana/MetaDataScraper?color=800000)](https://github.com/ishan-surana/MetaDataScraper/issues) [![PRs](https://badgen.net/github/prs/ishan-surana/MetaDataScraper?color=C71585)](https://github.com/ishan-surana/MetaDataScraper/pulls) ![Downloads](https://img.shields.io/github/downloads/ishan-surana/MetaDataScraper/total) [![Last commit](https://badgen.net/github/last-commit/ishan-surana/MetaDataScraper?color=blue)](https://github.com/ishan-surana/MetaDataScraper/commits/main/) [![Workflow](https://github.com/ishan-surana/MetaDataScraper/actions/workflows/python-publish.yml/badge.svg)](https://github.com/ishan-surana/MetaDataScraper/blob/main/.github/workflows/python-publish.yml) [![PyPI](https://d25lcipzij17d.cloudfront.net/badge.svg?id=py\u0026r=r\u0026ts=1683906897\u0026type=6e\u0026v=1.0.0\u0026x2=0)](https://pypi.org/project/MetaDataScraper/) [![Maintained](https://img.shields.io/badge/maintained-yes-cyan)](https://github.com/ishan-surana/MetaDataScraper/pulse) [![OS](https://img.shields.io/badge/OS-Windows-FF0000)](https://www.microsoft.com/software-download/windows11) [![Documentation Status](https://readthedocs.org/projects/metadatascraper/badge/?version=latest)](https://metadatascraper.readthedocs.io/en/latest/?badge=latest)\u003cbr\u003e\r\n---\r\n## \u003cdiv align=center\u003eSupport this package by donating here! ➡️ [![Buy Me a Coffee](https://img.shields.io/badge/Buy%20Me%20a%20Coffee-badge?style=plastic\u0026logo=buy-me-a-coffee\u0026color=black)](https://www.buymeacoffee.com/ishansurana) [![Paypal](https://img.shields.io/badge/PayPal-badge?style=plastic\u0026logo=paypal\u0026color=white)](https://www.paypal.com/paypalme/ishansurana)\u003c/div\u003e\u003cbr\u003e\r\n\r\n# MetaDataScraper\r\n\r\nMetaDataScraper is a Python package designed to automate the extraction of information like follower counts, and post details \u0026 interactions from a public Facebook page, in the form of a list. It uses Selenium WebDriver for web automation and scraping.  \r\nThe module provides two classes: `LoginlessScraper` and `LoggedInScraper`. The `LoginlessScraper` class does not require any authentication or API keys to scrape the data. However, it has a drawback of being unable to access some Facebook pages. \r\nThe `LoggedInScraper` class overcomes this drawback by utilising the credentials of a Facebook account (of user) to login and scrape the data.\r\n\r\n## Installation\r\n\r\nYou can install MetaDataScraper using pip:\r\n\r\n```\r\npip install MetaDataScraper\r\n```\r\n\r\nMake sure you have Python 3.x and pip installed.\r\n\r\n## Usage\r\n\r\nTo use MetaDataScraper, follow these steps:\r\n\r\n1. Import the `LoginlessScraper` or the `LoggedInScraper` class:\r\n\r\n   ```python\r\n   from MetaDataScraper import LoginlessScraper, LoggedInScraper\r\n   ```\r\n\r\n2. Initialize the scraper with the Facebook page ID:\r\n\r\n   ```python\r\n   page_id = \"your_target_page_id\"\r\n   scraper = LoginlessScraper(page_id)\r\n   email = \"your_facebook_email\"\r\n   password = \"your_facebook_password\"\r\n   scraper = LoggedInScraper(page_id, email, password)\r\n   ```\r\n\r\n3. Scrape the Facebook page to retrieve information:\r\n\r\n   ```python\r\n   result = scraper.scrape()\r\n   ```\r\n\r\n4. Access the scraped data from the result dictionary:\r\n\r\n   ```python\r\n   print(f\"Followers: {result['followers']}\")\r\n   print(f\"Post Texts: {result['post_texts']}\")\r\n   print(f\"Post Likes: {result['post_likes']}\")\r\n   print(f\"Post Shares: {result['post_shares']}\")\r\n   print(f\"Is Video: {result['is_video']}\")\r\n   print(f\"Video Links: {result['video_links']}\")\r\n   ```\r\n\r\n## Features\r\n\r\n- **Automated Extraction**: Automatically fetches follower counts, post texts, likes, shares, and video links from Facebook pages.\r\n- **Comprehensive Data Retrieval**: Retrieves detailed information about each post, including text content, interaction metrics (likes, shares), and multimedia (e.g., video links).\r\n- **Flexible Handling**: Adapts to diverse post structures and various types of multimedia content present on Facebook pages, like post texts or reels.\r\n- **Enhanced Access with Logged-In Scraper**: Overcomes limitations faced by anonymous scraping (loginless) by utilizing Facebook account credentials for broader page access.\r\n- **Headless Operation**: Executes scraping tasks in headless mode, ensuring seamless and non-intrusive data collection without displaying a browser interface.\r\n- **Scalability**: Supports scaling to handle large volumes of data extraction efficiently, suitable for monitoring multiple Facebook pages simultaneously.\r\n- **Dependency Management**: Utilizes Selenium WebDriver for robust web automation and scraping capabilities, compatible with Python 3.x environments.\r\n- **Ease of Use**: Simplifies the process with straightforward initialization and method calls, facilitating quick integration into existing workflows.\r\n\r\n## Dependencies\r\n\r\n- selenium\r\n- webdriver_manager\r\n\r\n## License\r\n\r\nThis project is licensed under the Apache Software License Version 2.0 - see the [LICENSE](https://github.com/ishan-surana/MetaDataScraper/blob/main/LICENCE) file for details.\r\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fishan-surana%2Fmetadatascraper","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fishan-surana%2Fmetadatascraper","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fishan-surana%2Fmetadatascraper/lists"}