{"id":24360406,"url":"https://github.com/dms-codes/scrape_tripsantai","last_synced_at":"2026-05-21T22:07:38.399Z","repository":{"id":199833929,"uuid":"703854958","full_name":"dms-codes/scrape_tripsantai","owner":"dms-codes","description":"Trip Santai Tour Data Scraper This Python script is a web scraper designed to extract and collect information about tours from the Trip Santai website. It utilizes the requests library to fetch web pages, BeautifulSoup for parsing HTML, and writes the collected data to a CSV file.","archived":false,"fork":false,"pushed_at":"2023-10-12T03:55:46.000Z","size":21,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-01-18T21:20:04.638Z","etag":null,"topics":["beautifulsoup4","data","python","requests","scraper","webscraper"],"latest_commit_sha":null,"homepage":"https://github.com/dms-codes/scrape_tripsantai","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/dms-codes.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null}},"created_at":"2023-10-12T03:49:34.000Z","updated_at":"2023-10-12T03:56:29.000Z","dependencies_parsed_at":null,"dependency_job_id":"b2a3584c-4ef5-4886-9792-9a0fb8ed3502","html_url":"https://github.com/dms-codes/scrape_tripsantai","commit_stats":null,"previous_names":["dms-codes/scrape_tripsantai"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dms-codes%2Fscrape_tripsantai","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dms-codes%2Fscrape_tripsantai/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dms-codes%2Fscrape_tripsantai/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dms-codes%2Fscrape_tripsantai/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/dms-codes","download_url":"https://codeload.github.com/dms-codes/scrape_tripsantai/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":243182886,"owners_count":20249708,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["beautifulsoup4","data","python","requests","scraper","webscraper"],"created_at":"2025-01-18T21:19:40.469Z","updated_at":"2025-12-26T22:57:08.835Z","avatar_url":"https://github.com/dms-codes.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Trip Santai Tour Data Scraper\n\nThis Python script is a web scraper designed to extract and collect information about tours from the [Trip Santai](https://www.tripsantai.com/) website. It utilizes the `requests` library to fetch web pages, `BeautifulSoup` for parsing HTML, and writes the collected data to a CSV file.\n\n## Prerequisites\n\nBefore using this script, make sure you have the following Python libraries installed:\n\n- `requests`\n- `BeautifulSoup`\n\nYou can install these libraries using pip:\n\n```bash\npip install requests beautifulsoup4\n```\n\n## Usage\n\n1. Clone this repository to your local machine.\n\n2. Modify the `BASE_URL` in the script to the specific URL of the tours you want to scrape on the Trip Santai website.\n\n3. Run the script:\n\n```bash\npython trip_santai_scraper.py\n```\n\n4. The script will fetch tour data, including tour name, category, destination, duration, pricing, itinerary, and inclusions/exclusions.\n\n5. The collected data will be written to a CSV file named `data_tour_tripsantai.csv`.\n\n## Code Structure\n\n- `constants.py`: Contains constants like `BASE_URL` and `TIMEOUT`.\n- `utils.py`: Contains utility functions for extracting and cleaning text from HTML elements.\n- `trip_santai_scraper.py`: The main script for scraping tour data.\n- `requirements.txt`: Lists the required Python libraries.\n\n## Contact\n\nIf you have any questions or suggestions, please feel free to contact us.\n\nHappy web scraping!\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdms-codes%2Fscrape_tripsantai","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fdms-codes%2Fscrape_tripsantai","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdms-codes%2Fscrape_tripsantai/lists"}