An open API service indexing awesome lists of open source software.

https://github.com/maxinexiong/web-scraping-rpa

This repository contains an RPA robot that was designed to scrap up to 500 pieces of property information for a given location from a real estate website. The extracted data is then intelligently organized, filtered, and sorted according to user-defined criteria, and integrated into the Excel file, output.xlsx.
https://github.com/maxinexiong/web-scraping-rpa

data-scraping data-table excel-processing robotic-process-automation rpa table-extraction uipath uipath-classic-design uipath-modern-design uipath-studio web-scraping

Last synced: 2 months ago
JSON representation

This repository contains an RPA robot that was designed to scrap up to 500 pieces of property information for a given location from a real estate website. The extracted data is then intelligently organized, filtered, and sorted according to user-defined criteria, and integrated into the Excel file, output.xlsx.

Awesome Lists containing this project

README

        

### UiPath RPA Robot
# Web Scraping for Real Estate Data

[![GitHub](https://badgen.net/badge/icon/GitHub?icon=github&color=black&label)](https://github.com/MaxineXiong)
[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
[![Platform - UiPath RPA](https://img.shields.io/badge/Platform-UiPath_RPA-fa4616)](https://www.uipath.com)


This repository contains a robust and efficient solution built using the [**UiPath Automation Platform**](https://www.uipath.com) that streamlines the process of gathering real estate data
from [zillow.com](https://www.zillow.com/), a popular online real estate marketplace. The robot is designed to extract essential information about properties in a given location, including *price*, *number of bedrooms*, *number of bathrooms*, *area (sqft)*,
and *address*, for a maximum of 500 properties. The extracted data is then intelligently organized, filtered, and sorted according to user-defined criteria, and seamlessly integrated into an
Excel file named **output.xlsx**.

Having access to accurate and up-to-date property information is essential for making timely, informed decisions on property investment and management. This UiPath
RPA robot provides a solution to efficiently collect and manage property data from internet, and can save you significant time and effort by automating the repetitive and time-consuming
data search and entry process.

This repository includes solutions created using both **Classic Design** and **Modern Design** in UiPath Studio. _You can check out the **automation demo videos** by clicking on the badge below_:

[![View Demo - Web-Scraping-Real-Estate-Data](https://img.shields.io/badge/View_Demo-Web_Scraping_Real_Estate_Data_(Modern_Design)-fa4616?style=for-the-badge)](https://1drv.ms/v/s!AhxVr7ogXVBRlS25lNe_U91iujN3?e=mKj9X1)

[![View Demo - Web-Scraping-Real-Estate-Data](https://img.shields.io/badge/View_Demo-Web_Scraping_Real_Estate_Data_(Classic_Design)-fa4616?style=for-the-badge)](https://1drv.ms/v/s!AhxVr7ogXVBRlSzdaoWcyaUO0muC?e=tffKcL)


## **Installation**

Before installing **UiPath Softwares**, please make sure your system meets the hardware and software requirements outlined in the **[UiPath documentation](https://docs.uipath.com/studio/standalone/2022.10/user-guide/hardware-and-software-requirements)**.

The **Uipath Platform** includes the following tools:

- **UiPath Studio**
- **UiPath Assistant**
- **UiPath Automation Cloud, including UiPath Orchestrator**


To run this project successfully, please follow these steps to install UiPath Studio:

***

Step 1 : Visit [uipath.com](https://www.uipath.com/) and click **Try UiPath Free** button.



Step 2: **Sign up** for a personal account.



Step 3: **Verify** your account in email.



Step 4: **Log into** the **UiPath Automation Cloud** using your account, and click the **Download Uipath Studio** button.



Step 5: Click **Sign in**.



Step 6: Select **UiPath Studio Pro**.



Step 7: Follow the system instructions to complete the installation of **UiPath Studio Pro**.




Please also follow these steps below to connect your local machine to the UiPath Automation Cloud for deploying this workflow (if desired):

***

Step 1: Sign up and log into [UiPath Automation Cloud](https://cloud.uipath.com/).

Step 2: Add a **Tenant**.



Step 3: **Edit** the user and assign the **Automation Users** role to grant them permission to execute processes.




Step 4: Go to the **Orchestrator** interface and click on **Tenant** in the left pane.




Choose **Folders** and then click the **+** icon to create a new folder.




Step 5: Navigate back to **Tenant** interface and follow the steps below to start adding an Automation User for Unattended Robot in **Manage Access**.



a) Scroll down to locate the target user, then assign the **Automation User** role to grant them the necessary permissions. Click **Next** button to move on to the next page.




b) In the *Personal automations setup* page, select the options to **Enable user to run automations** and **Create a personal workspace for this user and enable optimal Studio Web experience**, then click on the **Next** button.



c) On the *Unattended setup* page, check the option to **Enable this user to run unattended automations**, choose **Specific Windows credentials** for local machine connection to Orchestrator, provide **Domain\Username** of your user account on local machine (which can be found by executing `whoami` in Command Prompt), and enter the **Password** for accessing your local machine. Finally, click on the **Update** button.



Step 6: Now, go to the **Machines** page where you should see the workspace machine for the target user already created. Click the ellipsis to select **Edit Machine**.



Enter **1** for both the *Production (Unattended)* and *Testing* fields, then click the **Update** button.



Step 7: Now return to the newly created folder, choose the **Machines** menu, and click **Manage Machines in Folder** button to assign the machine you just configured to the folder.





You should now have both the **User** and **Machine** assigned to the new folder.



Step 8: Open **UiPath Assistant** and click **Sign In**. If you see the **green circle** in the top right corner, you’ve successfully connected your local *UiPath Studio* to the *UiPath Automation Cloud*.




You can confirm the connection by opening UiPath Studio and checking for a green circle at the bottom.



***

To **publish a process** from UiPath Studio to Orchestrator, **switch to the new folder** you just created in the Orchestrator, and then click to **Publish the process** as a package.



To learn more about other best practices on Orchestrator, please refer to the [Orchestrator User Guide](https://docs.uipath.com/orchestrator/standalone/2023.4/user-guide/introduction).


## **Usage**

To run the RPA workflow on your local machine, follow these steps:

1. Either **download** this repository to your local machine or **clone** it directly within your UiPath Studio.
2. Open the **UiPath Studio** software on your machine.
3. Locate and **open** the **Main.xaml** file from the downloaded repository in **UiPath Studio**.
4. **Run** the **Main.xaml** file to initiate the web scraping process.


## **Acknowledgement**

I would like to express my gratitude to the **[UiPath community](https://community.uipath.com/)** for providing resources, tutorials, and a platform for automation enthusiasts to learn and collaborate.


## **License**

This project is licensed under the [MIT License](https://choosealicense.com/licenses/mit/), which means you're free to modify, distribute, and use the code in your own projects.