https://github.com/maxinexiong/web-scraping-rpa
This repository contains an RPA robot that was designed to scrap up to 500 pieces of property information for a given location from a real estate website. The extracted data is then intelligently organized, filtered, and sorted according to user-defined criteria, and integrated into the Excel file, output.xlsx.
https://github.com/maxinexiong/web-scraping-rpa
data-scraping data-table excel-processing robotic-process-automation rpa table-extraction uipath uipath-classic-design uipath-modern-design uipath-studio web-scraping
Last synced: 2 months ago
JSON representation
This repository contains an RPA robot that was designed to scrap up to 500 pieces of property information for a given location from a real estate website. The extracted data is then intelligently organized, filtered, and sorted according to user-defined criteria, and integrated into the Excel file, output.xlsx.
- Host: GitHub
- URL: https://github.com/maxinexiong/web-scraping-rpa
- Owner: MaxineXiong
- Created: 2023-11-18T23:07:36.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2023-11-19T00:24:34.000Z (over 1 year ago)
- Last Synced: 2023-11-19T01:27:54.990Z (over 1 year ago)
- Topics: data-scraping, data-table, excel-processing, robotic-process-automation, rpa, table-extraction, uipath, uipath-classic-design, uipath-modern-design, uipath-studio, web-scraping
- Homepage:
- Size: 16.5 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
### UiPath RPA Robot
# Web Scraping for Real Estate Data[](https://github.com/MaxineXiong)
[](https://opensource.org/licenses/MIT)
[](https://www.uipath.com)
This repository contains a robust and efficient solution built using the [**UiPath Automation Platform**](https://www.uipath.com) that streamlines the process of gathering real estate data
from [zillow.com](https://www.zillow.com/), a popular online real estate marketplace. The robot is designed to extract essential information about properties in a given location, including *price*, *number of bedrooms*, *number of bathrooms*, *area (sqft)*,
and *address*, for a maximum of 500 properties. The extracted data is then intelligently organized, filtered, and sorted according to user-defined criteria, and seamlessly integrated into an
Excel file named **output.xlsx**.Having access to accurate and up-to-date property information is essential for making timely, informed decisions on property investment and management. This UiPath
RPA robot provides a solution to efficiently collect and manage property data from internet, and can save you significant time and effort by automating the repetitive and time-consuming
data search and entry process.This repository includes solutions created using both **Classic Design** and **Modern Design** in UiPath Studio. _You can check out the **automation demo videos** by clicking on the badge below_:
[-fa4616?style=for-the-badge)](https://1drv.ms/v/s!AhxVr7ogXVBRlS25lNe_U91iujN3?e=mKj9X1)
[-fa4616?style=for-the-badge)](https://1drv.ms/v/s!AhxVr7ogXVBRlSzdaoWcyaUO0muC?e=tffKcL)
## **Installation**
Before installing **UiPath Softwares**, please make sure your system meets the hardware and software requirements outlined in the **[UiPath documentation](https://docs.uipath.com/studio/standalone/2022.10/user-guide/hardware-and-software-requirements)**.
The **Uipath Platform** includes the following tools:
- **UiPath Studio**
- **UiPath Assistant**
- **UiPath Automation Cloud, including UiPath Orchestrator**
To run this project successfully, please follow these steps to install UiPath Studio:***
Step 1 : Visit [uipath.com](https://www.uipath.com/) and click **Try UiPath Free** button.
![]()
Step 2: **Sign up** for a personal account.
![]()
Step 3: **Verify** your account in email.
![]()
Step 4: **Log into** the **UiPath Automation Cloud** using your account, and click the **Download Uipath Studio** button.
![]()
Step 5: Click **Sign in**.
![]()
Step 6: Select **UiPath Studio Pro**.
![]()
Step 7: Follow the system instructions to complete the installation of **UiPath Studio Pro**.
![]()
Please also follow these steps below to connect your local machine to the UiPath Automation Cloud for deploying this workflow (if desired):***
Step 1: Sign up and log into [UiPath Automation Cloud](https://cloud.uipath.com/).
Step 2: Add a **Tenant**.
![]()
Step 3: **Edit** the user and assign the **Automation Users** role to grant them permission to execute processes.
![]()
![]()
Step 4: Go to the **Orchestrator** interface and click on **Tenant** in the left pane.
![]()
![]()
Choose **Folders** and then click the **+** icon to create a new folder.
![]()
![]()
Step 5: Navigate back to **Tenant** interface and follow the steps below to start adding an Automation User for Unattended Robot in **Manage Access**.
![]()
a) Scroll down to locate the target user, then assign the **Automation User** role to grant them the necessary permissions. Click **Next** button to move on to the next page.
![]()
![]()
b) In the *Personal automations setup* page, select the options to **Enable user to run automations** and **Create a personal workspace for this user and enable optimal Studio Web experience**, then click on the **Next** button.
![]()
c) On the *Unattended setup* page, check the option to **Enable this user to run unattended automations**, choose **Specific Windows credentials** for local machine connection to Orchestrator, provide **Domain\Username** of your user account on local machine (which can be found by executing `whoami` in Command Prompt), and enter the **Password** for accessing your local machine. Finally, click on the **Update** button.
![]()
Step 6: Now, go to the **Machines** page where you should see the workspace machine for the target user already created. Click the ellipsis to select **Edit Machine**.
![]()
Enter **1** for both the *Production (Unattended)* and *Testing* fields, then click the **Update** button.
![]()
Step 7: Now return to the newly created folder, choose the **Machines** menu, and click **Manage Machines in Folder** button to assign the machine you just configured to the folder.
![]()
![]()
![]()
You should now have both the **User** and **Machine** assigned to the new folder.
![]()
Step 8: Open **UiPath Assistant** and click **Sign In**. If you see the **green circle** in the top right corner, you’ve successfully connected your local *UiPath Studio* to the *UiPath Automation Cloud*.
![]()
You can confirm the connection by opening UiPath Studio and checking for a green circle at the bottom.
![]()
***
To **publish a process** from UiPath Studio to Orchestrator, **switch to the new folder** you just created in the Orchestrator, and then click to **Publish the process** as a package.
![]()
To learn more about other best practices on Orchestrator, please refer to the [Orchestrator User Guide](https://docs.uipath.com/orchestrator/standalone/2023.4/user-guide/introduction).
## **Usage**
To run the RPA workflow on your local machine, follow these steps:
1. Either **download** this repository to your local machine or **clone** it directly within your UiPath Studio.
2. Open the **UiPath Studio** software on your machine.
3. Locate and **open** the **Main.xaml** file from the downloaded repository in **UiPath Studio**.
4. **Run** the **Main.xaml** file to initiate the web scraping process.
## **Acknowledgement**
I would like to express my gratitude to the **[UiPath community](https://community.uipath.com/)** for providing resources, tutorials, and a platform for automation enthusiasts to learn and collaborate.
## **License**
This project is licensed under the [MIT License](https://choosealicense.com/licenses/mit/), which means you're free to modify, distribute, and use the code in your own projects.