https://github.com/ejw-data/excel-vba-webscraping
Excel Application that uses VBA to scrape website directory information and populate sheets.
https://github.com/ejw-data/excel-vba-webscraping
excel vba webscraping
Last synced: about 1 month ago
JSON representation
Excel Application that uses VBA to scrape website directory information and populate sheets.
- Host: GitHub
- URL: https://github.com/ejw-data/excel-vba-webscraping
- Owner: ejw-data
- Created: 2022-06-14T21:42:33.000Z (almost 3 years ago)
- Default Branch: main
- Last Pushed: 2022-08-30T00:29:31.000Z (over 2 years ago)
- Last Synced: 2025-01-22T06:47:22.815Z (3 months ago)
- Topics: excel, vba, webscraping
- Homepage:
- Size: 1.66 MB
- Stars: 1
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# excel-vba-webscraping
Author: Erin James Wills, [email protected]

Photo by [Ilya Pavlov](https://unsplash.com/@ilyapavlov?utm_source=unsplash&utm_medium=referral&utm_content=creditCopyText) on [Unsplash](https://unsplash.com/s/photos/web?utm_source=unsplash&utm_medium=referral&utm_content=creditCopyText)
## Overview
So this was not a project I ever anticipated doing but was asked by a coworker to help simplify things for them. During the pandemic, personnel information like office phone numbers and key contacts were changing frequently. Do to the need to work closely with the medical school, I created this simple excel vba webscraper.`Normally, I would do something like this in Python but the people who would utilize this would only feel comfortable with Excel.`
Overall, this accomplishes the job but no extra time was spent making it a nice looking spreadsheet. The person needing the information could quickly format it to their liking but this eleminated the need for copying and pasting content.
## Technologies
* Excel: VBA, HTTP Library
## Data Source
This was last tested in 2021 on the Feinberg website. It was used for about two years. Images of the content being scraped is below. The website used tabs to differentiate organizations and also used HTML Cards to group the data. The individual organizations had quite a bit of differences in their organization structure and data provided. The webscrape obtained about 90-95% of the necessary data.

## Installation Instructions
1. Clone the repo
1. Open the macro-enabled spreadsheet (.xlsm)
1. Accept any Security Warnings
1. Activate the HTTP library by going to:
1. File > Options > Add-Ons
1. Add a Checkmark next to HTTP Library
1. Make the Devloper tab visible by going to:
1. File > Options
1. Select from the list of folders, Developer
1. To run the program, click on the Develop Tab
1. Click the Macros button
1. Select the function and click run.
** The sheets (tabs) will start populating.