https://github.com/eshaffer321/bankofamerica-web-scraper

Selenium screen scraper for personal financial data on bankofamerica.com
https://github.com/eshaffer321/bankofamerica-web-scraper

Last synced: 12 months ago
JSON representation

Selenium screen scraper for personal financial data on bankofamerica.com

Host: GitHub
URL: https://github.com/eshaffer321/bankofamerica-web-scraper
Owner: eshaffer321
License: mit
Created: 2019-07-07T02:09:11.000Z (almost 7 years ago)
Default Branch: master
Last Pushed: 2023-05-01T21:14:06.000Z (about 3 years ago)
Last Synced: 2025-02-16T12:01:58.584Z (over 1 year ago)
Language: Python
Homepage:
Size: 28.3 KB
Stars: 9
Watchers: 1
Forks: 5
Open Issues: 2
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# BankofAmerica-Web-Scraper

Selenium web scraper used to pull personal financial data from bankofamerica.com

## About
This web scraper will pull account balances and all transactions from credit and checking accounts. This is meant to be
used with a [node.js server](https://github.com/eshaffer321/BankOfAmerica-2-GoogleSheet-API) which will re-categorize and
insert into a [google sheet](https://docs.google.com/spreadsheets/d/14GYLeWTUBPFWYzXMAJJV4YPmwcsf6vabkQ0-CeHSqHQ/edit#gid=759515713).

## Installing

To install dependencies and do local development, run
```bash
pip install requirements.txt
```

## Usage

To run the program with docker (preferred):
```bash
docker run -d -p 5000:5000 erickshaffer/boa-scraper:latest
```

## Environment variables

This project is meant to be used with a `docker-compose` file located in the
[node.js server](https://github.com/eshaffer321/BankOfAmerica-2-GoogleSheet-API) repository.

### Account File
The account credentials are stored in a json file. If you would like to login even with the security v2 security,
you can provide the security answers in the file. This is not required, but the program will not work if it encounters
these questions while parsing. This file need to located in the directory /app/var/account
```json
[{
"name": "",
"username": "",
"password": "",
"security_questions": {
"What is the name of your first employer?": "",
"What is the street you grew up on": "",
"What is the name of your best friend": ""
}
}]

```

## How it works

This service first logs in, and then start to collect the account balances and overview from the my accounts page. Next,
it will visit all checking and credit cards and start collecting the transaction info. This is the following information
that the program collects:

```
merchant_name
category
date
description
amount
```

Only the transactions from the current month are collected. Currently, the savings scraper isn't implemented. For my use
case I did not have many important transactions in savings. The amounts are still collected in the overview and displayed
in the sheet. If you would like to implement savings, just create another entry in `page.py` and locators in `locator.py`.
To learn more about the page object design pattern, look at [the selenium docs](https://selenium-python.readthedocs.io/page-objects.html)

## Development
### Testing

There is a few tests located in the test directory. These will test basic login functionality, account summary recording,
and a full functional test of the scraper. Please replace the empty strings with your account information to run these tests.

Here is an example run of a full functional test run:
```.env
python test/FullTests.py
```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/eshaffer321/bankofamerica-web-scraper

Awesome Lists containing this project

README