{"id":22404762,"url":"https://github.com/lussierc/stockstoryscraper","last_synced_at":"2025-07-31T17:32:47.917Z","repository":{"id":55642436,"uuid":"294215790","full_name":"lussierc/StockStoryScraper","owner":"lussierc","description":"The Stock Story Scraper (SSS) is a text mining tool that gathers a stock's relevant articles and performs extensive sentiment analysis on them. Fall 2020 Independent Study - Allegheny College.","archived":false,"fork":false,"pushed_at":"2022-02-17T04:34:16.000Z","size":157991,"stargazers_count":2,"open_issues_count":8,"forks_count":2,"subscribers_count":1,"default_branch":"master","last_synced_at":"2023-03-04T07:04:23.519Z","etag":null,"topics":["predict-stock-swings","sentiment","stock-news","stocks","text-mining"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/lussierc.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2020-09-09T19:57:13.000Z","updated_at":"2023-01-24T22:38:49.000Z","dependencies_parsed_at":"2022-08-15T05:20:25.694Z","dependency_job_id":null,"html_url":"https://github.com/lussierc/StockStoryScraper","commit_stats":null,"previous_names":[],"tags_count":null,"template":null,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/lussierc%2FStockStoryScraper","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/lussierc%2FStockStoryScraper/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/lussierc%2FStockStoryScraper/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/lussierc%2FStockStoryScraper/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/lussierc","download_url":"https://codeload.github.com/lussierc/StockStoryScraper/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":228275004,"owners_count":17895008,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["predict-stock-swings","sentiment","stock-news","stocks","text-mining"],"created_at":"2024-12-05T10:15:05.340Z","updated_at":"2024-12-05T10:15:05.966Z","avatar_url":"https://github.com/lussierc.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# StockStoryScraper [SSS]\n\nScrapes new articles from highly rated stock news websites thru Google News, analyzes them for sentiment among other things, then scores them and provides an overall rating of stock sentiment and well-being. Allows users to get a quick look into how a stock is performing in the news.\n\nFall 2020 Independent Study - Allegheny College.\n\n![SSS Logo](images/sss.png)\n\n## Tool Overview\n\n- Scrapes articles from highly rated stock news websites specified by the user. The user also enters their chosen stocks, corresponding ticker symbols, date range for scraping, and an export file (`.csv`) that can be read back into the tool.\n  - With this, users can read back in their previously exported CSVs of article information to view their results again.\n    - Code to compare results from different runs will be added in future updates.\n- Uses vaderSentiment to perform textual sentiment analysis.\n- Scores the articles, gathers their price information, and generates results pertaining to the stocks overall sentiment feelings and well being.\n- Tool is used via a User Interface (web application) using Streamlit or a Command Line Interface.\n- More features to be added soon.\n\n### What's the point?\n\nIt takes a lot of time to read every available news article about a stock, whether you are a professional trader or an amateur. The tool quickly gathers all the relevant articles from highly rated stock websites in the user's defined date range, then analyzes their textual sentiments. The user can read the articles that were scraped if they choose or look at the numerous graphs on the tool depicting article sentiments and overall stock feelings/performance.\n\nThis saves lots of time and gives users a one stop shop for stock news and the automatic analysis of them.\n\n#### View the Program in Action\n\nHere is an example of the projects web app UI (using Streamlit) in action. The example shows the settings being configured for a new, fresh run of the project:\n\n![Run screen!](images/baserun.png)\n\n## Running the Project\n\nThere are a few ways that users can run the project! These methods include using your own local Python3 installation, Pipenv, or Docker.\n\n### Running with Docker\n\nThe program can be run within a Docker Container using Docker Desktop. For more information on how to install this program, view [this](https://www.docker.com/) resource.\n\nThere are builder scripts for each type of machine. First ensure you are in the `src` directory. To run the `Mac OS` version for instance, you would use the following commands:\n\n1. `sh ./docker/build_macOS.sh` -- builds the container\n2. `sh ./docker/run_macOS.sh` -- enters the container\n3. `python3 run_tool.py` -- run the program\n\n#### OS-specific scripts to build and run containers\nThe following bash scripts simplify building the container.\n\n| OS  | Building  | Running  |\n|---|---|---|\n| MacOS  \t\t|  `./build_macOS.sh` |  `./run_macOS.sh` |\n| Linux   \t|  `./build_linux.sh` | `./run_linux.sh`  |\n| Windows \t|  `build_win.bat` \t\t|  `run_win.bat` |\n\nThese files may be found in the directory, `docker/` and the builder require a copy of `Dockerfile` to run which is in the `src` directory, hence why these command should be run from the `src` directory like in the example above.\n\n### Running with Pipenv\nMake sure Python(3) and Pipenv are installed on your machine. Find information on installing pipenv [here](https://pipenv-fork.readthedocs.io/en/latest/install.html).\n\n#### Pipenv\n\nThe project comes with a `Pipfile` in the `src` directory that will install the necessary packages for the program, making it easy for users with Pipenv to run the project on their machines.\n\nFirst navigate to the `src` directory using `cd src`. Then run the command `pipenv lock` to install the necessary Python packages.\n\nYou can then run the command `pipenv run python3 run_tool.py` to run the program. You will be presented with the option to run either the UI web interface or the Command Line Interface.\n\n### Running with Python\n\nFirst ensure Python and Pip are installed on your machine. Then navigate to the `src` directory.\n\nYou can install the required packages for the project using Pip by running `pip3 install -r requirements.txt` or `pip install -r requirements.txt` depending on your machine's Pip installation.\n\nThen, you can run the program by using the command `python3 run_tool.py`.\n\n*Note:* if you run into Spacy issues while running the program, you may have to run the command `python3 -m spacy download en`.\n\n## Problems, Ideas, or Praise\n\nPlease leave an issue in the Issue Tracker if you encounter errors, have ideas, or anything of the like!\n\n## Future Work\n\nView the Issue Tracker to see future tasks that will be completed in the near future.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Flussierc%2Fstockstoryscraper","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Flussierc%2Fstockstoryscraper","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Flussierc%2Fstockstoryscraper/lists"}