Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/ZachLiuGIS/stock_extractor
General Purpose Stock Extractors from Online Resources
https://github.com/ZachLiuGIS/stock_extractor
Last synced: about 2 months ago
JSON representation
General Purpose Stock Extractors from Online Resources
- Host: GitHub
- URL: https://github.com/ZachLiuGIS/stock_extractor
- Owner: ZachLiuGIS
- Created: 2016-02-02T15:19:10.000Z (almost 9 years ago)
- Default Branch: master
- Last Pushed: 2022-12-26T20:00:47.000Z (about 2 years ago)
- Last Synced: 2024-11-04T23:37:55.574Z (about 2 months ago)
- Language: Python
- Size: 20.5 KB
- Stars: 34
- Watchers: 5
- Forks: 4
- Open Issues: 4
-
Metadata Files:
- Readme: README.rst
Awesome Lists containing this project
- awesome-quant - stock_extractor - General Purpose Stock Extractors from Online Resources. (Python / Data Sources)
- awesome-quant - stock_extractor - General Purpose Stock Extractors from Online Resources. (Python / Data Sources)
README
*****
Stock Extractor
*****This package includes a series of stock data extractor class from a few widely used sources, such as Yahoo Finance,
Barchart.com, etc.=====
Installation
=====``pip install stock_extractor``
The package has the following dependencies:
* requests
* pandas
* beautifulsoup4=====
USAGE
=====The package currently has four extractor class
* SP500Extractor
It is used to extract stock data about S&P 500 companies
* YahooFinanceInfoExtractor
It is used to extract latest stock data and information from Yahoo Finance
* YahooFinanceHistoryQuoteExtractor
It is used to extract historical quote data for stocks from Yahoo Finance
* YahooFinanceDivExtractor
It is used to extract historical dividend data for stocks from Yahoo Finance
SP500Extractor
-----.. code-block:: python
# import extractor class
from stock_extractor import SP500Extractorextractor = SP500Extractor()
# get_sp500_symbol_list() returns all SP500 company symbols as a list
sp500_symbols = extractor.get_sp500_symbol_list()# get_sp500_data_by_type(type) retrieves sp500 company stock infomation and store the result in a pandas dataframe
# type can be 'main', 'technical', or 'performance'
# 'main' includes fields: 'Symbol', 'Name', 'Last Price', 'Change', 'Percent', 'High', 'Low', 'Volume', 'Time'
# 'technical' includes fields: 'Symbol', 'Name', 'Last Price', 'Opinion', '20D-Strength', '20D-Volty', '20D-AVol', '52W-Low', '52W-High'
# 'performance' includes fields: 'Symbol', 'Name', 'Last Price', 'Weighted-Alpha', 'YTD-Pct', '1Month-Pct Change', '3Month-Pct Change', '1Year-Pct Change'
extractor.get_sp500_data_by_type('technical')# get_sp500_full_data() will retrieve all three types of data and combine them into the dataframe
extractor.get_sp500_full_data()# get_dataframe() will return the dataframe that stores retrieved data
extractor.get_dataframe()# save_to_csv(filepath) will save the dataframe as a csv file
extractor.save_to_csv('sp500_data.csv')YahooFinanceInfoExtractor
-----This extractor class extract latest stock data and stock information on Yahoo Finance
The fields that can be extracted from this class include:'Ask', 'AvgDVol', 'AskSize', 'Bid', 'AskRealTime', 'BidRealTime', 'BookValue', 'BidSize', 'Change&Pct', 'Change',
'Commission', 'ChangeRealTime', 'AfterHourChangeRealTime', 'Dividend', 'LastTradeDate', 'TradeDate', 'EPS',
'ErrorIndication', 'EPSE_CurrentYear', 'EPSE_NextYear', 'EPSE_NextQuarter', 'FloatShares', 'D-Low', 'D-High',
'52W-Low', '52W-High', 'HoldingsGainPercent', 'AnnualizedGain', 'HoldingsGain', 'HoldingsGainPercentRealTime',
'HoldingsGainRealTime', 'MoreInfo', 'OrderBookRealTime', 'MarketCap', 'MarketCapRealTime', 'EBITDA',
'ChangeFrom52W-Low', 'PctChangeFrom52W-Low', 'LastTradeRealTime', 'PctChangeRealTime', 'LastTradeSize',
'ChangeFrom52W-High', 'PctChangeFrom52W-High', 'LastTradeWithTime', 'LastTradePrice', 'HighLimit',
'LowLimit', 'DayRange', 'DayRangeRealTime', '50MA', '200MA', 'ChangeFrom200MA', 'PctChangeFrom200MA',
'ChangeFrom50MA', 'PctChangeFrom50MA', 'Name', 'Notes', 'Open', 'PreviousClose', 'PricedPaid', 'PctChange',
'Price/Sales', 'Price/Book', 'Ex-DividendDate', 'P/E', 'DividendPayDate', 'P/E_RealTime', 'PEG',
'P/E-EstCurrentYear', 'P/E-EstNextYear', 'Symbol', 'SharesOwned', 'ShortRatio', 'LastTradeTime', 'TradeLinks',
'TickerTrend', '1YrTarget', 'Volume', 'HoldingsValue', 'HoldingsValueRealTime', '52W-Range', 'DayValueChange',
'DayValueChangeRealTime', 'StockExchange', 'Yield'Example:
.. code-block:: python
# import extractor class
from stock_extractor import YahooFinanceInfoExtractorextractor = YahooFinanceInfoExtractor()
# read a list of symbols from txt file.
# extractor.set_symbol_list(symbol_list) can set symbol list as python list
extractor.read_symbol_list_from_txt('../sample_data/sample_symbol_list.txt')# set which fields are included in extraction
extractor.set_field_list([
'Symbol', 'LastTradePrice', 'LastTradeDate', 'LastTradeTime', 'D-High', 'D-Low', '52W-High', '52W-Low',
'50MA', '200MA', 'PctChangeFrom50MA', 'PctChangeFrom200MA', 'EBITDA', 'MarketCap',
'Dividend', 'Yield', 'EPS', 'P/E', 'PEG', 'Price/Sales', 'Price/Book', 'Name'
])# extract data from Yahoo Finance
extractor.load_yahoo_data()# save the result in a csv file
# you can call extractor.get_dataframe() to return the result as pandas dataframe
extractor.save_to_csv('../output/sample_stock_info.csv')YahooFinanceHistoryQuoteExtractor
-----This extractor extract historical quote data for the input symbol list for a time span
Example:
.. code-block:: python
# import extractor class
from stock_extractor import YahooFinanceHistoryQuoteExtractorextractor = YahooFinanceHistoryQuoteExtractor()
# set start and end date for extraction
extractor.set_end_date('2016-01-01')
extractor.set_start_date('2015-01-01')# set symbol list, you can also set this from a txt file, see example above
extractor.set_symbol_list(['CAT', 'SPLS', 'ETP', 'HCP', 'T'])# the method that actually extracts data from Yahoo Finance
extractor.load_data_by_symbol_list()# filter out other fields, leave only adj price here since this field is usually what people need.
extractor.get_adj_price_only_dataframe()#return the result as a pandas dataframe
extractor.get_dataframe()# save the result as csv file
extractor.save_to_csv()YahooFinanceDivExtractor
-----This extractor extracts historical dividend data from Yahoo Finance.
The api is very similar to the previous one.Example:
.. code-block:: python
# import extractor class
from stock_extractor import YahooFinanceDivExtractorextractor = YahooFinanceDivExtractor()
extractor.set_end_date('2016-01-01')
extractor.set_start_date('2001-01-01')
extractor.read_symbol_list_from_txt('../sample_data/sample_symbol_list.txt')
extractor.load_data_by_symbol_list()
extractor.get_dataframe()
extractor.save_to_csv()TEST
=====run this code
``$ python -m unittest discover``
CONTACT
=====The package is created by Zach Liu. Please send email to [email protected] if you have questions or comments.
LICENCE
=====MIT