Edgar Scrapper is a python web scraper program that parses fund holdings pulled from EDGAR. The program runs these following steps to retrieve and store the data:
-
Enters in a specific CIK on the main page and navigates to the Search Results page.
-
Finds the most recent document that has a filing that contains "13F" and navigates to the Filing Detail page (From clicking 'Documents').
-
Parses the XML table and generates a tab-delimited .csv file using that data.
Edgar Scrapper was built using Python 2.7.10 and Scrapy 1.6.0.
- Assuming you've installed Python from the above link, in your console, navigate to the app's folder and run
sudo pip install -r requirements.txt --ignore-installed six
to install all dependencies. - To run the program, run
scrapy runspider edgar_scrapper.py
. - This will generate a .csv file in the project folder based on the CIK specified in edgar_scrapper.py (Default is 0001166559).
WilfredRuck - Public code repos