Projects
Open Source Projects
- datamule - Work with SEC data at scale
- doc2dict - Convert documents (HTML, XML, PDF, etc) into dictionaries
- datamule-data - Up to date data files for datamule using GitHub actions
- datamule-indicators - Automatically updating indicators generated from SEC data
- txt2dataset - Convert text into datasets
- secsgml - Parse SEC SGML efficiently
- secxbrl - Fast, lightweight parser designed for SEC InLine XBRL.
Papers & Articles
- Managerial Differentiation - Forthcoming
- Proposed System Architecture for Datamule
- High Speed Algorithmic Document Parsing
- Putting Institutional Holdings in a Data Warehouse
- How to host the SEC Archive for $20/month
- Creating Structured Datasets from SEC filings
- Deploy a Financial Chatbot in 5 Minutes