Stars
Community resources for MorphoSource. Discussion board, issue tracker, and public feature roadmap.
8
Updated Jun 26, 2023
The Apache Tika toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF).
This repository shares NARA-created open source software to support federal agencies in their preparation of metadata and permanent electronic records for transfer to NARA.
11
Updated Aug 15, 2023
Example notebooks and tutorials from Constellate, the text analysis service from ITHAKA.