You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have searched both the documentation and discord for an answer.
Question
in this code when i ran:
vanilaParsing = LlamaParse(result_type="markdown").load_data(
"./expense_report_document.pdf"
)
it returns empty dictionary on metadata
The text was updated successfully, but these errors were encountered:
To add metadata when parsing a document using LlamaParse, ensure that the metadata is correctly set up in the Document object and that the extractors are properly configured to populate the metadata fields. Here's how you can do it:
Define the Document with Metadata: When creating a Document, include metadata directly in the constructor. This metadata will be available for extraction and processing.
Use Extractors to Populate Metadata: Ensure that your extractors are set up to extract and populate metadata fields. The extractors should process the document text and update the metadata.
Check Metadata Configuration: If the metadata is still empty, verify that:
The document text is suitable for extraction.
The extractors are correctly configured and capable of extracting the desired metadata.
The run_transformations function is correctly implemented to apply these extractors and update the metadata.
Advanced Metadata Customization: Customize how metadata is included in the document for LLM and embedding models. For example, exclude certain metadata keys from being visible to the LLM or embedding model.
By following these steps and ensuring proper configuration, the metadata should be populated as expected when parsing documents with LlamaParse [1][2].
Question Validation
Question
in this code when i ran:
vanilaParsing = LlamaParse(result_type="markdown").load_data(
"./expense_report_document.pdf"
)
it returns empty dictionary on metadata
The text was updated successfully, but these errors were encountered: