Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Firecrawl URL Importing is throwing error #280

Closed
3 of 6 tasks
ukiras123 opened this issue Sep 10, 2024 · 5 comments
Closed
3 of 6 tasks

Firecrawl URL Importing is throwing error #280

ukiras123 opened this issue Sep 10, 2024 · 5 comments
Labels
bug Something isn't working investigating Bugs that are still being investigated whether they are valid

Comments

@ukiras123
Copy link

ukiras123 commented Sep 10, 2024

Description

Installation

  • pip install goldenverba
  • pip install from source
  • Docker installation

If you installed via pip, please specify the version:

Weaviate Deployment

  • Local Deployment
  • Docker Deployment
  • Cloud Deployment

Configuration

Reader:
Chunker:
Embedder:
Retriever:
Generator:

Steps to Reproduce

Try to import using Firecrawl URL. It is throwing error:
ℹ FileStatus.ERROR | 2024-09-10T04:36:25.979Z | Import for New
Firecrawl Job failed: Reader Firecrawl failed with: 1 validation error for
FileConfig metadata Field required [type=missing, input_value={'fileID':
'2024-09-10T04... settings', 'took': 0}}}, input_type=dict] For further
information visit https://errors.pydantic.dev/2.9/v/missing | 0

Additional context

It is not working with any website I tried. Firecrawl has also updated its version so we might want to do that as well.

@ukiras123
Copy link
Author

ℹ FileStatus.ERROR | 2024-09-10T04:36:25.979Z | Import for New
Firecrawl Job failed: Reader Firecrawl failed with: 1 validation error for
FileConfig metadata Field required [type=missing, input_value={'fileID':
'2024-09-10T04... settings', 'took': 0}}}, input_type=dict] For further
information visit https://errors.pydantic.dev/2.9/v/missing | 0

@thomashacker
Copy link
Collaborator

Thanks for the issue! We'll look into it

@thomashacker thomashacker added the investigating Bugs that are still being investigated whether they are valid label Sep 13, 2024
@narmaku
Copy link

narmaku commented Nov 18, 2024

Same issue while trying to import github repo that contains python code.

@thomashacker
Copy link
Collaborator

Thanks a lot, there was a bug in the Firecrawl Reader code, should be fixed in the upcoming version 2.1

@thomashacker thomashacker added the bug Something isn't working label Dec 9, 2024
@thomashacker
Copy link
Collaborator

Version 2.1 got released, let me know if the error still persists and feel free to reopen

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working investigating Bugs that are still being investigated whether they are valid
Projects
None yet
Development

No branches or pull requests

3 participants