Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Harvester causes SOLR cloud to crash #3783

Open
1 of 4 tasks
jbrown-xentity opened this issue Apr 15, 2022 · 2 comments
Open
1 of 4 tasks

Harvester causes SOLR cloud to crash #3783

jbrown-xentity opened this issue Apr 15, 2022 · 2 comments

Comments

@jbrown-xentity
Copy link
Contributor

jbrown-xentity commented Apr 15, 2022

When a production CKAN catalog is implemented and indexed fully, the harvesters crash. Not sure yet why, need to investigate further.

Probably related to #3784.

Part of #1342

How to reproduce

  1. Use DB prod backup to restore for catalog
  2. Spin up and reindex catalog
  3. Turn on harvesting

Expected behavior

Harvesting succeeds, solr lives on

Actual behavior

SOLR crashes/fails

Sketch

Need to isolate what is causing the problem, check the following use cases:

@jbrown-xentity
Copy link
Contributor Author

In our clean use case, https://catalog-fxia-datagov.app.cloud.gov/dataset/, 5 harvest sources worked ok (including largest DCAT-US source and several WAF's). When a 6th was added, https://catalog.data.gov/harvest/about/ioos, SOLR crashed. We confirmed the SOLR state is similar as before (no leader available). If so, we have our test case for reproducing and debugging. It harvested > 200 datasets (but probably < 5K) before SOLR crashed.

@nickumia-reisys
Copy link
Contributor

nickumia-reisys commented Apr 18, 2022

The above findings leads me to want to test the following:

  • Does the error occur with CKAN2.8/Solr 8 (Solrcloud Mode)?
    • Yes
    • No
  • Does the error occur with CKAN2.9/Solr 8 (Solrcloud Mode)?
    • Yes
    • No
  • Does the error occur with CKAN2.8/Solr 8 (Standalone Mode)?
    • Yes
    • No
  • Does the error occur with CKAN2.9/Solr 8 (Standalone Mode)?
    • Yes
    • No
  • Does the error occur with CKAN2.8/Solr 6 (Solrcloud Mode)?
    • Yes
    • No
  • Does the error occur with CKAN2.9/Solr 6 (Solrcloud Mode)?
    • Yes
    • No
  • Does the error occur with CKAN2.8/Solr 6 (Standalone Mode)?
    • Yes
    • No
  • Does the error occur with CKAN2.9/Solr 6 (Standalone Mode)?
    • Yes
    • No

**Note: the CKAN 2.8/2.9 differentiator answers the question of it it's a weird encoding issue with the database dump taken from PY2/CKAN2.8 on FCS. If the same harvest source can be reliably harvested from scratch in a PY3/CKAN2.9 context, then the issue is that the data is not compatible between PY2/PY3.

Apparently, the error was not happening with CKAN/Solr 5 (Standalone Mode). We're not sure if this is a byproduct of,

  • Standalone Solr vs. Solrcloud
  • PY2 vs. PY3
  • Solr 5 vs. Solr 6 vs. Solr 8

Without testing all of these cases in order (until the error no longer happens), we won't know what a good configuration is for our application and how to properly choose the right solution. Obviously, we can't go live with anything but Solr 8, but if the problem doesn't exist prior to Solr 8, then we have a bigger Solr problem than if our setup is the issue.

@jbrown-xentity jbrown-xentity changed the title Harvester causes full SOLR to crash Harvester causes SOLR cloud to crash Apr 21, 2022
@btylerburton btylerburton moved this from 📥 Queue to 🧊 Icebox in data.gov team board Oct 10, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: 🧊 Icebox
Development

No branches or pull requests

2 participants