-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Images missing for multiple NHMD Pinned Insects & Herbarium exports #113
Comments
Images for this export were found:
Still missing images for:
|
Images for both exports have now been located. However, I am having trouble locating images for:
|
I'm also having trouble finding images to match NHMD Herbarium exports from July. I've now got a script running to match barcodes to GUIDs in a database. It will take a few days to go through all the folders and add everything to the database but once it's finished, we should be able to see where all of the images are (assuming they exist on the N drive.) |
I am still finding exports that I cannot locate matching images for. These were all from the end of July 2024. Pip suggested I ask Khaled and/or Bhupjit if they have any ideas about where these images could be. After gaining their insight, I will either:
It's possible that these specimens may need to be re-imaged. |
I was able to resume using an old script that matches image GUIDs with their respective barcodes in a database table. It will take a while to go through all the folders but already I've located some of the missing images using the database. Additionally, Rebekka found some images on the WORKHERB0003 workstation from July and August that were not ingested due to errors. She is ingesting them now. Hopefully some of the missing herbarium images are in that set. |
Still missing at least some images from the following exports: -NHMD_Herba_20240724_14_31_SS_JMJ (some images ingested 8/6 at herb0003 but not all) |
Bhupjit suggested we chat about this with Khaled next Thursday when we're all in the office. |
I'm still updating the databases with the barcodes and guids. Hopefully it won't take much longer to get those completely up-to-date. I modified the original barcode-guid matching script to first check if the guid already exists in the database before it processes the image, so that's sped things up a bit. I will need to manually enter in some barcodes because the processing package doesn't catch all of them. Once all of that is done, I can use another script I'm working on to check all the barcodes from DigiApp exports against the barcode-guid databases. That should return a full list of missing images. Here's the task list to make it easier for me to see where I'm at:
|
I've manually updated the dbs for NHMD pinned insects and run the script for the date range July 1, 2024 - October 1, 2024 and it looks like the process works. I discovered two barcodes that don't have matching images. I'm now working on NHMD Herbarium. Once I've focused on this initial date range, I'll extend to check all of 2024 (and possibly 2023 as well.) |
I've now got a list of over 1,000 specimens from the date range July 1 - October 1, 2024 that have no associated images. There were clearly a few folders that did not get ingested from the end of July. Now I need to match up these barcodes with their locations from the DigiApp exports so I can get a workable list together for the digitizers. I also need to find out if we have a re-imaging policy in place already or if that is still in development. |
You can follow the progress of this in the future here: #159 |
This list has been added to the QA image issues spreadsheet so all current re-imaging issues are in one place. |
Description:
While checking exports for NHMD Pinned Insects, I came across a few exports that I couldn't easily find the images for. After some research, it seems these specimens were digitized while the ingestion client was producing errors and not correctly ingesting images. I read through the slack messages (channel: ingestion_client_users) concerning the issue, and it sounds like the images were all successfully ingested July 29 - 30, 2024. Looking through those image folders, I was able to find some but not all of the images. I am still unable to locate images for the following exports:
The first export has some data in it that needs to be checked against the image so I'm unable to proceed with that until we locate the images.
Next steps:
I need to ask Khaled and Bhupjit if they have any leads as to where the images could be.
The text was updated successfully, but these errors were encountered: