-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update taxonomy information for box 1-285 in Specify #117
Comments
The Issue Resolve Infraspecies spreadsheet notes before import. #114 was resolved by deciding to leave the taxonomy comments in for now and resolve these things after the import into Specify. This means that also the column "comments" needs to be imported, not just author names and subspecies / Hybrids etc. |
The related issues have been resolved and the updated and cleaned file is this: It can be found here: "N:\SCI-SNM-DigitalCollections\DaSSCo\Workflows and workstations\Herbarium\Infraspecies spreadsheet\Infraspecies_table_filled_in.xlsx" The columns "subspecies _old" and "variety_old" refer to information that is already in Specify, in case this is important to distinguish. The new information that needs to be imported is "Subspecies", "Subspecies_Author", "Variety", "Variety_Author", "Forma", "Forma_Author", "Hybrid_parent_1", "Hybrid_parent_1_Author, "Hybrid_parent_2", "Hybrid_parent_2_Author and "Comment". The table also includes the Collection Object ID and current taxon ID for each specimen. |
I had a chat about this with Fedor, and he confirmed that there's no way to update records via workbench, which means we have two options:
In order to use the API, we'll need to put together a script. This is going to require quite a bit of legwork, as I'll need to test the API calls and figure out all the primary & foreign keys, what to do about validation, etc. Bhupjit has already sent me a list of resources for playing around with this, which I'm tracking here: NHMDenmark/Projects/DaSSCo digitisation data/Research Specify API. |
Since Joaquim is already working on a script to update records in Specify via the API, (for the transcription app,) I can piggyback off of his efforts. I talked to him briefly about it on Slack and asked if he knew when that part would be ready. Here was his response: I have not started working on it yet, but that's the plan. I should start working on it in a couple weeks, depending on how much changes will be needed on the transcription platform. Pip says this can wait, as it's lower priority than keeping digitization going, and developing new data pipeline for AU. |
At the beginning of digitizing at Herbarium C, no author names or taxon information below species level (hybrids, subspecies, variety, forma) were recorded. This was only started with box 286, so this information needs to be added to all entries from box 1 - 285.
For this, a spreadsheet was filled in with the author information for each taxon (#74 ).
Another sheet was filled in for any further taxonomic information (Hybrids, subspecies etc.) (#91 ).
The information from these 2 spreadsheets is now collected in a large table located on the N-Drive: : "N:\SCI-SNM-DigitalCollections\DaSSCo\Workflows and workstations\Herbarium\Infraspecies spreadsheet\Infraspecies_table_filled_in.xlsx"
Before this information can be uploaded to Specify, some last issues need to be resolved:
Once these Issues are resolved, we can plan how to import the missing information into Specify.
The text was updated successfully, but these errors were encountered: