-
Notifications
You must be signed in to change notification settings - Fork 3
DAT19
Catalogue records from Ethio-Sphere project like http://mycms-vs03.rrz.uni-hamburg.de/domlib/receive/domlib_document_00000015 can be transformed in TEI records to constitute the basic needs for a catalogue records encoding.
all data in DomLib repository.
una tantum XSLT transformation. The XSLT can not only map but in many cases also split and structure information in the coherently entered data.
Needs to add publicationStmt date taking it from the myCore original last modified date (not creation date?)
Institutions should be exported to a separate institution file, containing place references. Coordinates in MyCore refer to the institution and not to the settlement.
homogeneous data in mycore text fields
-
an XSLT has been prepared to take myCore and transform it in valid TEI-all XML catalogue records. this requires two steps because of html/richtext escaped in the XML view of the mycore data. The second step simply cleans from spurious elements the result documents.
-
text in fields is hard coded in html. this means that the parsing is harsh and cannot get to let's say the second part of a field in Amharic rather than in English.
-
in listBibl type mss the information is split but identifiers are given from the text, substituting spaces with underscore. this will need to be fixed by hand, while creating mini records for the mss to which these records point.
-
listBibl type edition and type text carry over editions and bibliographic information about contents, which should go to Work entities rather than in the manuscript. This are thus left behind.
- for institutions @when to settlment/ab[@type='history'] in institutions from EthioSpare
ATTT!!! SETTLEMENT HAS WRONG CONTENT, should go into an element which describes the localization into a different system from the region and nation one
@type
values should be mapped as described here
results are not valid in most cases for minor problems, which need hand fixing. a first stage of revision shall decide if there is more that can be done in the automated conversion or handfixing can start from these files.