Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Mapping subjects #23

Closed
banukutlu opened this issue Oct 3, 2018 · 14 comments
Closed

Mapping subjects #23

banukutlu opened this issue Oct 3, 2018 · 14 comments
Assignees
Labels

Comments

@banukutlu
Copy link
Contributor

banukutlu commented Oct 3, 2018

https://github.com/psu-libraries/psulib_blacklight/wiki/Subject-Fields-Index-and-Display

@banukutlu banukutlu changed the title Index subjects for display Index subjects Dec 12, 2018
@banukutlu banukutlu changed the title Index subjects Index subjects for search and display Dec 12, 2018
@banukutlu banukutlu changed the title Index subjects for search and display Mapping subjects Dec 12, 2018
@banukutlu
Copy link
Contributor Author

banukutlu commented Jan 3, 2019

@ruthtillman:

  1. do we want to display a subject facet on the sidebar?
  2. for subject searching do we want to boost more on specific subjects for example primary subject having more boost than additional subjects? Or if not, just having one subject field for search with all the listed MARC fields is sufficient?
    Below mappings what we had originally:
    • Primary subject: 600#{ATOU}:610#{ATOU}:611#{ATOU}:630#{ATOU}:650abcde:651ae:653a:654abcde:655abc
    • Additional subjects: 600vwxyz:610vwxyz:611vwxyz:630vwxyz:650vwxyz:651vwxyz:654vwxyz:655vwxyz
    • Subject Topic Facet:600|*0|abcdq:610|*0|ab:611|*0|ab:630|*0|ab:650|*0|a:653|*0|a
  3. Could you add which subfields we will be using for each field for searching in the documentation? Same as display - 650|*0|abcdvxyz:650|*2|abcdvxyz:650|*1|abcdvxyz:650|*3|abcdvxyz:650|*6|abcdvxyz:650|*7|abcdvxyz:600abcdfklmnopqrtvxyz:610abfklmnoprstvxyz:611abcdefgklnpqstvxyz:630adfgklmnoprstvxyz:647acdgvxyz:648avxyz:651avxyz?

@banukutlu banukutlu added the question Further information is requested label Jan 4, 2019
@ruthtillman
Copy link
Collaborator

  1. Yes, and/but let's make it 650|*0|aa:650|*0|x:650|*1|aa650|*1|x:651|*0|a651|*0|x:600abcdtq:610abt:610x:611abt:611x: for the foreseeable future. This is based on what's in the original traject subject_topic_index and what we're indexing for display

@ruthtillman
Copy link
Collaborator

  1. Yes, let us value Primary Subject more highly than other subjects.

  2. I added a column to the table.

@banukutlu banukutlu removed the question Further information is requested label Jan 7, 2019
@banukutlu
Copy link
Contributor Author

banukutlu commented Jan 8, 2019

@ruthtillman

  1. How do we label for the sidebar facet? Subject or Topic or sth else?
  2. Are the MARC mappings I pasted for the primary and additional subject fine? Facet one will be changed with what you gave but wanted to double check for the two other?

@ruthtillman
Copy link
Collaborator

  1. Subject
  2. For search, yes I think so.

@banukutlu
Copy link
Contributor Author

banukutlu commented Jan 9, 2019

@ruthtillman

In the wiki, subject fields table does not include 653, 654, 655 and also647 and 648 are listed but these two fields are not used in either subject search fields (primary and additional subjects) but used for subject_display. Also some subfields does not seem to be consistent between the wiki and our mappings right now. So I just want to make sure, should I update the wiki with the mappings or should I change the below mappings accordingly?

Primary subjects: 600#{ATOU}:610#{ATOU}:611#{ATOU}:630#{ATOU}:650abcde:651ae:653a:654abcde:655abc
Additional subjects: 600vwxyz:610vwxyz:611vwxyz:630vwxyz:650vwxyz:651vwxyz:654vwxyz:655vwxyz.

Also I think we should try to keep these two fields inline with subject display. Could you pls review the fields mappings again?

@banukutlu
Copy link
Contributor Author

banukutlu commented Jan 9, 2019

@ruthtillman could you also review the subject facet mapping again? https://github.com/psu-libraries/psulib_blacklight/wiki/Subject-Fields-Index-and-Display#subject-facet

Are subfields correct here: 650|*0|aa?

@banukutlu banukutlu added the question Further information is requested label Jan 9, 2019
@ruthtillman
Copy link
Collaborator

@banukutlu flagging to review first thing tomorrow morning. Of your question and Charlie's today, his was faster to answer.

@ruthtillman
Copy link
Collaborator

@banukutlu one quick answer, the repeated a is correct. It's a little weird but occasionally the data may have that. It won't repeat if more than one a doesn't exist.

@ruthtillman
Copy link
Collaborator

@banukutlu updated wiki. Added fields requested, removed 655 from index for this, also removed ATOU because I didn't like a couple things it would be indexing so made more specific.

@banukutlu banukutlu removed the question Further information is requested label Jan 11, 2019
@banukutlu
Copy link
Contributor Author

banukutlu commented Jan 11, 2019

@ruthtillman an example with the latest mappings:

  • subject_topic_facet_ssim is the sidebar facet
  • subject_tsim and subject_addl_tsim are in keyword search and subject search (along with their unstemmed versions)
  • subject_display_ssm and subject_facet are used for displaying and linking hierarchical subjects

https://blackcat01qa.libraries.psu.edu/catalog/21601671/librarian_view

21601671     subject_addl_tsim         China Shanghai History 20th century. | China Shanghai. | Civilization 20th century. | Shanghai.
21601671     subject_display_ssm       1900-1999 | Photography—China—Shanghai—History—20th century | Modernism (Art)—China—Shanghai | Civilization | Modernism (Art) | Photography | Shanghai (China)—Civilization—20th century | China—Shanghai
21601671     subject_facet             1900-1999 | Photography—China—Shanghai—History—20th century | Modernism (Art)—China—Shanghai | Civilization | Modernism (Art) | Photography | Shanghai (China)—Civilization—20th century | China—Shanghai
21601671     subject_topic_facet_ssim  Photography | History | Modernism (Art) | Shanghai (China) | Civilization
21601671     subject_tsim              1900-1999 | Photography | Modernism (Art) | Civilization. | Photography. | Shanghai (China) | China

banukutlu pushed a commit that referenced this issue Jan 11, 2019
@banukutlu
Copy link
Contributor Author

banukutlu commented Jan 14, 2019

@ruthtillman can you clarify the below paragraph from our subjects doc:

Subject-field specific searches full or partial match. e.g. Quilt in subject search field would return records with Quiltmakers and those with NAMES Project AIDS Memorial Quilt (and others). On the other hand a subject search of African American quiltmakers would return only records which had all those words in subject, hence just ones with subject African American quiltmakers.

@banukutlu banukutlu added question Further information is requested PR and removed PR labels Jan 14, 2019
@ruthtillman
Copy link
Collaborator

@banukutlu I tried clarifying -- does that help?

@banukutlu banukutlu removed the question Further information is requested label Jan 14, 2019
@banukutlu
Copy link
Contributor Author

banukutlu commented Jan 14, 2019

yes, Thank you!

cdmo added a commit that referenced this issue Jan 15, 2019
@banukutlu banukutlu added review and removed PR labels Jan 16, 2019
@banukutlu banukutlu added done and removed done labels Feb 26, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants