-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Subject strings concatenated #1449
Comments
Something like
will look at all the Items and Theses and un-concatenate Subjects. This assumes that there aren't any I could prepare this as a one of rake task (in master or integration_postmigration) or run it in the rails console. Not certain what we've done in the past, or want to do in the future. Would appreciate some guidance or history from @mbarnett @weiweishi |
Thanks @pgwillia! |
@weiweishi said
I gave @henryzhang87 the heads up that I was working on this I'll continue with Weiwei's report in mind. My plan is to practice, practice and then execute. |
After chatting with @mbarnett the solution I'm going to pursue is to create a migration that will happen after the Fedora migration. Basically the step's I'll follow are:
|
Just sent you the log of the items for review, Tricia |
## Context In the first part of this data clean up we created a new column `deconcatenated_subject` and copied all subjects here, cleaning up the ones we've identified. Now we want to replace the subjects column with these values. #1449 #1627 ## What's New We remove the subject column and rename `deconcatenated_subject` to `subject`. Also clean up the ERA_subjects_issue.txt file and reference in the previous migration.
Describe the bug
Some items in ERA have all of the subject terms concatenated into a single string.
To Reproduce
For example, https://era.library.ualberta.ca/items/0d9000f4-3c7e-4221-9f26-f5faf0a1d1bb
Expected behavior
Each subject word or phrase should be a single item in the list
Additional context
A list of impacted items (based on triplestore query) is attached. @weiweishi This had happened for another collection previously and was cleaned up via script so hoping the same can be done again.
ERA_subject_issue.txt
Thanks!
The text was updated successfully, but these errors were encountered: