Speech to text UI for media objects #1527

peetucket · 2024-10-11T22:16:12Z

Why was this change made? 🤔

Fixes #1524 - allows the user to start speech to text for media object

~~HOLD for integration tests on stage~~

Also:

allows for OCR or speech to text or both to be enabled in settings by changing how the feature flag works from doing it in both the view on the server side and in the stimulus controller, to just doing it in JS on the client side in the stimulus controller (by passing in the settings value to the stimulus controller and letting it handle it)
sets the workflow context as needed for both cases
changes the setting of workflow context variables so that they are only sent went true. This is a change from current pre-assembly where it always sends workflow variables, even if the values are false. This sends a lot of unnecessary workflow variables (ie we don't need to send runOcr if it's false, that's the default). See https://github.com/sul-dlss/common-accessioning/blob/main/lib/dor/text_extraction/ocr.rb#L68-L71 for where we use the workflow context to decide if we need to run OCR (nil is that same as false)

Note: shared_configs turns this in just in QA for now for testing.

~~Question:~~
- since media objects always provide a file manifest, we may not need any of the changes in the fileset builder since it will never be used... and the user will be expected to provide sdrGenerated, corrected, etc. attributes for the files they provide

How was this change tested? 🤨

Spec
Integration test on stage

peetucket · 2024-10-11T22:35:04Z

app/javascript/controllers/caption_controller.js

+  ocrEnabled () {
+    return this.data.get('ocr-enabled') === 'true'
+  }
+
+  sttEnabled () {
+    return this.data.get('stt-enabled') === 'true'
+  }


these enable the JS to see the Settings, which are passed into the stimulus controller

peetucket · 2024-10-11T22:36:17Z

app/models/batch_context.rb

@@ -180,7 +180,7 @@ def verify_output_dir_no_exists
  def verify_file_manifest_selected_for_media
    return unless content_structure == 'media' && !using_file_manifest

-    errors.add(:content_structure, 'requires a file manifest.  Please select the checkbox and ensure a file manifest is present.')
+    errors.add(:content_structure, 'requires a file manifest.  Please indicate you have a file manifest and ensure a file manifest is present.')


fixes an existing error message, this control is no longer a checkbox, it is a radio button

peetucket · 2024-10-11T22:37:44Z

app/views/batch_contexts/_new_bc_form.erb

-<div data-controller="globus <%= "caption" if Settings.ocr.enabled%>">
+<div data-controller="globus caption"
+     data-caption-stt-enabled="<%=Settings.speech_to_text.enabled%>"
+     data-caption-ocr-enabled="<%=Settings.ocr.enabled%>">


always enable the caption stimulus controller and always add the controls to the html (but hidden)... we will instead make the settings values visible to the stimulus controller so the JS can do the work of deciding which controls to show/hide as needed based on feature flag settings and user selections

this makes it more flexible and easier to reason with (since it all happens in one place now, in the stimulus controller)

peetucket · 2024-10-15T19:42:51Z

app/lib/pre_assembly/from_staging_location/structural_builder.rb

@@ -10,6 +10,7 @@ class StructuralBuilder
      # @param [String] reading_order
      # @param [Boolean] all_files_public
      # @param [Boolean] manually_corrected_ocr set by user when creating the job
+      # @param [Boolean] ocr_available set by user when creating the job


noticed this missing param documentation

peetucket added 2 commits October 11, 2024 15:31

add speech to text controls

39e7961

let the user select speech to text for media objects

de05e0f

peetucket force-pushed the 1524-start-stt branch from ec1e5d0 to de05e0f Compare October 11, 2024 22:34

peetucket commented Oct 11, 2024

View reviewed changes

peetucket force-pushed the 1524-start-stt branch 2 times, most recently from c136a79 to de05e0f Compare October 15, 2024 18:36

peetucket added 3 commits October 15, 2024 11:38

fix typo

09fcd0e

Merge branch 'main' into 1524-start-stt

989e40f

remove option to select manual corrected speech to text

f3cfdaf

peetucket commented Oct 15, 2024

View reviewed changes

fix schema.rb

d54381a

peetucket marked this pull request as ready for review October 15, 2024 19:58

peetucket changed the title ~~Speech to text UI for media objects~~ [HOLD] Speech to text UI for media objects Oct 15, 2024

peetucket changed the title ~~[HOLD] Speech to text UI for media objects~~ Speech to text UI for media objects Oct 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speech to text UI for media objects #1527

Speech to text UI for media objects #1527

peetucket commented Oct 11, 2024 •

edited

Loading

peetucket Oct 11, 2024 •

edited

Loading

peetucket Oct 11, 2024

peetucket Oct 11, 2024

peetucket Oct 15, 2024

Speech to text UI for media objects #1527

Are you sure you want to change the base?

Speech to text UI for media objects #1527

Conversation

peetucket commented Oct 11, 2024 • edited Loading

Why was this change made? 🤔

How was this change tested? 🤨

peetucket Oct 11, 2024 • edited Loading

Choose a reason for hiding this comment

peetucket Oct 11, 2024

Choose a reason for hiding this comment

peetucket Oct 11, 2024

Choose a reason for hiding this comment

peetucket Oct 15, 2024

Choose a reason for hiding this comment

peetucket commented Oct 11, 2024 •

edited

Loading

peetucket Oct 11, 2024 •

edited

Loading