Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Extract Editable Text from an ODT #5

Open
orcmid opened this issue Feb 15, 2021 · 0 comments
Open

Extract Editable Text from an ODT #5

orcmid opened this issue Feb 15, 2021 · 0 comments
Labels
task actions requirement to accomplish a particular task

Comments

@orcmid
Copy link
Owner

orcmid commented Feb 15, 2021

Using an ApacheOpenOffice ODT file, convert it to editable text files using pandoc. Demonstrate and make reproducible.

  1. Determine what happens with the document pages and the images.
  2. See how to obtain finer-grain text pages from the conversion so that they can be edited easily.
  3. Resolve how cross-referencing is reflected and preserved also.

Do these until we have a decent assessment of how an ODT file can be preserved enough but made into suitable editable text forms.

Create a reproducible case in the repository where some can perform the same operations.

Provide something about how to install pandoc and how its usage fits here. Screen capture the command-line operation.

@orcmid orcmid added the task actions requirement to accomplish a particular task label Feb 15, 2021
@orcmid orcmid changed the title Extract Editable Text from an ODFT Extract Editable Text from an ODT Feb 15, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
task actions requirement to accomplish a particular task
Projects
None yet
Development

No branches or pull requests

1 participant