PyDocX is a tool that can export MS Word documents (Office Open XML) into different markup languages. Currently, only HTML is supported. You can extend any of the available exporters to customize it to your needs. This includes extending the base exporter to add support for a markup language or format that is not supported.
To get started using PyDocX, see the Usage guide and also Extending PyDocX.
Want to help save science? Want to get paid to develop free, open source software? Check out our openings!