-
-
Notifications
You must be signed in to change notification settings - Fork 3.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Task list (e.g. for GSOC) #1169
Comments
Here is a list of projects that could be done in pandoc:
Also worth considering whether it would help to use my custom text-based parser combinators from cheapskate (perhaps with some amplifications) instead of the slower parsec. |
great! |
How about a PDF reader/writer pair for pandoc? The writer could be written with HPDF. The reader would strip stuff it can't understand, but try to keep text, headings, images, and the like. I've been (minimally) working on a PDF reading library already, and this looks like a great application. |
+++ Kyle Raftogianis [Mar 13 14 13:56 ]:
Interesting idea. My worry is that functionality would be too limited |
I see your point. PDFs are for rendered documents, not for markup. However, PDF documents still store metadata, and can store outlines (which can be turned into section headings) and "article threads", which describe how the text is connected into sections. It definitely needs more thought, though. |
type setting math at some level could done, it just might require having access to CM font or something |
though that might get out of scope of whats safe for a gsoc project. |
On Thu, Mar 13, 2014 at 10:01 PM, John MacFarlane
It's not possible to answer that in a generic way. From some PDFs you Good chances to get most of the structure are from "tagged"[*] PDFs and [*] If you do not know about "tagged" PDF just try to imaging a lot of |
but for the writer there should be a way right? |
I think the LaTeX writer offers much more than a PDF writer would. I don't think this idea would be very feasible. Thanks for the comments! |
I am in the process of writing a proposal for adding an EPUB reader. |
1,5,7,8,9,10 are still open from this list for anyone finding it. |
and some would probably make great GSOCs! |
Closing. Opened new list at #1852. |
tasks that might be suitable for part of a GSOC for some lucky student!
The text was updated successfully, but these errors were encountered: