Problem with links from docx/odt to markdown #2689
Weird -- they're actually different links in the docx xml file. I can collapse them, but that would only work if they go to the same place. I guess adjacent links to the same place should be collapsed in general. I wonder if this is something about how word handles French? Any way, thanks for reporting this -- I'll let you know as soon as I figure out the best way to proceed.
Actually, when I write directly the word document or a google doc with links, the file is clean and the links are processed correctly by pandoc.
I know why the file is faulty.
It was originally a copy/paste from a text with links already generated by a markdown (on my ghost blog). It seems that the copy of this document create multiple links where it's suppose to be only one. Maybe the markdown/text conversion of the ghost platform isn't clean...
Well, the docx reader should be robust against these things anyway, since documents come from all sorts of sources. I have a fix almost ready to push -- just fixing up a couple of function names. Is it okay if I use part of the file above as a test case?
linkmarkdown.docx
converting this document to markdown put a "\n" with an empty text link before each link and split unevenly when the link is a sentence.
The link is also made italic wich I don't want.
ex :
Note: this docx file was downloaded from a google doc.