HTML Symbols not correctly parsed in 1.16? TexMath / HTML issue!? #2734
Tinkerer-
commented
Tinkerer-
changed the title from HTML Symbols not correctly parsed in 1.16? to HTML Symbols not correctly parsed in 1.16? TexMath / HTML issue!
Tinkerer-
changed the title from HTML Symbols not correctly parsed in 1.16? TexMath / HTML issue! to HTML Symbols not correctly parsed in 1.16? TexMath / HTML issue!?
Jmuccigr
commented
On a Mac here, pandoc 1.16.0.2. I get an ë for this:
This is a ë test.
EDIT: this was on the command line.
Strange! I'm seeing the same thing on
http://johnmacfarlane.net/babelmark2/?normalize=1&text=Hdin+in+a+%26euml%3B+test%0A
and
http://pandoc.org/try/?text=This+is+a+%26%23333%3B+test%0A&from=markdown&to=html
however, trying from the command line (with both pandoc 1.16
and the current dev version), I get proper output.
I don't see any changes between 1.16.0.2 and current dev
that would have affected this, but it may have to do with
changes in a dependent library.
This needs further investigation.
+++ Tinkerer- [Feb 22 16 03:59 ]:
…
The linux build that gets incorrect output is compiled
against tagsoup-0.13.3. I'm guessing that's the issue.
See the tagsoup changelog:
0.13.6 #28, some named entities require a trailing semicolon
I will raise the tagsoup lower bound to prevent linking
against buggy versions of tagsoup.
+++ John Muccigrosso [Feb 22 16 08:58 ]:
…
Tinkerer-
commented
Great, that fixed it! Thanks for the quick reply and for all the work on Pandoc in general!
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Since I've upgraded to pandoc 1.16 I have an issue with HTML characters. For example,
ë
should become ë but instead pandoc decides to interpret it, and make it ë (&$euml;
).Example:
From previous pandoc versions I would have expected the output to have been:
I'm using pandoc 1.16.0.2, built by cabal.
Edit: when I'm using the Windows installed pandoc (also 1.16.0.2), pandoc behaves as expected!
Edit2: The difference between my Windows intalled pandoc ("good") and the cabal version ("bad"), as indicated by a diff of the version output is that the Windows installed version uses texmath 0.8.4.1, where the version compiled by Cabal uses texmath 0.8.4.2