Skip to content

Some Unicode (or non-7-bit ASCII) characters cause grief #546

Open
@RexJaeschke

Description

@RexJaeschke

a. Any paragraph of text or Hack … -delimited example containing an ellipsis (U+2026), left-double quote (U+201C), or right-double quote (U+201D), will be rendered as a blank line.
b. Em-dash (U+2014) and en-dash (U+2013) cause text to be swallowed up with no output.
c. I have cross-references of the form §§, but rather than displaying §§ linked to xxx, the whole construct is swallowed up with no output. BTW, § is U+00A7, so the high bit is set putting it outside the ASCII range.

Are all code points > U+007F handled in this manner?

BTW, I discovered these when pasting text from MS-Word. I've replaced each of these characters with ones that are accepted, but it took me a while to figure out why they "disappeared into the void".

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions