Skip to content

utf-8 with invalide BOM's -- rHash (v1.4.4) #309

Description

@MCasteele

Hi,

While using a custom format, I noticed every line/file beyond the first one has an invisible char sequence added. In this case, looking at the output in a hex-capable editor, it appears to be a UTF-8 BOM (U+FEFF encoded in UTF-8), and it appears on every line of output, except the very first. The UTF-8 documentation I have says that (1) it is useless, (but allowed for compat reasons) in UTF-8, and (2) It should only appear once, at the start of the stream/document, not every line, especially hen RHash is not even printing it on the very first line, only the second line, and after.

I attached samples files to this:format.txt is the file with thecformat string,
log.txt is the runtime log (see the command, below), and output.txt is the resulting outfile, from the command: rhash -v --utf8 --output=output.txt --template=format.txt --log=log.txt *

I have rhash version:

rhash --version
RHash v1.4.4

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions