this is to evaluate and discuss whether the UTF8 code should be ported over from bstrlib; this was functionality that was implemented after this fork was forked off of bstrlib
some time ago I did a brief experiment but porting wasn't entirely trivial due to some drift between the two projects