The 
Unicode Consortium site has a 
FAQ regarding byte-order marks.  I found out some very useful info, such as about surrogate pairs (D800-DBFF for prefix, DC00-DFFF for suffix), and the invalid values in UTF-16 (FDD0-FDEF, as well as unpaired surrogates).
 
No comments:
Post a Comment