commit | 544ad2be674423238c47650d2c8588ba7dfc9ed2 | [log] [tgz] |
---|---|---|
author | Marco Nelissen <marcone@google.com> | Wed Nov 13 14:18:21 2013 -0800 |
committer | Marco Nelissen <marcone@google.com> | Wed Dec 11 10:28:44 2013 -0800 |
tree | 1167d14b0e345cd7ad6c2a415a7134c915507b86 | |
parent | 35a9e7d49ff5ff99fedb0bf2a35c39f7ea5a2f9e [diff] |
Better character set encoding detection Id3 tags are supposed to be ISO-8859-1 or unicode, but often aren't. To better detect the real encoding we now use ICU to detect possible encodings for a given byte sequence, then apply additional heuristics to determine the most likely one. b/5564857 Change-Id: I53bc83b006433da5c2f2ccfcd770ddb3a26b64d0