diff options
Diffstat (limited to 'ISSUES')
-rw-r--r-- | ISSUES/charset-sniffing.md | 3 |
1 files changed, 3 insertions, 0 deletions
diff --git a/ISSUES/charset-sniffing.md b/ISSUES/charset-sniffing.md new file mode 100644 index 0000000..4ed7031 --- /dev/null +++ b/ISSUES/charset-sniffing.md @@ -0,0 +1,3 @@ +# Optimize Charset Sniffing + +Almost all charsets are supersets of ASCII, so when sniffing the charset for files which don't specify the encoding in their MIMEtype I can treat all the preceding text as ASCII. Though I suppose for this trick to work on UTF16 or UTF32 I'd need to remove any 0 bytes. |