summaryrefslogtreecommitdiff
path: root/ISSUES
diff options
context:
space:
mode:
Diffstat (limited to 'ISSUES')
-rw-r--r--ISSUES/charset-sniffing.md3
1 files changed, 3 insertions, 0 deletions
diff --git a/ISSUES/charset-sniffing.md b/ISSUES/charset-sniffing.md
new file mode 100644
index 0000000..4ed7031
--- /dev/null
+++ b/ISSUES/charset-sniffing.md
@@ -0,0 +1,3 @@
+# Optimize Charset Sniffing
+
+Almost all charsets are supersets of ASCII, so when sniffing the charset for files which don't specify the encoding in their MIMEtype I can treat all the preceding text as ASCII. Though I suppose for this trick to work on UTF16 or UTF32 I'd need to remove any 0 bytes.