summaryrefslogtreecommitdiff
path: root/ISSUES/charset-sniffing.md
diff options
context:
space:
mode:
authorAdrian Cochrane <adrian@openwork.nz>2021-11-11 16:52:12 +1300
committerAdrian Cochrane <adrian@openwork.nz>2021-11-11 16:52:12 +1300
commit7d31eda45dc9d615cbfa084a2149b17c5e313a85 (patch)
treed4b716a40d02a39c3a79f9761858b5bfedb505ca /ISSUES/charset-sniffing.md
parent3cb0d7d7bd278aab25146280dc53fac36531769b (diff)
downloadhurl-7d31eda45dc9d615cbfa084a2149b17c5e313a85.tar.gz
hurl-7d31eda45dc9d615cbfa084a2149b17c5e313a85.tar.bz2
hurl-7d31eda45dc9d615cbfa084a2149b17c5e313a85.zip
Note charset optimization.
Diffstat (limited to 'ISSUES/charset-sniffing.md')
-rw-r--r--ISSUES/charset-sniffing.md3
1 files changed, 3 insertions, 0 deletions
diff --git a/ISSUES/charset-sniffing.md b/ISSUES/charset-sniffing.md
new file mode 100644
index 0000000..4ed7031
--- /dev/null
+++ b/ISSUES/charset-sniffing.md
@@ -0,0 +1,3 @@
+# Optimize Charset Sniffing
+
+Almost all charsets are supersets of ASCII, so when sniffing the charset for files which don't specify the encoding in their MIMEtype I can treat all the preceding text as ASCII. Though I suppose for this trick to work on UTF16 or UTF32 I'd need to remove any 0 bytes.