summaryrefslogtreecommitdiff
path: root/docs
diff options
context:
space:
mode:
authorCasey Duncan <casey.duncan@gmail.com>2010-12-02 23:13:16 -0700
committerCasey Duncan <casey.duncan@gmail.com>2010-12-02 23:13:16 -0700
commit150a32920d364fdd2de0f4005dd5fb995cba7692 (patch)
tree61bcf42e23e755ae952bb40c3dc802380386e124 /docs
parentf5da86dfe5f4edfa443182bc6f6009e44f001d9e (diff)
downloadpyramid-150a32920d364fdd2de0f4005dd5fb995cba7692.tar.gz
pyramid-150a32920d364fdd2de0f4005dd5fb995cba7692.tar.bz2
pyramid-150a32920d364fdd2de0f4005dd5fb995cba7692.zip
add note explaining the difference between Unicode and UTF-8
Diffstat (limited to 'docs')
-rw-r--r--docs/narr/views.rst11
1 files changed, 11 insertions, 0 deletions
diff --git a/docs/narr/views.rst b/docs/narr/views.rst
index 3cd4154fa..29adc5f14 100644
--- a/docs/narr/views.rst
+++ b/docs/narr/views.rst
@@ -1028,6 +1028,17 @@ This implicit decoding happens when view code obtains form field values
via the ``request.params``, ``request.GET``, or ``request.POST`` APIs
(see :ref:`request_module` for details about these APIs).
+.. note::
+ Many people find the difference between Unicode and UTF-8 confusing.
+ Unicode is a standard for representing text that supports most of
+ the world's writing systems. However, there are many ways that Unicode
+ data can be encoded into bytes for transmittal and storage. UTF-8
+ is a specific encoding for Unicode, that is backwards-compatible
+ with ASCII. This makes UTF-8 very convenient for encoding data
+ where a large subset of that data is ASCII characters, which is
+ largely true on the web. UTF-8 is also the standard character
+ encoding for URLs.
+
For example, let's assume that the following form page is served up to
a browser client, and its ``action`` points at some :app:`Pyramid`
view code: