HEX
Server: Apache/2.4.65 (Ubuntu)
System: Linux ielts-store-v2 6.8.0-1036-gcp #38~22.04.1-Ubuntu SMP Thu Aug 14 01:19:18 UTC 2025 x86_64
User: root (0)
PHP: 7.2.34-54+ubuntu20.04.1+deb.sury.org+1
Disabled: pcntl_alarm,pcntl_fork,pcntl_waitpid,pcntl_wait,pcntl_wifexited,pcntl_wifstopped,pcntl_wifsignaled,pcntl_wifcontinued,pcntl_wexitstatus,pcntl_wtermsig,pcntl_wstopsig,pcntl_signal,pcntl_signal_get_handler,pcntl_signal_dispatch,pcntl_get_last_error,pcntl_strerror,pcntl_sigprocmask,pcntl_sigwaitinfo,pcntl_sigtimedwait,pcntl_exec,pcntl_getpriority,pcntl_setpriority,pcntl_async_signals,
Upload Files
File: //snap/google-cloud-cli/396/platform/gsutil/third_party/chardet/docs/supported-encodings.rst
Supported encodings
===================

Universal Encoding Detector currently supports over two dozen character
encodings.

-  ``Big5``, ``GB2312``/``GB18030``, ``EUC-TW``, ``HZ-GB-2312``, and
   ``ISO-2022-CN`` (Traditional and Simplified Chinese)
-  ``EUC-JP``, ``SHIFT_JIS``, and ``ISO-2022-JP`` (Japanese)
-  ``EUC-KR`` and ``ISO-2022-KR`` (Korean)
-  ``KOI8-R``, ``MacCyrillic``, ``IBM855``, ``IBM866``, ``ISO-8859-5``,
   and ``windows-1251`` (Russian)
-  ``ISO-8859-2`` and ``windows-1250`` (Hungarian)
-  ``ISO-8859-5`` and ``windows-1251`` (Bulgarian)
-  ``ISO-8859-1`` and ``windows-1252`` (Western European languages)
-  ``ISO-8859-7`` and ``windows-1253`` (Greek)
-  ``ISO-8859-8`` and ``windows-1255`` (Visual and Logical Hebrew)
-  ``TIS-620`` (Thai)
-  ``UTF-32`` BE, LE, 3412-ordered, or 2143-ordered (with a BOM)
-  ``UTF-16`` BE or LE (with a BOM)
-  ``UTF-8`` (with or without a BOM)
-  ASCII

.. warning::

    Due to inherent similarities between certain encodings, some encodings may
    be detected incorrectly. In my tests, the most problematic case was
    Hungarian text encoded as ``ISO-8859-2`` or ``windows-1250`` (encoded as
    one but reported as the other). Also, Greek text encoded as ``ISO-8859-7``
    was often mis-reported as ``ISO-8859-2``. Your mileage may vary.