HEX
Server: Apache/2.4.65 (Ubuntu)
System: Linux ielts-store-v2 6.8.0-1036-gcp #38~22.04.1-Ubuntu SMP Thu Aug 14 01:19:18 UTC 2025 x86_64
User: root (0)
PHP: 7.2.34-54+ubuntu20.04.1+deb.sury.org+1
Disabled: pcntl_alarm,pcntl_fork,pcntl_waitpid,pcntl_wait,pcntl_wifexited,pcntl_wifstopped,pcntl_wifsignaled,pcntl_wifcontinued,pcntl_wexitstatus,pcntl_wtermsig,pcntl_wstopsig,pcntl_signal,pcntl_signal_get_handler,pcntl_signal_dispatch,pcntl_get_last_error,pcntl_strerror,pcntl_sigprocmask,pcntl_sigwaitinfo,pcntl_sigtimedwait,pcntl_exec,pcntl_getpriority,pcntl_setpriority,pcntl_async_signals,
Upload Files
File: //snap/google-cloud-cli/396/platform/gsutil/third_party/charset_normalizer/docs/user/getstarted.rst
Installation
============

This installs a package that can be used from Python (``import charset_normalizer``).

To install for all users on the system, administrator rights (root) may be required.

Using PIP
---------
Charset Normalizer can be installed from pip::

    pip install charset-normalizer

You may retrieve the latest unicodedata backport as follow::

    pip install charset-normalizer[unicode_backport]

From git via master
-----------------------
You can install from dev-master branch using git::

    git clone https://github.com/Ousret/charset_normalizer.git
    cd charset_normalizer/
    python setup.py install

Basic Usage
===========

The new way
-----------

You may want to get right to it. ::

    from charset_normalizer import from_bytes, from_path

    # This is going to print out your sequence once properly decoded
    print(
        str(
            from_bytes(
                my_byte_str
            ).best()
        )
    )

    # You could also want the same from a file
    print(
        str(
            from_path(
                './data/sample.1.ar.srt'
            ).best()
        )
    )


Backward compatibility
----------------------

If you were used to python chardet, we are providing the very same ``detect()`` method as chardet.
This function is mostly backward-compatible with Chardet. The migration should be painless.

 ::

    from charset_normalizer import detect

    # This will behave exactly the same as python chardet
    result = detect(my_byte_str)

    if result['encoding'] is not None:
        print('got', result['encoding'], 'as detected encoding')


You may upgrade your code with ease.
CTRL + R ``from chardet import detect`` to ``from charset_normalizer import detect``.