Administration Guide ==================== The administration guide. Terminology ----------- Backup Content ~~~~~~~~~~~~~~ When doing deduplication, there are different strategies to get optimal results in terms of performance and/or deduplication rates. Depending on the type of data, one can split data into fixed or variable sized chunks. Fixed sized chunking needs almost no CPU performance, and is used to backup virtual machine images. Variable sized chunking needs more CPU power, but is essential to get good deduplication rates for file archives. Therefore, the backup server supports both strategies. File Archives: ``.pxar`` ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ .. see https://moinakg.wordpress.com/2013/06/22/high-performance-content-defined-chunking/ A file archive stores a whole directory tree. Content is stored using the :ref:`pxar-format`, split into variable sized chunks. The format is specially optimized to achieve good deduplication rates. Image Archives: ``.img`` ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ This is used for virtual machine images and other large binary data. Content is split into fixed sized chunks. Binary Data (BLOBs) ^^^^^^^^^^^^^^^^^^^ This type is used to store smaller (< 16MB) binaries like configuration data. Larger files should be stored as image archive. .. caution:: Please do not store all files as BLOBs. Instead, use the file archive to store whole directory trees. Catalog File: ``catalog.pcat1`` ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ The catalog file is basically an index for file archive. It contains the list of files, and is used to speedup search operations. The Manifest: ``index.json`` ^^^^^^^^^^^^^^^^^^^^^^^^^^^^ The manifest contains the list of all backup files, including file sizes and checksums. It is used to verify the consistency of a backup. Backup Type ~~~~~~~~~~~ The backup server groups backups by *type*, where *type* is one of: ``vm`` This type is used for :term:`virtual machine`\ s. Typically contains the virtual machine configuration and an image archive for each disk. ``ct`` This type is used for :term:`container`\ s. Contains the container configuration and a single file archive for the container content. ``host`` This type is used for physical host, or if you want to run backups manually from inside virtual machines or containers. Such backups may contain file and image archives (no restrictions here). Backup ID ~~~~~~~~~ An unique ID. Usually the virtual machine or container ID. ``host`` type backups normally use the hostname. Backup Time ~~~~~~~~~~~ The time when the backup was made. Backup Group ~~~~~~~~~~~~ We call the tuple ``/`` a backup group. Such group may contains one or more backup snapshots. Backup Snapshot ~~~~~~~~~~~~~~~ We call the triplet ``//