proxmox-backup/src/backup.rs

//! This module implements the proxmox backup data storage
//!
//! Proxmox backup splits large files into chunks, and stores them
//! deduplicated using a content addressable storage format.
//!
//! A chunk is simply defined as binary blob, which is stored inside a
//! `ChunkStore`, addressed by the SHA256 digest of the binary blob.
//!
//! Index files are used to reconstruct the original file. They
//! basically contain a list of SHA256 checksums. The `DynamicIndex*`
//! format is able to deal with dynamic chunk sizes, whereas the
//! `FixedIndex*` format is an optimization to store a list of equal
//! sized chunks.
//!
//! # ChunkStore Locking
//!
//! We need to be able to restart the proxmox-backup service daemons,
//! so that we can update the software without rebooting the host. But
//! such restarts must not abort running backup jobs, so we need to
//! keep the old service running until those jobs are finished. This
//! implies that we need some kind of locking for the
//! ChunkStore. Please note that it is perfectly valid to have
//! multiple parallel ChunkStore writers, even when they write the
//! same chunk (because the chunk would have the same name and the
//! same data). The only real problem is garbage collection, because
//! we need to avoid deleting chunks which are still referenced.
//!
//! * Read Index Files:
//!
//!   Acquire shared lock for .idx files.
//!
//!
//! * Delete Index Files:
//!
//!   Acquire exclusive lock for .idx files. This makes sure that we do
//!   not delete index files while they are still in use.
//!
//!
//! * Create Index Files:
//!
//!   Acquire shared lock for ChunkStore (process wide).
//!
//!   Note: When creating .idx files, we create temporary (.tmp) file,
//!   then do an atomic rename ...
//!
//!
//! * Garbage Collect:
//!
//!   Acquire exclusive lock for ChunkStore (process wide). If we have
//!   already an shared lock for ChunkStore, try to updraged that
//!   lock.
//!
//!
//! * Server Restart
//!
//!   Try to abort running garbage collection to release exclusive
//!   ChunkStore lock asap. Start new service with existing listening
//!   socket.
//!
//!
//! # Garbage Collection (GC)
//!
//! Deleting backups is as easy as deleting the corresponding .idx
//! files. Unfortunately, this does not free up any storage, because
//! those files just contains references to chunks.
//!
//! To free up some storage, we run a garbage collection process at
//! regular intervals. The collector uses an mark and sweep
//! approach. In the first phase, it scans all .idx files to mark used
//! chunks. The second phase then removes all unmarked chunks from the
//! store.
//!
//! The above locking mechanism makes sure that we are the only
//! process running GC. But we still want to be able to create backups
//! during GC, so there may be multiple backup threads/tasks
//! running. Either started before GC started, or started while GC is
//! running.
//!
//! ## `atime` based GC
//!
//! The idea here is to mark chunks by updating the `atime` (access
//! timestamp) on the chunk file. This is quite simple and does not
//! need additional RAM.
//!
//! One minor problem is that recent Linux versions use the `relatime`
//! mount flag by default for performance reasons (yes, we want
//! that). When enabled, `atime` data is written to the disk only if
//! the file has been modified since the `atime` data was last updated
//! (`mtime`), or if the file was last accessed more than a certain
//! amount of time ago (by default 24h). So we may only delete chunks
//! with `atime` older than 24 hours.
//!
//! Another problem arise from running backups. The mark phase does
//! not find any chunks from those backups, because there is no .idx
//! file for them (created after the backup). Chunks created or
//! touched by those backups may have an `atime` as old as the start
//! time of those backup. Please not that the backup start time may
//! predate the GC start time. Se we may only delete chunk older than
//! the start time of those running backup jobs.
//!
//!
//! ## Store `marks` in RAM using a HASH
//!
//! Not sure if this is better. TODO

use anyhow::{bail, Error};

// Note: .pcat1 => Proxmox Catalog Format version 1
pub const CATALOG_NAME: &str = "catalog.pcat1.didx";

#[macro_export]
macro_rules! PROXMOX_BACKUP_PROTOCOL_ID_V1 {
    () =>  { "proxmox-backup-protocol-v1" }
}

#[macro_export]
macro_rules! PROXMOX_BACKUP_READER_PROTOCOL_ID_V1 {
    () =>  { "proxmox-backup-reader-protocol-v1" }
}

/// Unix system user used by proxmox-backup-proxy
pub const BACKUP_USER_NAME: &str = "backup";

/// Return User info for the 'backup' user (``getpwnam_r(3)``)
pub fn backup_user() -> Result<nix::unistd::User, Error> {
    match nix::unistd::User::from_name(BACKUP_USER_NAME)? {
        Some(user) => Ok(user),
        None => bail!("Unable to lookup backup user."),
    }
}

mod file_formats;
pub use file_formats::*;

mod manifest;
pub use manifest::*;

mod crypt_config;
pub use crypt_config::*;

mod key_derivation;
pub use key_derivation::*;

mod crypt_reader;
pub use crypt_reader::*;

mod crypt_writer;
pub use crypt_writer::*;

mod checksum_reader;
pub use checksum_reader::*;

mod checksum_writer;
pub use checksum_writer::*;

mod chunker;
pub use chunker::*;

mod data_blob;
pub use data_blob::*;

mod data_blob_reader;
pub use data_blob_reader::*;

mod data_blob_writer;
pub use data_blob_writer::*;

mod catalog;
pub use catalog::*;

mod chunk_stream;
pub use chunk_stream::*;

mod chunk_stat;
pub use chunk_stat::*;

mod read_chunk;
pub use read_chunk::*;

mod chunk_store;
pub use chunk_store::*;

mod index;
pub use index::*;

mod fixed_index;
pub use fixed_index::*;

mod dynamic_index;
pub use dynamic_index::*;

mod backup_info;
pub use backup_info::*;

mod prune;
pub use prune::*;

mod datastore;
pub use datastore::*;

mod catalog_shell;
pub use catalog_shell::*;
improve docs 2019-08-14 12:08:27 +00:00			`//! This module implements the proxmox backup data storage`
src/backup.rs - improve doc 2019-02-12 12:27:11 +00:00			`//!`
improve docs 2019-08-14 12:08:27 +00:00			`//! Proxmox backup splits large files into chunks, and stores them`
			`//! deduplicated using a content addressable storage format.`
src/backup.rs - improve doc 2019-02-12 12:27:11 +00:00			`//!`
improve docs 2019-08-14 12:08:27 +00:00			`//! A chunk is simply defined as binary blob, which is stored inside a`
			//! `ChunkStore`, addressed by the SHA256 digest of the binary blob.
			`//!`
			`//! Index files are used to reconstruct the original file. They`
			//! basically contain a list of SHA256 checksums. The `DynamicIndex*`
			`//! format is able to deal with dynamic chunk sizes, whereas the`
			//! `FixedIndex*` format is an optimization to store a list of equal
			`//! sized chunks.`
src/backup.rs: add documentation about ChunkStore locking 2019-03-22 09:14:50 +00:00			`//!`
			`//! # ChunkStore Locking`
			`//!`
			`//! We need to be able to restart the proxmox-backup service daemons,`
			`//! so that we can update the software without rebooting the host. But`
			`//! such restarts must not abort running backup jobs, so we need to`
			`//! keep the old service running until those jobs are finished. This`
src/backup.rs: start explaining different GC algorithm 2019-03-30 16:21:40 +00:00			`//! implies that we need some kind of locking for the`
src/backup.rs: add documentation about ChunkStore locking 2019-03-22 09:14:50 +00:00			`//! ChunkStore. Please note that it is perfectly valid to have`
			`//! multiple parallel ChunkStore writers, even when they write the`
			`//! same chunk (because the chunk would have the same name and the`
			`//! same data). The only real problem is garbage collection, because`
			`//! we need to avoid deleting chunks which are still referenced.`
			`//!`
			`//! * Read Index Files:`
			`//!`
			`//! Acquire shared lock for .idx files.`
			`//!`
			`//!`
			`//! * Delete Index Files:`
			`//!`
			`//! Acquire exclusive lock for .idx files. This makes sure that we do`
			`//! not delete index files while they are still in use.`
			`//!`
			`//!`
			`//! * Create Index Files:`
			`//!`
src/backup.rs: describe the garbage collection problem 2019-03-30 15:26:52 +00:00			`//! Acquire shared lock for ChunkStore (process wide).`
src/backup.rs: add documentation about ChunkStore locking 2019-03-22 09:14:50 +00:00			`//!`
src/backup.rs: start explaining different GC algorithm 2019-03-30 16:21:40 +00:00			`//! Note: When creating .idx files, we create temporary (.tmp) file,`
			`//! then do an atomic rename ...`
src/backup.rs: add documentation about ChunkStore locking 2019-03-22 09:14:50 +00:00			`//!`
			`//!`
			`//! * Garbage Collect:`
			`//!`
src/backup.rs: describe the garbage collection problem 2019-03-30 15:26:52 +00:00			`//! Acquire exclusive lock for ChunkStore (process wide). If we have`
			`//! already an shared lock for ChunkStore, try to updraged that`
			`//! lock.`
src/backup.rs: add documentation about ChunkStore locking 2019-03-22 09:14:50 +00:00			`//!`
			`//!`
			`//! * Server Restart`
			`//!`
			`//! Try to abort running garbage collection to release exclusive`
			`//! ChunkStore lock asap. Start new service with existing listening`
			`//! socket.`
			`//!`
src/backup.rs: describe the garbage collection problem 2019-03-30 15:26:52 +00:00			`//!`
src/backup.rs: start explaining different GC algorithm 2019-03-30 16:21:40 +00:00			`//! # Garbage Collection (GC)`
src/backup.rs: describe the garbage collection problem 2019-03-30 15:26:52 +00:00			`//!`
			`//! Deleting backups is as easy as deleting the corresponding .idx`
			`//! files. Unfortunately, this does not free up any storage, because`
			`//! those files just contains references to chunks.`
			`//!`
			`//! To free up some storage, we run a garbage collection process at`
			`//! regular intervals. The collector uses an mark and sweep`
src/backup.rs: improve GC problem description 2019-03-31 07:44:35 +00:00			`//! approach. In the first phase, it scans all .idx files to mark used`
			`//! chunks. The second phase then removes all unmarked chunks from the`
src/backup.rs: describe the garbage collection problem 2019-03-30 15:26:52 +00:00			`//! store.`
			`//!`
			`//! The above locking mechanism makes sure that we are the only`
src/backup.rs: start explaining different GC algorithm 2019-03-30 16:21:40 +00:00			`//! process running GC. But we still want to be able to create backups`
			`//! during GC, so there may be multiple backup threads/tasks`
			`//! running. Either started before GC started, or started while GC is`
			`//! running.`
src/backup.rs: describe the garbage collection problem 2019-03-30 15:26:52 +00:00			`//!`
src/backup.rs: start explaining different GC algorithm 2019-03-30 16:21:40 +00:00			//! ## `atime` based GC
src/backup.rs: describe the garbage collection problem 2019-03-30 15:26:52 +00:00			`//!`
src/backup.rs: start explaining different GC algorithm 2019-03-30 16:21:40 +00:00			//! The idea here is to mark chunks by updating the `atime` (access
			`//! timestamp) on the chunk file. This is quite simple and does not`
src/backup.rs: improve GC problem description 2019-03-31 07:44:35 +00:00			`//! need additional RAM.`
src/backup.rs: start explaining different GC algorithm 2019-03-30 16:21:40 +00:00			`//!`
			//! One minor problem is that recent Linux versions use the `relatime`
			`//! mount flag by default for performance reasons (yes, we want`
			//! that). When enabled, `atime` data is written to the disk only if
			//! the file has been modified since the `atime` data was last updated
			//! (`mtime`), or if the file was last accessed more than a certain
src/backup.rs: improve GC problem description 2019-03-31 07:44:35 +00:00			`//! amount of time ago (by default 24h). So we may only delete chunks`
			//! with `atime` older than 24 hours.
			`//!`
			`//! Another problem arise from running backups. The mark phase does`
			`//! not find any chunks from those backups, because there is no .idx`
			`//! file for them (created after the backup). Chunks created or`
			//! touched by those backups may have an `atime` as old as the start
			`//! time of those backup. Please not that the backup start time may`
			`//! predate the GC start time. Se we may only delete chunk older than`
			`//! the start time of those running backup jobs.`
src/backup.rs: start explaining different GC algorithm 2019-03-30 16:21:40 +00:00			`//!`
			`//!`
			//! ## Store `marks` in RAM using a HASH
			`//!`
			`//! Not sure if this is better. TODO`
create backup mod in backup.rs, improve docu 2018-12-31 15:08:04 +00:00
switch from failure to anyhow Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com> 2020-04-17 12:11:25 +00:00			`use anyhow::{bail, Error};`
remove tools::getpwnam_ugid, impl. crate::backup::backup_user() And use new nix::unistd::User struct. 2019-12-19 09:20:13 +00:00
change catalog format, use dynamic index to store catalog. In order to remove size restriction of a single blob. 2019-11-08 09:35:48 +00:00			`// Note: .pcat1 => Proxmox Catalog Format version 1`
			`pub const CATALOG_NAME: &str = "catalog.pcat1.didx";`
src/backup.rs: define INDEX_BLOB_NAME here 2019-09-03 11:15:44 +00:00
src/backup.rs; use a macro to define PROXMOX_BACKUP_PROTOCOL_ID_V1 So that we can include it in static doc strings. 2019-06-05 06:41:20 +00:00			`#[macro_export]`
			`macro_rules! PROXMOX_BACKUP_PROTOCOL_ID_V1 {`
			`() => { "proxmox-backup-protocol-v1" }`
			`}`
src/backup.rs: define const PROXMOX_BACKUP_PROTOCOL_ID_V1 2019-06-05 06:12:13 +00:00
src/api2/reader.rs: implement backup reader protocol 2019-06-27 07:01:41 +00:00			`#[macro_export]`
			`macro_rules! PROXMOX_BACKUP_READER_PROTOCOL_ID_V1 {`
			`() => { "proxmox-backup-reader-protocol-v1" }`
			`}`

remove tools::getpwnam_ugid, impl. crate::backup::backup_user() And use new nix::unistd::User struct. 2019-12-19 09:20:13 +00:00			`/// Unix system user used by proxmox-backup-proxy`
			`pub const BACKUP_USER_NAME: &str = "backup";`

			/// Return User info for the 'backup' user (``getpwnam_r(3)``)
			`pub fn backup_user() -> Result<nix::unistd::User, Error> {`
			`match nix::unistd::User::from_name(BACKUP_USER_NAME)? {`
			`Some(user) => Ok(user),`
			`None => bail!("Unable to lookup backup user."),`
			`}`
			`}`

src/backup/file_formats.rs: split out file format data 2019-06-22 07:12:25 +00:00			`mod file_formats;`
			`pub use file_formats::*;`
src/backup/*_index.rs: used generated magic numbers 2019-06-14 12:58:37 +00:00
src/backup/manifest.rs: new class to generate/parse index.json 2019-10-12 15:58:08 +00:00			`mod manifest;`
			`pub use manifest::*;`

renamed: src/backup/crypt_setup.rs -> src/backup/crypt_config.rs 2019-06-21 07:51:18 +00:00			`mod crypt_config;`
			`pub use crypt_config::*;`
src/backup/crypt_setup.rs: crypto helpers 2019-06-08 07:51:49 +00:00
src/backup/key_derivation.rs: move kdf code into separate file 2019-06-18 09:17:22 +00:00			`mod key_derivation;`
			`pub use key_derivation::*;`

src/backup/data_blob.rs: move parts into single files 2019-08-14 11:05:11 +00:00			`mod crypt_reader;`
			`pub use crypt_reader::*;`

			`mod crypt_writer;`
			`pub use crypt_writer::*;`

			`mod checksum_reader;`
			`pub use checksum_reader::*;`

			`mod checksum_writer;`
			`pub use checksum_writer::*;`

remove proxmox-protocol subcrate AFAICT we have no use for it anymore, its api entry points are gone. If we do end up needing something from it, it's still in the git history anyway. (And about two thirds of it can be made much less awkward by utilizing async-await anyway, so no love lost there...) Moved the chunker back into src/backup/chunker.rs Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com> 2019-08-22 12:03:43 +00:00			`mod chunker;`
			`pub use chunker::*;`

src/backup/data_blob.rs: new file format for binary blobs 2019-06-21 09:32:07 +00:00			`mod data_blob;`
			`pub use data_blob::*;`

src/backup/data_blob.rs: move parts into single files 2019-08-14 11:05:11 +00:00			`mod data_blob_reader;`
			`pub use data_blob_reader::*;`

			`mod data_blob_writer;`
			`pub use data_blob_writer::*;`

renamed: src/backup/catalog_blob.rs -> src/backup/catalog.rs 2019-11-08 09:41:00 +00:00			`mod catalog;`
			`pub use catalog::*;`
src/backup/catalog_blob.rs: moved catalog impl. from pxar And avoid loading catalog into memory. 2019-08-16 10:27:17 +00:00
src/backup/chunk_stream.rs: async chunk stream 2019-05-14 08:05:29 +00:00			`mod chunk_stream;`
			`pub use chunk_stream::*;`

src/backup/chunk_stat.rs: new struct to track chunk statistics 2019-02-25 11:52:10 +00:00			`mod chunk_stat;`
			`pub use chunk_stat::*;`

src/backup/read_chunk.rs: move read chunk trait into extra file And implement LocalChunkReader. 2019-07-02 06:22:29 +00:00			`mod read_chunk;`
			`pub use read_chunk::*;`

simplify backup lib structure (pub use xxx:*), improve doc 2019-02-12 13:13:31 +00:00			`mod chunk_store;`
			`pub use chunk_store::*;`

add IndexFile trait We want to be able to iterate through digests of index files, but without always having to distinguish between dynamic and fixed types, so add a trait we can use as a trait object. Unfortunately the iterator needs to yield copies as iterators cannot yield values with life times when represented as trait objects (Box<dyn Iterator<Item = ?>>) Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com> 2019-02-27 13:32:34 +00:00			`mod index;`
			`pub use index::*;`

simplify backup lib structure (pub use xxx:*), improve doc 2019-02-12 13:13:31 +00:00			`mod fixed_index;`
			`pub use fixed_index::*;`

			`mod dynamic_index;`
			`pub use dynamic_index::*;`

src/backup/backup_info.rs: move code into separate file Also changed create_backup_dir() parameters - uses &BackupDir now. 2019-03-05 06:18:12 +00:00			`mod backup_info;`
			`pub use backup_info::*;`

src/backup/prune.rs: moved prune related code into extra file 2019-12-06 07:05:40 +00:00			`mod prune;`
			`pub use prune::*;`

simplify backup lib structure (pub use xxx:*), improve doc 2019-02-12 13:13:31 +00:00			`mod datastore;`
			`pub use datastore::*;`
src/backup/catalog_shell.rs: impl shell to inspect and restore a snapshot via the catalog Signed-off-by: Christian Ebner <c.ebner@proxmox.com> 2019-11-21 11:47:51 +00:00
			`mod catalog_shell;`
			`pub use catalog_shell::*;`