proxmox-backup

Author	SHA1	Message	Date
Fabian Grünbichler	efcac39d34	gc: remove duplicate variable list_images already returns absolute paths, we don't need to prepend anything. Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>	2020-12-01 06:06:51 +01:00
Fabian Grünbichler	cb4b721cb0	gc: log index files found outside of expected scheme for safety reason, GC finds and marks all index files below the datastore base path. as a result of regular operations, only index files within the expected scheme of <TYPE>/<ID>/<TIMESTAMP> should exist. add a small check + warning if the index list contains index files out side of this expected scheme, so that an admin with shell access can investigate. Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>	2020-12-01 06:06:17 +01:00
Fabian Grünbichler	7956877f14	gc: shorten progress messages we have messages starting the phases anyway, and limit the number of progress updates so that context remains available at all times. Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>	2020-12-01 06:04:13 +01:00
Stefan Reiter	fd19256470	gc: treat .bad files like regular chunks Simplify the phase 2 code by treating .bad files just like regular chunks, with the exception of stat logging. To facilitate, we need to touch .bad files in phase 1. We only do this under the condition that 1) the original chunk is missing (as before), and 2) the original chunk is still referenced somewhere (since the code lives in the error handler for a failed chunk touch, it only gets called for chunks we expect to be there, i.e. ones that are referenced). Untouched they will then be cleaned up after 24 hours (or after the last longer-running task finishes). Reason 2) is also a fix for .bad files not being cleaned up at all if the original is no longer referenced anywhere (e.g. a user deleting all snapshots after seeing some corrupt chunks appear). cond_touch_path is introduced to touch arbitrary paths in the chunk store with the same logic as touching chunks. Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2020-11-18 14:04:49 +01:00
Thomas Lamprecht	788d82d9b7	gc: mark_used_chunks: reduce implementation noise try do reduce some unecessary lines, make match arms more precise so one can faster see what's actually happening. Also, avoid > return Err(format_err!(...)) stuff, just use bail!() Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2020-11-02 21:08:38 +01:00
Dominik Csapak	2f0b92352d	garbage collect: improve index error messages so that in case of a broken index file, the user knows which it is Signed-off-by: Dominik Csapak <d.csapak@proxmox.com>	2020-11-02 20:08:50 +01:00
Fabian Grünbichler	e6dc35acb8	replace Userid with Authid in most generic places. this is accompanied by a change in RpcEnvironment to purposefully break existing call sites. Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>	2020-10-29 15:11:39 +01:00
Thomas Lamprecht	b6563f48ad	GC: improve task logs Make it more clear that removed files are chunks (not indexes or something like that, user cannot know that we do not touch them here) Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2020-10-29 14:47:39 +01:00
Thomas Lamprecht	932390bd46	GC: fix logging leftover bad chunks fixes commit `b4fb262335`, which copied over the "Removed bad files:" block, but only adapted the log text, not the actual variable. Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2020-10-29 14:40:29 +01:00
Dietmar Maurer	d6373f3525	garbage_collection: log deduplication factor	2020-10-29 11:13:01 +01:00
Dietmar Maurer	b4fb262335	garbage_collection: log bad chunks (still_bad value)	2020-10-29 10:24:31 +01:00
Dominik Csapak	b683fd589c	backup/datastore: save garbage collection status to disk and load it again when opening it this way we can persist the status of the last garbage collect across daemon reloads and reboots Signed-off-by: Dominik Csapak <d.csapak@proxmox.com>	2020-10-27 17:41:30 +01:00
Stefan Reiter	0698f78df5	fix #2988 : allow verification after finishing a snapshot To cater to the paranoid, a new datastore-wide setting "verify-new" is introduced. When set, a verify job will be spawned right after a new backup is added to the store (only verifying the added snapshot). Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2020-10-20 10:51:13 +02:00
Fabian Grünbichler	115d927c15	unbreak build and silence warning. Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>	2020-10-20 09:07:32 +02:00
Stefan Reiter	df729017b4	datastore: cleanup open and load config only once Force consumers to use the lookup_datastore method instead of potentially opening a datastore twice, and pass the config we have already loaded into open_with_path, removing the need for open(1). Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2020-10-20 07:51:05 +02:00
Stefan Reiter	1a374fcfd6	datastore: add manifest locking Avoid races when updating manifest data by flocking a lock file. update_manifest is used to ensure updates always happen with the lock held. Snapshot deletion also acquires the lock, so it cannot interfere with an outstanding manifest write. Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2020-10-16 09:34:12 +02:00
Dietmar Maurer	e07620028d	mark_used_chunks: simply ignore vanished files In case a prune operation removed a file in the meantime.	2020-10-16 08:10:46 +02:00
Stefan Reiter	4c0ae82e23	datastore: remove individual snapshots before group Removing a snapshot has some more safety checks which we don't want to ignore when removing an entire group (i.e. locking the manifest and notifying GC). Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2020-10-15 07:51:09 +02:00
Stefan Reiter	883aa6d5a4	datastore: remove load_manifest_json There's no point in having that as a seperate method, just parse the thing into a struct and write it back out correctly. Also makes further changes to the method simpler. Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2020-10-15 07:19:32 +02:00
Stefan Reiter	238a872d1f	reader: acquire shared flock on open snapshot ...to avoid it being forgotten or pruned while in use. Update lock error message for deletions to be consistent. Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2020-10-15 07:09:34 +02:00
Wolfgang Bumiller	8db1468952	more clippy fixups Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>	2020-10-14 13:58:35 +02:00
Wolfgang Bumiller	f6b1d1cc66	don't require WorkerTask in backup/ To untangle the server code from the actual backup implementation. It would be ideal if the whole backup/ dir could become its own crate with minimal dependencies, certainly without depending on the actual api server. That would then also be used more easily to create forensic tools for all the data file types we have in the backup repositories. Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>	2020-10-12 14:11:57 +02:00
Thomas Lamprecht	823867f5b7	datastore: gc: avoid unsafe call into libc, use epoch_i64 helper Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2020-10-01 12:38:38 +02:00
Thomas Lamprecht	c6772c92b8	datastore: gc: comment exclusive process lock Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2020-10-01 12:38:04 +02:00
Dietmar Maurer	ba37f3562d	src/backup/datastore.rs - open_with_path: use Path instead of str	2020-09-19 10:01:57 +02:00
Dietmar Maurer	fce4659388	src/backup/datastore.rs: new method open_with_path To make testing easier.	2020-09-19 09:55:21 +02:00
Dietmar Maurer	6a7be83efe	avoid chrono dependency, depend on proxmox 0.3.8 - remove chrono dependency - depend on proxmox 0.3.8 - remove epoch_now, epoch_now_u64 and epoch_now_f64 - remove tm_editor (moved to proxmox crate) - use new helpers from proxmox 0.3.8 * epoch_i64 and epoch_f64 * parse_rfc3339 * epoch_to_rfc3339_utc * strftime_local - BackupDir changes: * store epoch and rfc3339 string instead of DateTime * backup_time_to_string now return a Result * remove unnecessary TryFrom<(BackupGroup, i64)> for BackupDir - DynamicIndexHeader: change ctime to i64 - FixedIndexHeader: change ctime to i64	2020-09-15 07:12:57 +02:00
Stefan Reiter	a9767cf7de	gc: remove .bad files on garbage collect The iterator of get_chunk_iterator is extended with a third parameter indicating whether the current file is a chunk (false) or a .bad file (true). Count their sizes to the total of removed bytes, since it also frees disk space. .bad files are only deleted if the corresponding chunk exists, i.e. has been rewritten. Otherwise we might delete data only marked bad because of transient errors. While at it, also clean up and use nix::unistd::unlinkat instead of unsafe libc calls. Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2020-09-08 12:43:13 +02:00
Dietmar Maurer	8317873c06	gc: improve percentage done logs	2020-09-02 10:04:18 +02:00
Thomas Lamprecht	49a92084a9	gc: use human readable units for summary and avoid the "percentage done: X %" phrase Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2020-08-27 16:06:35 +02:00
Thomas Lamprecht	1ffe030123	various typo fixes Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2020-08-25 18:52:31 +02:00
Stefan Reiter	f23f75433f	backup: flock snapshot on backup start An flock on the snapshot dir itself is used in addition to the group dir lock. The lock is used to avoid races with forget and prune, while having more granularity than the group lock (i.e. the group lock is necessary to prevent more than one backup per group, but the snapshot lock still allows backups unrelated to the currently running to be forgotten/pruned). Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2020-08-11 11:02:21 +02:00
Stefan Reiter	6d6b4e72d3	datastore: prevent in-use deletion with locks instead of heuristic Attempt to lock the backup directory to be deleted, if it works keep the lock until the deletion is complete. This way we ensure that no other locking operation (e.g. using a snapshot as base for another backup) can happen concurrently. Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2020-08-11 11:00:29 +02:00
Dietmar Maurer	e434258592	src/backup/backup_info.rs: remove BackupGroup lock() Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2020-08-11 10:58:35 +02:00
Fabian Grünbichler	9a38fa29c2	verify: also check chunk CryptMode and in-line verify_stored_chunk to avoid double-loading each chunk. Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>	2020-08-11 09:56:20 +02:00
Wolfgang Bumiller	e7cb4dc50d	introduce Username, Realm and Userid api types and begin splitting up types.rs as it has grown quite large already Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>	2020-08-10 12:05:01 +02:00
Oguz Bektas	2f57a433b1	fix #2909 : handle missing chunks gracefully in garbage collection instead of bailing and stopping the entire GC process, warn about the missing chunks and continue. this results in "TASK WARNINGS: X" as the status. Signed-off-by: Oguz Bektas <o.bektas@proxmox.com>	2020-08-06 06:36:48 +02:00
Aaron Lauterer	d3d566f7bd	GC: use time pre phase1 to calculate min_atime in phase2 Used chunks are marked in phase1 of the garbage collection process by using the atime property. Each used chunk gets touched so that the atime gets updated (if older than 24h, see relatime). Should there ever be a situation in which the phase1 in the GC run needs a very long time to finish, it could happen that the grace period calculated in phase2 is not long enough and thus the marking of the chunks (atime) becomes invalid. This would result in the removal of needed chunks. Even though the likelyhood of this happening is very low, using the timestamp from right before phase1 is started, to calculate the grace period in phase2 should avoid this situation. Signed-off-by: Aaron Lauterer <a.lauterer@proxmox.com>	2020-08-04 10:19:05 +02:00
Fabian Grünbichler	8819d1f2f5	blobs: attempt to verify on decode when possible regular chunks are only decoded when their contents are accessed, in which case we need to have the key anyway and want to verify the digest. for blobs we need to verify beforehand, since their checksums are always calculated based on their raw content, and stored in the manifest. manifests are also stored as blobs, but don't have a digest in the traditional sense (they might have a signature covering parts of their contents, but that is verified already when loading the manifest). this commit does not cover pull/sync code which copies blobs and chunks as-is without decoding them. Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>	2020-08-04 07:27:56 +02:00
Dietmar Maurer	ff86ef00a7	cleanup: manifest is always CryptMode::None	2020-07-31 10:25:30 +02:00
Dietmar Maurer	e443902583	src/backup/datastore.rs: add helpers to load/store manifest We want this to modify the manifest "unprotected" data, for example to add upload statistics, notes, ...	2020-07-31 07:45:47 +02:00
Dietmar Maurer	1fc82c41f2	src/api2/backup.rs: aquire backup lock earlier in create_locked_backup_group()	2020-07-30 11:03:05 +02:00
Stefan Reiter	c9756b40d1	datastore: prevent deletion of snaps in use as "previous backup" To prevent a race with a background GC operation, do not allow deletion of backups who's index might currently be referenced as the "known chunk list" for successive backups. Otherwise the GC could delete chunks it thinks are no longer referenced, while at the same time telling the client that it doesn't need to upload said chunks because they already exist. Additionally, prevent deletion of whole backup groups, if there are snapshots contained that appear to be currently in-progress. This is currently unlikely to trigger, as that function is only used for sync jobs, but it's a useful safeguard either way. Deleting a single snapshot has a 'force' parameter, which is necessary to allow deleting incomplete snapshots on an aborted backup. Pruning also sets force=true to avoid the check, since it calculates which snapshots to keep on its own. To avoid code duplication, the is_finished method is factored out. Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2020-07-30 08:26:01 +02:00
Dietmar Maurer	39f18b30b6	src/backup/data_blob.rs: new load_from_reader(), which verifies the CRC And make verify_crc private for now. We always call load_from_reader() to verify the CRC. Also add load_chunk() to datastore.rs (from chunk_store::read_chunk())	2020-07-28 10:23:16 +02:00
Thomas Lamprecht	c3b090ac8a	backup: list images: handle walkdir error, catch "lost+found" We support using an ext4 mountpoint directly as datastore and even do so ourself when creating one through the disk manage code. Such ext4 ountpoints have a lost+found directory which only root can traverse into. As the GC list images is done as backup:backup user walkdir gets an error. We cannot ignore just all permission errors, as they could lead to missing some backup indexes and thus possibly sweeping more chunks than desired. While normally that should not happen through our stack, we had already user report that they do rsyncs to move a datastore from old to new server and got the permission wrong. So for now be still very strict, only allow a "lost+found" directory as immediate child of the datastore base directory, nothing else. If deemed safe, this can always be made less strict. Possibly by filtering the known backup-types on the highest level first. Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2020-07-22 16:01:55 +02:00
Thomas Lamprecht	c47e294ea7	datastore: fix typo Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2020-07-22 15:04:14 +02:00
Wolfgang Bumiller	521a0acb2e	DataStore::load_manifest: also return CryptMode Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>	2020-07-08 09:19:53 +02:00
Dietmar Maurer	60f9a6ea8f	src/backup/datastore.rs: add new helpers to load blobs and verify chunks	2020-06-24 06:58:14 +02:00
Dietmar Maurer	1610c45a86	src/client/pull.rs: also download client.log.blob	2020-05-30 14:51:33 +02:00
Dietmar Maurer	8545480a31	src/bin/proxmox-backup-proxy.rs: add simple task scheduler for garbage collection	2020-05-20 08:59:45 +02:00

1 2 3

136 Commits