tequilaOS/platform_system_core

Author	SHA1	Message	Date
Daniel Zheng	bee3f962fc	libsnapshot: stride compression Alternate dispatching blocks between threads rather than splitting the data beforehand and then sending to threads in order to ensure that single threading + multithreading chunks data at the same locations. Without this change, the resulting op count + data section of the cow will differ between --enable-threading && --disable-threading at runtime, which is a result we don't want Test: th Change-Id: I3ed8add0552745a281fce2aa7f1d1d32eb547e63	2024-02-22 21:38:47 -08:00
Daniel Zheng	25de579429	libsnapshot: log compression algorithm Log the compression algorithm and compression factor used during OTA for easier debugging Test: th Change-Id: Ic50989d7e233983d6299163fc647eb739a0b7cb2	2024-02-21 10:14:06 -08:00
Daniel Zheng	4f5a9950b2	Merge "libsnapshot: update opcountcheck" into main	2024-02-21 17:22:40 +00:00
Daniel Zheng	dccf1b6e39	libsnapshot: update opcountcheck Since variable block compresses blocks and there is no longer a 1:1 mapping between ops to blocks, we need to update this check in EmitBlocks to the actual number of compressed blocks written. Since single threaded + multi threaded + no compression invoke different code paths. Ensure that that blocks written are still equivalent to blocks.size(). Adding two test cases to cover these situations. Test: th Change-Id: If81eccf74333292a114268862dde0fe49681ef35	2024-02-21 09:22:07 -08:00
Akilesh Kailash	e5bc36900e	create_snapshot: Enable v3 writer + variable block size 1: Move to v3 COW writer 2: Enable variable block size. Default compression set to lz4 with compression factor 64KiB 3: Prepare merge sequence so that device can initiate the merge 4: Verify the merge order Bug: 319309466 Test: On Pixel 6 This was tested on live builds where the actual builds/testing is done on CI. Patch-Create+Apply = Create the snapshot patches between two builds and apply them to the device Branch(main) Patch-Creation+Apply Snapshot-size ============================================================= Build-1 -> Build-2 14 seconds 160MB Build-2 -> Build-3 21 seconds 331MB Build-3 -> Build-4 30 seconds 375MB Build X -> Build X 3 seconds 8MB Change-Id: I96437032de029d89de62ba11fe37d9287b0a4071 Signed-off-by: Akilesh Kailash <akailash@google.com>	2024-02-11 23:19:22 -08:00
Ryan Prichard	abb472c238	libsnapshot: replace non-character basic_string[_view]<T> In newer versions of libc++, std::char_traits<T> is no longer defined for non-character types, and a result, std::basic_string<T> and std::basic_string_view<T> are also no longer defined for non-character types. See https://discourse.llvm.org/t/deprecating-std-string-t-for-non-character-t/66779. Replace them with std::vector<T> and std::span<const T>. Bug: 175635923 Test: m MODULES-IN-system-core-fs_mgr Test: /data/nativetest64/cow_api_test/cow_api_test Change-Id: Ife2e87833ced43ff24e5765998cb6993e4f9b4c0	2024-02-08 23:20:10 -08:00
Treehugger Robot	eadf6b4bda	Merge "remount: Auto remount option (-R) also tries to gain root" into main	2024-02-05 23:47:54 +00:00
Akilesh Kailash	3ea911bc06	snapuserd: Add I/O path support for variable block size The flow of I/O path is as follows: 1: When there is a I/O request for a given sector, we first check the in-memory COW operation mapping for that sector. 2: If the mapping of sector to COW operation is found, then the existing I/O path will work seamlessly. Even if the COW operation encodes multiple blocks, we will discard the remaining data. 3: If the mapping of sector to COW operation is not found: a: Find the previous COW operation as the vector has sorted sectors. b: If the previous COW operation is a REPLACE op: i: Check if the current sector is encoded in the previous COW operations compressed block. ii: If the sector falls within the range of compressed blocks, retrieve the block offset. iii: De-compress the COW operation based on the compression factor. iv: memcpy the data based on the block offset. v: cache the COW operation pointer as subsequent I/O requests are sequential and can just be a memcpy at the correct offset. c: If the previous COW operation is not a REPLACE op or if the requested sector does not fall within the compression factor of the previous COW operation, then fallback and read the data from base device. Snapshot-merge: During merge of REPLACE ops, read the entire op in one shot, de-compress multiple blocks and write all the blocks in one shot. Performance: go/variable-block-vabc-perf covers detail performance runs on Pixel 6 for full and incremental OTA. Bug: 319309466 Test: snapuserd_test covers all the I/O path with various block sizes. About 252 cases with all combinations and tunables. [==========] 252 tests from 4 test suites ran. (702565 ms total) [ PASSED ] 252 tests. On Pixel 6: ======================================= COW Writer V3: for i in full, incremental OTA for j in 4k, 16k, 32k, 64k, 128, 256k for k in lz4, zstd, gz install OTA, reboot, verify merge ======================================= COW Writer V2: for i in full, incremental OTA for j in 4k for k in lz4, zstd, gz install OTA, reboot, verity merge ===================================== Change-Id: I4c3b5c3efa0d09677568b4396cc53db0e74e7c99 Signed-off-by: Akilesh Kailash <akailash@google.com>	2024-01-31 13:28:45 -08:00
Akilesh Kailash	59fa486703	libsnapshot_cow: Support multi-block compression This patch supports compression for bigger block size. 3 bits [57-59] in the COW Operation "source_info_" field is used to store the compression factor. Supported compression factors are power of 2 viz: 4k, 8k, 16k, 32k, 64k, 128k, 256k. Only REPLACE operations will have the bigger block size support for now. This can be extended to other operations later. The write path in EmitBlocks() has the core logic wherein consecutive sequence of REPLACE ops are compressed based on the compression factor settings. Thus, for a 64k compression factor, there will be just one COW operation which encodes all the 16 operation and the entire 64k block is compressed in one shot. NOTE: There is no read I/O path support in this patch. Subsequent patch will have the read support. Performance data (with read I/O path support in subsequent patch): go/variable-block-vabc-perf covers detail performance runs on Pixel 6 for full and incremental OTA. TL;DR: Performance of a full OTA (All numbers are compared against 4k block size) ======================================= Snapshot-size: ~10-11% decrease in snapshot-size (disk-space) for zstd with 256k block size. ~8% decrease in snapshot-size (disk-space) for lz4 Install time: ~13% decrease in OTA install time for zstd with 256k block size. Snapshot-merge: ~50% decrease in snapshot-merge time with 256k block size for zstd Post OTA boot-time: ~10.5 decrease in boot time for 64k block size for zstd In-memory footprint for COW operations: ~80% decrease in memory footprint for 256k block size. (58MB -> 9.2MB) ============================================ For more improvements, further tuning of zstd/lz4 is required primariy the compression levels, zstd compression window, performance of gz with compression levels. Bug: 319309466 Test: cow_api test covering all the supported block sizes for v3 writer. On Pixel 6: ======================================= COW Writer V3: for OTA in full, incremental OTA for block_size in 4k, 16k, 32k, 64k, 128k, 256k for compression_algo in lz4, zstd, gz, none install OTA, reboot, verify merge ======================================= COW Writer V2: for OTA in full, incremental OTA for block_size in 4k for compression_algo in lz4, zstd, gz, none install OTA, reboot, verity merge ===================================== Change-Id: I96201f1609582aa9d44d8085852e284b0c4a426d Signed-off-by: Akilesh Kailash <akailash@google.com>	2024-01-31 12:52:31 -08:00
Daniel Zheng	bcce91603b	libsnapshot: set header max_compression Intermediate CL needed before variable block size can land. Since v3 is enabled on cuttlefish, the base build needs to write the compression_factor in order for reader to properly parse. Otherwise we'll fail OTA test Test: th Change-Id: Ia353aae8e668858851073f09308909ae70d7854e	2024-01-31 17:35:00 +00:00
Daniel Zheng	6ca4d66a05	libsnapshot: update type num_compress_threads should be unsigned integer as it can never be negative. Test: th Change-Id: Ic0456ac717483300fa9cbd55eba5cdd0156207a7	2024-01-30 09:25:23 -08:00
Daniel Zheng	2aed5a5e4c	libsnapshot: op_count_max default to max blocks In the case that op_count_max is read in as zero, we should use the upper bound of max blocks as the estimation. One case in which this error can happen is if a v2 cow estimator is used, we should still be able to run an OTA if we upper bound our ops buffer size estimation. Test: th Change-Id: I97ca66368d6631bf43c8911ed66f99c9e8096e2d	2024-01-30 09:25:15 -08:00
Daniel Zheng	2c82c81f12	libsnapshot: use compression_factor in ota Parse manifest compression_factor and set CowOptions appropriately. This allows v3 writer to use compression factor in OTA. Updating some comments about supported compression algorithms Test: th Change-Id: I88f254087e536d9e5925064f85317f0acce280ee	2024-01-30 09:25:02 -08:00
Daniel Zheng	d84857fcfc	libsnapshot: remove op count check With variable block size compression being added, the number of ops written cannot be calculated directly as easily since one op can cover the data for multiple ops previously. We can get rid of this check for XOR and Raw blocks as within WriteOperation() we already make a check to see if we are exceeding op_count_max limit. We still need to keep this check for EmitZeroBlocks and EmitCopyBlocks since the number of operations is determined ahead of time in those function calls. Without this check in place, the ops will be added to cached ops and return true when ops cannot be written. with this change, v3 cow ota now works on cuttlefish with support for variable block size compression. Test: th Change-Id: Ia55f152f5deb67a9022d0feff112345e72741dd3	2024-01-29 14:04:18 -08:00
Daniel Zheng	903f8e07ff	libsnapshot: v3 structural changes Changes to structure of v3 header + operation needed for variable block size. Seperating this CL from the variable block size one so we can get v3 enabled on cuttlefish the op count type changes are so that op count matches the type of max_blocks. Max_blocks is used when op buffer size is not set -> we default to upper bound of one operation per block in the partition. Test: th Bug: 307452468 Change-Id: I1a2581763a4fd6be5d5795f7e4781023e9984256	2024-01-29 14:04:17 -08:00
Yi-Yo Chiang	bbb9690935	remount: Auto remount option (-R) also tries to gain root When doing an auto remount & reboot command and is running as the SHELL uid, just try to gain root on behalf of the user and retry the remount command. If "gain root" failed, then print the message to tell the user to run "adb root" and retry. Bug: 322285923 Test: adb unroot && adb remount -vR Test: adb unroot && adb shell remount -vR Change-Id: If8e04dc602573c73178c108ef4944f0a985b590e	2024-01-29 15:51:52 +08:00
Treehugger Robot	e7cc98f84e	Merge "libfiemap: Disable loop mapping code." into main	2024-01-22 19:54:51 +00:00
David Anderson	d7f0965761	libfiemap: Disable loop mapping code. On devices without metadata encryption, we use loop devices rather than device-mapper + dm-linear + FIEMAP. Devices without metadata encryption should not exist, since libfiemap was introduced with Android R, which requires metadata encryption. It is possible to retrofit an Android Q device with Virtual A/B, which is what Pixel 4 did. However those devices can only upgrade to Android T, and they had metadata encryption anyway. If there are any Android Q devices that retrofitted Virtual A/B in R, didn't have metadata encryption, and need to upgrade all the way to V, then we can recommend they make WrapUserdataIfNeeded() unconditional. Bug: N/A Test: fiemap_image_test, vts_libsnapshot_test Change-Id: I7be0507527b967166676c8b136b8758f5e69ba6b	2024-01-18 01:13:50 +00:00
Treehugger Robot	e746bc445b	Merge "fs_mgr_overlayfs: Trim surrounding "@" from the per mountpoint scratch dir name" into main	2024-01-17 00:41:29 +00:00
Nikita Ioffe	44a7cadbf7	Merge "dm_test.cpp: DeleteDeviceWithTimeout asserts that unique path is deleted" into main	2024-01-10 08:32:19 +00:00
Yi-Yo Chiang	dd62ec3288	fs_mgr_overlayfs: Trim surrounding "@" from the per mountpoint scratch dir name Right now we encode the per mountpoint scratch dir name like this: /system -> /mnt/overlay/@system/ /product/app -> /mnt/overlay/@product@app/ This CL changes it to: /system -> /mnt/overlay/system/ /product/app -> /mnt/overlay/product@app/ This makes it so that the encoded path for top-level mountpoints (like /system, /vendor) would have the same encoded scratch dir as before https://r.android.com/2795755 was introduced. With this change old first-stage-init can handle top-level remounts correctly. However for mountpoints with '/' in them, their remount scratch dirs would be encoded with the new format, and old first-stage-init would ignore and not setup these during boot. This makes the remount mechanism to function partially when running on an old ramdisk (first-stage-init) + new system combo. Normally we expect the init_boot ramdisk to be upgraded alongside system.img, so this change isn't strictly needed. However there are cases where we might want to develop new OS features on old vendor platform, thus this change. Bug: 306124139 Bug: 243503963 Test: adb-remount-test Change-Id: I9b43641bb338f11c6c83888880948e4b85af14e1	2024-01-10 04:42:06 +00:00
Treehugger Robot	1eac97ca73	Merge "Mark block device as rw before encryptFstab" into main	2024-01-10 02:22:53 +00:00
Kelvin Zhang	d0139b45b8	Mark block device as rw before encryptFstab Some testcases assume that /dev/block/by-name/userdata is writable, but mount_with_alternatives() will mark block device as RO if mount flag includes MS_RDONLY. Fix it by marking the block device as RW again. Test: th Bug: 319156415 Change-Id: Ic04acd4b6175d3f0aeea88675da44309e8df15e8	2024-01-09 17:14:07 -08:00
Yi-Yo Chiang	fe52f39461	adb-remount-test: Only test mounts that are remounted by us Right now we assume all RW mounts (minus /data & special FS) are remounted by us and we apply the remount/overlayfs related checks on them unconditionally. This would generate false positives when a partition was RW but not remounted by us. The test should instead check mounts that were remounted by us (transitioned from RO to RW after adb-remount), and ignore partitions that were already RW before running adb-remount. Bug: 313609600 Test: adb-remount-test Change-Id: I94e8a35775271f557790a458781657eb3b24a6f5	2024-01-09 18:15:38 +08:00
Akilesh Kailash	e57ef38bb2	Set taskprofile to snapshot merge thread Assign CPUSET_SP_BACKGROUND taskprofile to snapshot merge threads. This will ensure that the threads will not run on big cores. Additionally, reduce the flushing of data to 1MB after merging REPLACE ops. No major regression observed on snashot merge time. On Pixel 6 for incremental OTA of 500M, snapshot merge time increased from 72 seconds to 76 seconds after this patch. Bug: 311233916 Test: Full and incremental OTA on Pixel 6 - Verify merge threads not on big cores Change-Id: I455afdac0b77227869d846d0c4472ea9eb34c41c Signed-off-by: Akilesh Kailash <akailash@google.com>	2024-01-09 03:24:10 +00:00
David Anderson	442345b734	vts_fs_test: Only check /data and /metadata for rw partitions. Some rw /proc/mounts entries are FUSE. Also, add some diagnostics for failures. Bug: 318962836 Test: vts_fs_test on Pixel Change-Id: I85dec8b37f1a061b1eca597aba3887b598b699f5	2024-01-08 22:50:04 +00:00
Kelvin Zhang	f48c4ad35d	Merge "Mount /data as readonly before encrypt_inplace" into main	2024-01-08 21:09:02 +00:00
Edward Liaw	da6329bd6f	Merge "TEST_MAPPING: don't run vts_libsnapshot_test in kernel-presubmit" into main	2024-01-08 18:00:04 +00:00
Nikita Ioffe	5ff9025ae1	dm_test.cpp: DeleteDeviceWithTimeout asserts that unique path is deleted Before this patch, DeleteDeviceWithTimeout was checking that the dev node (i.e. /dev/block/dm-XX) is deleted after the call to DeleteDevice API. Since ueventd first deletes the symlinks that correspond to a device and only then deletes this device node, this assertion introduced a race condition (DeleteDevice API waits for the symlink to be deleted). This patch changes the DeleteDeviceWithTimeout test to check that unique path of the device has been deleted. Bug: 318425605 Test: presubmit Change-Id: I3fd9de507c75bcf6ac1350fa0b8adfdb5a2e89e8	2024-01-08 11:25:45 +00:00
Treehugger Robot	2e6640c34d	Merge "remount: remount partitions with noatime" into main	2024-01-08 03:34:27 +00:00
Kelvin Zhang	5ede70f714	Mount /data as readonly before encrypt_inplace According to aosp/1908136, the current flow is 1. factory reset formatted raw disk. 2. next boot tries to convert it to metadata encryption 2.a mount sda27 2.b umount sda27 2.c encrypt_inplace() 2.d fsck on dm-x 2.e mount dm-x If there are some write file operations between 2.a and 2.b, encryption might fail. To mitigate, change the mount in 2.a to readonly if we know we are going to do encrypt_inplace. Test: th Bug: 313962438 Change-Id: I7f4bbd36e1e6c978dde84f5396ffb90bbbdcae87	2024-01-05 14:06:21 -08:00
Kelvin Zhang	6687cb65a6	Support multi-threaded compression in COW v3 Performance of COW v3 is now on par with v2 in both multi-threaded and single threaded configurations. Note, v2 cow writer can cache up to 1024 blocks in memory if multi-threaded compression is enabled(even though batch size is configured as 200). For a fair comparison, benchmarks are ran with batch size of 256. For batch size of 256 or greater, v2 and v3 have similar multi-threaded performance. Test: th Bug: 313962438 Change-Id: I377c8291689a7a038bb00b09d7371a155e6972e9	2024-01-05 14:06:21 -08:00
Yi-Yo Chiang	b86df7687b	remount: remount partitions with noatime Related change: r.android.com/1110379 noatime reduces the wear and tear on the flash device. Bug: 313609600 Test: abtd adb-remount-test Change-Id: Ia42a064f297c25d3463a4ed9094a66236a6c5708	2024-01-05 16:47:14 +08:00
Daniel Zheng	070848b9ef	libsnapshot: add check for updating next_data_pos_ Adding a check here to ensure that next_data_pos_ isn't modified since initialization. After sizing the sequence buffer, this value should be the initialized value + the size of sequence buffer. Test: cow_api_test Change-Id: I9c79041b72544500989860a13ca6c25830d28750	2024-01-03 15:19:46 -08:00
Daniel Zheng	3f3162c217	libsnapshot: get options from protobuf fields Update snapshot.cpp to grab estimate_op_buffer_size & estimate_sequence_buffer_size from update_engine. Update v3 writer to use these options to size the buffers appropriately. we probably don't need the fields for merge metrics yet but will leave it here for now Test: th Bug: 313962438 Change-Id: I08252ff66174de9bafaf8dbe9115d9d049084c4c	2024-01-03 11:09:37 -08:00
Daniel Zheng	d0c3a04cb0	libsnapshot: add CowSizeInfo struct Adding a cow size info struct as writer will now need to know the op buffer size at the time of initialization. The sequence of events is as follows (same as estimate_cow_size but putting down here for clarity) 1. ota_from_target_files does dry run to determine cow size + ops buffer size 2. data is passed through delta archive manifest 3. snapshot.cpp parses these fields and confgiures cowoptions struct to pass to writer initialization 4. cow is initialized with correct sizing. Data is incrementally added at the ends of the cow ops buffer (which is why we need to know the sizing ahead of time) Test: ota Change-Id: I950e5ef82c9bd7e9bd9603b0599c930767ee3f0d	2024-01-03 11:08:41 -08:00
Edward Liaw	9376b7d8d0	TEST_MAPPING: don't run vts_libsnapshot_test in kernel-presubmit libsnapshot test is run in an independent configuration from kernel-presubmit. When run in kernel-presubmit, it fails because it creates another daemon on top of the daemon that is already running from first stage init. Bug: 316040872 Test: N/A Change-Id: Ie3381d6db35bb85fbb47326fa49938416d49f2b8 Signed-off-by: Edward Liaw <edliaw@google.com>	2024-01-03 18:41:41 +00:00
David Anderson	4793330ad1	vts_fs_test: Drop restrictions on ext4 userdata. Bug: 313335353 Test: vts_fs_test Change-Id: I40be9589f18288ebe2a8569dfaceb9cd2150db85	2024-01-02 13:52:12 -08:00
Seungjae Yoo	91389231b0	Merge "Support dm-verity with verification based on root digest" into main	2023-12-21 01:01:35 +00:00
Treehugger Robot	26cb9dbfef	Merge "Support batching ops across Add*Blocks() call" into main	2023-12-20 22:49:38 +00:00
Treehugger Robot	c4b9840456	Merge "Fix EmitSequenceData bug" into main	2023-12-20 22:49:38 +00:00
Seungjae Yoo	66dc7b7b99	Support dm-verity with verification based on root digest Currently the only ways to enable dm-verity were relying on its built-in vbmeta image or containing its public key on standalone vbmeta image. Merging this change will support enabling dm-verity based on hashtree descriptor root digest for standalone vbmeta image. Bug: 285855436 Test: Presubmit Test: adb shell /apex/com.android.virt/bin/vm run-microdroid --vendor /vendor/etc/avf/microdroid/microdroid_vendor.img Change-Id: I51eb64cae2ca8b4e97f1c6419b35d45e6f51cacb	2023-12-20 10:41:44 +09:00
Kelvin Zhang	a008c9c1a4	Support batching ops across Add*Blocks() call Performance of V3 COW writer is now on-par with V2 in both incremental OTA and full OTA. Test: th Bug: 313962438 Change-Id: If56e0fe42367f947c513fc4c93119c3825763cb9	2023-12-19 16:32:02 -08:00
Treehugger Robot	e0b444802b	Merge "Add op count check before attempting to write operations" into main	2023-12-19 20:18:41 +00:00
Akilesh Kailash	3a7774d650	Merge "libsnapshot: Detach the daemon explicitly before stopping the service" into main	2023-12-19 17:50:06 +00:00
Akilesh Kailash	1752c5f249	libsnapshot: Detach the daemon explicitly before stopping the service If the daemon is alive, detach it before explicitly terminating service. Bug: 316876960 Test: treehugger presubmit tests Change-Id: I94d9d1a0dab09a6b016f422c7497098abc86add8 Signed-off-by: Akilesh Kailash <akailash@google.com>	2023-12-18 17:22:06 -08:00
Chun-Wei Wang	222ffc5919	Disable DSU in recovery mode DSU (modifying mounting paths) will cause OTA update to fail in recovery mode Bug: 315887685 Test: 1. enter DSU mode 2. adb reboot recovery 3. select "Apply update from ADB" 4. adb sideload some-ota-update.zip Change-Id: I6aec86893b7f8aa9e34f158269ebe2fd9dd98b33	2023-12-18 07:03:07 +08:00
Kelvin Zhang	c85038b866	Fix EmitSequenceData bug If sequence data is written and the number of ops reaches the maximum, op data will corrupt the block data because location of block data is stale after writing sequence data. Fix by resetting location of block data after EmitSequenceData() Test: th Bug: 313962438 Change-Id: Ib53b81772ba341cdf5c240baaee7c10725a365c3	2023-12-15 20:12:20 -08:00
Kelvin Zhang	73ac5f184e	Add op count check before attempting to write operations Test: th Bug: 313962438 Change-Id: I0e288a42984d737d327236693a6b69c03a7ecc6e	2023-12-14 16:42:45 -08:00
David Anderson	adb91b0e59	remount: Detect when flashall has happened in the bootloader. This adds a new metadata header flag to the super partition. This flag is set when "adb remount" is used, and is implicitly cleared when flashing. If there is a scratch partition present on /data, we require that the flag be set in order to proceed using overlays. If not set, scratch is not mapped in first-stage init, and scratch images are removed later during startup. Bug: 297923468 Test: adb remount -R, touch file in out/, sync, flashall Change-Id: I9cc411a1632101b5fc043193b38db8ffb9c20e7f	2023-12-14 16:00:27 -08:00

1 2 3 4 5 ...

3833 commits