PV204: Disk encryption lab March 9, 2022, Milan Broz Introduction Encryption can provide confidentiality of user data. It can be implemented on different layers, including applications, file systems, or storage devices. Application layer encryption examples are PGP or ZIP compression with a password. Encryption of files (inside filesystem or through independent layer like Linux fscrypt) provides a more generic solution. However, this solution provides encrypted data with a key per user. Yet some parts (like filesystem metadata) are still unencrypted. The low-level storage (disk) encryption is called Full Disk Encryption (FDE). It is entirely transparent for the user (no need to choose what to encrypt – the whole disk is encrypted). The encrypted disk behaves the same as a disk without encryption. The primary use of FDE is to provide data confidentiality in power-down mode (stolen laptop does not leak user data). The major disadvantage is that everyone who knows the password can read the whole disk. Often we combine FDE with another encryption layer. Once the disk is unlocked, the media encryption key remains in the system, usually directly in system RAM. Exercise II will show how easy it is to get this key from the system's memory image. Another disadvantage of FDE is that it usually cannot guarantee data integrity. Encryption is fully transparent and length-preserving; the ciphertext and plaintext have the same size. There is no space to store any integrity information. This allows attacks by direct modification of ciphertext. The FDE works on the sector level, the same as the block device. Atomic I/O access units for encrypted devices are sectors. Sectors are encrypted independently by the symmetric cipher. Cipher block size is (in most cases) smaller than the size of a sector. It means that inside the sector, we need to use an encryption mode (suitable for storage encryption). It is crucial that the same data in different sectors must produce different ciphertext. This is achieved by a per-sector tweak (IV, Initialization Vector) that is usually derived from logical sector offset (sector number). A combination of properly used IV and encryption mode is critical for the security of the FDE system. Figure 1 shows an example of a wrongly used mode for encryption (here ECB, Electronic Codebook, and XTS with an incorrect constant tweak). Another problem could appear with CBC mode [CBC] and predictable IV. Then the attacker can use the predictability of IV and create watermarks and detect them directly from the ciphertext. We will show this flaw on the legacy TrueCrypt device in Exercise III (an optional exercise). Figure 1. An image encrypted in ECB mode and XTS mode without tweaked-sector encryption. Today, the most used encryption mode for disks is XTS [XTS]. This mode is constructed in such a way that it can safely use predictable sector numbers directly as an IV. The schema of CBC and XTS encryption mode is for reference in Appendix. FDE can be implemented on the hardware layer (self-encrypted disk, encryption inside controller chipset) or in software [FDE]. Software implementation is, for example, Bitlocker for Windows systems, dm-crypt in Linux, FileVault on macOS, or VeraCrypt. There are many other tools implementing this functionality [COMP]. In the following exercises, we will use two software FDE tools. The first one is LUKS/dm-crypt [LUKS], which is the primary solution for Linux-based systems and VeraCrypt [VC]. Both tools can create a virtual encrypted block device within a partition or a regular file. This virtual disk can be formatted as if it was a physical device and mounted. VeraCrypt also supports trivial steganography and allows to create a hidden virtual volume within another volume. References [FDE] Disk encryption, https://en.wikipedia.org/wiki/Disk_encryption, [COMP] Comparison of disk encryption software, https://en.wikipedia.org/wiki/Comparison_of_disk_encryption_software [CBC] Cipher Block Chaining mode, https://en.wikipedia.org/wiki/Block_cipher_mode_of_operation [XTS] XTS-AES Mode for Confidentiality on Storage Devices, https://csrc.nist.gov/publications/detail/sp/800-38e/final [LUKS] Linux Unified Key Setup / cryptsetup, https://gitlab.com/cryptsetup/cryptsetup/ [VC] VeraCrypt (former TrueCrypt), Free open-source disk encryption software, https://www.veracrypt.fr Exercises We will use prepared Virtual Machine PV204 (slightly modified Debian Linux). VM can be run in Oracle VirtualBox. The virtual machine has preinstalled LUKS disk encryption of the whole system, cryptsetup 2.x (it can open TrueCrypt and VeraCrypt devices as well), TrueCrypt 7.1a and VeraCrypt 1.24. User name: pv204 Password (also unlocking password for disk): pv204 Note: User pv204 can run superuser commands via sudo without need of entering password, e.g. sudo ls /root. Exercise I: Hardware keylogger use "If you let your machine out of your sight, it’s no longer your machine." In the context of disk encryption, a specialized device can be used to reveal entered passphrases (BIOS password, disk encryption unlocking passphrase). A hardware keylogger is a device that intercepts data entered through the keyboard and stores it internally for later analysis. The device can also transmit data using a wireless network or another hidden channel. If properly masked (e.g., in keyboard controller), it is very hard to reveal it by the user because the device is completely transparent to BIOS and operating system (and power consumption is minimal). Figure 1. Example keylogger module We will use a simple example of a commercially available USB keylogger module. The older version provides simulated flash-drive access for data retrieval; the new one can be accessed through a WiFi network (the key module is the WiFi access point in our example). There is a limited amount of memory for the internal log, but the extended version provides an SD card interface for the log store (enough storage for almost the whole physical life of the keyboard). Log retrieval is activated by an activation shortcut, which switches the keylogger into USB drive emulation mode and provides direct access to log storage (file). (It can also send UDP packets to process data in an external application.) the new version can be monitored in real-time through the internal webserver accessed through WiFi. Figure 2. Log retrieval from the module Your task 1. Connect the provided USB keyboard with keylogger module to USB port. 2. Check that the system recognizes keyboard (as the same as it is connected directly). 3. Simulate some work where you need enter sensitive data, e.g. create new TrueCrypt encrypted device, boot testing virtual machine and log in or try to enter some password in browser. 4. For older, flash-drive keylogger (see description on sticker) 1. Switch to host system file explorer and press activation shortcut : 'K' + 'B' + 'S' key together. The keyboard should detach and you should see new virtual USB flash disk. 2. Investigate "LOG.txt" file on the disk and compare it with your work in step 3. 3. If you want, delete the "LOG.txt" so your colleagues or lecturer will not see your "sensitive" data :-) 5. For new, WiFi capable keyloggers 1. Keep device connected in USB port 2. Connect your device (phone or laptop) to WiFi network KLOG1 or KLOG2 (see sticker on keylogger). NOTE: the network is open, (intentionally) not encrypted! 3. Open browser at http://192.168.4.1 (for recent browser you will need to enforce nonhttps connection 4. Investigate the log, you can still type on connected keyboard; refresh the page. 5. If you want, erase the log in setting page so your colleagues or lecturer will not see your "sensitive" data :-) Please, DO NOT CHANGE WIFI SETTINGS or ACTIVATION KEYS! Exercise II: Obtaining encryption key from VM memory image For most of the software-based disk encryption applications, the encryption runs on the CPU, usually inside the OS context (optionally with HW acceleration like AES-NI instructions). In this operating mode, the encryption key is present in RAM during the operation. An attacker can obtain an image of physical memory and try to search for the encryption key. This image can be either obtained through software (we will use VirtualBox debugger function) or through hardware (Cold boot attack, i.e., recover memory content after force reboot/short powerdown or use FireWire hw debug functions). The obtained key can be directly used for storage decryption (without password). The next problem is to identify candidate keys in memory image. This can be done by recognizing internal OS structures in memory or by a more generic approach, like reconstructing AES keys by identifying specific round keys [COLD]. Your Task 1. Copy provided image and prepare Virtual Machine with some active and mounted encrypted devices and understand the storage stack inside. You should have two encrypted and active mappings for this exercise in VM. One is system encryption itself (LUKS/dm-crypt) and second will be VeraCrypt device created from disk image testdisk.img. a) Start and login to VM. b) Run VeraCrypt GUI (on desktop) and create new image testdisk.img (please use AES only for encryption algorithm). c) Mount image through VeraCrypt GUI and investigate used parameters (Volume Properties). Keep it mounted to investigate key in memory later. d) Investigate and understand storage stack, dump parameters about encrypted virtual devices. Use lsblk command to get the whole picture: In this example, sda5 is your system encrypted partition, loop0 is the VeraCrypt (or TrueCrypt) device mapped to testimage.img. 2. Check VeraCrypt device - note the header is encrypted so you cannot get information using blkid. Use cryptsetup TrueCrypt/VeraCrypt extension to get more info about device on-disk header: $ sudo blkid testdisk.img $ sudo cryptsetup tcryptDump testdisk.img –veracrypt –dump-master-key Note: for older systems you have to add --veracrypt option to cryptsetup option (otherwise cryptsetup will not recognize VeraCrypt format). Recent version uses it as the default option. Now, try to display encrypted keys directly from kernel (superuser can display full devicemapper dm-crypt mapping table for active devices). e) Repeat the same exercise for LUKS device (/dev/sda5 mapped to sda5_crypt), just use luksDump command. Note LUKS has visible header on-disk. 3. While the VM is still running, obtain snapshot of memory core image. You can use Vbox_save_memcore.bat script which uses internal VirtualBox debugger to dump VM memory core (it is in fact ELF format dump, but it is enough for our use). 4. Analyze the memory core image with the provided aeskeyfind [AESKEYFIND] program. Note this program search for specific AES structures. (In reality you have key in memory more times but in different format, just think about screen buffer with text dumps executed in step 1.) 5. Compare possible encryption keys with information you obtained in step 1. See XTS encryption mode definition [XTS] and think why you see two separate AES keys for one device. 6. Now, try to answer some questions: What can be the other keys in the dump? Why some keys repeats in memory dump? The XTS keys are swapped, why? WhyVeraCrypt does not encrypt keys in memory (as in Windows)? Does it help? References [COLD] Lest We Remember: Cold Boot Attacks on Encryption Keys https://citp.princeton.edu/research/memory/ [AESKEYFIND] Fixed and extended AESkeyfind source code (gcc optimization bug patch) https://github.com/mbroz/aeskeyfind Optional Exercise III: Revealing TrueCrypt hidden disk existence (CBC) The goal of this exercise is to show that old CBC mode in TrueCrypt is vulnerable to watermarking attack and that such attack could be used to reveal hidden disk existence just from ciphertext analysis. Theory The TrueCrypt (in version before 4.1) used CBC mode with Initialization Vector calculated from the sector number and xored with secret key KIV. Also there is additional whitening which is calculated using several CRC32 operations over secret key KW xored with sector number. Final whitening value is then applied to the whole plaintext sector with plain xor function. Unfortunately, both operations are not secure (IV is still partially predictable because for consecutive sectors we know which bits will change, and the whitening is just linear transformation, CRC32 is not cryptographically secure). The figure shows simplified attack with P (plaintext block) and C (ciphertext block). For more info see [TC4]. This attack allows construct a special plaintext, which if aligned to proper sectors can propagate special pattern (watermark) into ciphertext. Because filesystem aligns start of files to sector offset, we can just use specially formatted file in hidden disk and then search for the watermark on ciphertext device. As the exercise will show, this special file can be text file, so forcing user to store such file in the hidden disk is not too complicated (imagine mail or picture in browser cache). Your Task For the task use TrueCrypt installed in Virtual Machine (VeraCrypt no longer supports CBC mode). The exercise files are already copied to /home/task3 inside VM. 1. Use the provided TrueCrypt container in /home/task3/tc_hidden_cbc.tc and try to open it in TrueCrypt. The password to outer (decoy) volume is "password", password to hidden volume is "hidden". 2. Using the provided program /home/task3/create_file (which implements attack above) generate special file and copy it to hidden TrueCrypt disk. 3. Dismount the TrueCrypt device and run /home/task3/detect_file over the TrueCrypt container image. It should find the pattern in special file and print the offset of this pattern. 4. See the create_file and detect_file source code (/home/task3/src) to better understand internal operation. References [TC4] sci-crypt, TrueCrypt 4.0 Out [see example source code, message seems to be deleted...] https://groups.google.com/forum/#!topic/sci.crypt/3DxOChZ0lrQ[1-25-false]