Kernel Hardening - security-misc

raja · July 25, 2022, 9:50am

Looking into mce=0, I am not entirely sure what its connection to ECC memory is based on the kernel docs.

Regarding mce=off, I think that would be less secure as per the definition of a machine check:
https://www.kernel.org/doc/html/latest/x86/x86_64/machinecheck.html

“Machine checks report internal hardware error conditions detected by the CPU. Uncorrected errors typically cause a machine check (often with panic), corrected ones cause a machine check log entry.”

Therefore, maybe we should remove or comment out that command as I am not sure what it does. On the other hand, replacing it with mce=off may not be desirable.

Patrick · July 25, 2022, 6:06pm

I wonder where this mce=0 think is coming from.

web search term:

linux “mce” security hardening
linux “mce=off” ecc security
linux “mce=0” ecc security

Quote Tails - kernel hardening

mce=0

Mostly useful for systems with ECC memory, setting mce to 0 will cause the kernel to panic on any uncorrectable errors detected by the machine check exception system. Corrected errors will just be logged. The default is mce=1, which will SIGBUS on many uncorrected errors. Unfortunately this means malicious processes which try to exploit hardware bugginess (such as rowhammer) will be able to try over and over, suffering only a SIGBUS at failure. Setting mce=0 should have no impact. Any hardware which regularly triggers a memory-based MCE is unlikely to even boot, and the default is 1 only for long-lived servers.

https://www.kernel.org/doc/Documentation/x86/x86_64/machinecheck.rst

Patrick · July 25, 2022, 6:15pm

https://www.mcelog.org/

https://www.mcelog.org/faq.html

https://www.mcelog.org/references.html

web search terms:

site:kernel.org “mce=0”
site:kernel.org “mce=off”
site:kernel.org “nomce”

So I guess we want to check but once an exception is found whether correctable or uncorrectable, a kernel panic is preferred.

I haven’t found a very authoritative source (kernel) to me that setting mce=0 (which can no longer be found in kernel documentation), nomce or mce=off results in kernel panic if an exception is found or that exceptions are even still detected or that this setting is related to security.

raja · July 25, 2022, 6:34pm

This must be the reference.

Also I think setting nomce or mce=off stops the kernel from panicking if a machine check error is detected.

Therefore the existing command should either be removed or commented out. This way, uncorrected errors will cause a panic as desired.

Patrick · July 25, 2022, 7:16pm

Maybe there is a general kernel feature to always treat =0 as =off?

https://www.kernel.org/doc/html/v5.3/x86/x86_64/machinecheck.html

tolerant
Tolerance level. When a machine check exception occurs for a non corrected machine check the kernel can take different actions.

0: always panic on uncorrected errors, log corrected errors
1: panic or SIGBUS on uncorrected errors, log corrected errors
2: SIGBUS or log uncorrected errors, log corrected errors
3: never panic or SIGBUS, log all errors (for testing only)

As per that link close to the desired outcome to kernel panic on error would be setting tolerant (tolerance level) to 0.

(The kernel manual does not show any example. Might be chimerical settings. Might be setting mce.tolerant=0 is sysctl.)

This could be added to file:
/etc/sysctl.d/30_security-misc.conf

But that as per kernel manual is only for non-corrected errors. For the (maybe?) desired outcome of also kernel panicking as soon as a correctable error is detected I haven’t found a setting yet.

maybe: Not clear yet if changing MCE settings can increase security at all.

Patrick · August 16, 2022, 9:44am

Got this by e-mail.

Kernel panic is a software routine. It should be executed by the kernel
itself. The MCE handler decides whether the system should panic or not based on
the exception that happened. If you disable MCE and the aforementioned file is
the only place that is called upon identifying such exceptions, the panic will
not happen at all.

And by the way I agree with their concern regarding exposing log messages to
malicious processes. But I would expect them to refer to an study, blog post,
article, code example, etc, to show that how this concern can be valid in real
world.

Patrick · August 16, 2022, 9:45am

Therefore in absence of any authoritative recommendation to change any mce settings, it’s best we comment it out and don’t change anything related to mce until there is a better argument.

Patrick · August 22, 2022, 10:56am

Patrick · August 22, 2022, 10:56am

Merged, thanks!

Patrick · September 21, 2022, 5:26pm

HulaHoop · September 23, 2022, 11:46am

Tails enabled 3 boot time kernel options for hardening. Some are caused by changes to upstream security features, another affects TTY which I am not sure how it will impact our current config.

Patrick · September 23, 2022, 6:51pm

That was all previously done already.

Using.

Also done.

Using.

Thanks anyhow since we could be missing something in theory.

To verify, grepping the source code (or probably enough grepping the security-misc source code) for kernel parameters which should be used or not used would work (which I’ve just done but anyone else welcome to check this as well).

raja · September 27, 2022, 4:03pm

Awesome, great to see more eyes keeping track of potential hardening methods.

Patrick · October 18, 2022, 6:38pm

hidepid can be re-tested once based on Debian bookworm. Maybe pkexec based applications will be no longer broken by hidepid.

Patrick · November 10, 2022, 4:27am

raja · December 11, 2022, 10:11am

Anyone had any issues with this?

Patrick · December 18, 2022, 11:25am

Now merged. Thank you!

raja · January 8, 2023, 5:54am

Based on the recently merged:

We are now allowing kexec:

github.com

Kicksecure/security-misc/blob/master/etc/sysctl.d/30_security-misc.conf#L41


      
          net.core.bpf_jit_harden=2
          
          
## Quote https://www.kernel.org/doc/html/latest/admin-guide/sysctl/kernel.html
          ##
          ## kexec_load_disabled:
          ##
          ## A toggle indicating if the kexec_load syscall has been disabled. This value defaults to 0 (false: kexec_load enabled), but can be set to 1 (true: kexec_load disabled). Once true, kexec can no longer be used, and the toggle cannot be set back to false. This allows a kexec image to be loaded before disabling the syscall, allowing a system to set up (and later use) an image without it being altered. Generally used together with the "modules_disabled" sysctl.
          
          
## Disables kexec which can be used to replace the running kernel.
          kernel.kexec_load_disabled=1
          
          
## Hides kernel addresses in various files in /proc.
          ## Kernel addresses can be very useful in certain exploits.
          ##
          ## https://kernsec.org/wiki/index.php/Bug_Classes/Kernel_pointer_leak
          kernel.kptr_restrict=2
          
          
## Improves ASLR effectiveness for mmap.
          vm.mmap_rnd_bits=32
          vm.mmap_rnd_compat_bits=16

Are we sure that this is a good idea? Recall, kexec functionality can be abused to replace the existing running kernel and load a malicious kernel (gaining arbitrary code execution).

Sensible hardening guides [1, 2] and security distributions [3] always appear to have it strictly disabled by default.

On the plus side, while having cold boot attack defense is good. The real question is whether the tradeoff of having kexec allowed is worth it knowing the potential downsides?

Patrick · January 8, 2023, 12:34pm

To use kexec, root is required. But if an attack has root access, then it’s game over anyhow.

In a threat model with untrusted root ( Multiple Boot Modes for Better Security: an Implementation of Untrusted Root ) that might be more important.

You could argue that the scope of security-misc should be limited and that cold boot attack defense should be a separate package instead. Then security-misc could disable kexec and cold boot attack defense could re-enable it.

That would allow cold boot attack defense to only be enabled on host operating systems. It would get installed on Kicksecure host operating systems by default but could be omitted in VMs (as it seems it’s not useful there except for development and testing).

raja · January 9, 2023, 6:28am

That makes sense.

Are there any statistics as to the portion of users that install/download Kicksecure to be used on bare metal vs VMs?

Based on my anecdotal evidence most people run it on VMs, though admittedly its a very very small sample size.