mask more /proc/cpuinfo output in KVM

phabricator-migrator · February 19, 2024, 5:47pm

Information

ID: 449
PHID: PHID-TASK-4ihcgqr5ulhwv32v263n
Author: Patrick
Status at Migration Time: resolved
Priority at Migration Time: Normal

Description

Currently lots of information from inside a compromised workstation (or fancy application reading and reporting it somewhere for whatever statistic purpose) can be read:

Seems like CPU features can be reduced:
https://www.berrange.com/posts/2010/02/15/guest-cpu-model-configuration-in-libvirt-with-qemukvm/

Add new ‘kvm’ domain feature and ability to hide KVM signature:
https://www.redhat.com/archives/libvir-list/2014-August/msg00744.html

Maybe more can be masked such as model and clock frequency.

As I understand, these features have been added to ease CPU migration in heterogeneous CPU environments. We can reuse these features to hide more hardware identifiers.

Needs research if there would be a performance penalty or something else would speak against this.

Comments

Patrick

2015-12-07 17:27:26 UTC

Some features exposed may even pose security issues. Whonix KVM’s cat /proc/cpuinfo, flags includes tsc, which might allow the VM to access the host CPU’s Time Stamp Counter (tsc). (Matters because of Clock Correlation Attacks.)

After reading libvirt: Domain XML format I suggest to start experimenting with something like the following (untested):
  <cpu>
  <cpu match='minimum'>
  <model>486</model>
  </cpu>
For CPU models, see also:
/usr/share/libvirt/cpu_map.xml

If we can afford it performance wise [and otherwise, depends for what it might break] we should only whitelist / allow required, “secure” CPU features, that we understand at least on their very superficial level.

HulaHoop

2015-12-07 22:50:12 UTC

HulaHoop

2015-12-07 23:09:49 UTC

HulaHoop

2015-12-07 23:21:56 UTC

Patrick

2015-12-07 23:31:30 UTC

! In T449#7518, @HulaHoop wrote:
I experimented with the cpu masking of individual features on different machines to test portability and reached the conclusion that it will cause a support nightmare.

Blocking some features like tsc_deadline_timer and constant_tsc will cause the vm to fail on other hardware that doesn’t have it.

Aren’t there settings to make this lenient? libvirt documentation implies there are.

feature require is obviously bad.

But why should feature disable “The feature will not be supported by virtual CPU.” be bad? (As long as the VM operating system starts and runs fine.)

Also cpu match,

exact and strict sounds bad.

But cpu match minimum seems lenient. (Make little to no cpu flags a requirement.)

Also model fallback allow seems lenient.

Blocking some features like tsc_deadline_timer and constant_tsc will cause the vm to fail on other hardware that doesn’t have it.

Isn’t this a contradiction? How can you block something that does not exist? If the cpu feature is unwanted in the VM, then the hypervisor should be fine with the cpu feature not existing on the host, right?

Even if all this was to work, an attacker in the VM can have a pretty good idea what cpu they are sitting on if they carry out some benchmarks.

I also worry about [future] applications, not compromised ones, leaking CPU info, that are as fancy as webrtc (which leaks local IP addresses).

! In T449#7519, @HulaHoop wrote:
What threat model are we considering for hiding the KVM signature?

I found that link only worth further research, because it discussed disabling cpu features.

Patrick

2015-12-07 23:46:59 UTC

Patrick

2015-12-07 23:55:39 UTC

HulaHoop

2015-12-08 01:05:29 UTC

Yes all my tests were done with “feature disable”.

If the cpu feature is unwanted in the VM, then the hypervisor should be fine with the cpu feature not existing on the host, right?

Thats what I thought too except it looks like the feature has to exist on the host for the mask to be applied or else it fails hard.

I also worry about [future] applications, not compromised ones, leaking CPU info, that are as fancy as webrtc (which leaks local IP addresses).

Can you give examples? At the moment all sensitive data like model name and microcode version is masked by default.

cr0 blog: Time-stamp counter disabling oddities in the Linux kernel

Side channel attacks are a whole field that cryptographers have to deal with when designing and applying cryptography in the real world. They have to account for the fact that hardware is imperfect and can leak information about the key to attackers, Its not something we can solve here at such a simple level but something cryptographers account for when hacking on OpenSSL.

TSC leaks may be similar to to TCP sequence numbers.

Nothing is leaked to the network by TSC. Its also heavily affected by system load which caused the instruction to miss ticks so hardware manufacturers implemented constant_tsc to keep timers from skewing.

HulaHoop

2015-12-08 01:48:48 UTC

Patrick

2015-12-08 21:33:49 UTC

! In T449#7525, @HulaHoop wrote:
I also worry about [future] applications, not compromised ones, leaking CPU info, that are as fancy as webrtc (which leaks local IP addresses).

Can you give examples? At the moment all sensitive data like model name and microcode version is masked by default.

There are no examples as of now, but I would not be surprised after webrtc leaking local client IP.

cr0 blog: Time-stamp counter disabling oddities in the Linux kernel

Side channel attacks are a whole field that cryptographers have to deal with when designing and applying cryptography in the real world. They have to account for the fact that hardware is imperfect and can leak information about the key to attackers, Its not something we can solve here at such a simple level but something cryptographers account for when hacking on OpenSSL.

The goal should be to make the VM as autonomous and isolated as possible. A compromised browser VM should not be able to interfere with any other cryptographic operations on the host or in other VMs. If we can get rid of TSC before cryptographers come up with clever defenses, and before new clever attacks to these clever defenses have been published, that would be awesome.

TSC leaks may be similar to to TCP sequence numbers.

Nothing is leaked to the network by TSC.

Right. Normally not. Local compromise was assumed. Then the TSC could be analyzed by malware and/or send over the network. I see I am generating confusion by switching threat models.

HulaHoop

2015-12-10 03:21:10 UTC

HulaHoop

2015-12-12 03:31:21 UTC

HulaHoop

2015-12-12 03:38:33 UTC

KVM cpus support a baseline of features by default. You can mask out the problematic ones and don’t have to worry about the extra ones it doesn’t support because it will be masked out anyhow (because it was never supported in the first place).

The only bad instructions we should filter out are a subset of whatever instructions are listed under the virtual cpu from the output of the cpu_map.xml list

cat /usr/share/libvirt/cpu_map.xml

I figured out safe defaults and will do a pull request. NB clflush was abused to carry out the rowhammer attack so its blacklisted. aes will be passed through for crypto performance - it doesn’t mess with random number generation.
  <cpu mode='custom' match='exact'>
    <model fallback='forbid'>qemu64</model>
    <topology sockets='1' cores='2' threads='1'/>
    <feature policy='disable' name='tsc'/>
    <feature policy='disable' name='clflush'/>
    <feature policy='optional' name='aes'/>
  </cpu>
libvirt: Domain XML format

Informative link about cpu flag functionality:
linux - What do the flags in /proc/cpuinfo mean? - Unix & Linux Stack Exchange

Daniel P. Berrangé » Blog Archive » Guest CPU model configuration in libvirt with QEMU/KVM

"Every hypervisor has its own policies for what a guest will see for its CPUs by default, Xen just passes through the host CPU, with QEMU/KVM the guest sees a generic model called “qemu32” or “qemu64”. "

cat output:
    <model name='qemu64'>
     
     <feature name='apic'/>
     <feature name='clflush'/>
     <feature name='cmov'/>
     <feature name='cx16'/>
     <feature name='cx8'/>
     <feature name='de'/>
     <feature name='fpu'/>
     <feature name='fxsr'/>
     <feature name='lm'/>
     <feature name='mca'/>
     <feature name='mce'/>
     <feature name='mmx'/>
     <feature name='msr'/>
     <feature name='mtrr'/>
     <feature name='nx'/>
     <feature name='pae'/>
     <feature name='pat'/>
     <feature name='pge'/>
     <feature name='pni'/>
     <feature name='pse'/>
     <feature name='pse36'/>
     <feature name='sep'/>
     <feature name='sse'/>
     <feature name='sse2'/>
     <feature name='svm'/>
     <feature name='syscall'/>
     <feature name='tsc'/>
   </model>

Patrick

2015-12-12 04:31:43 UTC

Patrick

2016-06-01 13:29:21 UTC

HulaHoop

2016-06-01 23:44:16 UTC

Patrick

2016-06-02 08:59:06 UTC

Patrick

2016-06-02 09:23:09 UTC

HulaHoop

2016-06-02 14:43:11 UTC

HulaHoop

2016-06-07 13:55:00 UTC