[tlul] Side-channel Hamming weight leakage of `data` on TL-UL #16767

ballifatih · 2022-12-09T22:26:10Z

TODOs

Implement SW guidelines described in this comment.

Original Description

I would like to get some security opinion about possible side-channel leakage on TL-UL transactions.

In some TL-UL transactions between Ibex and crypto HWIPs, the data part of the transaction is reset to 0. If the side-channel leakage caused by the TL-UL bus has good correlation with the Hamming distance, then I suspect the Hamming weights of the secrets passed with TL-UL transactions might be exposed to an attacker. I think double transition from 0 to data amplifies this effect (0x0 -> data -> 0x0). From side-channel perspective, to me it seems like keeping the last sent value on the data significantly increases the difficulty of recovering the value of each individual word of a secret. I guess resetting data to 0 also has its benefits, but I am not able to see all angles of such a trade-off.

Since we are using peripheral connections to pass secrets among HWIPs, most keys are already immune to this. However, there are still some keys that are passed over TL-UL (not the exhaustive list):

Keymgr generated SW keys,
Keymgr generated identity seed (if they are passed to OTBN through SW),
SW generated/managed symmetric keys/secrets for AES/OTBN/KMAC/HMAC HWIP.

This observation (0x0 -> data -> 0x0) is not consistent on all sides of xbar, and I only looked at two examples.

In the first waveform, Ibex is reading identity seed (target=SW) from keymgr:

The secret word in this example is 75e7_b7ac,
keymgr TL-UL output: data_prev -> data_next so the previous value is kept on data,
Ibex TL-UL input: 0x0 -> data -> 0x0 so data is reset to 0 after transition.

Ibex is writing to key to AES:

The secret word in this example is c11e_955a,
Ibex TL-UL output: 0 -> data -> 0x20 (I don't understand why 0x20 is loaded into data on Ibex side since there is no transaction),
AES TL-UL input: 0 -> data -> 0.

cc: @johannheyszl @jadephilipoom @bilgiday @gdessouky

The text was updated successfully, but these errors were encountered:

johannheyszl · 2022-12-12T11:51:21Z

Thanks @ballifatih, cc @moidx

TL;DR:

TL-UL data confidentiality vs SCA
discussion of potential sources of SCA leakage
peripherals seem to leave data on their output
bus multiplexer switches between inputs and then back to zero after read op

I'd assume that most values are in shares; @ballifatih

re key manager created keys that are read over TL-UL: Are those in shares?
re SW keys that are passed into HW-IP: They are also in shares I assume?
re key manager generated identity seed - same, is this in shares?

generally, IMO if we keep any old, potentially sensitive, values on the bus, through switching, we might create even more instances of Hamming distance between either zeros or other values.

johannheyszl · 2022-12-12T11:54:27Z

@vogelpi for viz

ballifatih · 2022-12-12T13:24:19Z

@johannheyszl AFAIU all these secret values are sent in two shares over TL-UL (I can see related CSRs have two shares). Since each word of the two shares are sent in sequential TL-UL transactions, I think it makes sense to assume that the attacker can read HW of both shares. I can see two follow-up discussion points:

Is 0 -> data -> 0 is really worse than data_prev -> data_next transition from SCA perspective?
Are two shares sent in sequence enough to prevent SCA? For 32-bit two words X and Y, if the attacker gets both HW(X) and HW(Y), then how much recovering advantage is obtained on X XOR Y? In particular, one should note that the attacker will get different (HW(X), HW(Y)) values for the same X XOR Y during each observation.

p.s. I am using HW(X) to refer to Hamming weight of the value X.

johannheyszl · 2022-12-12T15:08:14Z

thx.

Sharing is fresh after each power-up or ideally on every access?
Yes, zero to value is IMO worse than distance between values.
For masked values, a direct succession of masked value and mask on the same bus is not nice.

ballifatih · 2022-12-13T23:43:21Z

In the case of ID generation by keymgr, the randomness comes from KMAC. Each power-up should have fresh randomness. Each new invocation of generate-ID should also have fresh randomness. Once this ID is generated, it is stored in CSR, so reading it multiple times from CSR should return the secret with same masks. It's harder to guess what happens when SW generates/controls the secret key, then writes it to one of the crypto HWIPs.
Agreed, open to discussion.
Since both shares are read from registers, I think the ordering among words can be changed. SW can even interleave reading secret key/identity with other non-secret TL-UL transactions (not suggesting we should do that). The order of words can also be randomized.

johannheyszl · 2022-12-15T07:27:18Z

thx! nice, so:

averaging of traces: attacker may average if masked values are transferred over bus multiple times. if that is happening it is known from open source code.
randomizing order of bus tranfers: SW can randomize all bus accesses for multi word data and shares. this should de-facto prevent averaging.

tjaychen · 2022-12-15T16:08:14Z

hey all,
could you shed some more light on how the shares read in sequence creates and issue? Is the basic idea that the bus is narrower (fewer bits toggling), so it would be easier for an attacker to figure out the hamming weight? Secondly, assuming the register can be read multiple times (from keymgr), is the idea of averaging to reduce the noise from other parts of the bus so that the HW of the bus values can be surfaced?

Lastly, I am unsure now if this helps or hurts, but the software output registers from keymgr are actually "read clear". Meaning you cannot actually repeatedly read them. But it also means after every read there is a "value" -> "0" transition.

tjaychen · 2022-12-15T16:09:29Z

the 0 transition on the ibex probably has more to do with how the tlul sockets are constructed.. ie, for a peripheral that is not selected, all of its inputs probably just get blanked.

johannheyszl · 2022-12-15T16:27:09Z

thx tim. Our gut feeling is that we will likely not have an issue here. We will discuss today in the SCA sub WG. We might put a leakage test on the post-silicon test plan to make sure if we think its necessary.

re shares in sequence: if in any of the TL-UL registers or other, shares are loaded through FFs in sequence, the occurring Hamming distance would be equal to the Hamming weight of the unmasked value. But this is only if e.g. word 0 from share 0 is succeeded by word 0 from share 1. If reading all words from share 0 then all of share 1, this is IMO not an issue.

re averaging: Repeating through reading multiple times, allows averaging out of noise factors such as electrical noise in measurement chain, and noise signal from uncorrelated logic/functionality on OT. Experience shows that attacks on such wide words only ever succeed if averaging is possible to get 'good samples' for for template matching. All correlated noise remains of course. If the sequence of words is randomized, averaging is not possible, which is nice :)

tjaychen · 2022-12-15T16:30:25Z

sounds good, should this become software guidance then? it sounds like two things..always process 1 share fully ahead of the other. And within that share, randomize the sequence. This probably means we can't have any fifo like structures to store the keys (i dont think we do), but it might be something we will have to double check.

ballifatih · 2022-12-15T17:35:13Z

Summarizing some points from OT-SCA meeting:

0 -> data -> 0 behavior is not devastating. In the worst case, through template attacks, with many collected power traces, the attacker might be able to get the Hamming weight of each 32-bit chunk of a secret. Even then, this does not give out too much information on the full key.
data_prev -> data_next behavior is undesired for other reasons, like reducing the exposure of this value sitting on the data port to fault injection (FI) attacks or invasive physical probing. In short, there is a benefit in shortening the exposure time of a sensitive value on the bus as pointed out by @vogelpi and @cdgori.

And on the SW guideline side:

Avoid reading/writing secrets in 8-bit or 16-chunks.
Reading/writing shares in alternating manner is probably bad. Process one share fully and then move to another.
As @johannheyszl suggested randomizing the loading order of key words might be an additional counter-measure that we can implement on SW side, if needed later.
As @vogelpi and @bilgiday pointed out, feeding some random values from an LFSR post-transaction is also an idea we can keep on the side for now.

What remains is to check whether TL-UL adapters are behaving as intended. Two unexpected observations:

Why do we see data_prev -> data_next on the keymgr TL-UL output?
What is the value 0x20 that leaks to TL-UL data port from Ibex side?

I will look at these small TL-UL inconsistencies again and create a spin-off issue for those.

vogelpi · 2022-12-15T17:43:07Z

Thanks @ballifatih for starting this discussion and preparing the ot-sca meeting. It's an interesting and relevant topic I believe. I fully agree with your summary above.

On a side note, inside the entropy complex data_prev -> data_next is preferred over 0 -> data -> 0 because there we don't have spurious write enable protection and latching in any deterministic value downstream e.g. through FI would be very bad. But you summarized in your comment above, for the TL-UL bus things are different.

andreaskurth · 2023-02-21T16:28:47Z

Triaged for tlul:

What remains is to check whether TL-UL adapters are behaving as intended. Two unexpected observations:
* Why do we see `data_prev -> data_next` on the `keymgr` TL-UL output?

* What is the value `0x20` that leaks to TL-UL `data` port from Ibex side?
I will look at these small TL-UL inconsistencies again and create a spin-off issue for those.

@ballifatih: Could you please link the issue here? Do your findings there agree with the following:

IIUC the discussion above, we'll resolve this issue with SW guidelines post M2.5 but don't need to take action for M2.5. If so, I'd tag this Type:Icebox Changes deferred to future milestones . @vogelpi: Do you agree?

ballifatih · 2023-02-21T17:06:33Z

Sorry @andreaskurth, I couldn't get back to this issue to spin off the relevant discussion. Here it is #17330, so that we can isolate the TL-UL discussion from the SCA/security discussion.

Feel free to close this issue @andreaskurth and use the new one.

andreaskurth · 2023-02-22T08:46:14Z

Thanks @ballifatih (and no worries 🙂 )!

From your summary above, I think

And on the SW guideline side:

* Avoid reading/writing secrets in 8-bit or 16-chunks.

* Reading/writing shares in alternating manner is probably bad. Process one share fully and then move to another.

* As @johannheyszl suggested randomizing the loading order of key words might be an additional counter-measure that we can implement on SW side, if needed later.

* As @vogelpi and @bilgiday pointed out, feeding some random values from an LFSR post-transaction is also an idea we can keep on the side for now.

is still open and tracked by this issue. So I would keep this issue open to track the completion of the SW guidelines. I'm changing the labels accordingly and will Type:Icebox Changes deferred to future milestones it because non-ROM SW can be done post M2.5. @alphan: I think ROM code already adheres to those SW guidelines, right?

Let's continue the TL-UL hardware discussion in #17330.

johannheyszl · 2023-12-12T13:09:32Z

@jadephilipoom this is an issue with items for SW security guidelines (which I think are already covered). Let's close if redundant. thanks

ballifatih added Component:Security Hotlist:Security Security Opinion Needed IP:tlul labels Dec 9, 2022

ballifatih mentioned this issue Jan 19, 2023

[cryptolib] Connect SHAKE/CSHAKE/KMAC to public cryptolib API #16936

Merged

moidx added this to the Project: M3 milestone Jan 27, 2023

ballifatih mentioned this issue Feb 21, 2023

[tlul] TL-UL data zeroing inconsistencies #17330

Open

andreaskurth added Component:Doc Documentation issue Component:Software Issue related to Software Type:Task Tasks, to-do list. Type:Icebox Changes deferred to future milestones Triaged and removed Hotlist:Security Security Opinion Needed IP:tlul labels Feb 22, 2023

ballifatih mentioned this issue Sep 3, 2023

[crypto] Update the KMAC driver in preparation for aligned buffers. #19554

Merged

msfschaffner added the Earlgrey-PROD Candidate Temporary label to triage issues into Earlgrey-PROD Milestones label Oct 7, 2023

msfschaffner modified the milestones: Discrete: M3, Earlgrey-PROD.M3 Nov 8, 2023

msfschaffner added Hotlist:Security Security Opinion Needed and removed Triaged labels Nov 8, 2023

johannheyszl added SW:cryptolib Crypto library and removed Hotlist:Security Security Opinion Needed Earlgrey-PROD Candidate Temporary label to triage issues into Earlgrey-PROD Milestones Type:Task Tasks, to-do list. Type:Icebox Changes deferred to future milestones labels Dec 12, 2023

msfschaffner added the Earlgrey-PROD Candidate Temporary label to triage issues into Earlgrey-PROD Milestones label Dec 20, 2023

johannheyszl modified the milestones: Earlgrey-PROD.M3, cryptolib Jan 12, 2024

This was referenced Feb 20, 2024

[tlul] D2(S) Signoff #20985

Closed

[tlul] V2(S) Signoff #21020

Closed

This was referenced Mar 28, 2024

[keymgr] D2(S) Signoff #20981

Closed

[hmac] D2(S) Signoff #20996

Closed

martin-velay mentioned this issue Jun 6, 2024

[hmac] V2(S) Signoff #22471

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[tlul] Side-channel Hamming weight leakage of `data` on TL-UL #16767

[tlul] Side-channel Hamming weight leakage of `data` on TL-UL #16767

ballifatih commented Dec 9, 2022 •

edited by andreaskurth

Loading

johannheyszl commented Dec 12, 2022

johannheyszl commented Dec 12, 2022

ballifatih commented Dec 12, 2022

johannheyszl commented Dec 12, 2022

ballifatih commented Dec 13, 2022

johannheyszl commented Dec 15, 2022

tjaychen commented Dec 15, 2022

tjaychen commented Dec 15, 2022

johannheyszl commented Dec 15, 2022

tjaychen commented Dec 15, 2022

ballifatih commented Dec 15, 2022 •

edited

Loading

vogelpi commented Dec 15, 2022

andreaskurth commented Feb 21, 2023

ballifatih commented Feb 21, 2023

andreaskurth commented Feb 22, 2023 •

edited

Loading

johannheyszl commented Dec 12, 2023

[tlul] Side-channel Hamming weight leakage of data on TL-UL #16767

[tlul] Side-channel Hamming weight leakage of data on TL-UL #16767

Comments

ballifatih commented Dec 9, 2022 • edited by andreaskurth Loading

TODOs

Original Description

johannheyszl commented Dec 12, 2022

johannheyszl commented Dec 12, 2022

ballifatih commented Dec 12, 2022

johannheyszl commented Dec 12, 2022

ballifatih commented Dec 13, 2022

johannheyszl commented Dec 15, 2022

tjaychen commented Dec 15, 2022

tjaychen commented Dec 15, 2022

johannheyszl commented Dec 15, 2022

tjaychen commented Dec 15, 2022

ballifatih commented Dec 15, 2022 • edited Loading

vogelpi commented Dec 15, 2022

andreaskurth commented Feb 21, 2023

ballifatih commented Feb 21, 2023

andreaskurth commented Feb 22, 2023 • edited Loading

johannheyszl commented Dec 12, 2023

[tlul] Side-channel Hamming weight leakage of `data` on TL-UL #16767

[tlul] Side-channel Hamming weight leakage of `data` on TL-UL #16767

ballifatih commented Dec 9, 2022 •

edited by andreaskurth

Loading

ballifatih commented Dec 15, 2022 •

edited

Loading

andreaskurth commented Feb 22, 2023 •

edited

Loading