cfi: Simpler launder implementation for common types #1714

swenson · 2024-10-10T23:55:50Z

cfi_launder is meant to prevent the Rust compiler from optimizing a value away.

Our current implementation uses core::hint::black_box(), which is the recommended way in Rust. The problem is, this appears to often force the argument to spill into memory and to be reloaded, which can be a lot of extra instructions.

The original inspiration for this function is from, I believe, OpenTitan's launder* functions. There, they use an LLVM-specific trick of a blank inline assembly block to force the compiler to keep the argument in a register.

After reviewing our code and speaking with @vsonims, it sounds like the intention of the launder in our code is to prevent the compiler from optimizing the value away (as the comments suggest), so the simpler inline assembly trick may be sufficient (since we use the official Rust compiler, which uses LLVM).

The biggest problem is that we launder many types of values in our code and not all of them fit into a register.

So, this PR represents an incremental change: for u32s and similar small types, we implement cfi_launder using the inline assembly trick from OpenTitan. For any other types, we have a trait that can be derived that will call core::hint::black_box in the same way as today.

We can do future follow-up PRs to try to try to clean up some of those other uses of cfi_launder to hopefully shrink the code more.

I also slipped in avoid a few extra copies in the verifier by using references instead of copies (this saves ~80 bytes of instruction space).

This PR appears to shrink the ROM code size by 1232 bytes and the runtime firmware by 700 bytes.

cfi/lib/src/cfi.rs

swenson · 2024-10-14T18:26:18Z

(I've also looked at removing Copy traits from a few other types and trying to use more references so that the laundering could be more effective without copying, but I was surprised to find that this increased the code size.

There are more likely a few more types though that would be worth streamlining so we can save more code space, but I leave that to a future PR.)

swenson · 2024-11-04T19:43:06Z

@jhand2, @nquarton, @rusty1968 -- approvals were wiped when I rebased and updated the SHAs. No other changes were made. Please re-review and I'll get someone to merge this ASAP for the 1.2 ROM release.

@vsonims

… value away. Our current implementation uses `core::hint::black_box()`, which is the recommended way in Rust. The problem is, this appears to often force the argument to spill into memory and to be reloaded, which can be a lot of extra instructions. The original inspiration for this function is from, I believe, [OpenTitan's launder* functions](https://github.com/lowRISC/opentitan/blob/master/sw/device/lib/base/hardened.h#L193). There, they use an LLVM-specific trick of a blank inline assembly block to force the compiler to keep the argument in a register. After reviewing our code and speaking with @vsonims, it sounds like the intention of the launder in our code is to prevent the compiler from optimizing the value away (as the comments suggest), so the simpler inline assembly trick may be sufficient (since we use the official Rust compiler, which uses LLVM). The biggest problem is that we launder many types of values in our code and not all of them fit into a register. So, this PR represents an incremental change: for `u32`s and similar small types, we implement `cfi_launder` using the inline assembly trick from OpenTitan. For any other types, we have a trait that can be derived that will call `core::hint::black_box` in the same way as today. We can do future follow-up PRs to try to try to clean up some of those other uses of `cfi_launder` to hopefully shrink the code more. I also slipped in avoid a few extra copies in the verifier by using references instead of copies (this saves ~80 bytes of instruction space). This PR appears to shrink the ROM code size by 1232 bytes and the runtime firmware by 700 bytes.

cfi/lib/src/cfi.rs

…launder

swenson · 2024-11-06T17:46:41Z

Thanks!

@vsonims

* `cfi_launder` is meant to prevent the Rust compiler from optimizing a value away. Our current implementation uses `core::hint::black_box()`, which is the recommended way in Rust. The problem is, this appears to often force the argument to spill into memory and to be reloaded, which can be a lot of extra instructions. The original inspiration for this function is from, I believe, [OpenTitan's launder* functions](https://github.com/lowRISC/opentitan/blob/master/sw/device/lib/base/hardened.h#L193). There, they use an LLVM-specific trick of a blank inline assembly block to force the compiler to keep the argument in a register. After reviewing our code and speaking with @vsonims, it sounds like the intention of the launder in our code is to prevent the compiler from optimizing the value away (as the comments suggest), so the simpler inline assembly trick may be sufficient (since we use the official Rust compiler, which uses LLVM). The biggest problem is that we launder many types of values in our code and not all of them fit into a register. So, this PR represents an incremental change: for `u32`s and similar small types, we implement `cfi_launder` using the inline assembly trick from OpenTitan. For any other types, we have a trait that can be derived that will call `core::hint::black_box` in the same way as today. We can do future follow-up PRs to try to try to clean up some of those other uses of `cfi_launder` to hopefully shrink the code more. I also slipped in avoid a few extra copies in the verifier by using references instead of copies (this saves ~80 bytes of instruction space). This PR appears to shrink the ROM code size by 1232 bytes and the runtime firmware by 700 bytes. (cherry picked from commit 571d253)

@vsonims

* `cfi_launder` is meant to prevent the Rust compiler from optimizing a value away. Our current implementation uses `core::hint::black_box()`, which is the recommended way in Rust. The problem is, this appears to often force the argument to spill into memory and to be reloaded, which can be a lot of extra instructions. The original inspiration for this function is from, I believe, [OpenTitan's launder* functions](https://github.com/lowRISC/opentitan/blob/master/sw/device/lib/base/hardened.h#L193). There, they use an LLVM-specific trick of a blank inline assembly block to force the compiler to keep the argument in a register. After reviewing our code and speaking with @vsonims, it sounds like the intention of the launder in our code is to prevent the compiler from optimizing the value away (as the comments suggest), so the simpler inline assembly trick may be sufficient (since we use the official Rust compiler, which uses LLVM). The biggest problem is that we launder many types of values in our code and not all of them fit into a register. So, this PR represents an incremental change: for `u32`s and similar small types, we implement `cfi_launder` using the inline assembly trick from OpenTitan. For any other types, we have a trait that can be derived that will call `core::hint::black_box` in the same way as today. We can do future follow-up PRs to try to try to clean up some of those other uses of `cfi_launder` to hopefully shrink the code more. I also slipped in avoid a few extra copies in the verifier by using references instead of copies (this saves ~80 bytes of instruction space). This PR appears to shrink the ROM code size by 1232 bytes and the runtime firmware by 700 bytes. (cherry picked from commit 571d253)

swenson requested review from FerralCoder, rusty1968, bluegate010, mhatrevi, vsonims, ajisaxena, korran and JohnTraverAmd as code owners October 10, 2024 23:55

swenson mentioned this pull request Oct 11, 2024

[draft] Constant-time equality checks for sensitive values #1712

Draft

11 tasks

swenson requested a review from jhand2 October 11, 2024 00:00

swenson commented Oct 11, 2024

View reviewed changes

cfi/lib/src/cfi.rs Show resolved Hide resolved

jhand2 reviewed Oct 11, 2024

View reviewed changes

cfi/lib/src/cfi.rs Show resolved Hide resolved

swenson force-pushed the cfi-launder branch 2 times, most recently from 1921e07 to 05a5902 Compare October 14, 2024 16:15

jhand2 previously approved these changes Oct 14, 2024

View reviewed changes

swenson dismissed jhand2’s stale review via ea51249 October 14, 2024 18:11

jhand2 previously approved these changes Oct 14, 2024

View reviewed changes

rusty1968 previously approved these changes Oct 24, 2024

View reviewed changes

nquarton added the Caliptra v1.2 label Oct 31, 2024

nquarton previously approved these changes Nov 4, 2024

View reviewed changes

swenson dismissed stale reviews from nquarton, rusty1968, and jhand2 via 7913afc November 4, 2024 19:35

swenson force-pushed the cfi-launder branch from ea51249 to 7913afc Compare November 4, 2024 19:35

swenson force-pushed the cfi-launder branch from 7913afc to 0746e9a Compare November 4, 2024 21:04

swenson force-pushed the cfi-launder branch from 0746e9a to c809565 Compare November 4, 2024 21:06

nquarton previously approved these changes Nov 4, 2024

View reviewed changes

mhatrevi reviewed Nov 6, 2024

View reviewed changes

cfi/lib/src/cfi.rs Show resolved Hide resolved

mhatrevi previously approved these changes Nov 6, 2024

View reviewed changes

swenson added 2 commits November 6, 2024 09:31

Merge branch 'main' of github.com:chipsalliance/caliptra-sw into cfi-…

5905f76

…launder

Update frozen images

8356cf6

swenson dismissed stale reviews from mhatrevi and nquarton via 8356cf6 November 6, 2024 17:32

mhatrevi approved these changes Nov 6, 2024

View reviewed changes

mhatrevi enabled auto-merge (squash) November 6, 2024 17:46

mhatrevi merged commit 571d253 into main Nov 6, 2024
11 checks passed

swenson deleted the cfi-launder branch November 6, 2024 18:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cfi: Simpler launder implementation for common types #1714

cfi: Simpler launder implementation for common types #1714

swenson commented Oct 10, 2024 •

edited

Loading

swenson commented Oct 14, 2024

swenson commented Nov 4, 2024

swenson commented Nov 6, 2024

cfi: Simpler launder implementation for common types #1714

cfi: Simpler launder implementation for common types #1714

Conversation

swenson commented Oct 10, 2024 • edited Loading

swenson commented Oct 14, 2024

swenson commented Nov 4, 2024

swenson commented Nov 6, 2024

swenson commented Oct 10, 2024 •

edited

Loading