Use fixed-point u64 math to divide the PLL values #212

BryanKadzban · 2024-02-11T05:32:22Z

This avoids the need for almost 1K of code to implement division for u64 values (which the Cortex-M7 can't do in hardware). It can multiply u64 values in hardware, so instead of dividing by (integer) x, we find 1/x in fixed-point, multiply, and shift the fixed-point offset back out. It can also divide u32 values, and finding 1/x can be done in a u32 since our shift is small enough.

Fixes #211

This avoids the need for almost 1K of code to implement division for u64 values (which the Cortex-M7 can't do in hardware). It can multiply u64 values in hardware, so instead of dividing by (integer) x, we find 1/x in fixed-point, multiply, and shift the fixed-point offset back out. Finding 1/x can be done in a u32 since our shift is small enough, and that division *can* be done in hardware on this CPU. Fixes stm32-rs#211

BryanKadzban · 2024-02-11T05:34:36Z

Hmm, I seem to be failing a bunch of rcc tests. Let's fix that up...

This requires shifting the input frequencies down by 4 bits in order to fit both the max base_clk*n value, and the 30-bit fractional part, into a u64. (base_clk*n is up to 38 bits wide.) This is OK as long as all input frequencies are a multiple of 250kHz, which has the bottom 4 bits clear.

"cargo fmt" does not work because it requires both test mode and the #![no_std] attribute, which the crate lib.rs doesn't enable in test mode. But manually running rustfmt does.

BryanKadzban · 2024-02-11T20:16:31Z

For people not looking at #211, the issue with the tests was that we lost too much precision using only 26 bits for the fractional part of the fixed-point math. It needed at least 30 bits. But the numerator in the division operation needed up to 38 bits, so it would overflow a u64. Fortunately, we could shift the frequency right by 4 bits at the start to make everything fit, as long as the input frequencies are a multiple of 250kHz (they all are in the tests, at least, and I think they are likely to be in people's designs as well).

So now we shift the frequency right by 4 bits and use 30 bits for the fractional part. Tests are all accurate enough to pass.

eldruin

This looks good to me in principle, but I do not know exactly how it works. If nobody raises any issues, I would merge this.

eldruin

Alright, let's merge this. Thank you for your work!

BryanKadzban added 2 commits February 11, 2024 11:52

Run rustfmt on rcc.rs

e1ce029

"cargo fmt" does not work because it requires both test mode and the #![no_std] attribute, which the crate lib.rs doesn't enable in test mode. But manually running rustfmt does.

maximeborges requested review from a team, eldruin, maximeborges and mvertescher and removed request for a team March 22, 2024 18:32

eldruin reviewed Mar 22, 2024

View reviewed changes

eldruin approved these changes Mar 27, 2024

View reviewed changes

eldruin merged commit f6a5d1f into stm32-rs:main Mar 27, 2024
15 checks passed

BryanKadzban deleted the rm-u64-div branch August 3, 2024 00:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use fixed-point u64 math to divide the PLL values #212

Use fixed-point u64 math to divide the PLL values #212

BryanKadzban commented Feb 11, 2024

BryanKadzban commented Feb 11, 2024

BryanKadzban commented Feb 11, 2024

eldruin left a comment

eldruin left a comment

Use fixed-point u64 math to divide the PLL values #212

Use fixed-point u64 math to divide the PLL values #212

Conversation

BryanKadzban commented Feb 11, 2024

BryanKadzban commented Feb 11, 2024

BryanKadzban commented Feb 11, 2024

eldruin left a comment

Choose a reason for hiding this comment

eldruin left a comment

Choose a reason for hiding this comment