feat(rust/cbork): deterministic map decoding helper #360

cong-or · 2025-06-10T20:55:14Z

The code satisfies RFC 8949's deterministic CBOR map requirements by ensuring:

Key Ordering
- Shorter keys before longer ones
- Lexicographical ordering for equal-length keys
Key Uniqueness
- No duplicate keys allowed
Encoding Rules
- Uses minimal length encoding
- No indefinite-length items

All requirements are properly validated and tested

example usage:

use minicbor::Decoder;

fn main() -> Result<(), Box<dyn std::error::Error>> {
    // Example of a valid CBOR map with 2 entries
    let cbor_bytes = vec![
        0xA2,                   // Map with 2 pairs
        0x41, 0x01,            // Key 1: single byte string "1"
        0x41, 0x30,            // Value 1: single byte string "0"
        0x41, 0x02,            // Key 2: single byte string "2"
        0x41, 0x31,            // Value 2: single byte string "1"
    ];

    // Create a decoder with our CBOR bytes
    let mut decoder = Decoder::new(&cbor_bytes);
    
    // Decode and validate the map
    match decode_map_deterministically(&mut decoder) {
        Ok(validated_bytes) => {
            println!("Valid deterministic map! Bytes: {:?}", validated_bytes);
            // validated_bytes will contain the original CBOR bytes since they were valid
        },
        Err(e) => {
            println!("Invalid deterministic encoding: {}", e);
        }
    }

    Ok(())
}

p.s will do separate PR for abstracted cbor decoder with optional deterministic decoding w logging.

WIP

Adds validation for minimal length encoding of string types (Str and Bytes) in the DeterministicDecoder according to RFC 8949 Section 4.2. This ensures that string lengths are encoded using the minimal number of bytes required. For example, strings of length 0-23 must use direct encoding, length 24-255 must use one byte, etc. The changes: - Add length validation for Type::Str and Type::Bytes - Check for indefinite length strings - Validate minimal length encoding using check_minimal_length function

Adds comprehensive test coverage for RFC 8949 Section 4.2 deterministic encoding requirements. The new tests verify: - Minimal length integer encoding rules for values 0-23, 24-255, etc. - Floating point value requirements including shortest form and non-finite prohibition - String/array/map length encoding rules and indefinite length checks - Map key ordering rules with length-first canonical ordering Each test includes detailed comments explaining: - The specific RFC requirement being tested - Byte-level breakdown of CBOR encodings - Why each test case is valid or invalid - References to relevant RFC sections This ensures proper validation of all deterministic encoding rules and helps maintainers understand the requirements.

Add detailed test cases for deterministic CBOR encoding rules as specified in RFC 8949 section 4.2. The new tests cover: - Integer boundary conditions and minimal encoding requirements - Negative integer encoding across different ranges - Map key ordering (length-first, then lexicographic) - Floating point encoding with different precision requirements - String comparison ordering including UTF-8 handling - Nested structure validation - Array length encoding rules - Duplicate map key detection The tests are extensively documented with RFC requirements and include TODOs for future validation improvements, particularly for floating point handling where additional checks for non-finite values and minimal encoding could be added. Includes commented-out test cases that can be enabled once support for validating non-finite floating point values is implemented. RFC: https://datatracker.ietf.org/doc/html/rfc8949#section-4.2

Refactor test cases to fix clippy warnings: - Use simpler iterator chaining in array length test - Remove redundant calls - Replace explicit type annotations with inferred types - Fix collect() with redundant map operations Also simplify floating point test cases to match current implementation and improve RFC 8949 compliance documentation. The floating point tests now focus on valid encodings while keeping commented-out future test cases for non-finite values validation. Tests still verify the same RFC requirements but with more idiomatic Rust code.

Improve documentation and refactor validate_next() to align with RFC 8949 § 4.2 specification for deterministically encoded CBOR. Split validation logic into smaller, focused functions for better maintainability. - Split validate_next into specialized validation functions: * validate_integer() - Handles minimal integer encoding * validate_array() - Validates definite-length arrays * validate_string() - Checks string/bytes encoding * validate_map() - Ensures proper key ordering - Add comprehensive documentation referencing RFC 8949: * Detail core deterministic encoding requirements * Document rules for integer minimality * Explain length field constraints * Specify map key ordering rules * Include examples of valid/invalid encodings This refactoring improves code organization while maintaining full compliance with the CBOR deterministic encoding specification. The enhanced documentation helps developers understand both implementation details and RFC requirements.

github-actions · 2025-06-10T21:24:48Z

✅ Test Report | ${\color{lightgreen}Pass: 331/331}$ | ${\color{red}Fail: 0/331}$ |

no30bit

LGTM, but I've left some comments mostly on the error types.

On that topic, I am a bit hesitant about introducing DetermenisticError. Some of its variants have parallels in minicbor::decode::Error. For example, DetermenisticError::UnexpectedEof is the same as Error::end_of_input() from minicbor. It gets a bit confusing, considering the later is also wrapped under one of the variants.

If I were to implement this, I'd rather wrap the other way around: our local error within minicbor's Error::custom. Our error would then be extractable via std::error::Error::source, and we'd get the benefits of integration inside of minicbor::Decode implementations seamlessly and Error::at.

rust/cbork-utils/src/deterministic_helper.rs

no30bit · 2025-06-11T08:37:16Z

rust/cbork-utils/src/deterministic_helper.rs

+        d.skip()?;
+        let value_end = d.position();
+
+        // Extract the raw bytes for both key and value


Since the Decoder::skip skips over a CBOR value, which can't be represented by less than 1 byte, wouldn't it be safe to assume the ranges extracted below to be ok by definition?

Or are we testing the Decoder::skip implementation itself here?

addressing this now

@no30bit error handling has been addressed along with the cbor header helper which can be used for various types.

rust/cbork-utils/src/deterministic_helper.rs

cong-or added 30 commits May 25, 2025 18:32

feat(deterministic decoder): rfc template

b4d681b

WIP

fmt

2f143b5

docs

f783e57

docs: Add comprehensive documentation for CBOR deterministic validation

e00c280

Merge branch 'main' into feat-deterministic-cbor-decoder

cd772d6

Add violation test cases for string comparison ordering

b338c32

Add violation test cases for string comparison ordering

9b225eb

feat(deterministic cbor): toggle validation

3b30700

feat(deterministic cbor): toggle validation

50714cc

feat(deterministic cbor): toggle validation

6a9e068

refactor(generic decoder): helper functions

4f5f490

refactor(generic decoder): helper functions

7755322

refactor(generic decoder): helper functions

0cba61a

refactor(deterministic maps): rfc validation

16ea67c

refactor(deterministic maps): rfc validation

3e75b20

refactor(deterministic maps): rfc validation

63ed93d

refactor(deterministic maps): rfc validation

9a041d4

refactor(deterministic maps): rfc validation

228eab9

refactor(deterministic maps): rfc validation

dea6c33

refactor(deterministic maps): rfc validation

39b9c77

refactor(deterministic maps): rfc validation

b6a9b97

refactor(deterministic maps): rfc validation

550a6ad

refactor(deterministic maps): rfc validation

dcac645

refactor(deterministic maps): rfc validation

ea548a0

cong-or added 7 commits June 9, 2025 16:47

feat(deterministic map decoder only): rfc 8949

f2c98ca

feat(deterministic map decoder only): rfc 8949

0971eed

feat(deterministic map decoder only): rfc 8949

2e0afe1

feat(deterministic map decoder only): rfc 8949

e900867

feat(deterministic map decoder only): rfc 8949

3f6215e

feat(deterministic map decoder only): rfc 8949

857d194

feat(deterministic map decoder only): rfc 8949

a98ea60

cong-or self-assigned this Jun 10, 2025

Merge branch 'main' into feat-deterministic-cbor-decoder

90c8818

cong-or changed the title ~~feat(rust/cbork-utils): deterministic map decoding helper~~ feat(rust/cbork): deterministic map decoding helper Jun 10, 2025

cong-or added the review me PR is ready for review label Jun 10, 2025

cong-or marked this pull request as ready for review June 10, 2025 20:59

cong-or added 2 commits June 10, 2025 22:02

feat(deterministic map decoder only): rfc 8949

02321ab

feat(deterministic map decoder only): rfc 8949

0f87487

cong-or requested review from stevenj, no30bit and bkioshn June 10, 2025 21:25

no30bit reviewed Jun 11, 2025

View reviewed changes

cong-or added 5 commits June 11, 2025 10:27

refactor(pr changes): houskeeping

7479ec4

fmt

69fa7e3

refactor(pr changes): houskeeping

c041031

refactor(pr changes): houskeeping

1b2e1be

refactor(pr changes): houskeeping

054fb92

stevenj requested changes Jun 11, 2025

View reviewed changes

rust/cbork-utils/src/deterministic_helper.rs Outdated Show resolved Hide resolved

cong-or added 4 commits June 11, 2025 20:45

refactor(pr changes): houskeeping

32e1a4b

refactor(cleanup): actual vs declared length helper

176265d

refactor(cleanup): actual vs declared length helper

d16262a

refactor(cleanup): actual vs declared length helper

03a77a4

cong-or requested a review from no30bit June 14, 2025 21:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(rust/cbork): deterministic map decoding helper #360

feat(rust/cbork): deterministic map decoding helper #360

Uh oh!

cong-or commented Jun 10, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Jun 10, 2025 •

edited

Loading

Uh oh!

no30bit left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

no30bit Jun 11, 2025

Uh oh!

cong-or Jun 12, 2025

Uh oh!

cong-or Jun 14, 2025

Uh oh!

Uh oh!

Uh oh!

feat(rust/cbork): deterministic map decoding helper #360

Are you sure you want to change the base?

feat(rust/cbork): deterministic map decoding helper #360

Uh oh!

Conversation

cong-or commented Jun 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Jun 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

no30bit left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

no30bit Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

cong-or Jun 12, 2025

Choose a reason for hiding this comment

Uh oh!

cong-or Jun 14, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

cong-or commented Jun 10, 2025 •

edited

Loading

github-actions bot commented Jun 10, 2025 •

edited

Loading