round-trip encoding for extra zero bytes by mrnerdhair · Pull Request #13 · lexxvir/tlv-parser

mrnerdhair · 2025-07-17T18:28:48Z

Unfortunately, I've realized that having round-trip encoding of extra zero bytes is important e.g. for being able to correctly compose the DDA data for checking the hash in the decoded ICC certificate.

It's not super elegant to add a fourth field to a "TLV" element, but I don't know a better way to do it. And I remembered to update the tests this time...

lexxvir · 2025-07-18T10:59:39Z

Thank you for PR! Two notes though:

Is it possible to get leading zeroes with out adding is_zero parameter? Probably Tlv::read_tag can return tuple of the tag number and count of the skipped zeroes.
According to EMV there are trailing zeroes. It would be fine if this case also be handled to be 100% compatible with EMV.

P.S. Some of source code lines are not formatted (according to cargo fmt). I have added cargo fmt --check to the Github workflow in the master branch. So, If you rebase your branch over actual master you build check may failed.

Copilot

Pull request overview

This PR aims to preserve and round-trip “meaningless” 0x00 padding bytes that can appear before/between BER-TLV objects (per EMV guidance), so re-encoding after parsing can reproduce the original byte stream (important for EMV/DDA hash composition use cases).

Changes:

Added leading_zeroes tracking on Tlv and updated parsing to capture leading 0x00 bytes rather than discarding them.
Updated to_vec() to emit any preserved leading zero bytes.
Adjusted quickcheck and unit tests to reflect the new round-trip behavior.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 5 comments.

File	Description
`src/tlv.rs`	Stores and emits leading `0x00` padding around TLVs; updates parsing logic and unit tests.
`src/lib.rs`	Changes `TlvError::InvalidTagNumber` shape to support internal “is zero padding” detection.
`tests/lib.rs`	Updates quickcheck property to stop stripping leading zeros before comparing round-tripped bytes.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-02-09T15:09:47Z

-            .next()
-            .cloned()
-            .ok_or_else(|| TlvError::TruncatedTlv)?;
+        let first: u8 = iter.next().cloned().ok_or_else(|| TlvError::TruncatedTlv)?;


read_tag() no longer skips leading 0x00 bytes (previously it did). from_iter() now re-implements skipping/counting, but other callers like parse_tag_list() still call read_tag() directly, so their behavior changes to error on padding zeros. If padding zeros should still be tolerated there, consider keeping a “skip 0x00” variant of tag parsing for non-roundtrip use cases, or move the zero-skipping logic back into read_tag() behind a mode/flag.

Suggested change

let first: u8 = iter.next().cloned().ok_or_else(|| TlvError::TruncatedTlv)?;

// Skip leading 0x00 bytes (padding) before reading the actual tag.

let first: u8 = loop {

let byte = iter.next().cloned().ok_or_else(|| TlvError::TruncatedTlv)?;

if byte == 0x00 {

continue;

} else {

break byte;

}

};

Copilot · 2026-02-09T15:09:48Z

 pub enum TlvError {
    TruncatedTlv,
    InvalidLength,
-    InvalidTagNumber,
+    InvalidTagNumber { is_zero: bool },
    TooShortBody { expected: usize, found: usize },


Renaming InvalidTagNumber from a unit variant to InvalidTagNumber { is_zero: bool } is a public API breaking change for downstream code that pattern-matches on TlvError. Since is_zero is only used internally to detect/record leading padding bytes (and isn’t reflected in the Display message), consider handling that case without encoding internal details into the public error type (e.g., by peeking/consuming 0x00 bytes in the parser loop, or by using a private/internal error and mapping to InvalidTagNumber).

Copilot · 2026-02-09T15:09:48Z

    pub fn to_vec(&self) -> Vec<u8> {
-        let mut out: Vec<u8> = (self.tag as u64)
+        let mut out: Vec<u8> = vec![0; self.leading_zeroes];
+
+        out.append(&mut (self.tag as u64)
            .to_be_bytes()
            .iter()
            .skip_while(|&&x| x == 0)
            .cloned()
-            .collect();
+            .collect());

        out.append(&mut self.val.encode_len());


to_vec() now prepends leading_zeroes, but Tlv::len()/Value::len() (used by encode_len() and by Tlv::new() when splitting a constructed Value::Val) do not account for those bytes. This makes constructed TLVs emit an incorrect length field and can break re-parsing/indexing when a child TLV has leading zeroes. Include leading_zeroes in the encoded-size calculation (and any other length math) or move the preserved padding out of the TLV byte stream produced by to_vec() so length remains consistent.

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

mrnerdhair · 2026-02-10T08:05:14Z

(Sorry for the delay here, will try to get through the rest of this tomorrow!)

round-trip encoding for extra zero bytes

b429e20

lexxvir requested a review from Copilot February 9, 2026 15:04

Copilot started reviewing on behalf of lexxvir February 9, 2026 15:04 View session

lexxvir assigned Copilot Feb 9, 2026

Copilot AI reviewed Feb 9, 2026

View reviewed changes

This was referenced Feb 9, 2026

Fix typo in test variable name: truncacted → truncated #14

Merged

Implement round-trip encoding for leading and trailing zero bytes without breaking API #15

Draft

fix typo

4985eb0

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

round-trip encoding for extra zero bytes#13

round-trip encoding for extra zero bytes#13
mrnerdhair wants to merge 2 commits into
lexxvir:masterfrom
mrnerdhair:master

mrnerdhair commented Jul 17, 2025

Uh oh!

lexxvir commented Jul 18, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Copilot AI Feb 9, 2026

Uh oh!

Copilot AI Feb 9, 2026

Uh oh!

Uh oh!

Copilot AI Feb 9, 2026

Uh oh!

mrnerdhair commented Feb 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

-        let first: u8 = iter.next().cloned().ok_or_else(|| TlvError::TruncatedTlv)?;
+        // Skip leading 0x00 bytes (padding) before reading the actual tag.
+        let first: u8 = loop {
+            let byte = iter.next().cloned().ok_or_else(|| TlvError::TruncatedTlv)?;
+            if byte == 0x00 {
+                continue;
+            } else {
+                break byte;
+            }
+        };

Conversation

mrnerdhair commented Jul 17, 2025

Uh oh!

lexxvir commented Jul 18, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Copilot AI Feb 9, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 9, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Copilot AI Feb 9, 2026

Choose a reason for hiding this comment

Uh oh!

mrnerdhair commented Feb 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants