Improve bounds checks #68

Marcono1234 · 2025-12-10T15:41:06Z

As discussed @cowtowncoder, these are the changes for final review. If it is easier, you can review the commits individually.

If possible on merge please preserve the individual commits and don't squash them.

(CC @yawkat)

Can be a problem when reusing buffers and the buffer contains data from previous usage after the end index.

src/main/java/com/ning/compress/lzf/ChunkDecoder.java

cowtowncoder · 2025-12-19T03:24:35Z

src/main/java/com/ning/compress/BufferRecycler.java

    public void releaseEncodeBuffer(byte[] buffer)
    {
        if (_encodingBuffer == null || (buffer != null && buffer.length > _encodingBuffer.length)) {
+            // Clear the buffer to protect against bugs which might leak the content during next use


Hmmmh. Not a big fan of defensive coding; plus, doing this will cancel much of performance benefits of buffer reuse.

Or, maybe worse, add overhead for case where there is no reuse.

Actually, let's just remove these. If we really wanted, could add opt-in configuration for clearing but for now I don't think we should clear the buffers due to overhead.

@cowtowncoder it can lead to serious issues, see GHSA-cmp6-m4wj-q63q

Yes, in case of bug(s) that allow copy from outside legal area.

I will remove clearing from this PR: if there's reproduction of actual vulnerability can address that separately.

I would also be +1 for adding setting that simply disables recycling for most security conscious; that would avoid the issue even more reliably.

Removed buffer clearing from this PR; as I said can be addressed in a follow-up as necessary.

doing this will cancel much of performance benefits of buffer reuse

Fair point. But unlike GHSA-cmp6-m4wj-q63q, if there was a similar issue here in ning/compress, then it would always be an issue due to the automatic buffer reuse.

Though it seems LZF is not prone to the same offset == 0 bug because for LZF the offset is always + 1, see https://github.com/ning/compress/wiki/LZFFormat. So the risk here would be rather some other implementation bug or overflow.
Though at least during fuzzing I did not find any issues.

src/main/java/com/ning/compress/lzf/impl/UnsafeChunkEncoderBE.java

… improvement)

cowtowncoder · 2025-12-21T03:11:41Z

Just realized I messed up beautifully orchestrated individual commits. Sorry. Will still merge as separate.

cowtowncoder · 2025-12-21T03:25:16Z

Ok this is merged with changes I discussed and I'd be ready for 1.2.0 release.

But before doing that wanted to check anyone thinks further work is needed for before doing that (f.ex to address concerns about buffer recycling).

@Marcono1234 @yawkat WDYT?

Marcono1234 · 2025-12-22T21:56:04Z

But before doing that wanted to check anyone thinks further work is needed for before doing that (f.ex to address concerns about buffer recycling).

I am not sure, but I think not. It seems most or all cases where BufferRecycler is used already allow the user to provide their own BufferRecycler so they could do new BufferRecycler(). Maybe it could still be improved somehow in the future, but for now it might suffice?

Also a small note regarding 9265ebb, specifically this check:

compress/src/main/java/com/ning/compress/lzf/ChunkDecoder.java

Lines 110 to 113 in c9acc13

    
           // Fail if more input than expected was consumed, respectively if `inLength` does not include full block 
        
           if (inPtr > endMinusOne + 1) { 
        
               throw new LZFException("Corrupt input data, block #" + blockNr + " is incomplete"); 
        
           }

That happens after too much data was already consumed, but I think it is safe nonetheless because there are already bounds checks for the Unsafe access (so Unsafe out-of-bounds access is not a concern), and it throws an exception here so even if targetBuffer contains data which should not have been decompressed due to being out-of-bounds, it is not inspected by the user anyway.

The only (?) risk would be some side channel attack, that a malicious user can deduce information from different exceptions which occur before this check here. Or that despite the exception being thrown here, user-code still somehow uses the content from targetBuffer.
But I think the chances for both of this are rather low.
And preventing that would probably require refactoring the chunk decoder implementations and adding bounds checks (the code currently contains little to no bounds checks, and only fails implicitly due to out-of-bounds array access).

cowtowncoder · 2025-12-22T23:18:36Z

Ok. For now I consider this potentially releasable; but will wait a day or two just in case.
(not that it's a big deal; releasing is easy and versions free).

Marcono1234 added 4 commits December 3, 2025 17:34

Fix out-of-bounds access in decoder

924c9f9

Fix out-of-bounds access in encoder

efe4a2d

Add fuzz tests

ae182d0

Fix decoder reading past specified end index

9265ebb

Can be a problem when reusing buffers and the buffer contains data from previous usage after the end index.

cowtowncoder reviewed Dec 19, 2025

View reviewed changes

src/main/java/com/ning/compress/lzf/ChunkDecoder.java Show resolved Hide resolved

cowtowncoder reviewed Dec 19, 2025

View reviewed changes

src/main/java/com/ning/compress/lzf/impl/UnsafeChunkEncoderBE.java Outdated Show resolved Hide resolved

cowtowncoder added 5 commits December 20, 2025 18:47

Merge branch 'master' into bounds-checks

96aee57

Replace 2 asserts with proper Exception

b369e06

Convert remaining asserts

9328a47

Remove defensive clearing of recycled buffers (can consider as future…

2521a67

… improvement)

Add release notes

51762e0

cowtowncoder added this to the 1.2.0 milestone Dec 21, 2025

cowtowncoder merged commit 06bdc37 into ning:master Dec 21, 2025
5 checks passed

Marcono1234 deleted the bounds-checks branch December 22, 2025 22:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve bounds checks #68

Improve bounds checks #68

Uh oh!

Marcono1234 commented Dec 10, 2025

Uh oh!

Uh oh!

cowtowncoder Dec 19, 2025 •

edited

Loading

Uh oh!

cowtowncoder Dec 19, 2025

Uh oh!

yawkat Dec 19, 2025

Uh oh!

cowtowncoder Dec 21, 2025 •

edited

Loading

Uh oh!

cowtowncoder Dec 21, 2025

Uh oh!

Marcono1234 Dec 22, 2025

Uh oh!

Uh oh!

cowtowncoder commented Dec 21, 2025

Uh oh!

Uh oh!

cowtowncoder commented Dec 21, 2025

Uh oh!

Marcono1234 commented Dec 22, 2025

Uh oh!

cowtowncoder commented Dec 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Improve bounds checks #68

Improve bounds checks #68

Uh oh!

Conversation

Marcono1234 commented Dec 10, 2025

Uh oh!

Uh oh!

cowtowncoder Dec 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cowtowncoder Dec 19, 2025

Choose a reason for hiding this comment

Uh oh!

yawkat Dec 19, 2025

Choose a reason for hiding this comment

Uh oh!

cowtowncoder Dec 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cowtowncoder Dec 21, 2025

Choose a reason for hiding this comment

Uh oh!

Marcono1234 Dec 22, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

cowtowncoder commented Dec 21, 2025

Uh oh!

Uh oh!

cowtowncoder commented Dec 21, 2025

Uh oh!

Marcono1234 commented Dec 22, 2025

Uh oh!

cowtowncoder commented Dec 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

cowtowncoder Dec 19, 2025 •

edited

Loading

cowtowncoder Dec 21, 2025 •

edited

Loading