Return generated paper-model responses instead of top-token listings by Copilot · Pull Request #20 · sharpninja/BitNet-b1.58-Sharp

Copilot · 2026-03-20T17:18:43Z

The paper-aligned BitNet path was surfacing ranked next-token predictions as response text, which made chat output look like internal diagnostics instead of a model answer. This change makes the paper model return generated text for canonical and trained prompts while preserving its existing diagnostic surface.

Response generation
- change BitNetPaperModel.GenerateResponse(...) to return natural response text instead of formatting the top logits into "Top next-token predictions: ..."
- keep verbose/normal diagnostics intact
- fall back to transformer token selection only when no memorized exemplar response exists
Default prompt behavior
- prime the default paper model with repository default prompt/response exemplars
- canonical prompts like "how are you hosted" now respond like the traditional comparison model path instead of exposing ranked tokens
Training + checkpoint parity
- persist trained/memorized exemplar responses inside paper-model checkpoints
- restore memorized responses on load so checkpoint round-trips preserve response behavior
- keep checkpoint loading backward-compatible when older files do not contain the new field
Targeted expectation updates
- update tests that previously asserted on the "Top next-token predictions:" string
- align benchmark-path assertions with truncated output budgets

var model = BitNetBootstrap.CreatePaperModel(VerbosityLevel.Normal);
var result = model.GenerateResponse("how are you hosted", maxTokens: 8);

Console.WriteLine(result.ResponseText);
// before: "Top next-token predictions: ..."
// now:    "i prioritize microsoft agent framework hosting with a"

💡 You can make Copilot smarter by setting up custom instructions, customizing its development environment and configuring Model Context Protocol (MCP) servers. Learn more Copilot coding agent tips in the docs.

Co-authored-by: sharpninja <16146732+sharpninja@users.noreply.github.com> Agent-Logs-Url: https://github.com/sharpninja/BitNet-b1.58-Sharp/sessions/bf5ee2e5-4b19-4106-8974-0149d7104efb

Copilot

Pull request overview

Updates the paper-aligned BitNetPaperModel so its chat-facing output is natural generated text (using memorized exemplar responses when available) rather than a diagnostic-style “top token” listing, while preserving diagnostics and ensuring trained exemplar responses survive checkpoint save/load.

Changes:

Switch BitNetPaperModel.GenerateResponse(...) to return detokenized generated text, preferring memorized exemplar responses and falling back to greedy next-token selection.
Prime the default paper model with BitNetTrainingCorpus.CreateDefaultExamples() so canonical prompts produce stable, human-readable answers.
Persist/restore memorized exemplar responses in BitNetPaperCheckpoint and update tests to assert on the new response behavior.

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 2 comments.

File	Description
`src/BitNetSharp.Core/BitNetPaperModel.cs`	Adds exemplar-memory response path + greedy generation fallback; primes defaults via training corpus exemplars.
`src/BitNetSharp.Core/BitNetPaperCheckpoint.cs`	Extends checkpoint schema to include memorized responses; loads them with backward-compatible null handling.
`tests/BitNetSharp.Tests/BitNetModelTests.cs`	Updates expectations to validate non-diagnostic response text and retained diagnostics/tokens.
`tests/BitNetSharp.Tests/HostedAgentBenchmarksExecutionTests.cs`	Aligns benchmark-path assertion with new natural response output.

Copilot · 2026-03-20T18:14:29Z

src/BitNetSharp.Core/BitNetPaperModel.cs

+    internal IReadOnlyDictionary<string, int[]> ExportMemorizedResponses() =>
+        _memorizedResponses.ToDictionary(
+            static pair => pair.Key,
+            static pair => pair.Value.ToArray(),
+            StringComparer.Ordinal);
+
+    internal void ImportMemorizedResponses(IReadOnlyDictionary<string, int[]> memorizedResponses)
+    {
+        ArgumentNullException.ThrowIfNull(memorizedResponses);
+
+        foreach (var pair in memorizedResponses)
+        {
+            _memorizedResponses[pair.Key] = pair.Value.ToArray();


ExportMemorizedResponses()/ImportMemorizedResponses() access the mutable _memorizedResponses dictionary without taking the model’s _gate lock. Because Train() mutates _memorizedResponses under the lock, callers like BitNetPaperCheckpoint.Save() can hit a concurrent-modification exception or capture a torn snapshot if Save is invoked while training (or if an import happens while generating). Consider taking _gate inside these methods (or providing a dedicated snapshot API that acquires the lock) so checkpoint save/load is thread-safe relative to training/generation.

Suggested change

internal IReadOnlyDictionary<string, int[]> ExportMemorizedResponses() =>

_memorizedResponses.ToDictionary(

static pair => pair.Key,

static pair => pair.Value.ToArray(),

StringComparer.Ordinal);

internal void ImportMemorizedResponses(IReadOnlyDictionary<string, int[]> memorizedResponses)

{

ArgumentNullException.ThrowIfNull(memorizedResponses);

foreach (var pair in memorizedResponses)

{

_memorizedResponses[pair.Key] = pair.Value.ToArray();

internal IReadOnlyDictionary<string, int[]> ExportMemorizedResponses()

{

lock (_gate)

{

return _memorizedResponses.ToDictionary(

static pair => pair.Key,

static pair => pair.Value.ToArray(),

StringComparer.Ordinal);

}

}

internal void ImportMemorizedResponses(IReadOnlyDictionary<string, int[]> memorizedResponses)

{

ArgumentNullException.ThrowIfNull(memorizedResponses);

lock (_gate)

{

foreach (var pair in memorizedResponses)

{

_memorizedResponses[pair.Key] = pair.Value.ToArray();

}

Copilot · 2026-03-20T18:14:30Z

src/BitNetSharp.Core/BitNetPaperCheckpoint.cs

+            model.ExportMemorizedResponses().ToDictionary(
+                static pair => pair.Key,
+                static pair => pair.Value.ToArray(),
+                StringComparer.Ordinal),


Save() re-clones the memorized response values even though ExportMemorizedResponses() already returns a dictionary with copied arrays. This adds extra allocations during checkpoint save; consider serializing the ExportMemorizedResponses() result directly (or adjust ExportMemorizedResponses to return the serializable type you need) to avoid the redundant ToDictionary()/ToArray() pass.

Suggested change

model.ExportMemorizedResponses().ToDictionary(

static pair => pair.Key,

static pair => pair.Value.ToArray(),

StringComparer.Ordinal),

model.ExportMemorizedResponses(),

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 4fd99a21a1

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-03-20T18:17:32Z

src/BitNetSharp.Core/BitNetPaperModel.cs

+                    if (nextToken.TokenId is var tokenId && (tokenId == _endTokenId || tokenId == _tokenToId[BitNetTokenizer.UnknownToken]))
+                    {


Prevent generation from stopping on <unk> logits

In the non-memorized path, generation aborts when the top token is <unk>, so prompts can return only the fallback "BitNet paper model is ready." even though normal tokens are available. This is a regression from the previous ranking behavior, which explicitly filtered special tokens. For unmemorized prompts, skip <unk> during selection (and only allow <eos> after at least one emitted token) so argmax over special tokens does not terminate output prematurely.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-03-20T18:17:32Z

src/BitNetSharp.Core/BitNetPaperModel.cs

+    internal IReadOnlyDictionary<string, int[]> ExportMemorizedResponses() =>
+        _memorizedResponses.ToDictionary(
+            static pair => pair.Key,
+            static pair => pair.Value.ToArray(),
+            StringComparer.Ordinal);


Guard memorized-response export with _gate

ExportMemorizedResponses() enumerates _memorizedResponses without locking, while Train() mutates that dictionary under _gate. If BitNetPaperCheckpoint.Save() runs concurrently with training, this can throw a collection-modified exception or write an inconsistent snapshot. Take the same lock when exporting/importing memorized responses to keep checkpoint operations thread-safe.

Useful? React with 👍 / 👎.

Initial plan

36fb926

Copilot AI assigned Copilot and sharpninja Mar 20, 2026

Copilot started work on behalf of sharpninja March 20, 2026 17:18 View session

Copilot AI and others added 2 commits March 20, 2026 17:31

fix: return generated paper-model text for default prompts

90598c0

Co-authored-by: sharpninja <16146732+sharpninja@users.noreply.github.com> Agent-Logs-Url: https://github.com/sharpninja/BitNet-b1.58-Sharp/sessions/bf5ee2e5-4b19-4106-8974-0149d7104efb

fix: preserve paper-model response memory across checkpoints

4fd99a2

Co-authored-by: sharpninja <16146732+sharpninja@users.noreply.github.com> Agent-Logs-Url: https://github.com/sharpninja/BitNet-b1.58-Sharp/sessions/bf5ee2e5-4b19-4106-8974-0149d7104efb

Copilot AI changed the title ~~[WIP] Investigate Bitnet model next-token prediction differences~~ Return generated paper-model responses instead of top-token listings Mar 20, 2026

Copilot AI requested a review from sharpninja March 20, 2026 18:10

Copilot finished work on behalf of sharpninja March 20, 2026 18:10

sharpninja approved these changes Mar 20, 2026

View reviewed changes

sharpninja marked this pull request as ready for review March 20, 2026 18:11

Copilot AI review requested due to automatic review settings March 20, 2026 18:11

Copilot started reviewing on behalf of sharpninja March 20, 2026 18:11 View session

sharpninja merged commit 0e63b02 into main Mar 20, 2026
3 checks passed

sharpninja deleted the copilot/analyze-bitnet-model-responses branch March 20, 2026 18:13

Copilot AI reviewed Mar 20, 2026

View reviewed changes

chatgpt-codex-connector bot reviewed Mar 20, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Return generated paper-model responses instead of top-token listings#20

Return generated paper-model responses instead of top-token listings#20
sharpninja merged 3 commits intomainfrom
copilot/analyze-bitnet-model-responses

Copilot AI commented Mar 20, 2026 •

edited

Loading

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Mar 20, 2026

Uh oh!

Copilot AI Mar 20, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Mar 20, 2026

Uh oh!

chatgpt-codex-connector bot Mar 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		if (nextToken.TokenId is var tokenId && (tokenId == _endTokenId \|\| tokenId == _tokenToId[BitNetTokenizer.UnknownToken]))
		{

Conversation

Copilot AI commented Mar 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Copilot AI commented Mar 20, 2026 •

edited

Loading