getTextDiff by DoneDeal0 · Pull Request #34 · DoneDeal0/superdiff

DoneDeal0 · 2026-01-01T10:46:01Z

🚀 NEW FEATURE: `getTextDiff`

import { getTextDiff } from "@donedeal0/superdiff";

Compares two texts and returns a structured diff at a character, word, or sentence level.

FORMAT

Input

  previousText: string | null | undefined,
  currentText: string | null | undefined,
  options?: {
    separation?: "character" | "word" | "sentence", // "word" by default
    accuracy?: "normal" | "high", // "normal" by default
    detectMoves?: boolean // false by default
    ignoreCase?: boolean, // false by default
    ignorePunctuation?: boolean, // false by default
    locale?: Intl.Locale | string // undefined by default
  }

previousText: the original text.
currentText: the current text.
options
- separation whether you want a character, word or sentence based diff.
- accuracy:
  - normal (default): fastest mode, simple tokenization.
  - high: slower but exact tokenization. Handles all language subtleties (Unicode, emoji, CJK scripts, locale‑aware segmentation when a locale is provided).
- detectMoves:
  - false (default): optimized for readability. Token moves are ignored so insertions don’t cascade and break equality (recommended for UI diffing).
  - true: semantically precise, but slower — a single insertion shifts all following tokens, breaking equality.
- ignoreCase: if true, hello and HELLO are considered equal.
- ignorePunctuation: if true, hello! and hello are considered equal.
- locale: the locale of your text. Enables locale‑aware segmentation in high accuracy mode.

Output

type TextDiff = {
  type: "text";
  status: "added" | "deleted" | "equal" | "updated";
  diff: {
    value: string;
    index: number | null;
    previousValue?: string;
    previousIndex: number | null;
    status: "added" | "deleted" | "equal" | "moved" | "updated";
  }[];
};

USAGE

WITHOUT MOVES DETECTION

This is the default output. Token moves are ignored so insertions don’t cascade and break equality. Updates are rendered as two entries (added + deleted). The algorithm uses longest common subsequence (LCS), similar to GitHub diffs.

Input

getTextDiff(
- "The brown fox jumped high",
+ "The orange cat has jumped",
{ detectMoves: false, separation: "word" }
);

Output

{
      type: "text",
+     status: "updated",
      diff: [
        {
          value: 'The',
          index: 0,
          previousIndex: 0,
          status: 'equal',
        },
-       {
-         value: "brown",
-         index: null,
-         previousIndex: 1,
-         status: "deleted",
-       },
-       {
-         value: "fox",
-         index: null,
-         previousIndex: 2,
-         status: "deleted",
-       },
+       {
+         value: "orange",
+         index: 1,
+         previousIndex: null,
+         status: "added",
+       },
+       {
+         value: "cat",
+         index: 2,
+         previousIndex: null,
+         status: "added",
+       },
+       {
+         value: "has",
+         index: 3,
+         previousIndex: null,
+         status: "added",
+       },
        {
          value: "jumped",
          index: 4,
          previousIndex: 3,
          status: "equal",
        },
-       {
-         value: "high",
-         index: null,
-         previousIndex: 4,
-         status: "deleted",
-       }
      ],
    }

WITH MOVE DETECTION

If you prefer a semantically precise diff, activate the detectMoves option. Direct token swaps are considered updated.

Input

getTextDiff(
- "The brown fox jumped high",
+ "The orange cat has jumped",
{ detectMoves: true, separation: "word" }
);

Output

{
      type: "text",
+     status: "updated",
      diff: [
        {
          value: 'The',
          index: 0,
          previousIndex: 0,
          status: 'equal',
        },
+       {
+         value: "orange",
+         index: 1,
+         previousValue: "brown",
+         previousIndex: null,
+         status: "updated",
+       },
+       {
+         value: "cat",
+         index: 2,
+         previousValue: "fox",
+         previousIndex: null,
+         status: "updated",
+       },
+       {
+         value: "has",
+         index: 3,
+         previousIndex: null,
+         status: "added",
+       },
+       {
+         value: "jumped",
+         index: 4,
+         previousIndex: 3,
+         status: "moved",
+       },
-       {
-         value: "high",
-         index: null,
-         previousIndex: 4,
-         status: "deleted",
-       }
      ],
    }

📊 BENCHMARK

Scenario	Superdiff	diff
10k words	1.13 ms	3.68 ms
100k words	21.68 ms	45.93 ms
10k sentences	2.30 ms	5.61 ms
100k sentences	21.95 ms	62.03 ms

_{(Superdiff uses its normal accuracy settings to match diff's behavior)}

DoneDeal0 force-pushed the text-diff branch 10 times, most recently from ebfc55a to 6cf30a4 Compare January 7, 2026 20:35

DoneDeal0 force-pushed the text-diff branch 3 times, most recently from 943efdf to aee6be8 Compare January 11, 2026 14:48

DoneDeal0 self-assigned this Jan 11, 2026

DoneDeal0 force-pushed the text-diff branch from aee6be8 to c7fd62f Compare January 11, 2026 14:48

DoneDeal0 force-pushed the main branch 10 times, most recently from 42b0ec3 to 8d21774 Compare January 12, 2026 19:59

DoneDeal0 force-pushed the text-diff branch from 6dbd607 to b4ec894 Compare January 17, 2026 20:20

DoneDeal0 added 4 commits January 20, 2026 20:09

feat: tokenization

e3e9045

chore: objects benchmarks

6e84d9e

chore: boost getlistdiff perf

06ca398

chore: rebase

5b843ce

DoneDeal0 added 2 commits January 20, 2026 20:10

chore: improve perf

91075ca

fix: text api model

140c686

DoneDeal0 force-pushed the text-diff branch from b4ec894 to 140c686 Compare January 20, 2026 19:54

fix: tests

3eee209

DoneDeal0 changed the title ~~[DRAFT] getTextDiff~~ getTextDiff Jan 21, 2026

DoneDeal0 force-pushed the text-diff branch from c1cd841 to de8bd13 Compare January 21, 2026 20:21

chore: update benchmarks

65b49ff

DoneDeal0 force-pushed the text-diff branch from de8bd13 to 65b49ff Compare January 21, 2026 20:22

chore:m update readme

89a0b6d

DoneDeal0 force-pushed the text-diff branch 2 times, most recently from 6ef2035 to 3d054eb Compare January 26, 2026 19:44

chore:m update readme

76bf87b

DoneDeal0 force-pushed the text-diff branch from 3d054eb to 76bf87b Compare January 26, 2026 19:48

DoneDeal0 and others added 3 commits January 26, 2026 21:39

chore: refine readme

69656e1

chore: add gif

bf1b1cf

fix: edgecase characters punctuation

0c0691b

DoneDeal0 force-pushed the text-diff branch 7 times, most recently from 8a4fa0b to d5372fd Compare February 2, 2026 19:54

chore: add tests

63ff1ce

DoneDeal0 force-pushed the text-diff branch from d5372fd to 63ff1ce Compare February 4, 2026 19:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

getTextDiff#34

getTextDiff#34
DoneDeal0 wants to merge 14 commits intomainfrom
text-diff

DoneDeal0 commented Jan 1, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

DoneDeal0 commented Jan 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🚀 NEW FEATURE: getTextDiff

FORMAT

USAGE

📊 BENCHMARK

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

DoneDeal0 commented Jan 1, 2026 •

edited

Loading

🚀 NEW FEATURE: `getTextDiff`