Skip to content

Add Qidian font mapping tools and examples#1

Open
thucpru wants to merge 1 commit intomasterfrom
claude/fix-fontkit-mapping-011CV3N87DNbLTqDsXStsnJi
Open

Add Qidian font mapping tools and examples#1
thucpru wants to merge 1 commit intomasterfrom
claude/fix-fontkit-mapping-011CV3N87DNbLTqDsXStsnJi

Conversation

@thucpru
Copy link
Copy Markdown
Owner

@thucpru thucpru commented Nov 12, 2025

Add comprehensive toolkit for solving font obfuscation issues on websites like Qidian.com that use custom fonts with scrambled character mappings to prevent web scraping.

Features:

  • Core library (qidian-font-mapper.js) for font analysis and decoding
  • Practical web scraping example (qidian-scraper-example.js) with CLI
  • Browser-based UI (qidian-browser-decoder.html) for interactive testing
  • Comprehensive documentation (README-QIDIAN-FONT-MAPPER.md) with guides

The toolkit uses fontkit to:

  • Parse custom font files (WOFF, WOFF2, TTF, OTF)
  • Analyze character-to-glyph mappings (cmap tables)
  • Compare glyph shapes to build decoding tables
  • Decode obfuscated text back to original characters

Includes multiple approaches:

  1. Reference font comparison (accurate but slower)
  2. Heuristic analysis (faster but may need manual verification)
  3. Pattern detection for sequential mappings

Use cases:

  • Research on font encoding techniques
  • Educational purposes for understanding anti-scraping
  • Personal content access for legitimate subscribers

All tools include proper documentation and ethical usage guidelines.

Add comprehensive toolkit for solving font obfuscation issues on websites
like Qidian.com that use custom fonts with scrambled character mappings
to prevent web scraping.

Features:
- Core library (qidian-font-mapper.js) for font analysis and decoding
- Practical web scraping example (qidian-scraper-example.js) with CLI
- Browser-based UI (qidian-browser-decoder.html) for interactive testing
- Comprehensive documentation (README-QIDIAN-FONT-MAPPER.md) with guides

The toolkit uses fontkit to:
- Parse custom font files (WOFF, WOFF2, TTF, OTF)
- Analyze character-to-glyph mappings (cmap tables)
- Compare glyph shapes to build decoding tables
- Decode obfuscated text back to original characters

Includes multiple approaches:
1. Reference font comparison (accurate but slower)
2. Heuristic analysis (faster but may need manual verification)
3. Pattern detection for sequential mappings

Use cases:
- Research on font encoding techniques
- Educational purposes for understanding anti-scraping
- Personal content access for legitimate subscribers

All tools include proper documentation and ethical usage guidelines.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants