Skip to content

fix: extend mime types#243

Open
nikitachapovskii-dev wants to merge 2 commits intomasterfrom
fix/extend-mime-types
Open

fix: extend mime types#243
nikitachapovskii-dev wants to merge 2 commits intomasterfrom
fix/extend-mime-types

Conversation

@nikitachapovskii-dev
Copy link
Contributor

n this PR I added application/gzip and application/x-gzip to additionalMimeTypes and adjusted sitemap parsing strategy so gzip/plain-text responses (and .gz/.txt sitemap URLs) are passed to parseSitemap as URL sources, which uses Crawlee’s built-in handling for those formats. Regular XML responses still go through the raw path. This restores compatibility with real-world sitemap endpoints like Product Hunt and removes the current runtime failures, with tests added to cover the source-selection behavior.

Closes #241

@nikitachapovskii-dev nikitachapovskii-dev changed the title Fix/extend mime types fix: extend mime types Feb 26, 2026
@nikitachapovskii-dev nikitachapovskii-dev requested review from ruocco-l and removed request for ruocco-l February 26, 2026 15:09
Copy link
Collaborator

@ruocco-l ruocco-l left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🚀

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

SitemapEx: Handle gzipped sitemaps

2 participants