Releases: openzim/zimit
Releases · openzim/zimit
3.1.2
3.1.1
3.1.0
Added
- Added
--overwriteflag to overwrite existing ZIM file if it exists (#399)
Changed
- Fix issues preventing interrupted crawls from being resumed. (#499)
- Ensure build directory is used explicitly instead of a randomized subdirectory when passed, and pre-create it if it does not exist.
- Use all warc_dirs found instead of just the latest so interrupted crawls use all collected pages across runs when an explicit collections directory is not passed.
- Don't cleanup an explicitly passed build directory.
- Upgrade to browsertrix crawler 1.11.1 and Python 3.14 + small other Python dep / Github actions upgrades (#532)
- Add back publishing of arm64 Docker image (#463)
3.0.5
3.0.4
3.0.3
3.0.2
3.0.1
3.0.0
Changed
- Change solution to report partial ZIM to the Zimfarm and other clients (#304)
- Keep temporary folder when crawler or warc2zim fails, even if not asked for (#468)
- Add many missing Browsertrix Crawler arguments ; drop default overrides by zimit ; drop
--noMobileDevicesetting (not needed anymore) (#433) - Document all Browsertrix Crawler default arguments values (#416)
- Use preferred Browsertrix Crawler arguments names: (part of #471)
--seedsinstead of--url--seedFileinstead of--urlFile--pageLimitinstead of--limit--pageLoadTimeoutinstead of--timeout--scopeIncludeRxinstead of--include--scopeExcludeRxinstead of--exclude--pageExtraDelayinstead of--delay
- Remove confusion between zimit, warc2zim and crawler stats filenames (part of #471)
--statsFilenameis now the crawler stats file (since it is the same name, just like other arguments)--zimit-progress-fileis now the zimit stats location--warc2zim-progress-fileis the warc2zim stats location- all are optional values, if not set and needed temporary files are used
Fixed
- Do not create the ZIM when crawl is incomplete (#444)