Release 2.1.0 · openzim/warc2zim

New fuzzy-rule for cheatography.com (#342), der-postillon.com (#330), iranwire.com (#363)
Properly rewrite redirect target url when present in HTML tag (#237)
New --encoding-aliases argument to pass encoding/charset aliases (#331)
Add support for SVG favicon (#148)
Automatically index PDF content and use PDF title (#289 and #290)

Upgrade to python-scraperlib 4.0.0
Generate fuzzy rules tests in Python and Javascript (#284)
Refactor HTML rewriter class to make it more open to change and expressive (#305)
Detect charset in document header only for HTML documents (#331)
Use software property from warcinfo record to set ZIM Scraper metadata (#357)
Store ContentDate as metadata, based on WARC-Date (#358)
Remove domain specific rules (#328)
Revisit retrieve_illustration logic to prefer best favicons (#352 and #369)
Upgrade dependencies (zimscraperlib 4.0.0, wombat.js 3.7.12 and others) (#376)

Provide feedback