Open
Description
Example:
cppreference-doc-20250209\reference\en.cppreference.com\w\cpp\header\bit.html:4 which is the page title
- the raw dump has it correctly:
<title>Standard library header <bit> (C++20) - cppreference.com</title>
it's just UTF-8 encoded
- the html in the zip (also in the .tar.xz) has it twice encoded:
html-book-20250209.zip\reference\en\cpp\header\bit.html
<title>Standard library header <bit>Â (C++20) - cppreference.com</title>
adding \u00C2 kruft
I'd be happy to look into it.