charset - utf ...
i have a problem with the german characters.
after crawling my site looks like that:
but should look like that:
but if i look in the "index.html" there is the right utf charset, i think:
where can i find the mistake? what is the solution?
would be great if someone has a hint for me,
thank you and best regards,
- 17 replies
- B@bjoernbg2019-11-26 10:58:39.978Z
Hi, thanks for this great plugin – I'm only just starting to use it but it looks very promising!
With my website being in German I had the same issues as the others here and was able to fix it with changing one line in the PHP:
wp-content/plugins/static-html-output-plugin/plugin/WP2Static/HTMLProcessor.phpreplace this line
$this->xml_doc->loadHTML( mb_convert_encoding( $this->raw_html, 'HTML-ENTITIES', 'UTF-8' ) );
This solved the issue for me. The solution is based on a change made by the author in the dev version, so hopefully the next version will work out of the box!
This was very helpful, thanks. I implemented the same patch to the plugin code and that took care of our character encoding issues.
- In reply tobjoernbg⬆:
Thanks a lot for the fix!
After updating "HTMLProcessor.php" with the proposed fix German Umlauts work nicely.
FYI: MySQL tables have char fields defined with Collation = "utf8mb4_unicode_520_ci"
(WP database tables have been created automatically by the WordPress installer)
- Leon Stafford @leonstafford2018-12-23 15:50:43.869Z
It looks like this issue here:https://github.com/leonstafford/wp2static/issues/29
Could you please add to that issue or on this forum, a sample snippet of code from your WordPress site's source and this static copy's source (ie, View Source from right-click within your browser).
That will help me have some samples to work a solution for.
- Hmac @huji2019-01-16 14:49:38.436Z
hi Leon, I have the same problem, how can I solve this problem, it is very urgent.
- Leon Stafford @leonstafford2019-01-23 03:04:11.308Z
Please try this latest preview release, which adds a fix for UTF-8 encoded pages:https://github.com/leonstafford/wp2static/releases/tag/next_release_preview
Most WordPress themes should have this included, else ensure this is in your HTML code, else the plugin doesn't properly handle the encodings:
<meta charset="utf-8" />
That links is broken? Maybe you meant: https://github.com/leonstafford/wp2static/releases and clicking the "Latest release" green text? results in version 6.3 released a few days ago. @huji
- WWoah @woah2019-02-12 12:57:47.267Z
Anyone already has fixxed this? Just checking last time otherwise will move on :)
- Leon Stafford @leonstafford2019-02-12 22:32:15.325Z
It's likely down to a combination of your database encoding, the encoding used when entering the content and any other conversion steps throughout the process.
It's not something I can quickly troubleshoot with some back and forth over emails/forum, but if you would like to hire me for a few hrs, I can take a look at it for you.
Alternatively, you may try to do the static conversion with HTTrack and some scripts for rewriting URLs, that may have better support for the encoding used.
Thanks for your reply Leon. All clear.. What about the root problem? If we can't run in root i know enough and we will just move on.
- S@staticgen2019-09-24 08:52:45.958Z
I ran into the same problem with german umlauds on a page hosted on strato.de.
I made a test installation for debugging on another hoster uberspace.de, problem gone.
It seems database encoding or interaction between wordpress/plugin/database causes this problem.
This problem is also persistent with the latest version 7 build of the plugin.
If someone from the dev team wants to have a look, please pm me.
I can give you access to the test installations for debugging.
Would be really nice to resolve this.