[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Lynx-dev] Chinese characters are not displayed when using charset g
From: |
Thomas Dickey |
Subject: |
Re: [Lynx-dev] Chinese characters are not displayed when using charset gb2312/euc-cn |
Date: |
Tue, 29 Jun 2021 17:12:20 -0400 |
User-agent: |
Mutt/1.10.1 (2018-07-13) |
On Mon, Jun 21, 2021 at 05:09:53PM +0200, Cédric Hannotier via Lynx-dev wrote:
> Hi all,
>
> I got an HTML email with gb3212 charset.
s/3232/2312/
> It seems that lynx is unable to print some characters.
yes... I see that the problem is that lynx's conversion of euc-cn to utf-8
is incomplete. This is not a new issue as you can see here:
https://lists.nongnu.org/archive/cgi-bin/namazu.cgi?query=gb2312&submit=Search%21&idxname=lynx-dev&max=20&result=normal&sort=date%3Alate
> Changing the declared charset to euc-cn gives the same result.
gb2312 is equated to euc-cn internally.
> Using another browser works (qutebrowser).
> First converting that file to utf-8 (using iconv) also works.
yes, lynx uses iconv after organizing the characters :-)
> Lynx build is from Debian testing:
>
> Lynx Version 2.9.0dev.6 (05 Sep 2020)
> libwww-FM 2.14, SSL-MM 1.4.1, GNUTLS 3.7.0, ncurses 6.2.20201114(wide)
> Built on linux-gnu.
>
> Someone else tested it with both 2.8.9rel.1 debian 3 (from Debian 10)
> and 2.9.0dev.6 debian 2, but none of them worked.
>
> The HTML can be found there: https://ttm.sh/FJP.html
>
> Regards
> --
>
> Cédric Hannotier
>
> _______________________________________________
> Lynx-dev mailing list
> Lynx-dev@nongnu.org
> https://lists.nongnu.org/mailman/listinfo/lynx-dev
--
Thomas E. Dickey <dickey@invisible-island.net>
https://invisible-island.net
ftp://ftp.invisible-island.net
signature.asc
Description: PGP signature