[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Lynx-dev] Chinese characters are not displayed when using charset g
From: |
Thomas Dickey |
Subject: |
Re: [Lynx-dev] Chinese characters are not displayed when using charset gb2312/euc-cn |
Date: |
Wed, 30 Jun 2021 19:43:04 -0400 |
User-agent: |
Mutt/1.10.1 (2018-07-13) |
On Tue, Jun 29, 2021 at 05:12:20PM -0400, Thomas Dickey wrote:
> On Mon, Jun 21, 2021 at 05:09:53PM +0200, Cédric Hannotier via Lynx-dev wrote:
> > Hi all,
> >
> > I got an HTML email with gb3212 charset.
> s/3232/2312/
>
> > It seems that lynx is unable to print some characters.
>
> yes... I see that the problem is that lynx's conversion of euc-cn to utf-8
> is incomplete. This is not a new issue as you can see here:
>
> https://lists.nongnu.org/archive/cgi-bin/namazu.cgi?query=gb2312&submit=Search%21&idxname=lynx-dev&max=20&result=normal&sort=date%3Alate
>
> > Changing the declared charset to euc-cn gives the same result.
>
> gb2312 is equated to euc-cn internally.
>
> > Using another browser works (qutebrowser).
> > First converting that file to utf-8 (using iconv) also works.
>
> yes, lynx uses iconv after organizing the characters :-)
>
> > Lynx build is from Debian testing:
> >
> > Lynx Version 2.9.0dev.6 (05 Sep 2020)
I had some time today (which I'd intended working on another feature for lynx),
and implemented this as an experimental feature (which the packager may adopt
in dev.7 -- when I finish that other feature).
see
https://github.com/ThomasDickey/lynx-snapshots/commit/5111b5306b278cecb0b66166eb8338072fc713c6
--
Thomas E. Dickey <dickey@invisible-island.net>
https://invisible-island.net
ftp://ftp.invisible-island.net
signature.asc
Description: PGP signature