From: Adolf Belka <adolf.belka@ipfire.org>
To: development@lists.ipfire.org
Subject: Re: [PATCH v2 2/2] dns.cgi: Fixes bug#12395 - German umlauts not correctly displayed in remarks
Date: Tue, 12 Mar 2024 13:27:00 +0100 [thread overview]
Message-ID: <37fc4478-061b-4273-b8d7-d8e2f6bceac2@ipfire.org> (raw)
In-Reply-To: <EDF2038E-FD27-4033-B13E-D176DA598090@ipfire.org>
[-- Attachment #1: Type: text/plain, Size: 3452 bytes --]
Hi Michael,
On 12/03/2024 11:02, Michael Tremer wrote:
> Thank you.
>
> I merged this for now so that we can fix this problem quickly.
>
> However I was wondering whether we should consider making the decode statement a part of the “cleanhtml” function.
That makes a lot of sense. It would also mean that the problem of
umlauts etc would be fixed everywhere that cleanhtml is used rather than
needing to fix every invocation of cleanhtml.
I will look at putting something together for that.
>
> I am still unsure why this is happening in the first place. We should be receiving UTF-8 from the browser, and I believe that perl doesn’t natively store things in UTF-8. That is however not a problem, because it should read files the same way it wrote them and so there should not be any difference when we re-read the configuration files. Unless some parts of the code specify any kind of encoding.
We do receive UTF-8 from the browser. The problem seems to be that the
HTML::Entities::encode_entities command doesn't work with UTF-8 but with
ISO-8859-1 encoding. I can't find where I found this the other day when
I was searching on this topic to understand how to overcome it.
The fix is not encoding the text from the browser remark box into UTF-8
but decoding it from UTF-8. Once the text is in the files then it is fine.
Of course my reasoning for doing the decoding may or may not be right,
so I am always open to alternative suggestions.
Regards,
Adolf.
>
> -Michael
>
>> On 11 Mar 2024, at 12:19, Adolf Belka <adolf.belka(a)ipfire.org> wrote:
>>
>> - If Freifunk München e.V. is entered as a remark it gets converted to
>> Freifunk München e.V.
>> - This is because cleanhtml is used on the UTF-8 remark text before saving it to the file
>> and the HTML::Entities::encode_entities command that is run on that remark text does
>> not work with UTF-8 text.
>> - If the UTF-8 text in the remark is decoded before running through the cleanhtml command
>> then the characters with diacritical marks are correctly shown.
>> - Have tested out the fix on a remark with a range of different characters with
>> diacritical marks and all of the ones tested were displayed correctly with the fix while
>> in the original form they were mangled.
>>
>> Fixes: Bug#12395
>> Tested-by: Adolf Belka <adolf.belka(a)ipfire.org>
>> Signed-off-by: Adolf Belka <adolf.belka(a)ipfire.org>
>> ---
>> html/cgi-bin/dns.cgi | 7 +++++++
>> 1 file changed, 7 insertions(+)
>>
>> diff --git a/html/cgi-bin/dns.cgi b/html/cgi-bin/dns.cgi
>> index 0a34d3fd6..eb6f908d5 100644
>> --- a/html/cgi-bin/dns.cgi
>> +++ b/html/cgi-bin/dns.cgi
>> @@ -142,6 +142,13 @@ if (($cgiparams{'SERVERS'} eq $Lang::tr{'save'}) || ($cgiparams{'SERVERS'} eq $L
>> # Go further if there was no error.
>> if ( ! $errormessage) {
>> # Check if a remark has been entered.
>> +
>> + # decode the UTF-8 text so that characters with diacritical marks such as
>> + # umlauts are treated correctly by the following cleanhtml command
>> + $cgiparams{'REMARK'} = decode("UTF-8", $cgiparams{'REMARK'});
>> +
>> + # run the REMARK text through cleanhtml to ensure all unsafe html characters
>> + # are correctly encoded to their html entities
>> $cgiparams{'REMARK'} = &Header::cleanhtml($cgiparams{'REMARK'});
>>
>> my %dns_servers = ();
>> --
>> 2.44.0
>>
>
--
Sent from my laptop
next prev parent reply other threads:[~2024-03-12 12:27 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2024-03-11 12:19 [PATCH v2 1/2] dns.cgi: Revert "dns.cgi: Fixes bug#12395 - German umlauts not correctly displayed in remarks" Adolf Belka
2024-03-11 12:19 ` [PATCH v2 2/2] dns.cgi: Fixes bug#12395 - German umlauts not correctly displayed in remarks Adolf Belka
2024-03-12 10:02 ` Michael Tremer
2024-03-12 12:27 ` Adolf Belka [this message]
2024-03-12 14:56 ` Michael Tremer
2024-03-13 22:05 ` Adolf Belka
2024-03-15 10:53 ` Michael Tremer
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=37fc4478-061b-4273-b8d7-d8e2f6bceac2@ipfire.org \
--to=adolf.belka@ipfire.org \
--cc=development@lists.ipfire.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox