public inbox for development@lists.ipfire.org
 help / color / mirror / Atom feed
From: Adolf Belka <adolf.belka@ipfire.org>
To: development@lists.ipfire.org
Subject: Re: [PATCH v2 2/2] dns.cgi: Fixes bug#12395 - German umlauts not correctly displayed in remarks
Date: Tue, 12 Mar 2024 13:27:00 +0100	[thread overview]
Message-ID: <37fc4478-061b-4273-b8d7-d8e2f6bceac2@ipfire.org> (raw)
In-Reply-To: <EDF2038E-FD27-4033-B13E-D176DA598090@ipfire.org>

[-- Attachment #1: Type: text/plain, Size: 3452 bytes --]

Hi Michael,

On 12/03/2024 11:02, Michael Tremer wrote:
> Thank you.
> 
> I merged this for now so that we can fix this problem quickly.
> 
> However I was wondering whether we should consider making the decode statement a part of the “cleanhtml” function.
That makes a lot of sense. It would also mean that the problem of 
umlauts etc would be fixed everywhere that cleanhtml is used rather than 
needing to fix every invocation of cleanhtml.

I will look at putting something together for that.

> 
> I am still unsure why this is happening in the first place. We should be receiving UTF-8 from the browser, and I believe that perl doesn’t natively store things in UTF-8. That is however not a problem, because it should read files the same way it wrote them and so there should not be any difference when we re-read the configuration files. Unless some parts of the code specify any kind of encoding.
We do receive UTF-8 from the browser. The problem seems to be that the 
HTML::Entities::encode_entities command doesn't work with UTF-8 but with 
ISO-8859-1 encoding. I can't find where I found this the other day when 
I was searching on this topic to understand how to overcome it.

The fix is not encoding the text from the browser remark box into UTF-8 
but decoding it from UTF-8. Once the text is in the files then it is fine.

Of course my reasoning for doing the decoding may or may not be right, 
so I am always open to alternative suggestions.

Regards,

Adolf.
> 
> -Michael
> 
>> On 11 Mar 2024, at 12:19, Adolf Belka <adolf.belka(a)ipfire.org> wrote:
>>
>> - If Freifunk München e.V. is entered as a remark it gets converted to
>>    Freifunk München e.V.
>> - This is because cleanhtml is used on the UTF-8 remark text before saving it to the file
>>    and the HTML::Entities::encode_entities command that is run on that remark text does
>>    not work with UTF-8 text.
>> - If the UTF-8 text in the remark is decoded before running through the cleanhtml command
>>    then the characters with diacritical marks are correctly shown.
>> - Have tested out the fix on a remark with a range of different characters with
>>    diacritical marks and all of the ones tested were displayed correctly with the fix while
>>    in the original form they were mangled.
>>
>> Fixes: Bug#12395
>> Tested-by: Adolf Belka <adolf.belka(a)ipfire.org>
>> Signed-off-by: Adolf Belka <adolf.belka(a)ipfire.org>
>> ---
>> html/cgi-bin/dns.cgi | 7 +++++++
>> 1 file changed, 7 insertions(+)
>>
>> diff --git a/html/cgi-bin/dns.cgi b/html/cgi-bin/dns.cgi
>> index 0a34d3fd6..eb6f908d5 100644
>> --- a/html/cgi-bin/dns.cgi
>> +++ b/html/cgi-bin/dns.cgi
>> @@ -142,6 +142,13 @@ if (($cgiparams{'SERVERS'} eq $Lang::tr{'save'}) || ($cgiparams{'SERVERS'} eq $L
>> # Go further if there was no error.
>> if ( ! $errormessage) {
>> # Check if a remark has been entered.
>> +
>> + # decode the UTF-8 text so that characters with diacritical marks such as
>> + # umlauts are treated correctly by the following cleanhtml command
>> + $cgiparams{'REMARK'} = decode("UTF-8", $cgiparams{'REMARK'});
>> +
>> + # run the REMARK text through cleanhtml to ensure all unsafe html characters
>> + # are correctly encoded to their html entities
>> $cgiparams{'REMARK'} = &Header::cleanhtml($cgiparams{'REMARK'});
>>
>> my %dns_servers = ();
>> -- 
>> 2.44.0
>>
> 

-- 
Sent from my laptop

  reply	other threads:[~2024-03-12 12:27 UTC|newest]

Thread overview: 7+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2024-03-11 12:19 [PATCH v2 1/2] dns.cgi: Revert "dns.cgi: Fixes bug#12395 - German umlauts not correctly displayed in remarks" Adolf Belka
2024-03-11 12:19 ` [PATCH v2 2/2] dns.cgi: Fixes bug#12395 - German umlauts not correctly displayed in remarks Adolf Belka
2024-03-12 10:02   ` Michael Tremer
2024-03-12 12:27     ` Adolf Belka [this message]
2024-03-12 14:56       ` Michael Tremer
2024-03-13 22:05         ` Adolf Belka
2024-03-15 10:53           ` Michael Tremer

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=37fc4478-061b-4273-b8d7-d8e2f6bceac2@ipfire.org \
    --to=adolf.belka@ipfire.org \
    --cc=development@lists.ipfire.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox