From: Michael Tremer <michael.tremer@ipfire.org>
To: development@lists.ipfire.org
Subject: Re: [PATCH 1/5] location-functions.pl: Use a single script-wide db_handle.
Date: Mon, 09 Nov 2020 14:04:45 +0000 [thread overview]
Message-ID: <ED0D76E4-92E3-4785-967E-575BBEFBA291@ipfire.org> (raw)
In-Reply-To: <trinity-288f6ee2-6121-43a1-b303-28876bf25c1a-1604923989908@3c-app-gmx-bap38>
[-- Attachment #1: Type: text/plain, Size: 6002 bytes --]
Hello,
> On 9 Nov 2020, at 12:13, Bernhard Bitsch <Bernhard.Bitsch(a)gmx.de> wrote:
>
> Hello,
>
>> Gesendet: Montag, 09. November 2020 um 12:35 Uhr
>> Von: "Michael Tremer" <michael.tremer(a)ipfire.org>
>> An: "Bernhard Bitsch" <Bernhard.Bitsch(a)gmx.de>
>> Cc: development(a)lists.ipfire.org
>> Betreff: Re: [PATCH 1/5] location-functions.pl: Use a single script-wide db_handle.
>>
>> Hello,
>>
>> Thank you Stefan for submitting this patchset.
>>
>>> On 8 Nov 2020, at 19:36, Bernhard Bitsch <Bernhard.Bitsch(a)gmx.de> wrote:
>>>
>>> This means, we stay with the unbalanced memory allocation in (Perl) libloc. Which leaves a memory leak.
>>
>> Bernhard, could you please elaborate on how this memory leak is still existing?
>>
>> I also do not understand what you mean by unbalanced.
>>
>> As far as I understand the code right now, the database is being opened once and the handle is being stored internally. All functions that are being called will no longer have to hold their own database handle. Therefore the maximum amount of handles open is one.
>>
>
> The init() function allocates the handle, which is not really destroyed. The END block is just a safety process. The allocated memory for the handle should be released in the moment of references==0.
The handle is being initialised when location-functions.pl is being loaded and it is freed when the Perl interpreter ends.
It remains referenced all the time.
> Yes, the new code ( inspired by my work-around ) minimizes the number of handles to one. But this handle is persistent also ( this doesn't hurt because it is used persistent for the time of module use ).
It is only being initialised when location-functions.pl is being loaded. That only happens in code that uses those functions.
I do not see the problem with this. Constantly closing and re-opening the database is not an option since the connection tracking list can have tens of thousands of lookups and this will make the page - simply - slow.
I cannot see an option that is more resource-friendly than this.
>
>>>
>>>> Gesendet: Samstag, 07. November 2020 um 19:47 Uhr
>>>> Von: "Stefan Schantl" <stefan.schantl(a)ipfire.org>
>>>> An: development(a)lists.ipfire.org
>>>> Betreff: [PATCH 1/5] location-functions.pl: Use a single script-wide db_handle.
>>>>
>>>> Create and use a single script-wide database handle for libloc to
>>>> prevent from creating multiple ones.
>>>>
>>>> This helps saving memory, especially on small systems.
>>>>
>>>> Reference #12515.
>>>>
>>>
>>> The error can be produced easily with small memory, but it is present in all systems.
>>> Therefore I've posted this solution as work-around only!
>>
>> Did you test this patchset or did you come to your conclusion by only reading the code?
>>
>> Best,
>> -Michael
>>
>
> The sentence about "small systems" refers to the original comment. It is not a real memory saving, the patch cures a 'memory wasting'. It is clear, that a massively use of malloc without free crashes a small system faster than a system with big memory resources.
Opening the database requires pretty much nothing (maybe 4K) of RSS memory and about 50MB of VIRT memory. This due to mmap() which helps us reading the database a lot fast. I think this is very very reasonable. The more data is being read from the database the more memory it will use.
If you wish to change that (and that comes with a performance penalty) you could submit a patch that removes mmap() if the user wishes to.
But I do not see why this is a problem at all. The whole system won’t run well on systems with 128 or 256MB of memory. The Linux kernel won’t even boot properly on those any more. Memory is cheap. Our time isn’t.
We have found the most resource-saving way to deal with this now and anything else will simply use considerable amount of time with no benefit to 99% of our users.
Best,
-Michael
> The patchset is mainly my work-around, thus it is in test since I found the error.
>
> Best,
> -Bernhard
>
>>>
>>> - Bernhard
>>>
>>>
>>>> Signed-off-by: Stefan Schantl <stefan.schantl(a)ipfire.org>
>>>> ---
>>>> config/cfgroot/location-functions.pl | 11 ++++-------
>>>> 1 file changed, 4 insertions(+), 7 deletions(-)
>>>>
>>>> diff --git a/config/cfgroot/location-functions.pl b/config/cfgroot/location-functions.pl
>>>> index 2cfe7f908..9b1d0bfb5 100644
>>>> --- a/config/cfgroot/location-functions.pl
>>>> +++ b/config/cfgroot/location-functions.pl
>>>> @@ -55,6 +55,9 @@ our $keyfile = "$location_dir/signing-key.pem";
>>>> # Directory which contains the exported databases.
>>>> our $xt_geoip_db_directory = "/usr/share/xt_geoip/";
>>>>
>>>> +# Create libloc database handle.
>>>> +my $db_handle = &init();
>>>> +
>>>> #
>>>> ## Tiny function to init the location database.
>>>> #
>>>> @@ -86,7 +89,7 @@ sub verify ($) {
>>>> ## Function to the the country code of a given address.
>>>> #
>>>> sub lookup_country_code($$) {
>>>> - my ($db_handle, $address) = @_;
>>>> + my ($address) = @_;
>>>>
>>>> # Lookup the given address.
>>>> my $country_code = &Location::lookup_country_code($db_handle, $address);
>>>> @@ -174,9 +177,6 @@ sub get_full_country_name($) {
>>>>
>>>> # Function to get all available locations.
>>>> sub get_locations() {
>>>> - # Create libloc database handle.
>>>> - my $db_handle = &init();
>>>> -
>>>> # Get locations which are stored in the location database.
>>>> my @database_locations = &Location::database_countries($db_handle);
>>>>
>>>> @@ -197,9 +197,6 @@ sub address_has_flags($) {
>>>> # Array to store the flags of the address.
>>>> my @flags;
>>>>
>>>> - # Init libloc database handle.
>>>> - my $db_handle = &init();
>>>> -
>>>> # Loop through the hash of possible network flags.
>>>> foreach my $flag (keys(%network_flags)) {
>>>> # Check if the address has the current flag.
>>>> --
>>>> 2.20.1
next prev parent reply other threads:[~2020-11-09 14:04 UTC|newest]
Thread overview: 13+ messages / expand[flat|nested] mbox.gz Atom feed top
2020-11-07 18:47 Stefan Schantl
2020-11-07 18:47 ` [PATCH 2/5] location-functions.pl: Add END block to release the database handle Stefan Schantl
2020-11-07 18:47 ` [PATCH 3/5] location-functions.pl: Add get_continent_code() function Stefan Schantl
2020-11-07 18:47 ` [PATCH 4/5] locations-functions.pl: Allow get_locations() function to skip special locations Stefan Schantl
2020-11-09 11:49 ` Michael Tremer
2020-11-07 18:47 ` [PATCH 5/5] Adjust CGI files to work with latest location-function.pl changes Stefan Schantl
2020-11-10 12:18 ` Aw: " Bernhard Bitsch
2020-11-08 19:36 ` Aw: [PATCH 1/5] location-functions.pl: Use a single script-wide db_handle Bernhard Bitsch
2020-11-09 11:35 ` Michael Tremer
2020-11-09 12:13 ` Aw: " Bernhard Bitsch
2020-11-09 14:04 ` Michael Tremer [this message]
2020-11-09 15:16 ` Bernhard Bitsch
2020-11-09 17:03 ` Michael Tremer
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=ED0D76E4-92E3-4785-967E-575BBEFBA291@ipfire.org \
--to=michael.tremer@ipfire.org \
--cc=development@lists.ipfire.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox