public inbox for location@lists.ipfire.org
 help / color / mirror / Atom feed
From: Michael Tremer <michael.tremer@ipfire.org>
To: location@lists.ipfire.org
Subject: Re: [PATCH] location-importer: Fix Spamhaus ASN-DROP parsing
Date: Sun, 05 Nov 2023 13:47:53 +0000	[thread overview]
Message-ID: <2ABB0AF4-CC9C-4CEA-A389-2452CC3DE685@ipfire.org> (raw)
In-Reply-To: <fd183979-30db-463a-a381-4894cf4c8033@ipfire.org>

[-- Attachment #1: Type: text/plain, Size: 3094 bytes --]

Hello Peter,

> On 4 Nov 2023, at 14:04, Peter Müller <peter.mueller(a)ipfire.org> wrote:
> 
> The format of this list has changed, from a plain text file with a
> customer schema to JSON. Adjust our routines accordingly to make use of
> this list again.
> 
> Signed-off-by: Peter Müller <peter.mueller(a)ipfire.org>
> Tested-by: Peter Müller <peter.mueller(a)ipfire.org>
> ---
> src/scripts/location-importer.in | 21 +++++++--------------
> 1 file changed, 7 insertions(+), 14 deletions(-)
> 
> diff --git a/src/scripts/location-importer.in b/src/scripts/location-importer.in
> index 28a4f6c..8b0c676 100644
> --- a/src/scripts/location-importer.in
> +++ b/src/scripts/location-importer.in
> @@ -3,7 +3,7 @@
> #                                                                             #
> # libloc - A library to determine the location of someone on the Internet     #
> #                                                                             #
> -# Copyright (C) 2020-2022 IPFire Development Team <info(a)ipfire.org>           #
> +# Copyright (C) 2020-2023 IPFire Development Team <info(a)ipfire.org>           #
> #                                                                             #
> # This library is free software; you can redistribute it and/or               #
> # modify it under the terms of the GNU Lesser General Public                  #
> @@ -1686,7 +1686,7 @@ class CLI(object):
> ]
> 
> asn_lists = [
> - ("SPAMHAUS-ASNDROP", "https://www.spamhaus.org/drop/asndrop.txt")
> + ("SPAMHAUS-ASNDROP", "https://www.spamhaus.org/drop/asndrop.json")
> ]
> 
> for name, url in ip_lists:
> @@ -1759,23 +1759,16 @@ class CLI(object):
> 
> # Iterate through every line, filter comments and add remaining ASNs to
> # the override table in case they are valid...
> - for sline in f.readlines():
> + for sline in fcontent:
> # The response is assumed to be encoded in UTF-8...
> sline = sline.decode("utf-8")
> 
> - # Comments start with a semicolon...
> - if sline.startswith(";"):
> + # Load every line as a JSON object and try to obtain an ASN from it...
> + try:
> + asn = json.loads(sline)["asn"]
> + except KeyError:
> continue

It would be nicer if you didn’t do so many things in one line, because it will get difficult to catch the correct exception.

I believe that this should be split into the json.loads() operation where you should catch any JSON decoding issues. This would make it clear that some kind of download issue or similar has happened.

Fetching the “asn” field should be a second step, because an error here means that you have received a valid JSON object, but the format has changed.

> - # Throw away anything after the first space...
> - sline = sline.split()[0]
> -
> - # ... strip the "AS" prefix from it ...
> - sline = sline.strip("AS")
> -
> - # ... and convert it into an integer. Voila.
> - asn = int(sline)
> -
> # Filter invalid ASNs...
> if not self._check_parsed_asn(asn):
> log.warning("Skipping bogus ASN found in %s (%s): %s" % \
> -- 
> 2.35.3


  reply	other threads:[~2023-11-05 13:47 UTC|newest]

Thread overview: 4+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2023-11-04 14:04 Peter Müller
2023-11-05 13:47 ` Michael Tremer [this message]
2023-11-05 16:59   ` Peter Müller
2023-11-05 18:02     ` Michael Tremer

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=2ABB0AF4-CC9C-4CEA-A389-2452CC3DE685@ipfire.org \
    --to=michael.tremer@ipfire.org \
    --cc=location@lists.ipfire.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox