public inbox for location@lists.ipfire.org
 help / color / mirror / Atom feed
From: "Peter Müller" <peter.mueller@ipfire.org>
To: location@lists.ipfire.org
Subject: [PATCH] location-importer.in: Import (technical) AS names from ARIN
Date: Tue, 08 Jun 2021 12:10:36 +0000	[thread overview]
Message-ID: <20210608121036.16242-1-peter.mueller@ipfire.org> (raw)

[-- Attachment #1: Type: text/plain, Size: 3534 bytes --]

ARIN and LACNIC, unfortunately, do not seem to publish data containing
human readable AS names. For the former, we at least have a list of
tecnical names, which this patch fetches and inserts into the autnums
table.

While some of them do not seem to be suitable for human consumption (i.
e. being very cryptic), providing these data might be helpful
neverthelesss.

Signed-off-by: Peter Müller <peter.mueller(a)ipfire.org>
---
 src/python/location-importer.in | 61 +++++++++++++++++++++++++++++++++
 1 file changed, 61 insertions(+)

diff --git a/src/python/location-importer.in b/src/python/location-importer.in
index aa3b8f7..2a9bf33 100644
--- a/src/python/location-importer.in
+++ b/src/python/location-importer.in
@@ -505,6 +505,9 @@ class CLI(object):
 						for line in f:
 							self._parse_line(line, source_key, validcountries)
 
+		# Download and import (technical) AS names from ARIN
+		self._import_as_names_from_arin()
+
 	def _check_parsed_network(self, network):
 		"""
 			Assistive function to detect and subsequently sort out parsed
@@ -775,6 +778,64 @@ class CLI(object):
 			"%s" % network, country, [country], source_key,
 		)
 
+	def _import_as_names_from_arin(self):
+		downloader = location.importer.Downloader()
+
+		# XXX: Download AS names file from ARIN (note that these names appear to be quite
+		# technical, not intended for human consumption, as description fields in
+		# organisation handles for other RIRs are - however, this is what we have got,
+		# and in some cases, it might be still better than nothing)
+		try:
+			with downloader.request("https://ftp.arin.net/info/asn.txt", return_blocks=False) as f:
+				arin_as_names_file = f.body
+		except Exception as e:
+			log.error("failed to download and preprocess AS name file from ARIN: %s" % e)
+			return
+
+		# Split downloaded body into lines and parse each of them...
+		for sline in arin_as_names_file.readlines():
+
+			# ... valid lines start with a space, followed by the number of the Autonomous System ...
+			if not sline.startswith(b" "):
+				continue
+
+			# Split line and check if there is a valid ASN in it...
+			scontents = sline.split()
+			try:
+				asn = int(scontents[0])
+			except ValueError:
+				log.debug("Skipping ARIN AS names line not containing an integer for ASN")
+				continue
+
+			if not ((1 <= asn and asn <= 23455) or (23457 <= asn and asn <= 64495) or (131072 <= asn and asn <= 4199999999)):
+				log.debug("Skipping ARIN AS names line not containing a valid ASN: %s" % asn)
+				continue
+
+			# Skip any AS name that appears to be a placeholder for a different RIR or entity...
+			as_name = scontents[1].decode("ascii")
+
+			if re.match(r"^(ASN-BLK|)(AFCONC|AFRINIC|APNIC|ASNBLK|DNIC|LACNIC|RIPE|IANA)\d{0,1}-*", as_name):
+				continue
+
+			# Bail out in case the AS name contains anything we do not expect here...
+			if re.search(r"[^a-zA-Z0-9-_]", as_name):
+				log.debug("Skipping ARIN AS name for %s containing invalid characters: %s" % \
+						(asn, as_name))
+
+			# Things look good here, run INSERT statement and skip this one if we already have
+			# a (better?) name for this Autonomous System...
+			self.db.execute("""
+				INSERT INTO autnums(
+					number,
+					name,
+					source
+				) VALUES (%s, %s, %s)
+				ON CONFLICT (number) DO NOTHING""",
+				asn,
+				as_name,
+				"ARIN",
+			)
+
 	def handle_update_announcements(self, ns):
 		server = ns.server[0]
 
-- 
2.20.1


             reply	other threads:[~2021-06-08 12:10 UTC|newest]

Thread overview: 4+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2021-06-08 12:10 Peter Müller [this message]
2021-06-08 14:40 ` Michael Tremer
2021-06-08 15:10   ` Peter Müller
2021-06-08 15:15     ` Michael Tremer

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20210608121036.16242-1-peter.mueller@ipfire.org \
    --to=peter.mueller@ipfire.org \
    --cc=location@lists.ipfire.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox