From mboxrd@z Thu Jan 1 00:00:00 1970 From: Michael Tremer To: location@lists.ipfire.org Subject: Re: ASNs without AS name information (LACNIC and JPNIC) Date: Wed, 19 Jan 2022 08:17:53 +0000 Message-ID: <04F7129C-AA45-4A36-87E0-B355E8E332DB@ipfire.org> In-Reply-To: MIME-Version: 1.0 Content-Type: multipart/mixed; boundary="===============3886745703791310212==" List-Id: --===============3886745703791310212== Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: quoted-printable Hello, > On 18 Jan 2022, at 21:10, Peter M=C3=BCller wr= ote: >=20 > Hello nusenu, > hello Michael, >=20 >> Since you apparently don't like this data source >> and I always thought RIPEstat has pretty good data quality: >> Would you mind sharing your opinion on this?=20 >=20 > sorry for not replying on this sooner. Actually, I do not like or dislike R= IPEstat; I just did > not have sufficient time to made myself an educated opinion on this. >=20 > At the moment, things are quite packed on my end, but that will hopefully o= ver at the beginning > of February. So, this is not forgotten or silently discarded, but just a ve= ry tardy reply due > to my "load average"... >=20 >> @Peter: Do you want to look into extracting information from this? >=20 > Yes. >=20 > Without looking at the amount of queries we'd probably need to do: Do you t= hink this makes sense > while running the location-importer, or should this become a dedicated scri= pt, which we can run > in the background all the time, so it won't slow down the daily generation = of the actual database. >=20 > In case of the latter, we could actually do some scraping on the ARIN AS na= mes, too. Since we keep > track of their source, this should not be too hard, and if we get some more= human-readable names > for some of them, it might be worth the effort. I thought we were talking about parsing an HTML table. That should not be a p= rocess that is either complicated nor something I would call scraping. Scraping is what I would consider sending one request per piece of informatio= n you would want to obtain and this is always a bad idea. It is slow, it has = a lot of overhead on our side and of course on the server side - this is not = what a good citizen of the internet would do. So I would be against this. Mos= t places have this excluded in their t&cs for exactly this reason. -Michael >=20 > Thanks, and best regards, > Peter M=C3=BCller --===============3886745703791310212==--