Hello,

> On 18 Jan 2022, at 21:10, Peter Müller <peter.mueller(a)ipfire.org> wrote:
> 
> Hello nusenu,
> hello Michael,
> 
>> Since you apparently don't like this data source
>> and I always thought RIPEstat has pretty good data quality:
>> Would you mind sharing your opinion on this? 
> 
> sorry for not replying on this sooner. Actually, I do not like or dislike RIPEstat; I just did
> not have sufficient time to made myself an educated opinion on this.
> 
> At the moment, things are quite packed on my end, but that will hopefully over at the beginning
> of February. So, this is not forgotten or silently discarded, but just a very tardy reply due
> to my "load average"...
> 
>> @Peter: Do you want to look into extracting information from this?
> 
> Yes.
> 
> Without looking at the amount of queries we'd probably need to do: Do you think this makes sense
> while running the location-importer, or should this become a dedicated script, which we can run
> in the background all the time, so it won't slow down the daily generation of the actual database.
> 
> In case of the latter, we could actually do some scraping on the ARIN AS names, too. Since we keep
> track of their source, this should not be too hard, and if we get some more human-readable names
> for some of them, it might be worth the effort.

I thought we were talking about parsing an HTML table. That should not be a process that is either complicated nor something I would call scraping.

Scraping is what I would consider sending one request per piece of information you would want to obtain and this is always a bad idea. It is slow, it has a lot of overhead on our side and of course on the server side - this is not what a good citizen of the internet would do. So I would be against this. Most places have this excluded in their t&cs for exactly this reason.

-Michael

> 
> Thanks, and best regards,
> Peter Müller