[tor-bugs] #19118 [Metrics/Onionoo]: Add organization name to each relay

Tor Bug Tracker & Wiki blackhole at torproject.org
Thu May 19 09:17:22 UTC 2016


#19118: Add organization name to each relay
-----------------------------+-----------------------------------
 Reporter:  virgil           |          Owner:  karsten
     Type:  enhancement      |         Status:  needs_information
 Priority:  Medium           |      Milestone:
Component:  Metrics/Onionoo  |        Version:
 Severity:  Normal           |     Resolution:
 Keywords:  hardening        |  Actual Points:
Parent ID:                   |         Points:
 Reviewer:                   |        Sponsor:
-----------------------------+-----------------------------------
Changes (by karsten):

 * status:  new => needs_information


Comment:

 I'm yet unclear what we'd gain by adding CAIDA.org data.  We're using
 MaxMind's GeoLite ASN file which contains the following entry for 1984
 Hosting:

 {{{
 1566564352,1566566399,"AS44925 1984 ehf AS number"
 }}}

 Onionoo would include that as follows in a relay details document:

 {{{
 "as_number":"AS44925","as_name":"1984 ehf AS number"
 }}}

 (Admittedly, the "AS number" part in that string doesn't make much sense
 and looks like a data import problem on MaxMind's side.  But we can
 probably expect similar problems with CAIDA.org's data, just not with this
 particular entry.)

 But let's also look at a bigger AS/organization that hosts a lot of
 relays: OVH.  Here's what CAIDA.org says about OVH:

 {{{
 ORG-OS3-RIPE||OVH SAS|FR|RIPE
 16276||OVH|ORG-OS3-RIPE|RIPE
 35540||OVH-TELECOM|ORG-OS3-RIPE|RIPE
 }}}

 And here's what MaxMind's ASN file says about OVH:

 {{{
 86441984,86474751,"AS16276 OVH SAS"
 92733440,92798975,"AS16276 OVH SAS"
 96731136,96796671,"AS16276 OVH SAS"
 134738944,134739199,"AS16276 OVH SAS"
 135430144,135430399,"AS16276 OVH SAS"
 135432192,135434239,"AS16276 OVH SAS"
 135441408,135441663,"AS16276 OVH SAS"
 135556608,135556863,"AS16276 OVH SAS"
 135604480,135604735,"AS16276 OVH SAS"
 135792640,135794687,"AS16276 OVH SAS"
 135945728,135945983,"AS16276 OVH SAS"
 136175616,136175871,"AS16276 OVH SAS"
 136237056,136239103,"AS16276 OVH SAS"
 136404992,136407039,"AS16276 OVH SAS"
 136413184,136415743,"AS16276 OVH SAS"
 624623616,624689151,"AS16276 OVH SAS"
 624701440,624705535,"AS16276 OVH SAS"
 633012224,633077759,"AS16276 OVH SAS"
 635305984,635338751,"AS16276 OVH SAS"
 635371520,635437055,"AS16276 OVH SAS"
 778633216,778698751,"AS16276 OVH SAS"
 1056243712,1056251903,"AS16276 OVH SAS"
 1466073088,1466105855,"AS16276 OVH SAS"
 1532647424,1532649471,"AS16276 OVH SAS"
 1534656512,1534722047,"AS16276 OVH SAS"
 1558052864,1558118399,"AS16276 OVH SAS"
 1578565632,1578631167,"AS16276 OVH SAS"
 1728384000,1728385023,"AS16276 OVH SAS"
 1841168384,1841233919,"AS35540 OVH SAS"
 2382675968,2382684159,"AS16276 OVH SAS"
 2809266176,2809331711,"AS16276 OVH SAS"
 2954821632,2954887167,"AS16276 OVH SAS"
 2988441600,2988572671,"AS16276 OVH SAS"
 3001868288,3001872383,"AS16276 OVH SAS"
 3104444672,3104444927,"AS16276 OVH SAS"
 3104579584,3104580095,"AS16276 OVH SAS"
 3164930048,3164985007,"AS16276 OVH SAS"
 3164985009,3164995583,"AS16276 OVH SAS"
 3227451392,3227467775,"AS16276 OVH SAS"
 3227713536,3227779071,"AS16276 OVH SAS"
 3244823296,3244823551,"AS16276 OVH SAS"
 3245162240,3245162495,"AS16276 OVH SAS"
 3278773760,3278774271,"AS16276 OVH SAS"
 3287738368,3287738879,"AS16276 OVH SAS"
 3323674624,3323691007,"AS16276 OVH SAS"
 3325198336,3325231103,"AS16276 OVH SAS"
 3328479232,3328483327,"AS16276 OVH SAS"
 3337957376,3337961471,"AS16276 OVH SAS"
 3585744896,3585753087,"AS16276 OVH SAS"
 3590029312,3590045695,"AS16276 OVH SAS"
 }}}

 Wouldn't we include the exact same output after switching to CAIDA.org
 data?

 I'm hesitant to add another data source, because I expect inconsistencies
 between the two data sources where we don't have the exact same AS numbers
 in the two files and similar issues.

 Another (minor) issue is the additional overhead for Onionoo server
 operators.

 Stated differently, I'd want us to have a good reason for adding another
 data source.  Can you maybe give a counterexample where using CAIDA.org
 data in addition to MaxMind data would enhance Onionoo data notably?

--
Ticket URL: <https://trac.torproject.org/projects/tor/ticket/19118#comment:1>
Tor Bug Tracker & Wiki <https://trac.torproject.org/>
The Tor Project: anonymity online


More information about the tor-bugs mailing list