[metrics-bugs] #33061 [Metrics/CollecTor]: archived bandwidth scanner files lack explicit source attibution

Tor Bug Tracker & Wiki blackhole at torproject.org
Thu Jan 30 23:36:49 UTC 2020


#33061: archived bandwidth scanner files lack explicit source attibution
-------------------------------+-----------------------------------
 Reporter:  starlight          |          Owner:  metrics-team
     Type:  enhancement        |         Status:  needs_information
 Priority:  Medium             |      Milestone:
Component:  Metrics/CollecTor  |        Version:
 Severity:  Normal             |     Resolution:
 Keywords:                     |  Actual Points:
Parent ID:                     |         Points:
 Reviewer:                     |        Sponsor:
-------------------------------+-----------------------------------

Comment (by starlight):

 Replying to [comment:4 karsten]:
 > Replying to [comment:3 starlight]:
 > > To clarify further:  Each bandwidth scanner has a unique perspective
 of available bandwidth capacities in the network.  Associating documents
 in time series tied to individual scanners is critical to making sense of
 the data.
 >
 > True. What you'll have to do is combine bandwidth files with votes to
 extract meaningful results.

 I agree  combining votes and bandwidth documents is useful, but I find
 significant value in bandwidth scanner documents alone provided the source
 scanners are attributed.

 >. . .it's also not trivial or maybe not even possible for CollecTor to
 include this information in bandwidth files while archiving them.

 I'm curious why--have no difficulty with attribution here.  The scanner-
 to-authority correlation may not be the big picture design, but is the
 practical reality to date.

 >
 > Note that combining descriptors is not unusual for an analysis. Right
 now I'm combining consensuses, votes, server descriptors, and extra-infos
 for another, unrelated analysis. Sometimes it's simply necessary to
 combine data from different data sources; in the bandwidth files case from
 bandwidth scanners and directory authorities using bandwidth scanner data.

 No disagreement some forms of analysis are fine or even better without the
 source.

 =====

 I managed a perl script that successfully attributes scanner sources for
 the gaps filled from Collector.  Willing to make the results available.

--
Ticket URL: <https://trac.torproject.org/projects/tor/ticket/33061#comment:5>
Tor Bug Tracker & Wiki <https://trac.torproject.org/>
The Tor Project: anonymity online


More information about the metrics-bugs mailing list