[tor-bugs] #2680 [Metrics]: present bridge usage data so researchers can focus on the math

Tor Bug Tracker & Wiki torproject-admin at torproject.org
Mon Mar 14 20:39:17 UTC 2011


#2680: present bridge usage data so researchers can focus on the math
---------------------+------------------------------------------------------
 Reporter:  arma     |          Owner:  karsten 
     Type:  task     |         Status:  assigned
 Priority:  normal   |      Milestone:          
Component:  Metrics  |        Version:          
 Keywords:           |         Parent:          
   Points:           |   Actualpoints:          
---------------------+------------------------------------------------------

Comment(by arma):

 The "fingerprint" and "descriptor" in statuses.csv are always the same. I
 think you're printing "fingerprint" for both of them?

 I think the next step is to write a short overview of how to reconstruct
 these files to answer some research question. For example, say I want to
 get a list of all the countries that a given bridge has seen over time. I
 guess I want to iterate over all bridge fingerprints -- should I use the
 list of all fingerprints I find in statuses.csv or in descriptors.csv --
 should they be the same?

 So step zero, given a fingerprint, is to look it up in relays.csv and make
 sure it's not there. If it is, either ignore it or if we want to get
 fancier, ignore data from it close to the time it's in the relay list.

 Step one is to look it up in statuses.csv, get a set of descriptor hashes,
 discard all the ones whose third-to-last value is not TRUE, and skip
 duplicate hashes.

 Then step two is to take those remaining descriptor hashes and look them
 up in descriptors.csv, at which point I can learn which countries they saw
 unless the countries are all NA in which case we don't have data?

 And the optional step three is to take the timestamp from the status file
 and look up the fingerprint in assignments.csv to decide if it's http,
 email, or unassigned?

-- 
Ticket URL: <https://trac.torproject.org/projects/tor/ticket/2680#comment:3>
Tor Bug Tracker & Wiki <https://trac.torproject.org/>
The Tor Project: anonymity online


More information about the tor-bugs mailing list