[tor-bugs] #2763 [Metrics]: Do we collect descriptors that don't get into the consensus?

Tor Bug Tracker & Wiki torproject-admin at torproject.org
Fri Mar 25 09:01:20 UTC 2011


#2763: Do we collect descriptors that don't get into the consensus?
---------------------+------------------------------------------------------
 Reporter:  arma     |          Owner:  karsten 
     Type:  task     |         Status:  assigned
 Priority:  normal   |      Milestone:          
Component:  Metrics  |        Version:          
 Keywords:           |         Parent:          
   Points:           |   Actualpoints:          
---------------------+------------------------------------------------------
Changes (by karsten):

  * status:  new => assigned


Comment:

 Replying to [comment:4 nickm]:
 > I wonder if this approach might be insufficient for your requirements.
 It will tell you about descriptors that the authorities 'they have
 accepted and have decided to keep.  It ''won't'' tell us about descriptors
 that the authorities immediately rejected, or ones that they decided (for
 whatever reason) to drop or replace.
 >
 > Do we care about those factors?

 That's a fine question.  I can't say.  I guess Sebastian or arma have an
 answer.  From a metrics POV, we're only interested in the descriptors that
 are referenced from consensuses and maybe votes.  But I understand the
 need to collect unreferenced descriptors for debugging purposes.

 What reasons are there for an authority to reject or drop a descriptor?
 a) unable to parse and b) changes are cosmetic come to mind.  I'm somewhat
 concerned about a) here.  If we want to include descriptors that the
 directory authorities cannot parse, I'll have to improve the metrics code
 for parsing descriptors.  I'd prefer to not include descriptors from case
 a), though.  Descriptors from case b) should be fine to archive.  Are
 there other reasons for the authorities to drop or reject descriptors?

 > As for the information about download size: you can make it much
 smaller.  First, instead of downloading "all", download "all.z".

 Right.  We should do that for all downloads, I guess.

 > Second, instead of downloading all extra-info descriptors, read through
 the descriptors in tor/server/all.z to see which ones you are missing, and
 download only those.  I'd bet these approaches combined would save 60-80%
 of the expected download size.

 Okay, that should work.  Is once per day enough?

-- 
Ticket URL: <https://trac.torproject.org/projects/tor/ticket/2763#comment:5>
Tor Bug Tracker & Wiki <https://trac.torproject.org/>
The Tor Project: anonymity online


More information about the tor-bugs mailing list