[tor-bugs] #9199 [BridgeDB]: Rethink the logging of BridgeDB

Tor Bug Tracker & Wiki blackhole at torproject.org
Thu Jul 4 17:17:01 UTC 2013


#9199: Rethink the logging of BridgeDB
----------------------+-----------------------------------------------------
 Reporter:  asn       |          Owner:                   
     Type:  task      |         Status:  needs_information
 Priority:  normal    |      Milestone:                   
Component:  BridgeDB  |        Version:                   
 Keywords:            |         Parent:                   
   Points:            |   Actualpoints:                   
----------------------+-----------------------------------------------------

Comment(by sysrqb):

 Replying to [comment:3 asn]:
 > Replying to [comment:2 sysrqb]:
 > > a) Should we scrub the bridge's fingerprint when safe logging is not
 disabled? It can easily be used to retrieve the IP addr via Atlas, etc.
 Maybe we should hash the fingerprint by default so various aspects of the
 log file are linkable?
 >
 > Hm, do we even care about scrubbing the IP addresses of bridges? I was
 mostly worrying about clients IPs.

 I don't know, do we? My thought was "what happens if an adversary obtains
 a log file?", but maybe this isn't something we should worry about.
 Currently we don't log client IP addresses (only the /24 "area", but that
 still maybe too much info). We also log email addresses, which I think we
 should scrub. My current branch scrubs these last two cases.

 > > c) I think providing a heartbeat is a nice idea. How do you feel about
 displaying uptime, user GEOIP stats, OS stats (based on user-agent) over
 the past n hours? This won't be trivial, but it shouldn't be too
 difficult. I think we should push this to v2 also.
 >
 > Yeah, I also like heartbeats.

 It was your idea :)

 >
 > BTW, be aware that if you want a "X unique IPs asked for bridges during
 the last N hours" dialog, you also need to keep client IPs in memory. We
 should probably not do that.
 >

 For GEOIP stats, I was thinking we would analyze the information in real-
 time and only need to keep counters in a dict for the various values, but
 this would not show unique queries. If we want unique IPs stats, we can
 hash the IP and store it in the DB only for the time period, very similar
 to what we do for email addresses. We absolutely should not store IP
 addresses, agreed.

 > > d) If we're auditing the logging, do we want to consider switching to
 [https://twistedmatrix.com/documents/12.2.0/core/howto/logging.html
 twisted.python.log]? Isis brought this up a few weeks ago. If we're going
 to make a decision on it, now is a fine
 time.[https://twistedmatrix.com/trac/wiki/TwistedLogging 0]
 > >
 >
 > Hm. How hard would it be to switch to twisted.logging?  What are the
 advantages of using Twisted logging? Do we care about them?  If the
 advantages are not too great, I'd rate this as low-priority and probably
 do something more important if it takes more than 2 hours of
 coding/thinking.
 >
 > It's up to you :)
 >

 Deferring to Isis' answer above :)

-- 
Ticket URL: <https://trac.torproject.org/projects/tor/ticket/9199#comment:5>
Tor Bug Tracker & Wiki <https://trac.torproject.org/>
The Tor Project: anonymity online


More information about the tor-bugs mailing list