[metrics-bugs] #31901 [Metrics/Website]: webstats-tb.html graph too eager to include today's stats

Tor Bug Tracker & Wiki blackhole at torproject.org
Tue Oct 1 14:56:07 UTC 2019

#31901: webstats-tb.html graph too eager to include today's stats
 Reporter:  arma             |          Owner:  karsten
     Type:  defect           |         Status:  accepted
 Priority:  Medium           |      Milestone:
Component:  Metrics/Website  |        Version:
 Severity:  Normal           |     Resolution:
 Keywords:                   |  Actual Points:
Parent ID:                   |         Points:
 Reviewer:                   |        Sponsor:

Comment (by karsten):

 Here's the long version of what I think has happened: one log file for
 dist.tp.o is written at around 12am every day whereas four others are
 written at around 7am. That first log file is sanitized by CollecTor at
 4am and the others at 10am. The metrics-web cronjob runs at 9am, so it
 only sees that first log file and not the four others. All this should not
 be an issue, because we're delaying sanitization for three days. We
 shouldn't be looking at log file timestamps but contained request
 timestamps. I'm not sure why we're not doing this. This might be a bug.

 For now, I changed the timing of sanitizing web logs from running at
 4:21am, 10:21am, etc. to 7:41am, 1:41pm, etc. This should better sync with
 our metrics-web cronjob. And in theory it shouldn't break anything. Let's
 see how it works over the next few days.

 The real fix is described in the first paragraph. This is a bit harder to
 fix, though.

Ticket URL: <https://trac.torproject.org/projects/tor/ticket/31901#comment:2>
Tor Bug Tracker & Wiki <https://trac.torproject.org/>
The Tor Project: anonymity online

More information about the metrics-bugs mailing list