[tor-bugs] #2923 [Metrics]: Improve materialized views in the metrics database

Tor Bug Tracker & Wiki torproject-admin at torproject.org
Fri Apr 15 11:11:18 UTC 2011


#2923: Improve materialized views in the metrics database
-------------------------+--------------------------------------------------
 Reporter:  karsten      |          Owner:  karsten
     Type:  enhancement  |         Status:  new    
 Priority:  normal       |      Milestone:         
Component:  Metrics      |        Version:         
 Keywords:               |         Parent:         
   Points:               |   Actualpoints:         
-------------------------+--------------------------------------------------
 The [https://gitweb.torproject.org/metrics-
 web.git/blob/HEAD:/db/tordir.sql metrics database schema] uses
 periodically updated tables similar to materialized views for aggregating
 statistics.  When inserting data into the database, we write the dates
 that have changed to a separate updates table.  Every three hours, we
 delete the aggregates for these days and recompute them, which takes a few
 minutes.

 The recompute step that takes most of the time is `refresh_user_stats()`,
 which is no surprise given the complexity of that function.  We should try
 to simplify this function, possibly by pre-computing partial results that
 can be reused for other statistics.  Ideally, recomputing aggregates
 should run in under one minute, given that we want to add more
 materialized views for more aggregate statistics in the future.  In
 particular, I'd like to know which particular SQL parts slow us down in
 order to avoid them in the future.

-- 
Ticket URL: <https://trac.torproject.org/projects/tor/ticket/2923>
Tor Bug Tracker & Wiki <https://trac.torproject.org/>
The Tor Project: anonymity online


More information about the tor-bugs mailing list