[tor-bugs] #13720 [Ooni]: Investigate possible performance improvements to the ooni-pipeline

Tor Bug Tracker & Wiki blackhole at torproject.org
Thu Jul 23 18:17:28 UTC 2015


#13720: Investigate possible performance improvements to the ooni-pipeline
-----------------------------+---------------------
     Reporter:  hellais      |      Owner:  hellais
         Type:  enhancement  |     Status:  new
     Priority:  normal       |  Milestone:
    Component:  Ooni         |    Version:
   Resolution:               |   Keywords:
Actual Points:               |  Parent ID:
       Points:               |
-----------------------------+---------------------

Comment (by dcf):

 Here is another data point using many more reports. All the http_requests
 reports until 2015-07-17, 2,758 of them, 33 GB compressed.
 {{{
 yaml.load_all(f, Loader=yaml.CSafeLoader)
 real    133m23.210s
 user    147m24.860s
 sys     2m8.624s
 }}}
 This is faster than processing 1.5 GB of files using yaml.safe_load_all
 (and the processing program is even doing more work now). The processing
 program grabs the bodies of all responses, runs a bunch of regexes against
 them, and optionally saves the bodies to files.

--
Ticket URL: <https://trac.torproject.org/projects/tor/ticket/13720#comment:3>
Tor Bug Tracker & Wiki <https://trac.torproject.org/>
The Tor Project: anonymity online


More information about the tor-bugs mailing list