[tor-bugs] #13600 [Onionoo]: Improve bulk imports of descriptor archives

Tor Bug Tracker & Wiki blackhole at torproject.org
Wed Aug 19 19:46:00 UTC 2015


#13600: Improve bulk imports of descriptor archives
-----------------------------+-----------------
     Reporter:  karsten      |      Owner:
         Type:  enhancement  |     Status:  new
     Priority:  normal       |  Milestone:
    Component:  Onionoo      |    Version:
   Resolution:               |   Keywords:
Actual Points:               |  Parent ID:
       Points:               |
-----------------------------+-----------------

Comment (by karsten):

 @iwakeh

 I don't have a good answer for you, because I didn't have the chance to go
 through all comments on this ticket and the others yet.  But if I were to
 re-import descriptor archives into a new Onionoo instance, I'd do the
 following:

  - Use latest master of the official repository, nothing else.
  - Decompress (but not extract) tarballs using `unxz`.
  - Start with importing a single tarball or all tarballs of a single
 month, then try with three months, then twelve, etc.  You'll probably run
 into out-of-memory problems at some point, and you'll have to find out how
 many tarballs you can process at once.  Keep in mind that tarballs got
 bigger and bigger over time.
  - Once an import run completes, move away tarballs, because otherwise
 they will be re-imported.
  - Make backups of the `status/` directory after each import run.

 Sorry that this is not as convenient as it should be.

--
Ticket URL: <https://trac.torproject.org/projects/tor/ticket/13600#comment:17>
Tor Bug Tracker & Wiki <https://trac.torproject.org/>
The Tor Project: anonymity online


More information about the tor-bugs mailing list