[tor-bugs] #11788 [Metrics Data Processor]: Consider providing descriptor tarballs as .tar.xz rather than .tar.bz2

Tor Bug Tracker & Wiki blackhole at torproject.org
Wed May 7 18:14:19 UTC 2014


#11788: Consider providing descriptor tarballs as .tar.xz rather than .tar.bz2
----------------------------------------+-----------------
     Reporter:  karsten                 |      Owner:
         Type:  enhancement             |     Status:  new
     Priority:  normal                  |  Milestone:
    Component:  Metrics Data Processor  |    Version:
   Resolution:                          |   Keywords:
Actual Points:                          |  Parent ID:
       Points:                          |
----------------------------------------+-----------------

Comment (by karsten):

 Sample 2:

 {{{
 $ ls -lh votes-2014-04.tar.bz2
 -rw-r--r-- 1 metrics metrics 4.9G May  7 06:15 votes-2014-04.tar.bz2
 $ bunzip2 votes-2014-04.tar.bz2
 $ ls -lh votes-2014-04.tar
 -rw-r--r-- 1 metrics metrics  13G May  7 14:14 votes-2014-04.tar
 $ time xz -9 votes-2014-04.tar
 real    123m8.199s
 user    117m30.129s
 sys     0m21.541s
 $ ls -lh votes-2014-04.tar.xz
 -rw-r--r-- 1 metrics metrics 172M May  7 14:14 votes-2014-04.tar.xz
 }}}

 That's an impressive reduction by factor 29.  I had no idea!

 What will be funny is when people decompress a few votes tarballs (or even
 all of them) on their hard disk and find that these tarballs occupy 77
 times the disk space as in compressed form.  Guess we should add a warning
 to data.html.

--
Ticket URL: <https://trac.torproject.org/projects/tor/ticket/11788#comment:4>
Tor Bug Tracker & Wiki <https://trac.torproject.org/>
The Tor Project: anonymity online


More information about the tor-bugs mailing list