[tor-dev] Making microdescriptor tarballs available on metrics.tpo

Damian Johnson atagar at torproject.org
Wed Jan 22 03:32:22 UTC 2014

> Damian, can you try to parse these descriptors using stem, to see if the
> descriptor annotations are correct and if stem can parse them without
> issues?

Hi Karsten, sorry about the delay! Yup, stem parses them just fine
(though processing compressed tarballs still takes an unpleasantly
long time)...

% du -h microdescs-2014-01.tar.bz2
1.8M    microdescs-2014-01.tar.bz2

% cat parse.py
from stem.descriptor.reader import DescriptorReader

counter = 0

with DescriptorReader(["microdescs-2014-01.tar.bz2"]) as reader:
  for desc in reader:
    counter += 1

print "Found %i microdescriptors" % counter

% time python parse.py
Found 14999 microdescriptors

real    67m15.022s
user    65m50.259s
sys    1m13.717s

Cheers! -Damian

More information about the tor-dev mailing list