 
            On Sat, Jun 15, 2013 at 3:03 PM, Jacob Appelbaum <jacob@appelbaum.net>wrote:
Heya,
Greetings from #404Lab in Tunis, Tunisia!
I'm testing the ooniprobe in the censorship lab. I'm making a running list of issues and I'll outline them below.
When running the following test:
./bin/ooniprobe -l tunisia-404lab-netcache-000.log nettests/blocking/http_requests.py;
I find that the test hangs for ~10 minutes:
Log opened. [D] No test deck detected Starting Tor... [D] Setting control port as 9251 [D] Setting SOCKS port as 9250 [D] 10%: Finishing handshake with directory server [D] 15%: Establishing an encrypted directory connection [D] 20%: Asking for networkstatus consensus [D] 25%: Loading networkstatus consensus [D] 45%: Asking for relay descriptors [D] 80%: Connecting to the Tor network [D] 90%: Establishing a Tor circuit [D] 100%: Done [D] Building a TorState Successfully bootstrapped Tor [D] We now have the following circuits: [D] * <Circuit 1 BUILT [154.35.32.5] for GENERAL> [D] * <Circuit 2 BUILT [31.172.30.3] for GENERAL> [D] * <Circuit 3 BUILT [198.211.114.130] for GENERAL> [D] * <Circuit 5 BUILT [194.132.32.43] for GENERAL> [D] * <Circuit 6 BUILT [31.172.30.3 46.166.174.209 93.115.241.2] for GENERAL> [D] * <Circuit 7 EXTENDED [198.211.114.130 74.120.13.132] for GENERAL> [D] * <Circuit 8 EXTENDED [] for GENERAL> [D] * <Circuit 9 EXTENDED [] for GENERAL> [D] * <Circuit 10 EXTENDED [] for GENERAL> [D] * <Circuit 11 EXTENDED [] for GENERAL> [D] * <Circuit 12 EXTENDED [] for GENERAL> [D] * <Circuit 13 EXTENDED [] for GENERAL> [D] * <Circuit 14 EXTENDED [] for GENERAL> [D] * <Circuit 15 EXTENDED [] for GENERAL> [D] * <Circuit 16 EXTENDED [] for GENERAL> [D] Obtained our IP address from a Tor Relay None Found your IP via Tor 41.231.22.232 We will include some geo data in the report Your AS number is: AS2609 Reporting using collector: httpo://nkvphnp3p6agi5qq.onion [D] adding callback for startNetTest [D] Creating ./report-http_requests_test-2013-06-15T100736Z.yamloo [D] Writing report with YAML reporter [D] Created report with <ooni.reporter.YAMLReporter object at 0x334ca90> Reporting httpo://nkvphnp3p6agi5qq.onion/report [D] Sending {"software_name": "ooniprobe", "software_version": "0.0.11", "content": "---\noptions: []\nprobe_asn: AS2609\nprobe_cc: TN\nprobe_ip: 41.231.22.232\nsoftware_name: ooniprobe\nsoftware_version: 0.0.11\nstart_time: 1371290856.723735\ntest_name: http_requests_test\ntest_version: 0.2.3\n...\n", "test_version": "0.2.3", "test_name": "http_requests_test", "probe_asn": "AS2609"} Creating report with OONIB Reporter. Please be patient. This may take up to 1-2 minutes... [D] Created report with id
2013-06-15T100948Z_AS2609_UIsgF34hqK8qn2GaxQtaynMnlIQedKCNaH2fNVoY5voh1I4Kdw [D] Created report with <ooni.reporter.OONIBReporter object at 0x334cc90> [D] Starting this task <generator object generateMeasurements at 0x3c78870> [D] Running <class 'nettests.blocking.http_requests.HTTPRequestsTest'> test_get [D] Finished test setup
...
I'm not clear what will happen next - it just hangs. I killed it after around ten minutes.
It looks like there is a bug where if you do not specify an input file or host argument that ooni doesn't realize this until after test setup. On my system, with the current git master (514b707df4c701a1e4c58f726c0261e26fabdb32) I get the following error: [D] Finished test setup Unhandled error in Deferred: Unhandled Error Traceback (most recent call last): File "/home/ooni/.virtualenvs/ooni/local/lib/python2.7/site-packages/twisted/internet/defer.py", line 380, in callback self._startRunCallbacks(result) File "/home/ooni/.virtualenvs/ooni/local/lib/python2.7/site-packages/twisted/internet/defer.py", line 488, in _startRunCallbacks self._runCallbacks() File "/home/ooni/.virtualenvs/ooni/local/lib/python2.7/site-packages/twisted/internet/defer.py", line 575, in _runCallbacks current.result = callback(current.result, *args, **kw) File "/home/ooni/.virtualenvs/ooni/local/lib/python2.7/site-packages/twisted/internet/defer.py", line 1126, in gotResult _inlineCallbacks(r, g, deferred) --- <exception caught here> --- File "/home/ooni/.virtualenvs/ooni/local/lib/python2.7/site-packages/twisted/internet/defer.py", line 1070, in _inlineCallbacks result = g.send(result) File "/home/ooni/ooni-probe/ooni/director.py", line 227, in startNetTest self.measurementManager.schedule(net_test.generateMeasurements()) File "/home/ooni/ooni-probe/ooni/managers.py", line 107, in schedule self._fillSlots() File "/home/ooni/ooni-probe/ooni/managers.py", line 58, in _fillSlots task = self._tasks.next() File "/home/ooni/ooni-probe/ooni/nettest.py", line 508, in generateMeasurements measurement = self.makeMeasurement(test_class, method, input) File "/home/ooni/ooni-probe/ooni/nettest.py", line 483, in makeMeasurement measurement = Measurement(test_class, test_method, test_input) File "/home/ooni/ooni-probe/ooni/tasks.py", line 114, in __init__ self.testInstance.setUp() File "data/nettests/blocking/http_requests.py", line 53, in setUp raise Exception("No input specified") exceptions.Exception: No input specified The problem is that some nettests support multiple input arguments and therefore cannot rely on specifying that an option is required (see: class NetTest __init__ requiredOptions). We should either extend the NetTest API to cleanly handle cases where NetTests can take input from either a single host argument or an input file, or axe the single host argument and make the input file a required option.
I followed up with the following test:
./bin/ooniprobe -l tunisia-404lab-netcache-000.log nettests/manipulation/http_invalid_request_line.py;
It worked without issues.
The following test goes on for a while:
./bin/ooniprobe -l tunisia-404lab-netcache-000.log nettests/blocking/http_requests.py -f inputs/top-1000.txt
It starts throwing the following errors: Performing GET request to google.com.ar [D] Performing request google.com.ar GET {'User-Agent': ['Mozilla/5.0 (Windows; U; Windows NT 5.1; de; rv:1.9.2) Gecko/20100115 Firefox/3.6']} [!] Unsupported scheme: '' [!] Task <ooni.tasks.Measurement object at 0x3302d90> has failed 6 times Traceback (most recent call last): File "/ooni-probe/ooni/managers.py", line 59, in _fillSlots self._run(task) File "/ooni-probe/ooni/managers.py", line 70, in _run d = task.start() File "/ooni-probe/ooni/tasks.py", line 91, in start return BaseTask.start(self) File "/ooni-probe/ooni/tasks.py", line 38, in start self._running = defer.maybeDeferred(self.run) --- <exception caught here> --- File "/ooniprobe/lib/python2.7/site-packages/twisted/internet/defer.py", line 138, in maybeDeferred result = f(*args, **kw) File "/ooni-probe/ooni/tasks.py", line 129, in run d = self.netTestMethod() File "/ooni-probe/nettests/blocking/http_requests.py", line 117, in test_get headers=headers) File "/ooni-probe/ooni/templates/httpt.py", line 313, in doRequest body_producer) File "/ooni-probe/ooni/utils/txagentwithsocks.py", line 179, in request return client.Agent.request(self, method, uri, headers, bodyProducer) File "/lib/python2.7/site-packages/twisted/web/client.py", line 1218, in request parsedURI.port) File "/ooni-probe/ooni/utils/txagentwithsocks.py", line 199, in _getEndpoint raise SchemeNotSupported("Unsupported xxx scheme: %r" % (scheme,)) exceptions.NameError: global name 'SchemeNotSupported' is not defined [!] Permanent failure for <ooni.tasks.Measurement object at 0x3302d90> Failed doing measurement: <ooni.tasks.Measurement object at 0x3302d90> [D] Starting this task <ooni.tasks.ReportEntry object at 0xbf0c950> [D] Starting this task <ooni.tasks.ReportEntry object at 0xbf0ca50>
Your input URL list should be a list of URLs (not just domains, but should specify the protocol). We should also add inputProcessor's for existing nettests so that they can take more flexible input lists. I have added inputProcessors to a few tests, but not yet http_requests. It then hangs like so:
[D] Checking all tasks for completion 102 == 133 [D] Successfully performed report <ooni.tasks.ReportEntry object at 0xc1405d0> [D] None [D] Checking all tasks for completion 103 == 133 [D] Successfully performed report <ooni.tasks.ReportEntry object at 0xc1408d0> [D] None
I killed it - I'm not sure why it hangs. The host list is just the top alexa hosts.
Probably is the case of an unhandled traceback (due to a bad input) -- it should get caught, retried, and eventually skipped, but clearly it doesn't.
% ./bin/ooniprobe -l tunisia-404lab-netcache-000.log nettests/blocking/tcpconnect.py -f inputs/top-1000-port80.txt
This worked without issue.
Oh good :). BTW, I added inputProcessor support to tcpconnect so you can use it with a URL list. It doesn't currently support URLs with a specified port #, though.
I tried:
% ./bin/ooniprobe -l tunisia-404lab-netcache-000.log nettests/blocking/http_requests.py -f inputs/top-1000.txt
I had similar hangs - so the tests never finished:
[D] Checking all tasks for completion 100 == 133 [D] Successfully performed report <ooni.tasks.ReportEntry object at 0xcc7d190> [D] None [D] Checking all tasks for completion 101 == 133 [D] Successfully performed report <ooni.tasks.ReportEntry object at 0xcc7d490> [D] None [D] Checking all tasks for completion 102 == 133 [D] Successfully performed report <ooni.tasks.ReportEntry object at 0xcc7d790> [D] None [D] Checking all tasks for completion 103 == 133 [D] Successfully performed report <ooni.tasks.ReportEntry object at 0xcc7da90> [D] None
Thoughts?
These logs aren't very descriptive, though, I'd guess it's the same URL issue as above. --Aaron
All the best, Jake _______________________________________________ ooni-dev mailing list ooni-dev@lists.torproject.org https://lists.torproject.org/cgi-bin/mailman/listinfo/ooni-dev