[tor-bugs] #6180 [Ooni]: Detecting censorship in HTTP pages

Tor Bug Tracker & Wiki torproject-admin at torproject.org
Sun Aug 19 00:23:12 UTC 2012


#6180: Detecting censorship in HTTP pages
----------------------------+-----------------------------------------------
 Reporter:  hellais         |          Owner:  hellais     
     Type:  task            |         Status:  needs_review
 Priority:  normal          |      Milestone:              
Component:  Ooni            |        Version:              
 Keywords:  SponsorH201206  |         Parent:              
   Points:                  |   Actualpoints:              
----------------------------+-----------------------------------------------

Comment(by hellais):

 We discussed a lot about this during the OONI hackfest with mct and this
 is what we talked about:

 We can divide the possible approaches for distinguishing a blocked page
 from the correct result page into two categories: Statistical and
 Heuristics.

 # Statistical approach

 - Bayesian

 - SVM

 - Fuzzy signature matching

 - Other classifiers

 What we want to do is run these classifiers on the corpus of block pages
 and real website.

 # Heuristics

 - List of regexp that apply to censorship pages


 We also talked about having clients tell the backend what it got as a
 response and having the backend figure out if such a page should be a
 block page or the correct result.

-- 
Ticket URL: <https://trac.torproject.org/projects/tor/ticket/6180#comment:3>
Tor Bug Tracker & Wiki <https://trac.torproject.org/>
The Tor Project: anonymity online


More information about the tor-bugs mailing list