[tor-talk] panopticlick data

Joe Btfsplk joebtfsplk at gmx.com
Tue Oct 1 18:43:10 UTC 2013


On 10/1/2013 12:06 PM, Nicolas Vigier wrote:
> On Tue, 01 Oct 2013, Joe Btfsplk wrote:
>
>> Not sure I understand the question in this context.  Without
>> cookies, I don't expect them to identify repeat visitors.  I read
>> their full paper on how they use the data collected
>> https://panopticlick.eff.org/browser-uniqueness.pdf
>>
>> Me visiting 2 - 4 more times, or even the other site visitors - *in
>> the same 2 - 4 min. span*, wouldn't (actually) affect the statistics
>> & lower their reported uniqueness estimate by factors of 2, 3 or
>> more.
>>
>> Repeating the test 4 times, almost immediately (clearing cache
>> between), out of an existing data base of millions of other site
>> visitors, wouldn't lower my uniqueness from 1 in 1.7 million, then
>> to 1 in 700,000, to 1 in 500,000.
> 1st visit: 3 444 000
> 2nd visit: 3 444 000 / 2 = 1 722 000
> 3rd visit: 3 444 000 / 3 = 1 148 000
> 4th visit: 3 444 000 / 4 = 861 000
> 5th visit: 3 444 000 / 5 = 688 800
> 6th visit: 3 444 000 / 6 = 574 000
> etc ...
>
Thanks.  I'm not a statistics major, so you may have to explain, but are 
you saying that the 1st time I visit w/ a given set of browser 
characteristics, and they've only seen 1:3,444,000 browsers w/ exactly 
the same traits, then on my 2nd visit, they've now seen 2 identical 
browsers in 3,444,001  = 1: 1,722,000.5?

All that seems to mean is, they've not seen many browsers like mine 
(poor distribution), IF... it started out as 1 in 3.44 mil, or anything 
close - as mine would be a VERY common setup.

All the individual characteristics tested were very common, per their 
results.  Most are < 1:10 & none > 1:100, except the screen size (which 
seems incorrect).  Seems unlikely my 1920 width monitor only has 1664 
"usable" browser pane width (what they show). When they show *1920* 
width for TBB, but the 2 browser panes are the same in width.  Only 
thing taking up horizontal space on either browser is the vertical 
scroll bar, which are pretty much identical.

*NOTE:*  The *"bits of identifying information"* for individual browser 
characteristics (useragent, cookies enabled, etc.) & uniqueness (1 in X 
have this) of the INDIVIDUAL characteristics do NOT change, as you run 
the test repeatedly.
Those values must be calculated from a set data base & don't seem to be 
affected by your current visit.

Assuming trackers had a large enough sample space to have a high 
confidence level, for fingerprinting purposes, would it matter if only 1 
in 10,953, or 1 in 10,953,000 browsers were like yours?  As long as they 
could identify A browser w/ the same uniqueness (EXACT same 
characteristics - entering & exiting).  Even w/o Flash or Java enabled & 
revealing system fonts, etc.

Only way I see that's not true is if 100's of users w/ EXACT same 
browser characteristics (right down to same screen characteristics), 
used the same entry / exit relays at the SAME time.  That's unlikely, 
unless TBB starts spoofing screen size, the same for everyone.

I believe in same TBB version (maybe the same in many versions) they 
spoof the useragent & time zone, but wouldn't differences in screen 
sizes & color bit ALONE, among a few users on one entry / exit 
combination, at a given moment be enough to fingerprint one user?




More information about the tor-talk mailing list