[anti-censorship-alerts] [FIRING:1] ignoring bridges by functionality

meskio meskio at torproject.org
Mon Nov 13 09:46:53 UTC 2023


Quoting Roger Dingledine (2023-11-12 00:42:54)
> On Sat, Nov 11, 2023 at 11:13:00PM +0000, alertmanager at hetzner-nbg1-02.torproject.org wrote:
> > ## Firing Alerts
> > 
> > ----- 
> > Time: 2023-11-11 23:12:29.934 +0000 UTC
> > Summary:  Too many bridges are dysfuntional 
> > Description:  The fraction of functional bridges is too low for rdsys 
> 
> I went to look at bridgestrap right after this alert, and bridgestrap
> seems to be doing fine. So I am wondering how to debug it on the rdsys
> side -- to understand which bridges it is considering, and which ones
> it thinks are down and why -- but I don't know how to. I added a comment
> to https://gitlab.torproject.org/tpo/anti-censorship/rdsys/-/issues/177
> as a poor substitute. :)

Yes, I think adding that information will be useful to debug it. And I'm 
planning to work on bridgestrap this week, I hope to come along to do it.

I see this problem is usually appearing for a short period of time, ~30min that 
is the period of rdsys between scans on the bridge descriptors. It does happen 
when there is a restart on either rdsys or bridgestrap, but also sometimes on 
other situation that I haven't identified.

I propose modifying the alert, so is only triggered if the problem is at least 
for 1h, I think is fine to ignore this problem if is just for 30mins there:
https://gitlab.torproject.org/tpo/tpa/prometheus-alerts/-/merge_requests/38

-- 
meskio | https://meskio.net/
-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-
 My contact info: https://meskio.net/crypto.txt
-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-
Nos vamos a Croatan.


More information about the anti-censorship-alerts mailing list