Teor, Juga
There's a lot of things fighting for my attention right now, so you might have noticed I've slowed way down on attending to sbws tickets/PRs/etc. I think time will free up in the next few days.
I think sbws is in a very good place code-wise right now. I don't think much more **has** to be done to the code. Even though I enjoy adding things like the state file (GHPR#236 [2]), I don't think that was a good use of my time.
It looks like there's a lot of check boxes Juga has made regarding making a Debian package[0]. Those should get checked. These are important.
However, I think the absolute most important thing for us to be spending our time on right now is deciding what "good" results are and verifying sbws produces "good" results.
To accomplish this, I think one of the two suggestions I made in a comment on GH#182 [1] (quoted here) is what we should be doing.
1. Run torflow and sbws side-by-side (but not at the same time) to remove more variables. This has the added benefit of us having access to the raw scanner results from torflow before it does whatever magic scaling it does. OR
2. Ask for access to raw scanner results from someone running torflow.
I fear sbws is doomed to die the death of the new bandwidth scanners before it if we don't start seriously verifying sbws is "good" or if I personally slowly stop working/coordinating work on it.
Thanks
Matt
[0]: https://trac.torproject.org/projects/tor/ticket/26848 [1]: https://github.com/pastly/simple-bw-scanner/issues/182#issuecomment-40425005... [2]: https://github.com/pastly/simple-bw-scanner/pull/236
Matt Traudt:
Teor, Juga
There's a lot of things fighting for my attention right now, so you might have noticed I've slowed way down on attending to sbws tickets/PRs/etc. I think time will free up in the next few days.
I think sbws is in a very good place code-wise right now. I don't think much more **has** to be done to the code. Even though I enjoy adding things like the state file (GHPR#236 [2]), I don't think that was a good use of my time.
It looks like there's a lot of check boxes Juga has made regarding making a Debian package[0]. Those should get checked. These are important.
However, I think the absolute most important thing for us to be spending our time on right now is deciding what "good" results are and verifying sbws produces "good" results.
To accomplish this, I think one of the two suggestions I made in a comment on GH#182 [1] (quoted here) is what we should be doing.
- Run torflow and sbws side-by-side (but not at the same time) to
remove more variables. This has the added benefit of us having access to the raw scanner results from torflow before it does whatever magic scaling it does. OR
In that ticket you also mentioned that someone that already runs torflow should also run sbws. I said i can run both, and still the case if needed.
- Ask for access to raw scanner results from someone running torflow.
I fear sbws is doomed to die the death of the new bandwidth scanners before it if we don't start seriously verifying sbws is "good" or if I personally slowly stop working/coordinating work on it.
I don't think that's the case. I've not forget it... and i'm sure teor neither. Some of the last work we have done is regarding getting the bandwidth files archived, what will also help to determine whether sbws results are "good".
If 1. would be run by someone else, getting [0] done is indeed important and i'm currently working on it.
And maybe we aren't able to determine how "good" sbws results are until it actually starts being run by dirauths, for which [0] is still important.
Thanks for sharing your thoughts, juga.
Thanks
Matt
tor-dev mailing list tor-dev@lists.torproject.org https://lists.torproject.org/cgi-bin/mailman/listinfo/tor-dev
I'm happy and prepared to run sbws and torflow side by side. I'm a little less swamped than I was a month ago. I don't need a debian package; I'd rather run it from a git clone.
I think the only things I can't do are a) give you access to the box directly (but I can make whatever files/logs/raw results that you want available to you over HTTP) b) stop running torflow. (Unless we're ready to switch a live bwauth over to sbws.)
FWIW, I have the advantage of having archived my (semi-)raw bwauth data for a while: https://bwauth.ritter.vg/bwauth/
-tom
On 19 July 2018 at 10:16, juga juga@riseup.net wrote:
Matt Traudt:
Teor, Juga
There's a lot of things fighting for my attention right now, so you might have noticed I've slowed way down on attending to sbws tickets/PRs/etc. I think time will free up in the next few days.
I think sbws is in a very good place code-wise right now. I don't think much more **has** to be done to the code. Even though I enjoy adding things like the state file (GHPR#236 [2]), I don't think that was a good use of my time.
It looks like there's a lot of check boxes Juga has made regarding making a Debian package[0]. Those should get checked. These are important.
However, I think the absolute most important thing for us to be spending our time on right now is deciding what "good" results are and verifying sbws produces "good" results.
To accomplish this, I think one of the two suggestions I made in a comment on GH#182 [1] (quoted here) is what we should be doing.
- Run torflow and sbws side-by-side (but not at the same time) to
remove more variables. This has the added benefit of us having access to the raw scanner results from torflow before it does whatever magic scaling it does. OR
In that ticket you also mentioned that someone that already runs torflow should also run sbws. I said i can run both, and still the case if needed.
- Ask for access to raw scanner results from someone running torflow.
I fear sbws is doomed to die the death of the new bandwidth scanners before it if we don't start seriously verifying sbws is "good" or if I personally slowly stop working/coordinating work on it.
I don't think that's the case. I've not forget it... and i'm sure teor neither. Some of the last work we have done is regarding getting the bandwidth files archived, what will also help to determine whether sbws results are "good".
If 1. would be run by someone else, getting [0] done is indeed important and i'm currently working on it.
And maybe we aren't able to determine how "good" sbws results are until it actually starts being run by dirauths, for which [0] is still important.
Thanks for sharing your thoughts, juga.
Thanks
Matt
tor-dev mailing list tor-dev@lists.torproject.org https://lists.torproject.org/cgi-bin/mailman/listinfo/tor-dev
tor-dev mailing list tor-dev@lists.torproject.org https://lists.torproject.org/cgi-bin/mailman/listinfo/tor-dev
Hi,
On 20 Jul 2018, at 01:16, juga juga@riseup.net wrote:
Matt Traudt:
Teor, Juga
There's a lot of things fighting for my attention right now, so you might have noticed I've slowed way down on attending to sbws tickets/PRs/etc. I think time will free up in the next few days.
I think sbws is in a very good place code-wise right now. I don't think much more **has** to be done to the code. Even though I enjoy adding things like the state file (GHPR#236 [2]), I don't think that was a good use of my time.
It looks like there's a lot of check boxes Juga has made regarding making a Debian package[0]. Those should get checked. These are important.
However, I think the absolute most important thing for us to be spending our time on right now is deciding what "good" results are and verifying sbws produces "good" results.
You’re right - we need to know if we can switch to sbws, and we can’t use sbws unless it has reasonable results.
If the results aren’t reasonable, we might need to: * do further processing on the sbws results (like scaling) * change the sbws measurement design
The good news is that sbws ranks are approximately the same as torflow ranks. So the measurement design is probably ok.
But torflow weights are larger (max 100,000) than sbws weights (max 4000), so we will need to scale the sbws results.
torflow results are also steeper than sbws results: the ratio between high and low ranked relays is 1000:1 in torflow, but 10:1 in sbws.
If we want to, we can make sbws match torflow by defining a scaling algorithm that scales large relays more than small relays. But we could also decide that the flatter sbws curve is better for the network, because high-weight relays are overloaded.
Let’s do a few more experiments before we decide.
To accomplish this, I think one of the two suggestions I made in a comment on GH#182 [1] (quoted here) is what we should be doing.
- Run torflow and sbws side-by-side (but not at the same time) to
remove more variables. This has the added benefit of us having access to the raw scanner results from torflow before it does whatever magic scaling it does. OR
In that ticket you also mentioned that someone that already runs torflow should also run sbws. I said i can run both, and still the case if needed.
Ok, so juga can run sbws and torflow at different times on the same machine.
On 20 Jul 2018, at 01:34, Tom Ritter tom@ritter.vg wrote:
I'm happy and prepared to run sbws and torflow side by side. I'm a little less swamped than I was a month ago. I don't need a debian package; I'd rather run it from a git clone.
I think the only things I can't do are a) give you access to the box directly (but I can make whatever files/logs/raw results that you want available to you over HTTP) b) stop running torflow. (Unless we're ready to switch a live bwauth over to sbws.)
And tom can run sbws and torflow at the same time on the same machine.
I think we should run both comparisons, wait a week so they are in a stable state, and then check the results for a few weeks.
T