I concur with the discussion so far; once you are into the multi-tor-daemon deployments the "tuning" becomes rather organic.
I would look at network throughput on the tor nodes which are serving as reverse-proxies, and correlate that against load.
Frankly: Facebook currently delivers its entire onion service through 2 daemons per each of 3 onion addresses (www, cdn, sbx) for six daemons total; the chokepoint is far more likely to be the content-delivery backend.
-a