tor-dev
  Threads by month 
                
            - ----- 2025 -----
- October
- September
- August
- July
- June
- May
- April
- March
- February
- January
- ----- 2024 -----
- December
- November
- October
- September
- August
- July
- June
- May
- April
- March
- February
- January
- ----- 2023 -----
- December
- November
- October
- September
- August
- July
- June
- May
- April
- March
- February
- January
- ----- 2022 -----
- December
- November
- October
- September
- August
- July
- June
- May
- April
- March
- February
- January
- ----- 2021 -----
- December
- November
- October
- September
- August
- July
- June
- May
- April
- March
- February
- January
- ----- 2020 -----
- December
- November
- October
- September
- August
- July
- June
- May
- April
- March
- February
- January
- ----- 2019 -----
- December
- November
- October
- September
- August
- July
- June
- May
- April
- March
- February
- January
- ----- 2018 -----
- December
- November
- October
- September
- August
- July
- June
- May
- April
- March
- February
- January
- ----- 2017 -----
- December
- November
- October
- September
- August
- July
- June
- May
- April
- March
- February
- January
- ----- 2016 -----
- December
- November
- October
- September
- August
- July
- June
- May
- April
- March
- February
- January
- ----- 2015 -----
- December
- November
- October
- September
- August
- July
- June
- May
- April
- March
- February
- January
- ----- 2014 -----
- December
- November
- October
- September
- August
- July
- June
- May
- April
- March
- February
- January
- ----- 2013 -----
- December
- November
- October
- September
- August
- July
- June
- May
- April
- March
- February
- January
- ----- 2012 -----
- December
- November
- October
- September
- August
- July
- June
- May
- April
- March
- February
- January
- ----- 2011 -----
- December
- November
- October
- September
- August
- July
- June
- May
- April
- March
- February
January 2014
- 48 participants
- 51 discussions
 
                        
                    28 May '15
                    
                        Hi, all!
I've been trying to fill in all the cracks and corners for a revamp of
the hidden services protocol, based on earlier writings by George
Kadianakis and other discussions on the mailing list.  (See draft
acknowledgments section below.)
After a bunch of comments, I'm ready to give this a number and call it
(draft) proposal 224.  I'd like to know what doesn't make sense, what
I need to explain better, and what I need to design better.  I'd like
to fill in the gaps and turn this into a more full document.  I'd like
to answer the open questions. Comments are most welcome, especially if
they grow into improvements.
FWIW, I am likely to be offline for most of the current weekend,
because of Thanksgiving, so please be patient with my reply speed; I
hope to catch up with emails next week.
Filename: 224-rend-spec-ng.txt
Title: Next-Generation Hidden Services in Tor
Author: Nick Mathewson
Created: 2013-11-29
Status: Draft
-1. Draft notes
   This document describes a proposed design and specification for
   hidden services in Tor version 0.2.5.x or later. It's a replacement
   for the current rend-spec.txt, rewritten for clarity and for improved
   design.
   Look for the string "TODO" below: it describes gaps or uncertainties
   in the design.
   Change history:
       2013-11-29: Proposal first numbered. Some TODO and XXX items remain.
0. Hidden services: overview and preliminaries.
   Hidden services aim to provide responder anonymity for bidirectional
   stream-based communication on the Tor network. Unlike regular Tor
   connections, where the connection initiator receives anonymity but
   the responder does not, hidden services attempt to provide
   bidirectional anonymity.
   Other features include:
     * [TODO: WRITE ME once there have been some more drafts and we know
       what the summary should say.]
   Participants:
      Operator -- A person running a hidden service
      Host, "Server" -- The Tor software run by the operator to provide
         a hidden service.
      User -- A person contacting a hidden service.
      Client -- The Tor software running on the User's computer
      Hidden Service Directory (HSDir) -- A Tor node that hosts signed
        statements from hidden service hosts so that users can make
        contact with them.
      Introduction Point -- A Tor node that accepts connection requests
        for hidden services and anonymously relays those requests to the
        hidden service.
      Rendezvous Point -- A Tor node to which clients and servers
        connect and which relays traffic between them.
0.1. Improvements over previous versions.
   [TODO write me once there have been more drafts and we know what the
   summary should say.]
0.2. Notation and vocabulary
   Unless specified otherwise, all multi-octet integers are big-endian.
   We write sequences of bytes in two ways:
     1. A sequence of two-digit hexadecimal values in square brackets,
        as in [AB AD 1D EA].
     2. A string of characters enclosed in quotes, as in "Hello". These
        characters in these string are encoded in their ascii
        representations; strings are NOT nul-terminated unless
        explicitly described as NUL terminated.
   We use the words "byte" and "octet" interchangeably.
   We use the vertical bar | to denote concatenation.
   We use INT_N(val) to denote the network (big-endian) encoding of the
   unsigned integer "val" in N bytes. For example, INT_4(1337) is [00 00
   05 39].
0.3. Cryptographic building blocks
   This specification uses the following cryptographic building blocks:
      * A stream cipher STREAM(iv, k) where iv is a nonce of length
        S_IV_LEN bytes and k is a key of length S_KEY_LEN bytes.
      * A public key signature system SIGN_KEYGEN()->seckey, pubkey;
        SIGN_SIGN(seckey,msg)->sig; and SIGN_CHECK(pubkey, sig, msg) ->
        { "OK", "BAD" }; where secret keys are of length SIGN_SECKEY_LEN
        bytes, public keys are of length SIGN_PUBKEY_LEN bytes, and
        signatures are of length SIGN_SIG_LEN bytes.
        This signature system must also support key blinding operations
        as discussed in appendix [KEYBLIND] and in section [SUBCRED]:
        SIGN_BLIND_SECKEY(seckey, blind)->seckey2 and
        SIGN_BLIND_PUBKEY(pubkey, blind)->pubkey2 .
      * A public key agreement system "PK", providing
        PK_KEYGEN()->seckey, pubkey; PK_VALID(pubkey) -> {"OK", "BAD"};
        and PK_HANDHAKE(seckey, pubkey)->output; where secret keys are
        of length PK_SECKEY_LEN bytes, public keys are of length
        PK_PUBKEY_LEN bytes, and the handshake produces outputs of
        length PK_OUTPUT_LEN bytes.
      * A cryptographic hash function H(d), which should be preimage and
        collision resistant. It produces hashes of length HASH_LEN
        bytes.
      * A cryptographic message authentication code MAC(key,msg) that
        produces outputs of length MAC_LEN bytes.
      * A key derivation function KDF(key data, salt, personalization,
        n) that outputs n bytes.
   As a first pass, I suggest:
      * Instantiate STREAM with AES128-CTR. [TODO: or ChaCha20?]
      * Instantiate SIGN with Ed25519 and the blinding protocol in
        [KEYBLIND].
      * Instantiate PK with Curve25519.
      * Instantiate H with SHA256. [TODO: really?]
      * Instantiate MAC with HMAC using H.
      * Instantiate KDF with HKDF using H.
   For legacy purposes, we specify compatibility with older versions of
   the Tor introduction point and rendezvous point protocols. These used
   RSA1024, DH1024, AES128, and SHA1, as discussed in
   rend-spec.txt. Except as noted, all RSA keys MUST have exponent
   values of 65537.
   As in [proposal 220], all signatures are generated not over strings
   themselves, but over those strings prefixed with a distinguishing
   value.
0.4. Protocol building blocks [BUILDING-BLOCKS]
   In sections below, we need to transmit the locations and identities
   of Tor nodes. We do so in the link identification format used by
   EXTEND2 cells in the Tor protocol.
         NSPEC      (Number of link specifiers)   [1 byte]
         NSPEC times:
           LSTYPE (Link specifier type)           [1 byte]
           LSLEN  (Link specifier length)         [1 byte]
           LSPEC  (Link specifier)                [LSLEN bytes]
   Link specifier types are as described in tor-spec.txt. Every set of
   link specifiers MUST include at minimum specifiers of type [00]
   (TLS-over-TCP, IPv4) and [02] (legacy node identity).
   We also incorporate Tor's circuit extension handshakes, as used in
   the CREATE2 and CREATED2 cells described in tor-spec.txt. In these
   handshakes, a client who knows a public key for a server sends a
   message and receives a message from that server. Once the exchange is
   done, the two parties have a shared set of forward-secure key
   material, and the client knows that nobody else shares that key
   material unless they control the secret key corresponding to the
   server's public key.
0.5. Assigned relay cell types
   These relay cell types are reserved for use in the hidden service
   protocol.
      32 -- RELAY_COMMAND_ESTABLISH_INTRO
            Sent from hidden service host to introduction point;
            establishes introduction point. Discussed in
            [REG_INTRO_POINT].
      33 -- RELAY_COMMAND_ESTABLISH_RENDEZVOUS
            Sent from client to rendezvous point; creates rendezvous
            point. Discussed in [EST_REND_POINT].
      34 -- RELAY_COMMAND_INTRODUCE1
            Sent from client to introduction point; requests
            introduction. Discussed in [SEND_INTRO1]
      35 -- RELAY_COMMAND_INTRODUCE2
            Sent from client to introduction point; requests
            introduction. Same format as INTRODUCE1. Discussed in
            [FMT_INTRO1] and [PROCESS_INTRO2]
      36 -- RELAY_COMMAND_RENDEZVOUS1
            Sent from introduction point to rendezvous point;
            attempts to join introduction point's circuit to
            client's circuit. Discussed in [JOIN_REND]
      37 -- RELAY_COMMAND_RENDEZVOUS2
            Sent from introduction point to rendezvous point;
            reports join of introduction point's circuit to
            client's circuit. Discussed in [JOIN_REND]
      38 -- RELAY_COMMAND_INTRO_ESTABLISHED
            Sent from introduction point to hidden service host;
            reports status of attempt to establish introduction
            point. Discussed in [INTRO_ESTABLISHED]
      39 -- RELAY_COMMAND_RENDEZVOUS_ESTABLISHED
            Sent from rendezvous point to client; acknowledges
            receipt of ESTABLISH_RENDEZVOUS cell. Discussed in
            [EST_REND_POINT]
      40 -- RELAY_COMMAND_INTRODUCE_ACK
            Sent form introduction point to client; acknowledges
            receipt of INTRODUCE1 cell and reports success/failure.
            Discussed in [INTRO_ACK]
0.5. Acknowledgments
   [TODO reformat these once the lists are more complete.]
   This design includes ideas from many people, including
     Christopher Baines,
     Daniel J. Bernstein,
     Matthew Finkel,
     Ian Goldberg,
     George Kadianakis,
     Aniket Kate,
     Tanja Lange,
     Robert Ransom,
   It's based on Tor's original hidden service design by Roger
   Dingledine, Nick Mathewson, and Paul Syverson, and on improvements to
   that design over the years by people including
     Tobias Kamm,
     Thomas Lauterbach,
     Karsten Loesing,
     Alessandro Preite Martinez,
     Robert Ransom,
     Ferdinand Rieger,
     Christoph Weingarten,
     Christian Wilms,
   We wouldn't be able to do any of this work without good attack
   designs from researchers including
     Alex Biryukov,
     Lasse Øverlier,
     Ivan Pustogarov,
     Paul Syverson
     Ralf-Philipp Weinmann,
   See [ATTACK-REFS] for their papers.
   Several of these ideas have come from conversations with
      Christian Grothoff,
      Brian Warner,
      Zooko Wilcox-O'Hearn,
   And if this document makes any sense at all, it's thanks to
   editing help from
      Matthew Finkel
      George Kadianakis,
      Peter Palfrader,
   [XXX  Acknowledge the huge bunch of people working on 8106.]
   [XXX  Acknowledge the huge bunch of people working on 8244.]
   Please forgive me if I've missed you; please forgive me if I've
   misunderstood your best ideas here too.
1. Protocol overview
   In this section, we outline the hidden service protocol. This section
   omits some details in the name of simplicity; those are given more
   fully below, when we specify the protocol in more detail.
1.1. View from 10,000 feet
   A hidden service host prepares to offer a hidden service by choosing
   several Tor nodes to serve as its introduction points. It builds
   circuits to those nodes, and tells them to forward introduction
   requests to it using those circuits.
   Once introduction points have been picked, the host builds a set of
   documents called "hidden service descriptors" (or just "descriptors"
   for short) and uploads them to a set of HSDir nodes. These documents
   list the hidden service's current introduction points and describe
   how to make contact with the hidden service.
   When a client wants to connect to a hidden service, it first chooses
   a Tor node at random to be its "rendezvous point" and builds a
   circuit to that rendezvous point. If the client does not have an
   up-to-date descriptor for the service, it contacts an appropriate
   HSDir and requests such a descriptor.
   The client then builds an anonymous circuit to one of the hidden
   service's introduction points listed in its descriptor, and gives the
   introduction point an introduction request to pass to the hidden
   service. This introduction request includes the target rendezvous
   point and the first part of a cryptographic handshake.
   Upon receiving the introduction request, the hidden service host
   makes an anonymous circuit to the rendezvous point and completes the
   cryptographic handshake. The rendezvous point connects the two
   circuits, and the cryptographic handshake gives the two parties a
   shared key and proves to the client that it is indeed talking to the
   hidden service.
   Once the two circuits are joined, the client can send Tor RELAY cells
   to the server. RELAY_BEGIN cells open streams to an external process
   or processes configured by the server; RELAY_DATA cells are used to
   communicate data on those streams, and so forth.
1.2. In more detail: naming hidden services [NAMING]
   A hidden service's name is its long term master identity key.  This
   is encoded as a hostname by encoding the entire key in Base 32, and
   adding the string ".onion" at the end.
   (This is a change from older versions of the hidden service protocol,
   where we used an 80-bit truncated SHA1 hash of a 1024 bit RSA key.)
   The names in this format are distinct from earlier names because of
   their length. An older name might look like:
        unlikelynamefora.onion
        yyhws9optuwiwsns.onion
   And a new name following this specification might look like:
        a1uik0w1gmfq3i5ievxdm9ceu27e88g6o7pe0rffdw9jmntwkdsd.onion
   Note that since master keys are 32 bytes long, and 52 bytes of base
   32 encoding can hold 260 bits of information, we have four unused
   bits in each of these names.
   [TODO: Alternatively, we could require that the first bit of the
   master key always be zero, and use a 51-byte encoding. Or we could
   require that the first two bits be zero, and use a 51-byte encoding
   and reserve the first bit. Or we could require that the first nine
   bits, or ten bits be zero, etc.]
1.3. In more detail: Access control [IMD:AC]
   Access control for a hidden service is imposed at multiple points
   through the process above.
   In order to download a descriptor, clients must know which blinded
   signing key was used to sign it. (See the next section for more info
   on key blinding.)  This blinded signing key is derived from the
   service's public key and, optionally, an additional secret that is
   not part of the hidden service's onion address. The public key and
   this secret together constitute the service's "credential".
   When the secret is in use, the hidden service gains protections
   equivalent to the "stealth mode" in previous designs.
   To learn the introduction points, the clients must decrypt the body
   of the hidden service descriptor. The encryption key for these is
   derived from the service's credential.
   In order to make an introduction point send a request to the server,
   the client must know the introduction point and know the service's
   per-introduction-point authentication key from the hidden service
   descriptor.
   The final level of access control happens at the server itself, which
   may decide to respond or not respond to the client's request
   depending on the contents of the request. The protocol is extensible
   at this point: at a minimum, the server requires that the client
   demonstrate knowledge od the contents of the encrypted portion of the
   hidden service descriptor. The service may additionally require a
   user- or group-specific access token before it responds to requests.
1.4. In more detail: Distributing hidden service descriptors. [IMD:DIST]
   Periodically, hidden service descriptors become stored at different
   locations to prevent a single directory or small set of directories
   from becoming a good DoS target for removing a hidden service.
   For each period, the Tor directory authorities agree upon a
   collaboratively generated random value. (See section 2.3 for a
   description of how to incorporate this value into the voting
   practice; generating the value is described in other proposals,
   including [TODO: add a reference])  That value, combined with hidden service
   directories' public identity keys, determines each HSDirs' position
   in the hash ring for descriptors made in that period.
   Each hidden service's descriptors are placed into the ring in
   positions based on the key that was used to sign them. Note that
   hidden service descriptors are not signed with the services' public
   keys directly. Instead, we use a key-blinding system [KEYBLIND] to
   create a new key-of-the-day for each hidden service. Any client that
   knows the hidden service's credential can derive these blinded
   signing keys for a given period. It should be impossible to derive
   the blinded signing key lacking that credential.
   The body of each descriptor is also encrypted with a key derived from
   the credential.
   To avoid a "thundering herd" problem where every service generates
   and uploads a new descriptor at the start of each period, each
   descriptor comes online at a time during the period that depends on
   its blinded signing key. The keys for the last period remain valid
   until the new keys come online.
1.5. In more detail: Scaling to multiple hosts
   [THIS SECTION IS UNFINISHED]
   In order to allow multiple hosts to provide a single hidden service,
   I'm considering two options.
      * We can have each server build an introduction circuit to each
        introduction point, and have the introduction points responsible
        for round-robining between these circuits. One service host is
        responsible for picking the introduction points and publishing
        the descriptors.
      * We can have servers choose their introduction points
        independently, and build circuits to them. One service host is
        responsible for combining these introduction points into a
        single descriptor.
   If we want to avoid having a single "master" host without which the
   whole service goes down (the "one service host" in the description
   above), we need a way to fail over from one host to another. We also
   need a way to coordinate between the hosts. This is as yet
   undesigned. Maybe it should use a hidden service?
   See [SCALING-REFS] for discussion on this topic.
   [TODO: Finalize this design.]
1.6. In more detail: Backward compatibility with older hidden service
      protocols
   This design is incompatible with the clients, server, and hsdir node
   protocols from older versions of the hidden service protocol as
   described in rend-spec.txt. On the other hand, it is designed to
   enable the use of older Tor nodes as rendezvous points and
   introduction points.
1.7. In more detail: Offline operation
   In this design, a hidden service's secret identity key may be stored
   offline. It's used only to generate blinded identity keys, which are
   used to sign descriptor signing keys. In order to operate a hidden
   service, the operator can generate a number of descriptor signing
   keys and their certifications (see [DESC-OUTER] and [ENCRYPTED-DATA]
   below), and their corresponding descriptor encryption keys, and
   export those to the hidden service hosts.
1.8. In more detail: Encryption Keys And Replay Resistance
   To avoid replays of an introduction request by an introduction point,
   a hidden service host must never accept the same request
   twice. Earlier versions of the hidden service design used a
   authenticated timestamp here, but including a view of the current
   time can create a problematic fingerprint. (See proposal 222 for more
   discussion.)
1.9. In more detail: A menagerie of keys
   [In the text below, an "encryption keypair" is roughly "a keypair you
   can do Diffie-Hellman with" and a "signing keypair" is roughly "a
   keypair you can do ECDSA with."]
   Public/private keypairs defined in this document:
      Master (hidden service) identity key -- A master signing keypair
        used as the identity for a hidden service. This key is not used
        on its own to sign anything; it is only used to generate blinded
        signing keys as described in [KEYBLIND] and [SUBCRED].
      Blinded signing key -- A keypair derived from the identity key,
        used to sign descriptor signing keys. Changes periodically for
        each service. Clients who know a 'credential' consisting of the
        service's public identity key and an optional secret can derive
        the public blinded identity key for a service. This key is used
        as an index in the DHT-like structure of the directory system.
      Descriptor signing key -- A key used to sign hidden service
        descriptors. This is signed by blinded signing keys. Unlike
        blinded signing keys and master identity keys, the secret part
        of this key must be stored online by hidden service hosts.
      Introduction point authentication key -- A short-term signing
        keypair used to identify a hidden service to a given
        introduction point. A fresh keypair is made for each
        introduction point; these are used to sign the request that a
        hidden service host makes when establishing an introduction
        point, so that clients who know the public component of this key
        can get their introduction requests sent to the right
        service. No keypair is ever used with more than one introduction
        point. (previously called a "service key" in rend-spec.txt)
      Introduction point encryption key -- A short-term encryption
        keypair used when establishing connections via an introduction
        point. Plays a role analogous to Tor nodes' onion keys. A fresh
        keypair is made for each introduction point.
   Symmetric keys defined in this document:
      Descriptor encryption keys -- A symmetric encryption key used to
        encrypt the body of hidden service descriptors. Derived from the
        current period and the hidden service credential.
   Public/private keypairs defined elsewhere:
      Onion key -- Short-term encryption keypair
      (Node) identity key
   Symmetric key-like things defined elsewhere:
      KH from circuit handshake -- An unpredictable value derived as
      part of the Tor circuit extension handshake, used to tie a request
      to a particular circuit.
2. Generating and publishing hidden service descriptors [HSDIR]
   Hidden service descriptors follow the same metaformat as other Tor
   directory objects. They are published anonymously to Tor servers with
   the HSDir3 flag.
   (Authorities should assign this flag as they currently assign the
   HSDir flag, except that they should restrict it to Tor versions
   implementing the HSDir parts of this specification.)
2.1. Deriving blinded keys and subcredentials [SUBCRED]
   In each time period (see [TIME-PERIOD] for a definition of time
   periods), a hidden service host uses a different blinded private key
   to sign its directory information, and clients use a different
   blinded public key as the index for fetching that information.
   For a candidate for a key derivation method, see Appendix [KEYBLIND].
   Additionally, clients and hosts derive a subcredential for each
   period. Knowledge of the subcredential is needed to decrypt hidden
   service descriptors for each period and to authenticate with the
   hidden service host in the introduction process. Unlike the
   credential, it changes each period. Knowing the subcredential, even
   in combination with the blinded private key, does not enable the
   hidden service host to derive the main credential--therefore, it is
   safe to put the subcredential on the hidden service host while
   leaving the hidden service's private key offline.
   The subcredential for a period is derived as:
       H("subcredential" |
         credential |
         blinded-public-key).
2.2. Locating, uploading, and downloading hidden service descriptors
       [HASHRING]
   To avoid attacks where a hidden service's descriptor is easily
   targeted for censorship, we store them at different directories over
   time, and use shared random values to prevent those directories from
   being predictable far in advance.
   Which Tor servers hosts a hidden service depends on:
         * the current time period,
         * the daily subcredential,
         * the hidden service directories' public keys,
         * a shared random value that changes in each time period,
         * a set of network-wide networkstatus consensus parameters.
   Below we explain in more detail.
2.2.1. Dividing time into periods [TIME-PERIODS]
   To prevent a single set of hidden service directory from becoming a
   target by adversaries looking to permanently censor a hidden service,
   hidden service descriptors are uploaded to different locations that
   change over time.
   The length of a "time period" is controlled by the consensus
   parameter 'hsdir-interval', and is a number of minutes between 30 and
   14400 (10 days). The default time period length is 1500 (one day plus
   one hour).
   Time periods start with the Unix epoch (Jan 1, 1970), and are
   computed by taking the number of whole minutes since the epoch and
   dividing by the time period. So if the current time is 2013-11-12
   13:44:32 UTC, making the seconds since the epoch 1384281872, the
   number of minutes since the epoch is 23071364.  If the current time
   period length is 1500 (the default), then the current time period
   number is 15380. It began 15380*1500*60 seconds after the epoch at
   2013-11-11 20:00:00 UTC, and will end at (15380+1)*1500*60 seconds
   after the epoch at 2013-11-12 21:00:00 UTC.
2.2.2. Overlapping time periods to avoid thundering herds [TIME-OVERLAP]
   If every hidden service host were to generate a new set of keys and
   upload a new descriptor at exactly the start of each time period, the
   directories would be overwhelmed by every host uploading at the same
   time. Instead, each public key becomes valid at its new location at a
   deterministic time somewhat _before_ the period begins, depending on
   the public key and the period.
   The time at which a key might first become valid is determined by the
   consensus parameter "hsdir-overlap-begins", which is an integer in
   range [1,100] with default value 80. This parameter denotes a
   percentage of the interval for which no overlap occurs.  So for the
   default interval (1500 minutes) and default overlap-begins value
   (80%), new keys do not become valid for the first 1200 minutes of the
   interval.
   The new shared random value must be published *before* the start of
   the next overlap interval by at least enough time to ensure that
   clients all get it. [TODO: how much earlier?]
   The time at which a key from the next interval becomes valid is
   determined by taking the first two bytes of
     OFFSET = H(Key | INT_8(Next_Period_Num))
   as a big-endian integer, dividing by 65536, and treating that as a
   fraction of the overlap interval.
   For example, if the period is 1500 minutes long, and overlap interval
   is 300 minutes long, and OFFSET begins with [90 50], then the next
   key becomes valid at 1200 + 300 * (0x9050 / 65536) minutes, or
   approximately 22 hours and 49 minutes after the beginning of the
   period.
   Hidden service directories should accept descriptors at least [TODO:
   how much?] minutes before they would become valid, and retain them
   for at least [TODO: how much?] minutes after the end of the period.
   When a client is looking for a service, it must calculate its key
   both for the current and for the subsequent period, to decide whether
   the next period's key is valid yet.
2.2.3. Where to publish a service descriptor
   The following consensus parameters control where a hidden service
   descriptor is stored;
        hsdir_n_replicas = an integer in range [1,16]
                             with default value 2.
        hsdir_spread_fetch = an integer in range [1,128]
                             with default value 3.
        hsdir_spread_store = an integer in range [1,128]
                             with default value 3.
        hsdir_spread_accept = an integer in range [1,128]
                             with default value 8.
   To determine where a given hidden service descriptor will be stored
   in a given period, after the blinded public key for that period is
   derived, the uploading or downloading party calculate
        for replicanum in 1...hsdir_n_replicas:
            hs_index(replicanum) = H("store-at-idx" |
                                 blinded_public_key | replicanum |
                                 periodnum)
   where blinded_public_key is specified in section KEYBLIND, and
   periodnum is defined in section TIME-PERIODS.
   where n_replicas is determined by the consensus parameter
   "hsdir_n_replicas".
   Then, for each node listed in the current consensus with the HSDir3
   flag, we compute a directory index for that node as:
           hsdir_index(node) = H(node_identity_digest |
                                 shared_random |
                                 INT_8(period_num) )
   where shared_random is the shared value generated by the authorities
   in section PUB-SHAREDRANDOM.
   Finally, for replicanum in 1...hsdir_n_replicas, the hidden service
   host uploads descriptors to the first hsdir_spread_store nodes whose
   indices immediately follow hs_index(replicanum).
   When choosing an HSDir to download from, clients choose randomly from
   among the first hsdir_spread_fetch nodes after the indices.  (Note
   that, in order to make the system better tolerate disappearing
   HSDirs, hsdir_spread_fetch may be less than hsdir_spread_store.)
   An HSDir should rejects a descriptor if that HSDir is not one of the
   first hsdir_spread_accept HSDirs for that node.
   [TODO: Incorporate the findings from proposal 143 here. But watch
   out: proposal 143 did not analyze how much the set of nodes changes
   over time, or how much client and host knowledge might diverge.]
2.2.4. URLs for anonymous uploading and downloading
   Hidden service descriptors conforming to this specification are
   uploaded with an HTTP POST request to the URL
   /tor/rendezvous3/publish relative to the hidden service directory's
   root, and downloaded with an HTTP GET request for the URL
   /tor/rendezvous3/<z> where z is a base-64 encoding of the hidden
   service's blinded public key.
   [TODO: raw base64 is not super-nice for URLs, since it can have
   slashes. We already use it for microdescriptor URLs, though. Do we
   care here?]
   These requests must be made anonymously, on circuits not used for
   anything else.
2.3. Publishing shared random values [PUB-SHAREDRANDOM]
   Our design for limiting the predictability of HSDir upload locations
   relies on a shared random value that isn't predictable in advance or
   too influenceable by an attacker. The authorities must run a protocol
   to generate such a value at least once per hsdir period. Here we
   describe how they publish these values; the procedure they use to
   generate them can change independently of the rest of this
   specification. For one possible (somewhat broken) protocol, see
   Appendix [SHAREDRANDOM].
   We add a new line in votes and consensus documents:
       "hsdir-shared-random" PERIOD-START VALUE
       PERIOD-START = YYYY-MM-DD HH:MM:SS
       VALUE = A base-64 encoded 256-bit value.
   To decide which hsdir-shared-random line to include in a consensus
   for a given PERIOD-START, we choose whichever line appears verbatim
   in the most votes, so long as it is listed by at least three
   authorities. Ties are broken in favor of the lower value. More than
   one PERIOD-START is allowed per vote, and per consensus. The same
   PERIOD-START must not appear twice in a vote or in a consensus.
   [TODO: Need to define a more robust algorithm. Need to cover cases
   where multiple cluster of authorities publish a different value,
   etc.]
   The hs-dir-shared-random lines appear, sorted by PERIOD-START, in the
   consensus immediately after the "params" line.
   The authorities should publish the shared random value for the
   current period, and, at a time at least three voting periods before
   the overlap interval begins, the shared random value for the next
   period.
[TODO: find out what weasel doesn't like here.]
2.4. Hidden service descriptors: outer wrapper [DESC-OUTER]
   The format for a hidden service descriptor is as follows, using the
   meta-format from dir-spec.txt.
     "hs-descriptor" SP "3" SP public-key SP certification NL
       [At start, exactly once.]
       public-key is the blinded public key for the service, encoded in
       base 64. Certification is a certification of a short-term ed25519
       descriptor signing key using the public key, in the format of
       proposal 220.
     "time-period" SP YYYY-MM-DD HH:MM:SS NUM NL
       [Exactly once.]
       The time period for which this descriptor is relevant, including
       its starting time and its period number.
     "revision-counter" SP Integer NL
       [Exactly once.]
       The revision number of the descriptor. If an HSDir receives a
       second descriptor for a key that it already has a descriptor for,
       it should retain and serve the descriptor with the higher
       revision-counter.
       (Checking for monotonically increasing revision-counter values
       prevents an attacker from replacing a newer descriptor signed by
       a given key with a copy of an older version.)
     "encrypted" NL encrypted-string
       [Exactly once.]
       An encrypted blob, whose format is discussed in [ENCRYPTED-DATA]
       below. The blob is base-64 encoded and enclosed in -----BEGIN
       MESSAGE---- and ----END MESSAGE---- wrappers.
     "signature" SP signature NL
       [exactly once, at end.]
       A signature of all previous fields, using the signing key in the
       hs-descriptor line. We use a separate key for signing, so that
       the hidden service host does not need to have its private blinded
       key online.
2.5. Hidden service descriptors: encryption format [ENCRYPTED-DATA]
   The encrypted part of the hidden service descriptor is encrypted and
   authenticated with symmetric keys generated as follows:
       salt = 16 random bytes
       secret_input = nonce | blinded_public_key | subcredential |
             INT_4(revision_counter)
       keys = KDF(secret_input, salt, "hsdir-encrypted-data",
                  S_KEY_LEN + S_IV_LEN + MAC_KEY_LEN)
       SECRET_KEY = first S_KEY_LEN bytes of keys
       SECRET_IV  = next S_IV_LEN bytes of keys
       MAC_KEY    = last MAC_KEY_LEN bytes of keys
   The encrypted data has the format:
       SALT       (random bytes from above)       [16 bytes]
       ENCRYPTED  The plaintext encrypted with S  [variable]
       MAC        MAC of both above fields        [32 bytes]
   The encryption format is ENCRYPTED =
               STREAM(SECRET_IV,SECRET_KEY) xor Plaintext
   Before encryption, the plaintext must be padded to a multiple of ???
   bytes with NUL bytes. The plaintext must not be longer than ???
   bytes.  [TODO: how much? Should this be a parameter?  What values in
   practice is needed to hide how many intro points we have, and how
   many might be legacy ones?]
   The plaintext format is:
     "create2-formats" SP formats NL
      [Exactly once]
      A space-separated list of integers denoting CREATE2 cell format
      numbers that the server recognizes. Must include at least TAP and
      ntor as described in tor-spec.txt. See tor-spec section 5.1 for a
      list of recognized handshake types.
     "authentication-required" SP types NL
      [At most once]
      A space-separated list of authentication types. A client that does
      not support at least one of these authentication types will not be
      able to contact the host. Recognized types are: 'password' and
      'ed25519'. See [INTRO-AUTH] below.
     At least once:
        "introduction-point" SP link-specifiers NL
          [Exactly once per introduction point at start of introduction
            point section]
          The link-specifiers is a base64 encoding of a link specifier
          block in the format described in BUILDING-BLOCKS.
        "auth-key" SP "ed25519" SP key SP certification NL
          [Exactly once per introduction point]
          Base-64 encoded introduction point authentication key that was
          used to establish introduction point circuit, cross-certifying
          the blinded public key key using the certification format of
          proposal 220.
        "enc-key" SP "ntor" SP key NL
          [At most once per introduction point]
          Base64-encoded curve25519 key used to encrypt request to
          hidden service.
          [TODO: I'd like to have a cross-certification here too.]
        "enc-key" SP "legacy" NL key NL
          [At most once per introduction point]
          Base64-encoded RSA key, wrapped in "----BEGIN RSA PUBLIC
          KEY-----" armor, for use with a legacy introduction point as
          described in [LEGACY_EST_INTRO] and [LEGACY-INTRODUCE1] below.
          Exactly one of the "enc-key ntor" and "enc-key legacy"
          elements must be present for each introduction point.
        [TODO: I'd like to have a cross-certification here too.]
   Other encryption and authentication key formats are allowed; clients
   should ignore ones they do not recognize.
3. The introduction protocol
   The introduction protocol proceeds in three steps.
   First, a hidden service host builds an anonymous circuit to a Tor
   node and registers that circuit as an introduction point.
        [Between these steps, the hidden service publishes its
        introduction points and associated keys, and the client fetches
        them as described in section [HSDIR] above.]
   Second, a client builds an anonymous circuit to the introduction
   point, and sends an introduction request.
   Third, the introduction point relays the introduction request along
   the introduction circuit to the hidden service host, and acknowledges
   the introduction request to the client.
3.1. Registering an introduction point [REG_INTRO_POINT]
3.1.1. Extensible ESTABLISH_INTRO protocol. [EST_INTRO]
   When a hidden service is establishing a new introduction point, it
   sends a ESTABLISH_INTRO cell with the following contents:
     AUTH_KEY_TYPE    [1 byte]
     AUTH_KEY_LEN     [1 byte]
     AUTH_KEY         [AUTH_KEY_LEN bytes]
     Any number of times:
        EXT_FIELD_TYPE [1 byte]
        EXT_FIELD_LEN  [1 byte]
        EXT_FIELD      [EXTRA_FIELD_LEN bytes]
     ZERO             [1 byte]
     HANDSHAKE_AUTH   [MAC_LEN bytes]
     SIGLEN           [1 byte]
     SIG              [SIGLEN bytes]
   The AUTH_KEY_TYPE field indicates the type of the introduction point
   authentication key and the type of the MAC to use in for
   HANDSHAKE_AUTH. Recognized types are:
       [00, 01] -- Reserved for legacy introduction cells; see
                   [LEGACY_EST_INTRO below]
       [02] -- Ed25519; HMAC-SHA256.
       [FF] -- Reserved for maintenance messages on existing
               circuits; see MAINT_INTRO below.
               [TODO: Should this just be a new relay cell type?
                Matthew and George think so.]
   The AUTH_KEY_LEN field determines the length of the AUTH_KEY
   field. The AUTH_KEY field contains the public introduction point
   authentication key.
   The EXT_FIELD_TYPE, EXT_FIELD_LEN, EXT_FIELD entries are reserved for
   future extensions to the introduction protocol. Extensions with
   unrecognized EXT_FIELD_TYPE values must be ignored.
   The ZERO field contains the byte zero; it marks the end of the
   extension fields.
   The HANDSHAKE_AUTH field contains the MAC of all earlier fields in
   the cell using as its key the shared per-circuit material ("KH")
   generated during the circuit extension protocol; see tor-spec.txt
   section 5.2, "Setting circuit keys". It prevents replays of
   ESTABLISH_INTRO cells.
   SIGLEN is the length of the signature.
   SIG is a signature, using AUTH_KEY, of all contents of the cell, up
   to but not including SIG. These contents are prefixed with the string
   "Tor establish-intro cell v1".
   Upon receiving an ESTABLISH_INTRO cell, a Tor node first decodes the
   key and the signature, and checks the signature. The node must reject
   the ESTABLISH_INTRO cell and destroy the circuit in these cases:
        * If the key type is unrecognized
        * If the key is ill-formatted
        * If the signature is incorrect
        * If the HANDSHAKE_AUTH value is incorrect
        * If the circuit is already a rendezvous circuit.
        * If the circuit is already an introduction circuit.
          [TODO: some scalability designs fail there.]
        * If the key is already in use by another circuit.
   Otherwise, the node must associate the key with the circuit, for use
   later in INTRODUCE1 cells.
   [TODO: The above will work fine with what we do today, but it will do
   quite badly if we ever freak out and want to go back to RSA2048 or
   bigger.  Do we care?]
3.1.2. Registering an introduction point on a legacy Tor node [LEGACY_EST_INTRO]
   Tor nodes should also support an older version of the ESTABLISH_INTRO
   cell, first documented in rend-spec.txt. New hidden service hosts
   must use this format when establishing introduction points at older
   Tor nodes that do not support the format above in [EST_INTRO].
   In this older protocol, an ESTABLISH_INTRO cell contains:
        KEY_LENGTH     [2 bytes]
        KEY            [KEY_LENGTH bytes]
        HANDSHAKE_AUTH [20 bytes]
        SIG            [variable, up to end of relay payload]
   The KEY_LENGTH variable determines the length of the KEY field.
   The KEY field is a ASN1-encoded RSA public key.
   The HANDSHAKE_AUTH field contains the SHA1 digest of (KH |
   "INTRODUCE").
   The SIG field contains an RSA signature, using PKCS1 padding, of all
   earlier fields.
   Note that since the relay payload itself may be no more than 498
   bytes long, the KEY_LENGTH field can never have a first byte other
   than [00] or [01]. These values are used to distinguish legacy
   ESTABLISH_INTRO cells from newer ones.
   Older versions of Tor always use a 1024-bit RSA key for these
   introduction authentication keys.
   Newer hidden services MAY use RSA keys up 1904 bits. Any more than
   that will not fit in a RELAY cell payload.
3.1.3. Managing introduction circuits [MAINT_INTRO]
   If the first byte of an ESTABLISH_INTRO cell is [FF], the cell's body
   contains an administrative command for the circuit. The format of
   such a command is:
     Any number of times:
       SUBCOMMAND_TYPE [2 bytes]
       SUBCOMMAND_LEN  [2 bytes]
       SUBCOMMAND      [COMMAND_LEN bytes]
   Recognized SUBCOMMAND_TYPE values are:
       [00 01] -- update encryption keys
   [TODO: Matthew says, "This can be used to fork an intro point to
    balance traffic over multiple hidden service servers while
    maintaining the criteria for a valid ESTABLISH_INTRO
    cell. -MF". Investigate.]
   Unrecognized SUBCOMMAND_TYPE values should be ignored.
3.1.3.1. Updating encryption keys (subcommand 0001) [UPDATE-KEYS-SUBCMD]
   Hidden service hosts send this subcommand to set their initial
   encryption keys or update the configured public encryption keys
   associated with this circuit. This message must be sent after
   establishing an introduction point, before the circuit can be
   advertised.  These keys are given in the form:
      NUMKEYS         [1 byte]
      NUMKEYS times:
        KEYTYPE         [1 byte]
        KEYLEN          [1 byte]
        KEY             [KEYLEN bytes]
      COUNTER         [4 bytes]
      SIGLEN          [1 byte]
      SIGNATURE       [SIGLEN bytes.]
   The KEYTYPE value [01] is for Curve25519 keys.
   The COUNTER field is a monotonically increasing value across a given
   introduction point authentication key.
   The SIGNATURE must be generated with the introduction point
   authentication key, and must cover the entire subcommand body,
   prefixed with the string "Tor hidden service introduction encryption
   keys v1".
   [TODO: Nothing is done here to prove ownership of the encryption
   keys. Does that matter?]
   [TODO: The point here is to allow encryption keys to change while
   maintaining an introduction point and not forcing a client to
   download a new descriptor. I'm not sure if that's worth it. It makes
   clients who have seen a key before distinguishable from ones who have
   not.]
   [Matthew says: "Repeat-client over long periods of time will always
   be distinguishable. It may be better to simply expire intro points
   than try to preserve forward-secrecy, though". Must find out what he
   meant.]
   Setting the encryption keys for a given circuit replaces the previous
   keys for that circuit. Clients who attempt to connect using the old
   key receive an INTRO_ACK cell with error code [00 02] as described in
   section [INTRO_ACK] below.
3.1.4. Acknowledging establishment of introduction point [INTRO_ESTABLISHED]
   After setting up an introduction circuit, the introduction point
   reports its status back to the hidden service host with an empty
   INTRO_ESTABLISHED cell.
    [TODO: make this cell type extensible. It should be able to include
    data if that turns out to be needed.]
3.2. Sending an INTRODUCE1 cell to the introduction point. [SEND_INTRO1]
   In order to participate in the introduction protocol, a client must
   know the following:
     * An introduction point for a service.
     * The introduction authentication key for that introduction point.
     * The introduction encryption key for that introduction point.
   The client sends an INTRODUCE1 cell to the introduction point,
   containing an identifier for the service, an identifier for the
   encryption key that the client intends to use, and an opaque blob to
   be relayed to the hidden service host.
   In reply, the introduction point sends an INTRODUCE_ACK cell back to
   the client, either informing it that its request has been delivered,
   or that its request will not succeed.
3.2.1. INTRODUCE1 cell format [FMT_INTRO1]
   An INTRODUCE1 cell has the following contents:
     AUTH_KEYID      [32 bytes]
     ENC_KEYID       [8 bytes]
     Any number of times:
       EXT_FIELD_TYPE [1 byte]
       EXT_FIELD_LEN  [1 byte]
       EXT_FIELD      [EXTRA_FIELD_LEN bytes]
     ZERO             [1 byte]
     ENCRYPTED        [Up to end of relay payload]
   [TODO: Should we have a field to determine the type of ENCRYPTED, or
   should we instead assume that there is exactly one encryption key per
   encryption method? The latter is probably safer.]
   Upon receiving an INTRODUCE1 cell, the introduction point checks
   whether AUTH_KEYID and ENC_KEYID match a configured introduction
   point authentication key and introduction point encryption key.  If
   they do, the cell is relayed; if not, it is not.
   The AUTH_KEYID for an Ed25519 public key is the public key itself.
   The ENC_KEYID for a Curve25519 public key is the first 8 bytes of the
   public key. (This key ID is safe to truncate, since all the keys are
   generated by the hidden service host, and the ID is only valid
   relative to a single AUTH_KEYID.)  The ENCRYPTED field is as
   described in 3.3 below.
   To relay an INTRODUCE1 cell, the introduction point sends an
   INTRODUCE2 cell with exactly the same contents.
3.2.2. INTRODUCE_ACK cell format. [INTRO_ACK]
   An INTRODUCE_ACK cell has the following fields:
     STATUS    [2 bytes]
     Any number of times:
       EXT_FIELD_TYPE [1 byte]
       EXT_FIELD_LEN  [1 byte]
       EXT_FIELD      [EXTRA_FIELD_LEN bytes]
   Recognized status values are:
     [00 00] -- Success: cell relayed to hidden service host.
     [00 01] -- Failure: service ID not recognzied
     [00 02] -- Failure: key ID not recognized
     [00 03] -- Bad message format
   Recognized extension field types:
     [00 01] -- signed set of encryption keys
   The extension field type 0001 is a signed set of encryption keys; its
   body matches the body of the key update command in
   [UPDATE-KEYS-CMD]. Whenever sending status [00 02], the introduction
   point MUST send this extension field.
3.2.3. Legacy formats [LEGACY-INTRODUCE1]
   When the ESTABLISH_INTRO cell format of [LEGACY_EST_INTRO] is used,
   INTRODUCE1 cells are of the form:
     AUTH_KEYID_HASH  [20 bytes]
     ENC_KEYID        [8 bytes]
     Any number of times:
       EXT_FIELD_TYPE [1 byte]
       EXT_FIELD_LEN  [1 byte]
       EXT_FIELD      [EXTRA_FIELD_LEN bytes]
     ZERO             [1 byte]
     ENCRYPTED        [Up to end of relay payload]
   Here, AUTH_KEYID_HASH is the hash of the introduction point
   authentication key used to establish the introduction.
   Because of limitations in older versions of Tor, the relay payload
   size for these INTRODUCE1 cells must always be at least 246 bytes, or
   they will be rejected as invalid.
3.3. Processing an INTRODUCE2 cell at the hidden service. [PROCESS_INTRO2]
   Upon receiving an INTRODUCE2 cell, the hidden service host checks
   whether the AUTH_KEYID/AUTH_KEYID_HASH field and the ENC_KEYID fields
   are as expected, and match the configured authentication and
   encryption key(s) on that circuit.
   The service host then checks whether it has received a cell with
   these contents before. If it has, it silently drops it as a
   replay. (It must maintain a replay cache for as long as it accepts
   cells with the same encryption key.)
   If the cell is not a replay, it decrypts the ENCRYPTED field,
   establishes a shared key with the client, and authenticates the whole
   contents of the cell as having been unmodified since they left the
   client. There may be multiple ways of decrypting the ENCRYTPED field,
   depending on the chosen type of the encryption key. Requirements for
   an introduction handshake protocol are described in
   [INTRO-HANDSHAKE-REQS]. We specify one below in section
   [NTOR-WITH-EXTRA-DATA].
   The decrypted plaintext must have the form:
      REND_TOKEN                                 [20 bytes]
      Any number of times:
          EXT_FIELD_TYPE                         [1 byte]
          EXT_FIELD_LEN                          [1 byte]
          EXT_FIELD                              [EXTRA_FIELD_LEN bytes]
      ZERO                                       [1 byte]
      ONION_KEY_TYPE                             [2 bytes]
      ONION_KEY                                  [depends on ONION_KEY_TYPE]
      NSPEC      (Number of link specifiers)     [1 byte]
      NSPEC times:
          LSTYPE (Link specifier type)           [1 byte]
          LSLEN  (Link specifier length)         [1 byte]
          LSPEC  (Link specifier)                [LSLEN bytes]
      PAD        (optional padding)              [up to end of plaintext]
   Upon processing this plaintext, the hidden service makes sure that
   any required authentication is present in the extension fields, and
   then extends a rendezvous circuit to the node described in the LSPEC
   fields, using the ONION_KEY to complete the extension. As mentioned
   in [BUILDING-BLOCKS], the "TLS-over-TCP, IPv4" and "Legacy node
   identity" specifiers must be present.
   The hidden service SHOULD NOT reject any LSTYPE fields which it
   doesn't recognize; instead, it should use them verbatim in its EXTEND
   request to the rendezvous point.
   The ONION_KEY_TYPE field is one of:
      [01] TAP-RSA-1024:  ONION_KEY is 128 bytes long.
      [02] NTOR:          ONION_KEY is 32 bytes long.
   The ONION_KEY field describes the onion key that must be used when
   extending to the rendezvous point. It must be of a type listed as
   supported in the hidden service descriptor.
   Upon receiving a well-formed INTRODUCE2 cell, the hidden service host
   will have:
     * The information needed to connect to the client's chosen
       rendezvous point.
     * The second half of a handshake to authenticate and establish a
       shared key with the hidden service client.
     * A set of shared keys to use for end-to-end encryption.
3.3.1. Introduction handshake encryption requirements [INTRO-HANDSHAKE-REQS]
   When decoding the encrypted information in an INTRODUCE2 cell, a
   hidden service host must be able to:
     * Decrypt additional information included in the INTRODUCE2 cell,
       to include the rendezvous token and the information needed to
       extend to the rendezvous point.
     * Establish a set of shared keys for use with the client.
     * Authenticate that the cell has not been modified since the client
       generated it.
   Note that the old TAP-derived protocol of the previous hidden service
   design achieved the first two requirements, but not the third.
3.3.2. Example encryption handshake: ntor with extra data [NTOR-WITH-EXTRA-DATA]
   This is a variant of the ntor handshake (see tor-spec.txt, section
   5.1.4; see proposal 216; and see "Anonymity and one-way
   authentication in key-exchange protocols" by Goldberg, Stebila, and
   Ustaoglu).
   It behaves the same as the ntor handshake, except that, in addition
   to negotiating forward secure keys, it also provides a means for
   encrypting non-forward-secure data to the server (in this case, to
   the hidden service host) as part of the handshake.
   Notation here is as in section 5.1.4 of tor-spec.txt, which defines
   the ntor handshake.
   The PROTOID for this variant is
   "hidden-service-ntor-curve25519-sha256-1".  Define the tweak value
   t_hsenc, and the tag value m_hsexpand as:
      t_hsenc    = PROTOID | ":hs_key_extract"
      m_hsexpand = PROTOID | ":hs_key_expand"
   To make an INTRODUCE cell, the client must know a public encryption
   key B for the hidden service on this introduction circuit. The client
   generates a single-use keypair:
             x,X = KEYGEN()
   and computes:
             secret_hs_input = EXP(B,x) | AUTH_KEYID | X | B | PROTOID
             info = m_hsexpand | subcredential
             hs_keys = HKDF(secret_hs_input, t_hsenc, info,
                            S_KEY_LEN+MAC_LEN)
             ENC_KEY = hs_keys[0:S_KEY_LEN]
             MAC_KEY = hs_keys[S_KEY_LEN:S_KEY_LEN+MAC_KEY_LEN]
   and sends, as the ENCRYPTED part of the INTRODUCE1 cell:
          CLIENT_PK                [G_LENGTH bytes]
          ENCRYPTED_DATA           [Padded to length of plaintext]
          MAC                      [MAC_LEN bytes]
   Substituting those fields into the INTRODUCE1 cell body format
   described in [FMT_INTRO1] above, we have
            AUTH_KEYID                  [32 bytes]
            ENC_KEYID                   [8 bytes]
            Any number of times:
               EXT_FIELD_TYPE           [1 byte]
               EXT_FIELD_LEN            [1 byte]
               EXT_FIELD                [EXTRA_FIELD_LEN bytes]
               ZERO                     [1 byte]
            ENCRYPTED:
               CLIENT_PK                [G_LENGTH bytes]
               ENCRYPTED_DATA           [Padded to length of plaintext]
               MAC                      [MAC_LEN bytes]
   (This format is as documented in [FMT_INTRO1] above, except that here
   we describe how to build the ENCRYPTED portion. If the introduction
   point is running an older Tor that does not support this protocol,
   the first field is replaced by a 20-byte AUTH_KEYID_HASH field as
   described in [LEGACY-INTRODUCE1].)
   Here, the encryption key plays the role of B in the regular ntor
   handshake, and the AUTH_KEYID field plays the role of the node ID.
   The CLIENT_PK field is the public key X. The ENCRYPTED_DATA field is
   the message plaintext, encrypted with the symmetric key ENC_KEY. The
   MAC field is a MAC of all of the cell from the AUTH_KEYID through the
   end of ENCRYPTED_DATA, using the MAC_KEY value as its key.
   To process this format, the hidden service checks PK_VALID(CLIENT_PK)
   as necessary, and then computes ENC_KEY and MAC_KEY as the client did
   above, except using EXP(CLIENT_PK,b) in the calculation of
   secret_hs_input. The service host then checks whether the MAC is
   correct. If it is invalid, it drops the cell. Otherwise, it computes
   the plaintext by decrypting ENCRYPTED_DATA.
   The hidden service host now completes the service side of the
   extended ntor handshake, as described in tor-spec.txt section 5.1.4,
   with the modified PROTOID as given above. To be explicit, the hidden
   service host generates a keypair of y,Y = KEYGEN(), and uses its
   introduction point encryption key 'b' to computes:
      xb = EXP(X,b)
      secret_hs_input = xb | AUTH_KEYID | X | B | PROTOID
      info = m_hsexpand | subcredential
      hs_keys = HKDF(secret_hs_input, t_hsenc, info,
                     S_KEY_LEN+MAC_LEN)
      HS_DEC_KEY = hs_keys[0:S_KEY_LEN]
      HS_MAC_KEY = hs_keys[S_KEY_LEN:S_KEY_LEN+MAC_KEY_LEN]
      (The above are used to check the MAC and then decrypt the
      encrypted data.)
      ntor_secret_input = EXP(X,y) | xb | ID | B | X | Y | PROTOID
      NTOR_KEY_SEED = H(secret_input, t_key)
      verify = H(secret_input, t_verify)
      auth_input = verify | ID | B | Y | X | PROTOID | "Server"
      (The above are used to finish the ntor handshake.)
   The server's handshake reply is:
       SERVER_PK   Y                       [G_LENGTH bytes]
       AUTH        H(auth_input, t_mac)    [H_LENGTH bytes]
   These faileds can be send to the client in a RENDEZVOUS1 cell.
   (See [JOIN_REND] below.)
   The hidden service host now also knows the keys generated by the
   handshake, which it will use to encrypt and authenticate data
   end-to-end between the client and the server. These keys are as
   computed in tor-spec.txt section 5.1.4.
3.4. Authentication during the introduction phase. [INTRO-AUTH]
   Hidden services may restrict access only to authorized users.  One
   mechanism to do so is the credential mechanism, where only users who
   know the credential for a hidden service may connect at all. For more
   fine-grained conntrol, a hidden service can be configured with
   password-based or public-key-based authentication.
3.4.1. Password-based authentication.
   To authenticate with a password, the user must include an extension
   field in the encrypted part of the INTRODUCE cell with an
   EXT_FIELD_TYPE type of [01] and the contents:
     Username [00] Password.
   The username may not include any [00] bytes. The password may.
   On the server side, the password MUST be stored hashed and salted,
   ideally with scrypt or something better.
3.4.2. Ed25519-based authentication.
   To authenticate with an Ed25519 private key, the user must include an
   extension field in the encrypted part of the INTRODUCE cell with an
   EXT_FIELD_TYPE type of [02] and the contents:
        Nonce     [16 bytes]
        Pubkey    [32 bytes]
        Signature [64 bytes]
   Nonce is a random value. Pubkey is the public key that will be used
   to authenticate. [TODO: should this be an identifier for the public
   key instead?]  Signature is the signature, using Ed25519, of:
        "Hidserv-userauth-ed25519"
        Nonce       (same as above)
        Pubkey      (same as above)
        AUTH_KEYID  (As in the INTRODUCE1 cell)
        ENC_KEYID   (As in the INTRODUCE1 cell)
   The hidden service host checks this by seeing whether it recognizes
   and would accept a signature from the provided public key. If it
   would, then it checks whether the signature is correct. If it is,
   then the correct user has authenticated.
   Replay prevention on the whole cell is sufficient to prevent replays
   on the authentication.
   Users SHOULD NOT use the same public key with multiple hidden
   services.
4. The rendezvous protocol
   Before connecting to a hidden service, the client first builds a
   circuit to an arbitrarily chosen Tor node (known as the rendezvous
   point), and sends an ESTABLISH_RENDEZVOUS cell. The hidden service
   later connects to the same node and sends a RENDEZVOUS cell. Once
   this has occurred, the relay forwards the contents of the RENDEZVOUS
   cell to the client, and joins the two circuits together.
4.1. Establishing a rendezvous point [EST_REND_POINT]
   The client sends the rendezvous point a
   RELAY_COMMAND_ESTABLISH_RENDEZVOUS cell containing a 20-byte value.
            RENDEZVOUS_COOKIE         [20 bytes]
   Rendezvous points MUST ignore any extra bytes in an
   ESTABLISH_RENDEZVOUS message. (Older versions of Tor did not.)
   The rendezvous cookie is an arbitrary 20-byte value, chosen randomly
   by the client. The client SHOULD choose a new rendezvous cookie for
   each new connection attempt. If the rendezvous cookie is already in
   use on an existing circuit, the rendezvous point should reject it and
   destroy the circuit.
   Upon receiving a ESTABLISH_RENDEZVOUS cell, the rendezvous point
   associates the cookie with the circuit on which it was sent. It
   replies to the client with an empty RENDEZVOUS_ESTABLISHED cell to
   indicate success.  [TODO: make this extensible]
   The client MUST NOT use the circuit which sent the cell for any
   purpose other than rendezvous with the given location-hidden service.
   The client should establish a rendezvous point BEFORE trying to
   connect to a hidden service.
4.2. Joining to a rendezvous point [JOIN_REND]
   To complete a rendezvous, the hidden service host builds a circuit to
   the rendezvous point and sends a RENDEZVOUS1 cell containing:
       RENDEZVOUS_COOKIE          [20 bytes]
       HANDSHAKE_INFO             [variable; depends on handshake type
                                   used.]
   If the cookie matches the rendezvous cookie set on any
   not-yet-connected circuit on the rendezvous point, the rendezvous
   point connects the two circuits, and sends a RENDEZVOUS2 cell to the
   client containing the contents of the RENDEZVOUS1 cell.
   Upon receiving the RENDEZVOUS2 cell, the client verifies that the
   HANDSHAKE_INFO correctly completes a handshake, and uses the
   handshake output to derive shared keys for use on the circuit.
   [TODO: Should we encrypt HANDSHAKE_INFO as we did INTRODUCE2
   contents? It's not necessary, but it could be wise. Similarly, we
   should make it extensible.]
4.3. Using legacy hosts as rendezvous points
   The behavior of ESTABLISH_RENDEZVOUS is unchanged from older versions
   of this protocol, except that relays should now ignore unexpected
   bytes at the end.
   Old versions of Tor required that RENDEZVOUS cell payloads be exactly
   168 bytes long. All shorter rendezvous payloads should be padded to
   this length with [00] bytes.
5. Encrypting data between client and host
   A successfully completed handshake, as embedded in the
   INTRODUCE/RENDEZVOUS cells, gives the client and hidden service host
   a shared set of keys Kf, Kb, Df, Db, which they use for sending
   end-to-end traffic encryption and authentication as in the regular
   Tor relay encryption protocol, applying encryption with these keys
   before other encryption, and decrypting with these keys before other
   encryption. The client encrypts with Kf and decrypts with Kb; the
   service host does the opposite.
6. Open Questions:
   Scaling hidden services is hard. There are on-going discussions that
   you might be able to help with. See [SCALING-REFS].
   How can we improve the HSDir unpredictability design proposed in
   [SHAREDRANDOM]? See [SHAREDRANDOM-REFS] for discussion.
   How can hidden service addresses become memorable while retaining
   their self-authenticating and decentralized nature? See
   [HUMANE-HSADDRESSES-REFS] for some proposals; many more are possible.
   Hidden Services are pretty slow. Both because of the lengthy setup
   procedure and because the final circuit has 6 hops. How can we make
   the Hidden Service protocol faster? See [PERFORMANCE-REFS] for some
   suggestions.
References:
[KEYBLIND-REFS]:
        https://trac.torproject.org/projects/tor/ticket/8106
        https://lists.torproject.org/pipermail/tor-dev/2012-September/004026.html
[SHAREDRANDOM-REFS]:
        https://trac.torproject.org/projects/tor/ticket/8244
        https://lists.torproject.org/pipermail/tor-dev/2013-November/005847.html
        https://lists.torproject.org/pipermail/tor-talk/2013-November/031230.html
[SCALING-REFS]:
        https://lists.torproject.org/pipermail/tor-dev/2013-October/005556.html
[HUMANE-HSADDRESSES-REFS]:
        https://gitweb.torproject.org/torspec.git/blob/HEAD:/proposals/ideas/xxx-on…
        http://archives.seul.org/or/dev/Dec-2011/msg00034.html
[PERFORMANCE-REFS]:
        "Improving Efficiency and Simplicity of Tor circuit
        establishment and hidden services" by Overlier, L., and
        P. Syverson
        [TODO: Need more here! Do we have any? :( ]
[ATTACK-REFS]:
        "Trawling for Tor Hidden Services: Detection, Measurement,
        Deanonymization" by Alex Biryukov, Ivan Pustogarov,
        Ralf-Philipp Weinmann
        "Locating Hidden Servers" by Lasse Øverlier and Paul
        Syverson
[ED25519-REFS]:
        "High-speed high-security signatures" by Daniel
        J. Bernstein, Niels Duif, Tanja Lange, Peter Schwabe, and
        Bo-Yin Yang. http://cr.yp.to/papers.html#ed25519
Appendix A. Signature scheme with key blinding [KEYBLIND]
  As described in [IMD:DIST] and [SUBCRED] above, we require a "key
  blinding" system that works (roughly) as follows:
        There is a master keypair (sk, pk).
        Given the keypair and a nonce n, there is a derivation function
        that gives a new blinded keypair (sk_n, pk_n).  This keypair can
        be used for signing.
        Given only the public key and the nonce, there is a function
        that gives pk_n.
        Without knowing pk, it is not possible to derive pk_n; without
        knowing sk, it is not possible to derive sk_n.
        It's possible to check that a signature make with sk_n while
        knowing only pk_n.
        Someone who sees a large number of blinded public keys and
        signatures made using those public keys can't tell which
        signatures and which blinded keys were derived from the same
        master keypair.
        You can't forge signatures.
        [TODO: Insert a more rigorous definition and better references.]
  We propose the following scheme for key blinding, based on Ed25519.
  (This is an ECC group, so remember that scalar multiplication is the
  trapdoor function, and it's defined in terms of iterated point
  addition. See the Ed25519 paper [Reference ED25519-REFS] for a fairly
  clear writeup.)
  Let the basepoint be written as B. Assume B has prime order l, so
  lB=0. Let a master keypair be written as (a,A), where a is the private
  key and A is the public key (A=aB).
  To derive the key for a nonce N and an optional secret s, compute the
  blinding factor h as H(A | s, B, N), and let:
      private key for the period:   a' = h a
      public key for the period:    A' = h' A = (ha)B
  Generating a signature of M: given a deterministic random-looking r
  (see EdDSA paper), take R=rB, S=r+hash(R,A',M)ah mod l. Send signature
  (R,S) and public key A'.
  Verifying the signature: Check whether SB = R+hash(R,A',M)A'.
  (If the signature is valid,
       SB = (r + hash(R,A',M)ah)B
          = rB + (hash(R,A',M)ah)B
          = R + hash(R,A',M)A' )
  See [KEYBLIND-REFS] for an extensive discussion on this scheme and
  possible alternatives. I've transcribed this from a description by
  Tanja Lange at the end of the thread. [TODO: We'll want a proof for
  this.]
  (To use this with Tor, set N = INT_8(period-number) | INT_8(Start of
  period in seconds since epoch).)
Appendix B. Selecting nodes [PICKNODES]
  Picking introduction points
  Picking rendezvous points
  Building paths
  Reusing circuits
  (TODO: This needs a writeup)
Appendix C. Recommendations for searching for vanity .onions [VANITY]
  EDITORIAL NOTE: The author thinks that it's silly to brute-force the
  keyspace for a key that, when base-32 encoded, spells out the name of
  your website. It also feels a bit dangerous to me. If you train your
  users to connect to
      llamanymityx4fi3l6x2gyzmtmgxjyqyorj9qsb5r543izcwymle.onion
  I worry that you're making it easier for somebody to trick them into
  connecting to
      llamanymityb4sqi0ta0tsw6uovyhwlezkcrmczeuzdvfauuemle.onion
  Nevertheless, people are probably going to try to do this, so here's a
  decent algorithm to use.
  To search for a public key with some criterion X:
        Generate a random (sk,pk) pair.
        While pk does not satisfy X:
            Add the number 1 to sk
            Add the scalar B to pk
        Return sk, pk.
  This algorithm is safe [source: djb, personal communication] [TODO:
  Make sure I understood correctly!] so long as only the final (sk,pk)
  pair is used, and all previous values are discarded.
  To parallelize this algorithm, start with an independent (sk,pk) pair
  generated for each independent thread, and let each search proceed
  independently.
Appendix D. Numeric values reserved in this document
  [TODO: collect all the lists of commands and values mentioned above]
                    
                  
                  
                          
                            
                            8
                            
                          
                          
                            
                            22
                            
                          
                          
                            
    
                          
                        
                    
                    
                        I have been looking at doing some work on Tor as part of my degree, and
more specifically, looking at Hidden Services. One of the issues where I
believe I might be able to make some progress, is the Hidden Service
Scaling issue as described here [1].
So, before I start trying to implement a prototype, I thought I would
set out my ideas here to check they are reasonable (I have also been
discussing this a bit on #tor-dev). The goal of this is two fold,  to
reduce the probability of failure of a hidden service and to increase
hidden service scalability.
I think what I am planning distils down to two main changes. Firstly,
when a OP initialises a hidden service, currently if you start a hidden
service using an existing keypair and address, the new OP's introduction
points replace the existing introduction points [2]. This does provide
some redundancy (if slow), but no load balancing.
My current plan is to change this such that if the OP has an existing
public/private keypair and address, it would attempt to lookup the
existing introduction points (probably over a Tor circuit). If found, it
then establishes introduction circuits to those Tor servers.
Then comes the second problem, following the above, the introduction
point would then disconnect from any other connected OP using the same
public key (unsure why as a reason is not given in the rend-spec). This
would need to change such that an introduction point can talk to more
than one instance of the hidden service.
These two changes combined should help with the two goals. Reliability
is improved by having multiple OP's providing the service, and having
all of these accessible from the introduction points. Scalability is
also improved, as you are not limited to one OP (as described above,
currently you can also have +1 but only one will receive most of the
traffic, and fail over is slow).
I am aware that there are several undefined parts of the above
description, e.g. how does a introduction point choose what circuit to
use? but at the moment I am more interested in the wider picture. It
would be good to get some feedback on this.
1: https://blog.torproject.org/blog/hidden-services-need-some-love
2:
http://tor.stackexchange.com/questions/13/can-a-hidden-service-be-hosted-by…
                    
                  
                  
                          
                            
                            11
                            
                          
                          
                            
                            44
                            
                          
                          
                            
    
                          
                        
                    
                    
                        Here is a repository containing a simple HTTP-based transport.
	git clone https://www.bamsoftware.com/git/meek.git
	cd meek/meek-client
	export GOPATH=~/go
	go get
	go build
	tor -f torrc
Usually when you think of an HTTP transport, you think of something that
steganographically tries to make something look like plain HTTP requests
and responses. Try and forget that idea for now, because that's not what
I have in mind.
The protocol is simple. The client generates a random string to serve as
a session id. It puts this session id in a POST to the server. The
server has a map from session ids to ORPort connections; if the POST's
id is not in the map, the server creates a new ORPort connection,
otherwise it uses an existing one. The server copies the POST body to
the ORPort, and copies a block of data from the ORPort to the HTTP
response. The client receives the response, and when it has more to
send, it does another POST (with the same session id). Then repeat.
How do we prevent 1) fingerprinting of the HTTP requests and 2) blocking
of the HTTP server? The answer to (1) is that the HTTP requests are
really HTTPS: the censor gets to see where they are going but not what
is inside them. The answer to (2) is hinted at by the Bridge line:
	Bridge meek 0.0.2.0:1 url=https://meek-reflect.appspot.com/ front=www.google.com
We use Google App Engine as a middleman, using a trick to make it look
as if we're talking to www.google.com. This transport can get through as
long as https://www.google.com/ is unblocked, even if App Engine is
blocked. (Up to things like TLS fingerprinting and traffic analysis,
which we need to think about.)
Like flash proxy, this transport doesn't need bridge distribution. The
torrc has everything you need to make it work. Unlike flash proxy, it
works without any port forwarding games.
Now for the big ideas. This transport is similar to
https://trac.torproject.org/projects/tor/wiki/doc/GoAgent in its use of
App Engine. However, GoAgent requires every user to upload their own
instance of the app server. I propose that we run a server for use by
the public and see how much it costs. (App Engine's free tier gives you
1 GB a day and above that it costs money.) If the cost is comparable to
that of running a fast relay, it might make sense to fund on an ongoing
basis. As a student I have $1000 in App Engine credit that I wouldn't
mind burning on the experiment.
A simple PHP script can do the work of the App Engine server component:
all it does is copy HTTP requests and responses. By using PHP as a
middleman, you lose Google's too-big-to-fail unblockability, but you
gain an easy way to set up lots of bridges. Such a PHP bridge would not
even require a shell account, just a PHP web host. Conceivably such
bridges could even be distributed through BridgeDB.
Thinking about transport composition, scramblesuit|meek could be an
interesting thing. What this would mean is that your client makes an
HTTP request to some server, containing a POST body with the beginning
of a ScrambleSuit conversation. If you have the shared secret, the
server replies with 200 and you start communication. If you don't have
the shared secret, the server replies with a 404 (or even 200 with an
ordinary web page). What it means is that there can be a magic URL that
only you (holder of the shared secret) can use as a bridge. It could
even be on a real web site with real pages and everything. ScrambleSuit
would additionally provide some diversity of packet lengths and timing.
The Google fronting trick, it turns out, also works on CloudFlare sites,
which are many. If we ran a bridge as a web app on CloudFlare, even if
our web app is blocked, a censored user could access it through the name
of any CloudFlare site that supports HTTPS. There may be other CDN-like
systems that work similarly.
The software is working (try it!) though of course there are always lots
of things to do. I'd like the client to be able to pin the certificate
of www.google.com. We need the client to use TLS that looks like that of
a browser (now it is just using Go's built-in HTTPS support). There are
some constant buffer sizes and polling timeouts; they can probably be
tuned for better performance.
David Fifield
                    
                  
                  
                          
                            
                            1
                            
                          
                          
                            
                            2
                            
                          
                          
                            
    
                          
                        
                     
                        
                    
                        
                            
                                
                            
                            Using MaxMind's GeoIP2 databases in tor, BridgeDB, metrics-*, Onionoo, etc.
                        
                        
by Karsten Loesing 26 Feb '14
                    by Karsten Loesing 26 Feb '14
26 Feb '14
                    
                        Hi devs,
you probably know that we use MaxMind's GeoIP database in various of our
products (list may not be exhaustive):
 - tor: We ship little-t-tor with a geoip and a geoip6 file for clients
to support excluding relays by country code and for relays to generate
by-country statistics.
 - BridgeDB: I vaguely recall that the BridgeDB service uses GeoIP data
to return only bridges that are not blocked in a user's country.  Or
maybe that was a feature yet to be implemented.
 - Onionoo: The Onionoo service uses MaxMind's city database to provide
location information of relays.  (It also uses MaxMind's ASN database to
provide information on AS number and name.)
 - metrics-db: I'm planning to use GeoIP data to resolve bridge IP
addresses to country codes in the bridge descriptor sanitizing process.
 - metrics-web: We have been using GeoIP data to provide statistics on
relays by country.  This is currently disabled because the
implementation was eating too many resources, but I plan to put these
statistics back.
However, the GeoIP database that we currently use has a big shortcoming:
it replaces valid country codes with A1 or A2 whenever MaxMind thinks
that a relay is an "anonymizing proxy" or "satellite provider".
That's why we currently repair their database by either automatically
guessing what country code an A1 entry could have had [1, 2], or by
manually looking it up in RIR delegation files [3, 4].  This is just a
workaround.  Also, I think BridgeDB doesn't repair its GeoIP database.
Here's the good news: MaxMind now provides their databases in new
formats which provide the A1/A2 information in *addition* to the correct
country codes [5, 6].  We should switch!
How do we switch?  First option is to ship their binary database files
and include their APIs [7] in our products.  Looks there are APIs for C,
Java, and Python, so all the languages we need for the tools listed
above.  Pros: we can kick out our parsing and lookup code.  Cons: we
need to check if their licenses are compatible, we have to kick out our
parsing and lookup code and learn their APIs, and we add new dependencies.
Another option is to write a new tool that parses their full databases
and converts them into file formats we already support.  (This would
also allow us to provide a custom format with multiple database versions
which would be pretty useful for metrics, see #6471.)  Also, it looks
like their license, Creative Commons Attribution-ShareAlike 3.0
Unported, allows converting their database to a different format.  If we
want to write such a tool, we have a few options:
 - We use their database specification [8] and write our own parser
using a language of our choice (read: whoever writes it pretty much
decides).  We could skip the binary search tree part of their files and
only process the contents.  Whenever they change their format, we'll
have to adapt.
 - We use their Python API [9] to build our parser, though it looks like
that requires pip or easy_install and compiling their C API.  I don't
know enough about Python to assess what headaches that's going to cause.
 - We use their Java API [10] to build our parser, though we're probably
forced to use Maven rather than Ant.  I don't have much experience with
Maven.  Also, using Java probably makes me the default (and only)
maintainer, which I'd want to avoid if possible.
Thoughts?  What other options did I miss, and what pros and cons that I
overlooked?
And is this something that people on this list would want to help with,
once we agreed on one of the options?  If so, please feel free to join
the discussion now and maybe influence which path we're going to take.
All the best,
Karsten
[1]
https://gitweb.torproject.org/tor.git/blob/HEAD:/src/config/deanonymind.py
[2]
https://gitweb.torproject.org/onionoo.git/blob/HEAD:/geoip/deanonymind.py
[3] https://gitweb.torproject.org/tor.git/blob/HEAD:/src/config/geoip-manual
[4] https://gitweb.torproject.org/onionoo.git/blob/HEAD:/geoip/geoip-manual
[5] http://dev.maxmind.com/geoip/geoip2/whats-new-in-geoip2/
[6] http://dev.maxmind.com/geoip/geoip2/geolite2/
[7] http://dev.maxmind.com/geoip/geoip2/downloadable/
[8]
https://github.com/marklr/MaxMind-IPDB-perl/blob/master/docs/MaxMind-IPDB-s…
[9] https://pypi.python.org/pypi/geoip2
[10] https://github.com/maxmind/MaxMind-DB-Reader-java
                    
                  
                  
                          
                            
                            2
                            
                          
                          
                            
                            4
                            
                          
                          
                            
    
                          
                        
                    
                    
                        A Threshold Signature-based Shared RNG
0.  Overview
This proposal is for a design of a shared RNG to address ticket #8244 -
"The HSDirs for a hidden service should not be predictable indefinitely into the
future," /without/ requiring byzantine agreement protocols at each RNG period.
The basic idea of this proposal is to have directory authorities use a
deterministic threshold signature scheme to sign Hash(time-period) in
each consensus document.  This is done simply by having each authority
contribute a publicly-verifiable share of the signature during the
voting process.  As long as at least ceiling(n/2) authorities
are honest, the signature can be re-constructed by each client.  Since
the signature is hard to compute if you don't control at least
ceiling(n/2) authorities, its inclusion in the input to a hash
function (e.g. for
use in constructing the HS hash ring) will make the output of the hash
function pseudorandom.  (In the strong cryptographic sense that unless
you can break the signature scheme, you can't distinguish the hash
from a truly random string of the same length)
1. Notation and such
This protocol can work over a group G, using a subgroup generated by
an element B of prime order p.  We'll use Curve25519 [Curve25519] for
concreteness, but any group where the Computational Diffie-Hellman
problem is difficult would do.
We'll denote multiplication of scalars (e.g. mod p) by *, e.g. a*b;
we'll denote multiplication of a group element P by scalar a by a.P.
We will make use of Shamir threshold secret sharing, described, e.g.
in Handbook of Applied Cryptography [HAC], over the integers mod p.  For a
subset S = {P_1,...,P_t}  of size t, we'll denote the lagrange
multipliers for S by m_1,...m_t (eg. if each P_i has share s_i of
secret x, then m_1*s_1 + m_2*s_2 + ... + m_t*s_t = x.)
We assume that each authority S_i has a publicly known verification
key VK_i and signing key sk_i for a digital signature scheme (Sign,
Verify).  We model network communications as point-to-point with
bounded delay and assume at most t-1 authorities are compromised.
(Without these assumptions, the consensus voting protocol is also
broken)
We'll need a hash function H that can output elements of the subgroup
generated by B with unknown discrete logarithms.  For Curve25519, it
should work to compute H(x) by computing SHA256(x), and mapping this
256-bit string to the curve as in the Curve25519 paper.  A security
proof would model this H as a random oracle.
We'll need a distributed threshold key generation algorithm which
works as follows:
    Input: Parties S_1,...,S_n
    Output: to all players: group public key P = x.B. player public
key shares s_1.B, ..., s_n.B
         to each player P_i: secret key s_i)
such that the secret keys s_i are (n,t) secret shares of the (unknown)
secret key x.  We discuss choices to implement this protocol in section [DKG]
We will also require a noninteractive zero-knowledge "signature of
knowledge" that can be bound to a message m, to prove that two points
P, Q have a common discrete logarithm to bases B and R respectively.
We denote this proof by
   SK { s : s.B = P && s.R = Q } (m)
and discuss its implementation in section [SKDH].
2. Overall Protocol description [CONSENSUS-RANDOM]
The protocol requires that periodically the authorities run the
distributed key generation procedure to produce the threshold public
key P = s.B, and public key shares P_1 = s_1.B, ... , P_n =
s_n.B. This can be done infrequently, since its only purpose is to
limit key exposure.
For the time period starting at time T, the authorities generate the
shared random
value as follows:
Step 1.  [Period Base]
Each authority (and anyone else interested) generates the "time
period base point"  R in G, by computing
       R = H("tor-hs-rand-base-point " || period valid-after time)
Step 2. [Signature Share]
Each authority S_i uses its secret share s_i and signing key sk_i to
generate its signature share SHARE_i as follows:
     SHARE_i = R || Q_i || PROOF_i || Sign_i(R || Q_i || PROOF_i)
     Q_i         = s_i.R
     PROOF_i = SK { s_i : s_i.B == P_i && s_i.R == Q_i }(S_i || T)
Each SHARE_i is appended by S_i to the network consensus document for
time period
T in a separate section.
Step 3. [Validation]
Tor clients locally validate SHARE_i from server S_i for time period T as
follows:
  i.  Verifythe signature using verification key vk_i.
 ii.  Check that R = H("tor-hs-rand-base-point " || T).
iii.  Verify PROOF_i for points (P_i, Q_i) with bases (B,R) .
SHARE_i is considered valid if and only if all three checks succeed.
Step 4. [Aggregation]
If at least t = ceiling(n/2) valid shares appear in the consensus, Tor
clients locally compute the randomizer string RAND for time period T
as follows:
i.  Let (S[j], Q[j]) for j = 1,...,t denote the identities (S_i) and signature
shares (Q_i) associated with t valid SHARE_i  values.
ii.  Compute the Lagrange multipliers m[j] for x points S[1]...S[t]
iii. Compute RAND = m[1].Q[1] + m[2].Q[2] + ... + m[t].Q[t]
Notice that RAND = m[1]*s[1].R + ... + m[t]*s[t].R = x.R
If there are not enough valid shares, the protocol fails; this
indicates that a majority of the directory authorities are faulty or
compromised.  If we wanted a fail-open solution, possibilities include
hashing the
list of valid SHARE values, or taking the hash of the previous
consensus RAND value. [XXX - other options or opinions?]
3. Signature of knowledge scheme [SKDH]
Given values (R,S), bases (P,Q), and exponent s such that R=s.P, S = s.Q,
we use the "Fiat-Shamir Heuristic" with the proof of equality of
discrete logarithms due to Chaum-Pedersen [CP92, Sec. 3.2] to generate
SK { s : s.P == R && s.Q == S } (m)
As follows:
 i. Choose random integer a in [0,p-1]
ii. Compute:
    U = a.P
    V = a.Q
    c = Hash(U || V || m)
    z = a + s*c (mod p)
iii. Output (U,V,z)
The proof is verified by computing
       cc = Hash(U || V || m)
and checking that
       z.P == U + cc.R
       z.Q == V + cc.S
4. Distributed Key Generation [DKG]
We'll need a protocol satisfying the following specs:
    Input: Parties S_1,...,S_n
    Output: to all players: group public key P = x.B. player public
key shares s_1.B, ..., s_n.B
         to each player P_i: secret key s_i
such that the secret keys s_i are (n,t) secret shares of the (unknown)
secret key x.
There are several options for such a protocol, depending on what we
want to assume about the network over which the authorities will run
the protocol (Presumably, the Internet):
- If we assume "weak synchrony" (messages are eventually delivered
  without unbounded delay), the protocol of Kate and Goldberg [KG09]
  can tolerate floor((n-1)/3) corruptions and still run to completion.
- If we assume bounded delay but allow "rushing" adversaries and
  adaptive corruptions, the protocol of Garay et al [GKKZ11] can be
  used to realize the broadcast channel required for Pedersen's
  protocol [Ped91] while tolerating up to t-1 = floor(n/2) corruptions.
- If we assume bounded delay and static corruptions, the protocol of
  Dolev and Strong [DS83] can be used to realize the broadcast channel
  required for Pedersen's protocol and tolerate up to t-1 corruptions.
I'm not an expert on byzantine agreement protocols, so it is possible that
there exist other, more easy to implement/manage options as well.
5. References
[CP92] David Chaum, Torben Pryds Pedersen. "Wallet Databases with Observers",
CRYPTO 92, pp 89-105.  http://link.springer.com/chapter/10.1007/3-540-48071-4_7
[CURVE25519] Daniel J. Bernstein. "Curve25519: new Diffie-Hellman
speed records", PKC 2006, pp 207-228.  http://cr.yp.to/papers.html#curve25519
[DS83] D. Dolev and H. Strong. "Authenticated algorithms for Byzantine
agreement", SIAM J. Computing 12(4):656-666, 1983.
http://www.cse.huji.ac.il/~dolev/pubs/authenticated.pdf
[GKKZ11] Juan Garay, Jonathan Katz, Ranjit Kumaresan, Hong-Sheng
Zhou. "Adaptively Secure Broadcast, Revisited", PODC 11, pp 179-186.
http://chess.cs.umd.edu/~jkatz/papers/asb-final.pdf
[HAC] Alfred Manezes, Paul van Oorschot, Scott Vanstone. "Handbook of
Applied Cryptography", CRC Press, 1996.  http://cacr.uwaterloo.ca/hac/
[KG09] Aniket Kate, Ian Goldberg. "Distributed Key Generation for the
Internet", 29th International Conference on Distributed Computing
Systems, June 2009.   https://cs.uwaterloo.ca/~iang/pubs/DKG.pdf
[Ped91] Torben Pryds Pedersen. "Non-interactive and
information-theoretic secure verifiable secret sharing," CRYPTO91.
http://www.cs.huji.ac.il/~ns/Papers/pederson91.pdf
-- 
------------------------------------------------------------------------
Nicholas Hopper
Associate Professor, Computer Science & Engineering, University of Minnesota
Visiting Research Director, The Tor Project
------------------------------------------------------------------------
                    
                  
                  
                          
                            
                            3
                            
                          
                          
                            
                            7
                            
                          
                          
                            
    
                          
                        
                    
                    
                        If you're into C++ or Qt, we could use some help. Mumble[0] is a free
software VoIP application that various people have had success
using with Tor. One problem using Mumble with Tor is that Mumble
doesn't abide by the native proxy settings when looking up Mumble
servers[1]. When using a Tor router device or Torsocks, the DNS
requests are just dropped. When using the native proxy settings, the
DNS requests still get sent out to the local ISP, not through the
proxy. 
The desired outcome is that if a SOCKS5 proxy is set, DNS requests get
sent through the SOCKS5 proxy. The proxy is set as g.s.ptProxyType, and
gets set when main.cpp calls NetworkConfig::SetupProxy(), which in turn
uses QNetworkProxy::setApplicationProxy. 
The window that makes these DNS requests is ConnectDialog.cpp, but
you'll want to look at Cert.cpp as well. 
I submitted a pull request[2] that allows users to completely turn off
automatic UDP and DNS traffic sent out to the big list of public Mumble
Servers. This does not stop the automatic DNS traffic that gets sent to
servers on the Favorite servers list though, which is where any
private servers you might want to connect to would be. 
Getting this fixed would be great for Tor because it would make VoIP
over Tor easier and safer for people on all platforms. Please help!
[0]: http://mumble.info
[1]: https://github.com/mumble-voip/mumble/issues/1033
[2]: https://github.com/mumble-voip/mumble/pull/1128
                    
                  
                  
                          
                            
                            3
                            
                          
                          
                            
                            2
                            
                          
                          
                            
    
                          
                        
                     
                        
                    30 Jan '14
                    
                        Greetings humans,
this is an email to remind you that the regular biweekly pluggable
transports meeting is going to happen tomorrow. Place is the #tor-dev
IRC channel in OFTC. Time is 17:00 UTC.
Cheers!
                    
                  
                  
                          
                            
                            1
                            
                          
                          
                            
                            0
                            
                          
                          
                            
    
                          
                        
                    
                    
                        Hi all. This month I decided to switch from a depth-first to breadth-first
approach for overhauling arm. Changes included...
* Made arm PEP8 compliant. This took around a week during which sed and I were
  close friends. Like Stem, both pyflakes and pep8 checks are now included in
  arm's test runs.
* Completely dropped tor_tools from the codebase. This is a neat milestone as
  tor_tools was a 2,700 line wrapper around TorCtl providing a friendlier API,
  thread safety, caching, etc. Stem provides all of these capabilities.
  Initially migration was simply switching tor_tools from TorCtl to Stem,
  but now arm works directly with a Stem controller allowing us to drop
  the tor_tools module completely.
* Finished arm's new tracker module which provides tor resource usage,
  connections, and application names. It's not just much nicer code, but
  now has unit tests!
Why am I sinking all this effort into arm? After all, it presently does pretty
much what we want, right?
Well, yes. And I suspect user's won't see many obvious changes in the next arm
release. Rather, my present focus on arm is partly for code quality but more
importantly to make sure Stem is ready for prime time.
Arm and Vidalia are our two most complicated and demanding controller
applications, and making a Stem-based arm release is a surprisingly good
way of making sure Stem has all the pieces we need for tasks like a next-gen
Tor GUI or a localhost status panel for relay operators.
As I've rewritten arm Stem has improved as well. Stem changes this month
included...
* Added a port_usage() function for getting a description of a port's common
  usage. This nicely complements our other connection utils...
  >>> from stem.util.connection import port_usage
  >>> port_usage(80)
  'HTTP'
  >>> port_usage(22)
  'SSH'
  >>> port_usage(9418)
  'Git'
* Couple new Controller methods to help with NEWNYM handling:
  is_newnym_available() and get_newnym_wait().
* Pyflakes and pep8 are python modules so why shell out to them? We now use
  their APIs instead reduced the runtime of Stem's static checks by 60%.
* Greatly reduced the verbosity of stem's test output. You can still get the
  previous, more detailed results with the '--verbose' argument.
Last but not least, non-development tasks from this month included...
* Finally moved doctor to its new home on cappadocicum. (#10413)
* Submitted my Tor Ecosystem talk to LinuxFest Northwest...
  http://linuxfestnorthwest.org/2014/sessions/tor-ecosystem-developers-guide-…
* Tickets for Iceland. Looking forward to seeing everyone there!
Cheers! -Damian
                    
                  
                  
                          
                            
                            1
                            
                          
                          
                            
                            0
                            
                          
                          
                            
    
                          
                        
                    
                    
                        I'm interested in writing a pluggable transport for Tor.  Is this spec a
good place to start?
https://gitweb.torproject.org/torspec.git/blob/HEAD:/proposals/180-pluggabl…
Cheers,
Ox
-----------------------------------------------------------------------------------------
"I love people who harness themselves, an ox to a heavy cart,
who pull like water buffalo, with massive patience,
who strain in the mud and the muck to move things forward,
who do what has to be done, again and again."
- Marge Piercy
                    
                  
                  
                          
                            
                            4
                            
                          
                          
                            
                            5
                            
                          
                          
                            
    
                          
                        
                    
                    
                        To achieve offline key storage in the new HS design, hidden service
are using three layers of keys:
(Skip the next three paragraphs if you know this stuff)
Each hidden service has a "long-term master identity key". This is the
key that is encoded in its onion address.
Using the long-term identity key, the hidden service generates
"ephemeral blinded signing" keys according to #8106. This key lasts
for a short period of time (probably for a day or so; not decided
yet).
When the hidden service needs to make a new HS descriptor, it
generates a "descriptor signing keypair" and signs it with the blinded
signing key. It then includes in the descriptor the public part of the
descriptor signing key as well as its signature by the blinded signing
key. It finally signs the descriptor with the private part of the
descriptor signing key.
As described in the spec [0] this allows the HS to generate offline a
bunch of blinded signing keys and descriptor signing keys, copy them
to the online HS host, and let the host use those keys for the next
few days or so. As a result, an attacker that owns the online HS host
only gets access to the keys for a few days ahead. The attacker
doesn't get access to the long-term identity keys.
Of course, this doesn't solve the problem of how Hidden Services can
revoke compromised keys. Having the attacker impersonate the HS even
for a few days is not acceptable. Unfortunately, PKI revocation
solutions are always messy and don't work really well (look at SSL's
OCSP and CRLs).
The question becomes how the legitimate Hidden Service can inform a
client that its keys got compromised. A client that connects to a
Hidden Service first fetches the consensus from the directory system,
then fetches the HS descriptor from the HSDirs and finally connects to
the HS. This probably means that the client should be informed of the
compromise either by the directory system or by the HSDirs.
If we wanted to use the directory system, we could imagine some sort
of CRL scheme where hidden services notify the authorities about
compromised keys, then the authorities pass the CRL to the directory
servers, and the directory servers finally give the CRL to HS
clients. This system might be possible theoretically, but it poses
many engineering and security questions:
* Are users supposed to fetch the CRL everytime they connect to a HS?
  This might be dangerous identifying behavior.
* What happens if the CRL gets too big because the adversary fills it
  with fake revocations? The directory system can't handle big
  documents so there will need to be some sort of size limit (and old
  entries will need to be resetted frequently).
* The list of compromised keys suddenly becomes public
  information. This might not be a good idea.
On the other hand, if we wanted to use the HSDirs, we could imagine
the HS sending some sort of revocation message to the responsible
HSDirs so that they stop serving descriptors with compromised
keys. Unfortunately, this scheme treats HSDirs as trusted parties,
since they can simply ignore the revocation and continue passing the
evil descriptor to clients. We could decrease the chance of this
happening by implementing #8244 and also having the clients use
multiple HSDirs to fetch descriptors. This will force the attacker to
corrupt multiple HSDirs for the attack to succeed. Still the solution
is not very elegant.
What else should we consider?
For example (crazy ideas ahead) could we solve this problem more
elegantly by adding more layers of keys? Or maybe by adding yet
another network entity which handles revocations [like in Dan Boneh et
al. "A Method for Fast Revocation of Public Key Certificates and
Security Capabilities." paper (be aware of crazy crypto)]?
Or maybe we could look into certificate transparency; although
transparency for centralized PKIs (like SSL-CA) sounds like a good
idea, transparency for decentralized PKI systems with complex privacy
threat models might not be a good idea.
[0]: https://gitweb.torproject.org/torspec.git/blob/HEAD:/proposals/224-rend-spe…
                    
                  
                  
                          
                            
                            2
                            
                          
                          
                            
                            1
                            
                          
                          
                            
    
                          
                        
                     
                        
                     
                        
                     
                        
                     
                        
                     
                        
                    