GLIF: Re: [GLIF controlplane] RE: Network Control Architecture

Subject	Re: [GLIF controlplane] RE: Network Control Architecture
From	Jerry Sobieski <jerrys@xxxxxxxxxxxxxx>
Date	Fri, 20 Apr 2007 14:47:46 -0400

Good comments both Steve and Bert...let me chime in: (this is a bitlong, but I think it is relevant)

I too think the reservation phase in each domain must be atomic - thereare effective ways to do this. The overall process though becomes twophase: HOLD a resource for some finite holding time and provide an ACKto the requestor. At some later time the RM will receive a CONFIRM fromthe requester, or a RELEASE. If the hold time expires, the resource isreleased unilaterally. On a macro basis, the reservation of theentire end-to-end lightpath must also be held in the HOLD state whilethe rest of the application resources are reserved as there may be adependency between availability of non-network resources and thereserved lightpath.As Steve suggests, this atomic two phase mechanism is used in many othersimilar reservations systems.

The issue I am concerned about is the roles of the RB and RM. I thinkthe RBs will be numerous - possibly one for every user. I believe wemust assume that all networks will default to a stringent "self secure"stance and will only allow access to its RM from known and trustedpeers. It doesn't scale for every network to "know" about every otherRB in the world (RBs are agents of the user - not of the network)Therefore, for scalability and security reasons, these resourcereservation requests must be made between directly peering networks, andeach network is responsible for recursively reserving the resourcesforward toward the destination. This is still a two stage commit asdescribed above but it solves two problems: a) it scales much better aseach network only needs to expect queries from its direct peers (andcustomers) and b) it allows each network to negotiate aggregationpolicies with its peers for services (enabling economies of scale andglobal reach). This is not unlike how we place a phone call toanywhere in the world - we don't go asking each network if we can useit, we ask our service provider to do so, they ask theirs, and so on,and so on,...

The above scenario assumes the RB poses the service request to the RMserving the source end of a path. There is a [common?] case where theRB is not at the endpoint(s) and does not know of any RMs at theendpoint (or in the middle for that matter). This brings us to anotherassumption I think we must make: a RB only knows its *local* networkRM. An appropriately designed algorithm should/could forward therequest to the source address RM using the same forwarding process asthe reservation (but crossgrain toward toward the source), and then therequest can be serviced forward normally as described above. (This isthe "third party" provisioning scenario.) An alternative model asumes a"minion" agent at the path endpoints that is owned by the end user andknows of its local RM- the minion agent acts as proxy for the RB andmakes the reservation request to the minion's RM. (got that?:-) Ithink we *can* assume that the RB knows of these minions since theyreside at the end points (source or destination) at a well known port.

It is important to note that this process relies on each network RM (notthe RB) knowing constrained reachability of all endpoints - not unlikecurrent interdomain routing protocols. This allows the RM to postulatewhich "nexthop" network will provide the best path and try that first.If the RM knows more than just reachability - i.e. if it knows topology,then the RM can select a more specific candidate path and, viaauthorized recursive querires, can reserve the resource. Only the RMresponsible for a network knows the state and availability detailsassociated with the internal network resources, and therefore only thelocal RM can authoritatively and atomically reserve the resources inthat network.

The beauty of this process is that from the RB perspective, the RB needonly ask one RM for the entire end-to-end network path. The RM willeither return a ticket indicating a path was successfully reserved thatmeets the requested service characteristics, or a NACK indicating thatthe resource was not available for some reason. The user must changethe requested services parameters somehow before trying again - i.e.change the source or destination addr, the start time, the capacity, etc.)

As Gigi states, once all application resources are reserved in the HOLDstate, then all must be CONFIRM'ed which will lock in the reservation.At some delta-t later (which could be 0) there is a separate processthat causes the reconfiguration of the network elements to make thereserved resources available for actual use (i.e. the provisioning orsignaling process). This process must be correlated to a previousreservation and so the provisioning request (separate from thereservation request) must contain some indicator that is trusted by thenetwork and indicates which reservation is being placed into service(see Leon's work on AAA)

Note that none of the above is predicated on any particular routing orsignaling protocol... That being said (:-), DRAGON has implemented muchof this functionality using GMPLS protocols.-The DRAGON Network Aware Resource Broker (NARB) is analogous to thenetwork RM and performs the path computation recursively reserving theresources along the way.. It returns a path reservation in the form ofan Explicit Route Object (ERO) to the source requestor. This loose hopERO specifies a path consisting of ingress and egress points at eachnetwork boundary.-RSVP then uses this ERO to provision the multi-domain end-to-endpath.-The DRAGON Application Specific Topology "Master" is an agentanalogous to the RB mentioned above. AST Master queries all the variousresource managers (compute nodes, storage, instruments, network, etc) toreserve groups of dependent resources. There is a significant protocolexchange defined for ASTs to construct a workable physical resource gridfor the application.What DRAGON has not yet implemented: We have implemented schedulingand policy constraints in the traffic engineering database, but we havenot yet implemented the path computation to use those constraints (thiswill be coming soon).We have atomic reservations, but have not implemented the two phasecommit - though we have long recognized it as critical to the bookaheadcapability and a robust integrated resource scheduling process.


Thanks for sticking with me on this ...:-)
Jerry

Follow-Ups:
- Re: [GLIF controlplane] RE: Network Control Architecture
  - From: Gigi Karmous-Edwards

References:
- Network Control Architecture
  - From: Gigi Karmous-Edwards
- RE: Network Control Architecture
  - From: Inder Monga
- Re: [GLIF controlplane] RE: Network Control Architecture
  - From: Gigi Karmous-Edwards
- Re: [GLIF controlplane] RE: Network Control Architecture
  - From: Gigi Karmous-Edwards
- Re: [GLIF controlplane] RE: Network Control Architecture
  - From: Bert Andree
- Re: [GLIF controlplane] RE: Network Control Architecture
  - From: Steve Thorpe
- Re: [GLIF controlplane] RE: Network Control Architecture
  - From: Bert Andree

Prev by Date: Re: [GLIF controlplane] RE: Network Control Architecture
Next by Date: Re: [GLIF controlplane] RE: Network Control Architecture
Previous by thread: Re: [GLIF controlplane] RE: Network Control Architecture
Next by thread: Re: [GLIF controlplane] RE: Network Control Architecture
Index(es):
- Date
- Thread