WO2007030917A1 - Packet flow bifurcation and analysis - Google Patents

Packet flow bifurcation and analysis Download PDF

Info

Publication number
WO2007030917A1
WO2007030917A1 PCT/CA2006/001423 CA2006001423W WO2007030917A1 WO 2007030917 A1 WO2007030917 A1 WO 2007030917A1 CA 2006001423 W CA2006001423 W CA 2006001423W WO 2007030917 A1 WO2007030917 A1 WO 2007030917A1
Authority
WO
WIPO (PCT)
Prior art keywords
traffic
flow
packets
ingress
blade
Prior art date
Application number
PCT/CA2006/001423
Other languages
French (fr)
Other versions
WO2007030917A8 (en
Inventor
Jonathan Back
Siegfried Johannes Luft
Original Assignee
Zeugma Systems Canada, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zeugma Systems Canada, Inc. filed Critical Zeugma Systems Canada, Inc.
Priority to CA2620349A priority Critical patent/CA2620349C/en
Priority to EP06790602A priority patent/EP1932312A4/en
Publication of WO2007030917A1 publication Critical patent/WO2007030917A1/en
Publication of WO2007030917A8 publication Critical patent/WO2007030917A8/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L47/00Traffic control in data switching networks
    • H04L47/10Flow control; Congestion control
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L47/00Traffic control in data switching networks
    • H04L47/10Flow control; Congestion control
    • H04L47/12Avoiding congestion; Recovering from congestion
    • H04L47/125Avoiding congestion; Recovering from congestion by balancing the load, e.g. traffic engineering
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L47/00Traffic control in data switching networks
    • H04L47/10Flow control; Congestion control
    • H04L47/20Traffic policing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L47/00Traffic control in data switching networks
    • H04L47/10Flow control; Congestion control
    • H04L47/22Traffic shaping
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L47/00Traffic control in data switching networks
    • H04L47/10Flow control; Congestion control
    • H04L47/24Traffic characterised by specific attributes, e.g. priority or QoS
    • H04L47/2441Traffic characterised by specific attributes, e.g. priority or QoS relying on flow classification, e.g. using integrated services [IntServ]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L47/00Traffic control in data switching networks
    • H04L47/10Flow control; Congestion control
    • H04L47/24Traffic characterised by specific attributes, e.g. priority or QoS
    • H04L47/2475Traffic characterised by specific attributes, e.g. priority or QoS for supporting traffic characterised by the type of applications
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L49/00Packet switching elements
    • H04L49/35Switches specially adapted for specific applications
    • H04L49/355Application aware switches, e.g. for HTTP
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L49/00Packet switching elements
    • H04L49/40Constructional details, e.g. power supply, mechanical construction or backplane

Definitions

  • the field of invention relates generally to congestion and flow control in converged full service communication systems, and, more specifically but not exclusively relates to applying quality of service profiles in real time groups of traffic flows.
  • a modern metro area network 100 is composed of two types of networks: a core network 102 and one of more access networks 106.
  • the core network 102 communicates data traffic from one or more service providers 104A-104N in order to provide services to one or more subscribers 108A-108M.
  • Services supported by the core network 102 include, but are not limited to, (1) a branded service, such as a Voice over Internet Protocol (VoIP), from a branded service provider; (2) a licensed service, such as Video on Demand (VoD), through a licensed service provider and (3) traditional Internet access through an Internet Service Provider (ISP).
  • VoIP Voice over Internet Protocol
  • ISP Internet Service Provider
  • the core network supports a variety of protocols (Synchronous Optical Networking (SONET), Internet Protocol (IP), Packet over SONET (POS), Dense Wave Division Multiplexing (DWDM), OSPF, BGP, ISIS, etc.) using various types of equipment (core routers, SONET add-drop multiplexers (ADM), DWDM equipment, etc.).
  • SONET Synchronous Optical Networking
  • IP Internet Protocol
  • POS Packet over SONET
  • DWDM Dense Wave Division Multiplexing
  • OSPF Packet over SONET
  • BGP Packet over SONET
  • ISIS Dense Wave Division Multiplexing
  • the core network communicates data traffic from the service providers 104A-104N to access network(s) 106 across link(s) 112.
  • link(s) 112 may be a single optical, copper or wireless link or may comprise several such optical, copper or wireless link(s).
  • the access network(s) 106 complements the core network 102 by aggregating the data traffic from the subscribers 108A-108M.
  • Access network(s) 106 may support data traffic to and from a variety of types of subscribers 108A-108M, (e.g. residential; corporate, mobile, wireless, etc.). Although the access network(s) 106 may not comprise of each of the types of subscriber (residential, corporate, mobile, etc), access(s) network 106 will comprise at least one subscriber. Typically, access network(s) 106 supports thousands of subscribers 108 A - 108M. Access network(s) 106 aggregates data traffic from the subscribers over link(s) 112 connecting to the core network 102.
  • types of subscribers 108A-108M e.g. residential; corporate, mobile, wireless, etc.
  • access(s) network 106 will comprise at least one subscriber.
  • access network(s) 106 supports thousands of subscribers 108 A - 108M.
  • Access network(s) 106 aggregates data traffic from the subscribers over link(s) 112 connecting to the core network 102.
  • Access networks support a variety of protocols (e.g., IP, Asynchronous Transfer Mode (ATM), Frame Relay, Ethernet, Digital Subscriber Line (DSL), Dynamic Host Configuration Protocol (DHCP), Point-to-Point Protocol (PPP), Point-to-Point Protocol over Ethernet (PPPoE), etc.) using various types of equipment (Edge router, Broadband Remote Access Servers (BRAS), Digital Subscriber Line Access Multiplexers (DSLAM), Switches, etc).
  • the access network(s) 106 uses subscriber policy manager(s) 110 to set policies for individual ones and/or groups of subscribers. Policies stored in a subscriber policy manager(s) 110 allow subscribers access to different ones of the service providers 104 A-N. Examples of subscriber policies are bandwidth limitations, traffic flow characteristics, amount of data, allowable services, etc.
  • a data packet (also known as a "packet") is a block of user data with necessary address and administration information attached, usually in a packet header and/or footer, which allows the data network to deliver the data packet to the correct destination.
  • data packets include, but are not limited to, IP packets, ATM cells, Ethernet frames, SONET frames and Frame Relay packets.
  • data packets having similar characteristics are transmitted in a flow at a transmission rate. The transmission rate is determined by the packet size and the transmission gap (or "inter-packet gap") between each packet, hi addition, the transmission rate of data packets is dependent on the capacity of the network connection and processor capability of the transmitting device.
  • Figure 2 represents the Open Systems Interconnect (OSI) model of a layered protocol stack for transmitting data packets 200.
  • OSI Open Systems Interconnect
  • the physical layer (layer 1) 202 is used for the physical signaling.
  • the next layer, data link layer (layer 2) 204 enables transferring of data between network entities.
  • the network layer (layer 3) 206 contains information for transferring variable length data packet between one or more networks. For example, IP addresses are contained in the network layer 206, which allows network devices (also commonly referred to a network elements) to route the data packet.
  • Layer 4 the transport layer 208, provides transparent data transfer between end users.
  • the session layer (layer 5) 210 provides the mechanism for managing the dialogue between end-user applications.
  • the presentation layer (layer 6) 212 provides independence from difference in data representation (e.g. encryption, data encoding, etc.).
  • the final layer is the application layer (layer 7) 212, which contains the actual data used by the application sending or receiving the packet. While most protocol stacks do not exactly follow the OSI model, it is commonly used to describe networks.
  • bandwidth sensitive services such as VoIP or VoD
  • QoS Quality of Service
  • link(s) 112 can get overloaded and not provide enough bandwidth for these bandwidth sensitive services. Subsequently, the quality of these services degrades or becomes interrupted altogether.
  • QoS Quality of Service
  • QoS allocates different bandwidth rates to different types of data traffic. For example, QoS can be set up to allocate a bandwidth of 20 Mbps for VoIP service over link(s) 112.
  • QoS shapes the data traffic by re-transmitting the data traffic in a constant rate.
  • both the core and access networks must be set up to support the desired QoS policy.
  • a traffic shaper is a device that classifies a packet by deep packet inspection and transmits the packet based on pre-determined subscriber policies.
  • deep packet inspection examines the data contained in layers up to and including application layer 214 of each data packet 200 to determine what quality of service should be used for the packet. For example and by way of illustration, deep packet inspection matches the structure of the application layer data with potentially hundreds of known application data types. This allows a traffic shaper to finely tune the quality of service enforced. For instance, a traffic shaper may identify control packets for an adaptable video conferencing protocol to configure the network for an optimal video conferencing rate.
  • existing traffic shapers are subscriber aware, these traffic shapers only enforce pre-determined subscriber policies. That is, subscribers policies are set by the operator of the traffic shaper and do not change until the operator modifies the subscriber policies. This does not allow subscriber policies to change in real-time based on existing network conditions. Furthermore, existing traffic shapers cannot handle the high volume of data traffic that cross the core 102 and access 116 networks.
  • flow switches are network devices that transmit data packets in connected flows, instead of discrete packets.
  • Flow switches operate on groups of similar packets to provide QoS for an application.
  • flow switches have limited data traffic processing capability, are not subscriber aware, perform limited or no deep packet inspection, and cannot update subscriber policies in real-time.
  • optimum matching of traffic profiles with individual traffic flows This matching of optimum traffic profiles occurs in real time without any static provisioning linking the flow with the profile.
  • the matching is performed by the bifurcation and duplication of packets that make up a flow to each of egress traffic and computation processor resources, such that egress traffic operations and traffic analysis operations may be performed concurrently on the egress traffic and computation processor resources without introducing jitter or delay in either bifurcated processing path.
  • the traffic analysis includes maintaining flow statistics, flow stateful information and classifying the flow as a particular application traffic type. The optimum traffic profile for this application traffic type is then selected and applied to the individual flow.
  • an implementation environment comprising an apparatus for performing the method.
  • the apparatus includes a chassis populated with multiple traffic and compute blades.
  • the various blades are enabled to communicate with one another using a backplane mesh interconnect provided by the chassis under management of backplane fabric switches hosted by each blade.
  • Distributed software components are also provided for facilitating the method via execution on associated processing elements on the traffic and compute blades.
  • Figure 2 is a block diagram illustrating layers of the OSI protocol stack
  • Figure 3 illustrates an exemplary network configuration using a traffic shaping service node in a metro area network, according to one embodiment of the invention
  • Figure 4 is a diagram illustrating sets of Ingress Traffic Processors, Computation Processors, and Egress Traffic Processors used to implement aspects of the invention
  • FIG. 5 is a diagram of a feed- forward control loop illustrating aspects of the invention employed via bifurcation of packet flows;
  • Figure 6 is a schematic diagram illustrating the communication interconnected between a Traffic Blade and a Compute Blade
  • Figure 7 is a schematic diagram illustrating of one embodiment of a Compute Blade that is provisioned for an OAMP function
  • Figure 8 is a schematic diagram illustrating one embodiment of a Traffic Blade
  • Figure 9 is a schematic diagram illustrating one configuration of a service node implemented via a ATCA chassis
  • Figure 10 is a schematic diagram illustrating details of the inter-blade communication scheme; according to one embodiment of the invention.
  • Figure 11 is a schematic diagram illustrating a service node implementation environment including a local instance of a global arbitrator on each blade;
  • FIG 12 is a schematic diagram illustrating various components associated with a Service Management Engine (SME);
  • SME Service Management Engine
  • FIG. 13 is a schematic diagram illustrating details of the Bandwidth
  • Figure 14 is a schematic diagram illustrating details of the Services
  • Figure 15 is a schematic diagram illustrating details of the Application
  • Figure 16 is a flowchart illustrating operations employed to provision compute resources for an associated subscriber flow
  • Figure 17 is a schematic diagram of an exemplary execution environment for a service node used in connection with the packet processing operations of Figure 18;
  • Figure 18 is a flowchart illustrating operations performed in connection with processing a packet flow, according to one embodiment of the invention.
  • references in the specification to "one embodiment”, “an embodiment”, “an example embodiment”, etc., indicate that the embodiment described may include a particular feature, structure, or characteristic, but every embodiment may not necessarily include the particular feature, structure, or characteristic. Moreover, such phrases are not necessarily referring to the same embodiment. Further, when a particular feature, structure, or characteristic is described in connection with an embodiment, it is submitted that it is within the knowledge of one skilled in the art to effect such feature, structure, or characteristic in connection with other embodiments whether or not explicitly described.
  • Coupled may mean that two or more elements are in direct physical or electrical contact. However, “coupled” may also mean that two or more elements are not in direct contact with each other, but yet still co-operate or interact with each other.
  • FIG 3 illustrates an exemplary network configuration using a traffic shaping service node 302 in a metro area network according to one embodiment of the invention.
  • traffic shaping service node 302 is communicatively coupled between the core 102 and access 106 networks. While one embodiment is described under which the traffic shaping service node may shape traffic traveling in either direction, alternative embodiments may shape in only one direction (e.g., the service provider data traffic coming from the core network 102.
  • Traffic shaping a form of QoS, is the process of regulating and smoothing the flow of network data traffic within a computer network. Restricting the bandwidth of the traffic flow is one way to regulate data traffic.
  • the traffic shaping service node 302 uses the subscriber policies contained in subscriber policy manager(s) 110 for instruction on how to shape the data traffic from service providers 104A-104N and/or subscribers 108A-108M accordingly. Further details of various elements of embodiments of traffic shaping service nodes are discussed below. Packet Flow Bifurcation and Analysis
  • techniques are provided to enable optimum matching of a traffic profile with an individual traffic flow.
  • This matching of the optimum traffic profile occurs in real time without any static provisioning linking the flow with the profile.
  • the matching is achieved by the bifurcation and duplication of all packets (i.e., packet data) that make up a flow to a computation resource that will analyze the packets, maintain flow statistics, flow stateful information and classify the flow as a particular application traffic type.
  • the optimum traffic profile for this application traffic type is then selected and applied to the individual flow in real time.
  • a Traffic Flow comprises a set of packets having similar flow classification parameters.
  • a typical ingress operation performed by a layer-4 router or the like is to classify received packets to associated flows using a rule-based packet classification scheme, such as defined by an Access Control List (ACL) database.
  • ACL Access Control List
  • the rules for classifying a message i.e., one or more associated packets
  • filters or rules in firewall terminology
  • the packet classification problem is to determine the lowest cost matching filter or rule for each incoming message at the network element.
  • the relevant information is contained in N distinct header fields (or partial header fields) in each packet.
  • the relevant fields for an IPv4 packet comprise the Destination Address (32 bits), the Source Address (32 bits), the Destination Port (16 bits), the Source Port (16 bits), and the Protocol Field (8 bits); the set of field values for a given packet is referred to as the 5 -Tuple signature.
  • the corresponding filter database consists of a finite set of filters, filti, filt 2 ... filt N -
  • Each filter is a combination of N values, one for each header field.
  • Each field in a filter is allowed three kinds of matches: exact match, prefix match, or range match and wildcard.
  • exact match the header field of the packet should exactly match the filter field.
  • prefix match the filter field should be a prefix of the header field.
  • the header values should like in the range specified by the filter.
  • Each filter filt has an associated directive disp, which specifies how to process a packet matching the filter.
  • An Application Flow is a flow of packets that carries traffic belonging to a specific application, such as VoIP or VoD.
  • the data carried in the packet header e.g., the 5-Tuple signature
  • data in the packet payload is employed for identifying the appropriate application.
  • a technique known as Deep Packet Inspection is required to further analyze the contents of the packet and to keep stateful context of previous packets seen in the flow.
  • Modern network equipment can typically support 5-Tuple classification at line speed rates. However, classification schemes that employ deep packet inspection may not be implemented at line rate speeds in many instances. As a result, separate processing paths, respectively referred to as fast path and slow path, are sometimes typically employed for performing 5-Tuple classification and deep packet inspection on a given network device.
  • an apparatus via which the techniques may be implemented includes various processor resources, including multiple ingress traffic processors 400 (depicted as i ingress traffic processors 40Oi _ ,), computation processors 402 (depicted as j computation processors 402 ⁇ ), and egress traffic processors 404 (depicted as k egress traffic processors 402 1 _*).
  • processor resources including multiple ingress traffic processors 400 (depicted as i ingress traffic processors 40Oi _ ,), computation processors 402 (depicted as j computation processors 402 ⁇ ), and egress traffic processors 404 (depicted as k egress traffic processors 402 1 _*).
  • the particular values for i, j, and k may be the same or may differ, depending on the particular implementation environment and requirements.
  • the various ingress traffic processors 400 u, computation processors 402 1-/ ⁇ and egress traffic processors 402 1 , 4 are interconnected in a mesh fashion such that any processor may communicate with any other processor.
  • the communication rates amongst the various processors are equal. In other embodiments, the communication rates may differ.
  • the traffic processors perform 5-Tuple classification of flows, while the computation processors perform deep packet inspection of the flows and maintain stateful analysis data pertaining to each flow.
  • an Ingress Traffic Processor is termed TP 1
  • an Egress Traffic Processor is termed TP e
  • the ingress flow is termed F 1
  • the egress flow is termed F e .
  • Packets enter the apparatus at the TP 1 .
  • the TP 1 identifies a new incoming flow and make a routing decision R (Fi) regarding the destination TP e . Packets from the identified flow shall henceforth be sent to the TP e and exit the apparatus; packets thus flow from TP 1 to TP e .
  • R routing decision
  • F 1 is bifurcated, and its packets duplicated, to provide an equivalent and simultaneous flow that is sent to one of the many Computation Processors (CP); which CP is selected is decided programmatically and is a function S (F 1 ) of the flow F 1 , as schematically depicted by the feed forward control loop of Figure 5.
  • the bifurcated flow is termed F b .
  • the CP shall be notified of the creation of a new flow, along with the 5-Tuple that accompanies the flow and it shall expect reception of the flow F b .
  • the CP shall route the F b to specific software Analysis Entities that are capable of analyzing a traffic flow associated with the specific 5-Tuple.
  • This routing to specific analysis entities enables any state or analytical information that has been extracted from the flow to be preserved, meaning that the analysis process has access to past state information and can alter its analysis based upon this information.
  • the packet flow is bifurcated and duplicated, no additional latency or jitter is introduced into any traffic flow F 1 and F e .
  • the Analysis Engine shall be working on packet P t producing a result R (P t ) that is sent to TP e .
  • the arrival of the result R (P t ) at the TP e will have a certain time delay introduced in relation to the packet P t that arrived at TP e as part of the original (non bifurcated) flow F e such that, as R (P t ) arrives at TP e , then the current packet being processed by TP e shall be P t+ ⁇ -
  • the result R (P t ) is directly used to dynamically modify or adjust the exact traffic profile PR fe being applied to the packets forming the flow F e , and as such, the traffic profile being selected and applied to a packet P t+ ⁇ shall be based upon the analysis of packet P t :
  • the apparatus has the ability to view traffic in terms of Traffic Aggregates. Having access to these Traffic Aggregates and the statistics associated with then allow the Analysis Engines to select a traffic profile different from that which would result from isolated analysis of the traffic flow.
  • services provided over the network will consist of multiple flows, each with their own 5 -Tuple signature.
  • the Analysis Engines shall have the ability to draw together and group the information regarding these multiple flows and treat the grouping as a Service Flow.
  • the Analysis Engines have the ability to track the state of a Service Flow and dynamically alter the results being sent to the TP 1 and TP e that servicing the flows the make up the Service Flow.
  • Figures 4, 6-15, and 17 illustrate exemplary network element architectures that may be used for a variety of purposes, including but not limited to, a traffic shaping service node as previously described.
  • a traffic shaping service node as previously described.
  • FIGs 4, 6-15, and 17 are described with reference to a traffic shaping service node, it should be understood that these architectures are independent as part of the invention.
  • the aforementioned functions are facilitated by various processing and storage resources hosted by associated line cards and the like, which are mounted in a common chassis.
  • a Service Node can be decomposed into three entities, Traffic Blades (TB) 600, Compute Blades (CB) 602 and the chassis 604.
  • a TB 600 can be further reduced to its physical and link layer portions 606 and 608, network layer components 610, and infrastructure components 612.
  • a CB 602 provides Service Layer termination 612 and infrastructure components 614.
  • a CB can be further re-defined to be an OAMP Blade based on its slot index (within chassis 604).
  • OAMP blades are a functional superset of CBs, adding operations, administration, maintenance and provisioning functionality (collectively referred to as OAMP card function or OAMP CF).
  • chassis 604 comprises an Advanced Telecommunication and Computing Architecture (ATCA or AdvancedTCA®) chassis.
  • ATCA Chassis provides physical connectivity between the blades via a passive backplane 616 including a full-mesh backplane 616.
  • a passive backplane 616 including a full-mesh backplane 616.
  • the ATCA environment depicted herein is merely illustrative of one modular board environment in which the principles and teachings of the embodiments of the invention described herein may be applied. In general, similar configurations may be deployed for other standardized and proprietary board environments, including but not limited to blade server environments.
  • the ATCA 3.0 base specification (approved December 30, 2002), which is being carried out by the PCI Industrial Computer Manufacturers Group (PICMG), defines the physical and electrical characteristics of an off-the-shelf, modular chassis based on switch fabric connections between hot-swappable blades. (As used herein, the terms “board,” “blade,” and “card,” are interchangeable.)
  • This specification defines the frame (rack) and shelf (chassis) form factors, core backplane fabric connectivity, power, cooling, management interfaces, and the electromechanical specification of the ATCA-compliant boards.
  • the electromechanical specification is based on the existing IEC60297 EuroCard form factor, and enables equipment from different vendors to be incorporated in a modular fashion with guaranteed interoperability.
  • the ATCA 3.0 base specification also defines a power budget of 200 Watts (W) per board, enabling high performance servers with multi -processor architectures and multi gigabytes of on-board memory.
  • W power budget
  • mating connectors on the boards and backplane are employed for coupling input/output (I/O) signals.
  • I/O input/output
  • Advanced Switching (AS) employs a serial communication channel operating at Gigahertz+ frequencies.
  • ATCA boards may also provide one or more I/O ports on their front panels, enabling an ATCA board to be coupled to other network resources.
  • FIG. 7 An exemplary architecture 700 for a compute blade 602 is shown in Figure 7.
  • a single compute blade (physical) architecture is employed for both Compute Blades and OAMP CF' s. More particularly, under architecture 700, a corresponding blade may be deployed to support both Compute Blade and OAMP functionality.
  • Compute Blade 602 employs four multiple processor compute nodes 702 M .
  • each of compute nodes 702 1-4 functions as multiple processor resources, with each processor resource being associated with a logical processor. Accordingly, such processor resources may be implemented using separate processors, or processor chips employing multiple processor cores.
  • each of compute nodes 702i -4 is implemented via an associated symmetric multi-core processor. Exemplary multi- core processors that may be implemented include, but are not limited to Broadcom 1480 and 1280 devices.
  • Each of the compute nodes 702 1-4 is enabled to communicate with other compute nodes via an appropriate interface (e.g., bus or serial-based interfaces).
  • this interface comprises a "Hyper Transport” (HT) interface.
  • HT Hyper Transport
  • Other native (standard or proprietary) interfaces between processors may also be employed.
  • each compute nodes 702 1-4 is allocated various memory resources, including respective RAM 704] -4 .
  • each of compute nodes 702] -4 may also be allocated an external cache 706 1-4 , or may provide one or more levels of cache on-chip.
  • the RAM comprises ECC (Error Correction Code) RAM.
  • each compute node employs a NUMA (Non-Uniform Memory Access) cache coherency scheme.
  • NUMA Non-Uniform Memory Access
  • Other cache coherency schemes such as MESI (Modified, Exclusive, Shared, Invalidated), may also be implemented for other embodiments.
  • Each Compute Blade 602 includes a means for interfacing with ATCA mesh interconnect 618. In the illustrated embodiment of Figure 7, this is facilitated by a Backplane Fabric Switch 708. Meanwhile, a field programmable gate array (FPGA) 710 containing appropriate programmed logic is used as an intermediary component to enable each of compute nodes 702 1-4 to access backplane fabric switch 708 using native interfaces for each of the compute nodes and the fabric switch.
  • the interface between each of compute nodes 702i -4 and the FPGA 710 comprises an SPI (System Packet Interface) 4.2 interface
  • the interface between the FPGA and backplane fabric switch 708 comprises a Broadcom HiGigTM interface. It is noted that these interfaces are merely exemplary, and that other interface may be employed depending on the native interfaces of the various blade components.
  • the compute node associated with the OAMP function (depicted in Figure 7 as Compute Node #1) is provided with local SRAM 712 and a non- volatile store (depicted as Compact flash 714).
  • the non-volatile store is used to store persistent data used for the OAMP function, such as provisioning information and logs.
  • each compute node is provided with local RAM and a local cache, as depicted in Figure 11.
  • compute blade 602 is provisioned as an OAMP blade, hi one configuration (as shown), one of the compute nodes is employed for performing OAMP functions (e.g., compute node 702i), while the other three compute nodes (e.g., compute nodes 702 2-4 ) perform normal compute functions associated with compute blades, as described in further detail below.
  • compute blade 602 is provisioned as a compute blade, each of compute nodes 702i -4 is available for performing the compute functions described herein.
  • FIG. 8 shows an exemplary architecture 800 for a traffic blade 600.
  • Architecture 800 includes a PHY block 802, an Ethernet MAC block 804, a network processor unit (NPU) 806, a host processor 808, a SERDES interface 810, an FPGA 812, a backplane fabric switch 814, RAM 816 and 818 and cache 819.
  • the traffic blade further includes one or more I/O ports 820, which are operatively coupled to PHY block 820. Depending on the particular use, the number of I/O ports may vary from 1 to N ports. For example, under one traffic blade type a 10 x 1 Gigabit Ethernet (GigE) port configuration is provided, while for another type a 1 x lOGigE port configuration is provided. Other port number and speed combinations may also be employed.
  • GigE Gigabit Ethernet
  • PHY block 802 and Ethernet MAC block 804 respectively perform layer 1 (Physical) and layer 2 (Data Link) functions, which are well-known in the art. hi general, the PHY and Ethernet MAC functions may be implemented in hardware via separate components or a single component, or may be implemented in a combination of hardware and software via an embedded processor or the like.
  • One of the operations performed by a traffic blade is packet identification/classification. As discussed above, a multi-level classification hierarchy scheme is implemented for this purpose. Typically, a first level of classification, such as the aforementioned 5 -Tuple signature classification scheme, is performed by the traffic blade's NPU 806.
  • Additional classification operations in the classification hierarchy that may be required to fully classify a packet (e.g., identify an application flow type) in the manner discussed above, hi general, these higher-level classification operations may be performed by the traffic blade's host processor 808 and/or a processor on a compute blade, depending on the particular classification.
  • NPU 806 includes various interfaces for communicating with other board components. These include an Ethernet MAC interface, a memory controller (not shown) to access RAM 816, Ethernet and PCI interfaces to communicate with host processor 808, and an XGMII interface.
  • SERDES interface 810 provides the interface between XGMII interface signals and HiGig signals, thus enabling NPU 806 to communicate with backplane fabric switch 814.
  • NPU 806 may also provide additional interfaces to interface with other components, such as an SRAM (Static Random Access Memory) interface unit to interface with off-chip SRAM (both not shown).
  • SRAM Static Random Access Memory
  • host processor 808 includes various interfaces for communicating with other board components. These include the aforementioned Ethernet and PCI interfaces to communicate with NPU 806, a memory controller (on-chip or off-chip - not shown) to access RAM 818, and a pair of SPI 4.2 interfaces. FPGA 812 is employed to as an interface between the SPI 4.2 interface signals and the HiGig interface signals.
  • NPUs are designed for performing particular tasks in a very efficient manner. These tasks include packet forwarding and packet classification, among other tasks related to packet processing.
  • NPU 806 executes corresponding NPU software 822. This software is shown in dashed outline to indicate that the software may be stored (persist) on a given traffic blade (e.g., in a flash device or the like), or may be downloaded from an external (to the traffic blade) store during initialization operations, as described below.
  • NPU software 822 is loaded into internal SRAM 823 provided by NPU 806.
  • Host processor 808 is employed for various purposes, including lower- level (in the hierarchy) packet classification, gathering and correlation of flow statistics, and application of traffic profiles. Host processor 808 may also be employed for other purposes. In general, host processor 808 will comprise a general-purpose processor or the like, and may include one or more compute cores (as illustrated, in one embodiment a two-core processor is used). As with NPU 806, the functionality performed by host processor is effected via execution of corresponding software (e.g., machine code and or virtual machine byte code), which is depicted as host software 824. As before, this software may already reside on a traffic blade, or be loaded during blade initialization.
  • corresponding software e.g., machine code and or virtual machine byte code
  • host processor 808 is responsible for initializing and configuring NPU 806. Under one initialization scheme, host processor 808 performs network booting via the DHCP (or BOOTP) protocol. During the network boot process, an operating system is loaded into RAM 818 and is booted. The host processor then configures and initializes NPU 806 via the PCI interface. Once initialized, NPU 806 may execute NPU software 822 on a run-time basis, without the need or use of an operating system.
  • DHCP or BOOTP
  • FIG. 9 is a schematic diagram illustrating the cross-connectivity provided by the ATCA backplane mesh interconnect used in one embodiment of the Service Node.
  • an ATCA chassis 604 is fully populated with 14 ATCA blades, with each blade installed in a respective chassis slot - in an actual implementation, the chassis may be populated with less blades or may include other types of blades in addition to compute and traffic blades.
  • the illustrated configuration includes four compute blades 602 1-4 , and 10 traffic blades 60O 1- Io, with one of the compute blades being provisioned to provide OAMP functions.
  • each blade is communicatively-coupled with every other blade under the control of fabric switching operations performed by each blade's fabric switch.
  • mesh interconnect 618 provides a 10 Gbps connection between each pair of blades, with an aggregate bandwidth of 280 Gbps.
  • Inter-slot connectivity is enabled through a full mesh interconnect network compromising the ATCA passive backplane and a multi-port lOGbps switch fabric device integrated on every blade.
  • a 16- port lOGbps HiGig switch (Broadcom proposed model BCM56700) is implemented for each of backplane fabric switches 708 and 814.
  • a pair of 8-port switch fabric devices (e.g., Broadcom BCM5675) may be employed in place of the 16-port device shown in Figure 10.
  • Each fabric switch device maintains chassis facing ports and local facing HiGig ports.
  • the switch fabric is a distributed shared memory switch architecture with output port buffering.
  • the fabric is implemented as an insertion ring.
  • Each port provides 128K bytes of egress packet buffering (principal queue point) and 19K bytes on ingress packet buffering (sufficient for address resolution and ring insertion delay) for a 9K byte jumbo frame.
  • the switch fabric architecture of Figure 10 utilizes the fabric to create a partial mesh. That is, full "any-to-any" port forwarding is not required. Specifically, the fabric only needs to forward from local to chassis facing ports (and vice versa) and forwarding between chassis facing ports is not required.
  • Another aspect of the invention relates to scalability.
  • the service node is implemented using a distributed architecture, wherein various processor and memory resources are distributed across multiple blades. To scale a system, one simply adds another blade. The system is further enabled to dynamically allocate processor tasks, and to automatically perform fail-over operations in response to a blade failure or the like.
  • Blades may be hot-swapped without taking the system down, thus supporting dynamic scaling.
  • Yet another aspect of the invention relates to dynamic allocation of system resources, such as compute resources, queues, etc.
  • compute resources and usage is monitored on an ongoing basis, and real-time statistics and the like are maintained using a distributed database scheme and a local (to each blade) agent, such that each blade has a "global" view of the entire system resource availability and consumption.
  • Figure 11 shows an OAMP Blade 601, multiple Compute Blades 602, and multiple Traffic Blades 600.
  • Each of these blades runs an instance of a software agent or the like (e.g., service, daemon, etc.) referred to as the "Global Arbitrator" 1100.
  • a software agent or the like e.g., service, daemon, etc.
  • the Global Arbitrator For example, one of the compute nodes on a Compute Blade will be provisioned to run a Global Arbitrator, while the host processor on a Traffic Blade will be used for a similar function.
  • Each of the Global Arbitrator instances are enabled to communicate with one another using a common protocol running over the aforementioned communication signal infrastructure. For example, in one embodiment, Global Arbitrators exchange IP packets to communicate with one another. Other protocols may be implemented in a similar manner. [0075]
  • the Global Arbitrators 1100 are used to perform various tasks.
  • Some task examples include: Distributed Scheduling, Process Management, Group Management, Inter-Process Communication (IPC), and Remote Procedure Calls (RPC).
  • each Global Arbitrator will maintain local and global resource information for the system, hi this case, each Distributed Schedule module for each Global Arbitrator instance will identify the resources on its host blade (e.g., disk, memory, processing, networking, and subscribers), and then propagate this information to the other Global Arbitrators, as illustrated in Figure 11.
  • the Distributed Schedulers will also be responsible for load balancing resources across blades by tracking local resource loading an propagating this information to other Distributed Schedulers, such that each Global Arbitrator have access to a "world view" of available resources and resource consumption.
  • the Global Arbitrators will maintain a distributed database 1102 having a local instance 1102 A stored on each blade.
  • the distributed database 1102 which functions as a virtual database, will be maintained in such a manner that if a blade fails, no information will be lost.
  • Each local database instance will store both local and global data.
  • subscribers are associated with groups (based on their Subscriber Profile, such as QoS requirements and other contracted service parameters), while groups, in turn, are associated with resource groups.
  • groups may be associated with resources on a blade basis (e.g., Group 1 is associated with Traffic Blades 1, 2, 6, and 7 or with further granularity, such as the individual resource level (e.g., each compute node comprises an individual resource that may be allocated to a given group).
  • One of the purposes of the Group-to-subscriber mapping pertains to resource allocation.
  • the system will attempt to maintain load balancing by allocating resources such that the resources are consumed at a similar level across the infrastructure.
  • the subscriber-to-group allocation enables subscribers that have contracted for different levels of services to be allocated a corresponding level of (access to) resources. For example, some groups may be allocated more resources on a per-subscriber basis such that service flows corresponding to subscribers that have purchased a higher level of service will be allocated relatively more resources than service flows corresponding to lower QoS subscribers.
  • SME Service Management Engine
  • the SME provides key added value over and above that of a traditional network element such as a BRAS and Edge-Router.
  • a traditional BRAS/Edge-Router is able to manage subscribers' traffic on a per session basis.
  • a service node, equipped with the SME provides visibility into subscriber sessions, and enable traffic management on a per application level. In addition, it is able to provide customized, network-based, and subscriber-aware application services.
  • the SME provides these functionalities through flow classification, deep packet inspection, flow level traffic management, and application service scripting.
  • subscriber traffic enters a service node, it is separated into flows.
  • the flows are classified by their application-level protocols with the aid of deep packet inspection.
  • Some of these flows are traffic-managed according to the authorization of the subscriber to which they belong. This management typically includes policing, shaping and prioritization on a per flow basis.
  • Other flows are bifurcated or diverted to application service scripts that implement various customized services.
  • the SME builds on the subscriber management functions of a BRAS, and extends traffic management to a per-subscriber/per- application level. It also exposes some of these functions in a generic state machine so that customized applications may be built on top of these.
  • the service management engine software is the top most functional layer in the system. It uses features provided by the lower layers; it guarantees certain level of quality of service for services and applications under management; and it provides visibility into the traffic stream for the network operators.
  • the Service Management Engine is the runtime layer of Services Management Software Environment. It may be divided into three major functional areas: Bandwidth Management, Service Management, and Application Scripting.
  • the Bandwidth Management area is concerned with per-subscriber/per-service traffic management; the Service Management area is concerned with classifying flows and following protocol negotiations; and the Application Scripting area is concerned with providing capability to script custom network-based application services.
  • Figure 12 shows the major components of the SME, and their interactions. Some components are shown in more detail, while external components such as subscriber management, traffic management, and the global arbitrator are not shown for clarity. Figure 12 also does not show the internal layering of the components.
  • the SME consists of three major abstract components: Bandwidth Management component (BMC) 1201, a Services Management component (SMC) 1202, and an Application Scripting Component (ASC) 1204.
  • BMC Bandwidth Management component
  • SMC Services Management component
  • ASC Application Scripting Component
  • the BMC 1201 is responsible for tracking per-subscriber usage statistic and traffic authorization and admission.
  • the SMC 1202 is responsible for classification of flows, dissection of packets, and correlation of flows to services.
  • the ASC 1204 runs sandboxes in which scripts may be executed.
  • the Bandwidth Management Component 1201 depends on a Subscriber Management Subsystem (SMS) (which provides subscriber information 1206), a Statistics Engine 1208, as well as a Traffic Management Engine (TME) 1210 for operation.
  • SMS Subscriber Management Subsystem
  • Statistics Engine 1208 receives correlated traffic and flow statistics on a per- subscriber and per-port/circuit basis from Statistics Engine 1208. It runs a per- subscriber state machine that keeps track of subscriber authorization, bandwidth consumption, and service utilization. It also receives service classification information from Service Management Component 1202, and computes traffic management policies on a per-flow basis. These policies are then sent to the TME 1210 for execution.
  • BMC 1201 includes a Bandwidth Controller 1212, and a Traffic Monitor 1214
  • the Services Management Component 1202 supplies the protocol and service classification information to Bandwidth Management Component 1201. It receives pre-classified packet flows that are bifurcated from the ingress traffic blade; it classifies each flow by their application level protocol; it dissects packets from interested flows in order to get application level messages; finally, it correlates flows to services, and sends the service classification and traffic specification of flows to BMC and other interested listeners. These operations are facilitated by a Flow Classifier 1216 and a Service Classifier 1218.
  • the Application Scripting Component 1204 implements a sandbox where "application scripts" may be executed in virtual machines.
  • This component provides an API and an execution environment similar to what is available to the Bandwidth Management Component, hi addition, Service Definition Scripts may direct specific messages to a particular application script. Application Scripts may implement custom application state machines, or security and traffic management policies. Each script has its dedicated environment.
  • each subscriber is provisioned with a list of services; and each service is tagged with a service class: Best Effort, Managed, or Preferred.
  • service class Best Effort, Managed, or Preferred.
  • Profiles of services are provisioned at the management layer of SMC 1202. They are provisioned in the form of Service Definition Scripts.
  • a Service Definition specifies what protocols a service uses, how a service is matched, what values are expected in the service's control protocol negotiations, the traffic profile of the data streams, and the control actions to be taken when this services is detected.
  • These profiles are stored in a service node's persistent file system. The SMC uses these profiles to match flows to services, and obtain their traffic profiles, which are delivered to the BMC 1201.
  • Application Scripts are provisioned at the management layer of ASC 1204. They are stored in a service node's persistent file system, and are loaded into their sandboxes at startup time.
  • the Service Node platform architecture employs a substantial level of parallelism, as provided by multiple Compute and Traffic Blades.
  • External entities such as subscriber provisioning and AAA (Authentication, Authorization, and Accounting), the Statistics Engine 1208, and the Traffic Management Engine 1210 are run in their own processes.
  • the SME spreads itself along component boundaries.
  • the Bandwidth Management Component will have its own process; so is the Service Management Component.
  • the Application Scripting component will have a sandbox process from which all application scripts run.
  • Each compute-node that is not reserved for OAMP functions will have a full set of SME processes, including one BMC process, one SMC process, and one ASC process. Each of these processes is also multithreaded as described below.
  • a compute-node is responsible for a number of subscribers. All processing, analysis and computation done for, or on behalf of, these subscribers are conducted on this compute-node.
  • the Global Arbitrator 1100 has the responsibility to allocate subscribers to compute-nodes when they are authenticated.
  • the Service Management Component offloads some of its tasks, such as IP reassembly and preliminary classification, to a process on the host processor on Traffic Blades 600. This process may run these tasks directly, or act as a proxy for the ASIC/FPGA array.
  • SME has a configuration management (CM) process that implements the management layer functions.
  • CM configuration management
  • the BMC employs two or more "worker threads", each of which is responsible for a (disjoint) subset of subscribers that are assigned to a particular BMC instance.
  • the IPC servicing thread of a BMC will sort messages for different subscribers into different work queues based on their subscriber identification.
  • the SMC employs two or more threads that process incoming packets.
  • the division of labor between threads is defined on a per- flow basis.
  • the specific number of worker threads per process will be decided based on processor utilization data after profiling the processes with experimental data load.
  • the ASC employs one master thread, and at least one worker threads per application script.
  • the ASC virtual machines have facilities to allow application scripts to spawn more threads when desired.
  • the SMC receives pre- classified datagrams from the packet processing HAL 1220; these datagrams are packed in IPC messages with extra headers.
  • the worker threads of SMC 1202 will run flow classifiers, packet dissectors, as well as service classifiers on these messages and, in turn, produce "application level messages" for those services identified. These messages are then delivered to BMC 1201.
  • the BMCs worker threads are driven with these messages; they produce traffic management directives for Traffic Management Engine 1210.
  • the ASC 1204 worker threads are driven, similarly, with messages from SMC 1202 and other parts of the system.
  • subscriber authorization is provisioned in the Subscriber Management Subsystem. This information is provisioned statically, either on the node or in an external database. It is retrieved when a subscriber authenticates; and it is made available to the rest of the system software through sharing of the embedded provisioning information database.
  • Each subscriber's authorization data includes a list of names of services, as well as their access class, namely "guaranteed", "managed” or "best effort” in one embodiment.
  • subscribers are assigned to instances of SME dynamically.
  • each compute-node that does not serve the OAMP function will have an instance of SME with all of its components.
  • Each instance is responsible for a number of subscribers.
  • the list of subscribers served by a particular compute- node, or SME instance, is decided by the Global Arbitrator 1100 when subscribers authenticate. The decision is based on available compute resources on all available compute-nodes.
  • the protocol descriptions are provisioned as binary loadable modules that are loaded into SMC 1202.
  • the SMC has descriptions of a number of protocols built-in, while descriptions of extra protocols are loaded as protocol dissector plug- in modules.
  • all protocol definitions including the provisioned ones, are loaded automatically. These definitions may be de-provisioned by user request, provided that no other protocol or service definitions depend on them.
  • the service definitions are provisioned as clear text scripts. These definitions are provisioned at the management layer of SMC 1202, and saved in a persistent file system. Each SMC instance loads the complete list of provisioned service definitions when it starts. These definitions may be de-provisioned at any time.
  • application scripts are provisioned at the management layer of ASC 1204 and stored in the persistent file system.
  • the scripts are loaded into the ASC process by the master thread on each compute-node when the embedded software for that node is started. They may be de-provisioned and unloaded at any time.
  • the SME provisions the Traffic Management Engine dynamically.
  • the SME specifically the BMC, computes traffic management policies on the fly based on subscribers' authorization and real-time traffic condition. These policies are sent to the TMC for enforcement
  • One "application" provided by the SME is bandwidth management. This is performed by assigning classes of service to flows based on their classification and the authorization of the subscriber to whom they belong.
  • the SME relies on Subscriber Management Subsystem 1206 for subscriber authorization information, on Statistics Engine 1208 for circuit and flow level statistics, and on Traffic Management Engine 1210 for traffic management policy enforcement.
  • a BMC employs a Traffic Monitor 1214 and a Traffic Controller 1212.
  • the Traffic Monitor interfaces with Statistics Engine 1208 to monitor network traffic. For network interfaces that face the core side, the Traffic Monitor aggregates traffic statistics on a per-port or per- circuit basis. On subscriber facing interfaces, however, the Traffic Monitor aggregates statistics on per-destination or per-subscriber basis. Core-side monitoring provides information for admission control when new service is initiated. Subscriber-side monitoring helps to determine how much bandwidth and throughput any given subscriber is using. Together with their allotted bandwidth and throughput authorization, this information is taken as an input for deciding the class of service a new flow receives. In addition, flow-based statistics on selected flows are monitored for verifying the QoS a particular flow experiences. [00104] The Traffic Controller 1212 computes traffic management policies on a per flow basis.
  • the Traffic Controller's primary focus is to decide the class of service for a given flow and a given subscriber. If a flow is of a guaranteed service, and there is enough bandwidth, given the flow's traffic profile and the available bandwidth of the subscriber's line, then the flow will be admitted to the guaranteed class. Queues for other classes of traffic are adjusted, as necessary, to accommodate this flow. Otherwise, the flow will be added to the best-effort class. An alarm indication may be raised in this situation. Regardless of service classes, policing, shaping, and priority parameters are setup for all flows to protect them from each other, and to protect service level agreements for all subscribers. [00106] The Services Management Component 1202 is responsible for identifying and monitoring services.
  • the SMC receives bifurcated flows of packets from the Packet Processing HAL (Hardware Abstraction Layer) 1220 (PPHAL), classifies the flows according to the protocol they use, dissects the packets into application level messages, and matches flows to known "services".
  • PPHAL Packet Processing HAL
  • SMC 1202 sends a notification with flow identification and classification information, as well as traffic profile to the BMC instance on the same compute-node.
  • This notification may include a Flow Information Record (FIR) that contains various flow statistics.
  • FIR Flow Information Record
  • the Services Management Component can be broken down to two parts: the Flow Classifier 1216 and the Service Classifier 1218.
  • the Flow Classifier as its name suggests, is responsible for classification of packet flows. It runs a set of Packet Dissection Plug-in modules (Packet Dissectors).
  • Packet Dissectors Packet Dissectors
  • a Pre-Classifier in PPHAL 1220 filters out flows that are not of interest to the SMC, and only bifurcate flows belonging to subscribers who are assigned to a particular instance of SMC to that instance of SMC.
  • the SMC is responsible to provision classification rules to the PPHAL so that only flows that are potentially interesting are bifurcated; the BMC, however, is responsible for notifying the SMC, which, in turn, registers with PPHAL of subscribers that are to be monitored by a particular instance of SMC.
  • the Service Classifier For every packet it encounters, the Service Classifier performs deep packet inspection by running through its protocol dissectors 1400. Each protocol dissector assigns a percentage probability of a packet being of its associated protocol. At the end, the protocol with the highest probability wins. Some flows, however, may not be classified successfully: all dissectors may report very low probability. Accordingly, unclassified flows will be lumped into an "unclassified" pool. For management purposes, they will only be distinguishable by their flow ID. The classified flows are dissected into application level messages and passed to
  • the Service Classifier correlates classified and dissected packet flows as services. Services are defined by Service Definitions 1402. A Service Definition describes how a service is recognized, its traffic profile, and what actions to take when such a service is detected. Service Classifier 1218 reads in all available service definitions at startup time, and builds an internal data structure for service classification. Additional service definitions may be loaded at runtime. When service definitions are added or removed, this internal data structure is amended dynamically.
  • a service definition may request the traffic profile and the flow information record be sent to other components including, but not limited to, BMC 1201.
  • a service definition may be used, for example, to send information to Application Scripts running in a sandbox.
  • the Service Management Component maintains one FIR for each flow it monitors. Protocol classification and service classification results are recorded in these Flow Information records. FIRs are used when communicating classification information with other components.
  • the SMC 1202 relies on PPHAL 1220 to deliver bifurcated flows of packets to the correct instance of the software. As discussed earlier, when a subscriber is authenticated, it is assigned to a particular compute-node, and a particular instance of the SMC software. In addition, a backup instance may be specified at the same time by Global Arbitrator 1100. This assignment is communicated to PPHAL 1220, and it is the responsibility of the PPHAL to deliver bifurcated packets to the correct instance of the software. Moreover, SMC 1202 will instruct PPHAL 1220 to stop bifurcation of certain flows when enough information has been obtained from packet inspection.
  • the main communication method for delivering bifurcated packets from PPHAL 1220 to SMC 1202 will be IPC, using the "push" model.
  • the communication channel from SMC 1202 to PPHAL 1220 for control messaging will use RPC (Remote Procedure Call).
  • the primary client of SMC 1202 is the Bandwidth Management Component 1201.
  • a BMC instance notifies the SMC instance on the same compute-node the set of subscribers who are under its management.
  • the SMC instance registers with PPHAL 1220 to receive bifurcated packets from these subscribers.
  • PPHAL 1220 registers with PPHAL 1220 to receive bifurcated packets from these subscribers.
  • SMC 1202 sends the classification result and traffic profile to BMC 1201 for policy decision rendering.
  • the API from BMC to SMC comprises an RPC interface; whereas the interface from SMC to BMC will use IPC. As there is no shared data, no locking is necessary.
  • the Application Scripting Component 1204 is client of SMC 1202, and is very similar to BMC 1201.
  • the ASC receives copies of IPC events that are sent to BMC 1201. However, it does not provision SMC 1202 as does BMC 1201.
  • SMC 1202 couples with a configuration manager (CM) in the Management Layer (not shown).
  • CM configuration manager
  • the configuration manager resides on a compute-node that is dedicated for OAMP functions. They communicate via the provisioning database. While the configuration manager has write access to the provisioning information database, SMC only has read access. Multiple instances of SMC may share read locks on the same record set in this database.
  • the Application Scripting Component 1204 implements a sandbox where "application scripts" may be executed in virtual machines.
  • the ASC provides an API and an execution environment similar to what is available to BMC 1201
  • Service Definition Scripts may direct specific messages to a particular application script.
  • Application Scripts may implement custom application state machines or security and traffic management policies. Each script has its dedicated environment. This is where network-based applications can be hosted on the service node.
  • VMM 1500 is responsible for setting up the sandbox, and starting all virtual machines that run application scripts. For every application script, a separate virtual machine 1504 is started. The VMM monitors virtual machines that it starts, and restarts them if they crash.
  • Sandbox 1502 an execution environment for generic state machine engines (see Figure 12) (or virtual machines) which, in turn, run Application Scripts 1506.
  • the sandbox delivers events from other parts of the system to the virtual machines; in addition, it provides guarded access to a selected API 1508 for access resources and runtime information available on the network element.
  • the virtual machines comprise generic state machine engines. In respective embodiments they may be implemented as Mono or Java virtual machines with specific class libraries.
  • the sandbox is built with a Common Language Runtime (CLR) that is based on Mono with a custom-built Just-In-Time compiler to execute the Common Intermediate Language (CIL) byte code.
  • CLR Common Language Runtime
  • CIL Common Intermediate Language
  • the generic state machines may be implemented as "application domains" within the sandbox.
  • class libraries provide the event mechanism as well as API 1508 to the rest of the system.
  • Application scripts have access to flow information, statistics, as well as classification results from SMC 1202. They may be granted other access to components such as Traffic Management and Subscriber Management, depending on their intended application.
  • the Application Scripts 1506, are in the form of CIL packages known as "assemblies.”
  • An application service developer may use any language for which a compiler with a backend for generating CIL byte code is available.
  • the preferred language is C#.
  • Each Application Script 1506 is loaded into a separate application domain in the CLR.
  • the Application Scripts have access to system information and resources similar to those available to BMC 1201 ; however, they are not allowed to interact with each other for security reasons.
  • the various resources that are employed for handling a given flow may be (generally) located anywhere within the service node, thus supporting various capabilities, such as full scalability and failover. However, specific resources are assigned for handling particular flows based on the subscriber and possibly other considerations, such as application (e.g., VoIP, VoD, etc.).
  • application e.g., VoIP, VoD, etc.
  • Figure 16 illustrates operations that are employed to provision a new flow. The process begins in a block 1600, wherein a subscriber attempts to obtain a IP address or initiates a PPOE Session.
  • each ingress Traffic Blades will provide a set of one or more input ports via which input traffic is received by the service node, such that a particular flow received from a given subscriber will be received by a particular Traffic Blade.
  • that Traffic Blade will perform a preliminary inspection to identify the subscriber or subscriber session (e.g., a given subscriber may have multiple sessions open at the same time, including sessions having different applications). Since this is a new flow, it has yet to be assigned, which will be identified by the Traffic Blade.
  • the Traffic Blade NPU will have access to a distributed runtime database containing flow assignments based on some filtering criteria (e.g., 5-Tuple signature), whereby the NPU can determine the assignment for a flow using a database lookup. In the case of a new flow, this is result in a miss, and the NPU will forward the processing to the control plane and send the flow to an OAMP blade to verify and/or authenticate the subscriber.
  • subscriber authentication may be performed using one of many well-known authentication schemes, such as an AAA server.
  • the process moves to a block 1604, wherein the subscriber will be associated with an IP address or other Tuple, with the association being stored as a record in the runtime database.
  • the global arbitrator assigns the subscriber to a compute node/blade based on a combination of criteria, including the group associated with the subscriber/application and dynamic considerations, such as discussed above (e.g., load balancing, etc.)
  • the global arbitrator further informs the SME that the subscriber has been assigned to the compute node/blade and which Traffic Blade the subscriber has been activated on.
  • an exemplary service node environment is shown in Figure 17. As discussed above with reference to Figure 10, this environment employs a fully-populated ATCA chassis including 10 Traffic Blades 60O 1-J o, and 4 Compute Blades 602i -4 , with Compute Blade 602 1 being provisioned as to support OAMP functions. Each of the Traffic and Compute Blades are communicatively-coupled via mesh interconnect 618, with access to the interconnect being managed by the backplane fabric switch on each blade.
  • a global arbitrator 1100 instance will run on each blade, as exemplified by OAMP blade 602 ⁇ .
  • OAMP blade 602 ⁇ For simplicity and clarity, other global arbitrators are not shown.
  • various SME component instances will be run on the various blades.
  • these SME component instances are schematically depicted as SME instances 120O] -3 . It will be understood that the actual SME component instances will include instances of SME components applicable to the respective execution host (i.e., each of the Traffic Blades OOO M O and Compute Blades 602 1-4 will run instances of associated SME components).
  • FIG. 18 a packet (flow) is received at an ingress port.
  • an ingress packet flow F 1 (depicted as packets Pl and P2) is received at an input port of an ingress Traffic Blade 600] .
  • ingress operations including primary classification is performed by the Ingress Traffic Blade's NPU, which functions as an Ingress Traffic Processor TP 2 of Figure 5.
  • NPU which functions as an Ingress Traffic Processor TP 2 of Figure 5.
  • TP 2 a 5-Tuple Signature match
  • Other classification schemes may also be employed in a similar manner. The purpose of the classification is to associate the packets with a subscriber flow.
  • subscriber information is retrieved from a runtime database (e.g., Subscriber Management DB 1206) to identify the compute resources that will be employed to process the subscriber flow. As discussed above, this information (i.e., subscriber-to-resource mapping) is generated in block 1604 of Figure 16.
  • the compute resource comprises Compute Node #3 on compute blade 60O 2 .
  • the Ingress Traffic Blade also identifies the Egress Traffic Blade to which the packet is to be sent for Egress processing. In the illustrated example, this comprises Egress Traffic Blade 600] o, which also functions as Egress Traffic Processor TP e of Figure 5.
  • the processing of the packet is bifurcated.
  • This aspect involves two primary operations: copy the packet to each target processor, and perform respective sets of packet processing operations on those target processors.
  • the copy operation is accomplished in the following manner. First, the packet is copied from its NPU buffer (e.g., local RAM) into the backplane fabric switch 814 of Ingress Traffic Blade 60Oi . Along with copying the packet, information is provided to the backplane fabric switch to instruct the switch to which target blades the packet is to be copied to.
  • NPU buffer e.g., local RAM
  • the backplane fabric switches 814 and 708 implement a insertion ring scheme, under which certain cross-connections exist for each transmit "cycle.” A bit mask or the like is used to identify which blade(s) is/are targeted to receive a copy of the data, based on the slot address of the blade.
  • the backplane fabric switch of a given blade determines if there is an interconnect from itself to the backplane fabric switch on another blade for which request for a data transfer is pending. If so, the data is transmitted during that cycle; otherwise the backplane fabric switch waits for the next cycle. Meanwhile, the data is held in a fabric switch buffer until all copies of the data have been transmitted.
  • This scheme provides several advantages over conventional copy schemes. Under a typical scheme, a processor or the like is employed for each copy. In addition, an interconnect path may not be immediately available, especially under a priority-based fabric switch scheme. Each of these may consume additional processor resources, which in turn may introduce jitter an/or delay, hi contrast, under the insertion ring discussed herein, there is no jitter or delay introduced to the packet processing.
  • the flow to the Egress Traffic Blade (the egress flow) is termed F e
  • the bifurcated flow to the Compute Blade is termed F h
  • egress packet processing operations are performed in a block 1808. This typically involves assigning the packet to an associated flow queue and appending the packet to that flow queue. Other ongoing traffic management and shaping processes may be employed for dispatching the flow queues for transmission to a next hop in the network, such as employed by round robin and priority-based dispatch schemes.
  • the flow queue assignment is based, in part, on current traffic profiles and subscriber flow attributes such as flow application.
  • flow queues are dispatched using a priority-based scheme, with flows associated with higher QoS, for example, receiving higher priority.
  • the availability of queue dispatch (and corresponding queue assignments) may be managed in view of current traffic profiles. Accordingly, the assignment of a subscriber flow packet into a dispatch queue is a dynamic consideration that may change over time in view of changes in traffic profiles and the like.
  • the Analysis Engine of Figure 5 is employed to perform analysis of the traffic flow in a block 1810.
  • the Analysis Engine functions are effected via corresponding SME component instances in the manner described above and as represented by SME instance 120O 2 .
  • the Analysis Engine generates traffic profile information that is provided to each of the Ingress and Egress Traffic Blades to update there traffic profile information.
  • this result can be effected by updating a local instance of a distributed Traffic Profile (TP) database 1800 with the new traffic profile information.
  • TP Traffic Profile
  • the updated traffic profile information is propagated to each local instance of the Traffic Profile database, thereby effectively providing the updated traffic profile information to Egress Traffic Blade 6OOio, as depicted by the update Traffic Profiles operation of block 1814.
  • the traffic analysis information generated in block 1810 may be used to adjusting policing operations performed on the ingress flow, as shown in a block 1816.
  • This provides a means by which an Ingress Traffic Blade can dynamically adjust its ingress processing operations for a given flow in view of real-time traffic analysis feedback derived from that flow.
  • a machine-readable medium includes any mechanism for storing or transmitting information in a form readable by a machine (e.g., a computer).
  • a machine-readable medium may include a read only memory (ROM); a random access memory (RAM); a magnetic disk storage media; an optical storage media; and a flash memory device, etc.
  • a machine-readable medium can include propagated signals such as electrical, optical, acoustical or other form of propagated signals (e.g., carrier waves, infrared signals, digital signals, etc.).

Abstract

Methods and apparatus for optimum matching of a traffic profile with an individual traffic flow using flow bifurcation and analysis. Bifurcation and duplication of packets that make up a flow received at an ingress element are forwarded to each of egress traffic and computation processor resources, such that egress traffic operations and traffic analysis operations may be performed concurrently without introducing jitter or delay in either bifurcated processing path. The traffic analysis includes maintaining flow statistics, flow stateful information and classifying the flow as a particular application traffic type. The optimum traffic profile for this application traffic type is then selected and applied to the individual flow. The traffic analysis data is forwarded to ingress and egress processing elements in real time, and ingress and egress traffic processing operations are dynamically adjusted in view of the traffic analysis data.

Description

PACKET FLOW BIFURCATION AND ANALYSIS FIELD OF THE INVENTION
[0001] The field of invention relates generally to congestion and flow control in converged full service communication systems, and, more specifically but not exclusively relates to applying quality of service profiles in real time groups of traffic flows.
BACKGROUND INFORMATION
[0002] As depicted in Figure 1, a modern metro area network 100 is composed of two types of networks: a core network 102 and one of more access networks 106. The core network 102 communicates data traffic from one or more service providers 104A-104N in order to provide services to one or more subscribers 108A-108M. Services supported by the core network 102 include, but are not limited to, (1) a branded service, such as a Voice over Internet Protocol (VoIP), from a branded service provider; (2) a licensed service, such as Video on Demand (VoD), through a licensed service provider and (3) traditional Internet access through an Internet Service Provider (ISP).
[0003] The core network supports a variety of protocols (Synchronous Optical Networking (SONET), Internet Protocol (IP), Packet over SONET (POS), Dense Wave Division Multiplexing (DWDM), OSPF, BGP, ISIS, etc.) using various types of equipment (core routers, SONET add-drop multiplexers (ADM), DWDM equipment, etc.). Furthermore, the core network communicates data traffic from the service providers 104A-104N to access network(s) 106 across link(s) 112. In general, link(s) 112 may be a single optical, copper or wireless link or may comprise several such optical, copper or wireless link(s).
[0004] On the other hand, the access network(s) 106 complements the core network 102 by aggregating the data traffic from the subscribers 108A-108M.
Access network(s) 106 may support data traffic to and from a variety of types of subscribers 108A-108M, (e.g. residential; corporate, mobile, wireless, etc.). Although the access network(s) 106 may not comprise of each of the types of subscriber (residential, corporate, mobile, etc), access(s) network 106 will comprise at least one subscriber. Typically, access network(s) 106 supports thousands of subscribers 108 A - 108M. Access network(s) 106 aggregates data traffic from the subscribers over link(s) 112 connecting to the core network 102. Access networks support a variety of protocols (e.g., IP, Asynchronous Transfer Mode (ATM), Frame Relay, Ethernet, Digital Subscriber Line (DSL), Dynamic Host Configuration Protocol (DHCP), Point-to-Point Protocol (PPP), Point-to-Point Protocol over Ethernet (PPPoE), etc.) using various types of equipment (Edge router, Broadband Remote Access Servers (BRAS), Digital Subscriber Line Access Multiplexers (DSLAM), Switches, etc). The access network(s) 106 uses subscriber policy manager(s) 110 to set policies for individual ones and/or groups of subscribers. Policies stored in a subscriber policy manager(s) 110 allow subscribers access to different ones of the service providers 104 A-N. Examples of subscriber policies are bandwidth limitations, traffic flow characteristics, amount of data, allowable services, etc.
[0005] Before discussing subscriber policies and the effect on services, it is worth noting that data traffic is transmitted in data packets. A data packet (also known as a "packet") is a block of user data with necessary address and administration information attached, usually in a packet header and/or footer, which allows the data network to deliver the data packet to the correct destination. Examples of data packets include, but are not limited to, IP packets, ATM cells, Ethernet frames, SONET frames and Frame Relay packets. Typically, data packets having similar characteristics are transmitted in a flow at a transmission rate. The transmission rate is determined by the packet size and the transmission gap (or "inter-packet gap") between each packet, hi addition, the transmission rate of data packets is dependent on the capacity of the network connection and processor capability of the transmitting device.
[0006] Figure 2 represents the Open Systems Interconnect (OSI) model of a layered protocol stack for transmitting data packets 200. Each layer installs its own header in the data packet being transmitted to control the packet through the network. The physical layer (layer 1) 202 is used for the physical signaling. The next layer, data link layer (layer 2) 204, enables transferring of data between network entities. The network layer (layer 3) 206 contains information for transferring variable length data packet between one or more networks. For example, IP addresses are contained in the network layer 206, which allows network devices (also commonly referred to a network elements) to route the data packet. Layer 4, the transport layer 208, provides transparent data transfer between end users. The session layer (layer 5) 210, provides the mechanism for managing the dialogue between end-user applications. The presentation layer (layer 6) 212 provides independence from difference in data representation (e.g. encryption, data encoding, etc.). The final layer is the application layer (layer 7) 212, which contains the actual data used by the application sending or receiving the packet. While most protocol stacks do not exactly follow the OSI model, it is commonly used to describe networks.
[0007] Returning to Figure 1, bandwidth sensitive services, such as VoIP or VoD, require a dedicated bandwidth over link(s) 112 to properly operate. However, because each access network 106 can support thousands of subscribers, link(s) 112 can get overloaded and not provide enough bandwidth for these bandwidth sensitive services. Subsequently, the quality of these services degrades or becomes interrupted altogether. One solution to this problem is to enforce a Quality of Service (QoS) from the core 102 and/or access 106 networks. QoS allocates different bandwidth rates to different types of data traffic. For example, QoS can be set up to allocate a bandwidth of 20 Mbps for VoIP service over link(s) 112. In addition, QoS shapes the data traffic by re-transmitting the data traffic in a constant rate. However, for QoS to work properly, both the core and access networks must be set up to support the desired QoS policy.
[0008] Devices that solely perform QoS can be categorized, but not limited to, either traffic shapers or flow switches. A traffic shaper is a device that classifies a packet by deep packet inspection and transmits the packet based on pre-determined subscriber policies. Turning to Figure 2, deep packet inspection examines the data contained in layers up to and including application layer 214 of each data packet 200 to determine what quality of service should be used for the packet. For example and by way of illustration, deep packet inspection matches the structure of the application layer data with potentially hundreds of known application data types. This allows a traffic shaper to finely tune the quality of service enforced. For instance, a traffic shaper may identify control packets for an adaptable video conferencing protocol to configure the network for an optimal video conferencing rate.
[0009] Although existing traffic shapers are subscriber aware, these traffic shapers only enforce pre-determined subscriber policies. That is, subscribers policies are set by the operator of the traffic shaper and do not change until the operator modifies the subscriber policies. This does not allow subscriber policies to change in real-time based on existing network conditions. Furthermore, existing traffic shapers cannot handle the high volume of data traffic that cross the core 102 and access 116 networks.
[0010] On the other hand, flow switches are network devices that transmit data packets in connected flows, instead of discrete packets. Flow switches operate on groups of similar packets to provide QoS for an application. However, flow switches have limited data traffic processing capability, are not subscriber aware, perform limited or no deep packet inspection, and cannot update subscriber policies in real-time.
SUMMARY OF THE INVENTION
[0011] In accordance with aspects of the present invention, methods and apparatus are disclosed for optimum matching of traffic profiles with individual traffic flows. This matching of optimum traffic profiles occurs in real time without any static provisioning linking the flow with the profile. The matching is performed by the bifurcation and duplication of packets that make up a flow to each of egress traffic and computation processor resources, such that egress traffic operations and traffic analysis operations may be performed concurrently on the egress traffic and computation processor resources without introducing jitter or delay in either bifurcated processing path. The traffic analysis includes maintaining flow statistics, flow stateful information and classifying the flow as a particular application traffic type. The optimum traffic profile for this application traffic type is then selected and applied to the individual flow. The traffic analysis data is forwarded to ingress and egress processing elements in real time, and ingress and egress traffic processing operations are dynamically adjusted in view of the traffic analysis data. [0012] In another aspect of the present invention, an implementation environment comprising an apparatus for performing the method is disclosed. The apparatus includes a chassis populated with multiple traffic and compute blades. The various blades are enabled to communicate with one another using a backplane mesh interconnect provided by the chassis under management of backplane fabric switches hosted by each blade. Distributed software components are also provided for facilitating the method via execution on associated processing elements on the traffic and compute blades. BRIEF DESCRIPTION OF THE DRAWINGS
[0013] The foregoing aspects and many of the attendant advantages of this invention will become more readily appreciated as the same becomes better understood by reference to the following detailed description, when taken in conjunction with the accompanying drawings, wherein like reference numerals refer to like parts throughout the various views unless otherwise specified: [0014] Figure 1 (Prior Art) illustrates a typical metro area network configuration;
[0015] Figure 2 (Prior Art) is a block diagram illustrating layers of the OSI protocol stack;
[0016] Figure 3 illustrates an exemplary network configuration using a traffic shaping service node in a metro area network, according to one embodiment of the invention;
[0017] Figure 4 is a diagram illustrating sets of Ingress Traffic Processors, Computation Processors, and Egress Traffic Processors used to implement aspects of the invention;
[0018] Figure 5 is a diagram of a feed- forward control loop illustrating aspects of the invention employed via bifurcation of packet flows;
[0019] Figure 6 is a schematic diagram illustrating the communication interconnected between a Traffic Blade and a Compute Blade; [0020] Figure 7 is a schematic diagram illustrating of one embodiment of a Compute Blade that is provisioned for an OAMP function;
[0021] Figure 8 is a schematic diagram illustrating one embodiment of a Traffic Blade;
[0022] Figure 9 is a schematic diagram illustrating one configuration of a service node implemented via a ATCA chassis; [0023] Figure 10 is a schematic diagram illustrating details of the inter-blade communication scheme; according to one embodiment of the invention;
[0024] Figure 11 is a schematic diagram illustrating a service node implementation environment including a local instance of a global arbitrator on each blade;
[0025] Figure 12 is a schematic diagram illustrating various components associated with a Service Management Engine (SME);
[0026] Figure 13 is a schematic diagram illustrating details of the Bandwidth
Management Component of the SME components of Figure 12;
[0027] Figure 14 is a schematic diagram illustrating details of the Services
Management Component of the SME components of Figure 12;
[0028] Figure 15 is a schematic diagram illustrating details of the Application
Scripting Component of the SME components of Figure 12;
[0029] Figure 16 is a flowchart illustrating operations employed to provision compute resources for an associated subscriber flow;
[0030] Figure 17 is a schematic diagram of an exemplary execution environment for a service node used in connection with the packet processing operations of Figure 18; and
[0031] Figure 18 is a flowchart illustrating operations performed in connection with processing a packet flow, according to one embodiment of the invention.
DETAILED DESCRIPTION
[0032] Embodiments of methods and apparatus for optimum matching of traffic profiles with individual traffic flows. In the following description, numerous specific details such as application subscriber data traffic flow, traffic policy, data packet, line card architectures, software functionality and interrelationships of system components are set forth in order to provide a more thorough understanding of the invention. It will be appreciated, however, by one skilled in the art that the invention may be practiced without such specific details. In other instances, control structures, gate level circuits and full software instruction sequences have not been shown in detail in order not to obscure the invention. Those of ordinary skill in the art, with the included descriptions, will be able to implement appropriate functionality without undue experimentation.
[0033] References in the specification to "one embodiment", "an embodiment", "an example embodiment", etc., indicate that the embodiment described may include a particular feature, structure, or characteristic, but every embodiment may not necessarily include the particular feature, structure, or characteristic. Moreover, such phrases are not necessarily referring to the same embodiment. Further, when a particular feature, structure, or characteristic is described in connection with an embodiment, it is submitted that it is within the knowledge of one skilled in the art to effect such feature, structure, or characteristic in connection with other embodiments whether or not explicitly described.
[0034] In the following description and claims, the term "coupled," along with its derivatives, is used. "Coupled" may mean that two or more elements are in direct physical or electrical contact. However, "coupled" may also mean that two or more elements are not in direct contact with each other, but yet still co-operate or interact with each other.
[0035] Exemplary embodiments of the invention will now be described with reference to Figures 3-18. In particular, the operations of the flow diagrams in Figures 5, 16, and 18 will be described with reference to the exemplary architecture embodiments of Figures 4, 6-15, and 17. However, it should be understood that the operations of these flow diagrams can be performed by embodiments of the invention other than those discussed with reference to Figures 4, 6-15, and 17, and that the embodiments discussed with reference to Figures 5, 16, and 18 can perform operations different than those discussed with reference to these flow diagrams. Exemplary Traffic Shaping Service Node
[0036] Figure 3 illustrates an exemplary network configuration using a traffic shaping service node 302 in a metro area network according to one embodiment of the invention. (For simplicity and convenience, the terminology "traffic shaping service node" and "service node" are alternatively used herein.) In Figure 3, traffic shaping service node 302 is communicatively coupled between the core 102 and access 106 networks. While one embodiment is described under which the traffic shaping service node may shape traffic traveling in either direction, alternative embodiments may shape in only one direction (e.g., the service provider data traffic coming from the core network 102. Traffic shaping, a form of QoS, is the process of regulating and smoothing the flow of network data traffic within a computer network. Restricting the bandwidth of the traffic flow is one way to regulate data traffic. There are a variety of ways to bring data traffic flow with a desired rate, including dropping or discarding data packets, buffering received data packets and re-transmitting the data packets at the desired rate, combinations of these (e.g., buffering packets when there is space in the buffer and dropping packets when there is not), etc. Buffering the data traffic flow allows the traffic shaping service node to smooth the data traffic flow. Smoothing removes the bursts of data traffic and shapes the data traffic into a constant flow of data traffic. Smoothing is advantageous for applications that depend on a constant flow of data traffic. For example, video-based applications, such VoD or video conferencing, or real-time voice applications (VoIP) benefit from a constant flow of data traffic. In general, the traffic shaping service node 302 uses the subscriber policies contained in subscriber policy manager(s) 110 for instruction on how to shape the data traffic from service providers 104A-104N and/or subscribers 108A-108M accordingly. Further details of various elements of embodiments of traffic shaping service nodes are discussed below. Packet Flow Bifurcation and Analysis
[0037] In accordance with further aspects of the invention, techniques are provided to enable optimum matching of a traffic profile with an individual traffic flow. This matching of the optimum traffic profile occurs in real time without any static provisioning linking the flow with the profile. The matching is achieved by the bifurcation and duplication of all packets (i.e., packet data) that make up a flow to a computation resource that will analyze the packets, maintain flow statistics, flow stateful information and classify the flow as a particular application traffic type. The optimum traffic profile for this application traffic type is then selected and applied to the individual flow in real time.
[0038] In general, a Traffic Flow comprises a set of packets having similar flow classification parameters. For example, a typical ingress operation performed by a layer-4 router or the like is to classify received packets to associated flows using a rule-based packet classification scheme, such as defined by an Access Control List (ACL) database. Traditionally, the rules for classifying a message (i.e., one or more associated packets) are called filters (or rules in firewall terminology), and the packet classification problem is to determine the lowest cost matching filter or rule for each incoming message at the network element. Under the well-known N-tuple classification scheme, the relevant information is contained in N distinct header fields (or partial header fields) in each packet. For instance, under the common 5- Tuple classification scheme, the relevant fields for an IPv4 packet comprise the Destination Address (32 bits), the Source Address (32 bits), the Destination Port (16 bits), the Source Port (16 bits), and the Protocol Field (8 bits); the set of field values for a given packet is referred to as the 5 -Tuple signature.
[0039] The corresponding filter database consists of a finite set of filters, filti, filt2 ... filtN- Each filter is a combination of N values, one for each header field. Each field in a filter is allowed three kinds of matches: exact match, prefix match, or range match and wildcard. In an exact match, the header field of the packet should exactly match the filter field. In a prefix match, the filter field should be a prefix of the header field. In a range match, the header values should like in the range specified by the filter. Each filter filt, has an associated directive disp,, which specifies how to process a packet matching the filter.
[0040] An Application Flow is a flow of packets that carries traffic belonging to a specific application, such as VoIP or VoD. hi some cases, the data carried in the packet header (e.g., the 5-Tuple signature) is insufficient to classify a flow as belonging to an application, hi this case, data in the packet payload is employed for identifying the appropriate application. To accomplish this function, a technique known as Deep Packet Inspection is required to further analyze the contents of the packet and to keep stateful context of previous packets seen in the flow. [0041] Modern network equipment can typically support 5-Tuple classification at line speed rates. However, classification schemes that employ deep packet inspection may not be implemented at line rate speeds in many instances. As a result, separate processing paths, respectively referred to as fast path and slow path, are sometimes typically employed for performing 5-Tuple classification and deep packet inspection on a given network device.
[0042] Various high-level aspects of the bifurcated and flow analysis techniques of the present invention are illustrated by way of example in Figures 4 and 5. As shown in the architecture diagram of Figure 4, an apparatus via which the techniques may be implemented includes various processor resources, including multiple ingress traffic processors 400 (depicted as i ingress traffic processors 40Oi _ ,), computation processors 402 (depicted as j computation processors 402^), and egress traffic processors 404 (depicted as k egress traffic processors 4021_*). Pn general, the particular values for i, j, and k may be the same or may differ, depending on the particular implementation environment and requirements. The various ingress traffic processors 400 u, computation processors 4021-/} and egress traffic processors 4021,4 are interconnected in a mesh fashion such that any processor may communicate with any other processor. In one embodiment, the communication rates amongst the various processors are equal. In other embodiments, the communication rates may differ. In one embodiment, the traffic processors perform 5-Tuple classification of flows, while the computation processors perform deep packet inspection of the flows and maintain stateful analysis data pertaining to each flow.
[0043] With reference to Figure 5, an Ingress Traffic Processor is termed TP1, an Egress Traffic Processor is termed TP e, the ingress flow is termed F1, and the egress flow is termed Fe. Packets enter the apparatus at the TP1. The TP1 identifies a new incoming flow and make a routing decision R (Fi) regarding the destination TPe. Packets from the identified flow shall henceforth be sent to the TPe and exit the apparatus; packets thus flow from TP1 to TPe.
[0044] In addition, F1 is bifurcated, and its packets duplicated, to provide an equivalent and simultaneous flow that is sent to one of the many Computation Processors (CP); which CP is selected is decided programmatically and is a function S (F1 ) of the flow F1, as schematically depicted by the feed forward control loop of Figure 5. The bifurcated flow is termed Fb. When the F1 flow starts, the CP shall be notified of the creation of a new flow, along with the 5-Tuple that accompanies the flow and it shall expect reception of the flow Fb. The CP shall route the Fb to specific software Analysis Entities that are capable of analyzing a traffic flow associated with the specific 5-Tuple. This routing to specific analysis entities enables any state or analytical information that has been extracted from the flow to be preserved, meaning that the analysis process has access to past state information and can alter its analysis based upon this information. As the packet flow is bifurcated and duplicated, no additional latency or jitter is introduced into any traffic flow F1 and Fe .
[0045] The result Rb of the analysis performed in the CP is then passed onto the
TP e and back to the TP1; this enables the TPe to adjust the shaping performed on Fe and allow the TP1 to adjust the policing performed on F1.
[0046] If packet Pt enters the ingress TP1 and is bifurcated as part of the flow
Fb to the CP, then the Analysis Engine shall be working on packet Pt producing a result R (Pt) that is sent to TPe. The arrival of the result R (Pt) at the TP e will have a certain time delay introduced in relation to the packet Pt that arrived at TPe as part of the original (non bifurcated) flow Fe such that, as R (Pt) arrives at TPe, then the current packet being processed by TPe shall be Pt+Δ-
[0047] The result R (Pt) is directly used to dynamically modify or adjust the exact traffic profile PRfe being applied to the packets forming the flow Fe, and as such, the traffic profile being selected and applied to a packet Pt+Δ shall be based upon the analysis of packet Pt:
PR fe(Pt+zJ = R (Pt) where ZUO (1)
[0048] As the CP is shared between all Traffic Processors and the Analysis Engines can retain state information regarding past and present 5-Tuple flows, the apparatus has the ability to view traffic in terms of Traffic Aggregates. Having access to these Traffic Aggregates and the statistics associated with then allow the Analysis Engines to select a traffic profile different from that which would result from isolated analysis of the traffic flow.
[0049] Additionally, services provided over the network will consist of multiple flows, each with their own 5 -Tuple signature. The Analysis Engines shall have the ability to draw together and group the information regarding these multiple flows and treat the grouping as a Service Flow. Furthermore, the Analysis Engines have the ability to track the state of a Service Flow and dynamically alter the results being sent to the TP1 and TPe that servicing the flows the make up the Service Flow.
Exemplary Implementation Environments
[0050] Figures 4, 6-15, and 17 illustrate exemplary network element architectures that may be used for a variety of purposes, including but not limited to, a traffic shaping service node as previously described. Thus, while for exemplary network element architectures described with reference to Figures 4, 6-15, and 17 are described with reference to a traffic shaping service node, it should be understood that these architectures are independent as part of the invention. [0051] In accordance with architecture aspects of some embodiment, the aforementioned functions are facilitated by various processing and storage resources hosted by associated line cards and the like, which are mounted in a common chassis. As shown in Figure 6, from a datapath perspective, the hardware architecture of one embodiment of a Service Node can be decomposed into three entities, Traffic Blades (TB) 600, Compute Blades (CB) 602 and the chassis 604. A TB 600 can be further reduced to its physical and link layer portions 606 and 608, network layer components 610, and infrastructure components 612. Similarly, a CB 602 provides Service Layer termination 612 and infrastructure components 614. In one embodiment, a CB can be further re-defined to be an OAMP Blade based on its slot index (within chassis 604). OAMP blades are a functional superset of CBs, adding operations, administration, maintenance and provisioning functionality (collectively referred to as OAMP card function or OAMP CF). [0052] As illustrated in the embodiments herein, chassis 604 comprises an Advanced Telecommunication and Computing Architecture (ATCA or AdvancedTCA®) chassis. The ATCA Chassis provides physical connectivity between the blades via a passive backplane 616 including a full-mesh backplane 616. It is noted that the ATCA environment depicted herein is merely illustrative of one modular board environment in which the principles and teachings of the embodiments of the invention described herein may be applied. In general, similar configurations may be deployed for other standardized and proprietary board environments, including but not limited to blade server environments. [0053] The ATCA 3.0 base specification (approved December 30, 2002), which is being carried out by the PCI Industrial Computer Manufacturers Group (PICMG), defines the physical and electrical characteristics of an off-the-shelf, modular chassis based on switch fabric connections between hot-swappable blades. (As used herein, the terms "board," "blade," and "card," are interchangeable.) This specification defines the frame (rack) and shelf (chassis) form factors, core backplane fabric connectivity, power, cooling, management interfaces, and the electromechanical specification of the ATCA-compliant boards. The electromechanical specification is based on the existing IEC60297 EuroCard form factor, and enables equipment from different vendors to be incorporated in a modular fashion with guaranteed interoperability. The ATCA 3.0 base specification also defines a power budget of 200 Watts (W) per board, enabling high performance servers with multi -processor architectures and multi gigabytes of on-board memory. [0054] In addition to power input to ATCA boards, mating connectors on the boards and backplane are employed for coupling input/output (I/O) signals. Many of the ATCA boards, as well as other modular boards used for telecommunications and computer, such as but not limited to CompactPCI, employ very-high speed I/O channels. For example, Advanced Switching (AS) employs a serial communication channel operating at Gigahertz+ frequencies. ATCA boards may also provide one or more I/O ports on their front panels, enabling an ATCA board to be coupled to other network resources.
[0055] An exemplary architecture 700 for a compute blade 602 is shown in Figure 7. In one embodiment, a single compute blade (physical) architecture is employed for both Compute Blades and OAMP CF' s. More particularly, under architecture 700, a corresponding blade may be deployed to support both Compute Blade and OAMP functionality.
[0056] Compute Blade 602 employs four multiple processor compute nodes 702M. In general, each of compute nodes 7021-4 functions as multiple processor resources, with each processor resource being associated with a logical processor. Accordingly, such processor resources may be implemented using separate processors, or processor chips employing multiple processor cores. For example, in the illustrated embodiment of Figure 7, each of compute nodes 702i-4 is implemented via an associated symmetric multi-core processor. Exemplary multi- core processors that may be implemented include, but are not limited to Broadcom 1480 and 1280 devices. Each of the compute nodes 7021-4 is enabled to communicate with other compute nodes via an appropriate interface (e.g., bus or serial-based interfaces). For the Broadcom 1480 and 1280 devices, this interface comprises a "Hyper Transport" (HT) interface. Other native (standard or proprietary) interfaces between processors may also be employed. [0057] As further depicted in architecture 700, each compute nodes 7021-4 is allocated various memory resources, including respective RAM 704] -4. Under various implementations, each of compute nodes 702]-4 may also be allocated an external cache 7061-4, or may provide one or more levels of cache on-chip. In one embodiment, the RAM comprises ECC (Error Correction Code) RAM. In one embodiment, each compute node employs a NUMA (Non-Uniform Memory Access) cache coherency scheme. Other cache coherency schemes, such as MESI (Modified, Exclusive, Shared, Invalidated), may also be implemented for other embodiments.
[0058] Each Compute Blade 602 includes a means for interfacing with ATCA mesh interconnect 618. In the illustrated embodiment of Figure 7, this is facilitated by a Backplane Fabric Switch 708. Meanwhile, a field programmable gate array (FPGA) 710 containing appropriate programmed logic is used as an intermediary component to enable each of compute nodes 7021-4 to access backplane fabric switch 708 using native interfaces for each of the compute nodes and the fabric switch. In the illustrated embodiment, the interface between each of compute nodes 702i-4 and the FPGA 710 comprises an SPI (System Packet Interface) 4.2 interface, while the interface between the FPGA and backplane fabric switch 708 comprises a Broadcom HiGig™ interface. It is noted that these interfaces are merely exemplary, and that other interface may be employed depending on the native interfaces of the various blade components.
[0059] In addition to local RAM (e.g., RAM 7040, the compute node associated with the OAMP function (depicted in Figure 7 as Compute Node #1) is provided with local SRAM 712 and a non- volatile store (depicted as Compact flash 714). The non-volatile store is used to store persistent data used for the OAMP function, such as provisioning information and logs. In Compute Blades that do not support the OAMP function, each compute node is provided with local RAM and a local cache, as depicted in Figure 11.
[0060] In the embodiment illustrated in Figure 7, compute blade 602 is provisioned as an OAMP blade, hi one configuration (as shown), one of the compute nodes is employed for performing OAMP functions (e.g., compute node 702i), while the other three compute nodes (e.g., compute nodes 7022-4) perform normal compute functions associated with compute blades, as described in further detail below. When a compute blade 602 is provisioned as a compute blade, each of compute nodes 702i-4 is available for performing the compute functions described herein.
[0061] Figure 8 shows an exemplary architecture 800 for a traffic blade 600. Architecture 800 includes a PHY block 802, an Ethernet MAC block 804, a network processor unit (NPU) 806, a host processor 808, a SERDES interface 810, an FPGA 812, a backplane fabric switch 814, RAM 816 and 818 and cache 819. The traffic blade further includes one or more I/O ports 820, which are operatively coupled to PHY block 820. Depending on the particular use, the number of I/O ports may vary from 1 to N ports. For example, under one traffic blade type a 10 x 1 Gigabit Ethernet (GigE) port configuration is provided, while for another type a 1 x lOGigE port configuration is provided. Other port number and speed combinations may also be employed.
[0062] PHY block 802 and Ethernet MAC block 804 respectively perform layer 1 (Physical) and layer 2 (Data Link) functions, which are well-known in the art. hi general, the PHY and Ethernet MAC functions may be implemented in hardware via separate components or a single component, or may be implemented in a combination of hardware and software via an embedded processor or the like. [0063] One of the operations performed by a traffic blade is packet identification/classification. As discussed above, a multi-level classification hierarchy scheme is implemented for this purpose. Typically, a first level of classification, such as the aforementioned 5 -Tuple signature classification scheme, is performed by the traffic blade's NPU 806. Additional classification operations in the classification hierarchy that may be required to fully classify a packet (e.g., identify an application flow type) in the manner discussed above, hi general, these higher-level classification operations may be performed by the traffic blade's host processor 808 and/or a processor on a compute blade, depending on the particular classification.
[0064] NPU 806 includes various interfaces for communicating with other board components. These include an Ethernet MAC interface, a memory controller (not shown) to access RAM 816, Ethernet and PCI interfaces to communicate with host processor 808, and an XGMII interface. SERDES interface 810 provides the interface between XGMII interface signals and HiGig signals, thus enabling NPU 806 to communicate with backplane fabric switch 814. NPU 806 may also provide additional interfaces to interface with other components, such as an SRAM (Static Random Access Memory) interface unit to interface with off-chip SRAM (both not shown).
[0065] Similarly, host processor 808 includes various interfaces for communicating with other board components. These include the aforementioned Ethernet and PCI interfaces to communicate with NPU 806, a memory controller (on-chip or off-chip - not shown) to access RAM 818, and a pair of SPI 4.2 interfaces. FPGA 812 is employed to as an interface between the SPI 4.2 interface signals and the HiGig interface signals.
[0066] Typically, NPUs are designed for performing particular tasks in a very efficient manner. These tasks include packet forwarding and packet classification, among other tasks related to packet processing. To support such functionality, NPU 806 executes corresponding NPU software 822. This software is shown in dashed outline to indicate that the software may be stored (persist) on a given traffic blade (e.g., in a flash device or the like), or may be downloaded from an external (to the traffic blade) store during initialization operations, as described below. During run-time execution, NPU software 822 is loaded into internal SRAM 823 provided by NPU 806.
[0067] Host processor 808 is employed for various purposes, including lower- level (in the hierarchy) packet classification, gathering and correlation of flow statistics, and application of traffic profiles. Host processor 808 may also be employed for other purposes. In general, host processor 808 will comprise a general-purpose processor or the like, and may include one or more compute cores (as illustrated, in one embodiment a two-core processor is used). As with NPU 806, the functionality performed by host processor is effected via execution of corresponding software (e.g., machine code and or virtual machine byte code), which is depicted as host software 824. As before, this software may already reside on a traffic blade, or be loaded during blade initialization.
[0068] In one embodiment, host processor 808 is responsible for initializing and configuring NPU 806. Under one initialization scheme, host processor 808 performs network booting via the DHCP (or BOOTP) protocol. During the network boot process, an operating system is loaded into RAM 818 and is booted. The host processor then configures and initializes NPU 806 via the PCI interface. Once initialized, NPU 806 may execute NPU software 822 on a run-time basis, without the need or use of an operating system.
[0069] Figure 9 is a schematic diagram illustrating the cross-connectivity provided by the ATCA backplane mesh interconnect used in one embodiment of the Service Node. In the exemplary configuration 900 shown in Figure 9, an ATCA chassis 604 is fully populated with 14 ATCA blades, with each blade installed in a respective chassis slot - in an actual implementation, the chassis may be populated with less blades or may include other types of blades in addition to compute and traffic blades. The illustrated configuration includes four compute blades 6021-4, and 10 traffic blades 60O1-Io, with one of the compute blades being provisioned to provide OAMP functions. As depicted by the interconnection mesh, each blade is communicatively-coupled with every other blade under the control of fabric switching operations performed by each blade's fabric switch. In one embodiment, mesh interconnect 618 provides a 10 Gbps connection between each pair of blades, with an aggregate bandwidth of 280 Gbps.
[0070] Further details of the fabric switching operations are shown in Figure 10, wherein components having like reference numerals to those shown in Figure 7 and 8 perform similar functions. Inter-slot connectivity is enabled through a full mesh interconnect network compromising the ATCA passive backplane and a multi-port lOGbps switch fabric device integrated on every blade. In one embodiment, a 16- port lOGbps HiGig switch (Broadcom proposed model BCM56700) is implemented for each of backplane fabric switches 708 and 814. As an optional configuration, a pair of 8-port switch fabric devices (e.g., Broadcom BCM5675) may be employed in place of the 16-port device shown in Figure 10. Each fabric switch device maintains chassis facing ports and local facing HiGig ports. Each Traffic Blade 602 fabric present three local facing ports, while each Compute Blade 600 fabric present two local facing ports, with one spare port. The remaining ports are coupled to the backplane mesh, and provide support for a full 14-slot ATCA configuration [0071] Architecturally, the switch fabric is a distributed shared memory switch architecture with output port buffering. The fabric is implemented as an insertion ring. Each port provides 128K bytes of egress packet buffering (principal queue point) and 19K bytes on ingress packet buffering (sufficient for address resolution and ring insertion delay) for a 9K byte jumbo frame.
[0072] On one embodiment, the switch fabric architecture of Figure 10 utilizes the fabric to create a partial mesh. That is, full "any-to-any" port forwarding is not required. Specifically, the fabric only needs to forward from local to chassis facing ports (and vice versa) and forwarding between chassis facing ports is not required. [0073] Another aspect of the invention relates to scalability. The service node is implemented using a distributed architecture, wherein various processor and memory resources are distributed across multiple blades. To scale a system, one simply adds another blade. The system is further enabled to dynamically allocate processor tasks, and to automatically perform fail-over operations in response to a blade failure or the like. Furthermore, under an ATCA implementation, blades may be hot-swapped without taking the system down, thus supporting dynamic scaling. [0074] Yet another aspect of the invention relates to dynamic allocation of system resources, such as compute resources, queues, etc. Under this concept, compute resources and usage is monitored on an ongoing basis, and real-time statistics and the like are maintained using a distributed database scheme and a local (to each blade) agent, such that each blade has a "global" view of the entire system resource availability and consumption. Such a scheme is schematically illustrated in Figure 11, which shows an OAMP Blade 601, multiple Compute Blades 602, and multiple Traffic Blades 600. Each of these blades runs an instance of a software agent or the like (e.g., service, daemon, etc.) referred to as the "Global Arbitrator" 1100. For example, one of the compute nodes on a Compute Blade will be provisioned to run a Global Arbitrator, while the host processor on a Traffic Blade will be used for a similar function. Each of the Global Arbitrator instances are enabled to communicate with one another using a common protocol running over the aforementioned communication signal infrastructure. For example, in one embodiment, Global Arbitrators exchange IP packets to communicate with one another. Other protocols may be implemented in a similar manner. [0075] The Global Arbitrators 1100 are used to perform various tasks. Some task examples include: Distributed Scheduling, Process Management, Group Management, Inter-Process Communication (IPC), and Remote Procedure Calls (RPC). With respect to Distributed Scheduling and Process Management, each Global Arbitrator will maintain local and global resource information for the system, hi this case, each Distributed Schedule module for each Global Arbitrator instance will identify the resources on its host blade (e.g., disk, memory, processing, networking, and subscribers), and then propagate this information to the other Global Arbitrators, as illustrated in Figure 11. The Distributed Schedulers will also be responsible for load balancing resources across blades by tracking local resource loading an propagating this information to other Distributed Schedulers, such that each Global Arbitrator have access to a "world view" of available resources and resource consumption. [0076] To support these tasks, the Global Arbitrators will maintain a distributed database 1102 having a local instance 1102 A stored on each blade. The distributed database 1102, which functions as a virtual database, will be maintained in such a manner that if a blade fails, no information will be lost. Each local database instance will store both local and global data.
[0077] In accordance with another aspect of the invention, subscribers are associated with groups (based on their Subscriber Profile, such as QoS requirements and other contracted service parameters), while groups, in turn, are associated with resource groups. This is schematically illustrated in Figure 11. In general, groups may be associated with resources on a blade basis (e.g., Group 1 is associated with Traffic Blades 1, 2, 6, and 7 or with further granularity, such as the individual resource level (e.g., each compute node comprises an individual resource that may be allocated to a given group).
[0078] One of the purposes of the Group-to-subscriber mapping pertains to resource allocation. At a first level, the system will attempt to maintain load balancing by allocating resources such that the resources are consumed at a similar level across the infrastructure. At the same time, the subscriber-to-group allocation enables subscribers that have contracted for different levels of services to be allocated a corresponding level of (access to) resources. For example, some groups may be allocated more resources on a per-subscriber basis such that service flows corresponding to subscribers that have purchased a higher level of service will be allocated relatively more resources than service flows corresponding to lower QoS subscribers.
[0079] Another software aspect of the system pertains to the use of a Service Management Engine (SME). The SME provides key added value over and above that of a traditional network element such as a BRAS and Edge-Router. A traditional BRAS/Edge-Router is able to manage subscribers' traffic on a per session basis. A service node, equipped with the SME, provides visibility into subscriber sessions, and enable traffic management on a per application level. In addition, it is able to provide customized, network-based, and subscriber-aware application services.
[0080] The SME provides these functionalities through flow classification, deep packet inspection, flow level traffic management, and application service scripting. When subscriber traffic enters a service node, it is separated into flows. The flows are classified by their application-level protocols with the aid of deep packet inspection. Some of these flows are traffic-managed according to the authorization of the subscriber to which they belong. This management typically includes policing, shaping and prioritization on a per flow basis. Other flows are bifurcated or diverted to application service scripts that implement various customized services.
[0081] As discussed above, the SME builds on the subscriber management functions of a BRAS, and extends traffic management to a per-subscriber/per- application level. It also exposes some of these functions in a generic state machine so that customized applications may be built on top of these. The service management engine software is the top most functional layer in the system. It uses features provided by the lower layers; it guarantees certain level of quality of service for services and applications under management; and it provides visibility into the traffic stream for the network operators.
[0082] The Service Management Engine is the runtime layer of Services Management Software Environment. It may be divided into three major functional areas: Bandwidth Management, Service Management, and Application Scripting. The Bandwidth Management area is concerned with per-subscriber/per-service traffic management; the Service Management area is concerned with classifying flows and following protocol negotiations; and the Application Scripting area is concerned with providing capability to script custom network-based application services.
[0083] Figure 12 shows the major components of the SME, and their interactions. Some components are shown in more detail, while external components such as subscriber management, traffic management, and the global arbitrator are not shown for clarity. Figure 12 also does not show the internal layering of the components.
[0084] The SME consists of three major abstract components: Bandwidth Management component (BMC) 1201, a Services Management component (SMC) 1202, and an Application Scripting Component (ASC) 1204. The BMC 1201 is responsible for tracking per-subscriber usage statistic and traffic authorization and admission. The SMC 1202 is responsible for classification of flows, dissection of packets, and correlation of flows to services. The ASC 1204 runs sandboxes in which scripts may be executed.
[0085] The Bandwidth Management Component 1201 depends on a Subscriber Management Subsystem (SMS) (which provides subscriber information 1206), a Statistics Engine 1208, as well as a Traffic Management Engine (TME) 1210 for operation. The BMC receives correlated traffic and flow statistics on a per- subscriber and per-port/circuit basis from Statistics Engine 1208. It runs a per- subscriber state machine that keeps track of subscriber authorization, bandwidth consumption, and service utilization. It also receives service classification information from Service Management Component 1202, and computes traffic management policies on a per-flow basis. These policies are then sent to the TME 1210 for execution. To facilitate these operations, BMC 1201 includes a Bandwidth Controller 1212, and a Traffic Monitor 1214
[0086] The Services Management Component 1202, on the other hand, supplies the protocol and service classification information to Bandwidth Management Component 1201. It receives pre-classified packet flows that are bifurcated from the ingress traffic blade; it classifies each flow by their application level protocol; it dissects packets from interested flows in order to get application level messages; finally, it correlates flows to services, and sends the service classification and traffic specification of flows to BMC and other interested listeners. These operations are facilitated by a Flow Classifier 1216 and a Service Classifier 1218. [0087] The Application Scripting Component 1204 implements a sandbox where "application scripts" may be executed in virtual machines. This component provides an API and an execution environment similar to what is available to the Bandwidth Management Component, hi addition, Service Definition Scripts may direct specific messages to a particular application script. Application Scripts may implement custom application state machines, or security and traffic management policies. Each script has its dedicated environment.
[0088] Subscriber provisioning is handled at the subscriber management system, the detail of which is beyond the scope of this specification. In one embodiment, each subscriber is provisioned with a list of services; and each service is tagged with a service class: Best Effort, Managed, or Preferred. After subscribers are authenticated, their lists are brought to the network element as part of the authorization process. Each subscriber will be assigned to a compute-node, and the authorization will be made available to the Bandwidth Management Component residing on that compute node.
[0089] Profiles of services are provisioned at the management layer of SMC 1202. They are provisioned in the form of Service Definition Scripts. A Service Definition specifies what protocols a service uses, how a service is matched, what values are expected in the service's control protocol negotiations, the traffic profile of the data streams, and the control actions to be taken when this services is detected. These profiles are stored in a service node's persistent file system. The SMC uses these profiles to match flows to services, and obtain their traffic profiles, which are delivered to the BMC 1201.
[0090] Similarly, Application Scripts are provisioned at the management layer of ASC 1204. They are stored in a service node's persistent file system, and are loaded into their sandboxes at startup time.
[0091] As discussed above, the Service Node platform architecture employs a substantial level of parallelism, as provided by multiple Compute and Traffic Blades. External entities, such as subscriber provisioning and AAA (Authentication, Authorization, and Accounting), the Statistics Engine 1208, and the Traffic Management Engine 1210 are run in their own processes. The SME spreads itself along component boundaries. The Bandwidth Management Component will have its own process; so is the Service Management Component. The Application Scripting component will have a sandbox process from which all application scripts run.
[0092] Each compute-node that is not reserved for OAMP functions will have a full set of SME processes, including one BMC process, one SMC process, and one ASC process. Each of these processes is also multithreaded as described below. A compute-node is responsible for a number of subscribers. All processing, analysis and computation done for, or on behalf of, these subscribers are conducted on this compute-node. The Global Arbitrator 1100 has the responsibility to allocate subscribers to compute-nodes when they are authenticated.
[0093] hi addition to these processes that run on compute-nodes, the Service Management Component offloads some of its tasks, such as IP reassembly and preliminary classification, to a process on the host processor on Traffic Blades 600. This process may run these tasks directly, or act as a proxy for the ASIC/FPGA array. Moreover, SME has a configuration management (CM) process that implements the management layer functions. [0094] To take advantage of the SMP nature of the compute-nodes and the host processors, the foregoing processes are multi-threaded. In addition to threads that handle various housekeeping duties, each of BMC 1201 and SMC 1202 employ several threads that drain their work queues. The BMC employs two or more "worker threads", each of which is responsible for a (disjoint) subset of subscribers that are assigned to a particular BMC instance. The IPC servicing thread of a BMC will sort messages for different subscribers into different work queues based on their subscriber identification. Similarly, the SMC employs two or more threads that process incoming packets. The division of labor between threads is defined on a per- flow basis. The specific number of worker threads per process will be decided based on processor utilization data after profiling the processes with experimental data load. The ASC, on the other hand, employs one master thread, and at least one worker threads per application script. The ASC virtual machines have facilities to allow application scripts to spawn more threads when desired. [0095] Not withstanding the fact that these processes are multi-threaded, their operations are driven by messages (IPC) they receive. The SMC receives pre- classified datagrams from the packet processing HAL 1220; these datagrams are packed in IPC messages with extra headers. The worker threads of SMC 1202 will run flow classifiers, packet dissectors, as well as service classifiers on these messages and, in turn, produce "application level messages" for those services identified. These messages are then delivered to BMC 1201. The BMCs worker threads are driven with these messages; they produce traffic management directives for Traffic Management Engine 1210. The ASC 1204 worker threads are driven, similarly, with messages from SMC 1202 and other parts of the system. [0096] There are four classes of provision-able information: subscriber authorization, protocol description, service definition, and application scripts. As discussed earlier, subscriber authorization is provisioned in the Subscriber Management Subsystem. This information is provisioned statically, either on the node or in an external database. It is retrieved when a subscriber authenticates; and it is made available to the rest of the system software through sharing of the embedded provisioning information database. Each subscriber's authorization data includes a list of names of services, as well as their access class, namely "guaranteed", "managed" or "best effort" in one embodiment. [0097] However, subscribers are assigned to instances of SME dynamically. As discussed above, each compute-node that does not serve the OAMP function will have an instance of SME with all of its components. Each instance is responsible for a number of subscribers. The list of subscribers served by a particular compute- node, or SME instance, is decided by the Global Arbitrator 1100 when subscribers authenticate. The decision is based on available compute resources on all available compute-nodes.
[0098] The protocol descriptions are provisioned as binary loadable modules that are loaded into SMC 1202. The SMC has descriptions of a number of protocols built-in, while descriptions of extra protocols are loaded as protocol dissector plug- in modules. When an instance of SMC 1202 is started, all protocol definitions, including the provisioned ones, are loaded automatically. These definitions may be de-provisioned by user request, provided that no other protocol or service definitions depend on them.
[0099] The service definitions are provisioned as clear text scripts. These definitions are provisioned at the management layer of SMC 1202, and saved in a persistent file system. Each SMC instance loads the complete list of provisioned service definitions when it starts. These definitions may be de-provisioned at any time.
[00100] Similarly, application scripts are provisioned at the management layer of ASC 1204 and stored in the persistent file system. The scripts are loaded into the ASC process by the master thread on each compute-node when the embedded software for that node is started. They may be de-provisioned and unloaded at any time.
[00101] In addition to the above, the SME provisions the Traffic Management Engine dynamically. The SME, specifically the BMC, computes traffic management policies on the fly based on subscribers' authorization and real-time traffic condition. These policies are sent to the TMC for enforcement [00102] One "application" provided by the SME is bandwidth management. This is performed by assigning classes of service to flows based on their classification and the authorization of the subscriber to whom they belong. The SME relies on Subscriber Management Subsystem 1206 for subscriber authorization information, on Statistics Engine 1208 for circuit and flow level statistics, and on Traffic Management Engine 1210 for traffic management policy enforcement. [00103] As shown in Figure 13 and discussed above, a BMC employs a Traffic Monitor 1214 and a Traffic Controller 1212. The Traffic Monitor interfaces with Statistics Engine 1208 to monitor network traffic. For network interfaces that face the core side, the Traffic Monitor aggregates traffic statistics on a per-port or per- circuit basis. On subscriber facing interfaces, however, the Traffic Monitor aggregates statistics on per-destination or per-subscriber basis. Core-side monitoring provides information for admission control when new service is initiated. Subscriber-side monitoring helps to determine how much bandwidth and throughput any given subscriber is using. Together with their allotted bandwidth and throughput authorization, this information is taken as an input for deciding the class of service a new flow receives. In addition, flow-based statistics on selected flows are monitored for verifying the QoS a particular flow experiences. [00104] The Traffic Controller 1212 computes traffic management policies on a per flow basis. It runs a "traffic state tracking machine" for each subscriber under its management. It has access to subscriber authorization information, including service profiles, and provisioned bandwidth and throughput. It receives classification and traffic profile information for detected or expected flows of recognized network applications (services) from Service Management Component 1202. It then validates whether there is enough bandwidth on the subscriber's line to accommodate it.
[00105] The Traffic Controller's primary focus is to decide the class of service for a given flow and a given subscriber. If a flow is of a guaranteed service, and there is enough bandwidth, given the flow's traffic profile and the available bandwidth of the subscriber's line, then the flow will be admitted to the guaranteed class. Queues for other classes of traffic are adjusted, as necessary, to accommodate this flow. Otherwise, the flow will be added to the best-effort class. An alarm indication may be raised in this situation. Regardless of service classes, policing, shaping, and priority parameters are setup for all flows to protect them from each other, and to protect service level agreements for all subscribers. [00106] The Services Management Component 1202 is responsible for identifying and monitoring services. As shown in Figures 12 and 14, the SMC receives bifurcated flows of packets from the Packet Processing HAL (Hardware Abstraction Layer) 1220 (PPHAL), classifies the flows according to the protocol they use, dissects the packets into application level messages, and matches flows to known "services". Once a service is identified, SMC 1202 sends a notification with flow identification and classification information, as well as traffic profile to the BMC instance on the same compute-node. This notification may include a Flow Information Record (FIR) that contains various flow statistics. [00107] Initially, no traffic (for an associated flow) is bifurcated. When a subscriber is authenticated and assigned to a compute-node, its authorization list is analyzed at the BMC instance on that node. If the subscriber subscribes to any service that is being supported, the BMC instance will contact the SMC instance on the same compute-node, which, in turn, will instruct PPHAL 1220 to bifurcate traffic belonging to this subscriber to this node. If a subscriber is not authorized to use any "service," then no bifurcation is needed. For authorized subscribers, SMC 1202 will attempt to classify as much traffic as possible, and discard packets that it can not process. Once a classification decision is made for a particular flow, SMC 1202 will determine whether it needs to receive more packets from this flow. It is expected that only control flows need constant monitoring in SMC. High bandwidth flows, such as media flows, only need to be classified and never looked at again.
[00108] The Services Management Component can be broken down to two parts: the Flow Classifier 1216 and the Service Classifier 1218. The Flow Classifier, as its name suggests, is responsible for classification of packet flows. It runs a set of Packet Dissection Plug-in modules (Packet Dissectors). A Pre-Classifier in PPHAL 1220 filters out flows that are not of interest to the SMC, and only bifurcate flows belonging to subscribers who are assigned to a particular instance of SMC to that instance of SMC. The SMC is responsible to provision classification rules to the PPHAL so that only flows that are potentially interesting are bifurcated; the BMC, however, is responsible for notifying the SMC, which, in turn, registers with PPHAL of subscribers that are to be monitored by a particular instance of SMC. [00109] For every packet it encounters, the Service Classifier performs deep packet inspection by running through its protocol dissectors 1400. Each protocol dissector assigns a percentage probability of a packet being of its associated protocol. At the end, the protocol with the highest probability wins. Some flows, however, may not be classified successfully: all dissectors may report very low probability. Accordingly, unclassified flows will be lumped into an "unclassified" pool. For management purposes, they will only be distinguishable by their flow ID. The classified flows are dissected into application level messages and passed to
Service Classifier 1218 for further analysis.
[00110] The Service Classifier correlates classified and dissected packet flows as services. Services are defined by Service Definitions 1402. A Service Definition describes how a service is recognized, its traffic profile, and what actions to take when such a service is detected. Service Classifier 1218 reads in all available service definitions at startup time, and builds an internal data structure for service classification. Additional service definitions may be loaded at runtime. When service definitions are added or removed, this internal data structure is amended dynamically.
[00111] The "actions" specified in these definitions are primitives that the SME supports. The most common one (and the default) is the send command. A service definition may request the traffic profile and the flow information record be sent to other components including, but not limited to, BMC 1201. A service definition may be used, for example, to send information to Application Scripts running in a sandbox.
[00112] The Service Management Component maintains one FIR for each flow it monitors. Protocol classification and service classification results are recorded in these Flow Information records. FIRs are used when communicating classification information with other components.
[00113] The SMC 1202 relies on PPHAL 1220 to deliver bifurcated flows of packets to the correct instance of the software. As discussed earlier, when a subscriber is authenticated, it is assigned to a particular compute-node, and a particular instance of the SMC software. In addition, a backup instance may be specified at the same time by Global Arbitrator 1100. This assignment is communicated to PPHAL 1220, and it is the responsibility of the PPHAL to deliver bifurcated packets to the correct instance of the software. Moreover, SMC 1202 will instruct PPHAL 1220 to stop bifurcation of certain flows when enough information has been obtained from packet inspection. The main communication method for delivering bifurcated packets from PPHAL 1220 to SMC 1202 will be IPC, using the "push" model. However, the communication channel from SMC 1202 to PPHAL 1220 for control messaging will use RPC (Remote Procedure Call).
[00114] The primary client of SMC 1202 is the Bandwidth Management Component 1201. A BMC instance notifies the SMC instance on the same compute-node the set of subscribers who are under its management. The SMC instance, in turn, registers with PPHAL 1220 to receive bifurcated packets from these subscribers. Once a flow is dissected and classified, SMC 1202 sends the classification result and traffic profile to BMC 1201 for policy decision rendering. The API from BMC to SMC comprises an RPC interface; whereas the interface from SMC to BMC will use IPC. As there is no shared data, no locking is necessary.
[00115] The Application Scripting Component 1204 is client of SMC 1202, and is very similar to BMC 1201. The ASC receives copies of IPC events that are sent to BMC 1201. However, it does not provision SMC 1202 as does BMC 1201. [00116] Last, but not the least, SMC 1202 couples with a configuration manager (CM) in the Management Layer (not shown). The configuration manager resides on a compute-node that is dedicated for OAMP functions. They communicate via the provisioning database. While the configuration manager has write access to the provisioning information database, SMC only has read access. Multiple instances of SMC may share read locks on the same record set in this database. [00117] The Application Scripting Component 1204 implements a sandbox where "application scripts" may be executed in virtual machines. The ASC provides an API and an execution environment similar to what is available to BMC 1201 In addition, Service Definition Scripts may direct specific messages to a particular application script. Application Scripts may implement custom application state machines or security and traffic management policies. Each script has its dedicated environment. This is where network-based applications can be hosted on the service node.
[00118] As shown in Figure 15, there are two major sub-components in ACS 1204 subsystem: a Virtual Machine Manager (VMM) 1500, and a sandbox 1502 that supports all virtual machines 1504. VMM 1500 is responsible for setting up the sandbox, and starting all virtual machines that run application scripts. For every application script, a separate virtual machine 1504 is started. The VMM monitors virtual machines that it starts, and restarts them if they crash. [00119] Sandbox 1502 an execution environment for generic state machine engines (see Figure 12) (or virtual machines) which, in turn, run Application Scripts 1506. The sandbox delivers events from other parts of the system to the virtual machines; in addition, it provides guarded access to a selected API 1508 for access resources and runtime information available on the network element. [00120] The virtual machines comprise generic state machine engines. In respective embodiments they may be implemented as Mono or Java virtual machines with specific class libraries. In one embodiment, the sandbox is built with a Common Language Runtime (CLR) that is based on Mono with a custom-built Just-In-Time compiler to execute the Common Intermediate Language (CIL) byte code. The generic state machines may be implemented as "application domains" within the sandbox.
[00121] In addition to standard class libraries, a set of proprietary class libraries is available. These class libraries provide the event mechanism as well as API 1508 to the rest of the system. Application scripts have access to flow information, statistics, as well as classification results from SMC 1202. They may be granted other access to components such as Traffic Management and Subscriber Management, depending on their intended application.
[00122] The Application Scripts 1506, are in the form of CIL packages known as "assemblies." An application service developer may use any language for which a compiler with a backend for generating CIL byte code is available. The preferred language is C#. Each Application Script 1506 is loaded into a separate application domain in the CLR. The Application Scripts have access to system information and resources similar to those available to BMC 1201 ; however, they are not allowed to interact with each other for security reasons.
[00123] As discussed above, the various resources that are employed for handling a given flow may be (generally) located anywhere within the service node, thus supporting various capabilities, such as full scalability and failover. However, specific resources are assigned for handling particular flows based on the subscriber and possibly other considerations, such as application (e.g., VoIP, VoD, etc.). [00124] In further detail, reference is made to Figure 16, which illustrates operations that are employed to provision a new flow. The process begins in a block 1600, wherein a subscriber attempts to obtain a IP address or initiates a PPOE Session. Typically, each ingress Traffic Blades will provide a set of one or more input ports via which input traffic is received by the service node, such that a particular flow received from a given subscriber will be received by a particular Traffic Blade. In response to receiving a packet at a block 1602, that Traffic Blade will perform a preliminary inspection to identify the subscriber or subscriber session (e.g., a given subscriber may have multiple sessions open at the same time, including sessions having different applications). Since this is a new flow, it has yet to be assigned, which will be identified by the Traffic Blade. More specifically, the Traffic Blade NPU will have access to a distributed runtime database containing flow assignments based on some filtering criteria (e.g., 5-Tuple signature), whereby the NPU can determine the assignment for a flow using a database lookup. In the case of a new flow, this is result in a miss, and the NPU will forward the processing to the control plane and send the flow to an OAMP blade to verify and/or authenticate the subscriber. Flow example, subscriber authentication may be performed using one of many well-known authentication schemes, such as an AAA server.
[00125] Once the subscriber is authenticated, the process moves to a block 1604, wherein the subscriber will be associated with an IP address or other Tuple, with the association being stored as a record in the runtime database. The global arbitrator then assigns the subscriber to a compute node/blade based on a combination of criteria, including the group associated with the subscriber/application and dynamic considerations, such as discussed above (e.g., load balancing, etc.) The global arbitrator further informs the SME that the subscriber has been assigned to the compute node/blade and which Traffic Blade the subscriber has been activated on. These associations and assignments enables packets received at an ingress Traffic Blade to be sent to particular processing resources for further processing during ongoing flow operations, as follows.
[00126] In order to better understand aspects of the packet processing, an exemplary service node environment is shown in Figure 17. As discussed above with reference to Figure 10, this environment employs a fully-populated ATCA chassis including 10 Traffic Blades 60O1-Jo, and 4 Compute Blades 602i-4, with Compute Blade 6021 being provisioned as to support OAMP functions. Each of the Traffic and Compute Blades are communicatively-coupled via mesh interconnect 618, with access to the interconnect being managed by the backplane fabric switch on each blade.
[00127] In general, a global arbitrator 1100 instance will run on each blade, as exemplified by OAMP blade 602 \. For simplicity and clarity, other global arbitrators are not shown. Additionally, various SME component instances will be run on the various blades. For simplicity and clarity, these SME component instances are schematically depicted as SME instances 120O]-3. It will be understood that the actual SME component instances will include instances of SME components applicable to the respective execution host (i.e., each of the Traffic Blades OOOM O and Compute Blades 6021-4 will run instances of associated SME components).
[00128] Referring now to Figures 5, 17, and 18, the on-going flow operations begin at a block 1800 in Figure 18, wherein a packet (flow) is received at an ingress port. In further detail, an ingress packet flow F1 (depicted as packets Pl and P2) is received at an input port of an ingress Traffic Blade 600] . Upon receiving each packet, ingress operations including primary classification is performed by the Ingress Traffic Blade's NPU, which functions as an Ingress Traffic Processor TP2 of Figure 5. For example, in one embodiment a 5-Tuple Signature match is used. Other classification schemes may also be employed in a similar manner. The purpose of the classification is to associate the packets with a subscriber flow. [00129] After the primary classification is performed, subscriber information is retrieved from a runtime database (e.g., Subscriber Management DB 1206) to identify the compute resources that will be employed to process the subscriber flow. As discussed above, this information (i.e., subscriber-to-resource mapping) is generated in block 1604 of Figure 16. In the illustrated example, the compute resource comprises Compute Node #3 on compute blade 60O2. [00130] In conjunction with the operations of block 1802 the Ingress Traffic Blade also identifies the Egress Traffic Blade to which the packet is to be sent for Egress processing. In the illustrated example, this comprises Egress Traffic Blade 600] o, which also functions as Egress Traffic Processor TPe of Figure 5. [00131] Continuing at a block 1806, at this point the processing of the packet is bifurcated. This aspect involves two primary operations: copy the packet to each target processor, and perform respective sets of packet processing operations on those target processors. The copy operation is accomplished in the following manner. First, the packet is copied from its NPU buffer (e.g., local RAM) into the backplane fabric switch 814 of Ingress Traffic Blade 60Oi . Along with copying the packet, information is provided to the backplane fabric switch to instruct the switch to which target blades the packet is to be copied to. In one embodiment, the backplane fabric switches 814 and 708 implement a insertion ring scheme, under which certain cross-connections exist for each transmit "cycle." A bit mask or the like is used to identify which blade(s) is/are targeted to receive a copy of the data, based on the slot address of the blade. During a given cycle, the backplane fabric switch of a given blade determines if there is an interconnect from itself to the backplane fabric switch on another blade for which request for a data transfer is pending. If so, the data is transmitted during that cycle; otherwise the backplane fabric switch waits for the next cycle. Meanwhile, the data is held in a fabric switch buffer until all copies of the data have been transmitted.
[00132] This scheme provides several advantages over conventional copy schemes. Under a typical scheme, a processor or the like is employed for each copy. In addition, an interconnect path may not be immediately available, especially under a priority-based fabric switch scheme. Each of these may consume additional processor resources, which in turn may introduce jitter an/or delay, hi contrast, under the insertion ring discussed herein, there is no jitter or delay introduced to the packet processing.
[00133] As discussed above with reference to Figure 5, the flow to the Egress Traffic Blade (the egress flow) is termed Fe, while the bifurcated flow to the Compute Blade is termed Fh. For packets corresponding to egress flow Fe, egress packet processing operations are performed in a block 1808. This typically involves assigning the packet to an associated flow queue and appending the packet to that flow queue. Other ongoing traffic management and shaping processes may be employed for dispatching the flow queues for transmission to a next hop in the network, such as employed by round robin and priority-based dispatch schemes. [00134] In connection with these operations, the flow queue assignment is based, in part, on current traffic profiles and subscriber flow attributes such as flow application. For example, in one embodiment flow queues are dispatched using a priority-based scheme, with flows associated with higher QoS, for example, receiving higher priority. In addition, the availability of queue dispatch (and corresponding queue assignments) may be managed in view of current traffic profiles. Accordingly, the assignment of a subscriber flow packet into a dispatch queue is a dynamic consideration that may change over time in view of changes in traffic profiles and the like.
[00135] In parallel with the egress traffic operations of block 1808, the Analysis Engine of Figure 5 is employed to perform analysis of the traffic flow in a block 1810. As applied to the execution environment of Figure 17, the Analysis Engine functions are effected via corresponding SME component instances in the manner described above and as represented by SME instance 120O2. [00136] Continuing at a block 1812, the Analysis Engine generates traffic profile information that is provided to each of the Ingress and Egress Traffic Blades to update there traffic profile information. In view of the SME architecture, this result can be effected by updating a local instance of a distributed Traffic Profile (TP) database 1800 with the new traffic profile information. Using a distributed database update mechanism, the updated traffic profile information is propagated to each local instance of the Traffic Profile database, thereby effectively providing the updated traffic profile information to Egress Traffic Blade 6OOio, as depicted by the update Traffic Profiles operation of block 1814.
[00137] With respect to feedback to the Ingress Traffic Blade, the traffic analysis information generated in block 1810 may be used to adjusting policing operations performed on the ingress flow, as shown in a block 1816. This provides a means by which an Ingress Traffic Blade can dynamically adjust its ingress processing operations for a given flow in view of real-time traffic analysis feedback derived from that flow.
[00138] As discussed above, various operations performed by the service node are implemented via execution of software (e.g., machine instructions and/or virtual machine code) on processing elements. Thus, embodiments of this invention may be used as or to support software embodied as programs, modules, libraries, etc., executed upon some form of processing core or otherwise implemented or realized upon or within a machine-readable medium. A machine-readable medium includes any mechanism for storing or transmitting information in a form readable by a machine (e.g., a computer). For example, a machine-readable medium may include a read only memory (ROM); a random access memory (RAM); a magnetic disk storage media; an optical storage media; and a flash memory device, etc. In addition, a machine-readable medium can include propagated signals such as electrical, optical, acoustical or other form of propagated signals (e.g., carrier waves, infrared signals, digital signals, etc.).
[00139] The above description of illustrated embodiments of the invention, including what is described in the Abstract, is not intended to be exhaustive or to limit the invention to the precise forms disclosed. While specific embodiments of, and examples for, the invention are described herein for illustrative purposes, various equivalent modifications are possible within the scope of the invention, as those skilled in the relevant art will recognize. [00140] These modifications can be made to the invention in light of the above detailed description. The terms used in the following claims should not be construed to limit the invention to the specific embodiments disclosed in the specification and the drawings. Rather, the scope of the invention is to be determined entirely by the following claims, which are to be construed in accordance with established doctrines of claim interpretation.

Claims

CLAIMSWhat is claimed is:
1. A method, comprising: receiving packets comprising an ingress traffic flow at a network element; bifurcating processing of the packets to an egress traffic processor and a computation processor; performing egress traffic flow operations at the egress traffic processor, the egress traffic flow operations assigning the packets to an egress traffic flow; and performing traffic analysis operations at the computation processor concurrently with the egress traffic flow operations, the traffic analysis operations generating traffic analysis data corresponding to the ingress traffic flow.
2. The method of claim 1, wherein the operation of bifurcating processing of the packets introduces no additional latency into the processing of the ingress or egress traffic flows.
3. The method of claim 1, further comprising: receiving packets corresponding to multiple ingress traffic flows at the network element; and assigning packets associated with multiple ingress traffic flows to egress traffic flows in view of traffic profiles generated from the traffic analysis data for the multiple ingress traffic flows.
4. The method of claim 1 , further comprising: maintaining flow state information for the ingress traffic flow; and altering traffic analysis operations in view of the flow state information.
5. The method of claim 1, further comprising: dynamically adjusting an egress traffic profile applied to the egress traffic flow in view of the ingress traffic flow traffic analysis data that is generated.
6. The method of claim 1, wherein the network element comprises a modular chassis including a plurality of traffic blades and compute blades, the method further comprising: receiving the packets at an ingress traffic blade; performing primary classification of the packets to identify a subscriber flow associated with the packets; and based on the subscriber flow, identifying an egress traffic blade to be employed for the egress traffic flow operations and sending a first copy of the packets to that egress traffic blade; and identifying a computation blade to be employed for the traffic analysis operations and sending a second copy of the packets to that compute blade.
7. The method of claim 6, further comprising: detecting initiation of a subscriber session; storing data associating traffic blade and computation resources with a subscriber flow corresponding to the subscriber session.
8. The method of claim 7, wherein a compute blade includes multiple compute nodes, and associating computation resources with the subscriber flow identifies a compute node on a compute blade to be employed for performing traffic analysis operations.
9. The method of claim 6, wherein the copying the first and second packets to the egress traffic blade and compute blade comprises: buffering packets received at the ingress traffic blade in a backplane fabric switch of the ingress traffic blade; transmitting the first copy of the packets across a mesh interconnect to a backplane fabric switch of the egress traffic blade; and transmitting the second copy of the packets across the mesh interconnect to a backplane fabric switch of the compute blade.
10. The method of claim 9, wherein the backplane fabric switches employ an insertion ring transfer scheme to transfer a copy of a packet buffered in the backplane fabric switch of the ingress traffic blades to target egress traffic and compute blades.
11. The method of claim 6, wherein the primary classification operation comprises performing a 5 -Tuple classification.
12. The method of claim 1, further comprising: providing the generated traffic flow analysis data to the ingress processor; and adjusting policing performed on the ingress traffic flow in view of the traffic flow analysis data.
13. The method of claim 1 , further comprising: performing classification of the packets to classify the ingress traffic flow as a particular application traffic type.
14. The method of claim 13, further comprising: employing deep packet inspection to classify the ingress traffic flow.
15. An apparatus, comprising: a plurality of ingress traffic processors; a plurality of egress traffic processors, communicatively coupled to the ingress traffic processors; a plurality of computation processors, communicatively coupled to the ingress and egress traffic processors; and software components distributed across the plurality of ingress traffic processors, egress traffic processors and computation processors, the software to execute on the plurality of ingress traffic processors, egress traffic processors and computation processors to perform operations including, performing ingress processing operations on packets comprising an ingress traffic flow received at the apparatus; bifurcating processing of the packets to an egress traffic processor and a computation processor; performing egress traffic flow operations at the egress traffic processor, the egress traffic flow operations assigning the packets to an egress traffic flow; and performing traffic analysis operations at the computation processor concurrently with the egress traffic flow operations, the traffic analysis operations generating traffic profile data corresponding to the ingress traffic flow.
16. The apparatus of claim 15, wherein execution of the software performs further operations comprising: performing ingress processing operations on packets corresponding to multiple ingress traffic flows received at the apparatus; maintaining flow state information for each of the multiple ingress traffic flows; and altering traffic analysis operations in view of the flow state information.
17. The apparatus of claim 15, wherein execution of the software performs further operations comprising: dynamically adjusting an egress traffic profile applied to the egress traffic flow in view of the ingress traffic flow traffic profile data that is generated.
18. The apparatus of claim 15, wherein execution of the software performs further operations comprising: providing the generated traffic flow profile data to the ingress processor; and adjusting policing performed on the ingress traffic flow in view of the traffic flow profile data.
19. The apparatus of claim 15, wherein execution of the software performs further operations comprising: performing classification of the packets to classify the ingress traffic flow as a particular application traffic type.
20. The apparatus of claim 15, wherein execution of the software performs further operations comprising: associating the ingress traffic flow with a subscriber flow; identifying processing resources allocated to the subscriber flow, the processing resources including a target egress traffic processor and a target computation processor; and bifurcating processing of the packets by transmitting a copy of each packet to each of the target egress traffic processor and the target computation processor.
21. An apparatus, comprising: a chassis having a plurality of slots and including a backplane providing a mesh interconnect between the slots; a plurality of traffic blades, each installed in the chassis in a respective slot and including a backplane interface coupled to the backplane, the plurality of traffic blades including ingress traffic blades and egress traffic blades; a plurality of compute blades, each installed in the chassis in a respective slot and including a backplane interface coupled to the backplane; and software components distributed across the plurality of traffic blades and compute blades, the software components to execute on processing elements hosted by the traffic blades and compute blades to perform operations including, performing ingress processing for packets received at an ingress traffic blade, the ingress processing including performing primary classification of the packets to identify a subscriber flow associated with the packets; based on the subscriber flow, identifying an egress traffic blade to be employed for the egress traffic flow operations and sending a first copy of the packets to that egress traffic blade; and identifying a computation blade to be employed for the traffic analysis operations and sending a second copy of the packets to that compute blade, performing egress traffic operations for the subscriber flow on the egress traffic blade using the first copy of packets; and concurrently performing traffic analysis operations on the computation blade using the second copy of packets.
22. The apparatus of claim 21, wherein a compute blade includes a plurality of computing elements organized as compute nodes, and execution of the software components perform further operations comprising: identifying a compute node to be employed for traffic analysis operations; sending a copy of the packets to that compute node; and performing the traffic analysis operations, at least in part, on that compute node.
23. The apparatus of claim 21, wherein copying the first and second packets to the egress traffic blade and compute blade comprises: buffering packets received at the ingress traffic blade in a backplane fabric switch of the ingress traffic blade; transmitting the first copy of the packets across the mesh interconnect to a backplane fabric switch of the egress traffic blade; and transmitting the second copy of the packets across the mesh interconnect to a backplane fabric switch of the compute blade.
24. The apparatus of claim 23, wherein the backplane fabric switches employ an insertion ring transfer scheme to transfer a copy of a packet buffered in the backplane fabric switch of the ingress traffic blades to target egress traffic and compute blades.
25. The apparatus of claim 21, wherein execution of the software performs further operations comprising: generating traffic flow profile data via the traffic analysis operations; providing the traffic flow profile data to the ingress traffic blade; and adjusting policing performed on the subscriber flow in view of the traffic flow profile data.
26. The apparatus of claim 21, wherein execution of the software performs further operations comprising: effecting a distributed traffic analysis database in which traffic analysis data is stored, the distributed traffic analysis database including local instances of the database hosted on respective traffic and compute blades; updating a local instance of the traffic analysis database with traffic analysis data generated by its host blade; and propagating the update to other local instances of the traffic analysis database hosted by other blades.
27. The apparatus of claim 21, wherein execution of the software performs further operations comprising: performing classification of the packets to classify the subscriber flow as a particular application traffic type; and managing egress flow operations corresponding to the subscriber flow based on its application traffic type.
28. The apparatus of claim 27, wherein an ingress traffic blade includes a network processor unit (NPU) and a host processor, and wherein classification of packets into particular application traffic types is performed by: employing the NPU to perform a first level classification using the NPU; and employing the host processor to perform at least one additional level of classification including deep packet inspection.
29. The apparatus of claim 21, wherein the chassis comprises an Advanced Telecommunication and Computing Architecture (ATCA) chassis.
PCT/CA2006/001423 2005-09-12 2006-08-30 Packet flow bifurcation and analysis WO2007030917A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CA2620349A CA2620349C (en) 2005-09-12 2006-08-30 Packet flow bifurcation and analysis
EP06790602A EP1932312A4 (en) 2005-09-12 2006-08-30 Packet flow bifurcation and analysis

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US11/224,201 US7508764B2 (en) 2005-09-12 2005-09-12 Packet flow bifurcation and analysis
US11/224,201 2005-09-12

Publications (2)

Publication Number Publication Date
WO2007030917A1 true WO2007030917A1 (en) 2007-03-22
WO2007030917A8 WO2007030917A8 (en) 2007-08-09

Family

ID=37855020

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CA2006/001423 WO2007030917A1 (en) 2005-09-12 2006-08-30 Packet flow bifurcation and analysis

Country Status (4)

Country Link
US (1) US7508764B2 (en)
EP (1) EP1932312A4 (en)
CA (1) CA2620349C (en)
WO (1) WO2007030917A1 (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2008144886A1 (en) * 2007-05-25 2008-12-04 Zeugma Systems, Inc. Multi-level packet classification and routing in a distributed compute environment
US7706291B2 (en) 2007-08-01 2010-04-27 Zeugma Systems Inc. Monitoring quality of experience on a per subscriber, per session basis
US8264965B2 (en) 2008-03-21 2012-09-11 Alcatel Lucent In-band DPI application awareness propagation enhancements
US8374102B2 (en) 2007-10-02 2013-02-12 Tellabs Communications Canada, Ltd. Intelligent collection and management of flow statistics

Families Citing this family (110)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100195538A1 (en) * 2009-02-04 2010-08-05 Merkey Jeffrey V Method and apparatus for network packet capture distributed storage system
EP1722508B1 (en) * 2005-05-13 2008-01-02 Qosmos Distributed traffic analysis
US8028337B1 (en) 2005-08-30 2011-09-27 Sprint Communications Company L.P. Profile-aware filtering of network traffic
US8204974B1 (en) * 2005-08-30 2012-06-19 Sprint Communications Company L.P. Identifying significant behaviors within network traffic
US7719995B2 (en) * 2005-09-09 2010-05-18 Zeugma Systems Inc. Application driven fast unicast flow replication
US7929532B2 (en) * 2005-11-30 2011-04-19 Cortina Systems, Inc. Selective multicast traffic shaping
EP1793537B1 (en) * 2005-12-02 2009-03-25 Alcatel Lucent Network node with modular multistage packet classification
US7724754B2 (en) * 2006-02-24 2010-05-25 Texas Instruments Incorporated Device, system and/or method for managing packet congestion in a packet switching network
US8892737B2 (en) * 2006-03-06 2014-11-18 Vmware, Inc. Network sniffer for performing service level management
US7693996B2 (en) * 2006-03-06 2010-04-06 Vmware, Inc. Service level management system
US7693985B2 (en) * 2006-06-09 2010-04-06 Cisco Technology, Inc. Technique for dispatching data packets to service control engines
US7895331B1 (en) 2006-08-10 2011-02-22 Bivio Networks, Inc. Method for dynamically configuring network services
US8005101B1 (en) 2006-08-10 2011-08-23 Bivio Networks, Inc. Scalable architecture for deep-packet processing
US8135685B2 (en) * 2006-09-18 2012-03-13 Emc Corporation Information classification
US8612570B1 (en) 2006-09-18 2013-12-17 Emc Corporation Data classification and management using tap network architecture
US7761485B2 (en) 2006-10-25 2010-07-20 Zeugma Systems Inc. Distributed database
US7930703B2 (en) * 2006-11-03 2011-04-19 At&T Intellectual Property I, L.P. System and method for providing access to multimedia content via a serial connection
FR2908575B1 (en) * 2006-11-09 2009-03-20 At & T Corp METHOD AND APPARATUS FOR PROVIDING LOAD BASED LOAD BALANCING
US7653057B1 (en) * 2006-11-30 2010-01-26 World Wide Packets, Inc. Preventing forwarding of a packet to a control plane
US8073384B2 (en) * 2006-12-14 2011-12-06 Elster Electricity, Llc Optimization of redundancy and throughput in an automated meter data collection system using a wireless network
CA2615584A1 (en) * 2006-12-20 2008-06-20 David Brown Bifurcate space switch
JP4224508B2 (en) * 2006-12-26 2009-02-18 株式会社日立製作所 Traffic information aggregator
US8745179B2 (en) * 2007-01-31 2014-06-03 Cisco Technology, Inc. Managing multiple application configuration versions in a heterogeneous network
US7729278B2 (en) * 2007-02-14 2010-06-01 Tropos Networks, Inc. Wireless routing based on data packet classifications
US20080298230A1 (en) * 2007-05-30 2008-12-04 Luft Siegfried J Scheduling of workloads in a distributed compute environment
IL183904A (en) * 2007-06-13 2014-06-30 Vladimir Yanover Method and device for bandwitdth allocation in a wireless communication system
US7843914B2 (en) * 2007-06-29 2010-11-30 Alcatel-Lucent Network system having an extensible forwarding plane
US20090003375A1 (en) * 2007-06-29 2009-01-01 Martin Havemann Network system having an extensible control plane
US8000329B2 (en) * 2007-06-29 2011-08-16 Alcatel Lucent Open platform architecture for integrating multiple heterogeneous network functions
KR100948604B1 (en) * 2008-03-25 2010-03-24 한국전자통신연구원 Security method of mobile internet protocol based server
US9323901B1 (en) 2007-09-28 2016-04-26 Emc Corporation Data classification for digital rights management
US9461890B1 (en) 2007-09-28 2016-10-04 Emc Corporation Delegation of data management policy in an information management system
US8522248B1 (en) 2007-09-28 2013-08-27 Emc Corporation Monitoring delegated operations in information management systems
US8868720B1 (en) 2007-09-28 2014-10-21 Emc Corporation Delegation of discovery functions in information management system
US9141658B1 (en) 2007-09-28 2015-09-22 Emc Corporation Data classification and management for risk mitigation
US8548964B1 (en) 2007-09-28 2013-10-01 Emc Corporation Delegation of data classification using common language
US7826455B2 (en) * 2007-11-02 2010-11-02 Cisco Technology, Inc. Providing single point-of-presence across multiple processors
US9331919B2 (en) * 2007-11-30 2016-05-03 Solarwinds Worldwide, Llc Method for summarizing flow information of network devices
US8179799B2 (en) * 2007-11-30 2012-05-15 Solarwinds Worldwide, Llc Method for partitioning network flows based on their time information
US8601113B2 (en) * 2007-11-30 2013-12-03 Solarwinds Worldwide, Llc Method for summarizing flow information from network devices
WO2009082761A1 (en) 2007-12-26 2009-07-02 Elster Electricity, Llc. Optimized data collection in a wireless fixed network metering system
US8165024B2 (en) * 2008-04-03 2012-04-24 Alcatel Lucent Use of DPI to extract and forward application characteristics
US8625642B2 (en) 2008-05-23 2014-01-07 Solera Networks, Inc. Method and apparatus of network artifact indentification and extraction
US20090292736A1 (en) * 2008-05-23 2009-11-26 Matthew Scott Wood On demand network activity reporting through a dynamic file system and method
US8521732B2 (en) 2008-05-23 2013-08-27 Solera Networks, Inc. Presentation of an extracted artifact based on an indexing technique
US8004998B2 (en) * 2008-05-23 2011-08-23 Solera Networks, Inc. Capture and regeneration of a network data using a virtual software switch
US8031606B2 (en) 2008-06-24 2011-10-04 Intel Corporation Packet switching
US8179846B2 (en) * 2008-09-08 2012-05-15 Alcatel Lucent DPI-driven bearer termination for short-lived applications
US8300532B1 (en) * 2008-09-23 2012-10-30 Juniper Networks, Inc. Forwarding plane configuration for separation of services and forwarding in an integrated services router
US8204958B2 (en) * 2009-01-12 2012-06-19 Network Instruments, Llc Apparatus and methods for network analysis
US8051167B2 (en) * 2009-02-13 2011-11-01 Alcatel Lucent Optimized mirror for content identification
KR20120004993A (en) * 2009-03-18 2012-01-13 록스타 비드코 엘피 Methods and systems for providing a logical network layer for delivery of input/output data
CN101997826A (en) * 2009-08-28 2011-03-30 中兴通讯股份有限公司 Routing methods of control net element, forwarding net element and internet protocol network
US8832222B2 (en) * 2009-10-05 2014-09-09 Vss Monitoring, Inc. Method, apparatus and system for inserting a VLAN tag into a captured data packet
US20110125748A1 (en) * 2009-11-15 2011-05-26 Solera Networks, Inc. Method and Apparatus for Real Time Identification and Recording of Artifacts
WO2011060368A1 (en) * 2009-11-15 2011-05-19 Solera Networks, Inc. Method and apparatus for storing and indexing high-speed network traffic data
US8532129B2 (en) * 2009-12-30 2013-09-10 International Business Machines Corporation Assigning work from multiple sources to multiple sinks given assignment constraints
US8391305B2 (en) * 2009-12-30 2013-03-05 International Business Machines Corporation Assignment constraint matrix for assigning work from multiple sources to multiple sinks
US8295305B2 (en) * 2009-12-30 2012-10-23 International Business Machines Corporation Dual scheduling of work from multiple sources to multiple sinks using source and sink attributes to achieve fairness and processing efficiency
US9413649B2 (en) * 2010-03-12 2016-08-09 Force10 Networks, Inc. Virtual network device architecture
US20110310736A1 (en) * 2010-06-16 2011-12-22 Alcatel-Lucent Usa Inc. Method And System For Handling Traffic In A Data Communication Network
CN102420741B (en) * 2010-09-28 2016-01-20 朗讯科技投资有限公司 The method of dispatching communication flow and device in based on the equipment of ATCA
US8849991B2 (en) 2010-12-15 2014-09-30 Blue Coat Systems, Inc. System and method for hypertext transfer protocol layered reconstruction
US8699344B2 (en) 2010-12-15 2014-04-15 At&T Intellectual Property I, L.P. Method and apparatus for managing a degree of parallelism of streams
US8776207B2 (en) * 2011-02-16 2014-07-08 Fortinet, Inc. Load balancing in a network with session information
US8666985B2 (en) 2011-03-16 2014-03-04 Solera Networks, Inc. Hardware accelerated application-based pattern matching for real time classification and recording of network traffic
US9001828B2 (en) * 2011-03-21 2015-04-07 Marvell World Trade Ltd. Method and apparatus for pre-classifying packets
WO2012130311A1 (en) * 2011-03-31 2012-10-04 Telefonaktiebolaget L M Ericsson (Publ) Methods and apparatus for determining a language
CN102318291B (en) * 2011-07-14 2014-02-19 华为技术有限公司 Business flow processing method, device and system
US8842664B2 (en) * 2011-09-27 2014-09-23 Znyx Networks, Inc. Chassis management modules for advanced telecom computing architecture shelves, and methods for using the same
US20130086279A1 (en) * 2011-09-29 2013-04-04 Avvasi Inc. Systems and methods for media service delivery
JP5822125B2 (en) * 2011-11-09 2015-11-24 日本電気株式会社 Service cooperation apparatus, service cooperation method, and service cooperation program
WO2013072773A2 (en) * 2011-11-18 2013-05-23 Marvell World Trade Ltd. Data path acceleration using hw virtualization
US9246823B1 (en) * 2011-12-22 2016-01-26 Marvell Israel (M.I.S.L.) Ltd. Remote policing in a chassis switch
US9559948B2 (en) * 2012-02-29 2017-01-31 Dell Products, Lp System and method for managing unknown flows in a flow-based switching device
US9288159B2 (en) 2012-08-03 2016-03-15 Marvell World Trade Ltd. Systems and methods for deep packet inspection with a virtual machine
KR102020046B1 (en) * 2012-12-06 2019-09-10 한국전자통신연구원 Apparatus and Method for managing flow in server virtualization environment, Method for applying QoS
CN103250382B (en) * 2012-12-28 2017-04-26 华为技术有限公司 Distribution method, apparatus and system
KR101424503B1 (en) * 2013-05-08 2014-08-04 (주)엔텔스 Apparatus and mehtod for controlling traffic
US9461967B2 (en) * 2013-07-18 2016-10-04 Palo Alto Networks, Inc. Packet classification for network routing
WO2015117636A1 (en) * 2014-02-04 2015-08-13 Nokia Solutions And Networks Oy Service scaling in communications
US9906452B1 (en) * 2014-05-29 2018-02-27 F5 Networks, Inc. Assisting application classification using predicted subscriber behavior
US20170140551A1 (en) * 2014-06-30 2017-05-18 Universität Bern Method for segmenting and predicting tissue regions in patients with acute cerebral ischemia
US9806885B1 (en) * 2014-09-26 2017-10-31 Rockwell Collins, Inc. Dual use cryptographic system and method
WO2017100394A1 (en) * 2015-12-11 2017-06-15 Idac Holdings, Inc. Methods and apparatus for common transport of backhaul and fronthaul traffic
US10432650B2 (en) 2016-03-31 2019-10-01 Stuart Staniford System and method to protect a webserver against application exploits and attacks
US10015086B2 (en) * 2016-04-29 2018-07-03 Intuit Inc. Multi GTM based routing to avoid latencies
US10498612B2 (en) * 2016-09-27 2019-12-03 Mellanox Technologies Tlv Ltd. Multi-stage selective mirroring
US10574546B2 (en) 2016-09-27 2020-02-25 Mellanox Technologies Tlv Ltd. Network monitoring using selective mirroring
US10893440B2 (en) * 2016-11-04 2021-01-12 Huawei Technologies Co., Ltd. Network hotspot control method and related device
US10291497B2 (en) * 2017-03-31 2019-05-14 Juniper Networks, Inc. Session-based traffic statistics logging for virtual routers
US11296960B2 (en) 2018-03-08 2022-04-05 Nicira, Inc. Monitoring distributed applications
CN110580256B (en) * 2018-05-22 2022-06-10 华为技术有限公司 Method, device and system for identifying application identification
US10798006B2 (en) * 2018-10-12 2020-10-06 Akamai Technologies, Inc. Overload protection for data sinks in a distributed computing system
US11140090B2 (en) * 2019-07-23 2021-10-05 Vmware, Inc. Analyzing flow group attributes using configuration tags
US11340931B2 (en) 2019-07-23 2022-05-24 Vmware, Inc. Recommendation generation based on selection of selectable elements of visual representation
US11176157B2 (en) 2019-07-23 2021-11-16 Vmware, Inc. Using keys to aggregate flows at appliance
US11743135B2 (en) 2019-07-23 2023-08-29 Vmware, Inc. Presenting data regarding grouped flows
US11398987B2 (en) 2019-07-23 2022-07-26 Vmware, Inc. Host-based flow aggregation
US11349876B2 (en) 2019-07-23 2022-05-31 Vmware, Inc. Security policy recommendation generation
US11288256B2 (en) 2019-07-23 2022-03-29 Vmware, Inc. Dynamically providing keys to host for flow aggregation
US11188570B2 (en) 2019-07-23 2021-11-30 Vmware, Inc. Using keys to aggregate flow attributes at host
US11436075B2 (en) 2019-07-23 2022-09-06 Vmware, Inc. Offloading anomaly detection from server to host
US11321213B2 (en) 2020-01-16 2022-05-03 Vmware, Inc. Correlation key used to correlate flow and con text data
US11296958B2 (en) * 2020-04-24 2022-04-05 Toyo Corporation Packet capture device and packet capture method
US11799779B1 (en) 2020-10-28 2023-10-24 Juniper Networks, Inc. Session-based packet capture
US11785032B2 (en) 2021-01-22 2023-10-10 Vmware, Inc. Security threat detection based on network flow analysis
US11831667B2 (en) 2021-07-09 2023-11-28 Vmware, Inc. Identification of time-ordered sets of connections to identify threats to a datacenter
US11792151B2 (en) 2021-10-21 2023-10-17 Vmware, Inc. Detection of threats based on responses to name resolution requests
CN114820961B (en) * 2022-04-20 2022-11-29 北京金石视觉数字科技有限公司 Immersive digital visual display method and system

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4893302A (en) * 1988-03-31 1990-01-09 American Telephone And Telegraph Company, At&T Bell Laboratories Arrangement for switching concentrated telecommunications packet traffic
CA1295391C (en) * 1988-03-31 1992-02-04 William Paul Lidinsky Metropolitan area network arrangement for serving virtual data networks
CA2276526A1 (en) * 1997-01-03 1998-07-09 Telecommunications Research Laboratories Method for real-time traffic analysis on packet networks
WO2000010297A1 (en) 1998-08-17 2000-02-24 Vitesse Semiconductor Corporation Packet processing architecture and methods
US20020034181A1 (en) 2000-09-20 2002-03-21 Broadcom Corporation Switch assembly having multiple blades in a chassis
CA2511997A1 (en) * 2003-01-29 2004-08-19 Telcordia Technologies, Inc. Mitigating denial of service attacks
EP1469653A2 (en) 2003-04-15 2004-10-20 Sun Microsystems, Inc. Object aware transport-layer network processing engine

Family Cites Families (36)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4720850A (en) * 1986-03-14 1988-01-19 American Telephone And Telegraph Company At&T Bell Laboratories Communication system control arrangement
US6108338A (en) * 1995-12-28 2000-08-22 Dynarc Inc. Method and device for dynamic synchronous transfer mode in a dual ring topology
US5881050A (en) * 1996-07-23 1999-03-09 International Business Machines Corporation Method and system for non-disruptively assigning link bandwidth to a user in a high speed digital network
US6608832B2 (en) * 1997-09-25 2003-08-19 Telefonaktiebolaget Lm Ericsson Common access between a mobile communications network and an external network with selectable packet-switched and circuit-switched and circuit-switched services
US6779030B1 (en) * 1997-10-06 2004-08-17 Worldcom, Inc. Intelligent network
US6718387B1 (en) 1997-12-10 2004-04-06 Sun Microsystems, Inc. Reallocating address spaces of a plurality of servers using a load balancing policy and a multicast channel
US6452915B1 (en) * 1998-07-10 2002-09-17 Malibu Networks, Inc. IP-flow classification in a wireless point to multi-point (PTMP) transmission system
US6587470B1 (en) * 1999-03-22 2003-07-01 Cisco Technology, Inc. Flexible cross-connect with data plane
US6789116B1 (en) * 1999-06-30 2004-09-07 Hi/Fn, Inc. State processor for pattern matching in a network monitor device
US6985431B1 (en) * 1999-08-27 2006-01-10 International Business Machines Corporation Network switch and components and method of operation
US6873600B1 (en) * 2000-02-04 2005-03-29 At&T Corp. Consistent sampling for network traffic measurement
US6678281B1 (en) * 2000-03-08 2004-01-13 Lucent Technologies Inc. Hardware configuration, support node and method for implementing general packet radio services over GSM
US6948003B1 (en) * 2000-03-15 2005-09-20 Ensim Corporation Enabling a service provider to provide intranet services
US7725596B2 (en) * 2000-04-28 2010-05-25 Adara Networks, Inc. System and method for resolving network layer anycast addresses to network layer unicast addresses
US6621793B2 (en) * 2000-05-22 2003-09-16 Telefonaktiebolaget Lm Ericsson (Publ) Application influenced policy
WO2002019634A1 (en) * 2000-08-31 2002-03-07 Netrake Corporation Method for enforcing service level agreements
AU2002216279A1 (en) * 2000-12-27 2002-07-08 Cellglide Technologies Corp. Resource allocation in cellular telephone networks
US6914883B2 (en) * 2000-12-28 2005-07-05 Alcatel QoS monitoring system and method for a high-speed DiffServ-capable network element
US20020116521A1 (en) * 2001-02-22 2002-08-22 Denis Paul Soft multi-contract rate policing
US20020181462A1 (en) * 2001-04-24 2002-12-05 Sorin Surdila System and method for providing end-to-end quality of service (QoS) across multiple internet protocol (IP) networks
US7002977B1 (en) * 2001-06-29 2006-02-21 Luminous Networks, Inc. Policy based accounting and billing for network services
US6961539B2 (en) * 2001-08-09 2005-11-01 Hughes Electronics Corporation Low latency handling of transmission control protocol messages in a broadband satellite communications system
US7453801B2 (en) * 2001-11-08 2008-11-18 Qualcomm Incorporated Admission control and resource allocation in a communication system supporting application flows having quality of service requirements
US6661780B2 (en) * 2001-12-07 2003-12-09 Nokia Corporation Mechanisms for policy based UMTS QoS and IP QoS management in mobile IP networks
US20030206549A1 (en) 2002-05-03 2003-11-06 Mody Sachin Satish Method and apparatus for multicast delivery of information
CA2388792A1 (en) * 2002-05-31 2003-11-30 Catena Networks Canada Inc. An improved system and method for transporting multiple services over a backplane
US6741595B2 (en) * 2002-06-11 2004-05-25 Netrake Corporation Device for enabling trap and trace of internet protocol communications
US7251215B1 (en) * 2002-08-26 2007-07-31 Juniper Networks, Inc. Adaptive network router
WO2005017707A2 (en) 2003-08-14 2005-02-24 Telcordia Technologies, Inc. Auto-ip traffic optimization in mobile telecommunications systems
US7173817B2 (en) * 2003-09-29 2007-02-06 Intel Corporation Front side hot-swap chassis management module
US7496661B1 (en) * 2004-03-29 2009-02-24 Packeteer, Inc. Adaptive, application-aware selection of differentiated network services
US20060028982A1 (en) * 2004-08-06 2006-02-09 Wright Steven A Methods, systems, and computer program products for managing admission control in a regional/access network based on implicit protocol detection
US7639674B2 (en) * 2004-10-25 2009-12-29 Alcatel Lucent Internal load balancing in a data switch using distributed network processing
US20060149841A1 (en) * 2004-12-20 2006-07-06 Alcatel Application session management for flow-based statistics
US7480304B2 (en) * 2004-12-29 2009-01-20 Alcatel Lucent Predictive congestion management in a data communications switch using traffic and system statistics
US7751421B2 (en) * 2004-12-29 2010-07-06 Alcatel Lucent Traffic generator and monitor

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4893302A (en) * 1988-03-31 1990-01-09 American Telephone And Telegraph Company, At&T Bell Laboratories Arrangement for switching concentrated telecommunications packet traffic
CA1295391C (en) * 1988-03-31 1992-02-04 William Paul Lidinsky Metropolitan area network arrangement for serving virtual data networks
CA2276526A1 (en) * 1997-01-03 1998-07-09 Telecommunications Research Laboratories Method for real-time traffic analysis on packet networks
WO2000010297A1 (en) 1998-08-17 2000-02-24 Vitesse Semiconductor Corporation Packet processing architecture and methods
US20020034181A1 (en) 2000-09-20 2002-03-21 Broadcom Corporation Switch assembly having multiple blades in a chassis
CA2511997A1 (en) * 2003-01-29 2004-08-19 Telcordia Technologies, Inc. Mitigating denial of service attacks
EP1469653A2 (en) 2003-04-15 2004-10-20 Sun Microsystems, Inc. Object aware transport-layer network processing engine

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP1932312A4

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2008144886A1 (en) * 2007-05-25 2008-12-04 Zeugma Systems, Inc. Multi-level packet classification and routing in a distributed compute environment
US7773510B2 (en) 2007-05-25 2010-08-10 Zeugma Systems Inc. Application routing in a distributed compute environment
US7706291B2 (en) 2007-08-01 2010-04-27 Zeugma Systems Inc. Monitoring quality of experience on a per subscriber, per session basis
US8374102B2 (en) 2007-10-02 2013-02-12 Tellabs Communications Canada, Ltd. Intelligent collection and management of flow statistics
US8264965B2 (en) 2008-03-21 2012-09-11 Alcatel Lucent In-band DPI application awareness propagation enhancements

Also Published As

Publication number Publication date
CA2620349A1 (en) 2007-03-22
EP1932312A1 (en) 2008-06-18
WO2007030917A8 (en) 2007-08-09
US7508764B2 (en) 2009-03-24
US20070058632A1 (en) 2007-03-15
EP1932312A4 (en) 2011-08-24
CA2620349C (en) 2017-06-13

Similar Documents

Publication Publication Date Title
US7508764B2 (en) Packet flow bifurcation and analysis
US7733891B2 (en) Methods and apparatus to support dynamic allocation of traffic management resources in a network element
US7773510B2 (en) Application routing in a distributed compute environment
US11451491B2 (en) Methods and apparatus related to virtualization of data center resources
EP1922852B1 (en) Application driven fast unicast flow replication
US8755396B2 (en) Methods and apparatus related to flow control within a data center switch fabric
US20080298230A1 (en) Scheduling of workloads in a distributed compute environment
US8730954B2 (en) Methods and apparatus related to any-to-any connectivity within a data center
EP2206289B1 (en) Intelligent collection and management of flow statistics
US8340088B2 (en) Methods and apparatus related to a low cost data center architecture
US7606147B2 (en) Application aware traffic shaping service node positioned between the access and core networks
US8265071B2 (en) Methods and apparatus related to a flexible data center security architecture
US8335213B2 (en) Methods and apparatus related to low latency within a data center
US20100061367A1 (en) Methods and apparatus related to lossless operation within a data center
EP2974230B1 (en) Common agent framework for network devices
US8838753B1 (en) Method for dynamically configuring network services
Wale ATCA Support of Wireless Flow-Based Routing–the NGN Wireless “God-Box”?

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 2620349

Country of ref document: CA

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 2006790602

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 2006790602

Country of ref document: EP