Hewlett-Packard NonStop Enterprise Server
Summary By Ann Katan Analytical Source: Ed Cowger
Note:
Corporate Headquarters List of Tables
Table 2: Processor (Speed) Table 3: Price List: HP NonStop Enterprise Server
Overview Tandem Computers was the first company to introduce general-purpose commercial fault-tolerant servers to the market. Today's HP NonStop S-series servers are the direct descendents of the original design that Tandem brought to market over 25 years ago, which based on a massively parallel system hardware architecture with no single point of hardware failure, and process and data replication. At its simplest, the hardware components of today's S-series server consist of a single system with two logical processors (each logical processor comprised of two lock-stepped physical processors) and extensive built-in error-checking and fault-isolation features connected through a low-cost, high-bandwidth, low-latency network to each other and other resources such as disk controllers and communications adapters. Despite generations of changing technology and a vast expansion of its initial product offering into a robust product line, the NonStop Enterprise Division has maintained application compatibility across all its systems. The NonStop servers are targeted toward high-profile e-business, business intelligence, message switching and online transaction processing in the financial (banking, brokerage houses, securities exchanges), telecommunications (including cellular), travel, healthcare, government, U.S. emergency 911 services and retail industries, where reliability and scalability are crucial. The entire culture of the HP's NonStop Enterprise Division (NED) is geared toward preventing any downtime for its customer's applications. The focus on providing continuous application availability has remained unchanged since the first Tandem computer was introduced. Since then, the NonStop Enterprise Division has continued to ride the technology curve, resulting in today's highly scalable ServerNet interconnect technology that allows data to be transferred directly between any processor and input/output (I/O) device. The building block of the NonStop S-series server is the Processor MultiFunction (PMF) card. The PMF has memory, dual-ported ServerNet access, Small Computer Systems Interface (SCSI) ports, Ethernet controller, service processor and two processors, with each processor having its own cache. Each PMF runs its own copy of the NonStop Kernel (NSK) operating system. The two processors on each PMF behave as a "logical uniprocessor." They run in lock step, executing the same instruction stream out of their local cache. The output of the processors is continuously compared. If the outputs ever disagree, the operating system will immediately shut down the PMF and record the error. The operating system will then automatically shift the workload to another PMF that contains the backup process. The loosely coupled, shared-nothing architecture of the NonStop S-series allows system performance to scale virtually linearly as additional resources are added to the system. ServerNet routers route packets of data directly between devices in the system: disk to processor, processor to processor and communications line to processor. This switched network eliminates performance bottlenecks caused by shared resource conflicts. With the ServerNet capability to directly connect data sources and destinations, any processor has direct access to system resources of other processors. The NonStop Kernel operating system permits the processors to operate independently but in a cooperative manner. The NonStop Kernel microkernel-based operating system has two user/administrative interfaces or personalities: one interface supports the Guardian operating system interfaces used in older NonStop products, and the other interface provides a Unix environment. These interfaces are pure application programming interfaces (APIs), as opposed to any kind of translators, gateways or interpreters. Both "personalities" coexist on the same server. The NonStop S-series servers have a minimum of two PMF units (four physical processors). Within a single system, PMF units can vary in processor type, amount of memory and type of SCSI adapters. This allows users to cost-effectively configure the server for different applications and protects customer investments. The newest high-end members of the S-series are the S86000 and S76000 powered by MIPS Technologies R14000 processors. The midrange server is the S7600 based on the MIPS Technologies R12000 processor. Both the high-end and midrange servers can be expanded in increments of two PMFs to a maximum size of 16 PMFs in a single node. With the use of the NonStop Cluster Switch, up to 24 S-series nodes can be connected to create a virtual NonStop system with up to 384 PMFs. A NonStop system can expand to 255 nodes with 16 processors each for a total of 4,080 processors. HP also offers entry-level, nonexpandable, 2-PMF configurations using the MIPS R14000 and R12000 microprocessors. These entry-level systems can be upgraded to larger servers with a higher license charge.
Future Direction
The major changes for application developers include:
Beyond the initial introduction of the NonStop/Itanium systems, both MIPS-based and Itanium-based NonStop servers will have concurrent releases of the NonStop Kernel operating system. However, some new functionality, such as 64-bit addressing, will only be implemented for the Itanium-based NonStop servers.
Analysis To guard against failures caused by corrupted or inadvertently changed data, NonStop servers employ a variety of methods to ensure data integrity. The "logical uniprocessor" use of lock-stepped dual processors in the S-series modules provides a checking mechanism to ensure that a malfunctioning processor cannot corrupt any data. This feature has become more important over time, as increasing chip densities result in increasing frequencies of transient errors. In recent years, other vendors of high-end systems have moved to a lock-stepped approach in recognition of this problem, including IBM for its zSeries systems. In addition, data is checked and verified whenever it is moved. Data on all buses is parity-protected. Parity checks, packet sequence numbers and checksums protect I/O controllers. Before data is written to disk, a checksum is calculated that is appended to the disk data. When data is read back, the checksum is recalculated and compared to the checksum that was stored on disk. If the checksums match, the data has not been altered. Of course, if the data has been altered, there is a mirrored disk drive from which the information can be retrieved. ServerNet itself is capable of providing a fault-tolerant communications network. Since there are multiple paths to all the components of the servers, and since stored data is duplicated, no single failure can block communications between a process and its data. All processors, I/O devices and communications adapters are monitored by the service management subsystem, which detects and isolates failures. If a component fails, data is routed over a different path to the same destination. Each packet passed through the ServerNet network includes cyclic redundancy checks to ensure its integrity. Each ServerNet 2 link runs at 1.25G bps, full-duplex, for an aggregate router bandwidth of 1.5GB/second. ServerNet uses wormhole routing where an incoming packet is forwarded as soon as the destination address in its header is decoded. (In more conventional networks, a packet is not routed until it has been received in its entirety.) As a result, ServerNet can achieve latencies of only 300 nanoseconds in each router. ServerNet II delivers five times the throughput of the previous generation of ServerNet routers. The NonStop environment provides the capability for any process to run on at least two different PMFs. Using this capability of the NonStop platform, the operating system implements the concept of "process pairs." A process pair consists of a primary process and a backup process that run on separate PMFs. The primary process sends checkpoint messages to its backup process. These messages ensure that the backup process has all of the process state information that it needs to take over the processing of the application. Thus, a NonStop server can provide a fault-tolerant platform where users can continue interacting with an application and will continue to be serviced without any noticeable interruptions, despite the failure of any component, including the system software. In contrast, high-availability clustering solutions ensure that a server is always available to run an application, but the application itself will have to be restarted. Restarting an application can cause delays in service to users. The newest PMF used in the S86000, featuring the 550MHz MIPS R14000 processor combined with other hardware and software improvements, can provide a performance increase of up to 1.9 times over the previous high-end generation processor used in the S74000. The 500MHz R14000 processor used in the S76000 with its smaller cache size can provide a 1.5 times performance increase over the S74000. Despite these performance improvements, the MIPS R14000 processor significantly trails the performance of the leading Intel and RISC processors available from the major system vendors, as can be seen in the table below. [Note that the CINT benchmarks in the table below are an indication of individual processor performance only and are not a measure of system throughput.]
The retrofit of the HP NonStop servers with Intel Itanium processors will provide a significant performance boost for NonStop customers. However, there will be steps that users will have to perform to upgrade to the Itanium-based servers. The Itanium-based S-series will be able to execute original CISC-based application binaries at roughly the same speed as they execute on today's MIPS-based processors. However, there will be dramatic performance improvements if these applications are recompiled on the Itanium platform from source code. MIPS-based binaries must be recompiled from source code since there will be no MIPS interpreter on Itanium. MIPS object code will not run in the Itanium environment. HP has committed to providing the necessary compilers and other porting tools for this migration. The same languages will be supported on Itanium as are supported on MIPS. S-series infrastructure investments in storage and I/O adapters will be preserved during this transition, since the Itanium-based NonStop servers will continue to use ServerNet as the system communications backbone. The NonStop Enterprise Division continues to provide new software interfaces and features to ensure that its servers support the latest computing paradigms. NonStop servers support many industry-standard development tools and application environments including Java, Java Database Connectivity (JDBC), Common Object Request Broker Architecture (CORBA), BEA Tuxedo and WebLogic Server (WLS), Extensible Markup Language (XML), Simple Object Access Protocol (SOAP) and Open Database Connectivity (ODBC). Several enterprise management frameworks such as CA-Unicenter, BMC Patrol and HP OpenView are also supported.
Pricing
GSA Pricing
Competitors
Strengths
Hardware/Software-Based Fault Tolerance
System Scalability
Long Experience Building Parallel Systems
Database "Mixed Workload" Capability
Capacity On Demand
Limitations
Limited Application Set
Proprietary Disk Interface
Future Processor Technology
Insight HP continues to maintain the NonStop Enterprise Division's image as the leading fault-tolerant provider and has delivered enhanced products to satisfy its very loyal customer base. With HP's backing, the NonStop Enterprise Division should continue to enjoy its lead in mission-critical fault tolerance even as it changes processor technology. The issue here is how much pain and suffering its customers will incur in the migration to the Intel Itanium processor family. High-end, clustered Unix systems with partitioning or resource management software, robust database management systems and transaction processing (TP) monitors are becoming more reliable every year and today provide high-availability alternatives to NonStop systems. The same goes for IBM's zSeries and other S/390-compatable mainframes with Parallel Sysplex. It should be noted that when the Unix server vendors offer availability guarantees, even at 99.999 percent, the guarantee is only for an operating system prompt. Restarting an application on another node in a cluster can require a substantial amount of time. The NonStop server customer must decide whether the additional costs are worth the reduction in user-perceived downtime. We should also point out that while the acquisition cost of a NonStop server may be higher than the alternatives, the TCO may be more in line with clustered high-availability solutions, which are difficult and expensive to implement and manage. The NonStop server advantage is that it has been designed from the ground up for continuous availability. Gartner Datpro Document DPRO-90075, 27 February 2003.
![]()
© 2003 Gartner, Inc, and/or its Affiliates. All rights reserved. Reproduction of this publication in any form without prior written permission is forbidden. The information contained herein has been obtained from sources believed to be reliable. Gartner disclaims all warranties as to the accuracy, completeness or adequacy of such information. Gartner shall have no liability for errors, omissions or inadequacies in the information contained herein or for interpretations thereof. The reader assumes sole responsibility for the selection of these materials to achieve its intended results. The opinions expressed herein are subject to change without notice. |