This report documents the results of ESG Lab hands-on testing of the performance scalability of the HP StorageWorks X9000 Network Storage Systems (X9000). The HP X9000 Network Storage Systems are scale-out NAS solutions configured in failover clusters ("couplets") using industry-standard HP servers and storage with the IBRIX Fusion segmented file system pre-configured. The performance scalability of both large and small files is examined.
Published: October 19, 2010
File data growth has outstripped e-mail- and database-driven growth for some time now. The reasons are many: richer file formats, ubiquitous photo and video, online communities, collaboration tools, 3-D modeling, and 4-D imaging to name a few. But exotic file types are only part of the challenge. Organizations dealing with extensive and ever-growing archives of general purpose office data struggle with the same problems; they need to accommodate growth efficiently, manage it simply, and access it quickly.
Simultaneously, economic conditions have lead businesses to emphasize first CAPEX, and now OPEX, reductions more than ever. File growth has resulted in higher costs in terms of additional storage, complex management, and data center floor space and energy. As a result, the ability to scale out-that is, independently scale and tune bandwidth, processing, and storage capacity on the fly while managing a single, global namespace-is extremely popular for increasing efficiency and saving money.
Adoption of scale-out NAS solutions is driven by its ability to address multiple challenges. ESG survey respondents report selecting scale-out NAS to achieve faster storage provisioning, better scalability with easier management, improved performance of both I/O and throughput, and higher data availability (see Figure 1).[1]

The X9000 Network Storage Solutions are an integrated appliance series consisting of industry-standard HP servers and storage with the IBRIX Fusion segmented file system. Three models are available: one built to optimize performance or capacity, one focused on low cost and high density, and one gateway front end for use with existing SANs. X9000 couplets can be combined into a single file system with nearly linear capacity and performance scalability.

The X9000 is designed to enable extreme scaling, up to and beyond the currently tested 16PB configuration, in a single namespace. As a result, large file systems can be accessed, managed, and protected from a single interface. Unlike other clustered file system solutions, the X9000 was designed for multi-tenant environments which often have a wide variety of workloads, including response-time sensitive, small random workloads (such as general purpose file sharing) and bandwidth-intensive, large sequential workloads (such as high definition video editing). Features include:
ESG Lab evaluated the results of three sets of X9000 performance tests that were run in an HP lab in Fort Collins Colorado. The first tested large block sequential throughput using three HP StorageWorks X9320 Network Storage System (X9320) couplets, the second tested large block sequential write performance using three HP StorageWorks X9300 Network Storage Gateway (X9300) couplets, and the third tested small file performance using a single X9320 couplet.
The Fusion file system at the heart of an X9000 series scale-out NAS solution was designed to provide near linear performance scalability for large and small files, regardless of the size of the file system. Fusion technology was designed by IBRIX, which was acquired by HP in 2009. The Fusion file system uses a segmented architecture that provides scalable performance for large and small file workloads.
The Fusion file system distributes file data and metadata over multiple servers that work together in a fault tolerant, scalable cluster. Each segment server in a Fusion cluster has a map to determine which segment holds a particular file. As a result, regardless of which node accepts the data request, the server can quickly identify the location and deliver the data. This approach minimizes performance bottlenecks as metadata and file access traffic is distributed evenly over multiple segment servers.
Performance testing began with three X9320 couplets as shown in Figure 3. A "couplet" is a term HP uses to describe a pre-configured, fault tolerant pair of HP servers running the Fusion file system software attached to a pre-configured HP disk array. In this example, three couplets used 240 15K RPM disk drives to present a single 73 TB Fusion file system to 24 NFS v3 clients. The clients were attached to an Ethernet LAN using four bonded GigE ports. The X9320 couplet was attached to the LAN using a pair of 10 GigE connections.[2]

The HP SNPT performance utility was used to generate workloads and measure performance during each of the tests presented in this report. The SNPT utility, which stands for "simple NAS performance toolkit," was used as an alternative to industry-standard tools including iozone, Iometer, bonnie, dd, and countless other file system performance tools. SNPT was originally developed by IBRIX to overcome the limitations of industry-standard file system benchmarking tools. In particular, it was first developed for testing small file performance over an extremely large file system. Since then, the SNPT utility has been used to measure large file performance as well.
SNPT supports multiple streams running over multiple clients writing over many files with a variety of directory structures. SNPT collects fine grain performance data, down to the millisecond level for each file operation if needed. It collects performance data captured at the OS or network levels as well.
ESG Lab audited the source code of the SNPT utility and verified that it uses standard file system calls to open, write, read, and close files. Scripts used to generate SNPT calls and logs created by the SNPT utility were audited by ESG Lab. Performance data collected independently at the NFS client level to verify the authenticity of performance results reported by SNPT was also examined.
The first set of tests was designed to measure the performance of X9320 couplets for throughput-intensive write and read workloads. Sequential transfers to large files were used during this phase of testing. The aggregate throughput of the system was recorded as 24 Linux clients accessed a single Fusion file system in parallel. Tests were run on a Fusion file system deployed over one, two, and three couplets. The first two couplets were configured with 96 disk drives each. The third couplet was configured with 48 drives. The results are shown in Figure 4 and Table 1.


Why This MattersData growth can slow revenue-generating applications. For bandwidth-intensive applications such as animation rendering, medical imaging, life sciences and scientific modeling, performance can suffer when larger and larger files travel through the system. Organizations spend money on additional storage equipment, data center space, and energy, wasting capital on over-provisioning and increasing management complexity. The downtime associated with adding servers and storage can lead to increased costs and lower productivity. On the other hand, if performance is predictably fast as the number and size of files stored and accessed grows, productivity improves and the return on investment is faster. Predictable, balanced performance with linear scalability speeds workflows, which saves time and money. Having these features delivered in a pre-configured unit with industry-standard servers and storage that grow in modular units makes them easy to use and cost efficient. ESG Lab has confirmed that X9320 couplets have excellent levels of large block throughput performance (up to 1.448 GB/sec per couplet) that increase in near-linear fashion as couplets are added. |
The X9300 is the scale-out NAS gateway version of the X9000 series product line. Using the same hardware and software as the X9320 product line, an X9300 can be configured to work with a variety of disk arrays, including those arrays an organization may already own.
ESG Lab audited the results of throughput testing performed on a large three-couplet X9300 test bed. This particular test bed was created to validate the write performance of an X9000 Series solution that had been ordered by a customer for a high performance computing (HPC) application with extreme bandwidth requirements.
As shown in Figure 5, the X9300 couplets were tested with six HP StorageWorks MSA disk arrays. The disk arrays, with a total of 432 15K RPM disk drives, were attached to the couplets via 4 Gbps FC connections. Ninety-six clients accessed a single 43 TB Fusion file system through a Gigabit Ethernet interface. Each of the couplet servers was attached to the LAN via a single 10 Gigabit Ethernet connection.

The Fusion file system was accessed with IBRIX client software. The IBRIX client provides more performance compared to the industry-standard NFS and CIFS clients supported by X9000 Series scale-out NAS solutions. During read operations, the IBRIX client uses a map to direct host file requests to the segment server that holds those files. During writes, the IBRIX client uses load balancing algorithms to distribute file data evenly over segment servers. During this test, the IBRIX client was used to direct each client to a specific segment server. For this particular HPC workload, this provided the best performance.
Each client wrote a single large file. Aggregate throughput was calculated as the total amount of data written by all clients divided by the elapsed time of the clients. Elapsed time included the time to flush write data from cache to disk and close files. The results are shown in Figure 6.

Performance was measured using the open source collectl and colmux utilities.[3] The performance trace data is shown in Figure 7. Note that three five-minute test runs correspond to the results shown above in Figure 6.


Why This MattersHigh performance computing applications often have a real-time requirement to record and share vast amounts of large file data. Application performance and productivity can suffer when the storage system can't keep up with those real-time data requirements. An inability to keep up with incoming data feeds can lead to gaps in the data. Catching up is typically not an option. As a result, gaps in a real-time data stream can lead to a number of problems including application errors, misleading results, dissatisfied customers, lost productivity, and, in the worst case, lost revenue. A number of applications in the broader business application arena have similar real-time performance requirements. Network traffic analysis, security log auditing, video surveillance, and video upload sites are common examples of applications that can benefit from predictably scalable aggregate write throughput. A scale-out NAS gateway enables organizations to leverage their current storage investments to minimize cost and deployment complexity. ESG Lab has confirmed that a single X9000 couplet running on a pair of industry-standard HP servers can be used to record large files at an aggregate write throughput rate of 896 MB/sec. Performance scales in a nearly perfect linear fashion as couplets are added to a single Fusion file system. |
Content delivery is a good example of an application that benefits from a storage solution with predictably scalable large file throughput. Content delivery is used to define a broad category of systems which deliver digital video, image, and audio files over a delivery medium such as a broadcasting network, intranet, or the Internet. The delivery of large files over a content delivery medium requires a storage solution with lots of capacity and aggregate read throughput. To put the aggregate throughput results recorded by ESG Lab into perspective, the maximum aggregate read throughput that was recorded for the three-couplet X9320 system (3.70 GB/sec) was used to calculate the number of streams that could be delivered for a number of well known content types. The results are summarized in Figure 8 and Table 3.


Why This MattersContent delivery systems have strict performance demands. A performance bottleneck in any of the systems between the consumer and the content can lead to lost revenue and dissatisfied customers. Meeting the bandwidth-intensive performance demands of content delivery systems using traditional storage architectures often leads to over-provisioning, wasted capital costs, increased complexity, and excessive demands on data center infrastructure. The segmented architecture of the Fusion file system at the heart of an X9000 series scale-out NAS solution delivers predictably scalable throughput that's ideally suited for the cost effective delivery of digital content. |
General purpose business applications that store data on shared file systems typically create and access lots of small files. As the number of users sharing a single file system increases, small file performance needs to scale to meet the demands of the business.
High performance and rich media applications also often require good small file performance. Applications that store lot of large files often create small files to keep track of metadata (data about the files). A website that stores uploaded photos on a file share is a good example: besides the large photo files, the application often creates small thumbnail images and may also use small files to store metadata (e.g., titles, tags).
The Fusion file system at the heart of an X9000 scale-out NAS solution is designed to perform well for large and small file workloads.
The test bed used to analyze small file performance is shown in Figure 9. A single X9320 couplet with 96 15K RPM drives was used during this phase of testing. Each of twelve NFS V3 clients accessed the Fusion file system through a single Gigabit Ethernet connection. Both of the X9320 couplet servers were attached to the LAN via dual 10 Gigabit Ethernet interfaces.

The HP SNPT performance utility and a script running on each of the 12 NFS clients was used to create as many 4 KB files as possible over the course of an hour. A master script launched synchronized copies of the script on each of the clients and waited for them to finish. The total number of files created was recorded. The results are presented in Figure 10 and Table 4.


Why This MattersSmall files can cause performance problems that hinder operations. This can lead to revenue hits from customers frustrated by slow Web pages or transactions interrupted by time-outs. This can also lead to downtime and increased management expense as IT managers scramble to deploy and balance additional legacy NAS systems to meet the demands of the business. Most scale-out NAS solutions handle either large files or small files well, but not both. ESG Lab has confirmed that the X9320 delivers scalable performance for both small and large files. |
In case you haven't heard, the ever-expanding volume of unstructured data is expensive to store and retain and hard to manage. While it is difficult to make this oft-repeated statement sound fresh and interesting, the simple fact is that file data growth drives continuing customer challenges and storage industry innovation. Today, all you have to say is "file growth cost and complexity" and your audience fills in the back-story automatically, conjuring images of staffing, energy, data access, retention, and protection challenges.
Many vendors, over many years, have tried to solve the problems of predictable cost effective scale-out NAS performance. A number of well-funded companies came and went without gaining any real traction and hundreds of millions of dollars have gone up in smoke. Scaling has turned out to be a very difficult problem to solve. Most solutions work well for large or small files, but not for both-leading many companies to deploy a scale-out NAS solution alongside a commodity, entry-level NAS solution to deal with metadata. The difficulty is further demonstrated by the fact that storage industry behemoths, while making progress, have not been able to get the job done either by acquisition or internal development.
ESG has been following IBRIX for several years, since well before its acquisition by HP. We've written about it, spoken with customers, and performed several Lab Validations. IBRIX was born in the high-performance computing arena and early animation successes with Disney and Pixar put it on the map. To its credit, while the file system started out highly specialized, IBRIX expanded the product to benefit not just niche markets like medical imaging, scientific research, and animation, but general purpose file serving and file sharing as well. Its segmented architecture makes it work for both large files and large numbers of small files, expanding its usage and target market.
What impressed ESG Lab was not just the technology and virtually unlimited scalability, but IBRIX's commitment to effective partnering, go-to-market, and messaging strategies. IBRIX was smart to keep a narrow focus on scale-out NAS and to concentrate on OEM solutions with Dell and HP, which led to interest from EMC and IBM. Over time, the configuration and GUI were dramatically simplified and IBRIX focused on improving serviceability, usability, and manufacturability. All of this hard work resulted in its acquisition by HP in 2009.
In this most recent Lab Validation, ESG Lab confirmed that the Fusion segmented file system running on industry-standard servers and storage from HP delivers outstanding levels of performance and scalability for small and large files. An excellent large file throughput rate of 1.48 GB/sec can be achieved by a single HP StorageWorks X9320 scaling in a near-linear fashion as couplets are added to a single Fusion file system. Small file performance was outstanding as a single X9320 cluster created 5.1 million 4 KB files in an hour.
Who would have guessed that companies overwhelmed by Word and PowerPoint archives could benefit from the same solution as those burdened by 100-TB annual growth of genome sequencing data? Who knew that a NAS file system developed for high-performance computing could evolve into a graceful, cost-effective scale-out solution with predictable and near-linear performance for small and large files and exotic and everyday applications? The challenges that scale-out NAS solves are much more "everyday" than "lunatic fringe," and the X9000 makes it consumable by almost anyone. If you are facing file system growth and complexity challenges, you should consider the X9000. It's affordable, includes commercial features like snapshots and replication, and lets NFS and CIFS work on the same file system. You can buy a scale-out architecture that will grow with you and meet the needs of your business without interruption. The Fusion segmented file system, combined with HP's servers and storage (not to mention HP's buying power and supply-chain advantage), brings what started as a niche solution to the masses.




[1] Source: ESG Research Brief, Scale-Out NAS Adoption & Market Drivers, February 2009.
[2] See the Appendix for more configuration details.
[3] collectl.sourceforge.net
[4] The bit rates for each content type are listed in Table 5, which can be found in the Appendix.
[5] 5,131,623 files divided by 3,600 seconds equals 1,425 files per second.
ESG Lab ReportsThe goal of ESG Lab reports is to educate IT professionals about emerging technologies and products in the storage, data management and information security industries. ESG Lab reports are not meant to replace the evaluation process that should be conducted before making purchasing decisions, but rather to provide insight into these emerging technologies. Our objective is to go over some of the more valuable feature/functions of products, show how they can be used to solve real customer problems and identify any areas needing improvement. ESG Lab’s expert third-party perspective is based on our own hands-on testing as well as on interviews with customers who use these products in production environments. This ESG Lab report was sponsored by HP. |
[...] ESG Group article on the Performance of the HP X9000 (IBRIX and Fusion FS) – compare it with this from Ideas [...]
[...] is still discussing the IBRIX-based X9000 products as its NAS play, and while ESG Labs has tested the X900o and found it to be a very capable product that can perform well in a variety of environments and [...]
Browse by Content Type
Categories
Share