Next-gen distributed computing
David Margulius, Information Age
18/06/2002 17:13:24
Next-gen distributed computing If your large enterprise is like most, about 10 per cent of your available CPU cycles actually gets used -- a fact that is driving enterprises to deploy the next generation of cost-reducing distributed computing architectures. Distributed computing is coming, and its benefits will be significant. Grid computing, utility computing, P2P (peer to peer) architectures . . . toss in Web services, and you have a recipe for radically transforming the enterprise IT infrastructure, lowering costs, and making distributed and failure-tolerant systems more viable. Some draw analogies between today's datacentre and the days when every company had its own power generator manned by workers shovelling coal -- not a scalable solution (nor for that matter a highly available one) until the modern utility emerged. "It's not like one day we'll wake up and computing flows as easily as the water," says Bill Martorelli, vice president of Enterprise Services at Hurwitz Group, adding that current initiatives "are really the culmination of years and years of distributed computing concepts". And these concepts are becoming reality. Major players, including IBM, Sun and Hewlett-Packard, are introducing utility computing offerings featuring dynamic capacity scaling, aligning their internal grid and Web services initiatives, and supporting open-source standards efforts for linking computing grids cross-enterprise and globally. Most importantly, enterprises are starting to deploy computing and data grids on a cluster-and campus-wide scale. The idea is to take advantage of idle capacity, whether across the datacentre, around the world, or at different times of the day. Grid computing has primarily focused on solving the systems management challenges of distributed computing, such as security, authentication and policy management across heterogeneous platforms and organisations. Utility computing has focused on developing provisioning technology and the business model for on-demand, pay-as-you-go infrastructure. And P2P efforts have drawn attention to the potential for leveraging idle resources to handle huge computing tasks. Now these worlds are converging because of customer demand. As companies push for better resource utilisation, lower management costs, and more distributed, failure-tolerant infrastructures, a combination of grid and utility computing is a no-brainer. "Everything is still a big soup of new technologies - it's like a puzzle," says Wolfgang Gentzsch, director of grid computing at Sun. "But everyone wants to have it." Laying the grid Grid technology, originally developed to deliver supercomputing power to large scientific projects, comes in two flavours: compute grids (using distributed CPUs) and data grids (using distributed data sets). Although grids promise dramatic ROI from shifting enterprise computing workloads, around the globe or to external partners, most enterprises are starting with smaller, local deployments known as cluster (single datacentre) or campus grids -- in line with the trends toward clustering and virtualising resources in the datacentre, and using shared file systems and the cheapest available hardware and software. Gene Logic, for example, recently deployed a cluster computing grid using software from Avaki to help run processing-intensive DNA sequencing routines. The company wanted to retire an older Unix system with high maintenance costs, according to Joe Parlanti, Gene Logic's manager of computing infrastructure support. But buying a new system meant "you either have to buy it big enough and grow into it or buy something [smaller] and be stuck with the size". The company had some smaller Unix machines it used infrequently for a different application. By linking them into a grid, it was able to handle the sequencing work with no scheduling conflicts and eliminate the older machine. "The grid's so much quicker, about 15 per cent of the amount of time needed to run the total sequence," Parlanti says. "It improves our efficiency, and lowers our cost basis for support, and I can add and delete systems from the grid at will. It's very versatile." But multisite and inter-enterprise deployments await the development of software to handle the systems administration issues associated with heterogeneous-environment grids. Authentication and naming, job scheduling and policy management, fault tolerance and failure recovery, site autonomy, and QoS (quality of service) are among key issues being addressed by startups including Avaki and Entropia and larger players such as Sun and IBM. A major step forward was the recent introduction, with support from IBM, Microsoft and others, of a standards proposal called OGSA (Open Grid Services Architecture), building on an existing open-source grid middleware solution from the academic Globus Project (www.globus.org) plus Web services standards including XML, WSDL (Web Services Description Language), and SOAP (Simple Object Access Protocol). "We'll all be better off if there's a common set of protocols that allow these services to interoperate," says Dr Carl Kesselman, a co-leader of the Globus project. P2P architectures are also converging with the grid model. Although P2P has failed to gain enterprise momentum due to the greater perceived manageability and security of tightly coupled server-oriented systems, some P2P players such as Groove Networks are focusing on collaborative computing and communication (more akin to a data grid). Others, including Sun's JXTA, are trying to create building blocks to solve some of the classic grid problems, like authentication. "P2P is going to be absorbed or amalgamated into other activities like grid computing," speculates Andrew Grimshaw, Avaki's CTO and founder. Computing on tap As impending grid standards bring cycles-on-tap closer to reality, IT departments are getting a taste of utility computing -- a new model of à la carte options where you pay only for what you eat, whether buying from an external vendor or a managed services provider. The idea is to extend the cost benefits of outsourcing into the datacentre, while still providing centralised control and high availability. The utility model can be as simple as buying a 64-CPU server but only paying to use 32 initially, or paying based on average utilisation throughout the day and scaling up or down dynamically as needed. Or it can be as complex as dynamically configuring and provisioning server farms, storage systems and other resources on demand, outsourcing peak computing demand to external datacentres, or in the extreme even outsourcing your whole IT infrastructure to another company - as American Express recently announced it would to IBM, in a $4 billion deal. Several vendors have announced utility-like dynamically-reconfigurable infrastructure initiatives -- for example HP's Utility Data Center, Sun's N1 project, Compaq's Adaptive Infrastructure, and IBM's eLiza. The goal of the new model is "driving asset utilisation by taking static islands of computing scattered throughout the enterprise and making them dynamic", explains Pat Tickle, vice president of product management at Terraspring, one of several startups including eJasent and Jareva that are developing software to orchestrate the dynamic management, provisioning, and scaling of datacentre infrastructure. Such software enables IT managers to build a library of resource configurations, map and schedule applications onto servers, and track and bill cycles consumed to the appropriate user. In theory, such offerings will enable IT departments to manage internal resources more efficiently, and outsourcers to build true low-cost computing utilities. The utility model would also be a win for business continuity, making it cheaper and easier to purchase standby capacity, and to replicate and re-allocate capacity in case of a failure. But before utility computing can happen on a large scale, cultural issues in addition to security and performance issues have to be resolved. "IT organisations are used to running their own infrastructure, they like to control the machines," points out Michael Nelson, IBM's director of Internet technology and strategy. But he adds "we know we can't continue on the present trend - companies are overwhelmed by the cost of managing their systems". THE BOTTOM LINE Grid, P2P and utility computing Executive summary: The convergence of grid, peer-to-peer, and utility computing efforts -- along with Web services -- promises to revolutionise the datacentre by lowering costs, eliminating server underutilisation, reducing management complexity, and providing additional business continuity benefits. Test centre perspective: Enterprises are just starting to deploy grid and utility computing products and services. Widespread usage of external computing utilities, and of multisite and inter-enterprise distributed computing, is likely to require the further development of platform-independent standards.
[ Printer Friendly Version ]
[ Other stories about Collaborative Computing, Hewlett-Packard, OGSA, IBM, Microsoft, Entropia, Avaki, Hurwitz Group, Terraspring, American Express, Globus Project, Gene Logic, Groove Networks, Compaq ]
|