<?phpxml version="1.0" encoding="iso-8859-1"?>
<rss version="2.0" 
     xmlns:content="http://purl.org/rss/1.0/modules/content/"
     xmlns:wfw="http://wellformedweb.org/CommentAPI/"
     xmlns:dc="http://purl.org/dc/elements/1.1/"
 >
<channel>
	<title>ClusterCenter / wensong - voted</title>
	<link>http://clustercenter.org</link>
	<description>Pligg Web 2.0 Content Management System</description>
	<pubDate>Sat, 07 Mar 2009 23:50:29 +0800</pubDate>
	<language>en</language>
	<item>
		<title><![CDATA[Joe Stump - Scaling Digg and Other Web Applications]]></title>
		<link>http://clustercenter.org/architecture/Joe-Stump--Scaling-Digg-Other-Web-Applications/</link>
		<comments>http://clustercenter.org/architecture/Joe-Stump--Scaling-Digg-Other-Web-Applications/</comments>
		<pubDate>Sat, 07 Mar 2009 23:50:29 +0800</pubDate>
		<dc:creator>wensong</dc:creator>
		<category>architecture</category>
		<guid>http://clustercenter.org/architecture/Joe-Stump--Scaling-Digg-Other-Web-Applications/</guid>
		<description><![CDATA[Joe Stump is currently the Lead Architect for Digg where he spends his time partitioning data, creating internal services, and ensuring the code frameworks are in working order.Digg by the numbers: 30,000,000 Ron Paul fans. 13,000 requests a second, bunches of servers. &nbsp;&#187;&nbsp;<a href='http://www.krisjordan.com/2008/09/18/joe-stump-scaling-digg-and-other-web-applications/'>original news</a>]]></description>
	</item>

	<item>
		<title><![CDATA[Using the Cloud to build highly-efficient systems - All Things Distributed]]></title>
		<link>http://clustercenter.org/loadbalancing/Using-Cloud-to-build-highly-efficient-systems--All-Things-Distributed/</link>
		<comments>http://clustercenter.org/loadbalancing/Using-Cloud-to-build-highly-efficient-systems--All-Things-Distributed/</comments>
		<pubDate>Tue, 24 Feb 2009 10:10:18 +0800</pubDate>
		<dc:creator>wensong</dc:creator>
		<category>loadbalancing</category>
		<guid>http://clustercenter.org/loadbalancing/Using-Cloud-to-build-highly-efficient-systems--All-Things-Distributed/</guid>
		<description><![CDATA[Werner Vogels, CTO of Amazon.com, explained clearly why the Cloud is used to build highly-efficient systems.&quot;By using infrastructure as a service, basic IT costs are moved from a capital expense to a variable cost, building clearer relationships between expenditures and revenue generating activities.&quot; &nbsp;&#187;&nbsp;<a href='http://www.allthingsdistributed.com/2008/10/using_the_cloud_to_build_highl.html'>original news</a>]]></description>
	</item>

	<item>
		<title><![CDATA[Hypertable: An Open Source, High Performance, Scalable Database]]></title>
		<link>http://clustercenter.org/database/Hypertable-Open-Source-High-Performance-Scalable-Database/</link>
		<comments>http://clustercenter.org/database/Hypertable-Open-Source-High-Performance-Scalable-Database/</comments>
		<pubDate>Thu, 14 Feb 2008 11:54:18 +0800</pubDate>
		<dc:creator>wensong</dc:creator>
		<category>database</category>
		<guid>http://clustercenter.org/database/Hypertable-Open-Source-High-Performance-Scalable-Database/</guid>
		<description><![CDATA[Hypertable is an open source project to develop massively parallel high performance database platform. Its architecture more or less follows Google's bigtable design.The project goal is to bring the benefits of new levels of both performance and scale to many data-driven businesses who are currently limited by previous-generation platforms. &nbsp;&#187;&nbsp;<a href='http://hypertable.org/'>original news</a>]]></description>
	</item>

	<item>
		<title><![CDATA[TIPC Project Home Page]]></title>
		<link>http://clustercenter.org/software/TIPC-Project-Home-Page/</link>
		<comments>http://clustercenter.org/software/TIPC-Project-Home-Page/</comments>
		<pubDate>Tue, 05 Feb 2008 11:38:39 +0800</pubDate>
		<dc:creator>wensong</dc:creator>
		<category>software</category>
		<guid>http://clustercenter.org/software/TIPC-Project-Home-Page/</guid>
		<description><![CDATA[The TIPC project is an Open Source implementation of the Transparent Inter Process Communication (TIPC) protocol. TIPC is designed for use in clustered computer environments, allowing designers to create applications that can communicate quickly and reliably with other applications regardless of their location within the cluster. The TIPC protocol originated at the telecommunications manufacturer, Ericsson, and has been deployed in their products for years; it has now been released to the Open Source community and is gaining acceptance in an increasing number of fields world-wide.TIPC is available for Linux and VxWorks operating systems; support for Solaris is currently being developed. Applications written in C (or C++) can utilize TIPC's capabilities with sockets created using the AF_TIPC address family; add-ons for Perl, Python, and Ruby are also available. (Note: The TIPC team is looking for volunteers interested in adding support for Windows or Java; see the &quot;Contacts &quot; link for contact information.) &nbsp;&#187;&nbsp;<a href='http://tipc.sourceforge.net/'>original news</a>]]></description>
	</item>

	<item>
		<title><![CDATA[OpenSAF.org]]></title>
		<link>http://clustercenter.org/architecture/OpenSAF-org/</link>
		<comments>http://clustercenter.org/architecture/OpenSAF-org/</comments>
		<pubDate>Tue, 05 Feb 2008 11:18:33 +0800</pubDate>
		<dc:creator>wensong</dc:creator>
		<category>architecture</category>
		<guid>http://clustercenter.org/architecture/OpenSAF-org/</guid>
		<description><![CDATA[OpenSAF is an open source project established to develop a base platform middleware consistent with Service Availabilitya4 Forum (SA Foruma4) specifications, under the LGPLv2.1 license. The OpenSAF Foundation was established by leading Communications and Enterprise Computing Companies to facilitate the OpenSAF Project and to accelerate the adoption of the OpenSAF code base in commercial products.The OpenSAF project was launched in mid 2007 and has been under development by an informal group of supporters of the OpenSAF initiative. The OpenSAF Foundations was founded on January 22nd 2008 with Emerson Network Power, Ericsson, Nokia Siemens Networks, HP and Sun Microsystems as founding members. &nbsp;&#187;&nbsp;<a href='http://www.opensaf.org/'>original news</a>]]></description>
	</item>

	<item>
		<title><![CDATA[The Chubby Lock Service for Loosely-Coupled Distributed Systems]]></title>
		<link>http://clustercenter.org/software/Chubby-Lock-Service-Loosely-Coupled-Distributed-Systems/</link>
		<comments>http://clustercenter.org/software/Chubby-Lock-Service-Loosely-Coupled-Distributed-Systems/</comments>
		<pubDate>Sat, 24 Nov 2007 19:29:18 +0800</pubDate>
		<dc:creator>wensong</dc:creator>
		<category>software</category>
		<guid>http://clustercenter.org/software/Chubby-Lock-Service-Loosely-Coupled-Distributed-Systems/</guid>
		<description><![CDATA[We describe our experiences with the Chubby lock service, which is intended to provide coarse-grained locking as well as reliable (though low-volume) storage for a loosely-coupled distributed system. Chubby provides an interface much like a distributed file system with advisory locks, but the design emphasis is on availability and reliability, as opposed to high performance. Many instances of the service have been used for over a year, with several of them each handling a few tens of thousands of clients concurrently. The paper describes the initial design and expected use, compares it with actual use, and explains how the design had to be modified to accommodate the differences. &nbsp;&#187;&nbsp;<a href='http://labs.google.com/papers/chubby.html'>original news</a>]]></description>
	</item>

	<item>
		<title><![CDATA[Google Scalability Conference Trip Report: GFS, MapReduce, and BigTable]]></title>
		<link>http://clustercenter.org/architecture/Google-Scalability-Conference-Trip-Report-GFS-MapReduce-BigTable/</link>
		<comments>http://clustercenter.org/architecture/Google-Scalability-Conference-Trip-Report-GFS-MapReduce-BigTable/</comments>
		<pubDate>Sat, 24 Nov 2007 18:26:54 +0800</pubDate>
		<dc:creator>wensong</dc:creator>
		<category>architecture</category>
		<guid>http://clustercenter.org/architecture/Google-Scalability-Conference-Trip-Report-GFS-MapReduce-BigTable/</guid>
		<description><![CDATA[In a blog post, Microsoft's Dare Obasanjo shared his notes from the keynote session MapReduce, BigTable, and Other Distributed System Abstractions for Handling Large Datasets by Jeff Dean.The talk was about the three pillars of Google's data storage and processing platform; GFS, BigTable and MapReduce. &nbsp;&#187;&nbsp;<a href='http://www.25hoursaday.com/weblog/2007/06/25/GoogleScalabilityConferenceTripReportMapReduceBigTableAndOtherDistributedSystemAbstractionsForHandlingLargeDatasets.aspx'>original news</a>]]></description>
	</item>

	<item>
		<title><![CDATA[New Protocol Speeds Up Internet Resource Sharing]]></title>
		<link>http://clustercenter.org/loadbalancing/New-Protocol-Speeds-Up-Internet-Resource-Sharing/</link>
		<comments>http://clustercenter.org/loadbalancing/New-Protocol-Speeds-Up-Internet-Resource-Sharing/</comments>
		<pubDate>Sat, 24 Nov 2007 13:41:11 +0800</pubDate>
		<dc:creator>wensong</dc:creator>
		<category>loadbalancing</category>
		<guid>http://clustercenter.org/loadbalancing/New-Protocol-Speeds-Up-Internet-Resource-Sharing/</guid>
		<description><![CDATA[A Penn State researcher has developed a faster method for more efficient sharing of widely distributed Internet resources such as Web services, databases and high performance computers.The research paper, &quot;A Scalable Protocol for Deadlock and Livelock Free Co-Allocation of Resources in Internet Computing&quot; was published at IEEE's Symposium on Applications and the Internet in Orlando, Fla.The proposed algorithm enables better coordination of Internet applications in support of large-scale computing. The protocol uses parallel rather than serial methods to process requests. That helps with more efficient resource allocation as well as solves the problems of deadlock and livelock caused by multiple concurrent Internet applications competing for Internet resources. &nbsp;&#187;&nbsp;<a href='http://www.sciencedaily.com/releases/2003/01/030130081144.htm'>original news</a>]]></description>
	</item>

	<item>
		<title><![CDATA[Bee Strategy Helps Servers Run More Sweetly]]></title>
		<link>http://clustercenter.org/loadbalancing/Bee-Strategy-Helps-Servers-Run-More-Sweetly/</link>
		<comments>http://clustercenter.org/loadbalancing/Bee-Strategy-Helps-Servers-Run-More-Sweetly/</comments>
		<pubDate>Sat, 24 Nov 2007 13:19:03 +0800</pubDate>
		<dc:creator>wensong</dc:creator>
		<category>loadbalancing</category>
		<guid>http://clustercenter.org/loadbalancing/Bee-Strategy-Helps-Servers-Run-More-Sweetly/</guid>
		<description><![CDATA[Honeybees somehow manage to efficiently collect a lot of nectar with limited resources and no central command - after all, the queen bee is too busy laying eggs to oversee something as mundane as where the best nectar can be found on any given morning. According to new research from the Georgia Institute of Technology, the swarm intelligence of these amazingly organized bees can also be used to improve the efficiency of Internet servers faced with similar challenges.A bee dance-inspired communications system developed by Georgia Tech helps Internet servers that would normally be devoted solely to one task move between tasks as needed, reducing the chances that a Web site could be overwhelmed with requests and lock out potential users and customers. Compared with the way server banks are commonly run, the honeybee method typically improves service by 4 percent to 25 percent in tests based on real Internet traffic. The research was published in the journal Bioinspiration and Biomimetics. &nbsp;&#187;&nbsp;<a href='http://www.sciencedaily.com/releases/2007/11/071116133551.htm'>original news</a>]]></description>
	</item>

	<item>
		<title><![CDATA[Hbase: Bigtable-like structured storage for Hadoop HDFS]]></title>
		<link>http://clustercenter.org/software/Hbase-Bigtable-like-structured-storage-Hadoop-HDFS/</link>
		<comments>http://clustercenter.org/software/Hbase-Bigtable-like-structured-storage-Hadoop-HDFS/</comments>
		<pubDate>Fri, 09 Nov 2007 22:41:47 +0800</pubDate>
		<dc:creator>wensong</dc:creator>
		<category>software</category>
		<guid>http://clustercenter.org/software/Hbase-Bigtable-like-structured-storage-Hadoop-HDFS/</guid>
		<description><![CDATA[Google's Bigtable, a distributed storage system for structured data, is a very effective mechanism for storing very large amounts of data in a distributed environment.Just as Bigtable leverages the distributed data storage provided by the [WWW] Google File System, Hbase will provide Bigtable-like capabilities on top of Hadoop.Data is organized into tables, rows and columns, but a query language like SQL is not supported. Instead, an Iterator-like interface is available for scanning through a row range (and of course there is an ability to retrieve a column value for a specific key).Any particular column may have multiple values for the same row key. A secondary key can be provided to select a particular value or an Iterator can be set up to scan through the key-value pairs for that column given a specific row key. &nbsp;&#187;&nbsp;<a href='http://wiki.apache.org/lucene-hadoop/Hbase'>original news</a>]]></description>
	</item>

	<item>
		<title><![CDATA[The High Availability Linux Project]]></title>
		<link>http://clustercenter.org/software/High-Availability-Linux-Project/</link>
		<comments>http://clustercenter.org/software/High-Availability-Linux-Project/</comments>
		<pubDate>Sun, 04 Nov 2007 23:51:17 +0800</pubDate>
		<dc:creator>wensong</dc:creator>
		<category>software</category>
		<guid>http://clustercenter.org/software/High-Availability-Linux-Project/</guid>
		<description><![CDATA[It's the home of famous heartbeat software.The basic goal of the High Availability Linux project is to:    * Provide a high availability (clustering) solution for Linux which promotes reliability, availability, and serviceability (RAS) through a community development effort. The Linux-HA project is a widely used and important component in many interesting High Availability solutions, and ranks as among the best HA software packages for any platform. We estimate that we currently have more than thirty thousand installations up in mission-critical uses in the real world since 1999. Interest in this project continues to grow. These web pages are average nearly 20000 hits per day, and we see more than 100 downloads of Heartbeat per day. &nbsp;&#187;&nbsp;<a href='http://linux-ha.org/'>original news</a>]]></description>
	</item>

	<item>
		<title><![CDATA[What Drives Performance in HPC]]></title>
		<link>http://clustercenter.org/computing/What-Drives-Performance-in-HPC/</link>
		<comments>http://clustercenter.org/computing/What-Drives-Performance-in-HPC/</comments>
		<pubDate>Wed, 24 Oct 2007 23:14:04 +0800</pubDate>
		<dc:creator>wensong</dc:creator>
		<category>computing</category>
		<guid>http://clustercenter.org/computing/What-Drives-Performance-in-HPC/</guid>
		<description><![CDATA[What Drives Performance in HPC That's a good question. What does drive performance in HPC On a qualitative basis its easy to answer. A faster processor and memory. More memory. A better network or disk I/O subsystem. Unfortunately, those answers are rarely specific enough when faced with purchase decisions for a Linux cluster. This article is designed to support and expand on the What Drives Performance in HPC Webinar presented in June 2007, which outlined a quantitative approach to performance. &nbsp;&#187;&nbsp;<a href='http://www.linux-mag.com/launchpad/business-class-hpc/main/4170'>original news</a>]]></description>
	</item>

	<item>
		<title><![CDATA[Bigtable: A Distributed Storage System for Structured Data]]></title>
		<link>http://clustercenter.org/storage/Bigtable-Distributed-Storage-System-Structured-Data-1/</link>
		<comments>http://clustercenter.org/storage/Bigtable-Distributed-Storage-System-Structured-Data-1/</comments>
		<pubDate>Mon, 22 Oct 2007 12:26:18 +0800</pubDate>
		<dc:creator>wensong</dc:creator>
		<category>storage</category>
		<guid>http://clustercenter.org/storage/Bigtable-Distributed-Storage-System-Structured-Data-1/</guid>
		<description><![CDATA[Bigtable is a distributed storage system for managing structured data that is designed to scale to a very large size: petabytes of data across thousands of commodity servers. Many projects at Google store data in Bigtable, including web indexing, Google Earth, and Google Finance. These applications place very different demands on Bigtable, both in terms of data size (from URLs to web pages to satellite imagery) and latency requirements (from backend bulk processing to real-time data serving). Despite these varied demands, Bigtable has successfully provided a flexible, high-performance solution for all of these Google products. In this paper we describe the simple data model provided by Bigtable, which gives clients dynamic control over data layout and format, and we describe the design and implementation of Bigtable. See also the BigTable article at wikipedia ( http://en.wikipedia.org/wiki/BigTable ) for some introduction. &nbsp;&#187;&nbsp;<a href='http://labs.google.com/papers/bigtable.html'>original news</a>]]></description>
	</item>

	<item>
		<title><![CDATA[Cleversafe Open Source Community]]></title>
		<link>http://clustercenter.org/storage/Cleversafe-Open-Source-Community/</link>
		<comments>http://clustercenter.org/storage/Cleversafe-Open-Source-Community/</comments>
		<pubDate>Mon, 22 Oct 2007 00:09:23 +0800</pubDate>
		<dc:creator>wensong</dc:creator>
		<category>storage</category>
		<guid>http://clustercenter.org/storage/Cleversafe-Open-Source-Community/</guid>
		<description><![CDATA[Cleversafe offers an open source solution of dispersed storage network over WAN.Cleversafe uses Cauchy Reed-Solomon Information Dispersal Algorithms (IDAs) to separate data into unrecognizable Data Slices and distribute them, via secure Internet connections, to multiple storage locations on a Dispersed Storage Network (dsNet).With Dispersed Storage, transmission and storage of data is inherently private and secure. No single entire copy of the data is in one location, and only some of the slices need to be available in order to perfectly retrieve the data.Data on the dsNet remains private and secure in the face of natural catastrophes, or failures of hardware, connection, facility or IT management. Moreover, the individual data slices do not carry enough information for an unauthorized viewer to determine the original content. &nbsp;&#187;&nbsp;<a href='http://www.cleversafe.org/'>original news</a>]]></description>
	</item>

	<item>
		<title><![CDATA[The Google File System]]></title>
		<link>http://clustercenter.org/storage/Google-File-System/</link>
		<comments>http://clustercenter.org/storage/Google-File-System/</comments>
		<pubDate>Sat, 13 Oct 2007 22:52:32 +0800</pubDate>
		<dc:creator>wensong</dc:creator>
		<category>storage</category>
		<guid>http://clustercenter.org/storage/Google-File-System/</guid>
		<description><![CDATA[The Google File System is a scalable distributed file system for large distributed data-intensive applications. It provides fault tolerance while running on inexpensive commodity hardware, and it delivers high aggregate performance to a large number of clients.It likes other distributed file system, it has master nodes to  store meta data, and chunk server to store data.It's interesting to read their design choices for google data-intensive applications, for example, the size of each chunk is 64M bytes, compared to usual 4K block size in unix file system. &nbsp;&#187;&nbsp;<a href='http://labs.google.com/papers/gfs.html'>original news</a>]]></description>
	</item>

	<item>
		<title><![CDATA[Redundant Array of Independent Filesystems]]></title>
		<link>http://clustercenter.org/software/Redundant-Array-Independent-Filesystems/</link>
		<comments>http://clustercenter.org/software/Redundant-Array-Independent-Filesystems/</comments>
		<pubDate>Thu, 04 Oct 2007 13:50:17 +0800</pubDate>
		<dc:creator>wensong</dc:creator>
		<category>software</category>
		<guid>http://clustercenter.org/software/Redundant-Array-Independent-Filesystems/</guid>
		<description><![CDATA[Using the fan-out infrastructure we have, we are developing a file system that has the same redundancy characteristics of RAID, but at the VFS level. We call this Redundant Arrays of Independent Filesystems (RAIF). RAIF support includes striping (RAIF0), mirroring (RAIF1), parity (RAIF5 and RAIF6), and other modes. RAIF allows redundancy and performance increases on many types of file systems, including NFS. The RAIF logic is at the VFS layer, so file-level knowledge is used to decide on the right level of redundancy that best matches the value of the file in question. Moreover, recovery from failures can be done on a per file basis instead of the whole disk drive; this means that if a recovery failed mid-way, it can be resumed without having to restart the entire disk device's recovery.The project was started in April 2004 in Filesystems and Storage Laboratory of Stony Brook University. RAIF can be compiled as an external Linux loadable kernel module for a wide range of Linux kernel versions. &nbsp;&#187;&nbsp;<a href='http://www.fsl.cs.sunysb.edu/project-raif.html'>original news</a>]]></description>
	</item>

	<item>
		<title><![CDATA[Hadoop: a distributed computing platform]]></title>
		<link>http://clustercenter.org/software/Hadoop-distributed-computing-platform/</link>
		<comments>http://clustercenter.org/software/Hadoop-distributed-computing-platform/</comments>
		<pubDate>Thu, 04 Oct 2007 13:39:27 +0800</pubDate>
		<dc:creator>wensong</dc:creator>
		<category>software</category>
		<guid>http://clustercenter.org/software/Hadoop-distributed-computing-platform/</guid>
		<description><![CDATA[Hadoop is a software platform that lets one easily write and run applications that process vast amounts of data.Here's what makes Hadoop especially useful:    * Scalable: Hadoop can reliably store and process petabytes.    * Economical: It distributes the data and processing across clusters of commonly available computers. These clusters can number into the thousands of nodes.    * Efficient: By distributing the data, Hadoop can process it in parallel on the nodes where the data is located. This makes it extremely rapid.    * Reliable: Hadoop automatically maintains multiple copies of data and automatically redeploys computing tasks based on failures.Hadoop implements MapReduce, using the Hadoop Distributed File System (HDFS) (see figure below.) MapReduce divides applications into many small blocks of work. HDFS creates multiple replicas of data blocks for reliability, placing them on compute nodes around the cluster. MapReduce can then process the data where it is located.Hadoop has been demonstrated on clusters with 2000 nodes. The current design target is 10,000 node clusters.Hadoop is a Lucene sub-project that contains the distributed computing platform that was formerly a part of Nutch. &nbsp;&#187;&nbsp;<a href='http://lucene.apache.org/hadoop/'>original news</a>]]></description>
	</item>

	<item>
		<title><![CDATA[The Hadoop Distributed File System: Architecture and Design]]></title>
		<link>http://clustercenter.org/storage/Hadoop-Distributed-File-System-Architecture-Design/</link>
		<comments>http://clustercenter.org/storage/Hadoop-Distributed-File-System-Architecture-Design/</comments>
		<pubDate>Thu, 04 Oct 2007 13:22:26 +0800</pubDate>
		<dc:creator>wensong</dc:creator>
		<category>storage</category>
		<guid>http://clustercenter.org/storage/Hadoop-Distributed-File-System-Architecture-Design/</guid>
		<description><![CDATA[The Hadoop Distributed File System (HDFS) is a distributed file system designed to run on commodity hardware. It has many similarities with existing distributed file systems. However, the differences from other distributed file systems are significant. HDFS is highly fault-tolerant and is designed to be deployed on low-cost hardware. HDFS provides high throughput access to application data and is suitable for applications that have large data sets. HDFS relaxes a few POSIX requirements to enable streaming access to file system data. HDFS was originally built as infrastructure for the Apache Nutch web search engine project. HDFS is part of the Apache Hadoop project, which is part of the Apache Lucene project. &nbsp;&#187;&nbsp;<a href='http://lucene.apache.org/hadoop/hdfs_design.html'>original news</a>]]></description>
	</item>

	<item>
		<title><![CDATA[MOSIX Cluster and Grid Management]]></title>
		<link>http://clustercenter.org/computing/MOSIX-Cluster-Grid-Management-1/</link>
		<comments>http://clustercenter.org/computing/MOSIX-Cluster-Grid-Management-1/</comments>
		<pubDate>Fri, 21 Sep 2007 11:12:50 +0800</pubDate>
		<dc:creator>wensong</dc:creator>
		<category>computing</category>
		<guid>http://clustercenter.org/computing/MOSIX-Cluster-Grid-Management-1/</guid>
		<description><![CDATA[MOSIX is a management system targeted for High Performance Computing (HPC) on x86 based Linux clusters and organizational grids of multiple clusters. MOSIX incorporates dynamic resource discovery and automatic workload distribution, commonly found on single computers with multiple processors. &nbsp;&#187;&nbsp;<a href='http://www.mosix.org/'>original news</a>]]></description>
	</item>

	<item>
		<title><![CDATA[How To Design A Good API and Why it Matters]]></title>
		<link>http://clustercenter.org/other/How-To-Design-Good-API-Why-it-Matters/</link>
		<comments>http://clustercenter.org/other/How-To-Design-Good-API-Why-it-Matters/</comments>
		<pubDate>Tue, 28 Aug 2007 16:57:11 +0800</pubDate>
		<dc:creator>wensong</dc:creator>
		<category>other</category>
		<guid>http://clustercenter.org/other/How-To-Design-Good-API-Why-it-Matters/</guid>
		<description><![CDATA[Every day around the world, software developers spend much of their time working with a  ... all  variety of Application Programming Interfaces (APIs). Some are integral to the core platform, some provide access to widely distributed frameworks, and some are written in-house for use by a few developers. Nearly all programmers occasionally function as API designers, whether they know it or not. A well-designed API can be a great asset to the organization that wrote it and to all who use it. Good APIs increase the pleasure and productivity of the developers who use them, the quality of the software they produce, and ultimately, the corporate bottom line. Conversely, poorly written APIs are a constant thorn in the developer's side, and have been known to harm the bottom line to the point of bankruptcy. Given the importance of good API design, surprisingly little has been written on the subject. In this talk, I'll attempt to help you recognize good and bad APIs and I'll offer specific suggestions for writing good ones.This talk is part of the Advanced Topics in Programming Series at Google. &nbsp;&#187;&nbsp;<a href='http://www.infoq.com/presentations/effective-api-design'>original news</a>]]></description>
	</item>

	<item>
		<title><![CDATA[Technology Review: Building a Better Search Engine]]></title>
		<link>http://clustercenter.org/other/Technology-Review-Building-Better-Search-Engine/</link>
		<comments>http://clustercenter.org/other/Technology-Review-Building-Better-Search-Engine/</comments>
		<pubDate>Tue, 31 Jul 2007 10:17:09 +0800</pubDate>
		<dc:creator>wensong</dc:creator>
		<category>other</category>
		<guid>http://clustercenter.org/other/Technology-Review-Building-Better-Search-Engine/</guid>
		<description><![CDATA[Powerset, Inc., based in San Francisco, is on the verge of offering an innovative natural-language search engine, based on linguistic research at the Palo Alto Research Center (PARC). The engine does more than merely accept queries asked in the form of a question. The company claims that the engine finds the best answer by considering the meaning and context of the question and related Web pages.&quot;Powerset extracts deep concepts and relationships from the texts, and the users query and match them efficiently to deliver a better search,&quot; Powerset CEO Barney Pell says.Even though attempts have been made at natural-language search for decades, Powerset says that its system is different because it has solved some of the fundamental technological problems that have existed with this kind of search. It has done so by developing a product that is deep, computationally advanced, and still economically viable. &nbsp;&#187;&nbsp;<a href='http://www.technologyreview.com/Biztech/19109/?a=f'>original news</a>]]></description>
	</item>

	<item>
		<title><![CDATA[GFS - Google File System - SlideShare]]></title>
		<link>http://clustercenter.org/storage/GFS--Google-File-System--SlideShare/</link>
		<comments>http://clustercenter.org/storage/GFS--Google-File-System--SlideShare/</comments>
		<pubDate>Wed, 18 Jul 2007 11:47:31 +0800</pubDate>
		<dc:creator>wensong</dc:creator>
		<category>storage</category>
		<guid>http://clustercenter.org/storage/GFS--Google-File-System--SlideShare/</guid>
		<description><![CDATA[It's good to read this slide to get the whole picture, before reading the Google File System paper. :) &nbsp;&#187;&nbsp;<a href='http://www.slideshare.net/tutchiio/gfs-google-file-system/'>original news</a>]]></description>
	</item>

	<item>
		<title><![CDATA[Technology at Digg - Mysql 2007 conference]]></title>
		<link>http://clustercenter.org/loadbalancing/Technology-at-Digg--Mysql-2007-conference/</link>
		<comments>http://clustercenter.org/loadbalancing/Technology-at-Digg--Mysql-2007-conference/</comments>
		<pubDate>Fri, 22 Jun 2007 13:28:58 +0800</pubDate>
		<dc:creator>wensong</dc:creator>
		<category>loadbalancing</category>
		<guid>http://clustercenter.org/loadbalancing/Technology-at-Digg--Mysql-2007-conference/</guid>
		<description><![CDATA[This presentation at MySQL 2007 conference presents the history of technology used in Digg, system architecture and mysql deployment. &nbsp;&#187;&nbsp;<a href='http://www.slideshare.net/epee/mysql-2007-tech-at-digg-v3'>original news</a>]]></description>
	</item>

	<item>
		<title><![CDATA[Today's HPC Clusters: Cooler. Faster. More Cost-Effective.]]></title>
		<link>http://clustercenter.org/computing/Todays-HPC-Clusters-Cooler--Faster--More-Cost-Effective-/</link>
		<comments>http://clustercenter.org/computing/Todays-HPC-Clusters-Cooler--Faster--More-Cost-Effective-/</comments>
		<pubDate>Tue, 19 Jun 2007 11:47:21 +0800</pubDate>
		<dc:creator>wensong</dc:creator>
		<category>computing</category>
		<guid>http://clustercenter.org/computing/Todays-HPC-Clusters-Cooler--Faster--More-Cost-Effective-/</guid>
		<description><![CDATA[Building affordable, balanced clusters without sacrificing performance is the challenge facing everyone in high-performance computing. But what really drives performance in today's HPC systemsLearn how the performance of HPC infrastructure can hinge on the components and technology solutions you choose to deploy, including:    * Dual-core, quad-core and, ultimately, &quot;many-core&quot; processors;    * Switch architecture;    * Infiniband and Ethernet interconnects;    * Memory bandwidth and latency;    * I/O; and    * Chip-to-chip communication;This webinar will give you the ability to optimize component and system level performance in both existing and newly built HPC infrastructure, making your clusters easier to design, deploy, manage and grow.Who should attend: HPC platform designers looking for guidance in performance optimization and scientists and engineers interested in the key HPC trends of power, heat, space, and cost and how Intel Research is addressing these challenges. &nbsp;&#187;&nbsp;<a href='http://www.linux-mag.com/launchpad/business-class-hpc/main/3420'>original news</a>]]></description>
	</item>

	<item>
		<title><![CDATA[Internet pioneer to oversee its redesign under government contract]]></title>
		<link>http://clustercenter.org/other/Internet-pioneer-to-oversee-its-redesign-under-government-contract/</link>
		<comments>http://clustercenter.org/other/Internet-pioneer-to-oversee-its-redesign-under-government-contract/</comments>
		<pubDate>Tue, 22 May 2007 22:44:24 +0800</pubDate>
		<dc:creator>wensong</dc:creator>
		<category>other</category>
		<guid>http://clustercenter.org/other/Internet-pioneer-to-oversee-its-redesign-under-government-contract/</guid>
		<description><![CDATA[A new Internet could ultimately mean replacing networking equipment and rewriting software on computers, at a cost of billions of dollars. But any new network is likely to run parallel with the existing one for some time, with individuals and businesses gradually migrating over as they need more advanced applications. &nbsp;&#187;&nbsp;<a href='http://www.technologyreview.com/Wire/18765/'>original news</a>]]></description>
	</item>

	<item>
		<title><![CDATA[Inside MySpace.com]]></title>
		<link>http://clustercenter.org/loadbalancing/Inside-MySpace-com/</link>
		<comments>http://clustercenter.org/loadbalancing/Inside-MySpace-com/</comments>
		<pubDate>Fri, 04 May 2007 23:45:52 +0800</pubDate>
		<dc:creator>wensong</dc:creator>
		<category>loadbalancing</category>
		<guid>http://clustercenter.org/loadbalancing/Inside-MySpace-com/</guid>
		<description><![CDATA[Booming traffic demands put a constant stress on the social network's computing infrastructure. Yet, MySpace developers have repeatedly redesigned the Web site software, database and storage systems in an attempt to keep pace with exploding growth - the site now handles almost 40 billion page views a month. Most corporate Web sites will never have to bear more than a small fraction of the traffic MySpace handles, but anyone seeking to reach the mass market online can learn from its experience.Membership Milestones:* 500,000 Users: A Simple Architecture Stumbles* 1 Million Users:Vertical Partitioning Solves Scalability Woes* 3 Million Users: Scale-Out Wins Over Scale-Up* 9 Million Users: Site Migrates to ASP.NET, Adds Virtual Storage* 26 Million Users: MySpace Embraces 64-Bit Technology* What's Behind Those &quot;Unexpected Error&quot; Screens &nbsp;&#187;&nbsp;<a href='http://www.baselinemag.com/article2/0,1540,2082921,00.asp'>original news</a>]]></description>
	</item>

	<item>
		<title><![CDATA[Google offers its own changes to MySQL]]></title>
		<link>http://clustercenter.org/other/Google-offers-its-own-changes-to-MySQL/</link>
		<comments>http://clustercenter.org/other/Google-offers-its-own-changes-to-MySQL/</comments>
		<pubDate>Thu, 03 May 2007 20:49:36 +0800</pubDate>
		<dc:creator>wensong</dc:creator>
		<category>other</category>
		<guid>http://clustercenter.org/other/Google-offers-its-own-changes-to-MySQL/</guid>
		<description><![CDATA[Google long has been known to be a user of the open-source MySQL database software, but the search powerhouse this week published its own changes to the project.&quot;We think MySQL is a fantastic data storage solution, and as our projects push the requirements for the database in certain areas, we've made changes to enhance MySQL itself, mainly in the areas of high availability and manageability,&quot; Google software engineer Mark Callaghan said on the company's Google Code blog on Monday. &nbsp;&#187;&nbsp;<a href='http://news.com.com/8301-10784_3-9712307-7.html'>original news</a>]]></description>
	</item>

	<item>
		<title><![CDATA[MySQL hits 50 million revenue, plans IPO]]></title>
		<link>http://clustercenter.org/other/MySQL-hits-50-million-revenue-plans-IPO/</link>
		<comments>http://clustercenter.org/other/MySQL-hits-50-million-revenue-plans-IPO/</comments>
		<pubDate>Thu, 03 May 2007 20:42:27 +0800</pubDate>
		<dc:creator>wensong</dc:creator>
		<category>other</category>
		<guid>http://clustercenter.org/other/MySQL-hits-50-million-revenue-plans-IPO/</guid>
		<description><![CDATA[MySQL, purveyor of the open-source database of the same name, is on the road to becoming a publicly traded company, bolstered by $50 million in revenue in 2006.The company garnered about $50 million in revenue in 2006, Mickos said in an interview at the MySQL Conference and Expo here. That compares with $6.5 million in 2002 and about $34 million in 2005, according to earlier figures Mickos cited in a speech two years earlier. &nbsp;&#187;&nbsp;<a href='http://news.com.com/2100-7344_3-6179290.html'>original news</a>]]></description>
	</item>

	<item>
		<title><![CDATA[InterMezzo File System Home]]></title>
		<link>http://clustercenter.org/software/InterMezzo-File-System-Home/</link>
		<comments>http://clustercenter.org/software/InterMezzo-File-System-Home/</comments>
		<pubDate>Tue, 01 May 2007 12:49:45 +0800</pubDate>
		<dc:creator>wensong</dc:creator>
		<category>software</category>
		<guid>http://clustercenter.org/software/InterMezzo-File-System-Home/</guid>
		<description><![CDATA[InterMezzo is a new distributed file system with a focus on high availability. InterMezzo will be suitable for replication of servers, mobile computing, managing system software on large clusters, and for maintenance of high availability clusters. &nbsp;&#187;&nbsp;<a href='http://www.inter-mezzo.org/'>original news</a>]]></description>
	</item>

	<item>
		<title><![CDATA[Lustre: scalable, secure, robust, highly-available cluster file system]]></title>
		<link>http://clustercenter.org/software/Lustre-scalable-secure-robust-highly-available-cluster-file-system/</link>
		<comments>http://clustercenter.org/software/Lustre-scalable-secure-robust-highly-available-cluster-file-system/</comments>
		<pubDate>Tue, 01 May 2007 12:46:25 +0800</pubDate>
		<dc:creator>wensong</dc:creator>
		<category>software</category>
		<guid>http://clustercenter.org/software/Lustre-scalable-secure-robust-highly-available-cluster-file-system/</guid>
		<description><![CDATA[Lustre is a scalable, secure, robust, highly-available cluster file system. It is designed, developed and maintained by Cluster File Systems, Inc.The central goal is the development of a next-generation cluster file system which can serve clusters with 10,000's of nodes, petabytes of storage, move 100's of GB/sec with state of the art security and management infrastructure.Lustre runs today on many of the largest Linux clusters in the world, and is included by CFS's partners as a core component of their cluster offering (examples include HP StorageWorks SFS, and the Cray XT3 and XD1 supercomputers). Today's users have also demonstrated that Lustre scales down as well as it scales up, and run in production on clusters as small as 4 and as large as 15,000 nodes. &nbsp;&#187;&nbsp;<a href='http://lustre.org/'>original news</a>]]></description>
	</item>

	<item>
		<title><![CDATA[MogileFS - an open source distribution filesystem]]></title>
		<link>http://clustercenter.org/software/MogileFS--open-source-distribution-filesystem-1/</link>
		<comments>http://clustercenter.org/software/MogileFS--open-source-distribution-filesystem-1/</comments>
		<pubDate>Sat, 28 Apr 2007 17:47:19 +0800</pubDate>
		<dc:creator>wensong</dc:creator>
		<category>software</category>
		<guid>http://clustercenter.org/software/MogileFS--open-source-distribution-filesystem-1/</guid>
		<description><![CDATA[MogileFS is an open source distributed filesystem. It has many features:* Application level -- no special kernel modules required.* No single point of failure * Automatic file replication* &quot;Better than RAID&quot;* Flat Namespace * Shared-Nothing* No RAID required* Local filesystem agnosticAfter all, it has already been used in heavily loaded production systems. &nbsp;&#187;&nbsp;<a href='http://www.danga.com/mogilefs/'>original news</a>]]></description>
	</item>

	<item>
		<title><![CDATA[Grid Datafarm - Gfarm file system]]></title>
		<link>http://clustercenter.org/software/Grid-Datafarm--Gfarm-file-system/</link>
		<comments>http://clustercenter.org/software/Grid-Datafarm--Gfarm-file-system/</comments>
		<pubDate>Wed, 25 Apr 2007 22:29:54 +0800</pubDate>
		<dc:creator>wensong</dc:creator>
		<category>software</category>
		<guid>http://clustercenter.org/software/Grid-Datafarm--Gfarm-file-system/</guid>
		<description><![CDATA[&quot;Gfarm file system is a next-generation network shared file system, which will be an alternative solution of NFS, and will meet a demand for much larger, much reliable, and much faster file system.&quot;From its documentation, it's promising. &nbsp;&#187;&nbsp;<a href='http://datafarm.apgrid.org/'>original news</a>]]></description>
	</item>

	<item>
		<title><![CDATA[MySQL Conference Expo 2007]]></title>
		<link>http://clustercenter.org/database/MySQL-Conference-Expo-2007/</link>
		<comments>http://clustercenter.org/database/MySQL-Conference-Expo-2007/</comments>
		<pubDate>Tue, 17 Apr 2007 23:07:13 +0800</pubDate>
		<dc:creator>wensong</dc:creator>
		<category>database</category>
		<guid>http://clustercenter.org/database/MySQL-Conference-Expo-2007/</guid>
		<description><![CDATA[MySQL Conference &amp; Expo 2007 will be held at Santa Clara on April 23  26, 2007. The conference has a lot of very good technical sessions and tutorials, including MySQL performance tunings, MySQL cluster, MySQL replication, etc. &nbsp;&#187;&nbsp;<a href='http://mysqlconf.com/pub/w/54/sessions.html'>original news</a>]]></description>
	</item>

	<item>
		<title><![CDATA[Video: Tim Berners-Lee on the Semantic Web]]></title>
		<link>http://clustercenter.org/other/Video-Tim-Berners-Lee-on-Semantic-Web/</link>
		<comments>http://clustercenter.org/other/Video-Tim-Berners-Lee-on-Semantic-Web/</comments>
		<pubDate>Thu, 12 Apr 2007 18:20:58 +0800</pubDate>
		<dc:creator>wensong</dc:creator>
		<category>other</category>
		<guid>http://clustercenter.org/other/Video-Tim-Berners-Lee-on-Semantic-Web/</guid>
		<description><![CDATA[The Semantic Web is well under way and could have an impact even greater than the Web that we all use every day, predicts Tim Berners-Lee, director of the World Wide Web Consortium and senior researcher at MIT's Computer Science and Artificial Intelligence Laboratory. Berners-Lee says (in this video) that the Semantic Web, which he describes as a &quot;web of data&quot; in contrast to today's &quot;web of documents,&quot; has great potential in giving a user the ability to see, understand, and manipulate data. He points to applications in medicine, in reacting to civil and health emergencies, and even in such mundane tasks as knowing where your friends are in relation to the nearest coffee shop. &nbsp;&#187;&nbsp;<a href='http://www.technologyreview.com/Infotech/18451/'>original news</a>]]></description>
	</item>

	<item>
		<title><![CDATA[Cluster 2007]]></title>
		<link>http://clustercenter.org/other/Cluster-2007/</link>
		<comments>http://clustercenter.org/other/Cluster-2007/</comments>
		<pubDate>Mon, 09 Apr 2007 23:11:06 +0800</pubDate>
		<dc:creator>wensong</dc:creator>
		<category>other</category>
		<guid>http://clustercenter.org/other/Cluster-2007/</guid>
		<description><![CDATA[The IEEE Cluster 2007 conference is the information event that will link you with new ideas, advanced technologies, experience reports, and developers to enhance your understanding and allow you to exploit the advances in cluster technologies.It calls for papers now. &nbsp;&#187;&nbsp;<a href='http://www.cluster2007.org/'>original news</a>]]></description>
	</item>

	<item>
		<title><![CDATA[ClusterMonkey]]></title>
		<link>http://clustercenter.org/computing/ClusterMonkey/</link>
		<comments>http://clustercenter.org/computing/ClusterMonkey/</comments>
		<pubDate>Fri, 30 Mar 2007 09:46:49 +0800</pubDate>
		<dc:creator>wensong</dc:creator>
		<category>computing</category>
		<guid>http://clustercenter.org/computing/ClusterMonkey/</guid>
		<description><![CDATA[Cluster Monkey is a good resource for computing cluster. &nbsp;&#187;&nbsp;<a href='http://www.clustermonkey.net//'>original news</a>]]></description>
	</item>

	<item>
		<title><![CDATA[Technology Review: A Fresh Start for the Internet]]></title>
		<link>http://clustercenter.org/other/Technology-Review-Fresh-Start--Internet/</link>
		<comments>http://clustercenter.org/other/Technology-Review-Fresh-Start--Internet/</comments>
		<pubDate>Thu, 29 Mar 2007 23:46:00 +0800</pubDate>
		<dc:creator>wensong</dc:creator>
		<category>other</category>
		<guid>http://clustercenter.org/other/Technology-Review-Fresh-Start--Internet/</guid>
		<description><![CDATA[Researchers at Stanford University are on a mission to completely revamp the Internet. Plans for their multipart program, called the Clean Slate Design for the Internet, will be presented to the public this Wednesday at the school's annual Computer Forum. Ultimately, the researchers hope to make the Internet safer, more transparent, and more reliable by reconsidering both private and public networks. &nbsp;&#187;&nbsp;<a href='http://www.technologyreview.com/Infotech/18397/'>original news</a>]]></description>
	</item>

	<item>
		<title><![CDATA[Technology Review: Part I: A Smarter Web]]></title>
		<link>http://clustercenter.org/other/Technology-Review-Part-I-Smarter-Web/</link>
		<comments>http://clustercenter.org/other/Technology-Review-Part-I-Smarter-Web/</comments>
		<pubDate>Thu, 29 Mar 2007 23:28:42 +0800</pubDate>
		<dc:creator>wensong</dc:creator>
		<category>other</category>
		<guid>http://clustercenter.org/other/Technology-Review-Part-I-Smarter-Web/</guid>
		<description><![CDATA[The article gave a very good introduction about semantic web, its history and how it will help to build a smarter web. &nbsp;&#187;&nbsp;<a href='http://www.technologyreview.com/Infotech/18395/'>original news</a>]]></description>
	</item>

	<item>
		<title><![CDATA[Red Hat Enterprise Linux 5 is out]]></title>
		<link>http://clustercenter.org/other/Red-Hat-Enterprise-Linux-5-is-out/</link>
		<comments>http://clustercenter.org/other/Red-Hat-Enterprise-Linux-5-is-out/</comments>
		<pubDate>Sat, 17 Mar 2007 13:54:27 +0800</pubDate>
		<dc:creator>wensong</dc:creator>
		<category>other</category>
		<guid>http://clustercenter.org/other/Red-Hat-Enterprise-Linux-5-is-out/</guid>
		<description><![CDATA[The long-awaited Red Hat Enterprise Linux 5 is finally out.Red Hat Enterprise Linux 5 is supposed to be more robust, fast, feature-rich, and more operational.Kernel and Performance    * Based on the Linux 2.6.18 kernel    * Support for multi-core processors    * Broad range of new hardware support    * Updated crash dump capability provided by Kexec/Kdump    * Support for Intel Network Accelerator Technology (IOAT)    * Numerous enhancements for large SMP systems    * Enhanced pipe buffering    * IPv4/IPv6 fragmentation offload and buffer management    * Dynamically switchable per-queue I/O schedulers    * Kernel buffer splice capability for improved I/O buffer operations &nbsp;&#187;&nbsp;<a href='http://www.redhat.com/rhel/'>original news</a>]]></description>
	</item>

	<item>
		<title><![CDATA[LiveJournal's Backend: A history of scaling]]></title>
		<link>http://clustercenter.org/loadbalancing/LiveJournals-Backend-history-scaling/</link>
		<comments>http://clustercenter.org/loadbalancing/LiveJournals-Backend-history-scaling/</comments>
		<pubDate>Fri, 16 Mar 2007 23:43:58 +0800</pubDate>
		<dc:creator>wensong</dc:creator>
		<category>loadbalancing</category>
		<guid>http://clustercenter.org/loadbalancing/LiveJournals-Backend-history-scaling/</guid>
		<description><![CDATA[Brad Fitzpatrick, President and CTO of LiveJournal.com, has a very good presentation about LiveJournal's backend systems. The presentation has 80 slids and includes a lot of valuable information, about the system architecture for scalability and high availability, database, load balancing, caching, distributed file system.It's really worth reading it. &nbsp;&#187;&nbsp;<a href='http://danga.com/words/2005_oscon/'>original news</a>]]></description>
	</item>

</channel>
</rss>

