The MDDS-supported Petaplex project run by Knowledge Systems Inc. deals with scaling of hypertext / information retrieval systems to handle petabytes of data and millions of users.
Petaplex Architecture with Numerous Servers, of Several Types
The distributed systems will be grouped into clusters, each containing numerous smart disks.