data storage location

Guy Coates gmpc at sanger.ac.uk
Mon Sep 15 08:53:50 EDT 2003


> Does the application read all or only a subset of the data?
>   Is the subset predictable?  Externally by a scheduler?

Automatic data replication tied into the schedular would be really nice;
data-grid for beowulf-clusters. (Buzzword overload?)  The schedular would
examine the jobs waiting in the queue, and then populate the cluster with
the required data. Frequently accessed datasets would get replicated to
more nodes than less popular ones. Of course, the devil is always in the
details.  One can imagine all sorts of nasty scenarios, such as someone
submitting a mass of short running jobs, which kicks off a mass of
replication events, which then crushes your network.

Cheers,

Guy Coates

-- 
Guy Coates,  Informatics System Group
The Wellcome Trust Sanger Institute, Hinxton, Cambridge, CB10 1SA, UK
Tel: +44 (0)1223 834244 ex 7199




_______________________________________________
Beowulf mailing list, Beowulf at beowulf.org
To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf



More information about the Beowulf mailing list