 
 
 
 
 
 
   
 and is located at the HP site,
while the nodes-receivers are at different university sites. In order
to perform the sensitivity analysis, we vary the  number of
participating hosts: in experiments with
 and is located at the HP site,
while the nodes-receivers are at different university sites. In order
to perform the sensitivity analysis, we vary the  number of
participating hosts: in experiments with  participating hosts  
in replication set, the receivers are
 participating hosts  
in replication set, the receivers are 
 ordered as shown 
in Table 1.
 ordered as shown 
in Table 1.
|  | 
|  | 
|  | 
 to
 to  nodes, it uses
 nodes, it uses  Internet paths
connecting the source nodes to the recipient nodes (while sending only
one packet over each common link in those paths).  Thus the overall
performance is defined by the end-to-end properties of the
 Internet paths
connecting the source nodes to the recipient nodes (while sending only
one packet over each common link in those paths).  Thus the overall
performance is defined by the end-to-end properties of the  paths. Congestion in any of those paths impacts the overall
performance of the Sequential Unicast. FastReplica uses
the same
paths. Congestion in any of those paths impacts the overall
performance of the Sequential Unicast. FastReplica uses
the same  paths between the source and recipient nodes to 
transfer only
 paths between the source and recipient nodes to 
transfer only  -th of file
-th of file  .  FastReplica takes
advantage of using the additional
.  FastReplica takes
advantage of using the additional 
 paths between the
nodes in the replication set, and each of those paths is used for
sending
 paths between the
nodes in the replication set, and each of those paths is used for
sending  -th of file
-th of file  .  Thus, the congestion in any of
those paths impacts FastReplica performance for transfer of only
the
.  Thus, the congestion in any of
those paths impacts FastReplica performance for transfer of only
the  -th of file
-th of file  .
While the average replication time provides an interesting metric for
distribution strategy characterization, the metric representing the
maximum replication time is critical, because it reflects the worst
case of the replication time among the recipient nodes.
Figure 12 shows the maximum replication time for
experiments with 4, 6, and 8 recipient nodes in a replication set and
files of different sizes. The maximum replication times under 
Multiple Unicast, as well as Sequential Unicast, are much
higher than the corresponding average times for these strategies.  For
a case of 8 nodes in the replication set, the maximum times under 
Multiple Unicast and Sequential Unicast are almost 2 times
higher than the corresponding average times. The reason is that there
is a very limited bandwidth on the path from the source node
.
While the average replication time provides an interesting metric for
distribution strategy characterization, the metric representing the
maximum replication time is critical, because it reflects the worst
case of the replication time among the recipient nodes.
Figure 12 shows the maximum replication time for
experiments with 4, 6, and 8 recipient nodes in a replication set and
files of different sizes. The maximum replication times under 
Multiple Unicast, as well as Sequential Unicast, are much
higher than the corresponding average times for these strategies.  For
a case of 8 nodes in the replication set, the maximum times under 
Multiple Unicast and Sequential Unicast are almost 2 times
higher than the corresponding average times. The reason is that there
is a very limited bandwidth on the path from the source node  to
the recipient node
 to
the recipient node  . The performance of this path is practically the
same for both Multiple Unicast and Sequential
Unicast. This path defines the worst (maximum) replication time among
all the recipient nodes in the set.  Since FastReplica uses this
path to transfer only
. The performance of this path is practically the
same for both Multiple Unicast and Sequential
Unicast. This path defines the worst (maximum) replication time among
all the recipient nodes in the set.  Since FastReplica uses this
path to transfer only  -th of file
-th of file  , this ``bad'' path has
a very limited impact on maximum replication time and overall
performance of FastReplica.
Figure 13 shows how close the average and maximum
replication times under FastReplica are.  These results
demonstrate the robustness and predictability of performance results
under the new strategy.
, this ``bad'' path has
a very limited impact on maximum replication time and overall
performance of FastReplica.
Figure 13 shows how close the average and maximum
replication times under FastReplica are.  These results
demonstrate the robustness and predictability of performance results
under the new strategy.
|  | 
 and the receiver node
 and the receiver node  can be observed from these
measurements, and it severely impacts the overall performance of both
Multiple Unicast and Sequential Unicast.  The file replication
times under FastReplica across different nodes in the replication
set are  much more stable and predictable since each node performance is
defined by the bandwidth of
 can be observed from these
measurements, and it severely impacts the overall performance of both
Multiple Unicast and Sequential Unicast.  The file replication
times under FastReplica across different nodes in the replication
set are  much more stable and predictable since each node performance is
defined by the bandwidth of  paths, each transferring
 paths, each transferring  -th of the original file
-th of the original file  .
.
|  | 
|  | 
|  | 
 ,  the ``border'' file size, where
FastReplica works most efficiently, may increase correspondingly.
,  the ``border'' file size, where
FastReplica works most efficiently, may increase correspondingly.  
|  | 
 (hp.com) and nodes
 (hp.com) and nodes  (universities'
machines) is significantly lower than the cross bandwidth between
nodes
 (universities'
machines) is significantly lower than the cross bandwidth between
nodes  . only node
. only node  has a limited incoming
bandwidth from all the nodes
 has a limited incoming
bandwidth from all the nodes 
 , while the outgoing
bandwidth from node
, while the outgoing
bandwidth from node  to
 to  is  again significantly higher.
In such a configuration, FastReplica utilizes the abundance of
additional available bandwidth between the replication nodes in the
most efficient way to produce the spectacular results.
It is interesting to see how FastReplica would perform when a
different node with high bandwidth paths to the rest of the nodes is
used as the origin node.  We changed the configuration and made
node
 is  again significantly higher.
In such a configuration, FastReplica utilizes the abundance of
additional available bandwidth between the replication nodes in the
most efficient way to produce the spectacular results.
It is interesting to see how FastReplica would perform when a
different node with high bandwidth paths to the rest of the nodes is
used as the origin node.  We changed the configuration and made
node  (utexas.edu) to be the origin node, and rerun the
experiments again.
Figure 18 shows the average and
maximum speedup of file replication time under the proposed 
FastReplica in the small relative to the replication time of 
Multiple Unicast for files of 1.5 MB, 9 MB, and 36 MB, and a different
number of nodes in the replication set in the new configuration.
In the new configuration, the average replication times under 
FastReplica and Multiple Unicast are similar, but the maximum
replication time under FastReplica is still significantly better
than the maximum replication time under Multiple Unicast.
The bandwidth analysis reveals that node utexas.edu is connected
to the rest of the nodes via high bandwidth paths with low bandwidth
variation across these paths. Our analysis in Section 3.3
with a specially designed example, where the bandwidth matrix
 (utexas.edu) to be the origin node, and rerun the
experiments again.
Figure 18 shows the average and
maximum speedup of file replication time under the proposed 
FastReplica in the small relative to the replication time of 
Multiple Unicast for files of 1.5 MB, 9 MB, and 36 MB, and a different
number of nodes in the replication set in the new configuration.
In the new configuration, the average replication times under 
FastReplica and Multiple Unicast are similar, but the maximum
replication time under FastReplica is still significantly better
than the maximum replication time under Multiple Unicast.
The bandwidth analysis reveals that node utexas.edu is connected
to the rest of the nodes via high bandwidth paths with low bandwidth
variation across these paths. Our analysis in Section 3.3
with a specially designed example, where the bandwidth matrix  is defined by
equations (6), demonstrates that when the cross
bandwidth between some replication nodes is significantly lower than
the bandwidth of the original paths from
 is defined by
equations (6), demonstrates that when the cross
bandwidth between some replication nodes is significantly lower than
the bandwidth of the original paths from  to the recipient nodes
 to the recipient nodes
 then FastReplica improves the maximum
replication time but may have no significant improvement in average
replication time.
 then FastReplica improves the maximum
replication time but may have no significant improvement in average
replication time.
 
 
 
 
