com.actian.spark_vector.datastream
Affinities of each partition to host names represented as an Array
of Seq[String]
Vector end points
For each end point in endpoints
, a sequence of partition indices that are assigned to that end point
Get an assignment of partitions to hosts satisfying the properties described in the header
Get an assignment of partitions to hosts satisfying the properties described in the header
One sequence of partition indexes assigned to each host
Start measuring time and record it in acc
.
Start measuring time and record it in acc
.
This function should always be used in combination with profileEnd in a similar way with opening and closing
a sequence of brackets, where the code between a profile call and its corresponding profileEnd will have its
execution time measured and stored into acc
Finish profiling the current section of code, as determined by the most recent profile call
Initialize profiling
Initialize profiling
Print profile information using log
Print profile information using log
Class that contains the matching algorithm used to assign
RDD
partitions to Vector hosts, based onaffinities
.The algorithm used here tries to assign partitions to hosts for which they have affinity. For this reason only partitions that have affinity to at least one host are matched here, the others are assigned to a random node. Also, this algorithm aims to minimize the maximum number of partitions that a host will have assigned, i.e. the most data a host will process