On Approximate Range Mode and Range Selection
For any ϵ∈ (0,1), a (1+ϵ)-approximate range mode query asks for the position of an element whose frequency in the query range is at most a factor (1+ϵ) smaller than the true mode. For this problem, we design an O(n/ϵ) bit data structure supporting queries in O((1/ϵ)) time. This is an encoding data structure which does not require access to the input sequence; we prove the space cost is asymptotically optimal for constant ϵ. Our solution improves the previous best result of Greve et al. (Cell Probe Lower Bounds and Approximations for Range Mode, ICALP'10) by reducing the space cost by a factor of n while achieving the same query time. We also design an O(n)-word dynamic data structure that answers queries in O( n / n) time and supports insertions and deletions in O( n) time, for any constant ϵ∈ (0,1). This is the first result on dynamic approximate range mode; it can also be used to obtain the first static data structure for approximate 3-sided range mode queries in two dimensions. We also consider approximate range selection. For any α∈ (0,1/2), an α-approximate range selection query asks for the position of an element whose rank in the query range is in [k - α s, k + α s], where k is a rank given by the query and s is the size of the query range. When α is a constant, we design an O(n)-bit encoding data structure that can answer queries in constant time and prove this space cost is asymptotically optimal. The previous best result by Krizanc et al. (Range Mode and Range Median Queries on Lists and Trees, Nordic Journal of Computing, 2005) uses O(n n) bits, or O(n) words, to achieve constant approximation for range median only. Thus we not only improve the space cost, but also provide support for any arbitrary k given at query time.
READ FULL TEXT