public class AttributeStatistics
extends java.lang.Object
It contains statistic data about an attribute.
Some fields are valid just for numerical attributes and others just for string attributes. Also, some statistics are considered BASIC because computing them do not require to traverse all the different values of the attribute. For each getter method the documentation tells if the statistic is BASIC or not. See the Graph class method getAttributeStatistics or check out the SPARKSEE User Manual for more details on this.
Modifier and Type | Method and Description |
---|---|
double |
getAvgLengthString()
Gets the average length.
|
long |
getDistinct()
Gets the number of distinct values (BASIC statistics).
|
Value |
getMax()
Gets the maximum existing value (BASIC statistics).
|
int |
getMaxLengthString()
Gets the maximum length.
|
double |
getMean()
Gets the mean or average.
|
double |
getMedian()
Gets the median.
|
Value |
getMin()
Gets the minimum existing value (BASIC statistics).
|
int |
getMinLengthString()
Gets the minimum length.
|
Value |
getMode()
Gets the mode.
|
long |
getModeCount()
Gets the number of objects with a Value equal to the mode.
|
long |
getNull()
Gets the number of objects NULL a Value (BASIC statistics).
|
long |
getTotal()
Gets the number of objects with a non-NULL Value (BASIC statistic).
|
double |
getVariance()
Gets the variance.
|
public int getMinLengthString()
If the attribute is not an string attribute, it just returns 0.
public long getModeCount()
public double getVariance()
It is computed just for numerical attributes.
public Value getMode()
Mode: Most frequent Value.
public Value getMin()
public double getMedian()
Median: Middle value that separates the higher half from the lower.
If a < b < c, then the median of the list {a, b, c} is b, and if a < b < c < d, then the median of the list {a, b, c, d} is the mean of b and c, i.e. it is (b + c)/2
It is computed just for numerical attributes.
public long getTotal()
public int getMaxLengthString()
If the attribute is not an string attribute, it just returns 0.
public double getMean()
Mean or average: Sum of all Values divided by the number of observations.
It is computed just for numerical attributes.
public long getNull()
public long getDistinct()
public double getAvgLengthString()
If the attribute is not an string attribute, it just returns 0.
public Value getMax()