SparkseePython  6.0.2
Public Member Functions | List of all members
sparksee.AttributeStatistics Class Reference

Attribute statistics class. More...

Public Member Functions

def get_min_length_string (self)
 Gets the minimum length. More...
 
def get_mode_count (self)
 Gets the number of objects with a Value equal to the mode. More...
 
def get_variance (self)
 Gets the variance. More...
 
def get_mode (self)
 Gets the mode. More...
 
def get_min (self)
 Gets the minimum existing value (BASIC statistics). More...
 
def get_median (self)
 Gets the median. More...
 
def get_total (self)
 Gets the number of objects with a non-NULL Value (BASIC statistic). More...
 
def get_max_length_string (self)
 Gets the maximum length. More...
 
def get_mean (self)
 Gets the mean or average. More...
 
def get_null (self)
 Gets the number of objects NULL a Value (BASIC statistics). More...
 
def get_distinct (self)
 Gets the number of distinct values (BASIC statistics). More...
 
def get_avg_length_string (self)
 Gets the average length. More...
 
def get_max (self)
 Gets the maximum existing value (BASIC statistics). More...
 

Detailed Description

Attribute statistics class.

It contains statistic data about an attribute.

Some fields are valid just for numerical attributes and others just for string attributes. Also, some statistics are considered BASIC because computing them do not require to traverse all the different values of the attribute. For each getter method the documentation tells if the statistic is BASIC or not. See the Graph class method getAttributeStatistics or check out the SPARKSEE User Manual for more details on this.

Author
Sparsity Technologies http://www.sparsity-technologies.com

Member Function Documentation

def sparksee.AttributeStatistics.get_avg_length_string (   self)

Gets the average length.

If the attribute is not an string attribute, it just returns 0.

Returns
The average length.
def sparksee.AttributeStatistics.get_distinct (   self)

Gets the number of distinct values (BASIC statistics).

Returns
The number of distinct values.
def sparksee.AttributeStatistics.get_max (   self)

Gets the maximum existing value (BASIC statistics).

Returns
The maximum existing value.
def sparksee.AttributeStatistics.get_max_length_string (   self)

Gets the maximum length.

If the attribute is not an string attribute, it just returns 0.

Returns
The maximum length.
def sparksee.AttributeStatistics.get_mean (   self)

Gets the mean or average.

Mean or average: Sum of all Values divided by the number of observations.

It is computed just for numerical attributes.

Returns
The mean.
def sparksee.AttributeStatistics.get_median (   self)

Gets the median.

Median: Middle value that separates the higher half from the lower.

If a < b < c, then the median of the list {a, b, c} is b, and if a < b < c < d, then the median of the list {a, b, c, d} is the mean of b and c, i.e. it is (b + c)/2

It is computed just for numerical attributes.

Returns
The median.
def sparksee.AttributeStatistics.get_min (   self)

Gets the minimum existing value (BASIC statistics).

Returns
The minimum existing value.
def sparksee.AttributeStatistics.get_min_length_string (   self)

Gets the minimum length.

If the attribute is not an string attribute, it just returns 0.

Returns
The minimum length.
def sparksee.AttributeStatistics.get_mode (   self)

Gets the mode.

Mode: Most frequent Value.

Returns
The mode.
def sparksee.AttributeStatistics.get_mode_count (   self)

Gets the number of objects with a Value equal to the mode.

Returns
The number of objects with a Value equal to the mode.
def sparksee.AttributeStatistics.get_null (   self)

Gets the number of objects NULL a Value (BASIC statistics).

Returns
The number of objects NULL a Value.
def sparksee.AttributeStatistics.get_total (   self)

Gets the number of objects with a non-NULL Value (BASIC statistic).

Returns
The number of objects with a non-NULL Value.
def sparksee.AttributeStatistics.get_variance (   self)

Gets the variance.

It is computed just for numerical attributes.

Returns
The variance.