Class: QuantileDiscretizer

eclairjs/ml/feature. QuantileDiscretizer

`QuantileDiscretizer` takes a column with continuous features and outputs a column with binned categorical features. The bin ranges are chosen by taking a sample of the data and dividing it into roughly equal parts. The lower and upper bin bounds will be -Infinity and +Infinity, covering all real values. This attempts to find numBuckets partitions based on a sample of data, but it may find fewer depending on the data sample values.

Constructor

new QuantileDiscretizer(uidopt)

Parameters:
Name Type Attributes Description
uid string <optional>
Source:

Extends

Methods

(static) load(path) → {module:eclairjs/ml/feature.QuantileDiscretizer}

Parameters:
Name Type Description
path string
Source:
Returns:
Type
module:eclairjs/ml/feature.QuantileDiscretizer

copy(extra) → {module:eclairjs/ml/feature.QuantileDiscretizer}

Parameters:
Name Type Description
extra module:eclairjs/ml/param.ParamMap
Overrides:
Source:
Returns:
Type
module:eclairjs/ml/feature.QuantileDiscretizer

extractParamMap() → {module:eclairjs/ml/param.ParamMap}

Inherited From:
Source:
Returns:
Type
module:eclairjs/ml/param.ParamMap

fit(dataset, paramMapopt) → {module:eclairjs/ml.Estimator|module:eclairjs/ml/feature.Bucketizer}

Fits a model to the input data.
Parameters:
Name Type Attributes Description
dataset module:eclairjs/sql.Dataset
paramMap module:eclairjs/ml/param.ParamMap <optional>
Parameter map. These values override any specified in this Estimator's embedded ParamMap.
Inherited From:
Source:
Returns:
fitted model
Type
module:eclairjs/ml.Estimator | module:eclairjs/ml/feature.Bucketizer

getNumBuckets() → {integer}

Source:
Returns:
Type
integer

numBuckets() → {module:eclairjs/ml/param.IntParam}

Maximum number of buckets (quantiles, or categories) into which data points are grouped. Must be >= 2. default: 2
Source:
Returns:
Type
module:eclairjs/ml/param.IntParam

setInputCol(value) → {module:eclairjs/ml/feature.QuantileDiscretizer}

Parameters:
Name Type Description
value string
Source:
Returns:
Type
module:eclairjs/ml/feature.QuantileDiscretizer

setNumBuckets(value) → {module:eclairjs/ml/feature.QuantileDiscretizer}

Parameters:
Name Type Description
value integer
Source:
Returns:
Type
module:eclairjs/ml/feature.QuantileDiscretizer

setOutputCol(value) → {module:eclairjs/ml/feature.QuantileDiscretizer}

Parameters:
Name Type Description
value string
Source:
Returns:
Type
module:eclairjs/ml/feature.QuantileDiscretizer

transformSchema() → {module:eclairjs/sql/types.StructType}

Check transform validity and derive the output schema from the input schema. Typical implementation should first conduct verification on schema change and parameter validity, including complex parameter interaction checks. * @param {module:eclairjs/sql/types.StructType} schema
Inherited From:
Source:
Returns:
Type
module:eclairjs/sql/types.StructType

uid() → {string}

An immutable unique ID for the object and its derivatives.
Source:
Returns:
Type
string