Class: QuantileDiscretizer

eclairjs/ml/feature.QuantileDiscretizer

`QuantileDiscretizer` takes a column with continuous features and outputs a column with binned categorical features. The number of bins can be set using the `numBuckets` parameter. The bin ranges are chosen using an approximate algorithm (see the documentation for [[org.apache.spark.sql.DatasetStatFunctions.approxQuantile approxQuantile]] for a detailed description). The precision of the approximation can be controlled with the `relativeError` parameter. The lower and upper bin bounds will be `-Infinity` and `+Infinity`, covering all real values.

Constructor

new QuantileDiscretizer(uidopt)

Parameters:
Name Type Attributes Description
uid string <optional>
Source:

Extends

Methods

(static) load(path) → {module:eclairjs/ml/feature.QuantileDiscretizer}

Parameters:
Name Type Description
path string
Source:
Returns:
Type
module:eclairjs/ml/feature.QuantileDiscretizer

copy(extra) → {module:eclairjs/ml/feature.QuantileDiscretizer}

Parameters:
Name Type Description
extra module:eclairjs/ml/param.ParamMap
Overrides:
Source:
Returns:
Type
module:eclairjs/ml/feature.QuantileDiscretizer

extractParamMap() → {module:eclairjs/ml/param.ParamMap}

Inherited From:
Source:
Returns:
Type
module:eclairjs/ml/param.ParamMap

fit(dataset) → {Bucketizer}

Parameters:
Name Type Description
dataset module:eclairjs/sql.Dataset
Overrides:
Source:
Returns:
Type
Bucketizer

setInputCol(value) → {module:eclairjs/ml/feature.QuantileDiscretizer}

Parameters:
Name Type Description
value string
Source:
Returns:
Type
module:eclairjs/ml/feature.QuantileDiscretizer

setNumBuckets(value) → {module:eclairjs/ml/feature.QuantileDiscretizer}

Parameters:
Name Type Description
value number
Source:
Returns:
Type
module:eclairjs/ml/feature.QuantileDiscretizer

setOutputCol(value) → {module:eclairjs/ml/feature.QuantileDiscretizer}

Parameters:
Name Type Description
value string
Source:
Returns:
Type
module:eclairjs/ml/feature.QuantileDiscretizer

setRelativeError(value) → {type}

Parameters:
Name Type Description
value number
Source:
Returns:
Type
type

transformSchema(schema) → {StructType}

Parameters:
Name Type Description
schema module:eclairjs/sql/types.StructType
Source:
Returns:
Type
StructType