Class: VectorIndexerModel

eclairjs/ml/feature.VectorIndexerModel

Model fitted by VectorIndexer. Transform categorical features to use 0-based indices instead of their original values. - Categorical features are mapped to indices. - Continuous features (columns) are left unchanged. This also appends metadata to the output column, marking features as Numeric (continuous), Nominal (categorical), or Binary (either continuous or categorical). Non-ML metadata is not carried over from the input to the output column. This maintains vector sparsity.

Constructor

new VectorIndexerModel(numFeatures, categoryMaps)

Parameters:
Name Type Description
numFeatures Number of features, i.e., length of Vectors which this transforms
categoryMaps Feature value index. Keys are categorical feature indices (column indices). Values are maps from original features values to 0-based category indices. If a feature is not in this map, it is treated as continuous.
Source:

Extends

Methods

(static) load(path) → {module:eclairjs/ml/feature.VectorIndexerModel}

Parameters:
Name Type Description
path string
Source:
Returns:
Type
module:eclairjs/ml/feature.VectorIndexerModel

(static) read() → {MLReader}

Source:
Returns:
Type
MLReader

categoryMaps() → {Promise.<object>}

Source:
Returns:
Map object>
Type
Promise.<object>

copy(extra) → {module:eclairjs/ml/feature.VectorIndexerModel}

Parameters:
Name Type Description
extra module:eclairjs/ml/param.ParamMap
Overrides:
Source:
Returns:
Type
module:eclairjs/ml/feature.VectorIndexerModel

numFeatures() → {Promise.<integer>}

Source:
Returns:
Type
Promise.<integer>

setInputCol(value) → {module:eclairjs/ml/feature.VectorIndexerModel}

Parameters:
Name Type Description
value string
Source:
Returns:
Type
module:eclairjs/ml/feature.VectorIndexerModel

setOutputCol(value) → {module:eclairjs/ml/feature.VectorIndexerModel}

Parameters:
Name Type Description
value string
Source:
Returns:
Type
module:eclairjs/ml/feature.VectorIndexerModel

transform(dataset) → {module:eclairjs/sql.Dataset}

Parameters:
Name Type Description
dataset module:eclairjs/sql.Dataset
Source:
Returns:
Type
module:eclairjs/sql.Dataset

transformSchema(schema) → {module:eclairjs/sql/types.StructType}

Parameters:
Name Type Description
schema module:eclairjs/sql/types.StructType
Source:
Returns:
Type
module:eclairjs/sql/types.StructType

uid() → {Promise.<string>}

An immutable unique ID for the object and its derivatives.
Source:
Returns:
Type
Promise.<string>

write() → {MLWriter}

Source:
Returns:
Type
MLWriter