Class: Word2Vec

eclairjs/ml/feature. Word2Vec

new Word2Vec(uidopt)

Word2Vec trains a model of `Map(String, Vector)`, i.e. transforms a word into a code for further natural language processing or machine learning process.
Parameters:
Name Type Attributes Description
uid string <optional>
Source:

Extends

Methods

(static) load(path) → {module:eclairjs/ml/feature.Word2Vec}

Parameters:
Name Type Description
path string
Source:
Returns:
Type
module:eclairjs/ml/feature.Word2Vec

copy(extra) → {module:eclairjs/ml/feature.Word2Vec}

Parameters:
Name Type Description
extra module:eclairjs/ml.param.ParamMap
Overrides:
Source:
Returns:
Type
module:eclairjs/ml/feature.Word2Vec

extractParamMap() → {module:eclairjs/ml/param.ParamMap}

Inherited From:
Source:
Returns:
Type
module:eclairjs/ml/param.ParamMap

fit(dataset) → {module:eclairjs/ml/feature.Word2VecModel}

Parameters:
Name Type Description
dataset module:eclairjs/sql.Dataset
Overrides:
Source:
Returns:
Type
module:eclairjs/ml/feature.Word2VecModel

getInputCol() → {string}

Source:
Returns:
Type
string

getMaxIter() → {integer}

Source:
Returns:
Type
integer

getMinCount() → {integer}

Source:
Returns:
Type
integer

getNumPartitions() → {integer}

Source:
Returns:
Type
integer

getOutputCol() → {string}

Source:
Returns:
Type
string

getSeed() → {integer}

Source:
Returns:
Type
integer

getStepSize() → {float}

Source:
Returns:
Type
float

getVectorSize() → {integer}

Source:
Returns:
Type
integer

getWindowSize() → {integer}

Source:
Returns:
Type
integer

inputCol() → {module:eclairjs/ml/param.Param}

Source:
Returns:
Type
module:eclairjs/ml/param.Param

maxIter() → {module:eclairjs/ml/param.IntParam}

Param for maximum number of iterations (>= 0).
Source:
Returns:
Type
module:eclairjs/ml/param.IntParam

minCount() → {module:eclairjs/ml/param.IntParam}

The minimum number of times a token must appear to be included in the word2vec model's vocabulary. Default: 5
Source:
Returns:
Type
module:eclairjs/ml/param.IntParam

numPartitions() → {module:eclairjs/ml/param.IntParam}

Number of partitions for sentences of words. Default: 1
Source:
Returns:
Type
module:eclairjs/ml/param.IntParam

outputCol() → {module:eclairjs/ml/param.Param}

Source:
Returns:
Type
module:eclairjs/ml/param.Param

seed() → {module:eclairjs/ml/param.LongParam}

Param for random seed.
Source:
Returns:
Type
module:eclairjs/ml/param.LongParam

setInputCol(value) → {module:eclairjs/ml/feature.Word2Vec}

Parameters:
Name Type Description
value string
Source:
Returns:
Type
module:eclairjs/ml/feature.Word2Vec

setMaxIter(value) → {module:eclairjs/ml/feature.Word2Vec}

Parameters:
Name Type Description
value integer
Source:
Returns:
Type
module:eclairjs/ml/feature.Word2Vec

setMaxSentenceLength(value) → {module:eclairjs/mllib/feature.Word2Vec}

Parameters:
Name Type Description
value number
Source:
Returns:
Type
module:eclairjs/mllib/feature.Word2Vec

setMinCount(value) → {module:eclairjs/ml/feature.Word2Vec}

Parameters:
Name Type Description
value integer
Source:
Returns:
Type
module:eclairjs/ml/feature.Word2Vec

setNumPartitions(value) → {module:eclairjs/ml/feature.Word2Vec}

Parameters:
Name Type Description
value integer
Source:
Returns:
Type
module:eclairjs/ml/feature.Word2Vec

setOutputCol(value) → {module:eclairjs/ml/feature.Word2Vec}

Parameters:
Name Type Description
value string
Source:
Returns:
Type
module:eclairjs/ml/feature.Word2Vec

setSeed(value) → {module:eclairjs/ml/feature.Word2Vec}

Parameters:
Name Type Description
value integer
Source:
Returns:
Type
module:eclairjs/ml/feature.Word2Vec

setStepSize(value) → {module:eclairjs/ml/feature.Word2Vec}

Parameters:
Name Type Description
value float
Source:
Returns:
Type
module:eclairjs/ml/feature.Word2Vec

setVectorSize(value) → {module:eclairjs/ml/feature.Word2Vec}

Parameters:
Name Type Description
value integer
Source:
Returns:
Type
module:eclairjs/ml/feature.Word2Vec

setWindowSize(value) → {module:eclairjs/ml/feature.Word2Vec}

Parameters:
Name Type Description
value integer
Source:
Returns:
Type
module:eclairjs/ml/feature.Word2Vec

stepSize() → {module:eclairjs/ml/param.DoubleParam}

Param for Step size to be used for each iteration of optimization
Source:
Returns:
Type
module:eclairjs/ml/param.DoubleParam

transformSchema(schema) → {module:eclairjs/sql/types.StructType}

Parameters:
Name Type Description
schema module:eclairjs/sql/types.StructType
Overrides:
Source:
Returns:
Type
module:eclairjs/sql/types.StructType

uid() → {string}

An immutable unique ID for the object and its derivatives.
Source:
Returns:
Type
string

validateAndTransformSchema(schema) → {module:eclairjs/sql/types.StructType}

Validates and transforms the input schema.
Parameters:
Name Type Description
schema module:eclairjs/sql/types.StructType
Source:
Returns:
Type
module:eclairjs/sql/types.StructType

vectorSize() → {module:eclairjs/ml/param.IntParam}

The dimension of the code that you want to transform from words. Default: 100
Source:
Returns:
Type
module:eclairjs/ml/param.IntParam

windowSize() → {module:eclairjs/ml/param.IntParam}

The window size (context words from [-window, window]) default 5.
Source:
Returns:
Type
module:eclairjs/ml/param.IntParam