Constructor
new CountVectorizer(uidopt)
Parameters:
Name |
Type |
Attributes |
Description |
uid |
string
|
<optional>
|
|
- Source:
Methods
Parameters:
Name |
Type |
Description |
path |
string
|
|
- Source:
Returns:
-
Type
-
module:eclairjs/ml/feature.CountVectorizer
Parameters:
- Source:
Returns:
-
Type
-
module:eclairjs/ml/feature.CountVectorizer
Parameters:
- Source:
Returns:
-
Type
-
module:eclairjs/ml/feature.CountVectorizerModel
getMinDF() → {float}
- Source:
Returns:
-
Type
-
float
getMinTF() → {float}
- Source:
Returns:
-
Type
-
float
getVocabSize() → {integer}
- Source:
Returns:
-
Type
-
integer
Specifies the minimum number of different documents a term must appear in to be included in the vocabulary.
If this is an integer >= 1, this specifies the number of documents the term must appear in;
if this is a double in [0,1), then this specifies the fraction of documents.
- Source:
Returns:
-
Type
-
module:eclairjs/ml/param.DoubleParam
Parameters:
Name |
Type |
Description |
value |
boolean
|
|
- Source:
Returns:
-
Type
-
module:eclairjs/ml/feature.CountVectorizer
Parameters:
Name |
Type |
Description |
value |
string
|
|
- Source:
Returns:
-
Type
-
module:eclairjs/ml/feature.CountVectorizer
Parameters:
Name |
Type |
Description |
value |
float
|
|
- Source:
Returns:
-
Type
-
module:eclairjs/ml/feature.CountVectorizer
Parameters:
Name |
Type |
Description |
value |
float
|
|
- Source:
Returns:
-
Type
-
module:eclairjs/ml/feature.CountVectorizer
Parameters:
Name |
Type |
Description |
value |
string
|
|
- Source:
Returns:
-
Type
-
module:eclairjs/ml/feature.CountVectorizer
Parameters:
Name |
Type |
Description |
value |
integer
|
|
- Source:
Returns:
-
Type
-
module:eclairjs/ml/feature.CountVectorizer
Parameters:
- Source:
Returns:
-
Type
-
module:eclairjs/sql/types.StructType
uid() → {string}
An immutable unique ID for the object and its derivatives.
- Source:
Returns:
-
Type
-
string
Validates and transforms the input schema.
Parameters:
- Source:
Returns:
-
Type
-
module:eclairjs/sql/types.StructType
Max size of the vocabulary. CountVectorizer will build a vocabulary that only considers the top vocabSize
terms ordered by term frequency across the corpus.
Default: 2^18^
- Source:
Returns:
-
Type
-
module:eclairjs/ml/param.IntParam