Table Of Contents

RandomForestRegressorModel train


train(self, frame, label_column, observation_columns, num_trees=1, impurity='variance', max_depth=4, max_bins=100, seed=2145878081, categorical_features_info=None, feature_subset_category=None)

[ALPHA] Build Random Forests Regressor model.

Parameters:

frame : <bound method AtkEntityType.__name__ of <trustedanalytics.rest.jsonschema.AtkEntityType object at 0x7f9e686f3fd0>>

A frame to train the model on

label_column : unicode

Column name containing the label for each observation

observation_columns : list

Column(s) containing the observations

num_trees : int32 (default=1)

Number of tress in the random forest

impurity : unicode (default=variance)

Criterion used for information gain calculation. Supported values “variance”

max_depth : int32 (default=4)

Maximum depth of the tree

max_bins : int32 (default=100)

Maximum number of bins used for splitting features

seed : int32 (default=2145878081)

Random seed for bootstrapping and choosing feature subsets

categorical_features_info : None (default=None)

feature_subset_category : unicode (default=None)

Number of features to consider for splits at each node. Supported values “auto”, “all”, “sqrt”,”log2”, “onethird”

Returns:

: dict

Values of the Random Forest Classifier model object storing:

the list of observation columns on which the model was trained,
the column name containing the labels of the observations,
the number of decision trees in the random forest,
the number of nodes in the random forest,
the map storing arity of categorical features,
the criterion used for information gain calculation,
the maximum depth of the tree,
the maximum number of bins used for splitting features,
the random seed used for bootstrapping and choosing feature subset.

Creating a Random Forests Regressor Model using the observation columns and label column.