class sklearn.ensemble.ExtraTreesClassifier(n_estimators=’warn’, criterion=’gini’, max_depth=None, min_samples_split=2, min_samples_leaf=1, min_weight_fraction_leaf=0.0, max_features=’auto’, max_leaf_nodes=None, min_impurity_decrease=0.0, min_impurity_split=None, bootstrap=False, oob_score=False, n_jobs=None, random_state=None, verbose=0, warm_start=False, class_weight=None) [source]
An extra-trees classifier.
This class implements a meta estimator that fits a number of randomized decision trees (a.k.a. extra-trees) on various sub-samples of the dataset and uses averaging to improve the predictive accuracy and control over-fitting.
Read more in the User Guide.
| Parameters: | 
 | 
|---|---|
| Attributes: | 
 | 
See also
sklearn.tree.ExtraTreeClassifier
RandomForestClassifier
The default values for the parameters controlling the size of the trees (e.g. max_depth, min_samples_leaf, etc.) lead to fully grown and unpruned trees which can potentially be very large on some data sets. To reduce memory consumption, the complexity and size of the trees should be controlled by setting those parameter values.
| [1] | P. Geurts, D. Ernst., and L. Wehenkel, “Extremely randomized trees”, Machine Learning, 63(1), 3-42, 2006. | 
| apply(X) | Apply trees in the forest to X, return leaf indices. | 
| decision_path(X) | Return the decision path in the forest | 
| fit(X, y[, sample_weight]) | Build a forest of trees from the training set (X, y). | 
| get_params([deep]) | Get parameters for this estimator. | 
| predict(X) | Predict class for X. | 
| predict_log_proba(X) | Predict class log-probabilities for X. | 
| predict_proba(X) | Predict class probabilities for X. | 
| score(X, y[, sample_weight]) | Returns the mean accuracy on the given test data and labels. | 
| set_params(**params) | Set the parameters of this estimator. | 
__init__(n_estimators=’warn’, criterion=’gini’, max_depth=None, min_samples_split=2, min_samples_leaf=1, min_weight_fraction_leaf=0.0, max_features=’auto’, max_leaf_nodes=None, min_impurity_decrease=0.0, min_impurity_split=None, bootstrap=False, oob_score=False, n_jobs=None, random_state=None, verbose=0, warm_start=False, class_weight=None) [source]
apply(X) [source]
Apply trees in the forest to X, return leaf indices.
| Parameters: | 
 | 
|---|---|
| Returns: | 
 | 
decision_path(X) [source]
Return the decision path in the forest
New in version 0.18.
| Parameters: | 
 | 
|---|---|
| Returns: | 
 | 
feature_importances_ | Returns: | 
 | 
|---|
fit(X, y, sample_weight=None) [source]
Build a forest of trees from the training set (X, y).
| Parameters: | 
 | 
|---|---|
| Returns: | 
 | 
get_params(deep=True) [source]
Get parameters for this estimator.
| Parameters: | 
 | 
|---|---|
| Returns: | 
 | 
predict(X) [source]
Predict class for X.
The predicted class of an input sample is a vote by the trees in the forest, weighted by their probability estimates. That is, the predicted class is the one with highest mean probability estimate across the trees.
| Parameters: | 
 | 
|---|---|
| Returns: | 
 | 
predict_log_proba(X) [source]
Predict class log-probabilities for X.
The predicted class log-probabilities of an input sample is computed as the log of the mean predicted class probabilities of the trees in the forest.
| Parameters: | 
 | 
|---|---|
| Returns: | 
 | 
predict_proba(X) [source]
Predict class probabilities for X.
The predicted class probabilities of an input sample are computed as the mean predicted class probabilities of the trees in the forest. The class probability of a single tree is the fraction of samples of the same class in a leaf.
| Parameters: | 
 | 
|---|---|
| Returns: | 
 | 
score(X, y, sample_weight=None) [source]
Returns the mean accuracy on the given test data and labels.
In multi-label classification, this is the subset accuracy which is a harsh metric since you require for each sample that each label set be correctly predicted.
| Parameters: | 
 | 
|---|---|
| Returns: | 
 | 
set_params(**params) [source]
Set the parameters of this estimator.
The method works on simple estimators as well as on nested objects (such as pipelines). The latter have parameters of the form <component>__<parameter> so that it’s possible to update each component of a nested object.
| Returns: | 
 | 
|---|
sklearn.ensemble.ExtraTreesClassifier
    © 2007–2018 The scikit-learn developers
Licensed under the 3-clause BSD License.
    http://scikit-learn.org/stable/modules/generated/sklearn.ensemble.ExtraTreesClassifier.html