# How is Decision Tree Split Quality Computed?

How is Decision Tree Split Quality Computed?

How is Decision Tree Split Quality Computed?

How is Decision Tree Split Quality Computed?

add a comment

2

OpenCV decision tree is the CART version of the tree. Regression and classification tasks use own splitting criteria.

**Regression task**. We minimize the expected sum variances of responses for two child nodes. In other words for each split canidate in a splitted node we estimate responses for all samples in the node. I.e. each sample is assigned to left or right node's response that computed as average sum of true responses of samples that came to the current child node. Then we compute "sum((true_response - predicted_response)^2)" for all samples of the splitted node for a given split and choose the split that gives the minimum of this sum.**Classification task**. Its criteria is based on Gini index and minimizes impurity of child nodes (samples of one class should belong to the same child node). It's hard to describe Gini index in simple words here, so see e.g. 'Gini impurity' http://en.wikipedia.org/wiki/Decision*tree*learning#Gini_impurity). We minimize a sum "left_node_samples_ratio * GiniIndex(left_node_samples) + right_node_samples_ratio * GiniIndex(right_node_samples)" to find the best split.

Minimization of described criterias is reduced to equivalent maximization of another simplified ones - it's a warning if you'll decide to look at the implementation ;)

Asked: **
2012-07-20 09:35:16 -0500
**

Seen: **6,396 times**

Last updated: **Jul 20 '12**

How to build a regression tree over binary variables?

What is the usage of CV_DTREE_CAT_DIR?

CvDTree: Appending training data

Decision Tree, subset contents

Decision Trees .xml file structure explanation

Missing predict method in java class CvDTree

In Java: Any chance to get the xml of a CvDTree in a String (instead of a file)

Understading information on nodes of Decision Trees

Copyright OpenCV foundation, 2012-2018. Content on this site is licensed under a Creative Commons Attribution Share Alike 3.0 license.