Reference or deeper details for the CRT TREE algorithm

classic Classic list List threaded Threaded
3 messages Options
Reply | Threaded
Open this post in threaded view
|

Reference or deeper details for the CRT TREE algorithm

Tim Graettinger
Hi,

I'm trying to understand the details of the SPSS CRT Decision Tree algorithm, particularly the splitting criteria.  I managed to find a document on the IBM site entitled, IBM SPSS Decision Trees 25 that talks about the CRT Criteria: impurity measure, Gini, Twoing, and Ordered Twoing.  What are really missing for me are some examples with the calculations shown, or at least the formulas that are being applied.  I've tried to do some of my own examples by hand, but haven't been able to reproduce the impurity/improvement measure shown in the results.

If anyone has a pointer to the original paper(s) on which the SPSS implementation is based, that would be very helpful as well.  Thanks!
-Tim

=====================
To manage your subscription to SPSSX-L, send a message to
[hidden email] (not to SPSSX-L), with no body text except the
command. To leave the list, send the command
SIGNOFF SPSSX-L
For a list of commands to manage subscriptions, send the command
INFO REFCARD
Reply | Threaded
Open this post in threaded view
|

Re: Reference or deeper details for the CRT TREE algorithm

Jon Peck

On Tue, Apr 23, 2019 at 7:18 AM Tim Graettinger <[hidden email]> wrote:
Hi,

I'm trying to understand the details of the SPSS CRT Decision Tree algorithm, particularly the splitting criteria.  I managed to find a document on the IBM site entitled, IBM SPSS Decision Trees 25 that talks about the CRT Criteria: impurity measure, Gini, Twoing, and Ordered Twoing.  What are really missing for me are some examples with the calculations shown, or at least the formulas that are being applied.  I've tried to do some of my own examples by hand, but haven't been able to reproduce the impurity/improvement measure shown in the results.

If anyone has a pointer to the original paper(s) on which the SPSS implementation is based, that would be very helpful as well.  Thanks!
-Tim

=====================
To manage your subscription to SPSSX-L, send a message to
[hidden email] (not to SPSSX-L), with no body text except the
command. To leave the list, send the command
SIGNOFF SPSSX-L
For a list of commands to manage subscriptions, send the command
INFO REFCARD


--
Jon K Peck
[hidden email]

===================== To manage your subscription to SPSSX-L, send a message to [hidden email] (not to SPSSX-L), with no body text except the command. To leave the list, send the command SIGNOFF SPSSX-L For a list of commands to manage subscriptions, send the command INFO REFCARD
Reply | Threaded
Open this post in threaded view
|

Re: Reference or deeper details for the CRT TREE algorithm

Tim Graettinger
In reply to this post by Tim Graettinger
Thanks, Jon!  Just what I was looking for!!
-Tim

=====================
To manage your subscription to SPSSX-L, send a message to
[hidden email] (not to SPSSX-L), with no body text except the
command. To leave the list, send the command
SIGNOFF SPSSX-L
For a list of commands to manage subscriptions, send the command
INFO REFCARD