SOFTWARE/CART

 



by

What's New in CART® 4.0 Pro

Salford Systems' latest release of our flagship data-mining product, CART 4.0 PRO, contains many new features. CART 4.0 PRO is available for Windows 95 and 98, Windows NT 4.0, Windows 2000, and all major versions of UNIX and Linux.

Automatic Generation of Customized Reports and Publishing to the Web

Customized reports now can be generated automatically for any tree no matter when it was grown. Options control which components of individual CART analyses (tree diagram, gains charts, importance rankings, etc.) are included in each report. If several trees have been grown, a summary report table can be produced that describes each tree in one line of the table. The report can be saved as a Rich Text Format (.rtf) document and then published to Web (as .html).

Class Labels

You now can assign 32-character labels to levels of categorical variables. Labels can be added interactively to any variable including the target. Tree diagrams, CART text output and displays can use these labels when referring to levels of either target or predictor variables. Once entered the labels can be exported to any other analysis or tree; labels for categorical variables with many levels can be applied via command script and need not be entered interactively.

Case Weights

Case weights now can be used with any CART splitting rule or tree-combining method. (CART formerly treated each observation equally, i.e., with an effective case weight of 1.0.) Case weights, which may take on fractional values as well as whole numbers, are stored in a variable in the dataset and typically vary from observation to observation.

Penalties for Predictor Variables

CART now offers three ways to "penalize" (scale down) the improvement computed for a predictor variable, so that it is less likely to be chosen as the primary splitter. Penalties can be used to reflect the cost of a predictor's acquisition or the preference to not use a certain predictor unless it is unusually effective. The user can specify penalties on a specific variable from none through light to heavy. In addition, exact numerical penalty values can be specified or penalties can be imposed on all predictor variables in proportion to the degree that they are missing.

Penalties for High-Level Categorical Variables

A new method is provided to use effectively a predictor variable with many levels (such as zip code, which may have 3000 distinct levels) without giving such a predictor an undue advantage. Spurious splits are greatly reduced by ensuring that a high-level categorical predictor has no inherent advantage over a continuous variable that has a unique value for each record.

Combined Trees

CART's handling of committees of trees (bagging, ARCing) has been improved to make this technology more accessible to the average CART user. In addition, the user now can learn details about individual trees used in the committee. In some cases these details can allow users to draw robust conclusions that are not otherwise apparent.

© Copyright 2002 Salford-Systems Inc.


 
Copyright © 2010 TStat All rights reserved via Rettangolo, 12/14 - 67039 - Sulmona (AQ) - Italia