- This topic has 1 reply, 2 voices, and was last updated 8 years, 2 months ago by .
Viewing 1 reply thread
Viewing 1 reply thread
- You must be logged in to reply to this topic.
› Forums › Speech Synthesis › The front end › CART › CART – Distinguishing between majority and pure labels
In the CART model, is there any way of distinguishing between pure labels and majority labels to help you work out the likelihood that the unlabelled data will be classified correctly?
I think you are asking about the distribution of labels at a leaf of the tree – is that what you mean?
In general, with real data, we will not get pure leaves (i.e., all data points have a single label). So, we can say that there is always a distribution of labels at every leaf.
The question then becomes: how do we make use of that, when making predictions for unseen test data points? There are two possibilities:
In the second case, some subsequent process will have to resolve the uncertainty about the label – perhaps by using additional information such as the sequence of labels assigned to preceding and following points (in a sequence).
Some forums are only available if you are logged in. Searching will only return results from those forums if you log in.
Copyright © 2024 · Balance Child Theme on Genesis Framework · WordPress · Log in