Hi Saeed,
Decision Trees family comprisies of C5, CHAID, QUEST and C&RT.
C5 is only available in SPSS Modeler. I had deployed one for one of my company's vendors before to predict which customers are likely to purchase an insurance product.
In any case, back to decision tree, you might want to have some readings from wiki as a start first to know something about it. It might come in useful for future analysis.
http://en.wikipedia.org/wiki/Decision_tree_learning
I am currently launching another data mining model named Sequence Detection model too. Those that you see on an online website stating that if you had bought a certain product, what are the recommended products for you. A data mining model which works using conditional probability.
There are also many good website on Data Mining Models as well.
Warmest Regards
Dorraj Oet
Date: Tue, 18 Dec 2012 19:21:55 +1100
Subject: Re: Which Statistical test to use?!
From: [hidden email]
To: [hidden email]
CC: [hidden email]
Thanks for your reply.
Actually I have never run decision tree for my data analysis!
I hope I can work out an easier way!
Thanks and I will be looking forward to hearing more from the group and you.On Tue, Dec 18, 2012 at 7:05 PM, DorraJ Oet <[hidden email]> wrote:
Hi Saeed,
It is nice to know that each item is likert scale.
The reason why I suggested a decision tree because a Dean of a University once mentioned that the decision tree take care of interaction. Also, it might be meaningful to find out which of the variables, gender, age and proficiency level is important to
If I am doing the analysis, I will build decicison trees for all the 6 items using them as dependent variables and gender, age and proficiency level will be the independent variables. It might be interesting to know which of these 3 variables are important to the 6 variables above.
1. attachment to persian language
2. pronunciation attitude
3. language and social status
4. L1 use and exposure in the society
5. language knowledge
6. script
One more thing about Decision tree is that is can actually "group" the age for me. The age range of a respondent being attached to Persian Language might be different from that who has a high language knowledge. I can also do profiling along the way with a decision tree.
Warmest Regards
Dorraj
Date: Tue, 18 Dec 2012 18:45:51 +1100
Subject: Re: Which Statistical test to use?!
From: [hidden email]
To: [hidden email]
CC: [hidden email]
I have already run EFA and CFA on the questionnaire. Actually I have the Likert scale from 1 to 6 for each of the items in the questionnaire, and the gender is male and female as expected, the age group is divided into four 11-15, 16-20, 21-25, 26+, and the language proficiency is also divided into six including basic, elementary, pre-inter, inter, high inter, and advanced.
My first question is: Should I check the relationship between each subscale and each of the above variables? i mean does it sound plausible? And if yes, it becomes very complicated for data analysis. What do you suggest?
Thanks
From: [hidden email]
To: [hidden email]; [hidden email]
Subject: RE: Which Statistical test to use?!
Date: Tue, 18 Dec 2012 07:39:55 +0000
Hi Saeed,
This seems to be more of a modeling than a Statistical testing to see if there is a relationship.
I am going to try to suggest based on what I had interpret from your email.
Looking at if there is a relationship between Iranian English language learners’ attachment to Persian
- gender
- age
- proficiency level
My dependent variable will be Attachment to Persian. Assuming that attachment is scored 1 - 5.
My independent variables will be gender, age and proficiency level.
I will use a decision tree model to test which are the independent variables that are important to the variable Attachment to Persian.
If all 3 of the independent variables are important, the decision tree will allow you to see
For a respondent which is
- Male
- 30-51 years old
- Proficiency level 5
how attached is he to Persian.
Warmest Regards
Dorraj Oet
Date: Tue, 18 Dec 2012 18:01:17 +1100
From: [hidden email]
Subject: Which Statistical test to use?!
To: [hidden email]
Hi all,
I am dubious about which statistical test to use for my research questions. I would be glad to receive your comments. My research question is:
Is there any relationship between the Iranian English language learners' language identity and their age, gender, and language proficiency level.
The language identity is measured through a validated questionnaire with 19 items. And the language identity included six subscales in the questionnaire namely:
1. attachment to persian language
2. pronunciation attitude
3. language and social status
4. L1 use and exposure in the society
5. language knowledge
6. script
Each of the above subscales had 3 to 4 items.
Now the first research question will be transformed into multiple research questions including:
1. There is no relationship between Iranian English language learners’ attachment to Persian and their
· gender,
· age, and
· proficiency level
2. There is no relationship between Iranian English language learners’ pronunciation attitude and their
· gender,
· age, and
· proficiency level
3. There is no relationship between Iranian English language learners’ language and social status and their
· gender,
· age, and
· proficiency level
4. There is no relationship between Iranian English language learners’ L1 use/exposure in the society and their
· gender,
· age, and
· proficiency level
5. There is no relationship between Iranian English language learners’ language knowledge and their
· gender,
· age, and
· proficiency level
6. There is no relationship between Iranian English language learners’ script/alphabet and their
· gender,
· age, and
· proficiency level
Now....what do you think would be the best statistical test? Do you think I should use MANOVA? Do not forget that I am after "relation" not "effect". How should I enter the data for each subscale.
Thanks in advance to all of you for your kind attention.
Saeed
Free forum by Nabble | Edit this page |