Posted by
Bruce Weaver on
Nov 19, 2022; 8:47pm
URL: http://spssx-discussion.165.s1.nabble.com/IBM-Support-page-on-the-point-biserial-correlation-tp5741086.html
A recent discussion about the point-biserial correlation (r_pb) on ResearchGate got me thinking about this IBM Support page about r_pb:
https://www.ibm.com/support/pages/point-biserial-correlations-spssI have no real issue with what it says. It is completely true that when X is dichotomous and Y is a metric variable, Pearson r is equivalent to r_pb. However, I think the note should add that one cannot infer from this equivalence of the point estimates that the 95% CI for Pearson r is the correct CI for r_pb. If the help for Stata's -esize- command is correct, the two CIs are not the same. Specifically, the non-central t-distribution is used in computing the correct CI for r_pb. Click on Methods and Formulas here for more info:
https://www.stata.com/manuals/resize.pdfThis is not the only example where two different methods can be used to compute exactly the same point estimate, but where the CI from one method is not the correct CI for the statistic of interest. Consider Pearson r and the slope from a simple linear regression model when both variables have been standardized. The point estimates are indeed equivalent. But the 95% CI for the slope from that regression model is not the correct CI for Pearson r, despite the advice given in this YouTube video:
https://www.youtube.com/watch?v=-dSoWqDyT4EThe correct CI for Pearson r is symmetrical only when the observed value of r = 0; otherwise, it is asymmetrical. And the limits are always within the range -1 to 1. The CI for the slope from an OLS model, on the other hand is always symmetrical; and it can have values outside the range -1 to 1.
So, as noted above, I reckon the IBM Support page should be updated to clarify that one cannot simply use the STATS CORRELATION extension command to get the correct CI for r_pb.
Finally, a question for Jon, who wrote STATS CORRELATION. Is it feasible to add an option to compute the correct CI for r_pb (using the non-central t-distribution)?
Cheers,
Bruce
PS- Anyone interested in the RG and Statalist discussions about this can view them via the links below.
https://www.researchgate.net/post/Q_re_pbis_in_STATA-is_there_a_macro_to_do_multiple_calculations_to_a_table_or_list_or_do_I_have_to_do_them_all_individuallyhttps://www.statalist.org/forums/forum/general-stata-discussion/general/1690078-methods-for-computing-the-point-biserial-correlation
--
Bruce Weaver
bweaver@lakeheadu.ca
http://sites.google.com/a/lakeheadu.ca/bweaver/"When all else fails, RTFM."
PLEASE NOTE THE FOLLOWING:
1. My Hotmail account is not monitored regularly. To send me an e-mail, please use the address shown above.
2. The SPSSX Discussion forum on Nabble is no longer linked to the SPSSX-L listserv administered by UGA (
https://listserv.uga.edu/).