GEE with Binary Data

classic Classic list List threaded Threaded
7 messages Options
Reply | Threaded
Open this post in threaded view
|

GEE with Binary Data

Mesfin Mulatu
Hello All,

I am new to generalized estimating equations and I am wondering if anyone
in the group can help me answer a couple of data setup questions.

I have binary outcome data on the same subjects over a seven year period
(data collected on a yearly basis). Some subjects were present at year 1
and participated in the survey through year 7. Other subjects started in
year 3 and continued to year 7. Still others participated only in years 1
thru 5, but not in years 6 and 7.

I am trying to conduct a series of binary logistic GEEs with the logit
link and autoregressive correlations.

In conducting GEE, is it expected that all subjects have records on all
years of the survey? What are the recommendations for setting up your data
in this type of situations?

Any advice, reading material, and/or annotated example would be greatly
appreciated.

Best regards,

Mesfin

=====================
To manage your subscription to SPSSX-L, send a message to
[hidden email] (not to SPSSX-L), with no body text except the
command. To leave the list, send the command
SIGNOFF SPSSX-L
For a list of commands to manage subscriptions, send the command
INFO REFCARD
Reply | Threaded
Open this post in threaded view
|

Re: GEE with Binary Data

Ryan
Mesfin,

You are permitted to have missing data without losing the remaining the data
collected from a subject. It's important that the data are missing
completely at random or at least missing at random. You need to set up your
data in long (a.k.a. vertical) format. Here's an example of how to set up
the dataset [with a couple missing points]:

Subject     Time     Y
1                1       0
1                2       1
1                3       1
1                4
1                5       1
1                6       0
1                7       0
2                1       1
2                2
2                3       0
2                4       1
2                5       0
2                6       0
2                7       0
.
.
.
.
N

HTH,

Ryan


Mesfin Mulatu wrote:

>
> Hello All,
>
> I am new to generalized estimating equations and I am wondering if anyone
> in the group can help me answer a couple of data setup questions.
>
> I have binary outcome data on the same subjects over a seven year period
> (data collected on a yearly basis). Some subjects were present at year 1
> and participated in the survey through year 7. Other subjects started in
> year 3 and continued to year 7. Still others participated only in years 1
> thru 5, but not in years 6 and 7.
>
> I am trying to conduct a series of binary logistic GEEs with the logit
> link and autoregressive correlations.
>
> In conducting GEE, is it expected that all subjects have records on all
> years of the survey? What are the recommendations for setting up your data
> in this type of situations?
>
> Any advice, reading material, and/or annotated example would be greatly
> appreciated.
>
> Best regards,
>
> Mesfin
>
> =====================
> To manage your subscription to SPSSX-L, send a message to
> [hidden email] (not to SPSSX-L), with no body text except the
> command. To leave the list, send the command
> SIGNOFF SPSSX-L
> For a list of commands to manage subscriptions, send the command
> INFO REFCARD
>
>

--
View this message in context: http://old.nabble.com/GEE-with-Binary-Data-tp27772346p27776187.html
Sent from the SPSSX Discussion mailing list archive at Nabble.com.

=====================
To manage your subscription to SPSSX-L, send a message to
[hidden email] (not to SPSSX-L), with no body text except the
command. To leave the list, send the command
SIGNOFF SPSSX-L
For a list of commands to manage subscriptions, send the command
INFO REFCARD
Reply | Threaded
Open this post in threaded view
|

Re: GEE with Binary Data

Alex Reutter
In reply to this post by Mesfin Mulatu

In addition to Ryan's notes, have you looked at the online help?  Help > Case Studies, then Advanced > Generalized Linear Models > Generalized Estimating Equations gives an example of repeated measures binary logistic regression using the wheeze_steubenville.sav dataset that ships with the product.  These data are also analyzed in Hardin & Hilbe's Generalized Estimating Equations.

Alex
Reply | Threaded
Open this post in threaded view
|

Re: GEE with Binary Data

Mesfin Mulatu
In reply to this post by Mesfin Mulatu
On Thu, 4 Mar 2010 09:18:41 -0600, Alex Reutter <[hidden email]>
wrote:

>In addition to Ryan's notes, have you looked at the online help?  Help >
>Case Studies, then Advanced > Generalized Linear Models > Generalized
>Estimating Equations gives an example of repeated measures binary logistic
>regression using the wheeze_steubenville.sav dataset that ships with the
>product.  These data are also analyzed in Hardin & Hilbe's Generalized
>Estimating Equations.
>
>Alex

Thank you Ryan and Alex for your help. It is good to know that I can
include all of my cases in the analysis. I guess I have to read a bit on
what exactly "missing completely at random" is and if it applies to my
case. I will also check out the online help for additional insight.

Regards,

Mesfin

=====================
To manage your subscription to SPSSX-L, send a message to
[hidden email] (not to SPSSX-L), with no body text except the
command. To leave the list, send the command
SIGNOFF SPSSX-L
For a list of commands to manage subscriptions, send the command
INFO REFCARD
Reply | Threaded
Open this post in threaded view
|

Re: GEE with Binary Data

Ryan
Mesfin,

I'm sorry I didn't give you the code in my initial response. Assuming time is to be treated as a categorical variable, link=logit, distribution=binomial, and you want an autoregressive var-cov matrix, then the code below is a good starting point:

GENLIN y (REFERENCE=LAST) BY time (ORDER=ASCENDING)
  /MODEL time INTERCEPT=YES
 DISTRIBUTION=BINOMIAL LINK=LOGIT
  /REPEATED SUBJECT=id WITHINSUBJECT=time SORT=YES CORRTYPE=AR(1) ADJUSTCORR=YES COVB=ROBUST
  /PRINT CPS DESCRIPTIVES MODELINFO FIT SUMMARY SOLUTION .

Much can be said about the code but time is against me right now. There are many issues that come to mind. Feel free to write us back if you have specific questions.

HTH,

Ryan

Mesfin Mulatu wrote
On Thu, 4 Mar 2010 09:18:41 -0600, Alex Reutter <areutter@us.ibm.com>
wrote:

>In addition to Ryan's notes, have you looked at the online help?  Help >
>Case Studies, then Advanced > Generalized Linear Models > Generalized
>Estimating Equations gives an example of repeated measures binary logistic
>regression using the wheeze_steubenville.sav dataset that ships with the
>product.  These data are also analyzed in Hardin & Hilbe's Generalized
>Estimating Equations.
>
>Alex

Thank you Ryan and Alex for your help. It is good to know that I can
include all of my cases in the analysis. I guess I have to read a bit on
what exactly "missing completely at random" is and if it applies to my
case. I will also check out the online help for additional insight.

Regards,

Mesfin

=====================
To manage your subscription to SPSSX-L, send a message to
LISTSERV@LISTSERV.UGA.EDU (not to SPSSX-L), with no body text except the
command. To leave the list, send the command
SIGNOFF SPSSX-L
For a list of commands to manage subscriptions, send the command
INFO REFCARD
Reply | Threaded
Open this post in threaded view
|

spss v.18 and R version???

Bibel, Daniel (POL)
Just installed V 18 of SPSS and would like to install R -- must I
install version 2.8 of R as per the online documentation (and do I
install 2.8.0 or 2.8.1?) or will SPSS take one of the more current
releases?

Dan Bibel
Massachusetts State Police
(no, I'm a civilian and I can't fix your tickets)

=====================
To manage your subscription to SPSSX-L, send a message to
[hidden email] (not to SPSSX-L), with no body text except the
command. To leave the list, send the command
SIGNOFF SPSSX-L
For a list of commands to manage subscriptions, send the command
INFO REFCARD
Reply | Threaded
Open this post in threaded view
|

Re: GEE with Binary Data

Mesfin Mulatu
In reply to this post by Mesfin Mulatu
Thank you so much Ryan for the codes. I will certainly use it as a
strating point.

Let me ask for clarification about the missing data issue. In my dataset,
some cases took part in the study at a later point (say Year 3) and other
dropped out early (year 4 or 5). So we have missing data on all variables
for years 1 and 2 for the late starters, and complete missing for years 6
and 7 for the drop outs.

Does GEE accomodate this type of missingness? Most discussions of
missingness focus on missing data only on a specific variables in the
presence of others.

Thank you.

Mesfin

=====================
To manage your subscription to SPSSX-L, send a message to
[hidden email] (not to SPSSX-L), with no body text except the
command. To leave the list, send the command
SIGNOFF SPSSX-L
For a list of commands to manage subscriptions, send the command
INFO REFCARD