comparing datasets without using match or merge file command

classic Classic list List threaded Threaded
3 messages Options
Reply | Threaded
Open this post in threaded view
|

comparing datasets without using match or merge file command

akrobbins
Is there a syntax that can be used to compare for duplicates and unique cases between datasets without using the match or merge file command? I've been tasked to compare two datasets to determine duplicates and unique cases with the key variable being SSN. However the datasets can't be matched/merge to check for duplicates or unique cases (long story for me to explain). I am use to using the match/merge file syntax command this is the first time I have encountered this situation and am not sure if this procedure is even possible? Any help will be appreciated.



AKR
Reply | Threaded
Open this post in threaded view
|

Re: comparing datasets without using match or merge file command

Barnett, Adrian (DECD)
Hi Athena
Are you able to write out just the SSNs from each file into 2 new files, and use the normal MATCH FILES procedures for finding duplicates on those?

If there is some reason preventing that, it is difficult to imagine any feasible way in which you can complete the task.

Adrian Barnett
Project Officer
Educational Measurement and Analysis
Data and Educational Measurement
DECS
ph 82261080

-----Original Message-----
From: SPSSX(r) Discussion [mailto:[hidden email]] On Behalf Of akrobbins
Sent: Sunday, 10 January 2010 10:14 AM
To: [hidden email]
Subject: comparing datasets without using match or merge file command

Is there a syntax that can be used to compare for duplicates and unique cases
between datasets without using the match or merge file command? I've been
tasked to compare two datasets to determine duplicates and unique cases with
the key variable being SSN. However the datasets can't be matched/merge to
check for duplicates or unique cases (long story for me to explain). I am
use to using the match/merge file syntax command this is the first time I
have encountered this situation and am not sure if this procedure is even
possible? Any help will be appreciated.



AKR
--
View this message in context: http://old.nabble.com/comparing-datasets-without-using-match-or-merge-file-command-tp27093993p27093993.html
Sent from the SPSSX Discussion mailing list archive at Nabble.com.

=====================
To manage your subscription to SPSSX-L, send a message to
[hidden email] (not to SPSSX-L), with no body text except the
command. To leave the list, send the command
SIGNOFF SPSSX-L
For a list of commands to manage subscriptions, send the command
INFO REFCARD

=====================
To manage your subscription to SPSSX-L, send a message to
[hidden email] (not to SPSSX-L), with no body text except the
command. To leave the list, send the command
SIGNOFF SPSSX-L
For a list of commands to manage subscriptions, send the command
INFO REFCARD
Reply | Threaded
Open this post in threaded view
|

Re: comparing datasets without using match or merge file command

Albert-Jan Roskam
In addition, you could encrypt the ssn's first, or have somebody else do the encryption, and than follow Adrian's advice.

Cheers!!
Albert-Jan

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
In the face of ambiguity, refuse the temptation to guess.
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

--- On Sun, 1/10/10, Barnett, Adrian (DECS) <[hidden email]> wrote:

From: Barnett, Adrian (DECS) <[hidden email]>
Subject: Re: [SPSSX-L] comparing datasets without using match or merge file command
To: [hidden email]
Date: Sunday, January 10, 2010, 10:13 PM

Hi Athena
Are you able to write out just the SSNs from each file into 2 new files, and use the normal MATCH FILES procedures for finding duplicates on those?

If there is some reason preventing that, it is difficult to imagine any feasible way in which you can complete the task.

Adrian Barnett
Project Officer
Educational Measurement and Analysis
Data and Educational Measurement
DECS
ph 82261080

-----Original Message-----
From: SPSSX(r) Discussion [mailto:SPSSX-L@...] On Behalf Of akrobbins
Sent: Sunday, 10 January 2010 10:14 AM
To: SPSSX-L@...
Subject: comparing datasets without using match or merge file command

Is there a syntax that can be used to compare for duplicates and unique cases
between datasets without using the match or merge file command? I've been
tasked to compare two datasets to determine duplicates and unique cases with
the key variable being SSN. However the datasets can't be matched/merge to
check for duplicates or unique cases (long story for me to explain). I am
use to using the match/merge file syntax command this is the first time I
have encountered this situation and am not sure if this procedure is even
possible? Any help will be appreciated.



AKR
--
View this message in context: http://old.nabble.com/comparing-datasets-without-using-match-or-merge-file-command-tp27093993p27093993.html
Sent from the SPSSX Discussion mailing list archive at Nabble.com.

=====================
To manage your subscription to SPSSX-L, send a message to
LISTSERV@... (not to SPSSX-L), with no body text except the
command. To leave the list, send the command
SIGNOFF SPSSX-L
For a list of commands to manage subscriptions, send the command
INFO REFCARD

=====================
To manage your subscription to SPSSX-L, send a message to
LISTSERV@... (not to SPSSX-L), with no body text except the
command. To leave the list, send the command
SIGNOFF SPSSX-L
For a list of commands to manage subscriptions, send the command
INFO REFCARD