How to look for possible name matches in restricted subsets?

Posted by Art Kendall on
URL: http://spssx-discussion.165.s1.nabble.com/How-to-look-for-possible-name-matches-in-restricted-subsets-tp5740478.html

Context: I suspect that field operations may have not been consistent in
assuring they correctly matched Pre and Post cases.

There are a few hundred HousIDs in each city.
each case has
Country City HouseID PrePost Name sex v1 v2 v3.
PrePost has values 1 'Pre' 2 'Post'.


As a quality check on whether HouseID is plausibly referring to roughly the
same people, I am looking for a way to get data to eyeball for name sex v1
v2 v3.
1) take the first (say) 5 or 10 HouseIDs with 'Pre' on PrePost in a city
2) see if there is overlap between the set of names in that HouseID and
those in any HouseID in the 'Post' set for that city.  
3) output needed 1 'HouseID with some overlap' 2 'No overlap with any
HouseID.
3) overlap means that at least 1 Name from a 'Pre' HouseID has a match

Match means the last word in the set of words is the same AND at least 1 of
the other words matches.






-----
Art Kendall
Social Research Consultants
--
Sent from: http://spssx-discussion.1045642.n5.nabble.com/

=====================
To manage your subscription to SPSSX-L, send a message to
[hidden email] (not to SPSSX-L), with no body text except the
command. To leave the list, send the command
SIGNOFF SPSSX-L
For a list of commands to manage subscriptions, send the command
INFO REFCARD
Art Kendall
Social Research Consultants