Posted by
Bruce Weaver on
Jan 17, 2020; 3:00pm
URL: http://spssx-discussion.165.s1.nabble.com/Employee-data-sav-When-was-the-data-collected-tp5738785p5738788.html
Thanks Rick (and Jon, who told me the same thing via email). If I use
24-Jan-1995 as the date for calculating Age, things work out reasonably
well, although the salaries may be a bit low. Given that it's just for an
assignment, I won't worry about that.
Cheers,
Bruce
* Change path on next line as needed.
GET FILE "C:\SPSSdata\Employee data.sav".
* Assume data are from 1995.
COMPUTE Age = DATEDIFF(DATE.DMY(24,1,1995), bdate, "years").
* Compute Age minus years in current job.
COMPUTE Check1 = Age-jobtime/12. /* jobtime = time in job (mos).
* Compute Age minus years of previous experience.
COMPUTE Check2 = Age-prevexp/12. /* prevexp = previous experience (mos).
DESCRIPTIVES Age Check1 Check2 /STATISTICS=MIN MAX MEAN.
*Descriptive Statistics
N Minimum Maximum Mean
Age 473 23.00 65.00 37.7696
Check1 473 17.58 59.50 31.0078
Check2 473 21.08 60.00 29.7740
.
* Get median & IQR for salaries.
FREQUENCIES VARIABLES=salary
/FORMAT=NOTABLE
/NTILES=4
/STATISTICS=MEAN MEDIAN
/ORDER=ANALYSIS.
*Statistics
Current Salary
Mean $34,419.57
Median $28,875.00
Percentiles
25 $24,000.00
50 $28,875.00
75 $37,162.50
.
Rick Oliver wrote
> It is essentially fake data, although based on real data. It does not
> stand
> up well to close scrutiny. It's based on data used in lawsuit. I think the
> income numbers were already a little outdated when I joined SPSS in 1989.
> At some point we bumped up the values, and we may have updated the age
> values at the same time, while also changing some values to make more
> interesting relationships.
>
> On Thu, Jan 16, 2020, 1:54 PM Bruce Weaver <
> bruce.weaver@
> > wrote:
>
>> Short version of the question: Does anyone know what year the data in
>> the
>> "Employee data.sav" sample file were gathered?
>>
>>
>> Now here's the longer version.
>>
>> Here is a listing of the first 10 records from "Employee data.sav", one
>> of
>> the sample files that comes with (or at least used to come with) SPSS:
>>
>> id gender bdate educ jobcat salary salbegin jobtime prevexp
>> minority
>>
>> 1 m 02/03/1952 15 3 $57,000 $27,000 98 144
>> 0
>> 2 m 05/23/1958 16 1 $40,200 $18,750 98 36
>> 0
>> 3 f 07/26/1929 12 1 $21,450 $12,000 98 381
>> 0
>> 4 f 04/15/1947 8 1 $21,900 $13,200 98 190
>> 0
>> 5 m 02/09/1955 15 1 $45,000 $21,000 98 138
>> 0
>> 6 m 08/22/1958 15 1 $32,100 $13,500 98 67
>> 0
>> 7 m 04/26/1956 15 1 $36,000 $18,750 98 114
>> 0
>> 8 f 05/06/1966 12 1 $21,900 $9,750 98 0
>> 0
>> 9 f 01/23/1946 15 1 $27,900 $12,750 98 115
>> 0
>> 10 f 02/13/1946 12 1 $24,000 $13,500 98 244
>> 0
>>
>> I am considering using it for a class exercise, and if I do, I would like
>> to
>> have the students compute an age variable. Birth date is given, but I
>> don't
>> know when the data were gathered. I used SYSFILE INFO to find the file
>> creation date, which appears to be 24-Jan-2012. But when I use that
>> value
>> to compute age, I get ages ranging from 40 to 82 with a mean of about 55.
>> Those ages are higher than I expected. I could always just knock 20
>> years
>> off and say that the data were gathered in 1992. But it would be nice to
>> have the actual date if anyone knows it.
>>
>> Cheers,
>> Bruce
>>
>>
>>
>>
>> -----
>> --
>> Bruce Weaver
>>
> bweaver@
> LISTSERV@.UGA
> (not to SPSSX-L), with no body text except the
>> command. To leave the list, send the command
>> SIGNOFF SPSSX-L
>> For a list of commands to manage subscriptions, send the command
>> INFO REFCARD
>>
>
> =====================
> To manage your subscription to SPSSX-L, send a message to
> LISTSERV@.UGA
> (not to SPSSX-L), with no body text except the
> command. To leave the list, send the command
> SIGNOFF SPSSX-L
> For a list of commands to manage subscriptions, send the command
> INFO REFCARD
-----
--
Bruce Weaver
[hidden email]
http://sites.google.com/a/lakeheadu.ca/bweaver/"When all else fails, RTFM."
NOTE: My Hotmail account is not monitored regularly.
To send me an e-mail, please use the address shown above.
--
Sent from:
http://spssx-discussion.1045642.n5.nabble.com/=====================
To manage your subscription to SPSSX-L, send a message to
[hidden email] (not to SPSSX-L), with no body text except the
command. To leave the list, send the command
SIGNOFF SPSSX-L
For a list of commands to manage subscriptions, send the command
INFO REFCARD
--
Bruce Weaver
bweaver@lakeheadu.ca
http://sites.google.com/a/lakeheadu.ca/bweaver/"When all else fails, RTFM."
PLEASE NOTE THE FOLLOWING:
1. My Hotmail account is not monitored regularly. To send me an e-mail, please use the address shown above.
2. The SPSSX Discussion forum on Nabble is no longer linked to the SPSSX-L listserv administered by UGA (
https://listserv.uga.edu/).