Week2 assignment – Frequency Distribution

This week’s assignment was easy enough to write, but cleaning up the data was another ballgame! Since I had created my own dataset and codebook, I ended up with variables that don’t really fit the question. While I was determined not to let it deter me, getting my data to behave was a time-consuming affair and pretty much took up my entire week.

With few hours to go, I decided to create a separate question just for the assignments using the standard codebooks. Hence please don’t be bewildered if the previous posts talk about a totally different hypothesis.

So the new research question is as follows:

  • How many Americans still believe the great American dream (becoming wealthy, owning a home, etc) is still beyond reach?

The refined question would be:

  • How many people in the age range 25-34 still feel it is possible and how strongly do they believe it?
  • Is it stronger based on sub-groups ie. married, people with specific ethnicity or education levels?

The data for this has been added to the following anu_ool_pds-w5. The codebook explaining all the variables are available in this file anu-ool-pds-codebook-new . I have not included the research papers I looked up for this particular question, since it does not have any relevance for this week’s assignment.

For the purpose of this assignment, I have calculated frequency distributions for 5 variables : W1_F3, W1_F4_B, W1_F4_D, W1_F5_A and W1_F6. These are basically answers to 5 questions from the survey.

  • W1_F3: A basic American belief has been that if you work hard you can get ahead and reach the goals you set and more. Is this true or false today? The answer options were Extremely true (1), Moderately true (2), Slightly true (3), Neither (4), Slightly false (5), Moderately false (6), Extremely false (7) and Refused (-1).
  • W1_F4_B: [To have a financially secure retirement ] For yourself and people like you, how easy or hard is it to reach these goals? Answer options were Very hard (1),  Somewhat hard (2), Somewhat easy (3), Very easy (4), Refused (-1).
  • W1_F4_D: [To become wealthy ] For yourself and people like you, how easy or hard is it to reach these goals? Answer options were Very hard (1),  Somewhat hard (2), Somewhat easy (3), Very easy (4), Refused (-1).
  • W1_F5_A: [To own a home ] For your children or the children of people like yourself, how easy or hard will it be for your children to reach these goals in the future? Answer options were Very hard (1),  Somewhat hard (2), Somewhat easy (3), Very easy (4), Refused (-1).
  • W1_F6: How far along the road to your American Dream do you think you will ultimately get on a 10-point scale where 1 is not far at all and 10 nearly there?  The participants could also refuse this question (-1).

The program was first run for all participants (2295 responses) and then specifically for those participants in the range 25-34 (295 responses) . The program code for both these distributions is attached in the pdf file AssignmentW2_pgm_code .

The results for the first set (all participants) is a s follows:

Belief in achieving American Dream
W1_F3 Frequency Percent Cumulative
Frequency
Cumulative
Percent
-1 44 1.92 44 1.92
1 331 14.43 375 16.35
2 641 27.94 1016 44.29
3 521 22.71 1537 67.00
4 275 11.99 1812 78.99
5 214 9.33 2026 88.32
6 157 6.84 2183 95.16
7 111 4.84 2294 100.00
Frequency Missing = 1
Achieving financially secure retirement
W1_F4_B Frequency Percent Cumulative
Frequency
Cumulative
Percent
-1 67 2.92 67 2.92
1 733 31.95 800 34.87
2 1012 44.12 1812 78.99
3 390 17.00 2202 95.99
4 92 4.01 2294 100.00
Frequency Missing = 1

 

Achieving wealth
W1_F4_D Frequency Percent Cumulative
Frequency
Cumulative
Percent
-1 65 2.83 65 2.83
1 1258 54.84 1323 57.67
2 755 32.91 2078 90.58
3 164 7.15 2242 97.73
4 52 2.27 2294 100.00
Frequency Missing = 1

 

Owning a home
W1_F5_A Frequency Percent Cumulative
Frequency
Cumulative
Percent
-1 84 3.66 84 3.66
1 507 22.10 591 25.76
2 940 40.98 1531 66.74
3 578 25.20 2109 91.94
4 185 8.06 2294 100.00
Frequency Missing = 1

 

How close to achieve the American Dream
W1_F6 Frequency Percent Cumulative
Frequency
Cumulative
Percent
-1 59 2.57 59 2.57
1 70 3.05 129 5.62
2 59 2.57 188 8.20
3 98 4.27 286 12.47
4 152 6.63 438 19.09
5 362 15.78 800 34.87
6 336 14.65 1136 49.52
7 440 19.18 1576 68.70
8 350 15.26 1926 83.96
9 165 7.19 2091 91.15
10 203 8.85 2294 100.00
Frequency Missing = 1

 

These results can also be seen in the pdf here FREQ_anu_w2_asgt-results

Similarly, the frequency distributions for participants in the age group 25-34 is added in the file anu_w2_asgt-results.  As seen in both the files, the summary tables clearly display the frequency distributions in terms of the values the variables take, how often these values are seen (count) and the presence of missing data. For example, for the last variable, the title indicates “How close to achieve the American Dream” which is a summary of the question. The frequency and percent columns indicate the count for each answer option, and the percentage of total participants who chose this particular option. Only one value was missing as shown at the end of the table.

Please review the 2 result files and share your opinions in the peer feedback or in the comments section below. Thanks!

Advertisements

One thought on “Week2 assignment – Frequency Distribution

  1. Pingback: Week3 assignment – data management | Journey of Analytics

Please share your feedback and opinions. Thanks!

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s