Assignment One: Use Excel to Analyze ABS Table Builder Data
– Why are we doing this assignment?
– Specific educational objectives – Learn how to:
– Why do we need to know this?
– What exactly are you supposed to produce?
– Due: Monday, 6 April, at 9 AM, at My Uni
– Required file format
– Length:
– MS Word file
– Cover page
– 1 page, with name, student number and date of submission
– Textual Report
– 1.5 pages (about 500 words),single spaced
– 12 pt size Times New Roman, 1 inch margins on all sides
– Excel File
– pasted and formatted tables
– calculated indicators
– organized combined table
– scatterplot with a regression line
– in the nutshell
– pick two variables from the ABS Table Builder Data Sets, using
Australian Census data (2006, 2011, or 2016 Censuses)
– formulate a hypothesis that they are causally related
– test that hypothesis
– write up the findings
Structure
– Textual Report (MS Word file) (one single-spaced page, about 500 words)
1) Justify Variable Selection(1-2 paragraphs)
– Explain Dependent Variable
– what is it?any concerns with the way it was measured?
– find out how it was measured in Census Dictionary
(see p.6 below)
– Explain Independent Variable
– what is it? any concerns with the way it was measured?
– Explain the Hypothesis
– how do you think that these two variables are related?
– direction and strength of the relationship
– Provide a causal justification (story)
2) Univariate Analysis (1 paragraph) (covered in the Labs #1 and #2)
– Dependent Variable:
– mean, median, mode, standard deviation
– Independent Variable:
– mean, median, mode, standard deviation
3) Bivariate Analysis (1-2 paragraphs) (covered in the Labs #2 and #3)
– write down the regression equation
– report and interpret b, r
– describe the direction and strength of the relationship
– interpret scatter plot
– any issues with outliers or non-linearity?
– if yes, deal with them and interpret results
4) Conclusion (1-2 paragraphs)
– was your initial hypothesis supported or not?
– discuss relevance of ecological fallacy
-Excel File
– should contain
– calculated (with Excel formulae, NOT in another way):
-rates, then: mean, median, standard deviation
– calculated (with Excel formulae, NOT in another way):
– rates, then: mean, median, standard deviation
– combined table, correctly organized with matching years and names of states and territories
– calculated (with Excel formulae, NOT in another way):
– intercept, slope, correlation
– scatterplot (with regression line and regression equation)
– if needed, outliers and non-linearity dealt with
– Format
– name your assignments file using this form:
– “A1_Text_Firstname_Lastname.doc”
– “A1_Data_Firstname_Lastname.xls”
– for example:
– “A1_Text_Djordje_Stefanovic.doc”
– “A1_Data_Djordje_Stefanovic.xls”
– text file
– use correct grammar and spell check
– sentence construction
– make sure that sentences are grammatical
– prefer short and focused sentences to very long and unfocused ones
– use good style
– do not use “can’t” or “don’t”
– avoid using “this”, “that” and “it”
– use paragraphs and full sentences
– do not use the point form
– paragraph length
– not shorter than 3 sentences
– not more than 2/3 of the page
– try to write as clearly as you can
– social science jargon is avoided
– if you have to use it then it is (briefly) defined and used correctly
– capitalize your two variables:
– wrong: university education
– correct: University Education
– data file
– only 2 decimal places should be used in calculation
Grading Key
1) Format (10%)
2) Variable Justification (20%)
3) Calculations and Presentation: Individual Variables (20%)
4) Calculations and Presentation: Association (20%)
5) Interpretation of Results (20%)
6) Conclusion (10%)
How do I do this?
– to be posted Friday this week
– must be ratio level
– must not be one of the two variables used in the model assignment
– you must be able to come up with a story (causal argument) that explains the possible relationship between the two variables (but no lit review)
– two variables must have data for all states/territories and for the same year
– review how the variables were measured at Census dictionary:
https://guest.censusdata.abs.gov.au/webapi/jsf/dataCatalogueExplorer.xhtml
– if they are absolute numbers, you will need to calculate percentages or rates
(such as the homicide rate per 10,000 people in Australian states/territories)
– some trial and error is a normal part of the analytical work
– paste and format tables
– do calculations
– produce a scatterplot with a regression line
– look at the model assignment, Excel file, to make sure you did it right
– determine whether you have problems with skewness
– decide which indicators of central tendency and spread are
the best to report
– determine whether you have problems with outliers or non-linearity
– take measures as appropriate (see Lab #2 and #3)
– determine strength and direction of the relationship
– use the Excel results and your analysis
– look into the model assignment (text file) to make sure you are doing it right
– do not worry about the grammar, style, etc at this point
– fix paragraph structure, transitions between paragraphs
– fix grammar, style, spelling
9. Submit the Assignment One at