InstructionPrompt: You have been recently hired as a junior analyst by D.M. Pan Real Estate Company. The sales team has tasked you with preparing a report that examines the relationship between the selling price of properties and their size in square feet. You have been provided with a Real Estate County Data document that includes properties sold nationwide in recent years. The team has asked you to select a region, complete an initial analysis, and provide the report to the team. Note: In the report you prepare for the sales team, the response variable (y) should be the median listing price and the predictor variable (x) should be the median square feet. These are elements that need specifically addressed: Generate a Representative Sample of the Data Select a region (file attached below) and generate a simple random sample of 30 from the data. (Any of your choosing) Report the median listing price and median square foot, report the mean, median, and standard deviation. Analyze Your Sample Discuss how the regional sample created is or is not reflective of the national market. Compare and contrast your sample with the population using the National Statistics and Graphs document. Explain how you have made sure that the sample is random. Explain your methods to get a truly random sample. Generate Scatterplot Create a scatterplot of the x and y variables noted above and include a trend line and the regression equation Observe patterns Answer the following questions based on the scatterplot: Define x and y. Which variable is useful for making predictions? Is there an association between x and y? Describe the association you see in the scatter plot. What do you see as the shape (linear or nonlinear)? If you had a 1,200 square foot house, based on the regression equation in the graph, what price would you choose to list at? Do you see any potential outliers in the scatterplot? Why do you think the outliers appeared in the scatterplot you generated? What do they represent? Formatting is in this order: (or just write what each section is and I can format myself) Introduction [Include in this section a brief overview, including the purpose of the report.] Representative Data Sample [Present your simple random sample of 30, including the region you selected for your sample. Then identify the mean, median, and standard deviation of the median listing price and the median square foot variables.] Data Analysis [Discuss how the regional sample created is reflective of the national market. Compare and contrast your regional sample with the national population using the National Statistics and Graphs document (attached down below!) found in the Module Two Assignment Guidelines and Rubric. Explain how you have made sure that the sample is random. Explain your methods to get a truly random sample.] Scatterplot [Insert a scatterplot graph of the sample using the x and y variables noted earlier. Include a trend line and the regression equation.] The Pattern [Based on your graph, define each variable, and explain which variable will be useful for making predictions and why.] [Describe the association between x and y in the scatterplot and determine its shape. Identify any outliers you see in the graph and explain why these occur and what they represent.] [If you had a 1,200 square foot house, based on the regression equation in the graph, what price would you choose to list at? Explain.]