Practical Data Analysis with JMP, Third Edition. Robert Carver
Чтение книги онлайн.

Читать онлайн книгу Practical Data Analysis with JMP, Third Edition - Robert Carver страница 28

Название: Practical Data Analysis with JMP, Third Edition

Автор: Robert Carver

Издательство: Ingram

Жанр: Программы

Серия:

isbn: 9781642956122

isbn:

СКАЧАТЬ

      d. Investigate the relationship between the number of birds struck and the height of the aircraft. Write a sentence to describe that relationship.

      8. Scenario: Every ten years, the United States conducts a census of the population, gathering considerable data about the nation and its residents. The data table called USA Counties contains demographic, economic, commercial, educational, and other data about each of the 3,143 counties in the United States as of 2010.

      a. Create a scatterplot of median household income (Y) versus percent of the population with a bachelor’s degree. Comment on what you see.

      b. Compute and report the line of best fit for these data. Use that line to estimate the median household income in counties with 25% of the population holding bachelor’s degrees.

      c. Create and report on a scatterplot between the percentage of households where a foreign language is spoken in the home (foreign_spoken_at_home) and the percentage of households with a foreign-born member (foreign_born). How do you explain the distinctive pattern in the graph?

      d. Compute and explain the correlation coefficient for the two variables in item c above.

      e. Estimate the line of best fit using the population as determined by the 2010 US Census as Y and the 2000 population count as X. Think about the slope of this line. What does it tell us about what happened to the average of US counties’ populations between 2000 and 2010?

      f. The point representing Cook County, Illinois, is distinctive in that it lies below the red estimated line (2000 population was 5,194,675). According to this fitted line, what was unusual about Cook County in comparison to other counties of the United States?

      Chapter 5: Review of Descriptive Statistics

      Overview 87

      The World Development Indicators 87

      Questions for Analysis 88

      Applying an Analytic Framework 89

      Preparation for Analysis 92

      Univariate Descriptions 92

      Explore Relationships with Graph Builder 95

      Further Analysis with the Multivariate Platform 98

      Further Analysis with Fit Y by X 100

      Summing Up: Interpretation and Conclusions 101

      Visualizing Multiple Relationships 101

      The prior four chapters introduced several foundational concepts of data analysis and have also led you through a series of illustrative analyses. In this chapter, we pause to pull together what we have learned about descriptive analysis. This is the first in a series of short review chapters, each of which shares the common goal of recapitulating recent material and calling upon you, the reader, to apply the principles, concepts, and techniques that you have recently studied.

      In the review chapters, you will find fewer step-by-step instructions. Instead, there are guiding questions to remind you of the analytical process that you’ll want to follow. Refer to earlier chapters if you have forgotten how to perform a task. In this and all future chapters, do your computer work within a JMP Project. The examples in this and later review chapters are based on the World Development Indicators, collected and published by the World Bank.

      The World Bank was established in 1944 to assist with the redevelopment of countries after World War II. It has evolved into a group of five institutions concerned with interconnected missions of economic development. One important goal of its work is the alleviation of poverty worldwide, and as part of that mandate, the World Bank annually publishes the World Development Indicators (WDI) gathered from 215 nations. Earlier, we looked at some of the WDI data about birth rates and life expectancy.

      In the public sector as well as in business, policy makers rely on accurate, current data to gauge progress and to evaluate the impact of policy decisions. The WDI data informs policy-making by many agencies globally, and the World Bank’s annual data collection and reporting play an important role in the U.N.’s Millennium Development Goals.

      Sustainable Development Goals

      At the start of the millennium, the United Nations sponsored a Millennium Summit, which led to the adoption of the Millennium Declaration by 189 member states. The declaration laid out an ambitious set of goals including commitments to “combat poverty, hunger, disease, illiteracy, environmental degradation, and discrimination against women. The MDGs are derived from this Declaration, and all have specific targets and indicators.” 2

      The Millennium Development Goals include “8 goals, 21 targets, and 60 indicators for measuring progress between 1990 and 2015, when the goals are expected to be met.”3 More recently, the MDG’s have evolved into Sustainable Development Goals (SDGs). Some of these indicators are identical to the World Bank’s WDIs.

      The eight goals, listed below, are:

      ● Eradicate extreme poverty and hunger

      ● Achieve universal primary education

      ● Promote gender equality and empower women

      ● Reduce child mortality

      ● Improve maternal health

      ● Combat HIV/AIDS, malaria, and other diseases

      ● Ensure environmental sustainability

      ● Develop a Global Partnership for Development

      We return to our introductory example from Chapter 1 about variation in life expectancies worldwide, and further investigate variables that might give us insight into why people in some regions tend to live long than in others. We will initially analyze some of the WDI measures from the year 2016 to explore and to understand their variability, as well as plausible associations between variables. In doing so, we will engage in an extended exercise in statistical reasoning and review several of the descriptive techniques that were presented in the first four chapters.

      In Chapter 1, we speculated about possible factors that might contribute to variation in life expectancies. At that time, we wondered about the impacts of education, health care, basic sanitation, nutrition, political stability, and wealth. Relying on several of the WDIs in our data, we will focus on four constructs as measured by the variables listed in Table 5.1. Notice that several of these variables are indirect measures of the constructs; statisticians sometimes refer to such variables as proxies to indicate that they “stand in” for a difficult-to-measure concept or attribute. Many of the WDIs are reported for all or nearly all the nations, while others are reported more sparsely. For this chapter, we will use indicators that are reported for the large majority of the 215 countries. We will also examine life expectancies in different regions of the world.

      Note that WDI contains data through 2018 for just a few of the 42 columns. In this analysis, we will focus on 2016 because we have data for the variables of interest for most nations.

      Table 5.1: Ten Variables4 Used in This Chapter

ConstructVariable
WealthGross СКАЧАТЬ