Read The Bell Curve: Intelligence and Class Structure in American Life Online
Authors: Richard J. Herrnstein,Charles A. Murray
Tags: #History, #Science, #General, #Psychology, #Sociology, #Genetics & Genomics, #Life Sciences, #Social Science, #Educational Psychology, #Intelligence Levels - United States, #Nature and Nurture, #United States, #Education, #Political Science, #Intelligence Levels - Social Aspects - United States, #Intellect, #Intelligence Levels
76
Our topic here is the effect of adoption on raising IQ, not the implications of adoption data for estimating the heritability of IQ. For reviews of the adoption literature, see Herrnstein 1973; Locurto 1990; Munsinger 1975; Plomin and DeFries 1985. A comprehensive theoretical analysis of adoption studies of intelligence is in Turkheimer 1991.
77
Brown 1958, Chap. 5; Lane 1976; Lane and Pillard 1978.
78
Among others inspired by this evidence from “wild children” of the power over the mind of the human environment was an Italian physician trained at the end of the nineteenth century whose approach to education has survived the twentieth, Maria Montessori.
79
Locurto 1990; Plomin and DeFries 1985. In a refinement of this observation, it has been found that adopted children also score lower than the children in other homes that are socioeconomically the same as those of their adoptive parents but have no adopted children (thereby controlling for possible ways in which adoptive parents might be distinctive from non-adoptive parents).
80
Locurto 1990.
81
Dumaret and Stewart 1985; Schiffet al. 1982; Schiff and Lewontin 1986.
82
We will disregard in our analysis a number of considerations that would reduce
estimates of the impact of home environment, such as that the IQ of the schoolmates of the nonadopted half-siblings (who presumably share comparable lower-class surroundings) averaged only seven points less than the adopted children, not twelve. This difference raises the possibility that the adopted-away child seemed brighter in infancy or had better intellectual prospects than the half-sibling who stayed at home because of the parent they did not share, or that the shift in home environments was even more extreme than the estimates below assume it was, as if the adopted child’s biological family home was atypically poor, even for the poor neighborhoods they were in. This, as we explain below, would reduce the over-all estimate of the impact of home environment.
83
The cell sizes in the 2 × 2 table of high-and low-SES adopting and biological parent families were only ten children or fewer.
84
Capron and Duyme 1989. This study showed an even larger benefit—equivalent to sixteen IQ points—of having high-SES biological parents, even when the child was not reared by them, which again points to a heritability greater than .5.
85
This, it should be remembered, is for childhood IQ, which is more subject to the influence of home environment than adult IQ. Recent work has also indicated that how a parent treats a child (presumably also an adopted child) is in part determined by the child’s inherited characteristics. To that extent, speaking of home environment as if it were purely an environmental source of variation is incorrect (see Plomin and Bergeman 1991).
86
A twenty-point swing is easily reconciled with a heritability of .6 for IQ. Suppose the high-and low-SES homes in the French studies represent the 90th and 10th centile of environmental quality, as the text says. A twenty-point swing in IQ from the 2d to the 98th centile of environmental quality would then imply that the standard deviation of home environment effects on IQ is 4.69. Squared, this means a variance of 22 attributable to home environment. But as we noted in note 1, a heritability of .6 implies that there is a variance of 225-135, or 90, attributable to environmental sources. The French adoption studies, in short, are consistent with the conclusion that about a quarter of environmental variance is the variance across homes (if our guesses about the adopting and biological home environments are not way off). Three-quarters of the environmental influence on intelligence must be uncorrelated with the family SES, according to the present analysis. Note again that the balance tips toward environmental factors outside families as being the more relevant than those provided by families in affecting IQ, as mentioned in Chapter 4.
87 For a discussion of cost-benefit considerations, see Haskins, 1989.
Chapter 18
1
“Sharpen your pencil, and begin now,”
Wall Street Journal,
June 9, 1992, p. A16.
2
National Commission on Excellence in Education 1983, p. 5.
3
National Commission on Excellence in Education 1984, p. 58.
4
For an example of an alarmist view and a discussion of the various estimates, see Kozol 1985.
5
National Center for Education Statistics 1992, Table 12-4.
6
DES
1992, Table 95.
7
Ravitch and Finn 1987, p. 49.
8
Congressional Budget Office 1987, p. 16.
9
Congressional Budget Office 1987, p. 16.
10
Quoted in Kozol 1985, p. 9.
11
Four of the studies were conducted by the International Association for the Evaluation of Educational Achievement, known as the IEA. They were the First International Mathematics Study (FIMS), mid-1960s; the First International Science Study (FISS), 1966-1973; the Second International Mathematics Study (SIMS), 1981-1982; and the Second International Science Study (SISS), 1981-1982. The fifth study was initiated by the United States as a spin-off from NAEP. It was conducted in 1988 and is . known as the First International Assessment of Educational Progress (IAEP-I) (Medrich and Griffith 1992).
12
Medrich and Griffith 1992, Appendix B.
13
National Center for Education Statistics 1992, pp. 208-215.
14
The best single source for understanding complexities of international comparisons is the summary and synthesis produced by National Center for Educational Statistics (Medrich and Griffith 1992). Other basic sources in this literature are Walker 1976; McKnight et al. 1989; Keeves 1991. There are cultural factors too. In his vigorous defense of American education, Gerald Bracey tells of the scene in a Korean classroom during one such international test: “As each Korean student’s name was called to come to the testing area, that child stood and exited the classroom to loud applause. What a personal honor to be chosen to perform for the honor of the nation!” American children seldom react that way, Bracey observes (Bracey 1991, p. 113).
15
Bishop 1993b, National Center for Education Statistics 1992a, pp. 60-61.
16
In addition to Bishop 1989, reviewed below, see especially Carlson, Huelskamp, and Woodall 1993; Bracey 1991.
17
Bishop 1989.
18
The Flynn effect refers to gradually rising scores over time on cognitive ability tests, discussed in Chapter 13.
19
NAEP periodically tests representative samples of students at different age levels in mathematics, reading, science, and, more recently, in writing and in history and literature.
20
National Center for Education Statistics, 1991, Fig. 1. The tests were designed to have a mean of 250 and a standard deviation of 50 when taken across all three age groups. The exception to flat trend lines was science performance among 17-year-olds, which shows a fifteen-point decline from 1969 to 1990, somewhat more than .3 SD (we do not know the specific standard deviations for 17-year-olds on the science test; probably it is less than 50). Note also that science among 17-year-olds reflects disproportionately the performance of the above-average students who tend to take high school science—consistent with our broader theme that educational performance deteriorated primarily among the gifted.
21
Two large questions about the table on page 422 immediately present themselves. First, are the five studies accurate representations of the national samples that they purported to select, and are the five tests comparable with each other? The answer to the first half of the question is a qualified yes. The studies were not perfect, but all appear to have been well designed and executed. The qualification is that the data exclude youngsters who did not reach the junior year in high school. The answer to the second half of the question is cloudier, if only because sets of tests administered at different times to different samples always introduce incomparabilities with effects that cannot be assessed precisely. The prudent conclusion regarding the math scores is to discount the modest fall and rise from 1955 to 1983 and assume instead that math aptitude over that period was steady. Regarding the Verbal scores, it seems likely that they rose from 1955 to 1966 and dropped from sometime after 1966 to sometime between 1974 and 1983, with the magnitude and precise timing of those shifts still open to question. Before leaving the norm studies, we must add a proviso: the SAT scales got easier during 1963 to 1973 by about eight to thirteen points on the Verbal and perhaps ten to seventeen points on the Math. They seem to have been stable before and following this period (Modu and Stern 1975, 1977). The same person would, in other words, have earned a higher score on the later SATs than the earlier ones, owing purely to changes in the test scales themselves. Whether the PSAT, a much shorter test, experienced the same degree of drift is unknown, but it is a good idea to adjust mentally the 1974 and 1983 scores downward a bit, though this does not change the overall interpretation of the results.
22
Grades 10 and 11 show a similar pattern. Grade 12 remained slightly under its high (1965-1967) as of 1992, but it is likely that the deficit is explained by increases in the proportion of 17-year-olds retained in school.
The possibility remains open, however, that education in the post-slump period improved more in the lower grades than in the higher ones.
23
Congressional Budget Office 1986.
24
Medrich and Griffith 1992.
25
The College Board added new method of reporting test scores in 1967 based on seniors instead of all tests administered, and continued to report the means for both types of samples through 1977. During the years when both scores were available, the trends were visually almost indistinguishable. In the year when we employed the new measure in the graph on page 425, 1970, the scores for the two methods were identical.
26
Based on the 1963 standard deviations, .49 and .32 SD reductions respectively.
27
For a technical statement of this argument, see Carlson, Huelskamp, and Woodall 1993.
28
Readers can follow the journey through the numbers in Murray and Herrnstein 1992.
29
It is possible that the SAT pool was not getting democratized in the usual socioeconomic sense but was nevertheless beginning to dig deeper into the cognitive distribution. Responses in the SAT student questionnaire indicate that somewhat more students from the bottom of the class were taking the test in 1992 than in 1976, but this effect was extremely small for whites. In 1980, 72.2 percent of whites reported that they were in the top two-fifths of their high school class, compared to 71.5 percent in 1992. We nonetheless explored the possibility that the pool had become cognitively democratized, by looking at the scores of students who reported that they were in the top tenth, the second tenth, and the second fifth of their classes. If their scores went up while those for the entire SAT sample went down, that would be suggestive evidence (if we make certain assumptions about the consistency with which students reported their true class rank) that the pool was drawing from a cognitively broader segment of the population. Using 1980 (the end of the decline) to 1992 as the period of comparison, the Verbal scores of whites who reported they were in the top tenth, 2d tenth, and 2d fifth went up by five, seven, and eight points respectively, while that of the entire white SAT pool remained flat. In Math, the scores of the top tenth, 2d tenth, and 2d fifth went up by nine, thirteen, and fourteen points, respectively, while that of the pool rose by nine points. At first glance, this would seem to be evidence for a strong effect of cognitive democratization. But then we looked at what happened to the scores of white students reporting that they were in the 3d, 4th, and lowest fifths of their classes. Their scores went up by much more: nine, eleven, and ten points, respectively, in the Verbal; seventeen, seventeen, and nine in the Math. We are aware of Simpson’s paradox, which shows how scores
in each interval can go up when scores in the aggregated group go down, but in this case the explanation appears to lie in changes either in the way that students report their class rank, the meaning of class rank, or both. We give “cognitive democratization” credit for two points each in the Verbal and Math, but it is not certain that even that much is warranted.
30
For an argument that the test score decline does in fact represent falling intelligence, see Itzkoff 1993.
31
For a broader discussion of falling SAT scores in the high-scoring segment of the pool, see Singal 1991.
32
From 1967, scores were reported for all test takers; from 1972 through 1976, ETS reported scores for all test takers and for college-bound seniors. To estimate college-bound seniors for 1967-1972, we computed the ratio of college-bound seniors to total test takers for the overlapping years of 1972-1976. For Verbal, the mean ratio was .82, with a high of .88 and a low of .77. For Math, the mean ratio was .78, with a high of .85 and a low of .71. The mean ratios were applied to the data from 1967 to 1972 to obtain an estimate of the number of college-bound seniors.
33
ETS keeps careful watch on changes in item difficulty, which are called “scale drift.” It finds that scores of 650 and above were little affected by scale drift (Modu and Stern 1975; 1977).