Report on a Study of the Arrowsmith Program for Learning Disabilities
Prepared by William Lancee, Ph.D., January 22, 2003

This is an abridged version of the above research study. A detailed report with statistical tables is available on request from Arrowsmith School.

Introduction:

Learning Disabilities (LD) seriously affect academic and emotional development and are unlikely to remit without specialized intervention. Students with learning disabilities tend to fall farther and farther behind their peers in academic performance and subsequently tend to have a low sense of self-worth. Klein and Mannuza (2000)1 followed 104 children with LD who initially did not have emotional difficulties. Sixteen years later, these children, when compared to 124 controls, had a much lower status occupational level and continued to struggle with a high prevalence of psychiatric and addiction disorders.

Various special education programs have been developed to address learning disabilities. The approach of the Arrowsmith Program is first to distinguish finely between elemental cognitive impairments and then to implement an individualized highly task-oriented program that exercises and challenges the identified deficit. It is thought that these highly targeted exercises create ways for the brain to provide the necessary functionality for encoding and decoding spoken and written discourse, and for storing, organizing, and integrating knowledge. If this is successful, the child can rejoin his or her peers in normal academic progress. It should be understood that successful graduates of the Arrowsmith Program will require some time to make up for the learning time that was lost due to the original impairment. The authors of the Arrowsmith Program have high expectations for their successful graduates and believe that they will become academically and occupationally competitive.

Objective:

At the beginning of 2001, the Toronto Catholic District School Board (TCDSB) enrolled 30 students (grade 2 to grade 7, from 4 schools) in the Arrowsmith Program (AP). These students were identified by the TCDSB as having learning disabilities. All 30 students were below the age-adjusted 33%-tile in at least one of the three subtests of the Wide Range Achievement Test 3 (WRAT3) - (i) spelling, (ii) timed arithmetic, and (iii) word recognition. Twenty seven students scored below 15%-tile in at least one of these tasks - that is, lower than 85% of other students at the same age. An additional 10 learning disabled students from a fifth TCDSB school were assessed over the school year but were not enrolled in the AP. It was the intention that these students would function as a comparison group.

In October, 2002, Arrowsmith School commissioned William Lancee, Ph.D., who is an experienced research scientist at the University of Toronto, Faculty of Medicine, to review, analyse and report on data from the TCDSB/AP study. The data entry, data verification, data organization, and defining statistical transformations were completed by the middle of December, 2002 when data analysis commenced.

Study Design:

The study design was limited by ethical and practical constraints. It was decided that all students from a given class should receive the same intervention. Therefore it was not possible to select AP students on a random basis. Whole classes could have been randomized to receive AP or not, but this would have seriously increased the required sample size and would have been too costly.

Without randomized controls, it would not be possible to definitively attribute differential improvements to the AP program, since the following selection biases may occur.

(1) It might be that AP students were more amenable to spontaneous improvement or other non-AP factors. However, spontaneous improvement in LD is highly unlikely, especially in a single school year.

(2) The possibility that other non-specific factors such as attention and time could have favoured the AP students is also unlikely, since comparison students received similar added attention and learning time.

(3) It could be argued that if students who were selected for AP had less severe LD, they would be more likely to improve. As will be shown in the result section, this possibility could also be rejected, since initial severity was not predictive of degree of improvement.

The approach taken was to test for pre-post improvements in the 30 AP students, and then test whether this was different from the pre-post changes in the 10 comparison students. Because of the small sample sizes, the power to detect statistically significant difference between the two groups was low. Nevertheless, if statistical differences were found, they could be accepted with confidence.

Study Sample:

There were no study dropouts, and all 30 AP students and 10 comparison students were followed up. Because the basis for selection was LD, there was no control over grade and gender.

  AP Students  Comparison Group
  Male   Female   Male   Female
Grade 2     2 1 - -
Grade 3     1 1 1 -
Grade 4     1 6 1 -
Grade 5     11 - 2 1
Grade 6     3 1 - 1
Grade 7     2 1 2 -
Grade 8     - - 1 -
All 20 10 8 2

Although all study students had LD, no single definition was used to select students for the study. There were three comparison students who were close to normal range on at least one WRAT3 subtest (75%-tile or higher). None of the 30 AP were functioning close to this level. This lack of equivalence at the start made direct comparison of the two groups difficult.

At the risk of reducing the size of comparison sample too much, the three highest functioning comparison students were considered to be outliers and were treated as a separate subgroup for some of the statistical analyses.

Measures:

Pre and post measures include the following 12 standardized Achievement and IQ tests:

• the Wide Range Achievement Test 3 (WRAT3)

1 spelling
2 timed arithmetic
3 word recognition

• the Woodcock Reading Mastery

1 word identification
2 word attack
3 word comprehension
4 passage comprehension

• the Monroe-Sherman Achievement Tests

1 copying text
2 auditory memory
3 visual memory

• the Otis-Lennon Mental Ability IQ Test (timed task)

• the Peabody Picture Vocabulary IQ Test (task is not timed)

Eighty of the 960 test administrations were done jointly by two different test administrators (one designated by the TCDSB and one by the AP). These score pairs correlated highly (r=.90). In all but two cases the tests scores were identical, indicating excellent adherence to test protocol.

AP students are assigned exercises based on specific cognitive deficits. Therefore the Arrowsmith Program uses assessment tools (AP authored) to test performance on elemental cognitive functions. These tests were also done at pre and post by the 30 AP students and by the 10 comparison students. The AP tests were used to determine if improvements in standardized tests could be linked to improvements in the hypothesized elemental cognitive functions.

Students in the AP and their teachers and parents completed a comprehensive satisfaction questionnaire at the 12-month follow-up time point.

Relationship between improvements and satisfaction

The 30 AP students, their parents and teachers completed a 24 item satisfaction questionnaire. Improvements were seen by at least 2 raters (teacher and student; student and parent; or teacher and parent) in more than 80% of students in the following areas: reading comprehension; ability to focus on task; understanding ideas; legibility of written work; confidence; self-esteem; and ability to self-advocate. Between 70% and 80% of students were seen as having improved in: telling time; remembering factual information; listening skills; organizational skills; and understanding and following instructions.

The correlation between improved comprehension as seen by teachers correlated highly with the Relative Progress GE scores (Pearson r = 0.49; p<0.01).

Association between improvements in Achievement/IQ tests and improvements in AP tests

If improvements were due to specific AP exercises then we should expect to see strong correlations between changes in AP test scores and changes in Achievement/IQ test scores. The results support the defining principle of the AP program: that carefully targeted exercises aimed at improving elemental cognitive abilities will result in broader improvements seen as increased scores on achievement and IQ tests.

Conclusion

Despite some study design limitations and small sample size, the study results strongly support the Arrowsmith Program as instrumental in changing the developmental course of the majority of children with LD in this sample. In only 12 months, almost one third of the AP students were on a course that brought them closer to their peers. Another 27% improved their performance at the same rate as expected from their non-LD peers, that is, they stayed at the same distance but did not fall further behind. All other AP students (43%) improved at least somewhat on the various achievement tests. None of the 10 students in the comparison group progressed substantially beyond their entry status.

1Klein, R.G. and Mannuza, S. (2000). Children with complicated reading disorders grown up. In L.L. Greenhill (Ed.), Learning disabilities: Implications for psychiatric treatment. Washington: American Psychiatric Press.

Study Consultant

Dr. William Lancee is Head of Research in the Department of Psychiatry at Mount Sinai Hospital and Associate Professor, Department of Psychiatry, University of Toronto. He has a Bachelor in Mathematics from the University of Waterloo. In the first part of his career, he was a statistical consultant to major pharmaceutical companies. Later he received his Ph.D. from the Institute of Medical Science at University of Toronto, and has designed and carried out more than 20 peer reviewed collaborative studies, including large-scale epidemiological studies as well as psychotherapy intervention studies. He is author of three published psychological measurement tools: The Nurse Observed Behaviour Scale; the Staff Patient Interaction Response Scale; and the Perceived Family Burden Scale. He has supervised 15 graduate students. His special interest is in the mathematical modeling of complex systems using cellular automata, genetic programming, and neural networks. He has published 37 papers on a wide variety of topics. Recent papers related to family issues and child development are:

Maunder R., Lancee, W.J., Greenberg, G., Hunter, J., Fernandes, B. Insecure attachment in a subgroup of ulcerative colitis defined by ANCA status. Dig Dis Sci, 45, 2000, 2127-2132.

Hazelton, R. Lancee, W.J., O'Neil, M.K. (1998) The controversial long-term effects of parental divorce: the role of early attachment. Journal of Divorce and Remarriage, 29(1)1-18.

Beitchman, J.H., Brownlie, E.B., Inglis, A., Wild, J., Ferguson, B., Schachter, D., Lancee, W.J., Matthews, R., Wilson, B. (1996) Seven-Year Follow-up of Speech/Language Impaired and Control Children: Psychiatric Outcome. Journal of Child Psychology and Psychiatry 37(8)961-970.

Levene, J.E., Lancee, W.J., Seeman, M.V. (1996) The Perceived Family Burden Scale: Measurement and Validation. Schizophrenia Research, 22, 151-157.

Beiser, M., Lancee, W.J., Gotowiec, A, Sack, W., & Redshirt, R. (1993). Measuring Self-Perceived Role Competence Among First Nations and non-Native Children. Canadian Journal of Psychiatry. Vol 38, 412-419.

Top