Examining washback in second language education contexts: A high stakes provincial exam and the teacher factor in classroom practice in Quebec secondary schools
Carolyn E Turner
Integrated Studies in Education, McGill University, Quebec Canada
PP: 103 - 123
Abstract
Research has helped us understand that high stakes tests can have some sort of influence at the micro level on classroom practice and activity. To date, much variation has been reported.
This paper discusses washback in second language (L2) education contexts and specifically the teacher factor in classroom activity. It draws on four areas of research: washback research in second language (L2) and general education, the teacher factor in washback, alignment of assessment internal and external to the classroom, and mixed methods research. It provides teacher case study examples from a larger study probing into teacher practice and beliefs when dealing with a new high stakes English as a second language (ESL) provincial exit exam at the high school level.
The participants are from the population of ESL secondary teachers in the French school system in the province of Quebec in Canada. The main research question is: How do teachers mediate between classroom assessment activity and preparing students for upcoming external exams?
Results indicate that teachers used common overall approaches, but that there was variation in individual practice. When first introduced to the new exam material, teachers used a formative assessment approach. As the exam time neared, their practice evolved into a summative assessment approach. This phenomenon demonstrated an interfacing or 'blurring' of formative and summative assessment in an attempt to align classroom and external exam assessment. Implications are discussed pertaining to a coherent education system across curriculum, teaching, learning and assessment.
Keywords
washback, impact of L2 high stakes exams on pedagogy and learning, teacher factor, alignment of assessment internal and external to the classroom, mixed methods research
Article Text
Research has helped us understand that high stakes tests can have some sort of influence at the micro level on classroom practice and activity (whether the test is administered as part of a curriculum innovation, whether it is administered as an exit or certification test that claims to represent a curriculum, etc.) (Cheng et al., 2004).
The specific nature of this influence still needs clarification, and it appears due to its reported variation, that there are several factors that need to be looked at closely, systematically and critically (Alderson, 2004; McNamara, 2006). Whether high stakes tests can influence teacher beliefs and student performance is even less clear. Even though there is evidence to this effect, we still need more long-term and follow-up studies to learn about the nature of such claims (Davison, 2004; Shih, 2007).
... continues ...
References
Alderson, J.C. (2004). Forward. In L. Cheng, Y. Watanabe & A. Curtis (Eds.), Washback in language testing: Research contexts and methods (pp. ix-xii). Mahwah, NJ: Lawrence Erlbaum Associates.
Alderson, J.C. & Hamp-Lyons, L. (1996). TOEFL preparation courses: a study of washback. Language Testing, 13(3), 280-297.
Alderson, J.C. & Wall, D. (1993). Does washback exist? Applied Linguistics 14, 115-129.
Andrews, S. (2004). Washback and curriculum innovation. In L. Cheng, Y. Watanabe & A. Curtis (Eds.), Washback in language testing: Research contexts and methods (pp. 37-50). Mahwah, NJ: Lawrence Erlbaum Associates.
Andrews, S., Fullilove, J. & Wong. Y. (2002). Targeting washback: A case-study. System, 30, 207-223.
Assessment Reform Group (ARF). Assessment for learning: Research-based principles of assessment for learning to guide classroom practice. Retrieved August 10, 20080 from http://www.qca.org.uk/qca_4336.aspx
Bachman, L.F. & Palmer, A.S. (1996). Language Testing in Practice. Oxford: Oxford University.
Bailey, K.M. (1996). Working for washback: A review of the washback concept in language testing. Language Testing, 13(3), 257-279.
Black, P. & Wiliam, D. (1998a). Assessment and classroom learning. Assessment in Education, 5 (1), 7-73.
Black, P. & Wiliam, D. (1998b). Inside the black box: Raising standards through classroom assessment. Phi Delta Kappan, 80, 139-144.
Black, P. & Wiliam, D. (2003). In praise of educational research: Formative assessment. British Educational Research Journal, 29(5), 623-637.
Blais, J.G. & Laurier, M. (2005, April). Accountability and standardized testing in Quebec and some neighbouring US states. Paper presented at the annual meeting of the American Educational Research Association, Montreal, QC.
Bogdan, R. & Biklen, S. (1998). Qualitative research for education. Cambridge: Cambridge University Press.
Brookhart, S. (2005, April). Research on formative classroom assessment. Paper presented at the American Educational Research Association, Montreal, Quebec.
Burrows, C. (2004). Washback in classroom-based assessment: A study of the washback effect in the Australian adult migrant English program. In L. Cheng, Y. Watanabe & A. Curtis (Eds.), Washback in language testing: Research contexts and method (pp. 113-128). Mahwah, NJ:Lawrence Erlbaum Associates.
Cheng, L., Watanabe, Y. & Curtis, A. (Eds.) (2004). Washback in language testing: Research contexts and methods. Mahwah, NJ:Lawrence Erlbaum Associates.
Colby-Kelly, C. & Turner C.E. (2007). AFL research in the L2 classroom and evidence of usefulness: Taking formative assessment to the next level. Canadian Modern Language Review, 64(1), 9-38.
Creswell, J & Clark, V.L.P. (2007). Designing and conducting mixed methods research. Thousand Oaks, CA: SAGE.
Davison, C. (2004). The contradictory culture of classroom-based assessment: Teacher assessment practices in senior secondary English. Language Testing 21(3), 305-334.
Davison, C. (2008, June). Using summative assessments for formative purposes; The ultimate justification for learners and teachers. Paper presented at the 30th Language Testing Research Colloquium, Hangzhou, China.
Davison, C. & Hamp-Lyons (2008, June). Applying school-based assessment for learning in a formal examination system. Workshop presented at the 30th Language Testing Research Colloquium, Hangzhou, China.
Fox, J. (2004, October). Language test impact: Practices and Possibilities. Paper presented at the Midwestern Association of Language Testers, Cleveland, OH.
Fullan, M.G. & Stiegelbauer, S. (1991). The new meaning of educational change (2nd ed.). New York: Teachers College Press.
Goldstein, H. (1989). Psychometric test theory and educational assessment. In H. Simons & J. Elliot (Eds.), Rethinking assessment (pp. 140-148). Milton Keynes, England: Open University Press.
Gouvernement du Québec, Ministère de l'Éducation. 2008. Document d'Information: Épreuves Uniques, Anglais, Langue Seconde, de cinquième année de secondaire 156-544. Juin, août 2008, janvier 2009, grille d'évaluation. Retrieved August 12, 2008 from http://www.meq.gouv.qc.ca/dgfj/de/docinfosec.htm.
Johnson, R.B. & Onwuegbuzie. A.J. (2004). Mixed methods research: A research paradigm whose time has come. Educational Researcher, 33(7),14-26.
Kim, J. (2007, June). Mixed-methods designs in language testing research: Review and suggestions. Paper presented at the 29th annual Language Testing Research Colloquium, Barcelona, Spain.
Leung, C. & Mohan, B. (2004). Teacher formative assessment and talk in classroom contexts: Assessment as discourse and assessment of discourse. Language Testing, 21 (3), 336-359.
Markee, N. (1997). Managing curricular innovation. Cambridge: Cambridge University Press.
Marshall, C. & Rossman , G.B. (1989). Designing qualitative research. Newbury Park, CA: Sage.
McNamara, T. (1998). Policy and social considerations in language assessment. Annual Review of Applied Linguistics, 18, 304-309.
McNamara, T. & Roever, C. (2006). Language testing: The social dimension. Malden, MA: Blackwell Publishing.
Messick, S. (1996). Validity and washback in language testing. Language Testing, 13(3), 241-256).
Milanovic, M. & Saville, N. (1996). Considering the impact of Cambridge EFL examinations. Cambridge: University of Cambridge Local Examinations Syndicate.
O'Loughlin, K. (2006). Learning about second language assessment: Insights from a postgraduate student on-line subject forum. University of Sydney Papers in TESOL, 1, 71-85.
Pellegrino, J.W, Chudowsky, N. & Glaser, R. (Eds.). (2001). Knowing what students know: The science and design of educational assessment. Washington, DC: National Academy Press.
Rea-Dickins, P. (2006). Currents and eddies in the discourse of assessment: A learning-focused interpretation. International Journal of Applied Linguistics, 16(2), 164-188.
Saif, (nd) Aiming for positive washback: A case study of international teaching assistants. Language Testing, 23(1), 1-34.
Shih, C. (2007). A new washback model of students' learning. Canadian Modern Language, 64(1), 135-161.
Shohamy, E. (1992). Beyond proficiency testing: A diagnostic feedback testing model for assessment foreign language learning. Modern Language Journal, 76, 513-521.
Solomon, P.G. (2002). The assessment bridge: Positive ways to link tests to learning, standards, and curriculum improvement. Thousand Oaks, CA: Corwin Press.
Stobart, G. (2003). The impact of assessment: Intended and unintended consequences. Assessment in Education, 16(2), 139-140.
Teasdale, A. & Leung, C. (2000). Teacher assessment and psychometric theory: A case of paradigm crossing? Language Testing, 17(2), 163-184.
Tesch, R. (1990). Qualitative research: Analysis types and software tools. New York: Falmer.
Turner C.E. (2001a). Developing an empirically based rating scale for evaluating speaking ability at the secondary 4 and 5 levels. (Report) Quebec: Ministry of Education of Quebec.
Turner C.E. (2001b). The need for impact studies of L2 performance testing and rating: Identifying areas of potential consequences at all levels of the testing cycle. In A. Brown et al. (Eds.), Experimenting with uncertainty: Language testing essays in honour of Alan Davies, Studies in Language Testing 11, (pp. 127-139). Cambridge: Cambridge University.
Turner C.E. (2006). Professionalism and high stakes test: Teachers' perspectives when dealing with educational change introduced through provincial exams. TESL Canada Journal, 23(2), 54-76.
Turner C.E. (2008, June). The specificity of the 'research approach' in classroom studies: Probing the predictability of washback through teacher conceptual and instrumental evidence in Quebec high schools. In Turner (Organizer), The 'teacher' factor in washback contexts: Evidence from mixed methods research in Canada, China and Malaysia. Symposium conducted at the 30th Language Testing Research Colloquium, Hangzhou, China.
Turner C.E., Laurier, M. & Wei, J. (2007, June). Health care access for linguistic minorities: Identifying nurse speech acts for assessment. Presented at the 29th Language Testing Research Colloquium (LTRC '07). Barcelona, Spain.
Wall, D. (2000). The impact of high-stakes testing on teaching and learning: Can this be predicted or controlled? System, 28, 499-509.
Watanabe, Y. (2004). Methodology in washback studies. In L. Cheng, Y. Watanabe & A. Curtis (Eds.), Washback in language testing. Research contexts and method (pp. 19-37). Mahwah NJ: Lawrence Erlbaum Associates.
Watanabe, Y. (2006). Does grammar translation come from the entrance examination? Preliminary findings from classroom-based research. Language Testing, 13(3), 318-333.
Willis, J. (2007). Assessment for learning - Why the theory needs the practice. International Journal of Pedagogies and Learning, 3(2), 52-59.
Woods, D. (1996). Teacher cognition in language teaching. Cambridge: Cambridge University Press.

eContent Home




