Assessing Reliability of Two Versions of Vocabulary Levels Tests in Iranian Context

Aso Bayazidi, Fateme Saeb


This study examined the equivalence and reliability of the two versions of the Vocabulary Levels Test in an Iranian context. This study was motivated by the fact that the Vocabulary Levels test is increasingly being used in Iran for both research and pedagogical purposes without having been checked for validity and reliability in this context. The equivalence and reliability of the two versions of the test were examined through the parallel-form approach to reliability in Classical True Score theory. Seventy-five intermediate learners of English as a foreign language at the Iran Language Institute took the two versions of the test with one week interval between the two administrations in a counterbalanced fashion. To examine the equivalence of the two versions, the means and variances of the scores obtained for the two tests were compared using paired-sample t-test and one-way ANOVA, respectively. The results of the analyses indicated that the difference between the means of the two versions was significant, and the two versions cannot be considered as parallel forms. To assess the reliability of the two versions, the correlation between the scores obtained from them was estimated using Pearson Product Moment correlation. The results of the analyses showed that the two versions are highly correlated and are reliable tests. It is concluded that the two versions should not be treated as equivalent in longitudinal and gain score studies.



Vocabulary Testing, The Vocabulary Levels Test, Equivalence, Reliability

Full Text:



Alderson, J. C. (2005). Diagnosing foreign language proficiency. London: Continuum.

American Educational Research Association, American Psychological Association, & National Council on Measurement in Education. (1999). Standards for educational and psychological testing. Washington, DC: Authors.

Bachman, L.F. (1990). Fundamental considerations in language testing. Oxford: Oxford University Press.

Beglar, D. & Hunt, A. (1999). Revising and validating the 2000 word level and university word level tests. Language Testing, 16, 131-162.

Cobb, T. (1997). Is there any measurable learning from hands-on concordancing? System, 25, 301-315.

D'Anna, C.A., Zechmeister, E.B., & Hall J.W. (1991). Toward a meaningful definition of vocabulary size. Journal of Reading Behavior, 23(1), 109-122.

Elley, W. (1989). Vocabulary acquisition from listening to stories. Reading Research Quarterly, 24(2), 174-187.

Ellis, R. (1994). Factors in the incidental acquisition of second language vocabulary from oral input: A review article. Applied Language Learning, 5(1), 1-32.

Goulden, R., Nation, P., & Read J. (1990). How large can a receptive vocabulary be? Applied Linguistics, 11(4), 341-363.

Haynes, M., & Baker, I. (1993). American and Chinese readers learning from lexical familiarization in English texts. In T. Huckin, M. Haynes, & J. Coady (Eds.). Second language reading and vocabulary acquisition (pp. 130-152). Norwood, NJ: Ablex.

Hinkel, E. (2001). Giving examples and telling stories in academic essays. Issues in Applied Linguistics, 12, 149-170.

Hu, M., & Nation, I.S.P. (2000). Vocabulary density and reading comprehension. Reading in a Foreign Language, 23(1), 403-430.

Huckin, T. & Bloch, J. (1993). Strategies for inferring word meaning in contexts: A cognitive model. In T. Huckin, M. Haynes, & J. Coady (Eds.). Second language reading and vocabulary acquisition (pp. 130-152). Norwood, NJ: Ablex.

Joe, A. (1998). What effects do text-based tasks promoting generation have on incidental vocabulary acquisition? Applied Linguistics, 19(3), 357-377.

Larson-Hall, J. (2010). A guide to doing statistics in second language research using SPSS. New York, NY: Routledge.

Laufer, B., & Nation, P. (1995). Vocabulary size and use: Lexical richness in L2 written production. Applied Linguistics, 16(1), 307-322.

Laufer, B., & Paribakht, T.S. (1998). The relationship between passive and active vocabularies: effects of language learning context. Language Learning, 48, 365-391.

Lee, S. (2003). ESL learners' vocabulary use in writing and the effects of explicit vocabulary instruction. System, 31, 537-561.

Meara, P. (1996). The dimensions of lexical competence. In G. Brown, K. Malmkjaer, & J. Williams (Eds.). Performance and competence in second language acquisition (pp. 35-53). Cambridge: Cambridge University Press.

Nation, P. (1983). Testing and teaching vocabulary. Guidelines, 5, 12-25.

Nation, I.S.P. (1990). Teaching and learning vocabulary. New York: Newbury House.

Nation, I.S.P. (2006). How large a vocabulary is needed for reading and listening? Canadian Modern Language Review, 63(1) 59-82.

Read, J. (1988). Measuring the vocabulary knowledge of second language learners. RELC Journal, 19, 77-89.

Schmitt, N., Schmitt, D., & Clapham, C. (2001). Developing and exploring the behaviour of two new versions of the Vocabulary Levels Test. Language Testing, 18 (1), 55-88.

Schmitt, N. (2010). Researching vocabulary: A vocabulary research manual. London: Palgrave Macmillan.

Schmitt, N., & Meara, P. (1997). Researching vocabulary through a word knowledge framework: word associations and verbal suffixes. Studies in Second Language Acquisition, 19, 17-36.

Wilkins, D.A. (1972). Linguistics in language teaching. London: Arnold.

Xing, P., & Fulcher, G. (2007). Reliability assessment for two versions of Vocabulary Levels Tests. System 35, 182–191.



  • There are currently no refbacks.

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.

2010-2019 (CC-BY) Australian International Academic Centre PTY.LTD.

Advances in Language and Literary Studies

You may require to add the '' domain to your e-mail 'safe list’ If you do not receive e-mail in your 'inbox'. Otherwise, you may check your 'Spam mail' or 'junk mail' folders.