Analysis of summative tests for English

Rusma Setiyana

Abstract


This study aimed to analyze the quality of summative tests for English at MAN Boarding School Meulaboh I in terms of validity, reliability, difficulty index, discrimination index, and the effectiveness of distractors. Content analysis was employed in this study. Two techniques were carried out to collect the data, namely a checklist and document analysis. The data from the checklist was analyzed using statistical procedures and the data from the document analysis was analyzed using Anates software version 4. The results showed that the validity of the English summative tests at MAN Meulaboh I was on average either sufficient or poor since the percentages were below 72%. Secondly, the tests had a high and consistent degree of reliability. The index of difficulty was above 70%. Thirdly, 60% of the difficulty index in the test of the first grade, 48% in the second grade, and 8% in the third grade test were accepted. Fourthly, more than half of the discrimination index was good. In detail, good in the discrimination index of the test was 76% in the first grade, 56% in the second grade and 72% in the third grade. Finally, the effectiveness of distractors in the English summative test in the first grade was 53%, in the second grade was 67% and in the third grade was 50%. 


Keywords


summative test, item analysis

Full Text:

PDF

References


Asaad, A., & Hailaya, W. M. (2005). Measurement and Evaluation: Concept and Principles. Quezon: REX Printing Company, Inc.

Asia E-University. (2009). Classroom Assessment: Chapter 10 in Educational Psychology (online course). Retrieved January 17, 2005 from http://www.peoplelearn.homestead.com/beduc/chapter_10.pdf

Basanta, C. P. (2012). Coming to grips with progress testing: some guidelines for its design. A Journal from English Teaching Forum, 50(3), 37-40.

Brown, H. D. (2004). Language Assessment: Principles and Classroom Practices. White Plains, N. Y.: Pearson Education.

Burton, S. J., Sudweeks, R. R., Merril, P. F., & Wood, B. (1991). How to Prepare Better Multiple Choice Tests: Guidelines for University Faculty. Retrieved from https://testing.byu.edu/handbooks/betteritems.pdf

Derrick, D. (2013). Teaching beyond the test: A method for designing test preparation classes. English Teaching Forum, 51(4), 20-27.

Fulcher, G., & Davidson, F. (2007). Language Testing and Assessment. New York: Routledge.

Fulcher, G., & Davidson, F. (2013). The Routledge Handbook of Language Testing. New York: Routledge.

Gronlund, N. E., & Waugh, C. K. (2009). Assessment of Student Achievements. New York: Pearson Education.

Harlen, W. (2005). Teacher’s summative practice and assessment for learning - tensions and synergies. The Curriculum Journal, 16(2), 207-223.

Harrison, A. (1991). A Language Testing Handbook. Hong Kong: Macmillan.

Heaton, J. B. (2000). Writing English Language Tests. Beijing: Foreign Language Teaching and Research Press.

Hughes, A. (2003). Testing for Language Teachers. Cambridge: Cambridge University Press.

Khalifa, H., & Weir, C. J. (2009). Studies in Language Testing. Cambridge: Cambridge University Press.

Liao, Y. (2004). Issues of validity and reliability in second language performance assessment. Columbia University Working Papers in TESOL & Applied Linguistics, 4(2), 1-4.

Linn, R. L., & Gronlund, N. E. (2002). Measurement and Assessment in Teaching. Upper Saddle River, N. J.: Prentice Hall.

McCowan, R. J., & McCowan, S. C. (1999). Item analysis for criterion-referenced tests. New York: Center for Development of Human Services.

Okunya, L. O. (2014). Validity and reliability of teacher-made tests. African Educational Research Journal, 2(2), 61-71.

Popham, W. J. (2004). Curriculum, instruction, and assessment: Amiable allies or phony friends?. Teachers College Record, 106(3), 417-428.

Roegier, D. (2014). Assessment literacy: Building a base for better teaching and learning. English Teaching Forum, 52(3), 2-13.

Roszkowski, M. J., & Spreat, S. (2011). Issues to consider when evaluating “tests”. In Financial planning and counseling scales (pp. 13-31). Springer New York.

Sunarya. (2003). Panduan Anates Kr-2003. Retrieved July 21, 2015 from http://file.upi.edu/Direktori/FIP/JUR._PSIKOLOGI_PEND_DAN_BIMBINGAN/195911301987031-YAYA_SUNARYA/BAHAN_EVALUASI-ASESMEN/ANALISIS.pdf

Thomas, J., Allman, C., & Beech, M. (2004). Assessment for the Diverse Classroom: A Handbook for Teachers. Retrieved January 20, 2015 from http://www.fldoe.org/core/fileparse.php/7690/urlt/0070083-assess_diverse.pdf

Walker, C., & Schmidt, E. (2004). Smart Tests: Teacher-Made Tests That Help Students. Markham, O. N.: Pembroke.


Article Metrics

Abstract view : 0 times
PDF - 0 times

Refbacks

  • There are currently no refbacks.




P-ISSN: 2085-3750

E-ISSN: 3025-9789 

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 International License.