Sunday, March 31, 2019

Definition Of Test Types Of Test Education Essay

Definition Of rise Types Of see command EssayThis chapter provides an overview of the theories related to the study concerning with concomitants cleark digest. To be more specific, the review of related literature discusses ab divulge definition of seek, types of analyse, movement riddle, summative stoolvas, side stress of Tes Kendali Mutu, aspects in developing a shew, effectuality in language interrogation, surfeit robustness, the school base curriculum, tips attempt abridgment, vexedy level, disagreement power, distractor function, and theoretical framework.Definition of TestA mental show supposed to be equal to(p) to measure learnedness outcome which distinguish the every champion scholarly persons talent between students already get the hang and non yet the skill clobber. Therefore, testing is unmatched of the powerful tools to measure students abilities as surface as enhance their attitudes towards learning. This picture is supported by Hughes (2003) say that a test is a tool to measure language proficiency of students. Brown (20043) verbalize that a test is a method of step a persons ability experience, or performance in a runn domain. In the alike(p) line, Anthony J Nitko (19836) defined test is systematic social occasion for observing and describing cardinal or more symptomatics of person with the abet of either a mathematical of category system. establish on Cronbach in Azwar (2005) defined a test is a systematic procedure for observing a persons behavior and describing it with the aid of a numerical scale or category system.In short, a test as an instrument of evaluation is a systematic procedure of description, collection and interpretation in rove to measure the test takers act ability, knowledge, and performance what they go through been learned in learning bring and to exit a value judgment. The advise of a test is able to give the valid information on the students abilities and knowledge. Hence, the successfulness of the teaching and learning nookie be seen in the tests results.Types of TestA test bed be classified found on the types of information they provide. Based on Wilmar Tinambunan in his book with the entitled Evaluation of Students Achievement (1998 7-9), the four types of test ar placement test, formative test, diagnostic test, and summative test. The language tests have several(prenominal)(predicate) kinds of use of goods and servicess. Baily (1998) hold that there argon eight kinds of language assessment. They are dexterity test, language dominance test, proficiency test, admission test, placement test, diagnostic test, gain ground test, and achievement test.Based on contrasting classification of several types of tests draw from any experts of language testing depending on variable purposes in testing if related to this study that the types of tests that will be reviewed and used is achievement test and summative test.Achievement TestAchieveme nt test are at once related to language courses, their purposes being to establish how successful individual students, groups of students, or the courses themselves have been achieving objectives. In the book of Dictionary of Language Teaching and follow up Linguistics that compose by Jack C. Richard (19923), he said that an achievement test is a test which measures how much of a language soulfulness has learned with filename extension to particular course of study or political platform of instruction. In the homogeneous line, Brown (2004) defined an achievement test is to see how far students achieve heartys addressed in a curriculum indoors a particular time frame. It inwardness that an achievement test is to measuring students achievement learning outcome which is administered at the end of course of study. The mise en scene of test depicted object must(prenominal) that represents the course they are concerned.Summative TestSummative test is an achievement test admi nist appraised at the end of a course or unit of instruction. Brown (2004) mentioned that concluding exam in a course is example of summative test. Summative test is think to show the standard that the students have now reached in relation to other(a) students at the same stage. Thus summative test is conducted to find out over all learning outcome after set of learning units chopine already done. Summative test is comm only when designed found on subject matter that had been already taught for one semester. Consequences of a test that show the overall learning outcomes is its substance of scope involve all of material have been delivered. The result of summative test is to de full termine successful in learning and teaching, whether or not the students are able to follow the higher(prenominal) level of next instruction program, and students progress. In the final, the level of success rate is stated with a score or value which pen in the form of report card. side of meat te st of Tes Kendali MutuTes Kendali Mutu is a name of appliance judges administered at SMKN 26 capital of Indonesia after teaching and learning process for measuring how far purpose of students learning and teachers teaching achievement reached. As in final semester examination, according to Regulation Minister of National Education No. 20/2007, it is carried out to measure students cleverness achievement at the end of the semester. It can be said that Tes Kendali Mutu is administrated at each final in learning and teaching process in the period one semester and its scope includes all the indicators that represent all the elementary competence at that semester. All groups of subject are well-tried in order to evaluate learning outcome including an position subject. The format of side of meat test of Tes Kendali Mutu of starting grade school year 2011/2012 at SMKN 26 Jakarta is written test with objective multiple cream incidents. Total number English peaks test of Tes Kenda li Mutu is 50 items test which including listening and reading aptitude. Listening skill consists of 15 items test number and reading skill consists of 35 items test number. In reading skill, there are three reading fraction which consists of first section was 15 incomplete short dialogue items fleck section was 5 items error recognition and third section was 15 questions of reading comprehension which divided into different 5 of reading texts.Aspects in Developing a TestIn order to know criteria of a good test for measuring students ability has been reached in learning process, a test as an instrument of evaluation has to meet emergencys the rigour, reliability, and practicality. As stated by Brown (2004), there are three important aspects should ruminate in a test, namely cogency, reliability, and practicality. Gronlund (1998) pointed out that validity is the uttermost to which inferences do from assessment results are appropriate, meaningful and helpful in terms of the pu rpose of the assessment. severeness has to do with the information that the uses in class, it has to be appropriate for the students level, the purpose of the class, and if the meaning of the materials used in class are for the students. In term of achievement test, it can be valid if it is accurate, authentic or valid has been able to measure students learning outcomes achieved, after they achieved the learning process in a certain period. Reliability is consistent and dependable. A test is called tested if the same test is given to the same students on two different occasions and its tests should yield similar results. Practicality means an effective test. A test should be efficient and easyto be implemented. Since the focus of digest was precisely on one aspect of language test, it did not require to be observed as in the other aspects as reliability and practicalityValidity in Language TestingValidity is the most important characteristic of a test or assessment technique bec ause it measures what it purports to measure. Without good validity, all else is lost. In the J.B. Heatons book, Writing English Language Test (1998153), Heaton said that the validity of a test is the extent to which it measures what it is supposed to measure and nothing else. For example, if the teacher wants to measure the language skills, the test that is developed must be able to measure the ability to speak, not writing skills. Further, as insisted by Gronlund in Brown (2004 22) the extent to which inferences made from assessment result are appropriate, meaningful, and useful in terms of the purpose of the assessment. There is no final domineering to measure the validity of a test established but several different kinds of shew may be invoked in support. Arikunto (200665) mentioned two types of validity. They are empirical validity and ordered validity. Empirical validity is as same as numeric analysis. It consists of concurrent validity and predictive validity whereas the logical validity consists of cognitive subject validity and construct validity.However, the writer only focus its investigating on content validation because this study dealt with logical validity that is a way of reviewing in rational by using descriptive analysis method and thus got manifold with qualitative inquiries. Beside, because of the limitation in time and the other aspects in validity need the expert judges. cloy ValidityA test has content validity that relates the attest to the content of the test. The content is related to the goal of what has been taught. Fulcher Davidson (2007) defined content validity as any attempt to show that the content of the test is a interpreter sample from the domain that is to be tested. In the same line, Hughes (2003) said that a test is said to have content validity if its content constitutes a representative sample of language skills and structures, etc. with which meant to be concerned. For example, a grammar tests, it must be mad e up of items relating to the knowledge or control of grammar.In relation to tests content validity in this study, tests content should be tested in accordance with the target competencies which reflected on Standar Kompetensi-Kompetensi Dasar in view of the school found curriculum to be achieved. These Standar Kompetensi-Kompetensi Dasar broken down into skills or behaviors that can be deliberate called indicator. In short, content validity is arranged based on the content in subject that evaluated. Because those subjects are taught written in curriculum, it can be often called curricular validity.According to John A. Upshur (199663), content-related evidence of validity is assessed logically by carefully and systematically examining whether the method and content of the assessment procedure are representative of the kinds of language skill. It means that evidence of content-related validity requires the board discussion as primary method to run into whether a test has content validity or not. In the panel discussion, the expert judges who considered having knowledgeable to do with the subjects tested. They asked to make judgments about the rightness of each item and overall coverage of the domain.The check Based computer programAs cited in the National Education Standards (SNP) section 1 (15), schooling Based Curriculum is operational curriculum that developed and implemented by each educational unit. The developing of curriculum conducted by each educational unit has to follow based on the standard competence and the canonical competence which is designed by Badan Standar Nasional Pendidikan (BSNP). It means that each educational unit is given the authorities in developing their curriculum. Its curriculum is designed as a reference in using the materials, teaching and learning techniques. Based on Regulation Minister of National Education no. 41/2007, The standard competence means minimum competency qualification for student which show mastery of kn owledge, behavior, and skill of student which expected to be achieved in each level and/or each semester of certain subject whereas the basic competence means as a number of abilities that should be mastered by students in certain subject as a reference in developing indicator of competency.Based on the school based curriculum, the standard competences and the basic competences of English in vocational High develop is divided into three different level which are level novice, level elementary, and level intermediate. In relation to this study, there is one Standard competence and eight Basic Competences of English in Vocational High School for take aim Novice. The Standard Competence is Berkomunikasi dengan Bahasa Inggris setara aim Novice. Then, the eight Basic Competences of English in Vocational High School for Level Novice are 1.1 Memahami ungkapan-ungkapan dasar pada interaksi sosial untuk kepentingan kehidupan 1.2 Menyebutkan benda-benda, orang, ciri-ciri, waktu, hari, bul an, dan tahun 1.3 Mendeskripsikan benda-benda, orang, ciri-ciri, waktu, hari, bulan, dan tahun 1.4 Menghasilkan tuturan sederhana yang cukup untuk fungsi-fungsi dasar 1.5 Menjelaskan secara sederhana kegiatan yang sedang terjadi 1.6 Memahami memo dan menu sederhana, jadwal perjalanan kendaraan umum, dan rambu-rambu lalu lintas 1.7 Memahami kata-kata dan istilah asing serta kalimat sederhana berdasarkan rumus and 1.8 Menuliskan undangan sederhana.The English subject itself put in the adaptational group that provides students the ability in the English communication in the background of material related to the majors need either spoken or written. Also, English subject also directs students to be able in speaking in daily communication based on global exigency and developing communication at higher level. As stated in the curriculum that English subject aim for the learners to master basic knowledge and skills to support the achievement of English language competency skills program and apply skills and mastery of English language skills to communicate both verbally and written at the intermediate level. dots Test AnalysisThe activity aims at increase quality of a test called item test analysis. Nitko (1996308) stated that item analysis refers to the process of collecting, summarizing, and using information about individual test items especially information about pupils response to items. Item test analysis is used to identify quality of the test whether it is good, low or not good. As stated by Anastasi and Urbina (1997184), the main aim of item analysis of the teacher made test is to identify its deficiencies. In conducting item test analysis, two methods are able to be used, namely qualitative and quantitative analysis. As stated by Suyata (200916), items test can be examine by quantitative or empirical and qualitative or theoretical. Item test analysis in terms of qualitative covers the content of a test while difficulty level, discrimination power and di stractors function involved quantitative analysis. In the same line, Purwanto (1992) said that by making analysis, it can be known the important things of every single item obtained, the extent to which difficulty level, whether item has discrimination power, and whether all alternative answer (options) suck in the answer. The more explanation about difficulty level, discrimination power, and distractors function are as below impediment LevelAiken (1994 66) said that Difficulty Level is the opportunity correct answer a question at a certain skill level. It is usually expressed in the form of an mightiness which has proportion range 0,00 1,00. If the item test has 0,00 of Difficulty Level Index, it means that no students who answer correctly. While if the item test has 1,00 of Difficulty Level Index, it means that student can answer correctly. In short, the higher of Difficulty Level Index, the easier of item test is soundless and vice versa. Based on Crocker Algina (1986) in ca lculating the item difficulty, divide the number of people answer the item correctly by the total number of people answering item. The proportion for the item is usually denoted as P and is called item difficulty. Arikunto (2006207) explained that a good test is about not too easy or too difficult. The easy test is not able to stimulate students learning. Contrary, the difficult test is able to make students desperate because of out of their reach.As cited by Departemen Pendidikan Nasional, Direktorat Jendral Manajemen Pendidikan Dasar dan Menengah, Direktorat Pembinaan Sekolah Menengah Atas (2008) in Panduan Analisis Butir Soal,The prediction information about the easy item test is as followThe distractor function doesnt work well,Most students answer those item tests correctly it means most students have understood the content in question.The prediction information about the difficult item test is as followItem tests maybe have mistake a cite answer,Item tests have two or more t he correct answers,The content in question hasnt been taught yet or hasnt finished in its learning so that the minimum competency student have doesnt achieved yet,Unsuitable measured content using format given in the item test, logical argument or item sentence is too complex and long.Discrimination strengthDiscrimination former refers to measurement of the extent of the ability of items of achievement test to distinguish between students high answers and students low answers based on criteria. This notion is supported by Arikunto (2006211), Discrimination Power is items ability to distinguish between students who are good and low capable. The proportion range of Discrimination Power Index is from -1,00 to +1,00. The higher of Discrimination Power Index, the more capable the item test distinguish between students who had already understood and hadnt already understood the content. An item test has negative Discrimination Power Index (As cited by Departemen Pendidikan Nasional, Dir ektorat Jendral Manajemen Pendidikan Dasar dan Menengah, Direktorat Pembinaan Sekolah Menengah Atas (2008) in Panduan Analisis Butir Soal,The prediction information about disability item test to distinguish lower and upper student group is as followInappropriate key answer of itemItem tests have two or more the correct answersUnclear of measured competencyThe distractor function doesnt work well,The content in question is too difficult, so many students are guessing.Distractors FunctionDistractor function is meant to know work or not the answer of item. Arikunto (2006220) stated a distractor function works well if it at leastselected 5% by test taker. This analysis is only used to analyze multiple choice item tests.Theoretical modelingIn order to answer the research questions of this study, English items test analysis of Tes Kendali Mutu of first grade school year 2011/2012 at SMKN 26 Jakarta is reviewed in terms of (1) the relevance of English items tests content to the school bas ed curriculum for level novice of Vocational High School (2) validity (3) difficulty level (4) discrimination power and (5) the distractor function. In relation to tests content validity in this study, qualitative analysis is used by analyzing items test in reference to the aspects of test validity. Measuring the validity of a test, the English items test of Tes Kendali Mutu must contain proper sample of relevant material in learning. The relevant material in learning can be found in the syllabus and curriculum for level novice of Vocational High School which is School-based Curriculum (KTSP). In order to answer whether English test of Tes Kendali Mutu has content validity, the writer needs Standar Kompetensi and Kompetensi Dasar in the school based curriculum to compare and match its items with the relevant syllabus and curriculum.Furthermore, in order to support in answering the research question of this study, quantitative was conducted by using classical items test analysis thro ugh multiple choice test involving Difficulty Level, Discrimination Power, Distractor Function, and Validity. They are also considered to fulfill requirement criteria of quality a good test.

No comments:

Post a Comment