Table of Contents previous section next section
EVALUATION: Standard 3 - Appropriate Assessment
Methods and Uses

Assessment methods and instruments should be selected on the basis of--

  • the type of information sought;
  • the use to which the information will be put;
  • the developmental level and maturity of the student.

The use of assessment data for purposes other than those intended is inappropriate.

Focus

The purpose of an assessment--to identify areas of difficulty for individual students, to gather data for instructional planning, to assign grades, or to evaluate a program--should dictate the kinds of questions asked, the methods employed, and the uses of the resulting information. When one type of measure is used in lieu of another, the information obtained is often invalid or useless. In addition, the methods used to gather information should be appropriate to the developmental level and maturity of the students. Table 3.1 outlines some common purposes of student evaluation. The purposes in the table overlap, and the questions are only examples.

Discussion

The assessment of student performance serves many purposes. For the student, assessment aids learning and measures mathematical knowledge and power. For the teacher, it provides information about how instruction should be modified and paced. For the administrator, it charts the effectiveness of a program. In addition, the general public expresses concern about academic achievement. Each of these groups asks different questions. Each needs different kinds of information. An assessment designed to answer one kind of question can misrepresent the answer to another. Although this caveat might seem obvious, the legitimate uses of an assessment often are not well understood or defined, as when, for example, standardized tests of general mathematical achievement are used for curriculum evaluation.

The information required by different audiences lies on a continuum from the most specific, such as measures of a curriculum objective, to indicators of general mathematics performance. Teachers who want to know how they can help Sandra or John understand fractions as parts of a region, for example, will learn best by questioning students about their perceptions of a specific concept. However, when the same teachers ask how well Sandra or John has understood the course material as a whole, they must cast a wider net; assessment methods limited to a single aspect of learning will not suffice. Students should have the opportunity to show how well they have integrated their knowledge by applying their learning in a larger context.

TABLE 3.1
PURPOSES AND METHODS OF ASSESSMENT
Purposes(examples of
questions asked)
For Whose
Use
Unit of
Assessment
Type of Assessment Assessment Methods
Diagnostic

What does this student understand about the concept or procedure?

What aspects of problem solving are causing difficulty?

What accounts for this student's unwillingness to attempt new problems or see the application of previously learned materials?

 

 

 

Individual teacher

Individual teacher

 

 

 

Individual student

Tasks that focus on a specific skill, type of procedure, concept, strategy, or a type of reasoning

Each student evaluated

Observation

Oral questions that ask students to explain their procedures

Focused written tasks

Directed test items

Instructional Feedback

What do students know about the material presented?

Can students apply their learning to new situations?

Do students understand the connections among ideas?

How shall I pace instruction?

Does the class need more intensive review of more challenging material?

 

 

 

 

 

Individual teacher

 

 

 

 

 

Class

Tasks that require an integration of know- ledge

Tasks that cover a range of skills, concepts, and procedures

Tasks that require the application of learning to new contexts

Problem solving and reasoning tasks

Tasks that vary the format and context in which the material is presented

Matrix sampling test situations

Written tests, including those that require differential methods for solutions to problems

Class presentations

Extended problem solving projects

Observation of class discussion

Take-home tests

Homework, journals

Group work and projects

Instructional Feedback

What do students know about the material presented?

 

Can students apply their learning to new situations?

 

Do students understand the connections among ideas?

 

How shall I pace instruction?

 

Does the class need more intensive review or more challenging material?

 

 

 

 

Individual student

Parents

School

 

 

 

 

 

Individual student

Tasks that demand the integration of material that was taught

Tasks that are intrinsically interesting and challenging to the student

Tasks that require the student to structure the material and generate solutions, in the context of the real world, as well as in math- ematics

Extended problem solving projects

 

Papers or written arguments that demand thoughtful inquiry about a mathematical topic

 

Written tests that present problems with a range of difficulty based on expectations for course

 

Oral presentations

Generalized mathematical achievement

How does the general mathematical capability of this student compare with others or with a national norm?

 

Parents

Teachers

Administrators

 

 

Individual student

Tasks organized in highly reliable tests designed for maximum discrim- ination among students

Standardized achievement tests

Program Evaluation

How effective is this instructional program in achieving our goals for mathematical learning?

 

 

 

Teachers

Administrators

Other decision makers

 

 

 

 

 

Class

School

Tasks that reflect the intent of the curriculum goals

 

Tasks that are aligned to the instruct- ional methods and content of the curriculum (see Standards 12 and 13)

 

Matrix sampling test situations

Student interviews

 

Performance tests

 

Criterion referenced tests

 

Observation of class discussions

 

Success of students who have completed the program

When assessments are used for grading or as summative indicators of achievement, a number of measures should be used (see Evaluation Standard 2). In addition to the more traditional types of written tests, students should be given tasks that are challenging and complex and that allow them to perform at their maximum level of ability. Not only should such tasks provide meaningful information to parents and school authorities, but they should be interesting and valuable experiences in their own right, giving the student a sense of accomplishment.

Criterion-referenced tests, often used for program evaluation, can cover a narrower range of content. Generally designed to measure the attainment of specific objectives, they are, in some sense, a collection of mini-tests that each focus on a relatively narrow area of achievement. When the objectives on these tests reflect the goals of the school or the program, the tests are a valid measure of effectiveness. However, such tests usually present material in only one format (written multiple choice), which limits what can be measured. Thus, other information is needed to confirm their results.

In contrast to assessments that yield useful curriculum-based information to the teacher and student, more general types of tests are relatively insensitive to individual curricula. The standardized achievement test is the most prominent example. Its purpose is to measure an individual student's relative position in a population. Consequently, this type of test must maximize individual differences among students while measuring the common elements of their instruction. As a result, it is less appropriate for measuring the effectiveness of any specific curriculum and is more likely to reflect a student's general achievement, background, and prior knowledge. Furthermore, because of their format, standardized norm-referenced tests have difficulty measuring the generation of ideas, the formulation of problems, and the flexibility to deal with mathematical problems that are not well structured (i.e., problems similar to those encountered in everyday life). For these reasons, they are inappropriate as the only measure of whether teaching and curricula reflect the spirit of the Standards.

Whatever the purpose of the assessment, the methods used should consider the characteristics of the students themselves. Students' mathematical and cognitive development is a gradual, cumulative process built on prior experience and understandings. This consideration is particularly important in the early years, when basic skills and concepts are initially encountered. For example, recording responses on op-scan sheets imposes an additional irrelevant demand on young children. The very results of written tests in the early grades can be suspect: when children use their developing reading and writing skills to process the mathematical content of questions, the results might be more reflective of achievement in these areas than of their understanding of mathematics. At this stage, when children's understanding is often closely tied to the use of physical materials, assessment tasks that allow them to use such materials are better indicators of learning.

Students differ in their perceptions and thinking styles. An assessment method that stresses only one kind of task or mode of response does not give an accurate indication of performance, nor does it allow students to show their individual capabilities. For example, a timed multiple-choice test that rewards the speedy recognition of a correct option can hamper the more thoughtful, reflective student, whereas unstructured problems can be difficult for students who have had little experience in exploring or generating ideas. An exclusive reliance on a single type of assessment can frustrate students, diminish their self-confidence, and make them feel anxious about, or antagonistic toward, mathematics.

Finally, problems that reflect the potential of some students can be enigmas to others, disqualifying them from making any kind of meaningful response. Prior knowledge, experience, and the opportunity to learn are important considerations in interpreting test results. A challenging task in problem solving for some students is an exercise in recall for others. In part, the solution to this dilemma lies in the types of tasks used in the assessment. The complex, multifaceted tasks advocated in these Evaluation Standards can be structured to allow students to answer at different levels of sophistication. If students are to perform at their maximum levels of ability, the measures by which they are judged should give them the opportunity and the encouragement to do so.

STUDENT ASSESSMENT

If a conversation is to be meaningful to both of its participants, each must listen to the other; in the absence of such mutual attention, the conversation becomes an exercise in futility. Similarly, the act of teaching should be founded on dialogues between teachers and students, each responding to the other on the basis of what has been said or done. Assessment refers to the process of trying to understand what meanings students assign to the ideas being covered in these dialogues; as such, it is an integral element of effective teaching. Periodic assessment provides the teacher with a basis for deciding what questions should be asked and what examples and illustrations should be used; ultimately, it offers a foundation for any meaningful dialogue between teacher and student.

The student-assessment standards describe what is to be observed and measured in the process of understanding what mathematics students know. Teachers drawing meaning from their interactions with students is central to this process. At this level the most important decisions about student learning are made. The general-assessment standards, offer principles for the student-assessment standards, but the latter are paramount for helping students acquire the knowledge of mathematics as described in the Curriculum Standards. Thus, the student-assessment standards are more specific and relevant to teachers.

Assessment must be more than testing; it must be a continuous, dynamic, and often informal process. It manifests itself in teachers' statements, for example, "Johnny seems to have difficulty in graphing equations" or "Wilma demonstrated a lot of insight in solving those addition and subtraction problems." Assessment is more than the establishment of definitive conclusions. Assessment is cyclic in nature, a process of observation, conjecture, and constant reformulation of judgments about students' understanding. Assessment should produce judgments that are evolutionary in nature, regardless of whether those judgments are based on classroom discourse or on the more formal aspects of testing that characterize nearly every instructional program.

Testing to assign grades is one of the most common forms of evaluation. But assessment is a much broader and basic task, one designed to determine what students know and how they think about mathematics. Assessment should produce a "biography" of students' learning, a basis for improving the quality of instruction. Indeed, assessment has no raison d'être unless it improves instruction.

Teaching is effective to the degree that it takes student thinking into consideration. Without ongoing communication, a teacher's instructional strategies can only randomly enhance learning. Consider the student who answers the following item incorrectly: See figure 3.0

Illustration

Fig. 3.0

If the fractions represented by points C and D on this number line are multiplied, what point will best represent the product?

If the student answers "point A," is he or she confusing multiplication with subtraction? Similarly, if points E or F are selected, is multiplication being confused with addition? Or is the answer a manifestation of the general misbegotten rule "multiplication makes bigger"? Did the student assign values to C and D but incorrectly multiply the fractions? What representation or model of multiplication is the student using? Would the student identify the same point if presented with the product D x C instead of C x D? How would the student respond if specific fractions were assigned to C and D? Only through explicit and careful assessment of how a student does mathematics can instruction be tailored to individual needs, thereby enhancing a student's chances for success.

These seven student-assessment standards focus on assessing students' understanding of, and disposition toward, mathematics. Each presents a list of mathematical outcomes derived from the standard's central focus. These standards and their associated tasks can serve as heuristics for designing assessment systems that include instruments, procedures for aggregating data, and ways of record keeping that are comprehensive and that tap more than superficial understanding. It is reasonable--indeed, expected--that every lesson cover more than one aspect and that over a series of lessons most, if not all, aspects are addressed in both instruction and assessment. For example, whereas instruction in any particular procedure may not encompass all seven aspects of procedural knowledge, each of the seven aspects will be represented in both instruction and assessment over a series of lessons.

 
Back to top
next sectionnext section
Home | Table of Contents | Purchase | Resources | NCTM Home | Illuminations Website
Copyright © 1989 by the National Council of Teachers of Mathematics.