Virtual Empirical Investigation: Concept Formation and Theory Justification

Dan Kalman

The big picture is important in mathematics. If you have the big picture or the broad insight about what is really going on, you can usually reconstruct details that may have been forgotten. Unfortunately, this is an aspect of the subject that is largely unobserved by students, especially in the first two years of the college curriculum. All too often, they approach mathematics entirely at the molecular level. That is, they try to learn every fact, technique, concept, method, result with equal emphasis, never seeing how these molecules combine to create larger structures. Little wonder, then, if they don't appreciate how a few key molecules can establish the framework of a much larger whole.

Although I know this, and keep it in mind when I prepare to teach my classes, there seems to be little impact on many of the students. For example, there is a clear, even compelling, picture that goes with Newton's method: Slide down the tangent line until you hit the x axis; go up to the curve; repeat. This is the big picture I have in mind when I plan instruction for this topic. Do my students learn this about Newton's method? Do they hang onto this one key picture as the important thing to remember? Alas, no.

This article describes an idea for helping students see the big picture. It involves computer activities that allow students to manipulate mathematical objects at an almost tactile level. At this point I have no data to show how effective the approach is. I can only report anecdotally that students seem to appreciate the activities, and that they feel right.

I will organize the presentation as follows. First, to give the flavor of the activities, one sample will be described in detail. That will be followed by a discussion of the pedagogical rationale for these activities. Next, a few more sample activities will be described. And finally, I will explain how I create the activities.

Newton's Method

The big picture for Newton's method, as described above, is dynamic and visual. The computer activity is supposed to highlight that big picture. Figure 1 shows the computer screen layout for the activity.

Figure 1: Newton's Method

The main feature is a window which displays the graph of a function. The expression which defines the function appears in a text box. Students can edit the expression to define any function they wish to consider. Another text box specifies the starting value of x. A push button triggers the execution of a single step of the Newton algorithm: in an animated display, the student sees a tangent line grow from the point on the curve down to the x axis, sees this point marked with a small square, sees a vertical line grow up to the corresponding point of the graph. At the same time, a textual display shows the numerical value of x and f(x) for this new point, as well as the index n, which counts the number of iterations as they are performed.

I provide the students with a set of instructions for using this activity, as well as a series of specific questions to investigate. For a specified function and starting point, they are asked to find how many iterations are required to find the root to four decimal place accuracy. Then to 6 decimal place accuracy. Then 8. I have the students change the initial value of x and report on how that affects the process. I give students an example with two roots, and ask them to determine which initial points lead to each root, and which initial points don't lead to any root. Finally, I give them a highly contrived example in which the starting value is very close to a point of period 6 (Figure 2).

Figure 2: Near a Point of Period 6

The iteration follows the 6 term cycle once, then settles down on one root. Changing the tenth decimal digit of the initial x produces nearly the identical iteration, but after once around the cycle this one eventually settles down to a different root.

Throughout the activity, student interaction with the computer consists mainly of clicking push buttons, and watching the results. There is a bit of typing when a new function is to be defined. It is significant that the interaction is completely orchestrated. Students are not required to give the computer any commands, and they do not have to use a specialized syntax to communicate with the program.

Pedagogical Rationale

A good part of the underlying rational for my use of these activities comes from a constructivist learning model. That does not imply, as it has been construed by some, that students must derive every fact from first principles, essentially rediscovering everything for themselves. But it does mean that true learning occurs only after active engagement with ideas. In particular, it is not sufficient to tell your students what you want them to understand.

It is ironic that more teachers do not understand this better. Most of us have had the experience many times of completely losing the thread of meaning while listening to a lecture or colloquium. At first we can understand the ideas as they are presented. But with each new definition we are less able to keep the important concepts clear. Before long, we are totally at sea. The carefully prepared explanations of the speaker cease to have any meaning, because they describe interconnections between mathematical constructs that we have not internalized.

Having had this experience, we should not be surprised that our carefully planned lectures are not faithfully transcribed into the minds of our students. As Von Neumann reportedly said in mathematics you don't understand things, you just get used to them. And how does that happen? You work with the ideas, considering special cases, working out examples, exploring connections with the familiar. For beginning college students, working with the ideas generally means doing homework problems. That is what imparts meaning to the ideas in a course, and that is often how students frame their understanding.

Now consider the student's experience of Newton's method. After some sort of presentation in class about the main idea, and possibly after reading a similar explanation in the book, the student spends some concentrated time on homework exercises. To what is their attention drawn by this experience? To the recursion formula, to the process of entering numbers in a calculator, perhaps to convergence of the iterates. I believe it is this experience that gives Newton's method meaning for most of the students, and the experience defines what that meaning will be. Students may remember the formula, they may remember an arithmetic algorithm and computing successive estimates of the root. But they are unlikely to retain that compelling picture of sliding down the tangent line.

The rationale for the Newton's Method computer activity is to have the students actively engaged with the idea I want them to remember. I am trying to give them an activity that draws their attention repeatedly to that image. Presumably, this experience will be most effective if the students can see dynamically, almost concretely, the animation of the sliding tangent line.

The Newton's Method activity illustrates a number of the principles I try to incorporate into all of my computer activities. Among the most important, in my view, is creating an illusion of tactile interaction. I do not want my students to be distracted by cryptic commands into which they must embed parameters. They should not need to translate questions into a highly abstract computer syntax. Instead, the ideal is for questions to lead directly to action, just as it does for physical manipulatives. Students should be able to conduct experiments to answer questions. With physical manipulatives, the experiments are performed by immediate physical actions: pick them up; arrange them in a line; compare their sizes, shapes, and colors. The act of carrying out each experiment is completely unconscious. The student does not think about how reach out and pick something up. She just does it, with the attention completely focussed on seeing what happens. Just so for my students. I hope that they will quickly understand how to click buttons, manipulate slider bars, change numerical values, and so on, so that they can naturally and thoughtlessly conduct experiments. That is what I mean by virtual empirical investigation in the title of the paper.

Concept Formation. The discussion so far has concerned the acquisition of meaning. The constructivist model says that meaning obtains from interconnections among many concepts, illuminated by a wealth of examples, viewpoints, and representations. A student learns by active engagement with all of these, and it is the experience of this interaction that creates meaning. That is what is meant by constructing knowledge. The motivation for the computer activities, then, is to provide experiences which I hope will contribute to the construction of knowledge. Further, I believe that the computer activities can provide qualitatively different experiences from listening to a lecture, reading a book, or working on homework problems. To the extent that the computer activities enrich the student's experience of a subject, I believe they can support construction of deeper understanding.

Theory Justification. While conceptual understanding is vitally important, it is not all that we are after. We want our students to know more than what is true - we want them to also understand why it is true. That is not quite correct. On some level, empirical investigation does convey insight about why something happens. But there is a somewhat different epistemological issue that is important here. Different academic disciplines are characterized (in part) by their distinct approaches to substantiating knowledge. Factual knowledge in physics is established by different means and with different standards of evidence than in mathematics, for example. For the core courses in the traditional undergraduate mathematics curriculum, we wish our students to come to some understanding of the standards of evidence in mathematics. We want them to understand what we call proof.

Here, the reliance on empirical investigation for concept development introduces a potential hazard. If students find the experiential evidence sufficiently compelling, they may see no need for proof. Worse, they may consider the idea of proof to be an unwelcome distraction, imposed apparently as a matter of pedantic excess. We are all familiar with student skepticism about proofs, particularly when the goal is to prove what students perceive as obvious. This attitude might be reinforced by an instructional approach that places too great an emphasis on learning by experiment.

By its nature, mathematics is an abstract subject. Developing intuitions about what is true is only the first half of the mathematical method. Equally important is the articulation of an appropriate abstract formulation, and the deductive proof within that formulation of the principles that we have discovered intuitively. Throughout the undergraduate curriculum, I hope to convey the importance of both aspects to my students.

I am concerned that by working so hard to create an illusion of tangibility, I may be distorting the understanding that my students ultimately obtain. Accordingly, in most of my activities, I try to include examples that highlight the limitations of the empirical approach. Thus, these experiences have two intended goals. First, by interacting with the computer activities, I hope students will acquire a strong conceptual understanding of mathematical ideas. Second, I try to arrange experiences that show why a theoretical formulation is needed. In this way, the computer activities are intended to support both concept formation and theory justification.

In the next section, I will present several more examples of computer activites I use with my classes. For each activity there are exercises for both concept formation and theory justification. Unfortunately, no verbal description of these activities can really convey what it is like to experience them. Both the activities and the software under which they operate are freely available over the internet. Details will be presented in the section on Creating Computer Activities.

Additional Examples

Usually, I have students work with these computer activities in class, for a time that can range from half an hour to the full class period. I provide students with a set of instructions that detail how each activity operates. The instructions also include an outline of experiments to perform and questions to answer. The following discussion will describe both aspects of each activity - how the activity operates, and what the students do with it.

Limits. I have developed several computer activities exploring various facets of the limit concept. Just one will be described here. It is intended to convey one particular insight about what lim_x ® a f(x) means. This insight can be stated as follows:

Examine the values of f(x) when x is near, but not equal to, a. Is there one obvious value suggested for f(a)? If so, that value is what we call the limit. If not, no limit exists.

For the purposes of the computer activity, graphical visualization is the means for deciding what the obvious missing value should be. The screen layout (Figure 3) features a graph window and text boxes for tabular output of function values.

Figure 3: Screen Layout for Limit Exploration

Students define a function by text input, the point a at which the limit is to be investigated, and a step size. The main interaction is to click a button, and see a graph of several equally spaced points to the left and right of a, as well as the printed numerical coordinates of the points. The points are joined with straight lines. As explained in the instructions, this is purely for visualization, and is not an accurate graph of the function between the plotted points. By changing the step size, students can examine the behavior of the function on finer and finer scales. They can also specify that the limit should be approached just from the right or just from the left. The graph window automatically sizes itself to the range of x and f(x) values displayed.

Since most functions defined by simple algebraic expressions are differentiable, on a sufficiently fine scale the points will appear to fall on a straight line. In this situation, there is an obvious interpolated value for f(a), and students easily identify it visually and numerically. The activity outline calls for students to estimate values of various limits. These are carefully selected to linearize on different scales. Students find that the step size required to estimate the limit reliably depends on the function. There are also several examples for which limits do not exist. The standard example f(x) = sin(1/x) provides one interesting case. Of course, students begin with no preconception of the graph of this function. They simply observe that on any scale, the computer activity fails to reveal any obvious interpolation point. This leads to a discussion of graphing by connecting the dots, and its limitations. It also motivates the standard discussion of the behavior of the graph.

Examples with jump discontinuities are also provided, so that students can observe distinct limits from the left and right. Here, I feel it is better to avoid functions defined by cases, or even examples like |x|/x. Students recognize these as different from normal functions, and tend to dismiss them as unrepresentative examples. A better alternative is something like f(x) = x/Ö{1-cos10x}. Near x = 0, the quadratic approximation of 1 - cosx is x²/2, so the function f behaves like a multiple of x/|x|. But students find it much more credible as the sort of function one might plausibly encounter.

For theory justification, students are asked to examine the behavior of f(x) = |x|(1.01-.01^10|x|)/x as x approaches 0. This function has been carefully contrived so that the apparent limiting behavior changes at different scales. A selection of graphs with the step size ranging from .1 to .000001 appears in Figure 4.

Figure 4: Graphs for f(x)

At first glance, the graph suggests that this function has a jump discontinuity at x = 0. Magnification by a factor of 10 or 100, however, suggests that the limit may exist, and equal 0. Still greater magnification reveals what appears to be a discontinuity after all. This example illustrates dramatically that empirical investigation can be misleading. After my students have looked at this example, I ask how fine a scale must be used to be sure of getting the right answer. They readily observe that no matter how fine the scale is, you cannot really be sure that you are observing the limiting behavior.

Derivatives. The next example concerns the concept of derivative. In this case, the highlighted big picture concept is:

Under extreme magnification at a point, a curve will either appear to straighten into a line, or not. If it does, the slope of that line is the derivative at the point; if it does not, there is no derivative at the point.

One of the primary goals of the computer activity is to engage students with this idea in a direct tangible way. They zoom-in on a point of the curve until the curve seems to be the straight line. Then they measure the slope of that line to determine the value of the derivative. The layout for this interaction is shown in Figure 5.

Figure 5: Derivative Activity

Note that the graph shows both a curve and a straight line. The curve is the graph of a function, centered at a point (a,f(a)). The line is shown for reference purposes. It is fixed to the point (a,f(a)), but is free to rotate about that point.

Students find the derivative of a function at a point as follows. First, they enter an expression for f(x), the value of a, and a magnification factor. By clicking a button, they repeatedly zoom-in on the graph of f, until the graph appears to become a straight line. Then they rotate the reference line until it coincides with the graph of the function. A slider-bar is used for this purpose. As the slider is dragged with the mouse, students see the reference line rotate. When the reference line and the graph are aligned, the slope can be read from the settings on the slider bar.

The activity outline directs students to compute the derivative at several different points for a particular function f. During this activity, their attention is repeatedly drawn to the intended big picture concept, because the process of finding the derivative at each point adheres exactly to the desired image of the derivative. There are also some related activities, such as organizing the derivative values in a table, and graphing the results, which the students perform by hand with paper and pencil. These provide background for the concept of a derivative function, and for the idea of visually estimating the graph of f¢(x) from the graph of f(x).

The activity outline also includes examples where the derivative fails to exist. For these, the student observes that the function fails to become a straight line under repeated magnification. In one example, students examine the function f(x) = .8x - Ö[(e^x-x-1)] (Figure 6).

Figure 6: Graph with a Cusp

Then they compare the behavior of g(x) = .8x - Ö[(e^x-x-.99999)]. In the first case, under repeated magnifications, the graph takes on the appearance of a fixed angle. For g(x), the first several magnifications reveal the same behavior as for f(x), but after several more magnifications, the curve does eventually straighten out.

The derivative activity also has an example for theory justification, involving the function f(x) = sin(x) + sin(50000*x)/50000. Here, a very low amplitude high frequency oscilation is superimposed on a sine curve. Zooming-in seems to reveal a straight line in the primary oscilation before the effect of the secondary oscilation appears. Several magnifications are shown in Figure 7.

Figure 7: Theory Justification for Derivatives

The students typically zoom-in until the graph is almost straight, then adjust the reference line to determine the slope. But then they are directed to zoom-in even further, and they find that the curve reveals a new level of variation, only to straighten out a second time. And this time, the slope is something quite different from what was first observed. In later discussion they readily repeat what they observed for limits: no matter how far one zooms in, there can be no certainty that the empirical examination has given the correct answer.

Riemann Integration. The final sample activity constructs Riemann sums to approximate definite integrals. In this example, there is no simple concise statement of the concept I want to reinforce for students. That is because the big picture conception of Riemann integration is very complex.

Consider the typical introduction of the subject. Students are told that the goal is to find the area under a curve; that this area will be approximated using a collection of rectangles; that the bases of the rectangles are defined by partitioning an interval on the x axis; that the height of each rectangle is found by evaluating the function at an x along the base. They are told about upper sums, lower sums, left-endpoint sums, right-endpoint sums, and mid-point sums. They are told that the true area must lie between any upper and lower sums. They are told something about the limiting process. This is a tremendous amount to take in, and it is frequently all presented in a single lecture or a single section of a text book.

Naturally, once you have mastered this complex of ideas, the entire development appears intuitive and straight-forward. But on a first exposure, the number of ideas to coordinate and organize is likely a bit overwhelming. The students I have observed do not, in general, emerge with anything like a coherent understanding of all of the various facets of Riemann integration outlined above.

The objective of the computer activity is to actively engage students in building various kinds of Riemann sums. By working with an interaction that is nearly tactile, students are free to think about the big picture of what they are doing, and why. In the course of a class period, my students work through a progression of three computer screens. The first introduces the idea of approximating the area under a part of a curve using a single rectangle. In the second, students create a series of rectangles. The third automatically produces left and right end point sums with equal subdivisions, and for a user specified number of terms. This is used to investigate the limiting process for defining the integral. In the interests of brevity, a detailed description will be presented for only the second of these interactions.

A portion of the screen layout appears in Figure 8. The interaction

Figure 8: Area Activity

proceeds as follows. First, the student selects a function. A predefined set of functions is included in the activity, and there is a push button for choosing among them. This button is repeatedly clicked until the graph of the desired selection appears in the graph window. Next, the student selects the end points for the integral by clicking with the mouse on the x axis in the graph window. Rectangles are defined one at a time. For each rectangle, the right side is defined by clicking on the x axis. The height of the rectangle is the value of f(x) where x is one of four options; the left or right end point of the interval; the midpoint; or a point selected by mouse click. The student selects one of these options by clicking a push button. At this point, there is an animated display in the graph window. A colored line is shown growing vertically from the x axis to the curve, and from that point horizontally to create the top of the rectangle. This rectangle then fills up with a new color. These displays are intended to dramatize the process of determining first the top of the rectangle, and then the area within the rectangle. When the rectangle is complete, its area is shown in a textual display, and added to a running total. The figure shows the result after completing two rectangles. When the final rectangle is complete, the student may obtain a value for the true area by clicking a button.

The activity outline directs students to construct a variety of rectangular approximations to the area under the graph using this setup. In completing these tasks, the students are repeatedly required to think about the underlying ideas of rectangles, how the width and height are determined, and how the resulting area relates to the area under the curve. Note that the students construct the rectangles in a virtually physical manipulation. They do not have to carry out the mechanics of computation, and they see the results of their actions essentially instantly. This contributes to the illusion of tangibility in dealing with areas and rectangles.

The Riemann sum computer activity does not include any theory justification exercises. For this topic, the big picture concept is quite complicated, and I am content if the students acquire a clear understanding of it.

Creating Computer Activities

The primary focus of this article is to present a pedagogical rationale for a certain type of instructional computer activity, and to describe several sample activities. I hope that other teachers will want to experiment with these activities; instructions for obtaining them appear below. I also hope that some teachers will want to develop their own instructional activities. Accordingly, this section discusses the design and implementation methodology that I use.

The activities described above were created using a software product called Mathwright. That is not the only possible approach to developing interactive computer activities. An alternative is to program in Java, for example, creating applets which are accessed via the internet (see [5]). Similar kinds of interactions also appear in commercial educational software, such as ODE Architect ([1]). However, Mathwright is the only development approach with which I have any significant familiarity, and it is the only one that I will consider here. In any case, it is beyond the scope of this article to survey possible implementation methods for interactive computer activities. I do want to make it clear that I find Mathwright an attractive and powerful tool for creating interactive activities. Moreover, learning to use Mathwright is easy (and quick). No other option of which I am aware offers the same combination of ease of use and expressive power.

Mathwright includes separate programs for creating and operating interactive computer activities. Activities are created with the author program. The composition style is highly intuitive. Screen components (graph windows, text windows, etc.) are selected from menus, then dragged to the desired location and sized using the mouse. Attributes are defined by filling out simple interactive forms. For example, in creating a graph window, the user can select menu items to change the graph color, coordinate system, appearance of axes and grid lines, and so on. A simple script can be attached to each screen component, specifying actions to carry out when the component is clicked with the mouse. The reader program allows users to open and operate activities developed using the author program. For a more detailed description of the Mathwright paradigm, see [2]. Note that Mathwright operates only under a Windows operating system, and is not available for Macintosh or Unix environments.

Mathwright was a featured software package for several years in the MAA's Interactive Mathematics Text Project (IMTP) [3]. Quite a number of Mathwright activities were developed by participants in the IMTP, as well as other Mathwright users, and are currently available over the internet from the Mathwright Library website (http://www.mathwright.com). An earlier version of the library, supported by an NSF grant, provided free access to the reader software, and to the entire collection of activities. It was reviewed in [4]. At the current library, access to the software and activities requires payment of a nominal membership fee.

Originally, Mathwright activities were saved as documents on the user's computer, and opened with the reader software, in much the same way that word processing or spread sheet software can be used to create and open the corresponding types of documents. There is now an additional option for hosting Mathwright activities on webpages. For this approach, the user must install a free software extension for the webbrowser, currently available only for use with Internet Explorer. Reference [6] provides a detailed description of the use of Mathwright to create webpages.

I maintain two webpages where visitors can obtain free access to the activities I have developed, as well as to the software necessary to use the activities. At http://www.dankalman.net/mwweb are several Mathwright activities available as either webpages or for download and use offline. This website also includes instructions for downloading and installing necessary software. Following the terminology of [6], the activities at this webpage are referred to as microworlds. The Magnify microworld includes both the activities for limits and for derivatives, and the Area microworld includes the Riemann integration activities described earlier.

The Newton's Method activity descussed at the start of the paper is currently not available in a microworld format. Instead, it operates under an earlier version of the Mathwright reader. This software, together with the Newton's Method activity (among others), is available at http://www.dankalman.net/mathwright. The webpage also includes the necessary download and installation instructions.

Conclusion

There are many different instructional uses of computers. They can provide structured drill problems with immediate feed back to students, as well as the closely allied activity of interactive testing. They support dramatic graphics and visualization for classroom demonstrations. Computers can take over the drudgery of computation or symbolic manipulation so that students can proceed quickly from questions to answers. And computers offer a variety of enhancements in the management of textual information, including hypertext linking and electronic searching. But all of these uses either duplicate or only slightly extend capabilities that teachers already have without computers.

In contrast, interactive computer activities of the sort described here are something completely different. Under their spell, students have the illusion of exploring mathematical ideas by sight and touch. Computer environments have the ability to simulate reality, and the teacher can dictate exactly what the fabric of that reality will be. What could be better suited to mathematics instruction?

I believe that these interactive simulated environments allow students to explore mathematical subjects in an almost tactile way. This is a new opportunity for students and teachers, and would not be possible without computers. To me, this is the really exciting arena for instructional computing.

References

[1]: Robert L. Borrelli Courtney S. Coleman, Modeling and Visualization in the Introductory ODE Course, in Michael J. Kallaher, Ed., Revolutions in Differential Equations. Mathematical Association of America, Washington, DC, 1999.
[2]: Angela Hare. Software Review: Mathwright Author and Player. College Mathematics Journal, 28:2 (1997) 140-144.
[3]: Interactive Mathematics Text Project. FOCUS, 13:1 (1993) p. 28.
[4]: Dan Kalman. Software Review: New Mathwright Library. College Mathematics Journal, 30:5 (1999) 398-405.
[5]: Frank Wattenberg, Bart Stewart, and Suzanne Alejandre. Lite Applets. Journal of Online Mathematics and its Applications (http://www.joma.org), 28 (2002).
[6]: James E. White. Introducing Mathwright Microworlds. Journal of Online Mathematics and its Applications (http://www.joma.org), 28 (2002).