Teaching Quantitative Skills: Data Analysis

Managing labs has got to be one of the most difficult things we do as biology teachers.  There is so much to keep in mind: safety, time, cost, level appropriateness, course sequence, preparation time, and did I mention time?  It’s no wonder that we are tempted to make sure that the lab “works” and that the students will get good data.  When I first went off the deep end and starting treating my classes like a research lab–meaning almost every lab had an element of individual based inquiry, I’ve got to say I was just pretty content if I could get students to the point that they asked their own question, designed an effective experimental procedure and collected some reasonable data.  It took a lot of effort to get just that far and to honest, I didn’t put enough emphasis on good data analysis and scientific argumentation as much as I should have.  At least that is the 20-20 hind-sight version that I see now.  Of course, that’s what this series is all about—how to incorporate and develop data analysis skills in our classes.

Remember, this lab has a number of features that make it unique:  safe enough to do as homework (which saves time), low cost, and more possible content and quantitative skills to explore than anyone has time for.  For me, its like saddling up to an all you can eat dessert bar.  No doubt, I’m going to “overeat” but since this lab happens early and it is so unique, I think I can get away with asking the students to work out of their comfort zone.  1. because they skills will be used again for other labs and 2. because I need them to get comfortable with learning from mistakes along with the requisite revisions that come from addressing those mistakes.
Depending on how much time we had earlier to go over ideas for handling the data the data the students bring back from their “homework” is all over the map.  Their graphs of their data are predictably all but useless to effectively convey a message.  But their data and their data presentations provide us a starting point, a beginning, where, as a class we can discuss, dissect, decide, and work out strategies on how to deal with data, how to find meaning in the data, and how to communicate that meaning with others.
In the past, the students would record their results and graph their work in their laboratory notebooks.  Later, I’d let them do their work in Excel or some other spreadsheet.  The data tables and graphs were all over the map.  Usually about the best the students would come up with looked something like this.
The data (although not usually, this precise) and usually not with the actual H2O2 concentrations:

Sometimes they would have a row of “average time” or mean time but I don’t think any student has ever had rows of standard deviation and for sure no one ever calculated standard error but getting them to this point is one of my goals at this point.  Of course, that is going to be one of my goals at this point.  As teachers we work so much with aggregated data (in the form of grades and average grades) that we often don’t consider that for many it doesn’t make any sense.  Turns out to be an important way of thinking that is missing more than we realize.  In fact in the book, Seven Pillars of Statistical Wisdom, Stephen M. Stigler devotes an entire chapter on aggregation and its importance in the history of mathematical statistics.  For most of my career, I was only vaguely familiar with this issue.  Now I’d be very careful to bring this out in discussion with a number of questions.  What does the mean provide for us that the individual data points do not?  Why does the data “move around” so much?
It doesn’t take much to make sure they calculate the mean for their data.
This brings up another point.  Not only do some folks fail to see the advantage of aggregating data some feel that the variation we see can be eliminated with more precise methods and measurement–that there is some true point that we are trying to determine.  The fact is the parameter we are trying to estimate or measure is the mean of the population distribution.  In other words there is a distribution that we are trying to determine and we will always be measuring that distribution of possibilities.  This idea was one of the big outcomes of the development of statistics in the early 1900’s and can be credited to Karl Pearson.  Today, in science, the measurement and such assume these distributions–even when measuring some physical constant like the acceleration of gravity.  That wasn’t the case in the 1800’s and many folks today think that we are measuring some precise point when we collect our data.  Again, I wasn’t smart enough to know this back when I started teaching this lab and honestly it is an idea that I assumed my students automatically assimilated but I was wrong.  Today, I’d take time to discuss this.
Which brings up yet another point about the “raw” data displayed in the table.  Take a look at disk 3, substrate concentration 0.75%.  Note that it is way off compared to the others.  Now this is a point to discuss.  The statement that it is “way off” implies a quantitative relationship.  How do I decide that?  What do I do about that point?  Do I keep it?  Do I ignore it?  Throw it away?  Turns out that I missed the stop button on the stop watch a couple of times when I was recording the data.  (Having a lab partner probably would have led to more precise times).  I think I can justify removing this piece of data but ethically, I would have to report that I did and provide the rationale.  Perhaps in an appendix.  Interestingly, a similar discussion with a particularly high-strung colleague resulted caused him so much aggravation that the discussion almost got physical.  He was passionate that you never, ever, ever discard data and he didn’t appreciate the nuances of reporting improperly collected data.  Might be a conversation for you’ll want to have in your class.
The best student graphs from this data would look like this.  I didn’t often get means but I liked it when I did.  But note that the horizontal axis is log scaled.  Students would often bring this type of graph to me.  Of course, 99% of the them didn’t know they had logged the horizontal axis, they were only plotting the concentrations of H2O2 equally spaced.  I would get them to think about the proper spacing by asking them if the difference between 50% and 25% was the same difference as between 6.25% and 3.125%.  That usually took care of things.  ( of course there were times, later in the semester that we explored log plots but not for this lab. )

Note also, that this hypothetical student added a “best fit” line.  Nice fit but does it fit the trend in the actual data?  Is there actually a curve?

addendum:  When looking at data like this it is important to consider what the trend line is saying.  If this reaction is a straight line then if I have 0% H2O2 then does it really make sense that the disk will rise in about 35 seconds?  It shouldn’t since we demonstrated that disks don’t rise in 0% H2O2 earlier.  Likewise at the other end of the trend line.  Does it make sense that a concentration of H2O2 greater than 60% would create a negative time to rise–that we’d go back in time?  It is this type of questioning that will help the students determine that this is a curve and not a straight line—which is important later.

This is where referring back to the models covered earlier can really pay off.  What kind of curve would you expect?  When we drop a disk in the H2O2 and time how long it rises are we measuring how long the reaction takes place or are we measuring a small part of the overall reaction?  At this point it would be good to consider what is going on.  The reaction is continuing long after the disk has risen as evidenced by all the bubbles that have accumulated in this image.   So what is the time of disk rise measuring?  Let’s return to that in a bit but for now let’s look at some more student work.

Often, I’d get something like this with the horizontal axis—the explanatory variable—the independent variable scaled in reverse order.  This happened a lot more when I started letting them used spreadsheets on the first go around.

Spreadsheet use without good guidance is usually a disaster.  After I started letting them use spreadsheets I ended up with stuff that looked like this:

or this

It was simply too easy to graph everything–just in case it was all important.  I’ve got to say this really caught be off guard the first time I saw it.  I actually thought the students were just being lazy, not calculating the means, not plotting means, etc.   But I think I was mostly wrong about that.  I now realize many of them actually thought this was better because everything is recorded.   I have this same problem today with my college students.  To address it I ask questions that try and get to what “message” are we trying to convey with our graph.  What is the simplest graphic that can convey the message?  What can enhance that message?  What is my target audience?
The best spreadsheet plots would usually look something like this where they at least plotted means and kind of labeled the axis.  But they were almost always bar graphs.  Note the the bar graphs graph “categories” on the horizontal axis so they are equally spaced.  This is the point that I usually bring out to start a question about the appropriateness of different graph methods.  Eventually with questions we move to the idea of the scatter plot and bivariate plots.  BTW, this should be much easier over the next few years since working with bivariate data is a big emphasis in the Common Core math standards.

But my goal in the past was to get the students to consider more than just the means but also to somehow convey the variation in their data–without plotting every point as a bar.  To capture that variability, I would suggest they use a box plot–something we covered earlier in the semester with a drops on a penny lab.  I hoped to get something like this and I usually would, but it would be drawn by hand.

The nice thing about the box plot was that it captured the range and variability in the data and provided them with an opportunity to display that variation.  With a plot like this they could then argue, with authority that each of the dilutions take a different amount of time to rise.  With a plot like this you can plainly see that there is really little or no overlap of data between the treatments and you can also see a trend.  Something very important to the story we hope to tell with the graph.  My students really liked box plots for some reason.  I’m not really sure why but I’d get box plots for data they weren’t appropriate for.
Today, I’m not sure how much I’d promote box plots but instead probably use another technique I used to promote—ironically, based on what I discussed above—plot every point and the mean.  But do so in a way that provides a clear message of the mean and the variation along with the trend.  Here’s what that might look like.

It is a properly scaled scatterplot (bivariate plot) that demonstrates how the response variable (time to rise) varies according to the explanatory variable (H2O2  concentration).  Plotting is not as easy as the bar graph examples above but it might be worth it.  There are a number of ways to do this but one of the most straight forward is to change the data table itself to make it easier to plot your bivariate data.  I’ve done that here.  One column is the explanatory/independent variable, H2O2  concentration.  The other two columns record the response or dependent variable, the time for a disk to rise.  One of the other columns is the mean time to rise and the other is the time for the individual disk to rise.  BTW, this way of organizing your data table is one of the modifications you often need to do in order to enter your data into some statistical software packages.

With the data table like this you can highlight the data table and select scatter plot under your chart options.

At this point, I’d often throw a major curve ball towards my students with a question like, “What’s up with time being the dependent variable?”  Of course, much of their previous instruction on graphing, in an attempt to be too helpful suggested that time always goes on the x-axis.  Obviously, not so in this case but it does lead us to some other considerations in a bit.
For most years this is where we would stop with the data analysis.  We’ve got means, we’ve represented the variability in the data, we have a trend, we have quantitative information to support our scientific arguments.
But now, I want more.  I think we should always be moving the bar in our classes.  To that end, I’d be sure that the students included the descriptive statistic of the standard deviation of the sample along with the standard error of the mean and to use standard error to estimate a 95% confidence interval.   That would also entail a bit of discussion on how to interpret confidence intervals.  If I had already introduced SEM and used it earlier to help establish sample sizes then having the students calculate them here and apply them on their graphs would be a forgone conclusion.
But what my real goal, today would be to get to the point where we could compare our data and understanding about how enzymes work with the work done in the field–enzyme kinetics.  Let’s get back to that problem of what is going on with the rising disk—what is it that we are really measuring if the reaction between the catalase and the substrate continues until the substrate is consumed?  It should be obvious that for the higher levels of concentration we are not measuring how long the reaction takes place but we are measuring how fast the disk accumulates the oxygen product.  Thinking about the model it is not too difficult to generate questions that lead students to the idea of rate:  something per something.  It is really the rate of the reaction we are interested in and it varies over time.   What we are indirectly measuring with the disk rise is the initial rate of the enzyme/substrate reaction.  We can arrive at a rate by taking the inverse or reciprocal of the time to rise.  That would give us a float per second for a unit.  If we knew how much oxygen it takes to float a disk we could convert our data into oxygen produced per second.
So converting the data table would create this new table.

Graphing the means and the data points creates this graph.

Graphing the means with approximately 95% error bars creates this graph.

Woooooooweeeeee, that is so cool.  And it looks just like a Michelis-Menten plot.

By Thomas Shafee (Own work) [CC BY 4.0 (http://creativecommons.org/licenses/by/4.0)], via Wikimedia Commons
Creating this plot–as long as the students can follow the logic of how we get here opens up an entirely new area for investigation about enzymes and how they work.  Note that we now have some new parameters:  Vmax and Km that help to define this curve.  Hmmmm.  What is this curve and do my points fit it?  How well do the data points fit this curve.  Can this curve, these parameters help us to compare enzymes?  Here we return to the idea of a model–in this case a mathematical model which I’ll cover in the next installment.

Establishing an Experimental Procedure to Guide the Home Investigation

Moving from the bulk reaction of yeast (catalase) and H2O2 to a procedure that can produce reasonable, reliable and precise data without just telling them,  “This is the technique that we will use”, can be tricky.  But it is a discussion full of quantitative considerations if that procedure is going to generate quantitative data that can support a claim.

At the end of the day, my overall goal is that every student will have an understanding and experience with a  defined protocol in their individual lab notebooks that can serve as their reference when they go home and collect their data.   I could be really helpful and just give them a well-structured set of laboratory instructions which would assure that most of the students who follow directions closely will succeed in getting the expected results.  Ensuring that my lab worked.  Of course, I’d have to hope that they would somehow, subconsciously pick up on the kind of thought that had to go into the organization of the tables, the presentation of the graphs, the preparation of the materials, etc.  My students never seemed to pick up that kind of thing, though by just following instructions.  That insight seems to come with wrestling with the challenges.  And since their thinking skills are more of a priority to me, I quit providing lab instructions very early in my career.  It is a lot more messy and you’ll be amazed at how many ways a student can go down the wrong path but I found that trusting the students to figure things out, works–they get better and better at it which makes the class more fun for me and for them.  For me, this lab fell pretty early in the year and for that reason it was a bit messier than it might have been had we worked on it later in the year.  It is important to note that I don’t just “turn the students loose” to go design whatever they can conjure up.  That is a recipe for disaster in so many ways but most importantly it typically leads to all sorts of negative student experiences.  The goal is to keep the challenges in front of the students finally tuned to their developing skills–to keep the students, as best we can, in the “zone”or perhaps better defined as:  Mihály Csíkszentmihályi’s FLOW.

Some of my learning goal targets that I keep in mind to guide my questions during discussing include:  1.  Introducing the floating disk technique but making sure the students understand how it is working.  2.  How do we explore variables systematically.  (serial dilutions) 3 What is this replicability thing?,  4. Emphasizing the importance of exploratory work to help establish data that can inform design. 5. How big of sample do we need? What factors into determining sample size?  6. Identify and contrast systematic and random error.

With these thoughts guiding my questions we launch into a discussion about the mess I created earlier.

With practice over the years it is easy to have barrage of questions ready to go.   Typically, I reframe/choose my next question based on student responses.  In that way, we are all following along on the same reasoning path–or at least as much as 20+ individual agents can follow the same path.

What did we mix to create the mess?  What did we get out?  How is this related to the models we explored?  How could we quantify what is going on?   What are we going to try and figure out?  What can we control?  What do we need to know?  What should we measure? How should we systematically measure it?   How can we be sure to all generate data/information that can inform our exploration?  How can I capture the products produced?  How do I measure the products over time?  What could/should I use for controls?  What should we quantify if we want to make a claim?  This last question can be particularly productive if out goal is to collaboratively develop an experimental protocol.  I never know exactly where we will go but with the guiding questions in my mind and with practice on my part it doesn’t usually take too long before we get to a starting/exploratory protocol that we can test in class.

At some key point in the discussion (you’ll know when) I demonstrate the floating disk technique itself along with some qualifying statements/comments like:  “Let’s reduce the amount of yeast/catalase but try and keep it constant.  One way might be to collect a sample on a piece of filter paper like this.”  You can guess the next line:  “Now let’s see if this will generate some bubbles that we can count or observe.”  At that point when we drop the disk in the H2O2 it sparks questions in their my minds when the disk floats.  Of course this prompts me to ask more questions.  These questions are now more specific to developing the protocol:  What do you think would happen if we dropped the yeast disk into plain water? (control) What would happen if we dropped a paper disk without yeast into H2O2 ?  (control) If I dropped another disk into the H2O2 will it take the same amount of time to rise?  If not, how could I capture the variation? Why is the disk rising?  How many disks can I drop in the H2O2 before it affects the time to rise?  (why I used the well plate and a single disk).  At this point I may take time to have them time a number of disks dropped into the same substrate dilution to get some preliminary data to work with.
If I keep the yeast concentration constant how can I systematically vary the H2O2 solution?  This was my main objective in the past because I used the lab to introduce serial dilutions and how to make them–skills that came in handy later when we did our microbiology labs.  At this point we could work through a serial dilution without a formal measurement device.  Since, my goal was to do most of the lab work at home, we adapted by doing our dilutions with a “measure”–which was a sharpie mark a little less than half-way up one of the plastic cups.  1 measure of water and 1 measure of 3% H2O2  would equal a 1.5% solution of H2O2  and a 50% dilution.  That solution could then serve to produce the 25% dilution and so on.  If this isn’t clear, let me know and I can put up a small video of the process if that will help.
And a question that I would ask today but didn’t in the past:  Is the time to rise the same as the rate of rise?  How can I convert time to a rate?  Today, I’d consider this one of my primary objectives for this lab.  Like I said earlier my primary goals in the past were to get the students comfortable with serial dilutions, experimental design and data presentation.  But from a standpoint of content and lab integration, I think I’d focus more on the properties of enzymes now.  Explicitly exploring rate of reaction is a key quantitative question to work on because it challenges a common quantitative misconception (confusing rates and quantities) and it also creates a situation where we can address the data in a form that is similar to standard laboratory work with enzyme kinetics.
Other questions come from students as we work on a protocol—questions about how to drop the disk, how do I keep the yeast constant?  do I have to stir?  when to time the float?, how deep should the solution be?
And:  How many disks should I drop to be confident that I have measured the rate of rise?  In the past, I had my students collect data on 10 disks of yeast per substrate concentration because I used this lab to introduce box plots.  The choice was somewhat arbitrary but you need a sample of 10 or more if the box plot is going to provide relevant information.  For example, a sample size of 4, split into 4 quartiles isn’t going to tell me much.  In today’s AP Bio world I might use this lab as an opportunity to explore another way to estimate an appropriate sample size–using standard error.  Here’s how that works.
Pre-Determining Sample Size:
I’m pretty upset with myself that I didn’t teach this in the first half of my career for many reasons but the most important is that I think students need to make that link that helps them to realize that quantitative methods provide strong support for their claims.  One question I never got around to helping my high schoolers figure out was how to justify their sample size.  I kind of let it slide with general statements like:  “Well, three is not enough.”  “Let’s do at least 10.”  and so on.  Here’s how the discussion would go today.
First, during the exploratory work we’d collect some data from an “unknown” substrate solution and an unknown yeast solution.  Here’s the data.

Looks pretty consistent but there is almost 2 seconds difference in the time to rise between the slowest and the fastest disk.  Let’s see what happens if we dilute the substrate by 50% but keep the yeast concentration on the disks the same.

Now, that is interesting.  The time to rise in the diluted substrate definitely seems to take longer.  Just eye-balling it it looks like a difference of about 6 seconds–more than 50% longer.  Still there seems to be about 2 seconds of variability in the diluted substrate results as well.   How can we capture all this in a couple of numbers?
Descriptive stats to the rescue.
The means can help us by using a single number to represent all of the data collected under one condition and the standard deviation (of the sample) can help us describe the amount of variation in the sample.

For many, this would be enough to consider.  The differences between these two samples of 8 is more than a standard deviation–in fact more than 3 standard deviations.  They are really quite different results.  A sample size of 8 seems to an easy sample to collect but what if we wanted to collect smaller samples because our fingers cramp up working the stop watch so many times?  Could we use a smaller sample size and still collect data that will support our claims that these are different?  Let’s see how we might determine that.
First let’s agree on a level of precision that we think we will need.  To do that let’s take a look at the differences in the means.  The difference is almost 6 seconds.  Now, each time I do this experiment under the same conditions I will likely get slightly different means.  How confident am I that my sample mean is close to the actual population mean?  Means are a point estimate but I want to put an interval estimate around that point.  Let’s say that if I can establish an interval of the mean plus or minus 0.5 seconds then I’ll feel pretty confident that my experiment has captured the true population.  How about 95% confident?   To be about 95% confident in our point estimate of the mean in seconds with an interval estimate of plus or minus 0.5 seconds we need to work with the standard error of the mean (SEM).  Bear with me while I do the algebra and violate my principle of being less helpful.  😉
Remember that the formula for SEM is:

I’ve used the approximately equal to because we can only estimate with the standard deviation of the sample.  The actual SEM would require the true population standard deviation.  Our exploratory data has provided us with an estimate of the standard deviation.  With this equation we can solve for n to try and figure a different size of a sample size—a smaller one that could still provide us with confidence.
You may also remember that 2 x SEM is approximately equal to a 95% CI.

Let’s combine these two equations and since, earlier we decided that plus or minus 0.5 seconds was probably enough precision we can just substitute that for the 95% CI.

Substitue 0.66 for the stdev.s that is estimated from our exploratory data:
Divide both sides by 2.

Multiply both sides by the square root of n.

Divide both sides by 0.25 seconds.

We are getting close, now.  Square both sides and you end up with the sample size you’ll need to assure that you have a 95% confidence interval that is plus or minus 0.5 seconds around the mean of your sample.

Ah, finally.  Looks like a sample size of 7 will assure that the 95% CI will fit between plus or minus 0.5 seconds around the mean.  Of course if we wanted a 99% CI we could use 3 x SEM in the work.  Or we could define a more precise CI interval of say 0.25 seconds around the mean.   It is up to you.  But with this type of work, you can make a strong argument as to why you chose the sample size you chose.
Their lab notebooks, at this point will have drawings and instructions in their own words on how to do a serial dilution, sample data, procedures, and background information (and perhaps some model data).   I’ll send them home with my question to work first with the intent of them repeating the homework at home on a different question, later the next week after they have worked to develop their skills. The question I ask them to investigate is:  How is the rate of the enzyme reaction affected by the concentration of the substrate?  They can work in groups, with their family, or by themselves but I want everyone to have a lab notebook entry of the methods, the questions, the design and the data they have collected along with graphs of the data.  I’m not explicit about what that should look like at this point.  I don’t want to be too helpful.  I actually want mistakes so we can address them.  If I’m too helpful at this point and tell them to make a scatterplot of just the means of the time to rise versus the substrate concentration then many will be will not know how to work in a novel situation in the future.
The mistakes that will no doubt appear provide an important starting point for the discussion on analysis.  That will have to wait for the next installment….

Teaching Quantitative Skills in a Lab Context: Getting Started in the Classroom

Some background on my teaching approach (which you may not agree with):

A few years ago a young math teacher, Dan Meyers had several videos that went viral about math instruction.  Be sure to google his work but also check out the critique of his work.  Part of the his message was that we (curricula, teachers, books, etc.) are “too helpful” when we structure our lessons and instruction.  By that he meant that instead of giving students practice with formulating problems and working through unique solutions we have reduced math instruction to a series of “paint by number” steps to be memorized.  Meyers was not the first to make these claims and not the last.  For example another noted math educator, Phil Daro has a series of videos where the main idea is “against answer getting”.  In these videos he compares Japanese math instruction to U.S. instruction and notes that in Japan math instructors ask the question:  “How can I use this problem to teach this math concept?” vs in the U.S:. “How can I get my students to get the right answer to this problem?”  It’s not that the answers aren’t important but if correct answers are the main emphasis of instruction then becomes too easy for the entire system education to devolve into trivial answer getting.  The hard work of critical thinking, working through problems, getting comfortable with false starts, revision, metacognition and learning from mistakes–all qualities that education aspires to gets lost in the extreme focus on the end product.  Moreover, the answer getting approach contributes to students developing a fixed mindset about their own abilities that are very likely false.  Carol Dweck and Jo Boaler’s work in this area provides a number of ideas and approaches to help teachers avoid fixed mindsets and help move students along a learning progression that leads to effective problem solvers.  Part of Boaler’s work at successfully moving students from fixed to growth mindsets in math involves rich problems that have an easy, accessible entry point that opens a door to a very rich, open and challenging environment with many paths to explore.  The floating disk catalase assay fits this description to a “T” in my mind.
BTW,  even though I have participated in a number of curriculum development projects, standards writing and curriculum framework development, I personally seldom pay much explicit attention to standards, science practices frameworks, or objectives when I do my “planning”.  Nor do I ever develop formal learning objectives when I “prepare” lessons.  Like rubrics I tend to look at objectives and frameworks as too confining.  More importantly, I don’t think I have every taught “a lesson” that didn’t take the students beyond the typical standard or learning objective.  Since I kind of live and breath biology education, I don’t want to be boxed in, I want to explore what is possible.  I have a general idea of where we are trying to go in class but I don’t make it explicit.  I don’t want my students to think they have arrived at their destination (learning goal), rather I want them to value the journey and keep on keeping on the path.  I’m not advocating you do the same,  I’m only explaining why you won’t see any specific references here to specific learning goals or science practices.  What follows is a weird blend of what I have done in the classroom and how I would approach this material, today.  I’ve been out of the high school classroom for more than 10 years and I’ve got to say that all these new resources certainly make me wish I was back in the high school classroom.
With that bit of background as justification you’ll see that in the posts that follow I will be promoting being less helpful and trusting my students to be able to come up with reasonable problems and solutions to those problems.  To do this well, requires skill on the part of the teacher to guide student thinking through questions–Socratic questions.  Planning for the instruction requires explicitly thinking about the instructional goals and the types of questions and scenarios that can get us to those goals.  Like the student quantitative skills we are targeting our own skill in questioning will get better and better as we practice it and reflect on it.  By the way since we are talking about skills it is important to remember that skills are improved through practice and therefore our instruction should offer the chance to practice and revisit skills.
Getting Started:
I typically use labs to introduce material so that students have some level of experience with physical phenomena that can serve as a foundation for building conceptual knowledge.  But I’ve got to get their attention and hopefully spark their interest.  I’ve explored many different enzyme systems in the classroom.  For instance, in the “old days” my students did all kinds of things with salivary amylase and starch.  This system had the pedagogical hook of being known as the “spit lab”.  They loved to hate spitting into test tubes to collect their amylase.  High interest.  For catalase I call on their experience with Hydrogen peroxide since most of my students have a bottle back at home and most are familiar with it.
Before going any further, I remind them that they will need to start recording any observations, questions (real important)  and thoughts in their lab notebook.  In the interest of being “less helpful” for more than 25 years I did not provide my students with lab write-ups or worksheets.  They had to organize their own investigations based on demo’s and discussions in class.  I made sure to make their lab notebook indispensable to their individual success in the class by making later assignments that required the information they should have entered into their lab notebooks–usually in the form of laboratory practicals as substitutes for final exams.
I bring out a bottle of Hydrogen peroxide and begin a discussion.
My part of the discussion involves questions and follow-up questions with these targets in mind:  1.  to stimulate interest.  2.  to recall why they use H2O2.  3. to realize that H2O2 breaks down on its own (by asking questions about old, “use-up” bottles in the medicine cabinet and why is the bottle brown?),  4. that bubbles are a sign that the H2O2 is working (killing “germs”).  (the connection to the bubbles needs to be corrected in a bit)

It is at this point I bring out a plastic cup about half full of a yeast solution.  (I almost always use plastic in my labs to minimize when we need goggles)  I mix up a package of bakers yeast in about 250 ml of water before class so that it well suspended.  I pour out about 1/2 cup of H2O2 and say “Let’s see if we can get some bubbles”

At this point I have them.  Because there are lots of bubbles….

Way more than they expect.

When it starts to overflow, that is when I pull out my best Chevy Chase imitation and start bumbling around trying to keep the mess at bay but it is too late.

They are hooked now, at least long enough to provide a quick bit of background information.  At this point we describe the decomposition reaction and quickly balance the equation.  And then, using questions again, start to probe what might be going on.  The target this time is that the idea that the reaction has been greatly speeded up.  Speed implies rate.  This is important.  This is quantitative thinking.  You have been doing similar discussions with your students but you may have not pointed out the quantitative aspect of this observation in the past, assuming that your students would readily see the quantitative aspects of this event.  I know that is exactly what I used to do but if we want to focus more on quantitative skills we have to bring them up to the top and not leave them below the surface, hoping the students will automatically figure it out.   Knowing what I do today, I wish I had made this emphasis more in the past.  Turns out, that one of the big quantitative errors that the public makes is mixing up quantities and rates.
At this point I also introduce the idea of a catalyst as something that increases the rate of a reaction—without being part of the reaction.  The definition is not exactly, spot-on but it is good enough to begin developing a conceptual model–which, again takes us into more quantitative thinking.
Modeling to develop a foundation:
When I was in the classroom this is where I’d start drawing representations of catalase and H2O2 on the whiteboard.and implying motion with lots of hand motion, all the while asking questions about the process.  Of course the purpose of this was to provide the start of a mental model for what was going on at the molecular level to help the students inform their experimental design.  Today I’d do things differently.  I’d use the computer based Molecular Workbench models that are available at Concord.org.  We would have already visited this site, previously so I wouldn’t need a do do much in the way of introducing the site itself.  This type of site, in my mind, is a game changer that makes the abstract world of molecular interactions more accessible and helps to reduce student mis-understandings creating more rigorous mental models.  A very important aspect of these models is the randomness incorporated into the models.  One of the most difficult ideas to get one’s head around is the idea of random motion and interactions leading to the order we see in living things.  Check out this paper to learn more about this teaching/learning challenge:  Garvin-Doxas, Kathy, and Michael W. Klymkowsky. “Understanding randomness and its impact on student learning: lessons learned from building the Biology Concept Inventory (BCI).” CBE-Life Sciences Education 7.2 (2008): 227-233.
These models are 2D agent-based computational models which means each structure in the image is an agent with properties and actions—that interact with the other agents.  The actions and interactions are based on kinetic theory and do not include quantum effects.  Here is the starting reaction representation.

Unlike the catalas/H2O2 decomposition reaction, this model represents a replacement reaction.  In this particular screen shot one of the green molecules has split and could be ready to combine with the purple molecule atoms if the purple molecules were split.  This may not look like a quantitative model but it is.  The reaction without the catalyst does happen but takes a long, long time.  Note that at the bottom there is a time measurement, there is a given, starting number of reactants and there is a measurement of reaction completion.  All quantitative parameters that students can “take data” on using the pause button and simply counting…..
Here below, two catalyst molecules have been added and in a very short time the reaction moving to completion.  Note that while the reaction is near completion the catalysts are unchanged.

Now, at this point I have to make a decision.  Do I have the students collect some data to help form their conceptual understanding or do I simply let their impressions of the model with just few trials guide their understanding.  Either way, it is important that I use a series of questions to guide the students to my targets:  1.  an understanding that the reaction is speeded up and hence rates are something we might want to measure,  2.  that the catalyst provides an “alternate pathway”, 3. that there is a limit to how fast the enzyme works, 4. that even when the reaction is “complete” things are still happening, and 5. that if we re-run the reaction, collecting data each time the results are slightly different but predictable.
You can play with the model right here:
Here’s the link to the model of catalysis where you can explore the model yourself or with your students:
But wait there’s more!
When I use any kind of model, now in the classroom, we have a discussion of the strengths and weakness of the model in play.  Usually, when I show the model above to teachers I get quite a few aha’s and general statements of approvals.  With that in mind what do you think are the strengths of this model?  More difficult for students, at least, is to come up with the weaknesses or limitations of the model.  Often they focus on trivial problems, like the atoms aren’t actually green and purple and miss others like this is a two-dimensional space.  They will no doubt have a difficult time with the idea of scaling time.  For the catalase system this model’s size scales are way out of wack.  What are some other “issues”?
In addition to a computational model a good strategy would be to have the students develop their own physical,  model of a catalyst speeding up a reaction.  Biology teachers have promoted the toothpickase lab as an enzyme lab over the years.
Googling toothpickase will bring up all sorts of prepared documents and images.  This model is a great one to work on and explore.  It will definitely help guide questions and experimental design to explore catalase but consider having the students come up with the model themselves with just a little prompting/demonstration from you.  Use questions to help them figure out the quantitative parameters, the idea of rates and how to structure and graph the results with the idea of supporting and communicating a scientific argument.  Try to avoid the temptation of providing a structured set of lab instructions and tables to fill out for toothpickase–in other words don’t be too helpful.  Every time we do that we are taking away a chance for the student to work on one of their quantitative skills.  One of the attractions of this model is that the students grasp what is modeled but instead of making it a center point of your lesson consider using it to support your exploration into an actual biological system–the catalase/hydrogen peroxide system.  Again,  help the students discover weaknesses and strengths of the model.
Experience with a physical model or the computational model should provide enough background that perhaps you can lead your students to develop a different kind of model—a symbolic model like this:

By Thomas Shafee (Own work) [CC BY 4.0 (http://creativecommons.org/licenses/by/4.0)], via Wikimedia Commons
Paula Donham has this same model in her write-up.  This particular model is the basis for further analysis later on in this lab.  So consider trying to get to a model like this before moving forward.
Remember,  I said that I would suggest more quantitative things to emphasize than anyone would want to actually use in the classroom so pick and choose.  There’s a couple of other models we could explore which I will explore towards the end of the lab but for now, the students should be ready to start designing how they are going to collect data that will help them understand the catalase/hydrogen peroxide system–which is the next thing I’ll talk about.

Teaching Quantitative Skills using the Floating Disk Catalase Lab: Intro

I find it remarkable how deeply the biology education community has embraced the call to increase quantitative skills in biology.   This is certainly not an easy change to incorporate into our curricula and it is one that the community will be working on, tweaking and improving over time as our own instructional quantitative strategies and skills mature.  But even with this willing effort by the community where does one find the time to add “something else” to an already packed curriculum?   The first part of the answer to that question is to first have confidence that it can be done;  the second part of the answer has to do with strategic and efficient curriculum decisions; and the third part of that answer is to realize that, like our students, we are somewhere on learning progressions ourselves and that our skills and understandings will deepen the more we teach quantitative skills.  No one has time to teach all the biology they would like to teach.  Every year most of us make all sorts of decisions about what to include, what to emphasize, and what to leave out.  The challenge of adding structured instruction in quantitative skills is daunting, particularly since most of us have not had time to develop our own math-based pedagogical tools and skills.  With that in mind we often fall back on the type of math instruction that we likely encountered in our own educational background.  If, like me, most of your math instruction was based on algorithms and focused on getting answers instead of learning how to do math, then likely if we model our quantitative skill instruction on the math instruction we experienced, we won’t be doing a very good job helping our students develop quantitative skills.  Instead, perhaps we (the biology teaching community) should consider delivering quantitative skills instruction in a way that models effective and efficient math instruction informed by research.  Here’s the good thing–it turns out that many of the strategies that work well for teaching science also work well for teaching math.  We biology teachers just need a bit more experience trying to explicitly teach appropriate quantitative skills.  We need to develop our own specialized pedagogical content knowledge.   I thought I’d put out an example of how this might work in a classroom–certainly not as an exemplar but more as a starting point.

To this end, at the 2016 NABT meeting Jennifer Pfannerstil, Stacey Kiser and I shared strategies to introduce quantitative skills focused around a classic lab: The Floating Disk Catalase lab.  The earliest version of this lab that I know of was published in ABLE.
A Quantitative Enzyme Study Using Simple Equipment by Beth A. D. Nichols and Linda B. Cholewiak.   Yes, that is the same Beth Nichols that recently retired from ETS but has worked with so many in this community.

In this series of posts I’ll walk through the material we presented at NABT along with some discussion on the rationale of each example coupled with resources so that you should be able to design your own lab that structures quantitative skills.  A caveat:  I want to emphasize how rich this particular lab is for developing quantitative skills–in fact I’ll present more possible ideas than probably anyone will want to use in any particular class.  So pick and choose what works for you and your class but consider the examples presented here as something to shoot for with your students.  Let’s get started.

Lab Overview:  

If you are not familiar with the lab it features a simple and student friendly method to measure/quantify enzyme action or kinetics using disks of filter paper soaked in a yeast solution as the enzyme source and a solution of hydrogen peroxide as the substrate.  Here’s a write-up by Paula Donham on the technique:  http://www.kabt.org/wp-content/uploads/2009/02/catalase-enzyme-lab.pdf  Note that one of the educational goals that Paula used this lab for was to introduce the use of box plots as a way of presenting your data.  (That’s a quantitative skill, btw.)
The materials:
How does it work?
Dip the paper disk in the yeast solution.  The yeast solution provides a set amount of catalase per disk.  Drop the disk into a solution of hydrogen peroxide.

The catalase breaks down the hydrogen peroxide into water and oxygen.  The oxygen bubbles catch in the paper fibers and eventually cause the disk to rise. You can see the disk staring to rise in the lower right hand corner of the cup.

I use the plastic cups to make a dilution series with the hydrogen peroxide and the 24 well plates for the testing.  The well plates allow you to put one disk per well (which might lead to better precision).
Ready to collect data with eight data points per substrate dilution:
Here’s a short video of the procedure using the well plate:
Dip a disk in the yeast, drop it into the hydrogen peroxide and time how long it takes to rise.
Why use this lab for introducing quantitative skills?
The simplicity and precision of this lab technique allows the teacher and the class to more deeply explore concepts about enzymes but also to explore how different quantitative skills can provide a path to even deeper understanding.  The key here is that the technique is so simple, the students can concentrate on thinking about what is going on with the enzyme, how to capture that quantitatively and how to support their conclusions with data.  And, they can simply do it over if errors are made since it takes a small amount of time.  There are some other aspects of this lab that allows you to introduce different approaches and deeper understanding as you build quantitative skills.  In my classes, I had three main goals for this lab:  1.  To begin an understanding of enzymes and enzyme action; 2.  Introduce and practice a number of quantitative skills (including serial dilutions and graphing);  and 3 Introduce and practice experimental design and scientific argumentation.  In my classes, we would introduce the technique, let everyone in the class practice it,  and then assign the actual data collection as homework.  The students had to acquire their own materials at home, collect the data and report back to class.  They could work collaboratively or with their families.  The lab is safe, inexpensive and doable.  By assigning the data collection as homework, this freed up class time to work on the quantitative skills.   The students generated mini-posters to share their work with their peers.  In the next post I’ll talk more about how you might present this lab to students and the types of quantitative skills you can build and practice.

Get Ready and Sign up for DNA Day

ks_DNADay_logo_goldLast year, the biology folks at the University of Kansas adapted a successful program from the University of North Carolina:  DNA Day.  The teachers I talked to loved the program and so did their students.  Don’t miss out.

Sign-ups for Kansas DNA Day 2016 are now open! Kansas DNA Day features graduate and advanced undergrad students in the biological sciences traveling to local high schools to give interactive lessons on the applications of genetics and genomic sequencing. The event will be held the week of April 21st. We currently have ambassadors ready for the greater Kansas City, Lawrence, Manhattan and Wichita areas so if you are a high school science teacher in one of these areas, sign up to have ambassadors come visit your classroom! More info and a link get involved can be found at http://ksdnaday.odst.dept.ku.edu/Welcome.html, or e-mailkansasdnaday@gmail.com for more info.