INTRODUCTION
National initiatives in biology education recognize the need for all undergraduate biology students to participate in research (
1–
3). Biology students who participate in research experience many fruitful outcomes, including increased persistence in science, increased science self-efficacy, lasting science learning, and increased understanding of research processes (
4–
11). Course-based undergraduate research experiences (CUREs) involve students in science research within the context of a course. By integrating research directly into the curriculum, CUREs enroll a greater number of students than can be accommodated by one-on-one mentorship alone (
12–
17). Here, we discuss a CURE module in which students explore links between genotype and phenotype through the measurement of quantitative plant traits (i.e., plant traits that fall on a continuous scale, e.g., growth); and traits that fall into categorical groups (e.g., alive or dead), which are measured qualitatively.
Facilitating students’ understanding of the link and distinction between
genotype and
phenotype and their interplay with the environment is an especially important goal in biology instruction, as is facilitating the understanding of structure, function, variation, and natural selection (
2). Phenotypes are influenced by both genetics and the environment, and characterizing phenotypes often requires quantitative approaches. For example, the interaction between a human’s genotype and their environment throughout development, (e.g., childhood nutrition) can influence adult stature and vary among individuals, where stature is a quantitative trait (
18). This is one example of a genotype-by-environment interaction that is central to agriculture, human health, and natural systems. Measuring quantitative traits and describing the influence of genetics and environment on phenotypes also exercises students’ use of quantitative reasoning, a key competency in
Vision & Change (
2,
18).
The Undergraduates Phenotyping
Arabidopsis Knockouts (unPAK) network consists of more than 16 institutions in which students measure quantitative phenotypic traits on a variety of genotypes of the model plant
Arabidopsis thaliana. Investigations of genotype and phenotype links are made possible in the model organism
Arabidopsis due to the extensive library of available
Arabidopsis T-DNA insertion mutants. In these plant lines, T-DNA (transferred DNA) is used to disrupt specific genes within the
Arabidopsis genome. By growing non-mutant and mutant plants in two environmental treatments and measuring growth, size, and reproductive phenotypes (all quantitative traits), students and researchers can investigate how insertion mutations that disrupt gene function may alter plant responses to the environment. This makes it possible to determine whether that gene has a positive, negative, or neutral effect on
Arabidopsis fitness under varying environmental conditions (
19,
20). The
Arabidopsis system makes it possible to obtain quantitative phenotypic data on a library of tens of thousands of T-DNA insertion mutants, with the goal of large-scale coverage of the nuclear genome (
20,
21;
http://arabidopsisunpak.org). Currently, more than 38,000 Salk T-DNA mutant lines (Salk Institute for Genomic Analysis [
http://signal.salk.edu/tabout.html]) are available from the Arabidopsis Biological Resource Center (ABRC) (
arabidopsis.org). Of these lines, ~9,000 are currently screened and curated in the unPAK stock center (for details see
arabidopsisunpak.org, also see
20).
Instructors of CUREs can select mutant lines from those curated by unPAK, expose them to different environmental treatments, and measure quantitative traits associated with plant growth and fitness to test questions with a focus on ecology, evolution, plant science, or genomics. All unPAK students use the same standards for experimental design and measurement methodology and are subject to the same quality controls on data. unPAK CUREs are unique in that each course is supported by an education-research network consisting of instructors and students working toward the overarching goal of linking genotype to fitness phenotypes. Below, we describe the design and student outcomes of a representative unPAK CURE module from one of the campuses that is available for adoption.
Intended audience and prerequisite student knowledge
The CURE module presented here is currently offered at multiple institutions (see
arabidopsisunpak.org for an updated list of participating institutions and classes). It has been successfully integrated into lower-division biology majors’ laboratory courses with a focus on ecology, evolution, plant science, or genetics. Here, we describe a version of the CURE module from the College of Charleston (CofC), where students have previously taken Introductory Biology. As this module is designed for integration into a lower-division class, students are not expected to have extensive biology knowledge and there are not multiple biology pre-requisites prior to the class. However, it is ideal if they have introductory-level knowledge of how to conduct literature searches, read primary literature, design simple experiments, manage data, conduct basic statistics, and graph. These skills are obtained by students in their first introductory biology course at the College of Charleston or in equivalent transfer courses. We also require a pre-or co-requisite introduction to statistics course. See the appendix for modifications of the CURE module which are also ready to be adopted that have been used at additional institutions in upper division biology courses.
Learning time
The CURE module is integrated into a course that meets weekly for one three-hour laboratory period and two 75-minute lectures for 14 weeks. Five laboratory periods over the course of the semester are dedicated entirely to the CURE module (weeks 1, 2, 3, 7, and 10,
Fig. 1), and two other weeks are dedicated in part to facilitating CURE activities (weeks 8 and 9), making in-class time 20 to 25 hours. To match the life cycle of the plant, CURE activities begin in week 2 and end in week 11 of the 14-week semester. The plants require time to grow over the course of the activity; therefore, there is one three-week break (weeks 4 to 6) during the semester when plants are growing and other, non-CURE-related, learning activities are implemented. Likewise, weeks 11, 12, 13, and 14 are dedicated to activities that are not directly related to the CURE module. Work done out of class is scaffolded, such that early weeks have minor time commitments (<1 hour) and build toward the final writing assignment which requires ~5 hours for graphing, statistics, and writing.
Learning objectives
Upon completion of this CURE module, students will:
1.
Demonstrate evidence of the ability to measure quantitative plant phenotypes, analyze quantitative data through graphing and statistical analyses, and communicate findings (Learning Objective [LO] 1)
2.
Demonstrate an ability to explain the link between genotype and phenotype, potential influences of mutation on phenotype, and variation across genotypes in phenotypic responses across environments (LO2)
3.
Engage in scientific research with the opportunity to discover something new to the scientific community (LO3)
4.
Gain confidence in their ability to do scientific research (LO4)
DISCUSSION
Field testing
The CURE was implemented in a foundation course taken by biology majors focused on ecology and evolution in which students perform science skills associated with unPAK and emphasize communication skills in science (e.g., reading, writing, data management, synthesis and presentation skills). The CURE was developed in 2012 and subsequently revised, then conducted in ~55 course sections across 11 institutions in class sizes of ~20 students. Evidence of student learning below describes a subset of sections that participated in the unPAK CURE.
Evidence of student learning
Evidence of LO1
Variation in plant phenotypes was uncovered in data collected by unPAK students in the classroom. Students contributed this data to a national database used by researchers across the unPAK network, demonstrating students’ ability to measure quantitative plant phenotypes and communicate their findings. Out of 123 pairs of student data reviewed from a random selection of sections between 2012 and 2016 at CofC, 80.5% of student pairs’ data files met data inclusion standards and were subsequently included in the unPAK database. This assessment underscores students’ success in achieving gains in their ability to execute quantitative phenotypic methods. Please see the sample plant data section for an example of student results that met data inclusion standards and indicate achieving LO1.
Evidence of LO2
Students successfully demonstrated an ability to explain the link between genotype and phenotype and variation across genotypes in phenotypic responses across environments. The final paper (see Appendix 7) is the most accurate and authentic measure of this learning outcome since students are required to construct arguments that link genotype, phenotype, and environment in the paper (see Suggestions for determining student learning section, above, for details). Any student failing to execute paper tasks linking genotype, phenotype, and environment is unlikely to receive above a maximum score of 66% on their final project (see the rubric and point structure in Appendix 7). Since 66% is considered a barely passing grade, passing the final project with a 70% or better indicates accomplishment of LO2. Of students who completed the assignment and the course (over 16 sections each with ~20 students per section), 97.7% received an acceptable passing grade on the final paper, indicating achievement of this outcome. Among the paper components most closely related to LO2, students most commonly struggled with the explanation of their results in the context of prior work, indicating that future work needs to focus on class examination of results and discussion sections of published literature. Future classes could incorporate more opportunities to practice this skill. For example, students might read additional relevant literature and engage in a class discussion about how the literature is important to their work, with specific coaching on how they might reference the literature in their final report. The module written assignment and activities represent 16% of the total points for the four-credit course at CofC.
Student survey data collection
Surveys with questions to assess discovery and science self-efficacy were distributed pre- and post-course to assess LO3 and LO4. Eighty-three percent of students given the opportunity to participate responded from one fall and one spring semester. We used
t-tests to analyze the data and the Benjamini-Hochberg false discovery rate (FDR) correction to account for potential Type I error in conducting multiple
t-tests (
25). We report FDR-adjusted
p values below.
Evidence of LO3
Students had opportunities to discover something new to the scientific community. The course provided students with opportunities to make relevant scientific discoveries comparable with students in a national sample of CUREs. The discovery and relevance scale used to measure students’ opportunities to make discoveries consisted of six questions with five possible responses each (summed score range: 5 to 30, Cronbach’s alpha = 0.87). Opportunities for relevant discovery reported by unPAK CURE students (mean = 25.14, sd = 5.16) did not differ significantly from students in the national sample of CUREs reported in previously published work by Corwin and colleagues (
9,
17) (
n = 72, mean = 24.35, sd = 4.04,
p = 0.408). In addition, students showed significantly more opportunities for relevant discovery than students in a national sample of traditional labs (
n = 60, mean = 20.77, sd = 5.82,
p < 0.001). These results demonstrate that the unPAK CURE is similar to other CUREs nationally in offering opportunities for relevant discovery, which is predicted to increase students’ motivation and contribute to their persistence in science and, importantly, offers students the opportunity to work on a problem with an unknown answer. This constitutes a learning opportunity regardless of whether students’ results support a specific hypothesis (
17). However, it is important to note that, overall since 2012, over 150 mutant lines have been screened in unPAK CUREs, generating new information that constitutes unPAK’s novel discoveries.
Evidence of LO4
Students gained confidence in their ability to do scientific research. The self-efficacy scale used to measure students’ changes in self-efficacy consisted of five questions with six possible responses each (summed score range: 6 to 30, Cronbach’s alpha = 0.86, 26) given to course sections of 20 across two semesters. Students made significant gains in science self-efficacy, with a pre-course mean score of 21.99 (sd = 4.71) and a post-course mean score of 24.20 (sd = 5.40;
p < 0.001). Such an outcome is a common result of one-on-one mentored research participation and has been characterized as an indirect contributor to persistence in science, since it precedes outcomes that directly predict persistence (
22,
23,
27).
Faculty feedback
After implementing the CURE, instructors (
n = 6) were queried about their experience (
28; further details in Appendix 11). We asked, “Why did you choose to implement a CURE? What goals did this module accomplish that a non–research based course might not have accomplished? What was the most important outcome of this module for your course?” All faculty agreed that the activities were valuable for their students and were well designed while offering flexibility. All stated they would use the CURE again in their course (Supplemental Table, Appendix 11).
Possible modifications
The multi-week module presented here can be modified to serve curricular needs for other foundation courses in biology, ecology, evolution, genetics, or plant science. Modifications can occur via selection of various mutant lines of interest for instruction or research (e.g., lines from a particular biochemical pathway), or by varying the growth environments (e.g., to focus on a topic of plant nutrition), or by emphasis on T-DNA insert position (intron vs. exon) or protein function to meet particular content or curricular goals. Inclusion of genetic assessments and advanced statistical approaches are additional examples of modifications, which may increase difficulty for more advanced students. For settings with limited growing space, students in different sections can measure the same plants. Findings from one semester could inform the instructor as to the next step in an inquiry, for example, changing environments with the same mutant lines.
ACKNOWLEDGMENTS
We thank Erik Sotka, Yana Wieckowski, Claudia Alt, Abigail Zoger, and Cynthia Chang for assistance with the unPAK CURE. We are grateful for the support from NSF IOS-1052262 and IOS-1355106 to M.T.R., A.E.S., and C.J.M., IOS-1146977; IOS-1052323 to H.S.C., and IOS-1050153 and IOS-1354603 to M.J.W. The NSF did not participate in any components of data collection or manuscript preparation. Research reported was approved by IRB review boards or determined to be exempt. We thank Irfanul Alam, Erin Fried, and Gracen Mitrick for comments on an earlier version of the manuscript. We thank anonymous reviewers for comments on a previous version. The authors declare that there are no conflicts of interest.