Content-Type: text/html A Survey of Faculty Evaluation Practices in Journalism and Mass Communication Presented to: Association for Education in Journalism and Mass Communication Advertising Teaching Division c/o Nancy Mitchell College of Journalism and Mass Communication 45 Avery, University of Nebraska Lincoln, Nebraska 8588-0138 Presented by: Robert C Sitz PhD & Frank Thayer PhD Dept of Journalism and Mass Communication New Mexico State University PO Box 300001, Dept 3J Las Cruces, NM 88003 Ph: (505) 646-1539 Student Evaluation Abstract Although it was presumed that most schools formally and systematically evaluate faculty for purposes of salary, tenure and promotion, the literature revealed little current evidence regarding the extent or nature of faculty participation. A lengthy survey questionnaire was mailed to members of the Association for Education in Journalism and Mass Communication to learn about faculty evaluation practices. This paper discusses the evolution of faculty evaluation, reviews several survey questions, and reports that despite diverse course formats, formal classroom evaluation is widely practiced; the preferred method is student evaluation; and rating instruments are popular for both formative and summative decisions. Student Evaluation INTRODUCTION There was a time when faculty evaluation was conducted informally and unsystematically--a period when the criterion of teacher effectiveness was objective and direct. Those were the days when Socrates could be seen talking in the marketplace to whoever would stop to listen. Those were the times when Aristotle walked in the Lyceum Gardens and gathered around him anyone who wished to engage in dialogue. The origins of student rating of faculty members can be traced to this era when to be an effective teacher was to be a person who attracted students (Travers, 1981). By the Middle Ages, the ability to attract students as a criterion of effectiveness took on an additional dimension--a remunerative measure of a teacher's worth. In addition to attracting students, the professor had to be able to extract fees from those students who were being taught. The University of Paris, founded in the 10th century, required professors to generate their own source of income through fees paid directly by the students. To survive, a professor would have to receive an adequate evaluation in the form of coins pressed into an outstretched palm (or dropped into the mortarboard held in the hand) as the students filed out of the classroom at the end of a lecture (Travers, 1981). At the University of Padua in Italy, students even hired and fired their own teachers (Seldin, 1980). BACKGROUND In modern times the evaluation of faculty performance has become less direct and much more complex. When school attendance became law, the idea of educational effectiveness became a public issue (Travers, 1981). In America, the system of education is generally regarded as a "mass" approach with open access. It is interesting to note that this "mass" perspective of education parallels the mass consumption characterization of the public by marketing Student Evaluation practitioners. In the early 1930s, business began to study "these masses" by conducting marketing research surveys that utilized consumer questionnaires. Although student evaluation was officially born at Harvard in 1924 (Seldon, 1980), the 1930s is cited as the period that inaugurated the use of similar survey questionnaires to evaluate faculty. In both marketing and educational contexts, this "mass" perspective has raised issues and spawned debate as part of the momentum of the accountability movement. The modern use of student evaluation is an outgrowth of the accountability movement of the 1960s and 1970s, a period which teacher performance was examined as a measurable element of modern education. As Robert Lovett said in 1970, "accountability defined as the right to insure a good education and to sever from the school system those who do not contribute to that end. In a very true sense, the teacher is first accountable to his pupils (Lovett, 1970 p. 130)". The problem is that in modern times there is a very diverse profile of educational institutions. Colleges and universities differ greatly in quality, character, purpose, size, complexity, and economic stability. The role of the professor in the educational process varies, and the job description of teaching is elusive and value-laden. Establishment of performance criteria and techniques for evaluation of faculty performance has become commensurately elusive. Therefore, throughout the evolving history of behavior evaluation, students, faculty and administrators have all claimed at one time or another that criteria are both reliable, valid useful and unreliable, or invalid and useless (Aleamoni, 1981). Despite the debate, the popularity of faculty performance evaluation gained momentum as America entered the turbulent sixties. Then, students began to reject autocratic administrators and put pressure on leaders for greater involvement in educational institution policies and activities that pertained to them. Students asked, and demanded, that administrations give them greater voice in the instruction they were experiencing (Seldin, 1980). Evaluation of faculty gave students the voice they were seeking. Student Evaluation The 1960s also defined a period when college enrollment was in a rapid expansion mode and many institutions had been largely concerned with recruiting and retaining faculty members. Tenure and promotion were almost automatic, and most institutions had long gotten by with poorly defined evaluation procedures (Centra, 1979). Eventually, enrollments began to taper off, and institutions of higher education no longer needed to add faculty and staff members. Institution budgets became tight, and many were at or near their prescribed limits in the percentage of faculty on tenure. Nationwide, 62 percent had tenure in 1977 (AAUP, 1978 in Centra, 1979). Teachers began having to prove that they deserved tenure. This forced institutions to make fine distinctions among generally competent instructors. Evaluation systems helped administrators to make these "distinctions." The confluence of student demands for greater involvement in administrative and instructional decisions, a historically loose tenure and promotion policy, and new challenges pertaining to the business of running institutions that were beginning to emerge in the 1970s resulted in a very hurried movement by institutions to embrace a wide variety of new procedures for evaluating instructional performance. From about 1976, education standards and costs have been continuously and increasingly under scrunity. Institutions of higher education, like other non-profit making organizations, have been increasingly asked to justify their activities and to account for their use of resources, as well as their performance, to external funding bodies such as state legislatures (Sizer, 1979). The case for development of some type of performance indicator (PI's) has been building up and gaining momentum throughout the 1980s. LITERATURE REVIEW In the aftermath of the numerous educational reform studies of the early and mid-1980s, educators and policy makers across the United States established procedures for assessing and Student Evaluation refining educational programs. By 1990, educators in 41 states were required to evaluate teachers. Of those states, educators in 34 were required to do so because of state legislation, 30 because of state department of education policy, and 5 because of litigation. Of 41 states, 24 "mandated that specific evaluative procedures and/or forms be used, 7 "suggested" specific procedures, and 10 neither required nor suggested specific procedures and forms" (Valentine, 1992). Pressure on administrations in regard to the design, and implementation of instruction now comes from students, parents, and legislators. Students are much less timid in expressing dissatisfaction than in previous times, and many parents express uncertainty that the effectiveness of instruction warrants the high costs of a college education. Since higher education is a labor-intensive industry with 70 to 80 percent of typical budgets going for salaries, (Seldin, 1980), it has been natural for institutions to look at instructional staffs as fertile areas to reduce costs. Granting tenure, for example, can have serious financial implications. It can commit an institution to a salary that can be cumulatively substantial over a 30-year period. So, for economic reasons alone, standards for tenure have become more rigorous. It isn't surprising that a diminishing number of professors are selected for promotion and tenure. In considering a professor for promotion in rank, tenure, or retention, there is a wide variety of procedures and factors that an institution can select and utilize. Scholarly research and publication, creative activities, service, peer reviews, classroom visitations by administrators, teacher self-assessments, testing, and learning outcomes are examples of potential criteria for the evaluation of faculty. However, without reverting to the methods of the Middle Ages, the harvesting of student "opinion" about a professor is theoretically limited to exit interviews, suggestion boxes, small discussion groups, questionnaires to alumni, outcomes assessment techniques, face-to-face discussions, student testimonials and "consumer" questionnaires. In reality, a written questionnaire or rating scale appears to serve as the predominant method of Student Evaluation gaining information about professors. In a 1983 survey that went to 770 academic deans listed in the U.S. Department of Education's "Directory", all listed as accredited, four-year, undergraduate, liberal arts colleges, classroom teaching was regarded, of 13 criteria, as the most important index of overall faculty performance with the frequency of use reported as 98.7 percent. Of the 13 sources of information for faculty evaluation enumerated in the study, 67.5 percent of deans surveyed stated that they a lways used student rating information. This compares to 54.8 percent of all of the deans who were studied in 1978 (Seldin in Gabbin, 1990). It is apparent that both administrators and faculty committees are relying on student ratings to help shape their judgements about faculty performance. Today, the use of student evaluations of faculty may be by far the most common method of rating the effectiveness of instruction (Wright et al in Gabbin, 1990), with classroom visits by peers a distant second (Miller, 1987). The technique developed by market researchers in the U.S.A. in the 1930s, the survey questionnaire, appears to be the most favored of all methods of obtaining student judgements about their courses (Winter Hebron, 1984). RESEARCH PROBLEM The evidence is compelling that administrative judgements concerning faculty tenure, promotion, and retention rely heavily, if not exclusively, upon some type of feedback regarding teacher performance in the classroom. It has been reported that when evaluating faculty for promotion and tenure, classroom teaching is a more important factor than publication, research, service or any other criteria. Although there are a number of methods of gauging teacher performance, student evaluation is generally agreed to have the most influence on administrative decisions concerning a professor's future (Miller, 1987). In this context, it appears that rating instruments are increasingly becoming the evaluative method of favor for both formative and Student Evaluation summative decisions about instructional effectiveness (Aleamoni, 1990). However, the literature yields very little current empirical data concerning the extent of the use of student rating systems. Earlier it was suggested that the American higher education system is really a "non-system" consisting of an open patchwork of institutions with a variety of missions. Within the framework of higher education, Journalism and Mass Communications programs generally present an eclectic course of studies that epitomize the idea of diversity. Approaches in academic philosophy within the Mass Communications classroom run the gamut from the theoretical to the hands-on. Classroom formats represent a range of pos sibilities. It is likely that student evaluation of faculty systems and procedures might vary to the same extent as the diverse approach to curriculum design and content that is inherent in the subject matter of Journalism and Mass Communications courses. It would therefore not be surprising to learn that such programs of study walk to a unusual drumbeat regarding faculty evaluation. This study was designed in part to determine how many faculty in the membership of the Association for Education in Journalism and Mass Communication (AEJMC) use feedback in the form of systematic student evaluation of faculty. Verification of the nature of measuring student feedback was also sought. There is no evidence that programs in Journalism and Mass Communications have embraced faculty evaluation methodologies to the extent that has been attributed to other disciplines. And for good reason it would be presumptuous to assume that Journalism and Mass Communications programs have followed the national trend. Therefore, an additional purpose of this research study was to determine the extent to which faculty have adopted the use of feedback instruments to evaluate the quality of teaching performance. Empirical insight was sought concerning how evaluation is conducted at the undergraduate level, how often evaluation measures are taken, and how the instruments when used are administered. Student Evaluation METHODOLOGY A survey questionnaire was sent to faculty members of the Association for Education in Journalism and Mass Communication. As an organization, the AEJMC has 16 divisions that represent subject matter in all of the relevant disciplines. A cover letter introducing the research investigators and the general purpose of the survey was sent to AEJMC members in each of the divisions. As a result, schools in every state and several foreign countries were represented in the survey. More than one response was rece ived from each state with the greatest representation from Texas, California, Wisconsin, and Florida. The survey instrument was lengthy, consisting of 60 questions; however, the survey questions were straight forward and bi-polar requiring a simple yes/no response. Although the entire questionnaire addressed the subject of evaluation of faculty by students, this particular report focuses on only those particular questions that provide a quantitative picture concerning the numbers of faculty that are involved in some form of systematic evaluation by students. The survey questionnaire was sent to a total of 2,811 current members of AEJMC. Each mailing included a pre-addressed stamped return envelope with a request to respond to the survey by April 29, 1994. It was presumed that a 30-day-plus turnaround for responders to the mailing would be adequate. However, a sizable number of survey questionnaires floated in long after the deadline. Those responses to the survey were not included in the response rate. The total number of usable responses to the survey q uestionnaire was 961 out of 2,811, for a return rate of 34.19%. Of those who responded to the survey, 66% were male and 38% had taught for fifteen years or more. Over half of the survey respondents had been at their respective institutions for at least seven years. Approximately one half of the participants were over the age of 46. Fifty four percent reported that they are tenured professors. Student Evaluation SURVEY RESULTS More than 90 percent of the AEJMC survey respondents stated that they currently participate in a formal system of faculty classroom performance evaluation at the undergraduate level. In practically all of the cases (95%) a rating instrument is used and the most popular format is a questionnaire. Fifty seven percent of the respondents agreed that the measurement instrument used could be "most clearly" described as a semantic differential type instrument (e.g. a Likert format that generates responses along a continuum from fair to good, etc.). However, in more than 86% of the cases, open-ended responses to questions were provided for. A very small percentage of survey participants suggested that a measurement instrument different than a questionnaire or a semantic differential format was used. Table 1: Response Frequency to Selected Survey Questions Student Evaluation Nearly all of the classroom evaluations were student generated, although it is apparent that some faculty also participate in peer evaluations and observation by the department chair. Approximately 14% of the faculty members responding to the survey indicated that they are observed by the department chair. Twenty percent also suggest that they are peer evaluated. The data do not enable a determination of which faculty only use one of the methods and which participate in more than one, or all three of the procedures. A small percent of the respondents said they did not appear to participate in any of the three types of evaluation specifically cited in the questions (student, peer, chair evaluations), yet when asked to give examples of different types of classroom evaluation, no examples were forthcoming. In nearly all cases the faculty evaluation procedure is implemented in the classroom setting and student responses are collected at the time of administration. In other words, the instrument does not appear to be of the take-home variety. Implementation of the evaluation procedure occurred within 2-4 weeks of the end of the academic session in over 90 percent of all situations. AEJMC members were split in terms of when during the class session the evaluation procedure was implemented. About one half of the faculty state that implementation of the evaluation process occurred at the beginning of the class session. The other half state that the procedure began toward the end of a classroom session. Interesting were the responses to a series of questions concerning who is actually responsible for implementing the evaluation procedure during the classroom session. All of the responses were predominantly "no" to each query that was intended to yield specific information as to the identity of the administrator. Eighty-two percent of the faculty stated that it was not "me" who implemented the evaluation procedure, 95% stated that it was not the department chair, 87 percent said that it was not a peer member of the faculty, 85% agreed that it was not an administrator such as a secretary, and 65% said that it was not a student assigned to the department (graduate assistant, work-study student). One can only infer that the evaluation administrator may be the percentage differential indicated. For example, looking at the latter Student Evaluation percentage of 65% of the respondents who stated that it was not a student assigned to the department who administrated the evaluation process, it can be inferred that 45% of the evaluations were administered by a graduate assistant or work-study student. DISCUSSION This study confirms the common perception that evaluation of journalism and mass communications faculty is almost universal in American colleges and universities, and that the evaluators are overwhelmingly the students in the college classroom. There are few who would argue with the appropriateness of this evolutionary process. While peers may have a better grasp of the level of knowledge the instructor possesses, and supervisors might understand best the protocol of the classroom environment, the student is the only person who has daily contact with the instructor and it is the student who can best evaluate the totality of the learning experience. No evaluation system is without shortcomings, but student evaluation of faculty is more in tune with allowing market forces to determine product success. The problem with faculty evaluation does not lie with the overall methods of evaluation, but with their ultimate design and uses. It would seem that academic researchers would be fully aware of the elements of survey research that can affect the results of their efforts. The atmosphere of the evaluation, the pre-survey preparation done by the faculty member, the person who administers the survey, the period of the semester in which the evaluation takes place, and the grade the student believes he or she is earning at the time of the evaluation, and the nature of the evaluation questions, are all partial explainers of evaluation results. If all this is true, it is not critical if evaluation results are used as aids to improving instruction. When the instruments become the performance indicators that determine the professional fate of the faculty member, then the process is taken to an entirely different level. Student Evaluation Four years ago, a study at New Mexico State University revealed at least 21 different evaluation questionnaires being used at the same time. The College of Arts and Science allowed each department to use a different teaching evaluation form, but still apportioned salary increase monies in accord with a ranking based on research, teaching, and service. To compare a department using a form of six questions, all on a four-point scale, with a department using a 12 question instrument employing a 7-point scale is just not very good science, yet such unequal faculty evaluation comparisons are still commonly used among university departments whose survey standards in other academic research projects are rigorously controlled to minimize errors. Unless tenure, promotion, and salary questions are to be determined 90% on the basis of research achievements, teaching evaluations will be used for summative judgements of faculty performance. Because journalism and mass communications faculty interviewed normally see teaching as their first priority, most are interested in having this competence evaluated and used as a major component of their professional success. To fairly and effectively achieve this goal, it is necessary to fully understand the eva luation process and to be able to rely on the instruments being used. Because it is clear that student evaluations dominate the field of faculty evaluations, it is then necessary to focus upon what certain evaluations mean. Most would agree that, on a 5-point scale for bad teacher-good teacher, a cumulative rating of 4.65 indicates that good teaching is occurring in the classroom; however, using the same scale for easy grader-hard grader, a 4.65 could have different implications to different observers. Faculty and administrators generally respect "hard" graders, but it is not clear whether students have a similar attitude. It may be that students would see a 2.5 as the best rating on the easy grader-hard grader scale, but this would not contribute to the faculty member's overall rating when the department chair is determining salary increases. The argument over validity of student evaluation of faculty is obviated by the concern over Student Evaluation the reliability of the instruments being used. This means that it is unnecessary to have a nationwide standard evaluation instrument but that it is important for universities, or colleges within universities, to use carefully constructed evaluation instruments for the purpose of comparison. If the current survey shows the dominance of the numerical scale question over the open-ended question, it suggests that more attention be paid to the scales being used and to what the evaluators mean by the answers they give to the questions. Many faculty look forward to student evaluations and see them as both necessary and constructive, but administrators may look on the process as a means for ranking the faculty for purposes of promotion, tenure, and salary determination. Whether evaluations can serve both purposes is a moot question: They can, and they do serve both purposes. Faculty evaluations by students are a permanent fixture of higher education, and their reliability can be improved if more attention is paid to the process and to the instruments being used. For this to be achieved, more attention must be paid to the student evaluators and the meanings they attach to evaluation terms. Further, the process used to administer evaluations should be consistent semester to semester and instructor to instructor. CONCLUSION The nationwide patterns outlined in this study are indicators of evaluation trends that will continue in institutions of higher learning into the next century. Perhaps the instruments that are used to measure performance and provide feedback to faculty and administration will provide objective insights that everyone can be comfortable with by that time. And since the evaluation instruments are used in part for purposes of making summative judgements about faculty, maybe faculty should be given equal opportunity in terms of evaluating administrators. It is interesting to note that there is a trend among organizations outside of academia to implement a performance evaluation tactic known as "upward feedback appraisal." Simply put, employees get to review their bosses. Similar to student evaluations of faculty, a list of questions is devised and workers are asked to rate the boss as very good, good, average, poor, or very poor in key areas. The questions usually center on such things as good listening skills, how well the Student Evaluation boss communicates, motivates, and displays trust and confidence in subordinates. At many companies, lack of improvement by managers from year to year will jeopardize their raises and promotion potential. Although these performance appraisals have been around for about ten years, according to Michael Seitchik, director of program development at the Wharton School's Division of Executive Education at the University of Pennsylvania, they've become "extremely popular in the last two or three years" (Vaughan, 1994). If turnabout is fair play, and given the fact that students and faculty already extensively evaluate each other, how about including annual evaluations of the administrators? Further study should also be conducted to discover whether faculty are currently satisfied with the administration, the form, the results, or the use made of teaching evaluation as conducted in college classrooms today. Primary Questions Yes Yes No No Question Frequency Percent Frequency Percent Does your institution currently use a formal system of faculty 782 88 108 12 classroom performance evaluation at the undergraduate level? Does your department (college or school) participate in a formal 801 90 88 10 system of faculty classroom performance evaluation at the undergraduate level? Does the formal faculty evaluation system that you use include the 796 95 40 5 use of some type of rating instrument? If a rating instrument is used, could it most clearly be described 437 57 324 43 as a semantic differential type instrument? If a rating instrument is used, could it most clearly be described 673 83 134 17 as a survey or questionaire? Are there open ended responses to atleast part of your instrument? 716 86 115 14 References: AAUP (1978) in Centra, John A (1979) Determining Faculty Effectiveness, San Francisco: Jossey-Bass Publishers. Aleamoni, L.M. (1981). Student ratings of instructor. In Handbook of Teacher Evaluation. Millman ed. Beverly Hills, CA, p110. Centra, John A (1979) Determining Faculty Effectiveness, San Francisco: Jossey-Bass Publishers p2 Gabbin, Alexander L, Cairns, Scott N, Benke, Ralph L. Editors (1990), Faculty Performance Appraisal - Center For Research in Accounting Education, James Madison University. Lovett, Robert (1970) "Professional Accountability in the Schools" p129-p134 in Sciara, F. J. (1970) Accountability in American Education Allyn & Bacon, Inc. : Boston Millman, Jason, Editor (1981) Handbook of Teacher Evaluation, Beverly Hills: Sage Publications Miller, Richard I (1987) Evaluating Faculty for Promotion and Tenure, San Francisco: Jossey-Bass Seldin, Peter (1980) Successful Faculty Evaluation Programs, New York: Coventry Press. Seldin (1990) Faculty Evaluations: Surveying Policy and Practices pp 3-10 Gabbin, Alexander L, Cairns, Scott. Travers, Robert (1981). Criteria of Good Teachers in Millman, Jason, editors, Handbook of Teacher Evaluation, Beverly Hills: Sage publications, p39. Sizer, J (1979) Assessing Institutional Performance: An Overview International Journal of Institutional Management in Higher Education Vol 3 No 1, pp 49-77 Vaughan, Viki (1994). Workers bet to review boss The Arizona Republic page E1 July 25 Valentine, Jerry W (1992) Principles and Practices for Effective Teacher Evaluation, Boston, Allyn & Bacon Winter Hebron, C. de (1984) An Aid for Evaluating Teaching in Higher Education - Assessment and Evaluation in Higher Education Vol (9) No (2) Summer, pp 145-163 Wright, Penny; Whittington, Ray; Whittenburg, G.E in Gabbin (Editor) (1990) Faculty Performance Appraisal - Center for Research in Accounting Education James Madison University, p55