Why Ofqual and CCEA are set to repeat the failings of 2020 in respect of 2021 awarded grades for GCSE, AS and A2.

27 Saturday Mar 2021

Tags

AQA, Belfast Live, Belfast Newsletter, Belfast Telegraph, Ben Lowry, CCEA, Dr Hugh Morrison, E.D. Hirsch, Educational Testing Service, ETS, Gavin Williamson, GCSE, GCSE changes, GCSE standards, Mike Cresswell, Northern Ireland Assembly Education Committee, Ofqual, Peter Weir MLA, Ronald K Hambleton, Standards Advisory Group

Dr Hugh G Morrison (The Queen’s University of Belfast [retired])

In 2020 the impact of Covid-19 prompted the Conservative government to replace public examinations (GCSE, AS and A2) by a process involving teacher-predicted grades. In this approach, an algorithm was used to keep teacher-predicted grades in check. This combination of teacher-predicted grades and algorithm – endorsed by Ofqual and CCEA – was attacked from all quarters in what became known as the 2020 “grades fiasco.” In Summer 2021, while teacher-assessed grades are to continue to play a pivotal role, the algorithm is to be replaced by another form of quality assurance: teacher moderation.

Ofqual hopes that the 2021 marriage of predicted grades and moderation will deliver grades which are “credible and meaningful.” Ofqual also asserts that the moderation training provided by the exam boards will help teachers “objectively and consistently assess their students’ performance” so that “the [2021] grades will be indistinguishable from grades issued by exam boards in other years.”

Anyone who has engaged in GCSE coursework moderation will be aware of its shortcomings, and one member of Ofqual’s Standards Advisory Group has warned that the 2021 approach to quality-assuring GCSE, AS and A2 grades could potentially issue in “Weimar Republic” levels of grade inflation.

It is instructive to give the broadest overview of how Ofqual and CCEA see moderation creating the circumstances whereby the 2021 grades will be “indistinguishable” from the grades issued by exam boards in past years. Consider a GCSE geography teacher. She receives training from the examining board designed to help her “internalise” the standards associated with each of the grades in GCSE geography. For example, she might be given the completed examination scripts of several different students who were graded A* in geography, several scripts graded A, and so on, down through the grades. The exam board officials might stress the key features of particular scripts that mark them out as meeting the standard represented by a particular grade. By scrutinising the completed scripts of several students graded C, for example, the teacher might gain insights into the standard: “grade C in GCSE geography.”

Our geography teacher then turns her attention to her own students’ “portfolios.” A student’s portfolio might contain his or her responses to mock examinations, so-called “mini tests” provided by the exam board, class tests, project work, examples of outstanding homework, and so on. The geography teacher then uses the scale of standards she internalised during training to assign a best-fit grade to each portfolio. Finally, external moderation might involve her exam board selecting a representative sample of her students’ portfolios to establish if her grading decisions accord with the combined professional wisdom of the exam board moderators.

Ofqual are prepared to acknowledge that moderation has some limitations: “It is often the case that two trained markers could give slightly different marks for the same answer and that both marks would be legitimate.” However, carefully conducted research paints a more depressing picture. In his book The schools we need, and why we don’t have them, E. D. Hirsch describes a study carried out by the world’s leading testing agency, the Educational Testing Service (ETS). In the study, in which “300 student papers were graded by 53 graders (a total of 15,900 readings), more than one third of the papers received every possible grade. That is, 101 of the 300 papers received all nine grades. … 94% [of the papers] received either seven, eight or nine different grades; and no essay received less than five different grades from 53 readers” (pp. 183-185).

Since a particular essay cannot be simultaneously worthy of nine different grades, for example, one is forced to the conclusion that a well-defined intrinsic worth cannot be ascribed to a particular essay. The essay’s worth can only be ascribed relative to a particular marker. In summary, the grade is not a property of the essay; rather, it is a joint property of the essay and the particular marker. To communicate unambiguously about the quality of the essay one must specify the measuring tool (in this case the marker). The grade is best construed as a property of an interaction between essay and marker rather than an intrinsic property of the essay.

It is important to stress that moderation’s difficulties extend beyond disciplines where essay-type questions predominate; all examinations are affected. In the text Fundamentals of Item Response Theory, Hambleton et al. address all tests: “examinee characteristics and test characteristics cannot be separated: each can be interpreted only in the context of the other … An examinee’s ability is defined only in terms of a particular test. When the test is “hard,” the examinee will appear to have low ability; when the test is “easy,” the examinee will appear to have higher ability. … Whether an item is hard or easy depends on the ability of the examinees being measured, and the ability of the examinees depends on whether the test items are hard or easy!” (pp. 2-3) One cannot meaningfully separate the notion of ability from the characteristics of the particular test used to measure that ability.

Now let’s return to our GCSE geography teacher. During her exam board training she studied the responses given by students who achieved specified grades in particular GCSE geography examinations. However, she must now use her judgement to assign grades to her students’ portfolios, a very different form of assessment. Dr Mike Cresswell – a past Chief Executive of the exam board AQA – underlines “the need to accept that there is no external and objective reality underpinning the comparability of results from different examinations.” This presents our geography teacher with an intractable problem: she must somehow gain insights into absolute standards which float free of any particular examination. For example, the absolute standard “B grade in GCSE geography” makes no reference to a particular test. Alas, in Cresswell’s words, “absolute examination standards are … a chimera.”

Why Ofqual and CCEA are set to repeat the failings of 2020 in respect of 2021 awarded grades for GCSE, AS and A2.

24 Wednesday Feb 2021

Posted by paceni in Grammar Schools

≈ Leave a comment

Tags

AQA, CCEA, Dr Hugh Morrison, E.D. Hirsch, Educational Testing Service, Fundamentals of Item Response Theory, Grades fiasco, Hambleton, Mike Cresswell, Ofqual, Standards Advisory Group, teacher-assessed grades, Weimar Republic

Dr Hugh G Morrison (The Queen’s University of Belfast [retired])

What CCEA & the media don’t want the public to know about the predicted grade debacle

15 Tuesday Dec 2020

Posted by paceni in Grammar Schools

≈ Leave a comment

Tags

CCEA, CCEA CEO, international standard setting for examinations, Justin Edwards, Ofqual, peer-reviewed literature, teacher predicted grades, teacher predicted rank orders

questions to Justin Edwards CEO CCEA

Mr Justin Edwards, the Chief Executive Officer of CCEA, Northern Ireland’s Council for Curriculum, Examinations & Assessment issued a reply on May 26th, 2020. It should be noted that CCEA regulates itself. Ofqual do not regulate CCEA

The first thing to notice, given the specific nature of the questions, is the ambiguous language of the replies.

Given the very specific request made by Dr Morrison in question one for CCEA to “identify a single peer-reviewed study” which would confirm that ” teachers could predict rank orders within grades with any degree of accuracy” the official reply failed to produce any evidence.

Question two raised the issue of the standards conundrum. The international literature highlights the fact that the predictions are only acceptable when teachers have access to the examination papers that the pupils would have taken.

Mr Edwards responds for CCEA without giving any reasoning. “CCEA will not be releasing any planned summer 2020 examination papers.”

CCEA reply on 2020 exam questions

MLAs and their advisors could be using guesswork to justify decisions to lockdown businesses and close schools

27 Tuesday Oct 2020

Posted by paceni in Grammar Schools

≈ Leave a comment

Tags

BBC Evening Extra, BBC GMU, BBC Northern Ireland Education Correspondent, BBC Talkback, Belfast Live, Guerra et al 2017, Jing Blakely and Smith 2011, Justin McCampbell, Michael McBride, NASUWT, Northern Ireland Assembly, Northern Ireland Education Minister, Professor Ian Young, Robin Swann, SPI-M, Stephen Elliott, The Belfast Newsletter, The Belfast Telegraph, The Daily Mirror, The Failure of R0, The Irish News, The Royal Statistical Society, The Scientific Pandemic Influenza Group on Modelling

The use of the R number in the Assembly’s release-of-evidence points to a profound misunderstanding of the limitations of that number. In particular, R is not an additive variable; one cannot meaningfully add the contribution to R of hair salons to the contribution to R of pubs and then compare with 1. This strategy makes no arithmetic sense.

Unfortunately, the R number changes with the model used to measure it. The Scientific Pandemic Influenza group on Modelling is a standing group that advises government on preparations to manage the risk of pandemics and keeps emerging evidence and research under review. SPI-M use approximately ten different models to arrive at an R number for the UK. This R number is calculated by attempting to somehow reconcile these differing values, each calculated with great uncertainty.

How can our MLAs possibly justify basing decisions which impact on people’s livelihoods, on the tiny R-related percentages published in the Assembly’s evidence? The uncertainty of R renders this unjustifiable.

These concerns about R are clear in the literature. Guerra et al. (2017) could only locate the R number for measles somewhere between 3.7 and 203.3.

Jing, Blakely and Smith (2011) published a paper entitled The Failure of R₀, in which the authors conclude, “Rarely has an idea so erroneous enjoyed such popular appeal”.

Coming right up to date, The Royal Society’s report entitled Reproduction number (R) and growth rate (r) of the COVID -19 epidemic in the UK (on page 53) struggles to make the case for R: “Given the suggested wide bounds of uncertainty that surround estimates of R in particular … are they still of value in policy formulation? The answer is definitely, yes … this is certainly a much better place to be in than just making a guess through verbal argument.”

Northern Ireland’s Health Minister and his two shields, the Chief Medical Officer and the Chief Scientific Advisor attempt to see off detractors by urging them to look at the “evidence in the round.” I am confident that no amount of additional evidence produced by the Minister and his two advisors will see off the criticisms set out in this letter.

The warning letter on centre-based moderation sent to CCEA on 23rd June

21 Friday Aug 2020

Posted by paceni in Grammar Schools

≈ Leave a comment

Tags

A-Level exams in Northern Ireland, Assessment and Testing: A survey of research, Assessment: Problems, CCEA, CCEA Regulation, Council for Curriculum Examinations and Assessment, Covid-19, Developments and Statistical Issues, Dr Hugh Morrison, Dr Hugh Morrison Queen's University Belfast, Harvey Goldstein, Jo-Anne Baird, Joel Michell, Justin Edwards, ludwig Wittgenstein, Mike Cresswell, Niels Bohr, Ofqual, P Newton, Robert, Sharon King, whatever standard of attainment it is judged by the awarders to represent, Wood

Why Centre-based Moderation cannot work

Ofqual and CCEA intend to apply a “moderation” process to teacher-predicted grades in order to prevent, for example, teachers awarding inflated grades to their students. This process – yet to be set out in detail – will focus on the Examination Centre in which each pupil would have taken his or her 2020 GCSE, AS or A2 examinations had Covid-19 not intervened. To simplify matters, consider a Centre in which, for instance, 20% to 24% of pupils have secured B grades in CCEA AS Physics for the past three years. Now suppose that in 2020 the physics teachers associated with that Centre return a B-grade prediction for 67% of their AS pupils. Does a statistical technique, or AI algorithm, or mathematical model (possibly drawing on teachers’ predicted rank orders) exist which can defensibly adjust the predicted grades to bring them into line with the 20% to 24% range of the past? Can one compute a defensible compromise position somewhere between 20% and 67%? The answer is an emphatic No.

It is difficult to escape the conclusion that Ofqual and CCEA simply interpret grades as quantities which are countable and can be assigned to individual pupils. But two of the most influential figures in UK assessment reject these claims. The UK’s examining bodies and researchers in education have been, for years, treating grades as quantifiable entities. This is because the awarding bodies have a very poor track record for in-depth thinking about the nature of the “measurements” in which they engage (see Wood[1] (1991)).

There are few individuals with Mike Cresswell’s understanding of the grading of UK examinations. Cresswell’s definition[2] of a grade as representing “whatever standard of attainment it is judged by the awarders to represent” (p. 224), indicates that counting grades as one might count pencils is indefensible. Let me be clear: I am not suggesting that the process for awarding grades needs to be abandoned. I am simply making the point that grading is not governed by strict scientific principles and that adding or subtracting grades is mathematically impermissible. As Cresswell’s definition makes clear, the grading process is a qualitative process rather than a quantitative one.

Now why is Cresswell forced to this vague qualitative definition of a grade? The reason is that the awarding bodies, education researchers, and the general public think of the grade awarded to a given pupil as a measure of the ability of that pupil. The awarding bodies think of a grade as a property of the particular pupil to whom it is awarded. But this is wrong: a grade is not an intrinsic property of the pupil but rather a joint property of the pupil and the examination from which the grade derives. In his 1996 book Assessment: Problems, Developments and Statistical Issues Harvey Goldstein[3] (a towering figure who contributed much to debates on statistical rigour in UK assessment) cautions: “[T]he object of measurement is expected to interact with the measurement in a way that may alter the state of the individual in a non-trivial fashion” (p. 54).

According to Goldstein, the examination does not merely “check up on” a pre-existing ability that the candidate had when he or she entered the examination hall. This static model is rejected for a more dynamic alternative in which the pupil’s ability is expressed through his or her responses to the questions which make up the examination. For Goldstein, ability changes as the candidate interacts with the examination questions: “Thus, on answering a sequence of test questions about quadratic equations the individual may well become “better” at solving them so that the attribute changes during the course of the test” (p. 54). Cresswell’s grade is not an intrinsic property of the candidate; rather, it’s the property of an interaction. Grades do not lend themselves to simple arithmetic manipulation and therefore quantitative procedures – such as simple regression or neural nets – are indefensible.

One can find unequivocal support for the claims of Cresswell and Goldstein in the writings of Niels Bohr and Ludwig Wittgenstein. Also Joel Michell’s research[4] can be used to establish that grades do not satisfy the seven Hölder axioms[5] and therefore are not quantifiable. There can be little doubt that the claims of Ofqual and CCEA that Centre-based statistical techniques, or algorithms, or mathematical modelling, can be used to moderate predicted grades are without foundation.

Dr Hugh Morrison (The Queen’s University of Belfast [retired])

[1] Wood, R. (1991). Assessment and testing: A survey of research. Cambridge: Cambridge University Press.

[2] Baird, J., Cresswell, M., & Newton, P. (2000). Would the real gold standard please step forward? Research Papers in Education, 15(2), 213-229.

[3] Goldstein, H. (1996). Statistical and psychometric models for assessment. In H. Goldstein & T. Lewis (Eds.), Assessment: problems, developments and statistical issues (pp. 41-55). Chichester: John Wiley & Sons.

[4] Michell, J. (1999). Measurement in psychology. Cambridge: Cambridge University Press.

[5] Michell, J., & Ernst, C. (1996). The axioms of quantity and the theory of measurement, Part 1, An English translation of Hölder (1901), Part 1, Journal of Mathematical Psychology, 40, 235-52.

(1997), The axioms of quantity and the theory of measurement, Part II, An English translation of Hölder (1901), Part II, Journal of Mathematical Psychology, 41, 345-56.

Hölder, O. (1901). Die axiome der quatität und die lehre vom mass, Berichte der Sachsischen Gesellschaft der Wissenschaften, Mathematische-Physicke Klasse, 53, 1-64.

Why the Ofqual/CCEA proposal for using teacher judgement to grade 2020 GCSE/A level examinations is indefensible.

Featured

Posted by paceni in Grammar Schools

≈ Leave a comment

Introduction

The claim made in this essay is that the academic literature clearly indicates that the capacity of teachers to predict their pupils’ grades falls far below acceptable levels. Furthermore, the evidence that teachers can rank-order their pupils within-grade is scant to non-existent. Indeed, the Awarding Bodies couldn’t stand up the claim that any of their examinations rank-order pupils on the construct they purport to measure. It follows that the only defensible solution is to provide two measures per examination: (i) a teacher-predicted grade (without an associated rank-order); and (ii) a test that could be used – if the pupil so decides – to overwrite the teacher prediction, in relevant cases. Where a pupil cannot take the test, he or she must accept the teacher-predicted grade. There is no credible evidence in the literature that one can mobilize some “standardization” algorithm (which has yet to be detailed by Ofqual or CCEA) to somehow correct for any excesses in teacher judgement.

The perils of expert prediction of all types

As far back as 1954 Paul Meehl (Clinical versus Statistical Prediction: A theoretical analysis and a review of the evidence) analysed the ability of a range of teachers to predict measures of academic success, and found scant evidence that this could meet acceptable standards. Meehl’s book ranged far beyond teachers’ predictions of grades to consider, for example, expert predictions of an individual’s probability of violating parole, predictions of success in pilot training, predictions of criminal recidivism, and so on. In his book Thinking, Fast and Slow the Nobel Laureate Daniel Kahneman (2011, p. 225) endorsed Meehl’s findings and stressed that the range of studies demonstrating the limitations of experts’ abilities to predict the future had expanded greatly since Meehl’s book was published:

“Another reason for the inferiority of expert judgement is that humans are incorrigibly inconsistent in making summary judgements of complex information. When asked to evaluate the same information twice, they frequently give different answers. The extent of the inconsistency is often a matter of real concern. Experienced radiologists who evaluate chest X-rays as “normal” or “abnormal” contradict themselves 20% of the time when they see the same picture on separate occasions. A study of 101 independent auditors who were asked to evaluate the reliability of internal corporate audits revealed a similar degree of inconsistency. A review of 41 separate studies of the reliability of judgements made by auditors, psychologists, pathologists, organizational managers, and other professionals suggests that this level of inconsistency is typical, even when a case is re-evaluated within a few minutes. Unreliable judgements cannot be valid predictors of anything.”

The perils of predicting within-grade rank order

Ofqual and CCEA are requiring teachers to rank order their pupils according to their achievement in mathematics, English, biology, and so on. However, no GCSE or A level product designed by the Awarding Bodies can itself perform this feat. The rank-ordering of candidates on the construct “achievement in geography,” for example, is a validity issue and the Awarding Bodies have a very, very poor record in this area.

In 1991 an expert on the work of the examination boards, Robert Wood, summarized his conclusions in the book Assessment and Testing: A survey of research commissioned by the University of Cambridge Local Examination Syndicate (UCLES). On pages 147- 151 he wrote:

“If an examining board were to be asked point blank about the validities of its offerings or, more to the point, what steps it takes to validate the grades it awards, what might it say? … The examining boards have been lucky not to have been engaged in validity argument. … Nevertheless, the extent of the boards’ neglect of validity is plain to see once attention is focused. Whenever boards make claims that they are measuring the ability to make clear reasoned judgements, or the ability to form conclusions (both examples from IGCSE and Economics), they have a responsibility to at least attempt a validation of the measures. … The boards know so little about what they are assessing that if, for instance, it were to be said that teachers assess ability … rather than achievement, the boards would be in no position to defend themselves. … As long as examination boards make claims that they are assessing this or that ability or skill, they are vulnerable to challenge from disgruntled individuals.”

The claim that a GCSE or A level examination could rank-order candidates on some appropriate construct would require the Awarding Bodies to use Structural Equation Modelling to compute three indices: root mean-square residual, adjusted goodness-of-fit, and chi-squared divided by degrees of freedom. To claim a rank order, these three statistics would have to be demonstrated to satisfy relevant inequalities. Can it be reasonable to ask teachers to predict something that is beyond the capabilities of the GCSE and A level examinations themselves?

The resolution

Needless to say, staff at Ofqual and CCEA are mandated to provide young people with grades that are as error-free as possible. They should take heed of Paul Meehl’s counsel in respect of teachers’ capacities to anticipate the future: “When one is dealing with human lives and life opportunities, it is immoral to adopt a mode of decision-making which has been demonstrated repeatedly to be … inferior.” If teacher judgement (omitting the requirement to rank-order) is to be used to forecast grades, pupils must also be offered speedy access to a public examination which protects them from the well-documented vagaries of teacher prediction.

Stephen Elliott

Lyra McKee

12 Sunday May 2019

Posted by paceni in Grammar Schools

≈ Leave a comment

This is the sort of honest communication between the generations that has the potential to do more good than the wasted hundreds of millions spent on conflict resolution ever could.

seftonblog

I’ve waited some time before putting pen to paper.

The death of a young woman, not much older than my daughter, is hard.

Murder is harder still.

When I got the news, at two in the morning, I did not sleep again that night.

I was introduced to Lyra about five years ago.There could be be fewer similarities.

I met this small, owlish, slightly diffident girl, in a Victoria Square coffee shop. She met a grumpy old man , with issues and a background. She had many difficulties with technology, which we laughed about. She was softly spoken, and I’m slightly deaf.

I was hoping that she could introduce me to contacts that might progress my enquiries into the murders of my parents. This she did.

Despite the disparity in our ages and in our experience of the world, she dispensed sage advice about me and my predicament. She was…

View original post 402 more words

“Research intensity” – fake news in higher education

25 Sunday Feb 2018

Posted by paceni in Grammar Schools

≈ Leave a comment

Tags

Albert Einstein, CalTech, Gravitational Wave, Hanfod Observatory, Intensity-weighted GPA, LIGO, Livingston Observatory, MIT, Neutron Stars, Paul Jump, Phil Baty, Queens University Belfast, REF, Research Intensity, Robert Bowman, Times Higher Education

Ligo Caltech

One of the greatest advances in modern physics – the detection of gravitational waves first postulated, a century ago, by Einstein in his general theory of relativity – was made by physicists at the Laser Interferometer Gravitational-Wave Observatory (LIGO). To paraphrase Richard Feynman, LIGO’s measurement precision can be expressed as follows: If you were to measure the distance between earth and the nearest star with this precision, it would be exact to the thickness of a human hair. Such incredible accuracy alone would more than justify the £150 million construction costs of LIGO as a feat of engineering alone.

THE Intensity rankings

It is instructive to contrast the measurement properties of the UK’s Research Excellence Framework (REF), which aims to rank-order the research quality of UK universities, with those of LIGO. While the REF league table has no discernible measurement properties whatsoever, its cost far exceeds LIGO’s construction costs, coming in at a staggering one quarter of a billion pounds.

THE REF winners

A recent issue of the Times Higher Education (1 – 7 February 2018) included a booklet published by the Queen’s University of Belfast which illustrates the extremes to which universities are prepared to go in using highly questionable data derived from REF ranks for the purposes of self-promotion. Page 5 of the Queen’s booklet consists of a single statement. At the centre of a black A4 page the words “Ranked in the top 10 in the UK for research intensity” (in white print and large font) sit in isolation. (In a much smaller font the university attributes this ranking to the Times Higher Education.) Pages 6 to 9 offer pen portraits of nine “world-leading academics” employed by Queen’s University. The university has used this “research intensity” claim to market the university ever since the publication of the 2014 REF. One cannot browse the university’s website without encountering the claim at every turn. It has appeared on university billboards, in promotional materials and was central to the university’s ubiquitous claim: “we are exceptional.” Why did no one notice that research intensity is a meaningless concept?

QUB THE insert cover

In the Research Excellence Framework, the research quality of journal articles, books etc. is assessed and reported on a four-point scale (five-point if one includes the ‘unclassified’ category). The scale is ordinal in the sense that a 3* article is deemed superior to one rated 1* or 2*, and inferior to an article rated 4*. Any appeal to arithmetic is impermissible because an article rated 4* is not 4 times the quality of a 1* article; a 2* article is not two-thirds the quality of a 3* article, and so on. The rules of arithmetic do not apply. (Needless to say, the challenge of assessing research quality would remain unchanged if numbers were abandoned for the grades A, B, C and D.) These whole-number ratings are then used by the Times Higher Education to compute a university’s all-important “research intensity” (reported to two decimal place accuracy) using simple arithmetic. But, as every sixth-form statistician knows, arithmetical operations are not meaningful when applied to an ordinal scale.

QUB THE p7

How can it be that no world-class scientist at Queen’s seems to have pointed out to those charged with marketing the university that “research intensity” is a highly questionable measure? Are we to believe that no UK scientist has written to the Times Higher Education pointing out the magazine’s error? One gets the clear impression from those charged with marketing Queen’s University that the Belfast campus is crammed to the rafters with ”world-leading” scientists. How could any scientist worthy of the label endorse the notion of “research intensity” when his or her sixth-form mathematics training would identify the notion as nonsensical. In particular, why have none of the university’s world-leading academics, listed on pages 6 through 9 of the Queen’s promotional booklet, questioned the nonsensical claim printed on page 5?”

The Queen’s University of Belfast and the Times Higher Education must clarify their positions on this matter.

QUB THE insert p5

Plagiarism: One law for pupils and another for teachers

04 Sunday Feb 2018

Posted by paceni in Grammar Schools

≈ Leave a comment

Tags

BBC Northern Ireland, CCEA, Council for Curriculum Examinations and Assessment, Dermot Mullan, dr neill morton, General Teaching Council Northern Ireland, GTCNI, Our Lady & St Patrick's College Knock, plagiarism, Robbie Meredith, TES

Stephen Elliott – Chair: Parental Alliance for Choice in Education

In the closing days of January 2018 it was revealed that Dermot Mullan, headteacher at Our Lady and St Patrick’s College in Belfast, was accused on plagiarising the work of another teacher. Mr Mullan immediately confessed to the offence and that, it would seem, is to be the end of the matter. His Board of Governors made no comment, the Catholic Church made no comment, and – most concerning of all – Northern Ireland’s General Teaching Council remained silent. This silence is puzzling given that Mr Mullan heads a school which makes much of its lofty Catholic principles. How does a plagiarist urge honesty and integrity on pupils in general (and pupils taking GCSE and GCE examinations, in particular)? How does Mr Mullan discipline a pupil suspected of copying the coursework of another pupil? Surely the parents of the culprit will detect a double standard here: there seems to be one rule for the children and another for their principal?

Dr Neill Morton pictured with Professor Tony Gallagher at QUB graduation.

The existence of a disturbing double standard is nowhere better illustrated than in the intervention of Neill Morton, the self-styled “emeritus” headmaster of Portora Royal School. Despite being the Education Chair of Northern Ireland’s Examination Council (CCEA), Dr Morton appeared on BBCNI television Newsline on Monday 29th January 2018 to assure the public that the whole issue of Mr Mullan’s plagiarism was overblown. This clearly demonstrates one law for pupils taking examinations and another for their teachers: if Dr Morton’s view of Mr Mullan’s indiscretion were applied to pupils, then the entire concept of public examinations would collapse. In short, Dr Morton’s comments on Mr Mullan’s plagiarism should immediately disqualify him from any public office concerned with public examinations.

Dr Morton’s failure to condemn Mr Mullan’s activities outright is even more surprising given that he has recently completed a Doctorate in Education at The Queen’s University of Belfast. A glance at that university’s website or a random walk through its McClay library will quickly reveal the seriousness with which it views plagiarism.

When pupils are charged with plagiarism the consequences can be drastic: their grades can be deleted; they may be expelled and the pupil whose work was plagiarised may fall under suspicion. One doesn’t seem to encounter the same clarity of decision-making when it comes to settling the fate of a highly-salaried headteacher like Mr Mullan. One encounters the same imbalance in respect of university students and their teachers: one can spend many hours searching for a well-defined Queens policy on staff accused of appropriating the work of other academics.

The claims advanced here deserve a response. It is completely unacceptable that Dr Morton’s judgement of Mr Mullan’s plagiarism is entirely at odds with the treatment of examination candidates guilty of the same offence. How must the parents of children judged to have plagiarised in an assessment have reacted to CCEA’s Education Chair making little of a headteacher facing the same charge? Why have the Governors of Mr Mullan’s school not made a statement? Why is the Catholic Church silent on what is a failure in morality in a person charged with leading by example? Finally, why are teachers, pupils and parents yet to hear a word from Northern Ireland’s General Teaching Council?

The AQE CEA and GL Assessment Test Results: Advice to parents: 2018

27 Saturday Jan 2018

Posted by paceni in Grammar Schools

≈ Leave a comment

Tags

Belfast Newsletter, Belfast Telegraph, Irish News, Suzanne Breen, The Parental Alliance for Choice in Education, transfer from primary school, transfer test, transfer test results 2017/18

Undoubtedly, thanks mainly to media pressure, the results of the 2017/18 transfer tests will be the subject of conversations in families all over Northern Ireland this weekend and for months beyond. The Parental Alliance for Choice in Education wish to offer our congratulations to all pupils who took the tests and express our hope that pupils are offered a place in the school of their choice. Unfortunately as with any competition based on opportunity not everyone will be able to avoid some disappointment.

Politicians, teacher unions and school principals are determined to end testing for transfer at 11

Perhaps an expression of thanks should be offered by parents & guardians for the provision of these “unofficial” or “unregulated” tests. Without the dedication, commitment, psychometric expertise, and adherence to available international standards all pupils would be attending comprehensive schools. This was the expressed aim and intention of successive Education Ministers and remains the aim of the Department of Education. This is particularly relevant given the collapse of the Executive and Assembly. Indeed it is remarkable that very little support for academic selection to grammar schools can be found in the media. This stands in stark contrast to the commercial greed of newspapers promoting the publication of schools league tables, transfer test practice booklets (while AQE provide all past papers at no cost) and just this week a transfer test guide suggesting, albeit inaccurately, scores that will get your child a place in a particular school.

As has been widely reported the number of applications for the AQE and GL Assessment tests has continued to grow. In the academic year 2016/17 14,491 test entries were received. This resulted in 11,570 applications to grammar schools for the 8,743 places available. Therefore 3 out of 4 applications succeeded. This year will be similar.

BT PPTC single test

Victoria College, Belfast have operated the questionable and non-transparent practice of dualling (accepting applications from pupils who have taken both tests or either) yet in this Belfast Telegraph article the principal Patricia Slevin proposes a single test. The dualling practice will have undoubtedly created misclassifications, resulting inmany pupils being denied a place in the school through the error of suggesting that two different tests, measuring different constructs, can simply be merged into one.

Single Test Impossible_20170128_0001_NEW

SingleTest Robinson 2012

The Parental Alliance has sought engagement with both AQE and GL Assessment, the test providers. GL Assessment refuse to engage citing their customer, PPTC, in a commercial contract. Since much is made of the fact that GL Assessment tests are free to pupils, who pays GL Assessment their charge for providing the multiple-choice, computer scanned and marked test? This raises the question of why this high stakes transfer test remains shrouded in secrecy. Recognised international standards suggest that pupils and their parents should be provided with exemplars of the questions likely to appear on tests. Neither the PPTC nor GL Assessment meet the standard. Indeed no past paper from GL Assessment has ever been published. The media have conspicuously not sought answers to this issue. Every year the BBCNI will broadcast a package on results day, invariably it will be from a school using the AQE test. No questions have been raised by the media about the dualling tables and their origin.

PPTC Practice Papers

There is no shortage of commercial Practice Papers available to purchase. Note the term “PPTC-style” All AQE past papers are made available to primary schools at no charge.

Another major distinction between the two tests is that GLA pupils only have one attempt at the examination. The time required for familiarisation, practice and the actual assessments in English & Maths exceeds that for many GCSE examinations.

Pupils taking AQE have three opportunities, allowing for a possible “off day” due to testing anxiety.

Details of GLA tests

Concerns have been raised this year about the use of content from the work of Charles Dickens in the PPTC GL Assessment English paper. Most pupils may have difficulty distinguishing between the author of fourteen and a half novels and the contemporary magician pictured below. Charles Dickens was famously known for being paid by the word published. The version of David Copperfield featured has 745 pages of text. The two exams are contrasted for parents to discuss.

Cover to DC book DC back cover David Cooperfield Magician

Contrast the above passage (randomly selected from the 745 pages in the book imaged above with the prose passage taken from an AQE 2017 paper. AQE tests are always unique; never repeated.

AQE 2017 Prose

The quest and motivation for a single transfer test must be critically examined by parents. In whose interests has the project been adopted? When the CCEA Transfer Test was ended, without a replacement examination in place, by Caitriona Ruane the prospect of compulsory comprehensive post primary schools loomed. A single (one provider) test was offered by AQE. This was quickly rejected by those mainly representing Catholic grammars. To be clear, the single test project is a manufactured crisis, clearly in the hands of politicians, civil servants, and school principals. Former DUP Education Minister kept the project alive by inviting Peter Tymms of Durham University to report on the matter. Tymms has a history with Northern Ireland primary school pupils via the now abandoned Incas assessments used in primary schools. (see blog search engine for articles).

The report from Peter Tymms was published by the Northern Ireland Executive Office close to the last day of the collapsed Assembly

Concerns raised with AQE joint CEO, Stephen Connolly about entering any process proposed by the Department of Education to work on a single test were met with a promise to express further reservations. It is understood that Stephen Connolly subsequently continued to meet with DENI officials

Screenshot_2

Another difficulty for parents is the fact that many grammar schools are not using academic selection for all pupils. Read the admission criteria carefully before applying to schools. It may be the case that your child is denied a place in favour of a pupil who did not take the tests. In the graphic below it is clear that Royal School Dungannon, Royal School Armagh and Sullivan Upper do not select 100% of their pupils by academic testing. Strathern School use bands rather than rank order of marks so that it will be impossible to reassure a child getting results today that their score will get them a place.

BT AQEGLA2018

The problem is even more acute when the dualling schools are examined. The obvious issue of the integrated schools pretending to be grammars can only be matched by those Catholic grammar schools which no longer use academic selection.

Lagan and Slemish are not grammar schools. They were permitted by the anti-selection DENI to use 11-plus tests to select 35% of their pupils.

Wallace High School in Lisburn, another grammar school which uses bands to report test scores only selects 87% of year 8 pupils. The minimum score reported is 101. Wallace High admit 170 pupils to year 8 so a total of 22 pupils get places without use of academic selection.

During October 2017 Wallace High School attracted attention for restricting the number of primary 7 pupils allowed to sit AQE tests at the school. It became clear that this was not a matter of physical capacity but the willingness of teachers to make themselves available on Saturday mornings.

https://www.lisburntoday.co.uk/news/schools-are-criticised-over-their-handling-of-aqe-test-situation-1-8249218

The Parental Alliance for Choice in Education blog

~ Education analysis and commentary

Category Archives: Grammar Schools

Why Ofqual and CCEA are set to repeat the failings of 2020 in respect of 2021 awarded grades for GCSE, AS and A2.

What CCEA & the media don’t want the public to know about the predicted grade debacle

The warning letter on centre-based moderation sent to CCEA on 23rd June

Why the Ofqual/CCEA proposal for using teacher judgement to grade 2020 GCSE/A level examinations is indefensible.

Featured

Lyra McKee

“Research intensity” – fake news in higher education

Plagiarism: One law for pupils and another for teachers

Rate this:

Rate this:

Rate this:

Rate this:

Rate this:

Rate this:

Rate this:

Rate this:

Rate this:

Rate this: