Checklist for reporting a piece of empirical work on language

A checklist of some important things often included when reporting a piece of classical quantitative empirical research professionally in research reports and journal articles,... and assignments and dissertations. Much is similar for qualitative writeups too.

Using a professional structure and headings makes your work look so much more impressive!

o   Information given here does NOT apply where your assignment or dissertation is purely sourced from reading about what other people say or have researched, or from your own experience, introspection etc. Follow the departmental ‘essay’ guidelines for that

o   Information given here does NOT apply where your assignment is doing exercises of some sort (e.g. phonetic transcription or drawing syntactic trees)

o   Information given here DOES apply wherever a dissertation or assignment has involved you gathering and analysing data from people, or even from texts, material previously taped by others, corpora etc. Especially where you have something to express as numbers at the end. I.e. wherever you have counted things in documents, interviewed people, given questionnaires or tests, etc. Anything called a project is likely to fall in this category. I.e. what you are writing is a report of (empirical) research, rather than an essay.

The list of things to report is of course a list of the things one should do as well!

Broadly any data gathering project has three parts, reflected in the three major parts of the write-up…. The checklist below elaborates on this nutshell:

o   The Background. What you are researching, your research questions etc., why it is of interest and where it ‘comes from’ in other people’s theories or research studies

o   The Method. How you gather the data to carry the project out

o   The Results. What you found and what it means, in relation to what was said in Background

One does not necessarily write the report from start to finish in the logical order it has at the end, as presented here. Often people read some background sources on their research topic, write some research questions, work on the method of data gathering, then gather the data, and only then, while looking at the results, actually write the background review and the description of method.

Some of what I list may not be entirely comprehensible without attending LG575 or similar, or further reference to research methods books.

I cannot of course cover every eventuality, so watch for what you may need to mention in addition to what is below! Note also that everything listed here will not of course be relevant in every instance where someone does a piece of empirical work: generally the more reactive and experimental the methods used, the more there will be to say. Everything here would certainly not be appropriate in a non-classical research study (e.g. an ethnographic study or one relying for data purely on expert linguistic introspection).

Where, as often, several variables are measured from one lot of data, and/or more than one question/ hypothesis investigated, much of the following may all have to be gone through separately for each.

Finally, though sections below, reflect what seems a reasonable consensus on 'good practice' as far as labelling of sections goes and their contents, there is considerable variation on the ground in reports you will read both in how sections are labelled and what is mentioned in each (and indeed in how thorough the overall account is!). For APA conventions, see appendix in Christensen, or the Fifth Edition (2001) of the APA Publication Manual (It does not seem to be available online).

! marks common mistakes people make (other than of course missing out whole chunks, jumbled order, and general inaccuracy, vagueness, typos etc.) - mistakes in what is reported, and maybe also in what was done in the research.

******


TITLE

·        containing all the relevant keywords to identify the topic

! Too long - can't put everything in the title

! Too vague and general, perhaps overclaiming the scope of the work

ABSTRACT

·        a summary of the whole thing

! You give what is really an introduction, missing out what the results were

! You tell us what each section of the writeup is going to talk about (e.g. 'In the third section we will describe the method'), not briefly what you did (e.g. 'The method we used was...')

! You include wording that refers forward  like ‘… as we shall see…’. Again it is not an introduction. It should read as referring back to the whole completed project

INTRODUCTION

·       what the topic is, in brief

                 

! You  start telling us a lot of detail about the method and your results at this point

! Detailed research questions and hypotheses… premature to give them here

 

·      reasons for doing the work, e.g.

 

importance as a research topic in itself, in the context of current knowledge in the relevant field. This entails saying a bit about what general areas of ELT, linguistics or whatever the study relates to

importance for local situation of researcher (esp. if teacher). This entails possibly a detailed description of what that situation or context is (e.g. if your study is on writing, then how that is taught throughout the educational system etc.)

! The research is presented as having interest only for the researcher's school/country. Classical research needs to be presented as having wider implications

! Long account of 'problems of teaching in my country'...none of which turn out to be the subject of your research  

! Unfounded generalisations with no sources like ‘standards of English have become poor in recent years’

! Multiple sections with titles like Importance of the study, Significance of the study, which are really not differentiatedin content

·      outline of what will come in the chapters/sections that follow

·      maybe brief definitions of some key terms to be used later

 REVIEW OF LITERATURE  covering these things but not necessarily in this order

·        review and critique of previous research in the same general area (shortcomings of methods or argumentation previously used, new areas to look at suggested by previous results). Their findings, esp. with respect to variables you are interested in. This should at every point be explicitly connected to your specific project.

! The background review reads like an MA survey essay on some area of investigation, cataloguing other people's studies, with no comparison of them with each other, or critique, and no use explicitly made of them to connect to your own work by showing what they suggested for it.

! Too broad… need to focus rapidly on just what bits of articles and books are relevant to your study

! You report previous work as ‘important’ when actually it has no relevance to your own research (though it may be highly regarded in the field generally).

! You retail other people’s criticisms of each other’s research but do not resolve opposing views, argue your own view, or draw implications for your research.

! Review feels like the literature got on top of you, rather than that you are on top of the literature, and is too long (more than a third of the writeup)

! You mention the results of your own later research in your review

! see also http://privatewww.essex.ac.uk/~scholp/litrevsarc.htm

·       theoretical background(s) or 'models' from which the ideas come (both pure and applied linguistic, and maybe in psychology, sociology...), or which you hope to shed light on

! Ostrich: you stick with one model you have learnt about and don't cover the rival theories or look in other disciplines that have something to say.

·        Discussion of definitions of key terms… esp, vague ones (e.g. in ELT ‘communicative’, ‘function’, ‘strategy’, ‘task’ etc….) where you disentangle different opinions of scholars

! You catalogue a lot of people’s definitions of X but fail to show where they agree/differ or which one you are adopting for your work and why.

·        a review of methods used previously to gather relevant data, justifying yours (e.g. merits of interviews versus questionnaires etc.). Better here than in Method chapter/section if it is substantial.

RESEARCH QUESTIONS/HYPOTHESES

·        the specific research questions/hypotheses this piece of research aims to deal with, mentioning the main variables in the study. Sometimes expressed as ‘aims’ or ‘purposes’.

! The hypotheses/questions seem to have no connection with the literature review, or the reader has to work hard to find exactly what prompted them in a mass of review.

! Hypotheses vaguely worded, e.g. what is being compared with what exactly? If you say one group will ‘do better in the cloze test’ then make sure you say ‘than who’ or ‘than in what other test’

! Hypotheses unclear as to which ones the researcher actually expects to be confirmed. E.g. if they are given as null hypotheses, is it the null hypothesis that the researcher actually expects to be confirmed or not?

METHOD

a) Cases/Subjects/Informants and Sampling

·       explanatory variables (EVs) that consist of groupings of cases that are required for the comparisons that the study has to make may be mentioned here, though the measurements or manipulations involved to define them may be described elsewhere -

groupings by inherent features of subjects, such as age group, L1, sex, type of school attended, class in school

groupings related to conditions imposed by an intervention or experimental regime, such as a division between subjects taught reading by one method and others taught by another:

how subjects were assigned to such groups – intact classes? random assignment?

·       for each group of cases involved –

how chosen:

random sample - what procedure for random selection used

quasi-random - how chosen (friends?...)

purposely selected individual(s) - on what basis

whether paid or otherwise rewarded

permissions obtained – informed consent (e.g. from parents, school)

whether subjects such as schoolchildren were free to decline to participate if they did not wish to (ethics)

how many

what sort of people (common features of the populations sampled), hence what wider populations the chosen subjects can be claimed to be representative of, e.g.:

native language, dialect etc., target language

age, social class, sex, school grade etc.

from what institution (school, university, hospital, etc.)

English learning/teaching history

subject variables that were controlled (i.e. deliberately eliminated from having any potential effect) and how each measured (distinguish subject variables used as EVs), e.g.:

intelligence

social class

reading age

etc.

any subjects rejected for any reason

! Insufficient information about what the subjects have in common and what population they could reasonably be regarded as representative of.

! Overclaiming the populations represented

! Just saying you used random samples, as a matter of routine, when obviously they weren’t

! Lack of clarity over what aspects of the cases you are controlling (eliminating), which you are comparing as variables in the study

b) Variables and their Observation/Measurement/Manipulation using Instruments

For each variable of any sort that is involved – dependent/response variable (DV), or one used to establish groups of subjects (EV), or one imposed on subjects to create conditions (independent variable, such as stimuli of three types to be responded to in an experiment, or a special teaching intervention), or one used just to screen subjects (control). How they were all manipulated or measured; instruments:

·        Materials / Stimuli 

standard published tests/questionnaires/observation schedules/ checklists etc. used, or own measures

how items, tests, questionnaires etc. were selected or constructed or adapted (random/deliberate, from what sources)

language or non-linguistic form of stimuli and/or responses

procedure for cloze gap choice, item selection, checklist point selection or the like

form of print/visual display; person whose voice was taped

response mode – written, spoken etc.; multiple choice, open choice…

variables controlled in choice of items, texts etc. (length, frequency, familiarity, readability etc. etc.)

distractors/fillers etc. introduced

incorporation of open/unstructured questions/items

realia - pictures, dolls etc.

topics / texts / task types etc.; interview questions

how many items used to measure a given variable or condition, and overall

how stimuli / materials etc. for different conditions were distinguished from each other; how they were ordered

equivalent, but different,  versions of the same text, stimuli  or whatever required for different groups

materials needed for teaching intervention, if any (texts, exercises, websites, practice materials, where from, how chosen, why created how they were…)

any piloting of materials and revision by item analysis before the current investigation .. how done and what was learnt from it and changed in the main study

! Insufficient justification for using the questionnaire, test or whatever that you did, rather than an alternative instrument

! Lack of detailed account of where individual items, texts etc. used came from, or how they were made, or why they were needed, given the research questions

! Pilot inadequately described, and what was learnt from it and improved in the main study not reported

! In teaching interventions, failure to be clear how exactly the ‘new’ materials being used differ from the usual ones, and if there is a control group, failure to be clear what materials they are using.

·       Procedure

the measurer, or rater, judge, assessor, teacher ..etc.:

whether a proxy for the real measurer/researcher

sex, age, race, etc.

status (teacher/researcher etc.)

institution attached to

how introduced or known to subjects

the situation:

postal / take-away or done on the spot

place where measurements obtained (lab/home/school, description of room etc.)

time of day

activity subjects would otherwise have been doing (e.g. leisure, class...)

onlookers (peers, mother etc.)

any manipulation or separate attention to the situation to create different conditions – e.g. formal vs informal, in class versus out of class

the measurement itself, and the intervention, or imposition of different treatments, if there was any:

how much was explained to subjects about the real purpose of the research and what would be done with the data before or after; any risks/benefits for participants; assurance of confidentiality (ethics)

instructions given (quote verbatim), in what language

practice items or training in the instruments provided first

administration to cases singly or in groups together

measurer present or not

exact sequence of events - what equipment was turned on and off when, order of presentation of stimuli, different tasks etc.  Maybe a table showing the timing and order in which procedures for various tests and treatments or teaching sessions were performed

exact task of subjects, learning schedule etc.

any separate procedures performed to create different conditions/treatments, e.g. different teaching provided to two groups, different instructions, etc.

disguise / distraction introduced or actual misdirection/lying about the task (ethical issues)

precautions taken against collusion

help/feedback provided during task (e.g. any correction after each item, second chances, encouragement etc.)

encouragement given to guess or not when in difficulty

time allowed per item/overall

opportunity for subjects to comment after on the task

            any piloting of any aspects of procedure before the main data gathering

! Lack of detail: in principle the account should be explicit and detailed enough to enable someone else to repeat exactly what you or the investigator did by following it.

! Impression given that everything went perfectly when in practice we know it rarely does

! Account of how things were measured mixed up unclearly with account of an intervention

! In teaching interventions, failure to be clear how exactly the ‘new’ procedure  being used differs from the usual one students had before, and in what respects it is identical; and if there is a control group, failure to be clear what procedure was followed there and where exactly it differed from the experimental one.

·       Apparatus / Equipment

make and specification of anything like these used to present material, record responses etc:

tape / cassette recorder and player and microphone, earphones

language laboratory

video camera and player

earphones

slide projector

voice operated relay (to detect start of spoken response to a stimulus)

timer

computer and monitor (keys specially labelled?) and software used either to capture data or analyse it

 

·        Scoring / Categorisation / Coding / Data Analysis

kind of scale on which each variable is recorded (interval numerical scores, categories of some sort etc.)

how the data was transcribed, translated etc., if relevant

what was counted as what when categorising etc. - coding of open responses to questionnaire items, observations, unstructured interview data, think aloud protocols etc.. Source of coding scheme - standard one used by others or made up/adapted in the light of data obtained

any data omitted and why (unanalysable, aberrant, subject uncooperative...)

how totals and % scores were derived

any procedure for combining scores from subtests or sets of items etc. to produce overall scores for each person

any standardisation of scores

how missing values were dealt with - non-responses, absences

reliability checks performed:

kind of check (test-retest, internal, etc.) and resulting coefficient of reliability

timing of test-retest check, on what cases, etc.

% of tests/protocols rescored by another measurer and who that measurer was, whether he/she knew purpose of measurement, his/her training etc.

validity checks - content, concurrent etc.

what was done to ensure anonymity of cases for reporting, and keep any information on real identities secure (ethics)

! Coding of data was done just by the researcher, not crosschecked by anyone else

! Linguistic aspects are not labelled with proper linguistic labels

! Things are counted as errors that are acceptable to an NS of English

! No account given of how the categories used to classify the data were arrived at

! Unclear how exactly the figures that appear in later tables were derived from the measures used

c) Design

·       overall specification of how all the (groups) of cases and variables of all sorts interlock:

which are explanatory or independent variables, dependent variables, and controlled (eliminated) variables, with what values

which are experimental 'made' variables (including teacher interventions), which just 'measured' ex post facto

which variables are within-subject 'conditions' or ‘treatments’ (matched groups, repeated measures), which between-subjects 'groupings' (independent groups)

labels like:

single-subject or case study

cross-sectional or between-groups study

longitudinal study

correlational

factorial

multivariate

·       where a lot of variables and conditions are involved, esp. in experiments, the overall plan and order in which measurements were made and stimuli presented, interventions made etc.; counterbalancing to take care of order effects, and so on

! Design is not mentioned, and it is unclear just how many variables there were, or which were potentially explaining which etc.

! Unclear which variables were controlled, in the sense of eliminated, as against which were the focus of research

! Calling every empirical study an 'experiment'.

RESULTS

·        often usefully organised around each research hypothesis or question in turn

! The account follows the types of statistic used, e.g. all descriptive statistics like averages given first, then all the ANOVA test results, when in fact this means that information related to the same research question gets fragmented. The stats are the servant not the master.

! Account slavishly  follows the instruments, e g. all questionnaire results first, then all interview results on the same issues, etc. This may make sense but if different instruments cover the same ground it can be more informative to combine the accounts. E.g. cover what all the instruments said about one thing in one place

·       graphs and tables showing results for groups and conditions and distribution shapes

! Graphs and tables have unclear or abbreviated labels like 'NNSPC1' such as maybe you had to use for the computer, but which should not carry through to the write-up

! Graphs and tables not numbered and not referred to by number from the text (‘See below’ is no good)

·        summary and dispersion statistics for groups and conditions:

means, percentages etc.

SDs, variances etc.

 

·       descriptive measures and inferential tests for comparisons of results of different groups, conditions, variables etc.:

differences between means, percentages etc. of different groups or conditions

correlations, associations etc.

significance tests for differences and correlations

 

! Results are said to be 'significant' when they aren't so in the statistical sense, or a sig test has not been performed.

 

·       qualitative account of things not reducible to figures, often in the form of summaries of what was said in interviews, or profiles of selected best and worst cases

! Insufficient examples cited from the data, where the data consisted of recordings, compositions etc. So the evidence is not supplied for the conclusions

·       interpretation of all the above, as you go along,  in the light of the original hypotheses / questions under investigation

! DIY: masses of figures/tables/graphs are given, but not much explanation of what they are telling us. Reader is bombed with detail.

! There are thirty pages of results without explanation, then thirty pages of the explanation starting over, by which time the reader has forgotten what the first results were.

! The interpretation of results does not connect back to the original hypotheses.

! Pseudo-explanations of results like 'The hypothesis was not met because there was not much difference in performance' (Why?!)

! Many new references are made to other research, which should have been in the original literature review

! No use is made of whatever was reviewed in the literature review. No comparisons with previous studies’  results.

·       interesting things you noticed in the data that it had not been planned to look for - exploratory

DISCUSSION

·        integrated account of the findings, pulling together what may have been covered in different places in the preceding coverage of Results

·        wider implications of the results in relation to:

previous work already discussed above

possible future research work by oneself or someone else

theory – has the work added to or altered anything?

practical applications (e.g. to teaching, therapy)

! There is only a feeble gesture in two paragraphs to give 'practical implications for teaching' or the like. Vague statements that nobody could make any practical use of.

! Under the name of implications various perfectly sensible recommendations are made about teaching or whatever, but they are not derived from your study. They derive from the general literature on the subject. Implications should clearly come from YOUR findings.

! Overgeneralisation of findings to 'all learners' or the like, when only University English majors in one city in your country were studied

! Failure to show clearly what was the new information from this study and what was confirming what had been found elsewhere (lit review).

 

·        what might better have been done differently, with hindsight

! Overconfidence in findings, despite shortcomings of the method

CONCLUSION

·        brief underlining of main points again. Often combined with Discussion.

! Tedious repetition of what has already been said three times over.

! The summary is only of the factual results, not of your interpretation and discussion of what the results ‘mean’

REFERENCES/BIBLIOGRAPHY

·        in the right format for the recipient (see guide in course booklet, targeted journal etc.)

! There are references in the text which are not in the bibliography, or vice versa.

! Names of authors are spelt differently/wrongly in different places.

! References not cited in the text in an accepted form or in different forms in different places

! Confusion of first and second name, such as reference to the works of Noam.

APPENDICES

·       tables of all the individual scores of all the individual cases measured, properly labelled

·        copies of any teaching materials, lesson plans etc. used

·       one copy of every test, questionnaire, observation sheet, checklist etc. that was used, in English and in the language in which it was administered, if different

·       samples of actual protocols (not all of them, unless they are very few). I.e. filled in questionnaires, compositions the subjects wrote, transcriptions of taped interviews and think aloud sessions....

      ! Far too much put in. One does not need every completed test script or questionnaire, or transcription of 20 hours of tape.  

rev. 11