Methodology tutorial - qualitative data analysis: Difference between revisions

The educational technology and digital learning wiki
Jump to navigation Jump to search
 
(22 intermediate revisions by 2 users not shown)
Line 1: Line 1:
{{Incomplete}}
{{Incomplete}}
{{under construction}}
<!-- <pageby nominor="false" comments="false"/> -->


<pageby nominor="false" comments="false"/>
This is part of the [[methodology tutorial]] (see its table of contents).


== Qualitative data analysis ==
== Introduction ==


This is part of the [[methodology tutorial]] (see its table of contents).
<div class="tut_goals">
; Learning goals:
: Understanding the purpose of code data and creating code books
: Learn about some descriptive analysis of variables techniques (including situations and roles)
: Learn about some causal analysis techniques
 
; Prerequisites
* [[Methodology tutorial - theory-finding research designs]]
* [[Methodology tutorial - qualitative data acquisition methods]]
 
; Level
* It aims at beginners. As with any method, you will have to practise and you likely will need some (human) guidance or at least serious additional reading. The purpose of this tutorial is to help you understand what qualitative data analysis could mean, not to present ready-made solutions.


== Introduction: classify, code and index ==
; Quality
* Slide style, should be expanded
</div>


In short qualitative data analysis usually implies to related and iterative steps.
In short, qualitative data analysis usually implies to related and iterative steps.
See [[Methodology tutorial - theory-finding research designs]] for the general principle.
See [[Methodology tutorial - theory-finding research designs]] for the general principle.


Line 21: Line 34:
(3) Interpret these
(3) Interpret these


; Before we start: Keep your documents and ideas safe !
;Before we start - Keep your documents and ideas safe !


Write memos (conservation of your thoughts). It is useful to write short memos (vignettes) when an interesting idea pops up, when you looked at something and want to remember your thoughts
(1) Write memos (conservation of your thoughts). It is useful to write short memos (vignettes) when an interesting idea pops up, when you looked at something and want to remember your thoughts


Write contact sheets to allow remembering and finding things. After each contact (telephone, interviews, observations, etc.), make a short data sheet:
(2) Create contact sheets that allow you remember your field work. After each contact (telephone, interviews, observations, etc.), make a short data sheet that should include:
* Indexed by a clear filename or tag on paper, e.g. CONTACT_senteni_2005_3_25.doc
* A clear tag for [[indexing]] purposes (filename or tag on paper), e.g. CONTACT_senteni_2005_3_25.doc.
* type of contact, date, place, and a link to the interview notes, transcripts.
* Type of contact, date, place, and a link to the interview notes, transcripts.
* principal topics discussed and research variables addressed (or pointer to the
* Principal topics discussed and research variables addressed (or pointer to the interview sheet).
interview sheet)
* Initial interpretative remarks, new speculations, things to discuss next time.
* initial interpretative remarks, new speculations, things to discuss next time


Index your interview notes:
(3) Index your interview notes:
* Put your transcription (or tapes) in a safe place
* Put your transcription (or audio/video files or audio tapes) in a safe place
* Assign a code to each "text", e.g. INT-1 or INTERVIEW_senteni_3_28-1
* Assign a code to each "text", e.g. INT-1 or INTERVIEW_senteni_3_28-1. The same code could be used as filename.
* You also may insert the contact sheet (see above)
* You also may insert the contact sheet (see above)
* Number the pages if you take notes manually
* Number the pages if you take notes manually (they can fall down ...)
 
(4) Do not trust your hard-disk !


== Codes and categories ==
== Codes and categories ==
First step in qualitative data analysis is coding.


[[Image:icon-hand-right.png]] A code is a “label” to tag a variable (concept) and/or a value found in a "text"
[[Image:icon-hand-right.png]] A code is a “label” to tag a variable (concept) and/or a value found in a "text"


;Basics
=== The coding principle ===


# A code is assigned to each (sub)category you work on
# A code is assigned to each (sub)category, i.e. theoretical variable you work on
#* In other words: you must identify ''variable names''
#* In other words: you must identify ''variable names''
# In addition, you can for each code assign a set of possible values (e.g.: “positive”/”neutral/”negative)
# In addition, you can for each code assign a set of possible values (e.g.: “positive”/”neutral/”negative)
Line 55: Line 71:
; Benefit of coding
; Benefit of coding


* Coding will allow you to find all informations regarding variables of interest to your
* Coding will allow you to find all informations regarding variables of interest to your research
research
* Reliability will be improved
* Reliability will be improved


Line 63: Line 78:
[[Image:quali-coding-principle.png]]
[[Image:quali-coding-principle.png]]


=== Technical Aspects ===
=== Technical Aspects of coding ===


* The safest way to code is to use some specialized software
* The safest and most reliable way to code is to use some specialized software
** e.g. Atlas or Nvivo (NuDist),
** e.g. Atlas or Nvivo (NuDist),
** however, this takes a lot of time !
** however, this takes a lot of time !
Line 81: Line 96:
** either your code book is determined by you research questions and associated theories, frameworks, analysis grids
** either your code book is determined by you research questions and associated theories, frameworks, analysis grids
** or you really learn how to use an inductive strategy like "grounded theory".
** or you really learn how to use an inductive strategy like "grounded theory".
=== Coding reliability ===
Assigning a code to a "text" segment is not always obvious and coding similar passages exactly the same way even less. In other words, we have a reliability problem.
There are two ways of improving reliability:
* Use clear and operational categories
* Use two or three coders (yourself and a friend) and compute intercoding index. If it is low, you will have to revise your coding scheme.
There exist several formula to compute intercoder (inter-rater) reliability. The most simple one is:
  reliability =  number of agreements (same coding)
                / total codes (agreements plus disagreements)
Read [http://www.temple.edu/sct/mmc/reliability/ this] for a very good introduction.


== Code-book creation and management ==
== Code-book creation and management ==
Line 93: Line 123:


{| border="1"
{| border="1"
! rowspan="1" colspan="1" |
! rowspan="1" colspan="2" |Categories
 
! rowspan="1" colspan="1" |Codes
! rowspan="1" colspan="1" |
! rowspan="1" colspan="1" |Theoretical references
categories
! rowspan="1" colspan="1" |
codes
! rowspan="1" colspan="1" |
theoreticalreferences
|-
|-
| rowspan="1" colspan="2" |
| rowspan="1" colspan="2" |properties of the innovation
properties of the innovation
| rowspan="1" colspan="1" |PI
| rowspan="1" colspan="1" |
PI
| rowspan="10" colspan="1" |
| rowspan="10" colspan="1" |
....(fill for your own code book).....
....(fill for your own code book).....
|-
|-
| rowspan="1" colspan="2" |
| rowspan="1" colspan="2" |external context
external context
| rowspan="1" colspan="1" |EC
| rowspan="1" colspan="1" |
CE
|-
|-
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |&nbsp;&nbsp;&nbsp;


| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |demography
demography
| rowspan="1" colspan="1" |CE-D
| rowspan="1" colspan="1" |
CE-D
|-
|-
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |


| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |support for the reform
support for the reform
| rowspan="1" colspan="1" |CE-S
| rowspan="1" colspan="1" |
CE-S
|-
|-
| rowspan="1" colspan="2" |
| rowspan="1" colspan="2" |internal context
internal context
| rowspan="1" colspan="1" |IC
| rowspan="1" colspan="1" |
CI
|-
|-
| rowspan="1" colspan="2" |
| rowspan="1" colspan="2" |adoption processes
adoption processes
| rowspan="1" colspan="1" |AP
| rowspan="1" colspan="1" |
PA
|-
|-
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |&nbsp;&nbsp;&nbsp;


| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |official chronology
official chronology
| rowspan="1" colspan="1" |PA-OC
| rowspan="1" colspan="1" |
PA-CO
|-
|-
| rowspan="1" colspan="2" |
| rowspan="1" colspan="2" |dynamics of the studied site
dynamics of the studied site
| rowspan="1" colspan="1" |DS
| rowspan="1" colspan="1" |
DS
|-
|-
| rowspan="1" colspan="2" |
| rowspan="1" colspan="2" |external and internal assistance
external and internal assistance
| rowspan="1" colspan="1" |EIA
| rowspan="1" colspan="1" |
AEI
|-
|-
| rowspan="1" colspan="2" |
| rowspan="1" colspan="2" |causal links
causal links
| rowspan="1" colspan="1" |CL
| rowspan="1" colspan="1" |
LC
|}
|}
Before you start think about your own code book, you '''really''' should go through the relevant literature and try to find existing code books (that you then may adapt). E.g. below is an
an example of codes use to analyze [http://www.ifets.info/journals/10_4/13.pdf types of job-related problems of Turkish computer teachers] (Deryakulu & Olkun, 2007).
# Role Conflict
# Inadequate Teacher Induction Policies
# Lack of Required Technological Infrastructure and Technical Support
# The Status of Computer Subject in School Curriculum
# Lack of Appreciation and Positive Feedback from Colleagues
# Unsupportive Administrators
# Rapidly Changing Nature of Content Knowledge in Computer Education
# Lack of Cohesive Computer Curriculum
# Insufficiency of Pre-Service Teacher Training Programs
# Large Class Sizes
# Indifferent Students
# Inadequate Supervision and Inspection


=== Coding by induction (according to “grounded theory”) ===
=== Coding by induction (according to “grounded theory”) ===


The principle is the following one:
Grounded theory (Glaser, Strauss) refers to a set of approaches that that focus on interpretation and theory building, i.e. it is a fully '''inductive''' approach. The researcher starts by coding a small data set and then increases the sample in function of emerging theoretical questions. Categories (codes) can be revised at any time.
* The researcher starts by coding a small data set and then increases the sample in function of emerging theoretical questions  
* Categories (codes) can be revised at any time


Starting point = 4 big abstract observation categories:
{{quotation|Grounded theory begins with a research situation.  Within that situation, your task as researcher is to understand what is happening there, and how the players manage their roles.  You will mostly do this through observation, conversation and interview.  After each bout of data collection you note down the key issues: this I have labelled "note-taking".
 
Constant comparison is the heart of the process.  At first you compare interview (or other data) to interview (or other data).  Theory emerges quickly.  When it has begun to emerge you compare data to theory.
 
The results of this comparison are written in the margin of the note-taking as coding.  Your task is to identify categories (roughly equivalent to themes or variables) and their properties (in effect their sub-categories). [http://www.scu.edu.au/schools/gcm/ar/arp/grounded.html Grounded theory: a thumbnail sketch], retrieved 13:40, 15 October 2008 (UTC))}}
 
Typically, you'd both code phenomena in isolation and relations (so-called axial coding).
On starting point for axial coding could be these big abstract observation categories:
* conditions (causes of a perceived phenomenon)
* conditions (causes of a perceived phenomenon)
* interactions between actors
* interactions between actors
Line 173: Line 199:
* consequences of actions
* consequences of actions


(... many more details: to use this approach you ''really'' must document yourself)
Read more in [Introduction to Grounded Theory].
 
To use this approach you ''really'' should document yourself, as beginner you'd likely fall into various traps, in particular selection and confirmation biases, i.e. you only look at things that will interest you for one or another reason.


=== Coding by ontological categories ===
=== Coding by ontological categories ===


Example:
Instead of initially creating a code book from variables found in your research questions or "inductive" coding à la grounded theory, you also start by creating or using a vocabulary for a given domain. This strategy is a compromise between “grounded theory” and “theory driven” approaches.
 
Below is a table that lists things you could observe in an organization (Bogdan and Biklen, cited by Miles & Huberman:1994 61)


{| border="1"
{| border="1"
! rowspan="1" colspan="1" |
! rowspan="1" colspan="1" |Types
Types
! rowspan="1" colspan="1" |Explanation
! rowspan="1" colspan="1" |
|-
| rowspan="1" colspan="1" |Context/Situation
| rowspan="1" colspan="1" |information on the context
|-
| rowspan="1" colspan="1" |Definition of the situation
| rowspan="1" colspan="1" |interpretation of the analyzed situation by people
|-
| rowspan="1" colspan="1" |Perspectives
| rowspan="1" colspan="1" |global views of the situation
|-
| rowspan="1" colspan="1" |Ways to look at people and objects
| rowspan="1" colspan="1" |detailed perceptions of certain elements
|-
| rowspan="1" colspan="1" |Processes
| rowspan="1" colspan="1" |sequences of events, flow, transitions, turning points, etc.
|-
| rowspan="1" colspan="1" |Activities
| rowspan="1" colspan="1" |structures of regular behaviors
|-
| rowspan="1" colspan="1" |Events
| rowspan="1" colspan="1" |specific activities (non regular ones)
|-
| rowspan="1" colspan="1" |Strategies
| rowspan="1" colspan="1" |ways of tackling a problem (strategies, methods, techniques)
|-
| rowspan="1" colspan="1" |Relations and social structure
| rowspan="1" colspan="1" |informal links
|-
| rowspan="1" colspan="1" |Methods
| rowspan="1" colspan="1" |comments (annotations) of the researcher
|}
 
In the literature, you may find several other such "accounting schemes". In educational technology, for example, there is a range of relatively simple code books for conversation and asynchronous discussion groups (forum) analysis. De Wever et al. (2006) provide a good overview.
Some coding schemes are fairly simple. E.g. Cobos and Pifarré (2008) analyzed "collaborative knowledge construction in the web" with the following coding scheme:
 
{| border="1"
!Explanation
!Asks for clarifying some parts of the document
!“The following link, which appears in your document, doesn't work now, but it worked a week ago”
|-
|Support
|Express explicit agreement with the document's ideas or information organisation
|“In my opinion this document is very useful and is easy to read”
|-
|Addition
|Suggests additions to the document: ideas, opinions or information organisation
|“I think that an index of the sections of the article should be added”
|-
|Delete
|Suggests deletions from the document: ideas, opinions or information organisation
|“In the summary there are some examples, which perhaps were not necessary”
|-
|Correction
|Suggests changes to the document. They refer to ideas, opinions or information organisation
|“I think that there is an error in the first paragraph in the conclusion section, where it says 'motor' it should be 'motivation'”
|}
 
[http://dx.doi.org/10.1016/j.compedu.2003.08.003 Pena and Nichols (2004)] used the following categories:
 
* Questions
* Reply
* Clarification
* Interpretation
* Conflict
* Assertion
* Consensus Building
* Judgment
* Reflection
* Support
* Other
 
There exist more complex code books:
As an example we produce a summary of students' messages code book by Eilon and Kliachko (2004).


{| border="1"
!"Level A" Group
! These categories indicate knowledge construction and a significant contribution to peer learning 
|-
|Comprehension
|Shows written evidence about the comprehension of the subjects studied by the following categories:
|-
|Reproducing- 1                   
|Reproduces the main points, ideas, arguments or messages found in the incoming information with reference to its source and with critical evaluation.
|-
|Directing
|Directs others to relevant sources for the subjects studied (both printed and online).
|-
|Clarifying by questions
|Locates ambiguous, difficult or problematic areas in the new material. Describes the context of the question or the reason for asking it.
|-                             
|Clarifying by answers
|Gives correct, relevant and comprehensive explanations as answers. Bases the answers on retrieved information while citing its origin.
|-
|-
| rowspan="1" colspan="1" |
|Reflection
Context/Situation
|Shows written evidence of metacognitive processes that the learner applies when studying new subjects by the following categories:
| rowspan="1" colspan="1" |
|-                               
information on the context
|Linking/extending
|Links the new information with his or her own previous knowledge. Extends the new knowledge to other domains, especially to STS issues.
|-
|Critical evaluation
|Evaluates the new information critically.
|-
|Transformation
|Applies the new information in an original and creative way, draws inferences, and gives original examples.
|-
!"Level B" Group
!These categories indicate a probable contribution to peer learning, by the following categories:
|-
|Documenting
|Documents l earning exper iences or indivi dual contributions to the group.
|-
|-
| rowspan="1" colspan="1" |
|Reproducing- 2                 
Definition of the situation
|Reproduces the main points, ideas, arguments or messages found in the incoming information without any evaluation or original input to it.
| rowspan="1" colspan="1" |
interpretation of the analyzed situation by people
|-
|-
| rowspan="1" colspan="1" |
|Learning outcomes             
Perspectives
|Presents group and individual learning outcomes.
| rowspan="1" colspan="1" |
global views of the situation
|-
|-
| rowspan="1" colspan="1" |
|Technical questions/answers
Ways to look at people and objects
|Questions or remarks about any subject that does not relate directly to understanding the subjects studied.
| rowspan="1" colspan="1" |
detailed perceptions of certain elements
|-
|-
| rowspan="1" colspan="1" |
|Personal knowledge
Processes
|Presents personal knowledge or daily life experiences.  
| rowspan="1" colspan="1" |
sequences of events, flow, transitions, turning points, etc.
|-
|-
| rowspan="1" colspan="1" |
!"Level C" Group
Activities
!These categories indicate no contr ibution (or adverse contribution) to peer learning, by the following categories:
| rowspan="1" colspan="1" |
structures of regular behaviors
|-
|-
| rowspan="1" colspan="1" |
|Irrelevant/unexplained questions
Events
|Poses questions without giving the context or reason for asking them.
| rowspan="1" colspan="1" |
specific activities (non regular ones)
|-
|-
| rowspan="1" colspan="1" |
|Casual quotations
Strategies
|Includes quotations without their context and without further explanation.                               
| rowspan="1" colspan="1" |
ways of tackling a problem (strategies, methods, techniques)
|-
|-
| rowspan="1" colspan="1" |
|Irrelevant/incorrect answers
Relations and social structure
|Offers Irrelevant or incorrect answers to questions sent by other students.
| rowspan="1" colspan="1" |
informal links
|-
|-
| rowspan="1" colspan="1" |
|Emotional/personal comments
Methods
|Includes personal comments, which should have been sent by e-mail as instructed by the teacher .                               
| rowspan="1" colspan="1" |
comments (annotations) of the researcher
|}
|}
* This is a compromise between “grounded theory” and “theory driven” approaches


=== Pattern codes ===
=== Pattern codes ===


Some researchers also code patterns (relationships). '' Simple'' encoding (above) breaks data down to atoms, categories)
Some researchers also code patterns (relationships). ''Simple'' encoding (above) breaks data down to atoms, categories), whereas '' pattern coding'' identifies relationships between atoms. Pattern coding is also one the steps in the inductive grounded theory approach.


“'' pattern coding'' ” identifies relationships between atoms.
The ultimate goal is to detect (and code) regularities, but also variations and singularities.
 
The ultimate goal is to detect (and code) regularities, but also variations and
singularities.


Some suggested operations:
Some suggested operations:


# Detection of '' co-presence'' between two values of two variables
# Detection of ''co-presence'' between two values of two variables
#* E.g. people in favor of a new technology (e.g. ICT in the classroom) have a tendency to use it.
#* E.g. people in favor of a new technology (e.g. ICT in the classroom) have a tendency to use it.
# Detection of '' exceptions''
# Detection of ''exceptions''
#* e.g. technology-friendly teachers who don’t use it in the classroom
#* e.g. technology-friendly teachers who don’t use it in the classroom
#* In this case you may introduce new variable to explain the exception, e.g. the attitude of the superior., of the group culture, the administration, etc.
#* In this case you may introduce new variable to explain the exception, e.g. the attitude of the superior., of the group culture, the administration, etc.
Line 276: Line 386:
;Some advice:
;Some advice:
* When you use these techniques always keep a link to the source (coded data)
* When you use these techniques always keep a link to the source (coded data)
* Try to fit each matrix or graph on a ''single page'' (or make sure that you can print things made by computer on a A3 pages)
* Try to fit each matrix or graph on a ''single page'' (or make sure that you can print things made by computer on a A3 page)
* you have to favor synthetic vision, but still preserve enough detail to make your artifact interpretable
* you have to favor synthetic vision, but still preserve enough detail to make your artifact interpretable
* Consult specialized manuals e.g. Miles &amp; Huberman, 1994 for recipes or get inspirations from qualitative research in the same domain
* Consult specialized manuals e.g. Miles &amp; Huberman, 1994 for recipes or get inspirations from qualitative research in the same domain as yours.
 
[[image:icon-warning.png|left]] In this tutorial we can not cover all possible types of analysis, but just provide a '''few''' examples of what can be done. Before you start doing any sort of analysis, '''think''' about what you need to answer your research questions !


=== The “context chart”,Miles &amp; Huberman (1994:102) ===
=== The “context chart”,Miles &amp; Huberman (1994:102) ===
Line 290: Line 402:
There exist codified "languages" for this type of analysis, e.g. [[UML]] or OSSAD
There exist codified "languages" for this type of analysis, e.g. [[UML]] or OSSAD


Once you clearly identifed and clarified formal relations, you can use the graph to make annotations (like below)
Once you clearly identified and clarified formal relations, you can use the graph to make annotations (like below)


[[Image:new-pedagogies-context-chart-annotated.png]]
[[Image:new-pedagogies-context-chart-annotated.png]]
Line 301: Line 413:


{| border="1"
{| border="1"
! rowspan="1" colspan="1" |
! rowspan="1" colspan="1" |Examples for external support
Examples for external support
! rowspan="1" colspan="1" |At counselor level
! rowspan="1" colspan="1" |
! rowspan="1" colspan="1" |At teacher level
At counselor level
! rowspan="1" colspan="1" |
At teacher level
|-
|-
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |Analysis of deficiencies
Analysis of deficiencies
| rowspan="4" colspan="2" |Fill in each cell as below
| rowspan="4" colspan="2" |
Fill in each cell as below
|-
|-
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |Teaching training
Teaching training
|-
|-
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |Change monitoring
Change monitoring
|-
|-
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |Incentives
Incentives
|-
|-
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |Group dynamics
Group dynamics
| rowspan="1" colspan="1" |adequate: “we have met an organizer 3 times and it has helped us” (ENT-12:10)
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |not adequate: “we just have informed” (ENT-13:20)
adequate: “we have met an organizer 3 times and it has helped us” (ENT-12:10)
| rowspan="1" colspan="1" |
not adequate: “we just have informed” (ENT-13:20)
|-
|-
| rowspan="1" colspan="1" |etc. ..
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
etc. ..
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
|}
|}


Line 531: Line 629:
! rowspan="1" colspan="1" |Assistance received
! rowspan="1" colspan="1" |Assistance received
! rowspan="1" colspan="1" |Immediate effects
! rowspan="1" colspan="1" |Immediate effects
! rowspan="1" colspan="1" |Long termeffects
! rowspan="1" colspan="1" |Long term effects
! rowspan="1" colspan="1" |Explanation of the researcher
! rowspan="1" colspan="1" |Explanation of the researcher
|-
|-
Line 583: Line 681:
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |“don’t coordinate very much” (1)
| rowspan="1" colspan="1" |“don’t coordinate very much” (1)
| rowspan="1" colspan="1" |doesn’t receive all the information (2)
| rowspan="1" colspan="1" |dosn’t receive all the information (2)
|-
|-
| rowspan="1" colspan="1" |rôle 3
| rowspan="1" colspan="1" |rôle 3
Line 592: Line 690:




== Techniques to hunt correlations ==
== Techniques to study relations ==
 
Often qualitative analysis stops with simple descriptive analysis ([[User:Daniel K. Schneider|Daniel K. Schneider]] believes that this is the case with popular forum analysis). However, you also may use qualitative data to do some kind of "correlational analysis" as you typically do in quantitative data analysis.


=== Matrices ordered according to concepts (variables) ===
=== Matrices ordered according to concepts (variables) ===


=== Clusters (co-variances of variables, case typologies) ===
; Clusters (co-variances of variables, case typologies)


* An idea that certain values should "go together": Hunt co-occurrences in cells
* An idea that certain values should "go together": Hunt co-occurrences in cells
Line 612: Line 712:
||case 1
||case 1
||
||
||important
| bgcolor="red"|important
||important
| bgcolor="green"|important
||important
||important
|-
|-
||case 2
||case 2
||
||
||not important
| bgcolor="red"|not important
||not important
| bgcolor="green"|not important
||not important
||not important
|-
|-
||case 3
||case 3
||
||
||important
| bgcolor="red"|important
||important
| bgcolor="green"|important
||important
||important
|-
|-
||case 4
||case 4
||yyy
||yyy
||not important
| bgcolor="red"|not important
||not important
| bgcolor="green"|not important
||not important
||not important
|-
|-
||case 5
||case 5
||.....
||.....
||important
| bgcolor="red"|important
||important
| bgcolor="green" |important
||important
||important
|-
|-
||case 6....
||case 6....
||
||
||important
| bgcolor="red"|important
||not important
| bgcolor="green"|not important
||not important
||not important
|}
|}


* This table shows e.g. that nedd for support and need for training seem to go together,
* This table shows e.g. that need for support and need for training seem to go together,
e.g. cases 1,3,5 have association of "important", cases 2 and 4 have association of "not
e.g. cases 1,3,5 have association of "important", cases 2 and 4 have association of "not
important".
important".
* See next page how we can summarize this sort of information in a crosstab
* See next page how we can summarize this sort of information in a crosstab


 
; Co-variance expressed in a corresponding crosstab
 
=== Co-variance expressed in a corresponding crosstab: ===


{| border="1"
{| border="1"
! rowspan="2" colspan="2" |
! rowspan="2" colspan="2" |training needs * support needs
training needs * support needs
! rowspan="1" colspan="2" bgcolor="red"|need for support
! rowspan="1" colspan="2" |
need for support
|-
|-
! rowspan="1" colspan="1" |
! rowspan="1" colspan="1" bgcolor="red"|yes
yes
! rowspan="1" colspan="1" bgcolor="red"|no
! rowspan="1" colspan="1" |
no
|-
|-
| rowspan="2" colspan="1" |
| rowspan="2" colspan="1" bgcolor="green"|need for<br /> training
need for<br /> training
| rowspan="1" colspan="1" bgcolor="green" |yes
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" bgcolor="blue"|3
yes
| rowspan="1" colspan="1" bgcolor="magenta"|1
| rowspan="1" colspan="1" |
3
| rowspan="1" colspan="1" |
1
|-
|-
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" bgcolor="green" |no
no
| rowspan="1" colspan="1" bgcolor="magenta" |1
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" bgcolor="blue" |2
1
| rowspan="1" colspan="1" |
2
|}
|}


* .... we can observer a correlation here: "blue cells" (symmetry) is stronger than
We can observer a correlation here: "blue cells" (symmetry) is stronger than "magenta"!
"magenta"!
* check with the data on last slide
 


You should check the data above to see if we did this right ...


=== Example typology with the same data: ===
=== Example typology with the same data: ===
Line 694: Line 779:
{| border="1"
{| border="1"
! rowspan="1" colspan="1" |
! rowspan="1" colspan="1" |
 
! rowspan="1" colspan="1" |Type 1:<br /> "anxious"
! rowspan="1" colspan="1" |
! rowspan="1" colspan="1" |Type 2:<br /> "dependent"
Type 1:<br /> "anxious"
! rowspan="1" colspan="1" |Type 3: <br /> "bureaucrats"
! rowspan="1" colspan="1" |
! rowspan="1" colspan="1" |Type 4:<br /> "autonomists"
Type 2:<br /> "dependent"
! rowspan="1" colspan="1" |
Type 3: <br /> "bureaucrats"
! rowspan="1" colspan="1" |
Type 4:<br /> "autonomists"
|-
|-
| rowspan="1" colspan="1" |case 1
| rowspan="1" colspan="1" |X
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
case 1
| rowspan="1" colspan="1" |
X
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
|-
|-
| rowspan="1" colspan="1" |case 2
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
case 2
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
X
| rowspan="1" colspan="1" |X
|-
|-
| rowspan="1" colspan="1" |case 3
| rowspan="1" colspan="1" |X
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
case 3
| rowspan="1" colspan="1" |
X
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
|-
|-
| rowspan="1" colspan="1" |case 4
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
case 4
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
 
| rowspan="1" colspan="1" |X
| rowspan="1" colspan="1" |
X
|-
|-
| rowspan="1" colspan="1" |case 5
| rowspan="1" colspan="1" |X
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
case 5
| rowspan="1" colspan="1" |
X
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
|-
|-
| rowspan="1" colspan="1" |case 6
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
case 6
| rowspan="1" colspan="1" |X
| rowspan="1" colspan="1" |
 
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
X
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
|-
|-
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |Total individuals<br/>per type
Total
| rowspan="1" colspan="1" |3
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |1
3
| rowspan="1" colspan="1" |0
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |2
1
| rowspan="1" colspan="1" |
0
| rowspan="1" colspan="1" |
2
|}
|}


* we can observe emergence of 3 types to which we assign "labels"
We can observe emergence of 3 types to which we assign "labels"
* Note: for more than 3 variables use a cluster analysis program
 
Note: for more than 3 variables use a cluster analysis program


Additional example
;Additional example


The table shows co-occurrence between values of 2 variables. The idea is to find out what
The table shows co-occurrence between values of 2 variables. The idea is to find out what
Line 792: Line 838:
{| border="1"
{| border="1"
! rowspan="1" colspan="1" |
! rowspan="1" colspan="1" |
 
! rowspan="1" colspan="5" |Strategies of a school
! rowspan="1" colspan="4" |
Strategies of a school
! rowspan="1" colspan="1" |
===  ===
|-
|-
! rowspan="1" colspan="1" |
! rowspan="1" colspan="1" |Type of pressure
Type of pressure
! rowspan="1" colspan="1" |strategy 1:no reaction
! rowspan="1" colspan="1" |
! rowspan="1" colspan="1" |strategy 2:a task force is created
strategy 1:no reaction
! rowspan="1" colspan="1" |strategy 3:internal training programs are created
! rowspan="1" colspan="1" |
! rowspan="1" colspan="1" |strategy 4:resources are reallocated
strategy 2:a task force is created
! rowspan="1" colspan="1" |strategy 5: .....
! rowspan="1" colspan="1" |
strategy 3:internal training programs are created
! rowspan="1" colspan="1" |
strategy 4:resources are reallocated
! rowspan="1" colspan="1" |
strat 5: .....
|-
|-
| rowspan="1" colspan="1" |Letters written by parents
| rowspan="1" colspan="1" |(N=4)(p=0.8)
| rowspan="1" colspan="1" |(N=1)(p=0.2)
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
Letters written by parents
| rowspan="1" colspan="1" |
(N=4)(p=0.8)
| rowspan="1" colspan="1" |
(N=1)(p=0.2)
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
|-
|-
| rowspan="1" colspan="1" |Letters written by supervisory boards
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
Letters written by supervisory boards
| rowspan="1" colspan="1" |(N=2)(p=0.4)
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |(N=3)(p=0.6)
 
| rowspan="1" colspan="1" |
(N=2)(p=0.4)
| rowspan="1" colspan="1" |
(N=3)(p=0.6)
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
|-
|-
| rowspan="1" colspan="1" |newspaper articles
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
newspaper articles
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
 
| rowspan="1" colspan="1" |(N=1)(p=100%)
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
(N=1)(p=100%)
| rowspan="1" colspan="1" |
|-
|-
| rowspan="1" colspan="1" |type ...
| rowspan="1" colspan="1" |.....
| rowspan="1" colspan="1" |....
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
type ...
| rowspan="1" colspan="1" |
.....
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
....
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
|}
|}


=== Recall: Interpretation of crosstabulation ===


See also: [[Methodology_tutorial_-_quantitative_data_analysis#Crosstabulation|Methodology tutorial - quantitative data analysis]] (Cross-tabulation)


=== Recall: Interpretation of crosstabulation ===
We would like to estimate the probability that a given value of the independent (explaining) variable entails a given value of the dependent (explained) variable.


Procedure
; The procedure


* calculate the % for each value of the independent variable
# calculate the % for each value of the independent variable
** Note: this can be either the line or the column depending on how you orient your table
#* Note: this can be either the line or the column depending on how you orient your table
* compute the % in the other direction
# compute the % in the other direction
* We would like to estimate the probability that a given value of the independent
(explaining) variable entails a given value of the dependent (explained) variable


{| border="1"
{| border="1"
! rowspan="1" colspan="1" |
! rowspan="1" colspan="1" |
===  ===
! rowspan="1" colspan="4" |Variable y to explain = Strategies of action
! rowspan="1" colspan="4" |
Variable y to explain = Strategies of action
|-
|-
! rowspan="1" colspan="1" |
! rowspan="1" colspan="1" |Explaining variable x<br />
Explaining variable x<br />
! rowspan="1" colspan="1" |do nothing
! rowspan="1" colspan="1" |
! rowspan="1" colspan="1" |send a mail
do nothing
! rowspan="1" colspan="1" |write a short tutorial
! rowspan="1" colspan="1" |
! rowspan="1" colspan="1" |Total
send a mail
! rowspan="1" colspan="1" |
write a short tutorial
! rowspan="1" colspan="1" |
Total
|-
|-
| rowspan="1" colspan="1" |Students making indirect suggestion
| rowspan="1" colspan="1" |4 (80%)
| rowspan="1" colspan="1" |1 (20%)
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
Students making indirect suggestion
| rowspan="1" colspan="1" |5 (100 %)
| rowspan="1" colspan="1" |
4 (80%)
| rowspan="1" colspan="1" |
1 (20%)
| rowspan="1" colspan="1" |
 
| rowspan="1" colspan="1" |
5 (100 %)
|-
|-
| rowspan="1" colspan="1" |Students explicitly <br /> complaining
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |
Students explicitly <br /> complaining
| rowspan="1" colspan="1" |2 (40%)
| rowspan="1" colspan="1" |
| rowspan="1" colspan="1" |3 (60%)
 
| rowspan="1" colspan="1" |5 (100%)
| rowspan="1" colspan="1" |
2 (40%)
| rowspan="1" colspan="1" |
3 (60%)
| rowspan="1" colspan="1" |
5 (100%)
|}
|}


Interpretation: “... if students explicitly complain, the tutor will react more strongly
Interpretation: “... if students explicitly complain, the tutor will react more strongly
and engage in more helpful acitities.”
and engage in more helpful activities.”


* See: quantitative data analysis.
* See: quantitative data analysis.


== Typology and causality graphs ==
== Typology and causality graphs ==
There are no limits of what you can draw. Basically such analysis just use a more or less precise language to draw [[concept map]]s.
Below we just show two examples.


=== Typology graphs ===
=== Typology graphs ===
Line 927: Line 927:
* Display attributes of types in a tree-based manner
* Display attributes of types in a tree-based manner


=== Perception of a new program by different implementation agencies (e.g. schools) and
Example: Perception of a new program by different implementation agencies (e.g. schools) and
its actors (e.g. teachers) ===
its actors (e.g. teachers)


[[Image:book-research-design-214.png]]
[[Image:typology-tree-diagram.png]]
 
 
=== Dialog and message structures ===
 
Message structure can be rendered with "message maps"  (e.g. Pena-Shaff and Nicholls, 2004).
[[image:pena-shaff-nichols-message-map.png|frame|none|Source: Student Interaction and Meaning Construction in Computer Bulletin Board Systems (BBS) http://www.sloanconsortium.org/conference/proceedings/2004/ppt/1087.ppt - reproduced without permission]]
 
Of course, you may use another language, such as [[UML activity diagram]]s to draw such maps.


=== Subjective causality graphs ===
=== Subjective causality graphs ===


[[Image:book-research-design-215.png]]
A simple causality graphs relates variables (concepts) with directed arrows.
 
[[Image:simple-causality-graph.png]]
 
There exist many variants. One older method is “operational coding” (Axelrod, 1976) and is somewhat popular in political science. It allows to compute outcomes of reasoning chains
 
Example: Teacher talking about active pedagogies, ICT connections, Forums
 
[[Image:operational-coding-graph.png]]
 
== Software ==
 
Depending on your discipline of reference, you may be familiar with different software families that help drawing graphs:
 
* For concept mapping software, see [[Concept map]]
* For UML-supporting diagram software, see [[Unified modeling language]]
 
We also can recommend a good general purpose free diagram software:
* [http://live.gnome.org/Dia DIA]
 
Finally, for people who hate to draw, there exist useful free [[visualization]] software, in particular:
* [[Graphviz]]
 
== Links ==
 
; Ground theory
* Bob Dick, [http://www.scu.edu.au/schools/gcm/ar/arp/grounded.html Grounded theory: a thumbnail sketch]
* Steve Borgatti, [http://www.analytictech.com/mb870/introtoGT.htm Introduction to Grounded Theory]
* [http://www.groundedtheory.com/ Glaser's Home page]
* [http://en.wikipedia.org/wiki/Grounded_theory Grounded theory], [http://en.wikipedia.org/wiki/Grounded_theory_(Glaser) Grounded theory (Glaser)], [http://en.wikipedia.org/wiki/Grounded_theory_(Strauss) Grounded theory (Strauss)] (Wikipedia)
 
== Bibliography ==
 
=== Introductions and Cookbooks ===
 
* Denzin, Norman K. and Yvonna S. Lincoln (eds) (). The SAGE Handbook of Qualitative Research, Sage Publications. ISBN 0761927573
 
* Dey, I. (1993). Qualitative Data Analysis. Routledge, London.
 
* Marshall, C. & Rossman, G. B. (1995) , Designing Qualitative Research, second edition, Sage, London
 
* Miles, M. B. &amp; Huberman, A. (1994). Qualitative Data Analysis: An Expanded Sourcebook. Sage. ISBN 0803955405 (This is still '''the''' cookbook for structured qualitative analysis.)
 
=== Other ===
 
* Amanda Coffey, A., Holbrook,B. Atkinson, P., Qualitative Data Analysis: Technologies and Representations, School of Social and Administrative Studies, University of Wales, Cardiff [http://www.socresonline.org.uk/1/1/4.html HTML]. (This is an overview of the literature without any practical advice)
 
* Braun, V., and Clarke, V. (2006). Using thematic analysis in psychology. Qualitative Research in Psychology 3 (2), 77-101. Retrieved from http://www.tandfonline.com/doi/abs/10.1191/1478088706qp063oa
 
* De Wever, B.;  T. Schellens, M. Valcke and H. Van Keer. Content analysis schemes to analyze transcripts of online asynchronous discussion groups: A review. ''Computers & Education'' 46 (2006) 6–28 [http://portal.acm.org/citation.cfm?id=1122922 Abstract/PDF] {{ar}}.
 
* Glaser, Barney G. (1992) Basics of grounded theory analysis: emergence vs forcing.  Mill Valley, Ca.: Sociology Press.
 
* Henri, F. (1992). Computer conferencing and content analysis. In A. R. Kaye (Ed.), Collaborative learning through computer conferencing. The Najadan Papers (pp. 117–136). London: Springer-Verlag.
 
* Lombard, Matthew; Jennifer Snyder-Duch and Cheryl Campanella Bracken (2008). Practical Resources for Assessing and Reporting Intercoder Reliability in Content Analysis Research Projects, Retrieved from http://www.temple.edu/sct/mmc/reliability/, accessed 20:59, 7 October 2008 (UTC).
 
* Strauss, Anselm, and Corbin, Juliet (1990) Basics of qualitative research: grounded theory procedures and techniques.  Newbury Park: Sage.
 
=== Examples ===
 
* Cobos, Ruth and Manoli Pifarre, Collaborative knowledge construction in the web supported by the KnowCat system, ''Computers & Education'', Volume 50, Issue 3, , April 2008, Pages 962-978. [http://www.sciencedirect.com/science/article/B6VCJ-4MGVHYM-1/2/1720d969d3cae93cb13c5ab5828fc3f0 HTML/PDF] {{ar}}.
 
* Deryakulu, Deniz & Olkun, Sinan (2007). Analysis of Computer Teachers’ Online Discussion Forum Messages about their Occupational Problems. ''Educational Technology & Society'', 10 (4), 131-142. [http://www.ifets.info/journals/10_4/13.pdf PDF]


* Cognitive maps à la “operational coding”, AXELROD, 1976
* Eilon, Batia and Kliachko, Sarah (2004). The Contribution of a Substance-Oriented Forum to the Study of Human Biology in Science Teacher Education, ''Journal of Technology and Teacher Education'' (2004) 12(1), 5-24.
* Allow to compute outcomes of reasoning chains
* Example: Teacher talking about active pedagogies, ICT connections, Forums


[[Image:book-research-design-216.png]]
* Lockhorst, D., Admiraal, W., Pilot, A., & Veen, W. (2003). Analysis of electronic communication using 5 different perspectives. Paper presented at ORD 2003 (in Heerlen).


* Pena-Shaff and C. Nicholls (2004). Analyzing student interactions and meaning construction in computer bulletin board discussions, Computers & Education 42 (2004), pp. 243–265. [http://dx.doi.org/10.1016/j.compedu.2003.08.003  doi:10.1016/j.compedu.2003.08.003]


[[Category: research methodologies]]
[[Category: research methodologies]]
[[Category: tutorials]]
[[Category:Research methodology tutorials]]

Latest revision as of 15:49, 16 May 2018

This is part of the methodology tutorial (see its table of contents).

Introduction

Learning goals
Understanding the purpose of code data and creating code books
Learn about some descriptive analysis of variables techniques (including situations and roles)
Learn about some causal analysis techniques
Prerequisites
Level
  • It aims at beginners. As with any method, you will have to practise and you likely will need some (human) guidance or at least serious additional reading. The purpose of this tutorial is to help you understand what qualitative data analysis could mean, not to present ready-made solutions.
Quality
  • Slide style, should be expanded

In short, qualitative data analysis usually implies to related and iterative steps. See Methodology tutorial - theory-finding research designs for the general principle.

(1) Data needs to be coded and indexed so that you can find it for data analysis. More particularly:

  • Information coding allows to identify variables and values, therefore allows for systematic analysis of data (and therefore reliability)
  • ensures enhanced construction validity, i.e. that you look at things that will allow you to measure concept

(2) You then can do visualizations, matrices, grammars, etc.

(3) Interpret these

Before we start - Keep your documents and ideas safe !

(1) Write memos (conservation of your thoughts). It is useful to write short memos (vignettes) when an interesting idea pops up, when you looked at something and want to remember your thoughts

(2) Create contact sheets that allow you remember your field work. After each contact (telephone, interviews, observations, etc.), make a short data sheet that should include:

  • A clear tag for indexing purposes (filename or tag on paper), e.g. CONTACT_senteni_2005_3_25.doc.
  • Type of contact, date, place, and a link to the interview notes, transcripts.
  • Principal topics discussed and research variables addressed (or pointer to the interview sheet).
  • Initial interpretative remarks, new speculations, things to discuss next time.

(3) Index your interview notes:

  • Put your transcription (or audio/video files or audio tapes) in a safe place
  • Assign a code to each "text", e.g. INT-1 or INTERVIEW_senteni_3_28-1. The same code could be used as filename.
  • You also may insert the contact sheet (see above)
  • Number the pages if you take notes manually (they can fall down ...)

(4) Do not trust your hard-disk !

Codes and categories

First step in qualitative data analysis is coding.

Icon-hand-right.png A code is a “label” to tag a variable (concept) and/or a value found in a "text"

The coding principle

  1. A code is assigned to each (sub)category, i.e. theoretical variable you work on
    • In other words: you must identify variable names
  2. In addition, you can for each code assign a set of possible values (e.g.: “positive”/”neutral/”negative)
  3. You then will systematically scan all your texts (documents, interview transcripts, dialogue captures, etc.) and tag all occurrences of variables.
  4. Three very different coding strategies exist as we shall see later:
    • Code-book creation according to theory
    • Coding by induction (according to “grounded theory”)
    • Coding by ontological categories
Benefit of coding
  • Coding will allow you to find all informations regarding variables of interest to your research
  • Reliability will be improved
The procedure with a picture

Quali-coding-principle.png

Technical Aspects of coding

  • The safest and most reliable way to code is to use some specialized software
    • e.g. Atlas or Nvivo (NuDist),
    • however, this takes a lot of time !
  • For a smaller piece (e.g. a master thesis), we suggest to simply tag the text on paper
    • you can make a reduced photocopy of the texts to gain some space in the margins
    • overline or circle the text elements you can match to a variable
    • make sure to distinguish between codes and other marks you may leave.
  • Don’t use "flat" and long code-books, introduce hierarchy (according to dimensions identified)
  • Each code should be short but also mnemonic (optimize)
    • e.g. to code according to a schema “principal category” - “sub-category” (“value”) use a code like:
CE-CLIM(+)

instead of:

external_context -climate (positive)
  • Don’t start coding before you have good idea on your coding strategy !
    • either your code book is determined by you research questions and associated theories, frameworks, analysis grids
    • or you really learn how to use an inductive strategy like "grounded theory".

Coding reliability

Assigning a code to a "text" segment is not always obvious and coding similar passages exactly the same way even less. In other words, we have a reliability problem.

There are two ways of improving reliability:

  • Use clear and operational categories
  • Use two or three coders (yourself and a friend) and compute intercoding index. If it is low, you will have to revise your coding scheme.

There exist several formula to compute intercoder (inter-rater) reliability. The most simple one is:

 reliability =  number of agreements (same coding)
               / total codes (agreements plus disagreements)

Read this for a very good introduction.

Code-book creation and management

Code-book creation according to theory

Icon-hand-right.png The list of variables (and their codes), is defined through theoretical reasoning, e.g.

  • analytical frameworks, analysis grids
  • concepts found in the list of research questions and/or hypothesis

Example from an innovation study (about 100 codes):

Categories Codes Theoretical references
properties of the innovation PI

....(fill for your own code book).....

external context EC
    demography CE-D
support for the reform CE-S
internal context IC
adoption processes AP
    official chronology PA-OC
dynamics of the studied site DS
external and internal assistance EIA
causal links CL

Before you start think about your own code book, you really should go through the relevant literature and try to find existing code books (that you then may adapt). E.g. below is an an example of codes use to analyze types of job-related problems of Turkish computer teachers (Deryakulu & Olkun, 2007).

  1. Role Conflict
  2. Inadequate Teacher Induction Policies
  3. Lack of Required Technological Infrastructure and Technical Support
  4. The Status of Computer Subject in School Curriculum
  5. Lack of Appreciation and Positive Feedback from Colleagues
  6. Unsupportive Administrators
  7. Rapidly Changing Nature of Content Knowledge in Computer Education
  8. Lack of Cohesive Computer Curriculum
  9. Insufficiency of Pre-Service Teacher Training Programs
  10. Large Class Sizes
  11. Indifferent Students
  12. Inadequate Supervision and Inspection

Coding by induction (according to “grounded theory”)

Grounded theory (Glaser, Strauss) refers to a set of approaches that that focus on interpretation and theory building, i.e. it is a fully inductive approach. The researcher starts by coding a small data set and then increases the sample in function of emerging theoretical questions. Categories (codes) can be revised at any time.

“Grounded theory begins with a research situation. Within that situation, your task as researcher is to understand what is happening there, and how the players manage their roles. You will mostly do this through observation, conversation and interview. After each bout of data collection you note down the key issues: this I have labelled "note-taking".

Constant comparison is the heart of the process. At first you compare interview (or other data) to interview (or other data). Theory emerges quickly. When it has begun to emerge you compare data to theory.

The results of this comparison are written in the margin of the note-taking as coding. Your task is to identify categories (roughly equivalent to themes or variables) and their properties (in effect their sub-categories). Grounded theory: a thumbnail sketch, retrieved 13:40, 15 October 2008 (UTC))”

Typically, you'd both code phenomena in isolation and relations (so-called axial coding). On starting point for axial coding could be these big abstract observation categories:

  • conditions (causes of a perceived phenomenon)
  • interactions between actors
  • strategies and tactics used by actors
  • consequences of actions

Read more in [Introduction to Grounded Theory].

To use this approach you really should document yourself, as beginner you'd likely fall into various traps, in particular selection and confirmation biases, i.e. you only look at things that will interest you for one or another reason.

Coding by ontological categories

Instead of initially creating a code book from variables found in your research questions or "inductive" coding à la grounded theory, you also start by creating or using a vocabulary for a given domain. This strategy is a compromise between “grounded theory” and “theory driven” approaches.

Below is a table that lists things you could observe in an organization (Bogdan and Biklen, cited by Miles & Huberman:1994 61)

Types Explanation
Context/Situation information on the context
Definition of the situation interpretation of the analyzed situation by people
Perspectives global views of the situation
Ways to look at people and objects detailed perceptions of certain elements
Processes sequences of events, flow, transitions, turning points, etc.
Activities structures of regular behaviors
Events specific activities (non regular ones)
Strategies ways of tackling a problem (strategies, methods, techniques)
Relations and social structure informal links
Methods comments (annotations) of the researcher

In the literature, you may find several other such "accounting schemes". In educational technology, for example, there is a range of relatively simple code books for conversation and asynchronous discussion groups (forum) analysis. De Wever et al. (2006) provide a good overview. Some coding schemes are fairly simple. E.g. Cobos and Pifarré (2008) analyzed "collaborative knowledge construction in the web" with the following coding scheme:

Explanation Asks for clarifying some parts of the document “The following link, which appears in your document, doesn't work now, but it worked a week ago”
Support Express explicit agreement with the document's ideas or information organisation “In my opinion this document is very useful and is easy to read”
Addition Suggests additions to the document: ideas, opinions or information organisation “I think that an index of the sections of the article should be added”
Delete Suggests deletions from the document: ideas, opinions or information organisation “In the summary there are some examples, which perhaps were not necessary”
Correction Suggests changes to the document. They refer to ideas, opinions or information organisation “I think that there is an error in the first paragraph in the conclusion section, where it says 'motor' it should be 'motivation'”

Pena and Nichols (2004) used the following categories:

  • Questions
  • Reply
  • Clarification
  • Interpretation
  • Conflict
  • Assertion
  • Consensus Building
  • Judgment
  • Reflection
  • Support
  • Other

There exist more complex code books: As an example we produce a summary of students' messages code book by Eilon and Kliachko (2004).

"Level A" Group These categories indicate knowledge construction and a significant contribution to peer learning
Comprehension Shows written evidence about the comprehension of the subjects studied by the following categories:
Reproducing- 1 Reproduces the main points, ideas, arguments or messages found in the incoming information with reference to its source and with critical evaluation.
Directing Directs others to relevant sources for the subjects studied (both printed and online).
Clarifying by questions Locates ambiguous, difficult or problematic areas in the new material. Describes the context of the question or the reason for asking it.
Clarifying by answers Gives correct, relevant and comprehensive explanations as answers. Bases the answers on retrieved information while citing its origin.
Reflection Shows written evidence of metacognitive processes that the learner applies when studying new subjects by the following categories:
Linking/extending Links the new information with his or her own previous knowledge. Extends the new knowledge to other domains, especially to STS issues.
Critical evaluation Evaluates the new information critically.
Transformation Applies the new information in an original and creative way, draws inferences, and gives original examples.
"Level B" Group These categories indicate a probable contribution to peer learning, by the following categories:
Documenting Documents l earning exper iences or indivi dual contributions to the group.
Reproducing- 2 Reproduces the main points, ideas, arguments or messages found in the incoming information without any evaluation or original input to it.
Learning outcomes Presents group and individual learning outcomes.
Technical questions/answers Questions or remarks about any subject that does not relate directly to understanding the subjects studied.
Personal knowledge Presents personal knowledge or daily life experiences.
"Level C" Group These categories indicate no contr ibution (or adverse contribution) to peer learning, by the following categories:
Irrelevant/unexplained questions Poses questions without giving the context or reason for asking them.
Casual quotations Includes quotations without their context and without further explanation.
Irrelevant/incorrect answers Offers Irrelevant or incorrect answers to questions sent by other students.
Emotional/personal comments Includes personal comments, which should have been sent by e-mail as instructed by the teacher .

Pattern codes

Some researchers also code patterns (relationships). Simple encoding (above) breaks data down to atoms, categories), whereas pattern coding identifies relationships between atoms. Pattern coding is also one the steps in the inductive grounded theory approach.

The ultimate goal is to detect (and code) regularities, but also variations and singularities.

Some suggested operations:

  1. Detection of co-presence between two values of two variables
    • E.g. people in favor of a new technology (e.g. ICT in the classroom) have a tendency to use it.
  2. Detection of exceptions
    • e.g. technology-friendly teachers who don’t use it in the classroom
    • In this case you may introduce new variable to explain the exception, e.g. the attitude of the superior., of the group culture, the administration, etc.
    • Exceptions also may provoke a change of analysis level (e.g. from individual to organization)

Attention: a co-presence does not prove causality

Descriptive matrices and graphics

Icon-hand-right.png Qualitative analysis attempts to put structure to data (as in exploratory quantitative techniques)

In short: Analysis = visualization

There exist 2 popular types of analysis:

  1. A matrix is a tabulation engaging at least one variable, e.g.
    • Tabulations of central variables by case (equivalent to simple descriptive statistics like histograms)
    • Crosstabulations allowing to analyze how 2 variables interact
  2. Graphs ( networks ) allow to visualize links:
    • temporal links between events
    • causal links between several variables
    • etc.
Some advice
  • When you use these techniques always keep a link to the source (coded data)
  • Try to fit each matrix or graph on a single page (or make sure that you can print things made by computer on a A3 page)
  • you have to favor synthetic vision, but still preserve enough detail to make your artifact interpretable
  • Consult specialized manuals e.g. Miles & Huberman, 1994 for recipes or get inspirations from qualitative research in the same domain as yours.
Icon-warning.png

In this tutorial we can not cover all possible types of analysis, but just provide a few examples of what can be done. Before you start doing any sort of analysis, think about what you need to answer your research questions !

The “context chart”,Miles & Huberman (1994:102)

This technique allows to visualize relations and information flows between rôles and groups

Example - Work flow for a "new pedagogies" program at some university

New-pedagogies-context-chart.png

There exist codified "languages" for this type of analysis, e.g. UML or OSSAD

Once you clearly identified and clarified formal relations, you can use the graph to make annotations (like below)

New-pedagogies-context-chart-annotated.png

Check-lists, Miles & Huberman (1994:105)

Check lists allow to make detailed summary for an analysis of an important variable.

Example: "external support is important for succeeding a reform project"

Examples for external support At counselor level At teacher level
Analysis of deficiencies Fill in each cell as below
Teaching training
Change monitoring
Incentives
Group dynamics adequate: “we have met an organizer 3 times and it has helped us” (ENT-12:10) not adequate: “we just have informed” (ENT-13:20)
etc. ..

Such a table displays various dimensions of and important variable (external support). E.g. in the example above the values of the variable "external support" are listed in the left column

In the other columns we insert summarized facts as reported by different roles.

Review Question: Imagine how you would build such a grid to summarize teacher’s, student’s and assistant’s opinion about technical support for an e-learning platform

Chronological tables Miles & Huberman (1994:110)

  • Can summarize a studied object’s most important events in time

Example: Task assignments for a blended project-oriented class

Activity

Date

imposed tools (products)

1

Get familiar with the subject

21-NOV-2002

links, wiki, blog

2

project ideas, Q&R

29-NOV-2002

classroom

3

Students formulate project ideas

02-DEC-2002

news engine, blog

4

Start project definition

05-DEC-2002

ePBL, blog

5

Finish provisional research plan

06-DEC-2002

ePBL, blog

6

Finish research plan

11-DEC-2002

ePBL, blog

7

Sharing

17-DEC-2002

links, blog, annotation

8

audit

20-DEC-2002

ePBL, blog

9

audit

10-JAN-2003

ePBL, blog

10

Finish paper and product

16-JAN-2003

ePBL, blog

11

Presentation of work

16-JAN-2003

classroom

  • This type of table is useful to identify important events.
  • You can add other information, e.g. tools used in this example


Matrices for roles (function in an organization or program)

Miles & Huberman (1994:124)

Crossing social roles with one or more variables

The abstract principle can be summarized as follows (see below for an example):

roles persons variable 1 variable 2 variable 3
role 1 person 1 cells are filled in with values
(pointing to the source)
person 2
.....
rôle 2 person 9
person 10
..... .....
role n person n

.....

Crossing roles with roles
role 1 ... role 3
role 1 fill in all sorts of informations about interactions
...
role 3

Example: Evaluation of the implementation of a help desk software

Actor Evaluation assistance provided Assistance received Immediate effects Long term effects Explanation of the researcher
Manager - - - demotivating threatened the program Felt threatened by new procedures
Consultant + help choosing the right soft. involved himself - contributed to the start of the experiment - ....
“Help-desk worker” +/- debugging of machines, little help with software better job satisfaction because of the tool slight improvement of throughput is still overloaded with work
Users +/- A few users provided help to peers with the tool debugging of machines, little help with software Were made aware of the high amount of unanswered questions slight improvement of work performance ....

Crossing between roles to visualize relations:

role 1 trainers role 3
rôle 1
trainers “don’t coordinate very much” (1) dosn’t receive all the information (2)
rôle 3


Techniques to study relations

Often qualitative analysis stops with simple descriptive analysis (Daniel K. Schneider believes that this is the case with popular forum analysis). However, you also may use qualitative data to do some kind of "correlational analysis" as you typically do in quantitative data analysis.

Matrices ordered according to concepts (variables)

Clusters (co-variances of variables, case typologies)
  • An idea that certain values should "go together": Hunt co-occurrences in cells
  • E.g.: “Can we observe a correlation between expressed needs for support and

expressed needs for training for a new collaborative platform (data from teachers’s interviews)?

case var 1 need for support need for training need for directives
case 1 important important important
case 2 not important not important not important
case 3 important important important
case 4 yyy not important not important not important
case 5 ..... important important important
case 6.... important not important not important
  • This table shows e.g. that need for support and need for training seem to go together,

e.g. cases 1,3,5 have association of "important", cases 2 and 4 have association of "not important".

  • See next page how we can summarize this sort of information in a crosstab
Co-variance expressed in a corresponding crosstab
training needs * support needs need for support
yes no
need for
training
yes 3 1
no 1 2

We can observer a correlation here: "blue cells" (symmetry) is stronger than "magenta"!

You should check the data above to see if we did this right ...

Example typology with the same data:

Type 1:
"anxious"
Type 2:
"dependent"
Type 3:
"bureaucrats"
Type 4:
"autonomists"
case 1 X
case 2 X
case 3 X
case 4 X
case 5 X
case 6 X
Total individuals
per type
3 1 0 2

We can observe emergence of 3 types to which we assign "labels"

Note: for more than 3 variables use a cluster analysis program

Additional example

The table shows co-occurrence between values of 2 variables. The idea is to find out what effect different types of pressure have on ICT strategies adopted by a school.

Strategies of a school
Type of pressure strategy 1:no reaction strategy 2:a task force is created strategy 3:internal training programs are created strategy 4:resources are reallocated strategy 5: .....
Letters written by parents (N=4)(p=0.8) (N=1)(p=0.2)
Letters written by supervisory boards (N=2)(p=0.4) (N=3)(p=0.6)
newspaper articles (N=1)(p=100%)
type ... ..... ....

Recall: Interpretation of crosstabulation

See also: Methodology tutorial - quantitative data analysis (Cross-tabulation)

We would like to estimate the probability that a given value of the independent (explaining) variable entails a given value of the dependent (explained) variable.

The procedure
  1. calculate the % for each value of the independent variable
    • Note: this can be either the line or the column depending on how you orient your table
  2. compute the % in the other direction
Variable y to explain = Strategies of action
Explaining variable x
do nothing send a mail write a short tutorial Total
Students making indirect suggestion 4 (80%) 1 (20%) 5 (100 %)
Students explicitly
complaining
2 (40%) 3 (60%) 5 (100%)

Interpretation: “... if students explicitly complain, the tutor will react more strongly and engage in more helpful activities.”

  • See: quantitative data analysis.


Typology and causality graphs

There are no limits of what you can draw. Basically such analysis just use a more or less precise language to draw concept maps.

Below we just show two examples.

Typology graphs

  • Display attributes of types in a tree-based manner

Example: Perception of a new program by different implementation agencies (e.g. schools) and its actors (e.g. teachers)

Typology-tree-diagram.png


Dialog and message structures

Message structure can be rendered with "message maps" (e.g. Pena-Shaff and Nicholls, 2004).

Source: Student Interaction and Meaning Construction in Computer Bulletin Board Systems (BBS) http://www.sloanconsortium.org/conference/proceedings/2004/ppt/1087.ppt - reproduced without permission

Of course, you may use another language, such as UML activity diagrams to draw such maps.

Subjective causality graphs

A simple causality graphs relates variables (concepts) with directed arrows.

Simple-causality-graph.png

There exist many variants. One older method is “operational coding” (Axelrod, 1976) and is somewhat popular in political science. It allows to compute outcomes of reasoning chains

Example: Teacher talking about active pedagogies, ICT connections, Forums

Operational-coding-graph.png

Software

Depending on your discipline of reference, you may be familiar with different software families that help drawing graphs:

We also can recommend a good general purpose free diagram software:

Finally, for people who hate to draw, there exist useful free visualization software, in particular:

Links

Ground theory

Bibliography

Introductions and Cookbooks

  • Denzin, Norman K. and Yvonna S. Lincoln (eds) (). The SAGE Handbook of Qualitative Research, Sage Publications. ISBN 0761927573
  • Dey, I. (1993). Qualitative Data Analysis. Routledge, London.
  • Marshall, C. & Rossman, G. B. (1995) , Designing Qualitative Research, second edition, Sage, London
  • Miles, M. B. & Huberman, A. (1994). Qualitative Data Analysis: An Expanded Sourcebook. Sage. ISBN 0803955405 (This is still the cookbook for structured qualitative analysis.)

Other

  • Amanda Coffey, A., Holbrook,B. Atkinson, P., Qualitative Data Analysis: Technologies and Representations, School of Social and Administrative Studies, University of Wales, Cardiff HTML. (This is an overview of the literature without any practical advice)
  • De Wever, B.; T. Schellens, M. Valcke and H. Van Keer. Content analysis schemes to analyze transcripts of online asynchronous discussion groups: A review. Computers & Education 46 (2006) 6–28 Abstract/PDF (Access restricted).
  • Glaser, Barney G. (1992) Basics of grounded theory analysis: emergence vs forcing. Mill Valley, Ca.: Sociology Press.
  • Henri, F. (1992). Computer conferencing and content analysis. In A. R. Kaye (Ed.), Collaborative learning through computer conferencing. The Najadan Papers (pp. 117–136). London: Springer-Verlag.
  • Lombard, Matthew; Jennifer Snyder-Duch and Cheryl Campanella Bracken (2008). Practical Resources for Assessing and Reporting Intercoder Reliability in Content Analysis Research Projects, Retrieved from http://www.temple.edu/sct/mmc/reliability/, accessed 20:59, 7 October 2008 (UTC).
  • Strauss, Anselm, and Corbin, Juliet (1990) Basics of qualitative research: grounded theory procedures and techniques. Newbury Park: Sage.

Examples

  • Cobos, Ruth and Manoli Pifarre, Collaborative knowledge construction in the web supported by the KnowCat system, Computers & Education, Volume 50, Issue 3, , April 2008, Pages 962-978. HTML/PDF (Access restricted).
  • Deryakulu, Deniz & Olkun, Sinan (2007). Analysis of Computer Teachers’ Online Discussion Forum Messages about their Occupational Problems. Educational Technology & Society, 10 (4), 131-142. PDF
  • Eilon, Batia and Kliachko, Sarah (2004). The Contribution of a Substance-Oriented Forum to the Study of Human Biology in Science Teacher Education, Journal of Technology and Teacher Education (2004) 12(1), 5-24.
  • Lockhorst, D., Admiraal, W., Pilot, A., & Veen, W. (2003). Analysis of electronic communication using 5 different perspectives. Paper presented at ORD 2003 (in Heerlen).
  • Pena-Shaff and C. Nicholls (2004). Analyzing student interactions and meaning construction in computer bulletin board discussions, Computers & Education 42 (2004), pp. 243–265. doi:10.1016/j.compedu.2003.08.003