Content analysis: Difference between revisions

The educational technology and digital learning wiki
Jump to navigation Jump to search
 
(5 intermediate revisions by the same user not shown)
Line 64: Line 64:
=== List of tools ===
=== List of tools ===


(needs completion)
See: '''''[[Portal: Data mining and learning analytics tools]]''''


* [http://www.tropes.fr/ Tropes] logiciel d'analyse sémantique de textes
'''Undocumented tools in the portal above'''
 
* [http://www.toko-sigmund.org/ tOKo] is an open source tool for text analysis and browsing a corpus of documents. It implements a wide variety of text analysis and browsing functions in an interactive user interface. It can for instance be used to analyse the exchange of information, for example in a community forum or through a collection of interconnected weblogs.


* [http://www.leximancer.com/ Leximancer], allows to summarize and navigate large text data (e.g. a wiki site) with various visualization tools.  (commercial, $750 AUD single license or $150 one-month online)
* [http://www.leximancer.com/ Leximancer], allows to summarize and navigate large text data (e.g. a wiki site) with various visualization tools.  (commercial, $750 AUD single license or $150 one-month online)


* [http://cran.at.r-project.org/web/views/NaturalLanguageProcessing.html CRAN Task View: Natural Language Processing] contains a list of packages useful for natural language processing.
* [http://www.cs.cmu.edu/~cprose/TagHelper.html TagHelper] is one of more popular tools for analysing texts in education. It also has been integrated into learning systems (Dönmez, 2005; Kumar, 2007). [[LightSide]] can be considered a successor.
 
* [http://rapid-i.com/content/view/181/190/ RapidMiner]. Quote: {{quotation|is unquestionably the world-leading open-source system for data mining. It is available as a stand-alone application for data analysis and as a data mining engine for the integration into own products.}}. The system is based on the earlier [http://sfbci.uni-dortmund.de/index.php?option=com_content&task=view&id=112&Itemid=144 YALE] system. Commercial versions of RapidMiner can do more than the free community edition.
** See [[RapidMiner]]
 
* [http://www.cs.waikato.ac.nz/ml/weka/ Weka 3: Data Mining Software in Java] is quote: {{quotation|a collection of machine learning algorithms for data mining tasks. The algorithms can either be applied directly to a dataset or called from your own Java code. Weka contains tools for data pre-processing, classification, regression, clustering, association rules, and visualization. It is also well-suited for developing new machine learning schemes.}}
 
* [http://gate.ac.uk/ GATE] is quote an {{quotation|open source software capable of solving almost any text processing problem}}. See the [http://gate.ac.uk/2mins.html 2 Minute Guide]. It can be compared to RapidMiner.
 
* [http://www.cs.cmu.edu/~cprose/TagHelper.html TagHelper] is one of more popular tools for analysing texts in education. It also has been integrated into learning systems (Dönmez, 2005; Kumar, 2007). Lightside (below) can be considered a successor.


* [http://lightsidelabs.com/what/research/ Lightside] is a free open source text minin and machine learning tool that provides features to assess student writing. Quote: {{quotation| LightSide began as an open source text mining and machine learning tool. Our core technology is freely available. We offer quick-start tutorials on machine learning for beginners and an introduction to error analysis. The workbench, as well as the core technology for machine learning and feature extraction, was developed under grants from National Science Foundation and the Office of Naval Research, through Carnegie Mellon University’s Language Technologies Institute.}}
'''Sites with more links to tools'''
** [[http://ankara.lti.cs.cmu.edu/side/LightSide_Researchers_Manual.pdf Researcher's Manual] (PDF)


More should be added here ! In the meantime, see
* [http://dirtdirectory.org/categories/text-mining Dirt Directory], text mining category. Probably the best source.
* [http://www.textanalysis.info/ Text Analysis Info]. Overview on software that analyses texts and other sources of human communication
* [http://alias-i.com/lingpipe/web/competition.html LigPipe's comptition]
* [http://alias-i.com/lingpipe/web/competition.html LigPipe's comptition]
* [http://www.tapor.ca/ TAPoR], text analysis portal
* [http://www.tapor.ca/ TAPoR], text analysis portal
* [http://cogcomp.cs.illinois.edu/page/software Software] (Cognitive Computation Group, University of Illinois)
* [http://en.wikipedia.org/wiki/Text_mining Text Mining] at Wikipedia.
* [http://en.wikipedia.org/wiki/Text_mining Text Mining] at Wikipedia.



Latest revision as of 20:38, 30 November 2014

Draft

Definition

Content analysis refers to a family of qualitative data analysis methods or to various forms of quantitative analysis.

“Content analysis (sometimes called textual analysis when dealing exclusively with text) is a standard methodology in the social sciences for studying the content of communication. Earl Babbie defines it as "the study of recorded human communications, such as books, websites, paintings and laws." Harold Lasswell formulated the core questions of content analysis: "Who says what, to whom, why, to what extent and with what effect?." Ole Holsti (1969) offers a broad definition of content analysis as "any technique for making inferences by objectively and systematically identifying specified characteristics of messages."” (Wikipedia, retrieved nov 1 2007)

See also:

This entry should be split into two different articles: qualitative content analysis and machine analysis (e.g. text mining) - Daniel K. Schneider 14:01, 12 March 2012 (CET).

Links

(Semi-) manual qualitative data analysis

Quantitative analysis of large corpus

  • TAPoR is a reimagining of the original TAPoR (Text Analysis Portal for Research). It is a both a resource for discovery and a community. The TAPoR team has created a place for Humanities scholars, students and others interested in applying digital tools to their textual research to find the tools they need, contribute their experience and share new tools they have developed or used with others. Best overal resource for textual research.
  • Text mining (German Wikipedia). Better, if you speak German.
  • Text Insight. serves as a research and academic portal for those doing qualitative analysis and text analytics. Main focus of the site is the Leximancer tool However, all researchers, students, academics, and commercial entities are welcome to use this portal and its resources.

Software

See other wiki pages of interest

These pages include specialised technologies

  • latent semantic analysis and indexing, a family of analysis techniques that that assume that a text contains a semantic structure through a kind vector space model and some kind of factor analysis that identifies relationships between terms.

List of tools

See: Portal: Data mining and learning analytics tools'

Undocumented tools in the portal above

  • Leximancer, allows to summarize and navigate large text data (e.g. a wiki site) with various visualization tools. (commercial, $750 AUD single license or $150 one-month online)
  • TagHelper is one of more popular tools for analysing texts in education. It also has been integrated into learning systems (Dönmez, 2005; Kumar, 2007). LightSide can be considered a successor.

Sites with more links to tools

Simple word graphics

Bibliography

(to do )

  • Rose, C. P., Wang, Y.C., Cui, Y., Arguello, J., Stegmann, K., Weinberger, A., Fischer, F. (In Press). Analyzing Collaborative Learning Processes Automatically: Exploiting the Advances of Computational Linguistics in Computer-Supported Collaborative Learning , International Journal of Computer Supported Collaborative Learning.
    • Taghelper tools
  • Dönmez, P., Rosé, C., Stegmann, K., Weinberger, A., & Fischer, F. (2005). Supporting CSCL with automatic corpus analysis technology. Paper presented at the Proceedings of th 2005 Conference on Computer Support for Collaborative Learning: Learning 2005: The Next 10 Years! (pp. 125 – 134), Taipei, Taiwan.
    • Use of TagHelper
  • Kumar, R., Rosé, C., Wang, Y.-C., Joshi, M., & Robinson, A. (2007). Tutorial dialogue as adaptive collaborative learning support. Paper presented at the Proceeding of the 2007 Conference on Artificial Intelligence in Education: Building Technology Rich Learning Contexts That Work (pp. 383 – 390).
    • Use of TagHelper

Text and data mining

  • Xiaojun Chen, Yunming Ye, Graham Williams, and Xiaofei Xu, A Survey of Open Source Data Mining Systems. PDF


Analysis of text quality

....

Analysis of on-line interactions

  • De Wever, B., Schellens, T., Valcke, M., and Van Keer, H. 2006. Content analysis schemes to analyze transcripts of online asynchronous discussion groups: a review. Comput. Educ. 46, 1 (Jan. 2006), 6-28. DOI= http://dx.doi.org/10.1016/j.compedu.2005.04.005
  • Pena-Shaff, J. B. and Nicholls, C. 2004. Analyzing student interactions and meaning construction in computer bulletin board discussions. Computers and Education 42, 3 (Apr. 2004), 243-265. DOI= http://dx.doi.org/10.1016/j.compedu.2003.08.003
  • Rourke, L., Anderson, T., Garrison, D. R., & Archer, W. (2001). Methodological Issues in the Content Analysis of Computer Conference Transcripts. International Journal of Artificial Intelligence in Education, 12(1), 8-22. PDF