Web-harvest

From EduTech Wiki
Jump to: navigation, search
Webharvest logo.jpg


Web-harvest 2.0 (2010/02/17)

Web-harvest.jpg

Developed by: Web-Harvest Team
License: BSD license (original version)
Web page : Tool homepage
Tool type :

Tool.png

The last edition of this page was on: 2014/02/26

The Completion level of this page is : Low


SHORT DESCRIPTION

Web-Harvest is Open Source Web Data Extraction tool written in Java. It offers a way to collect desired Web pages and extract useful data from them. In order to do that, it leverages well established techniques and technologies for text/xml manipulation such as XSLT, XQuery and Regular Expressions. Web-Harvest mainly focuses on HTML/XML based web sites which still make vast majority of the Web content. On the other hand, it could be easily supplemented by custom Java libraries in order to augment its extraction capabilities.


TOOL CHARACTERISTICS

Usability

Authors of this page consider that this tool is '.

Tool orientation

This tool is designed for general purpose analysis.

Data mining type

This tool is made for '.

Manipulation type

This tool is designed for Data extraction.

IMPORT FORMAT :

EXPORT FORMAT :


Tool objective(s) in the field of Learning Sciences

Analysis & Visualisation of data
Predicting student performance
Student modelling
Social Network Analysis (SNA)
Constructing courseware

Providing feedback for supporting instructors:
Recommendations for students
Grouping students:
Developing concept maps:
Planning/scheduling/monitoring
Experimentation/observation

Tool can perform:

  • Data extraction of type:
  • Transformation of type:
  • Data analysis of type:
  • Data visualisation of type: (These visualisations can be updated in "real time" )



ABOUT USERS

Tool is suitable for:

Students/Learners/Consumers
Teachers/Tutors/Managers
Researchers
Developers/Designers
Organisations/Institutions/Firms
Others

Required skills:

STATISTICS:

PROGRAMMING:

SYSTEM ADMINISTRATION:

DATA MINING MODELS:



FREE TEXT


Tool version : Web-harvest 2.0 2010/02/17
(blank line)

Developed by : Web-Harvest Team
(blank line)
Tool Web page : http://web-harvest.sourceforge.net/
(blank line)
Tool type :
(blank line)
License:BSD license (original version)

Web-harvest.jpg

1 SHORT DESCRIPTION


Web-Harvest is Open Source Web Data Extraction tool written in Java. It offers a way to collect desired Web pages and extract useful data from them. In order to do that, it leverages well established techniques and technologies for text/xml manipulation such as XSLT, XQuery and Regular Expressions. Web-Harvest mainly focuses on HTML/XML based web sites which still make vast majority of the Web content. On the other hand, it could be easily supplemented by custom Java libraries in order to augment its extraction capabilities.

2 TOOL CHARACTERISTICS


Tool orientation Data mining type Usability
This tool is designed for general purpose analysis. This tool is designed for . Authors of this page consider that this tool is .
Data import format Data export format
. .
Tool objective(s) in the field of Learning Sciences

☐ Analysis & Visualisation of data
☐ Predicting student performance
☐ Student modelling
☐ Social Network Analysis (SNA)
☐ Constructing courseware

☐ Providing feedback for supporting instructors:
☐ Recommendations for students
☐ Grouping students:
☐ Developing concept maps:
☐ Planning/scheduling/monitoring
Experimentation/observation

Can perform data extraction of type:

Can perform data transformation of type:

Can perform data analysis of type:

Can perform data visualisation of type:
(These visualisations can be updated in "real time" )


3 ABOUT USER


Tool is suitable for:
Students/Learners/Consumers:☐ Teachers/Tutors/Managers:☐ Researchers:☐ Organisations/Institutions/Firms:☐ Others:☐
Required skills:
Statistics: Programming: System administration: Data mining models:

4 OTHER TOOL INFORMATION


Web-harvest.jpg
Web-harvest.jpg
Webharvest logo.jpg
Web-harvest
BSD license (original version)
Free&Open source
Web-Harvest Team
2010/02/17
2.0
http://web-harvest.sourceforge.net/
Web-Harvest is Open Source Web Data Extraction tool written in Java. It offers a way to collect desired Web pages and extract useful data from them. In order to do that, it leverages well established techniques and technologies for text/xml manipulation such as XSLT, XQuery and Regular Expressions. Web-Harvest mainly focuses on HTML/XML based web sites which still make vast majority of the Web content. On the other hand, it could be easily supplemented by custom Java libraries in order to augment its extraction capabilities.
General analysis
Data extraction
Low
Facts about "Web-harvest"
Analysis orientationGeneral analysis +
Data manipulation typeData extraction +
Free software licenseBSD license (original version) +
Has completion levelLow +
Has descriptionWeb-Harvest is Open Source Web Data Extrac
Web-Harvest is Open Source Web Data Extraction tool written in Java. It offers a way to collect desired Web pages and extract useful data from them. In order to do that, it leverages well established techniques and technologies for text/xml manipulation such as XSLT, XQuery and Regular Expressions. Web-Harvest mainly focuses on HTML/XML based web sites which still make vast majority of the Web content. On the other hand, it could be easily supplemented by custom Java libraries in order to augment its extraction capabilities.
er to augment its extraction capabilities. +
Has last revision number2.0 +
Has logoWebharvest logo.jpg +
Has nameWeb-harvest +
Has screenshotWeb-harvest.jpg +
Has websitehttp://web-harvest.sourceforge.net/ +
Is developed byWeb-Harvest Team +
Last editionFebruary 26, 2014 +
License typeFree&Open source +
Was last released onFebruary 17, 2010 +