Web-harvest: Difference between revisions
Jump to navigation
Jump to search
(Created page with "{{Data mining and learning analytics tools |field_screenshot= |field_name=Web-harvest |field_developers= |field_license_type=Free&Open source |field_free_software_licence=BSD ...") |
No edit summary |
||
(2 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
{{Data mining and learning analytics tools | {{Data mining and learning analytics tools | ||
|field_screenshot= | |field_logo=Webharvest logo.jpg | ||
|field_screenshot=Web-harvest.jpg | |||
|field_name=Web-harvest | |field_name=Web-harvest | ||
|field_developers= | |field_developers=Web-Harvest Team | ||
|field_license_type=Free&Open source | |field_license_type=Free&Open source | ||
|field_free_software_licence=BSD license (original version) | |field_free_software_licence=BSD license (original version) | ||
Line 8: | Line 9: | ||
|field_last_version=2.0 | |field_last_version=2.0 | ||
|field_website=http://web-harvest.sourceforge.net/ | |field_website=http://web-harvest.sourceforge.net/ | ||
|field_description=Web-Harvest is Open Source Web Data Extraction tool written in Java. It offers a way to collect desired Web pages and extract useful data from them. In order to do that, it leverages well established techniques and technologies for text/xml manipulation such as XSLT, XQuery and Regular Expressions. Web-Harvest mainly focuses on HTML/XML based web sites which still make vast majority of the Web content. On the other hand, it could be easily supplemented by custom Java libraries in order to augment its extraction capabilities. | |field_description=Web-Harvest is Open Source Web Data Extraction tool written in Java. It offers a way to collect desired Web pages and extract useful data from them. In order to do that, it leverages well established techniques and technologies for text/xml manipulation such as XSLT, XQuery and Regular Expressions. Web-Harvest mainly focuses on HTML/XML based web sites which still make vast majority of the Web content. On the other hand, it could be easily supplemented by custom Java libraries in order to augment its extraction capabilities. | ||
|field_data_tool_type= | |field_data_tool_type= | ||
|field_plugin_of= | |field_plugin_of= |