Gensim

From EduTech Wiki
Jump to: navigation, search


Gensim

No image.png

Developed by:
License:
Web page : [ Tool homepage]
Tool type :

Tool.png

The last edition of this page was on: 2016/12/23

The Completion level of this page is : Low


SHORT DESCRIPTION

Quote from the about page (12/2016): Gensim started off as a collection of various Python scripts for the Czech Digital Mathematics Library dml.cz in 2008, where it served to generate a short list of the most similar articles to a given article (gensim = “generate similar”). I also wanted to try these fancy “Latent Semantic Methods”, but the libraries that realized the necessary computation were not much fun to work with.

By now, gensim is—to my knowledge—the most robust, efficient and hassle-free piece of software to realize unsupervised semantic modelling from plain text. It stands in contrast to brittle homework-assignment-implementations that do not scale on one hand, and robust java-esque projects that take forever just to run “hello world”.


TOOL CHARACTERISTICS

Usability

Authors of this page consider that this tool is '.

Tool orientation

This tool is designed for general purpose analysis.

Data mining type

This tool is made for Text mining.

Manipulation type

This tool is designed for '.

IMPORT FORMAT :

EXPORT FORMAT :


Tool objective(s) in the field of Learning Sciences

Analysis & Visualisation of data
Predicting student performance
Student modelling
Social Network Analysis (SNA)
Constructing courseware

Providing feedback for supporting instructors:
Recommendations for students
Grouping students:
Developing concept maps:
Planning/scheduling/monitoring
Experimentation/observation

Tool can perform:

  • Data extraction of type:
  • Transformation of type:
  • Data analysis of type:
  • Data visualisation of type: (These visualisations can be updated in "real time" )



ABOUT USERS

Tool is suitable for:

Students/Learners/Consumers
Teachers/Tutors/Managers
Researchers
Developers/Designers
Organisations/Institutions/Firms
Others

Required skills:

STATISTICS: Medium

PROGRAMMING: N/A

SYSTEM ADMINISTRATION: N/A

DATA MINING MODELS: Medium



FREE TEXT


Tool version : Gensim
(blank line)

Developed by :
(blank line)
Tool Web page : N/A
(blank line)
Tool type :
(blank line)

No image.png

1 SHORT DESCRIPTION


Quote from the about page (12/2016): Gensim started off as a collection of various Python scripts for the Czech Digital Mathematics Library dml.cz in 2008, where it served to generate a short list of the most similar articles to a given article (gensim = “generate similar”). I also wanted to try these fancy “Latent Semantic Methods”, but the libraries that realized the necessary computation were not much fun to work with.

By now, gensim is—to my knowledge—the most robust, efficient and hassle-free piece of software to realize unsupervised semantic modelling from plain text. It stands in contrast to brittle homework-assignment-implementations that do not scale on one hand, and robust java-esque projects that take forever just to run “hello world”.

2 TOOL CHARACTERISTICS


Tool orientation Data mining type Usability
This tool is designed for general purpose analysis. This tool is designed for Text mining. Authors of this page consider that this tool is .
Data import format Data export format
. .
Tool objective(s) in the field of Learning Sciences

☐ Analysis & Visualisation of data
☐ Predicting student performance
☐ Student modelling
☐ Social Network Analysis (SNA)
☐ Constructing courseware

☐ Providing feedback for supporting instructors:
☐ Recommendations for students
☐ Grouping students:
☐ Developing concept maps:
☐ Planning/scheduling/monitoring
Experimentation/observation

Can perform data extraction of type:

Can perform data transformation of type:

Can perform data analysis of type:

Can perform data visualisation of type:
(These visualisations can be updated in "real time" )


3 ABOUT USER


Tool is suitable for:
Students/Learners/Consumers:☐ Teachers/Tutors/Managers:☐ Researchers:☐ Organisations/Institutions/Firms:☐ Others:☐
Required skills:
Statistics: MEDIUM Programming: System administration: Data mining models: MEDIUM

4 OTHER TOOL INFORMATION


No screenshot.jpg
Gensim
Quote from the about page (12/2016): Gensim started off as a collection of various Python scripts for the Czech Digital Mathematics Library dml.cz in 2008, where it served to generate a short list of the most similar articles to a given article (gensim = “generate similar”). I also wanted to try these fancy “Latent Semantic Methods”, but the libraries that realized the necessary computation were not much fun to work with.

By now, gensim is—to my knowledge—the most robust, efficient and hassle-free piece of software to realize unsupervised semantic modelling from plain text. It stands in contrast to brittle homework-assignment-implementations that do not scale on one hand, and robust java-esque projects that take forever just to run “hello world”.

General analysis
Medium
N/A
N/A
Medium
Text mining
Low

@inproceedings{rehurek_lrec,

     title = Template:Software Framework for Topic Modelling with Large Corpora,
     author = {Radim {\v R}eh{\r u}{\v r}ek and Petr Sojka},
     booktitle = {{Proceedings of the LREC 2010 Workshop on New
          Challenges for NLP Frameworks}},
     pages = {45--50},
     year = 2010,
     month = May,
     day = 22,
     publisher = {ELRA},
     address = {Valletta, Malta},
     note={\url{http://is.muni.cz/publication/884893/en}},
     language={English}
}
Facts about "Gensim"
Analysis orientationGeneral analysis +
Has completion levelLow +
Has descriptionQuote from the
Quote from the [http://radimrehurek.com/gensim/about.html about page (12/2016): Gensim started off as a collection of various Python scripts for the Czech Digital Mathematics Library dml.cz in 2008, where it served to generate a short list of the most similar articles to a given article (gensim = “generate similar”). I also wanted to try these fancy “Latent Semantic Methods”, but the libraries that realized the necessary computation were not much fun to work with. By now, gensim is—to my knowledge—the most robust, efficient and hassle-free piece of software to realize unsupervised semantic modelling from plain text. It stands in contrast to brittle homework-assignment-implementations that do not scale on one hand, and robust java-esque projects that take forever just to run “hello world”.
at take forever just to run “hello world”. +
Has nameGensim +
Last editionDecember 23, 2016 +
Mining tool typeText mining +
User data mining models levelMedium +
User programming levelN/A +
User statistics levelMedium +
User system engineering levelN/A +