Automated Tagging for the Retrieval of Software Resources in Grid and Cloud Infrastructures
A key challenge for Grid and Cloud infrastructures is to make their services easily accessible and attractive to end-users. In this paper, the authors introduce tagging capabilities to the Minersoft system, a powerful tool for software search and discovery in order to help end-users locate application software suitable to their needs. Minersoft is now able to predict and automatically assign tags to software resources it indexes. In order to achieve this, they model the problem of tag pre-diction as a multi-label classification problem. Using data extracted from production-quality Grid and Cloud computing infrastructures, they evaluate an important number of multi-label classifiers and discuss which one and with what settings is the most appropriate for use in the particular problem.