|Title||R Ultimate Multilabel Dataset Repository|
|Publication Type||Conference Paper|
|Year of Publication||2016|
|Authors||Charte, Francisco, Charte David, Rivera Antonio J., del Jesus M. J., and Herrera F.|
|Conference Name||11th International Conference on Hybrid Artificial Intelligent Systems, HAIS 2016|
|Conference Location||Seville (Spain)|
Multilabeled data is everywhere on the Internet. From news on digital media and entries published in blogs, to videos hosted in Youtube, every object is usually tagged with a set of labels. This way they can be categorized into several non-exclusive groups. However, publicly available multilabel datasets (MLDs) are not so common. There is a handful of websites providing a few of them, using disparate file formats. Finding proper MLDs, converting them into the correct format and locating the appropriate bibliographic data to cite them are some of the difficulties usually confronted by researchers and practitioners. In this paper RUMDR (R Ultimate Multilabel Dataset Repository), a new multilabel dataset repository aimed to fuse all public MLDs, is introduced, along with mldr.datasets, an R package which eases the process of retrieving MLDs and their bibliographic information, exporting them to the desired file formats and partitioning them.
R Ultimate Multilabel Dataset Repository