herraiz.org | Blog
Main | Blog | Research papers | PhD thesis | GnuPG (PGP)
Main | Blog | Research papers | PhD thesis | GnuPG (PGP)
Last week, Daniel RodrÃguez (Information Engineering Research Unit, UAH) visited our department to talk about how to start to collaborate in the field Mining Software Repositories, where to get data, what topics we could do join works on. I prepared a set of slides with practical information about datasets, conferences and journals, to be used as a facilitator for discussion. The slides are available in SlideShare:
The presentation contains some links to datasets that can be easily used for empirical studies, and that makes it possible to conduct replicable studies. Also, there is paper at MSR 2010 that describes the data sources used for the MSR Challenge; the paper is entitled Mining Challenge 2010: FreeBSD, GNOME Desktop and Debian/Ubuntu and contains description of the FreeBSD repositories, of FLOSSMetrics data about GNOME and of the Ultimate Debian Database. If you use the paper for your research, please consider citing it (download the BibTeX citation as text file):
@InProceedings{challenge_msr2010,
author = {Abram Hindle
and Israel Herraiz
and Emad Shihab
and Zheng Ming Jiang},
title = {Mining {C}hallenge 2010:
{F}ree{BSD}, {GNOME} {D}esktop
and {D}ebian/{U}buntu},
booktitle = {Proceedings of the
7th IEEE International Working Conference
on Mining Software Repositories},
pages = {82--85},
year = {2010},
publisher = {IEEE Computer Society},
}