herraiz.org | Blog

Main | Blog | Research papers | PhD thesis | GnuPG (PGP)

 Subscribe to this blog in a reader

Intensive Metrics for Software Evolution

In natural sciences, intensive metrics do not depend on the size of the system (or amount of material) being measured. For instance, the temperature of an object or the hardness of a material are intensive properties.

In the case of software projects, many of the metrics used to study them are extensive, that is, the metrics depend on the size of the system. Thus, it is hard to compare different software projects using these metrics.

For the case of the Apache Software Foundation, we have found some ratios that are intensive properties, and we show how to use them to study the evolution of different projects, comparing them regardless their size, intended audience, technology used or maturity state. The metrics are obtained from the ratios between communication and development activies in the project. Interested? See a tag cloud of the keywords of this paper:

activity annotated apache around asf code commits community data developer different discussion email evolution fig figure found hadoop httpd information intensive international issue lines list mailing main measures messages metric number open paper pp projects ratio related releases results shows size software source study system total traffic used vertical web
created at TagCrowd.com

The paper is available in PDF, and it can be cited using this BibTeX code (also available as a text file for your convenience):

@InProceedings{gala2013:apache,
  author    = 	 {Santiago Gala-Perez
		  and Gregorio Robles
		  and Jesus M. Gonzalez-Barahona 
		  and Israel Herraiz},
  title     = 	 {Intensive Metrics for the Study of the Evolution of Open Source Projects},
  booktitle =    {10th IEEE Working Conference on Mining Software Repositories},
  year      = 	 {2013},
  address   = 	 {San Francisco, California, USA},
}

We are presenting this paper at the 10th IEEE Working Conference on Mining Software Repositories, with the following slides (see the slides at SlideShare.net):

Finally, this work is derived from Santiago Gala's master thesis, available in PDF format and including full details about the study of intensive metrics for software evolution. There is also a replication package that you can use to verify and extend our empirical study.

Written on May 17 2013 | Tags: #research, #msr, #software evolution
blog comments powered by Disqus