Joint Source and Schema Evolution: Insights from a Study of 195 FOSS Projects

Panos Vassiliadis, Fation Shehaj, George Kalampokis, Apostolos V. Zarras.

Summary

In this paper, we address the problem of the co-evolution of Free Open Source Software projects with the relational schemata that they encompass. We exploit a data set of 195 publicly available schema histories of FOSS projects hosted in Github, for which we locally cloned their respective project and measured their evolution progress. Our first research question asks which percentage of the projects demonstrates a “hand-in-hand” schema and source code co-evolution? To address this question, we defined synchronicity by allowing a bounded amount of lag between the cumulative evolution of the schema and the entire project. A core finding is that there are all kinds of behaviors with respect to project and schema co-evolution, resulting in only a small number of projects where the evolution of schema and project progress in sync. Moreover, we discovered that after exceeding a 5-year threshold of project life, schemata gravitate to lower rates of evolution, which practically means that, with time, the schemata stop evolving as actively as they originally did. To answer a second question, on whether evolution comes early in the life of a schema, we measured how often does the cumulative progress of schema evolution exceed the respective progress of source change, as well as the respective progress of time. The results indicate that a large majority of schemata demonstrates early advance of schema change with respect to code evolution, and, an even larger majority is also demonstrating an advance of schema evolution with respect to time, too. Third, we asked at which time point in their lives do schemata attain a substantial percentage of their evolution. Although there are exceptions to the general trend, a large number of projects attracts a large percentage of their schema evolution disproportionately early with respect to their project life span. Indicatively, 98 of the 195 projects attained 75% of the evolution in just the first 20% of their project’s lifetime.

Texts

Panos Vassiliadis, Fation Shehaj, George Kalampokis, Apostolos V. Zarras. Joint Source and Schema Evolution: Insights from a Study of 195 FOSS Projects. 26th International Conference on Extending Database Technology (EDBT 2023), pp. 27–39, March 28 - March 31, Ioannina, Greece.

Local copy of the paper (PDF) [Schema Biographies: research on Schema Evolution]

Links, Videos, etc

A 20' video presentation is available as a youtube video.