Articles | Volume 25, issue 10
https://doi.org/10.5194/nhess-25-3853-2025
https://doi.org/10.5194/nhess-25-3853-2025
Research article
 | 
08 Oct 2025
Research article |  | 08 Oct 2025

Insights into tectonic zonation models from the clustering analysis of seismicity in southern and south-eastern Spain

David Montiel-López, Antonella Peresan, Elisa Varini, and Sergio Molina
Abstract

The southern and south-eastern parts of Spain exhibit the highest seismicity rate in the country. However, although the recently developed Quaternary Active Fault database of Iberia (QAFI, García-Mayordomo et al.2012) collected the available information existing in the study area regarding fault data for their use in seismic hazard applications, this information is of limited use since data quality is very heterogeneous: few earthquakes are associated with specific fault segments, and occurrence time periods (when indicated) are affected by high uncertainties (Gaspar-Escribano et al.2015). This has motivated the definition of alternative tectonic zonation models, to be used for evaluating the seismic hazard. So far, the clustering properties have not been considered in this regard, though they can provide essential information about the features of seismic energy release, depending on the tectonic style of a region (Talebi et al.2024). This is why in this work the properties of the seismicity in terms of clustering are evaluated by applying the nearest-neighbour (NN) algorithm to the south-eastern Spain region. The scale parameters needed for the NN algorithm are optimised through the study of the z score and the temporal anomalies between events in the identified clusters for each run. The tree structure, using graph theory notation, has proved useful in the determination of the critical threshold that separates the background (independent) seismicity from the clustered (dependent) seismicity in the NN algorithm. Once the clusters have been identified, their properties have been quantified in terms of a selection of complexity measures: outdegree, closeness and average node depth. This procedure has been applied by considering two different completeness magnitudes: Mw 3.0 (the mean completeness magnitude for the entire catalogue) and Mw 2.1 (accounting for the most recent part of the catalogue). The results are similar in terms of proportion of foreshocks, mainshocks and aftershocks, and indicate a clear distinction between the western-most part (higher complexity) and eastern-most part (lower complexity). To check this result, three different zonation models have been examined and cross-compared; two of them passed the Kolmogorov–Smirnov (KS) test, meaning the distributions of the selected complexity measures are not the same for the different zones defined in the models. These zonations can be used in order to assess the seismic hazard, as they account for the influence of the tectonic setting on the patterns of earthquake occurrence, including the features of background and clustered seismicity components.

Share
1 Introduction

Southern and south-eastern Spain are the areas within the Iberian Peninsula with the highest seismic hazard (IGN-UPM Working Group2013; Kharazian et al.2021). The tectonic setting in this region can be related to the geological features. In this sense, the main geological domains are the Betic Cordillera to the north, divided into internal and external zones. The external zone, divided into Prebetic and Subbetic, originally formed the southern and south-eastern Mesozoic and Tertiary sedimentary cover of the Iberian shield and is arranged in many tectonic units (López-Casado et al.2001). The main geological domains can be seen in Fig. 1.

https://nhess.copernicus.org/articles/25/3853/2025/nhess-25-3853-2025-f01

Figure 1Main geological domains of southern and south-eastern Spain (adapted from Buforn et al.1995). The red-edged dotted-filled polygons identify the internal zone, whereas the blue-edged polygons with a strip pattern fill mark the location of the external zone. The coloured stars represent the most damaging earthquakes in the area since the pre-instrumental era of the catalogue. The triangles represent the seismic stations' location, obtained from González (2017), and are colour-coded by the first year of operation.

Southern and south-eastern Spain experiences low to moderate seismic activity due to the collision of the Africa and Eurasia plates. Seismic energy is released mostly through small and occasional moderate earthquakes, typically at shallow depths, with a few rare deep events. The region's seismic history began with detection by local stations in the early 20th century, leading to the development of a national seismic network in the 1960s, with further improvements in the 1980s. Most earthquakes in the region can be classified as low magnitude, with the exception of notable events such as the 1910 Adra coast earthquake (Mw 6.2) and the deep 1954 Durcal earthquake (Mw 7.0). Some of the most damaging earthquakes in the recent instrumental period have occurred in the Murcia region, i.e. Lorca's 2011 Mw 5.1 earthquake, Mula's 1999 Mw 4.9 and Bulla's 2002 Mw 5.0. All of them caused damage to the buildings and the Lorca earthquake even caused nine deaths (Molina et al.2018).

The historical seismicity of the region from the 15th to 20th centuries includes significant damaging earthquakes with onshore epicentres, such as those in 1431 (Granada), 1518 (Vera, Almería), 1680 (Alhaurín el Grande, Málaga), and 1804 (Dalias, Almería) with intensity VIII–IX (EMS-98) and estimated magnitude Mw>6.0 and the two most damaging earthquakes of our seismic catalogue: the 1829 Torrevieja earthquake and the 1884 Arena del Rey earthquakes, both with intensity IX–X (EMS-98) and estimated magnitude Mw>6.5 (Vidal-Sánchez1993).

The update of the Spanish seismic hazard map carried out in 2012 started with the identification of zones with different seismogenic characteristics. It is important to state that the high uncertainties in the Quaternary Active Fault database of Iberia (QAFI) (García-Mayordomo et al.2012) and the lack of earthquakes related to certain fault segments as pointed out by Gaspar-Escribano et al. (2015, p. 67) rule out using a fault-based seismic source model. The ZESIS model (García-Mayordomo2015a; Gaspar-Escribano et al.2015) originated from a previous one created following the expert judgement methodology after the cooperation of a large number of earth science researchers from Spain, Portugal and France, in the framework of the first Iberian Meeting on Active Faults and Paleoseismology (Iberfault-2010), the European project SHARE (Seismic Hazard Harmonization in Europe) (García-Mayordomo et al.2010) and, eventually, by the Advisory Board for the New Seismic Hazard Map of Spain. The seismogenic source zones model can be consulted and downloaded from the Instituto Geológico y Minero de España (IGME) web under the name of ZESIS database (IGME2015): http://info.igme.es/zesis/ (last access: 5 February 2025). Although some of the tectonic characteristics are shared between all of the subregions that were defined for the ZESIS zonation in southern and south-eastern Spain, regions such as the Granada Basin are more prone to exhibit swarm-like seismic activity (Saccorotti et al.2002; Stich et al.2024). In this sense, it is important to be able to identify the clustering characteristics of seismicity in different areas, as it could affect the seismic hazard analysis studies.

The declustering of the seismicity is one of the most important steps regarding seismic hazard analysis, as one of the hypotheses is that the seismicity in the area follows a Poisson distribution (i.e. all of the events are independent). This assumption cannot hold if the catalogue contains clustered seismicity data. Since the late 20th century, different declustering algorithms have been proposed: window methods, such as Reasenberg and Jones' (Reasenberg1985; Reasenberg and Jones1989), Gardner and Knopoff's (Gardner and Knopoff1974) or Uhrhammer's (Uhrhammer1986); stochastic declustering methods (Zhuang et al.2002; Zhuang2006) based on the Epidemic-Type Aftershock Sequence model (Ogata1998); and correlation methods such as the nearest-neighbour (NN) algorithm (Zaliapin et al.2008; Zaliapin and Ben-Zion2013a, 2020). For a more detailed explanation of the declustering methods, we refer the reader to van Stiphout et al. (2012).

Performing a clustering analysis on the seismic catalogue has several benefits: (1) it enables working with a background seismicity catalogue (with independent events), (2) it enables the study of the time-dependent seismic hazard in seismic series, and (3) it could shed light on the mechanisms behind the seismic behaviour of certain areas by identifying the events in the clusters.

In this work, we apply the NN algorithm to the seismic catalogue of southern and south-eastern Spain from 1970 up to the end of 2023. Our focus is to identify the main clusters present in the region. Then, we study the characteristics of the main clusters to see whether there are important differences inside the region regarding the complexity and magnitude of the mainshocks. As a result of this analysis, a declustered catalogue is obtained, which can be used for subsequent seismic hazard analysis.

2 Spanish seismic catalogue

Southern Spain is characterised by low to moderate shallow seismicity with rare high-magnitude earthquakes. The catalogue from the southern part of Spain (retrieved from https://www.ign.es/web/sis-catalogo-terremotos, last access: 25 September 2025, IGN2022) contains 46 296 earthquakes inside the region constrained by longitudes [7.0205° W, 1.5526° E] and latitudes [35.8762° N, 39.8548° N], from 1970 until the end of 2023 and with depths shallower than 50 km. It has been homogenised following the equations from IGN-UPM Working Group (2013). The ranges of local magnitude for the application of such equations have been ignored for magnitude types 1, 2, and 4. The local magnitude or intensity scales can be checked at https://www.ign.es/web/resources/docs/IGNCnig/SIS-Tipo-Magnitud.pdf (last access: 22 May 2025) (IGN2025). A discussion on the homogenisation process can be read in Appendix A. Only earthquakes belonging to tectonic zones in our study region with similar behaviour (crustal shortening direction) as defined in the ZESIS zonation (García-Mayordomo2015a) have been considered, as shown in Fig. 2.

https://nhess.copernicus.org/articles/25/3853/2025/nhess-25-3853-2025-f02

Figure 2Catalogue of southern and south-eastern Spain from 1970 to the end of 2023. It can be seen that faulting system determines the location of the epicentres for the most relevant earthquakes (Mw between 5.0 and 6.0 and marked as red stars) as most of them are located near these structures. The fault traces have been obtained from the QAFI database (García-Mayordomo et al.2012; IGME2022) and the tectonic zonation polygons from the ZESIS database (IGME2015). The coloured stars represent the most damaging earthquakes in the area since the pre-instrumental era of the catalogue.

Figure 3 shows the depth–energy distribution along with the depth histogram as an inset. As can be seen, the seismicity is concentrated around the 0 and 10 km range, and decreases exponentially with depth. The magnitude ranges from 0.8 to Mw 5.4.

https://nhess.copernicus.org/articles/25/3853/2025/nhess-25-3853-2025-f03

Figure 3Magnitude–depth distribution for the chosen catalogue. The histogram in the inset shows the depth distribution.

Download

The detection sensitivity and, therefore, the completeness magnitude (Mc) of the catalogue have changed over time due to upgrades in the seismic network. This fact is thoroughly discussed in the work of González (2017), the data from which have made it possible to identify four periods with distinct seismic network sensitivity: 1970–1984, 1985–1998, 1999–2013, and 2014–2023. These periods are also evident in Fig. 5 by analysing the slope changes in the cumulative number of stations per year. Figure 4 shows the events of the Spanish catalogue for the area of study. The number of events with magnitudes lower than 3.0 increased from 1970–1984 to 1985 on, and then the number of events with magnitudes lower than 2.0 spiked from 1999 on, reflecting an improvement in the sensitivity of the Spanish seismic network.

https://nhess.copernicus.org/articles/25/3853/2025/nhess-25-3853-2025-f04

Figure 4Spatial distribution of earthquake magnitudes in the four time intervals of the catalogue, identified as having different seismic network sensitivities: 1970–1984 (a), 1985–1998 (b), 1999–2013 (c), and 2014–2023 (d).

3 Nearest-neighbour (NN) algorithm for clustering analysis

In this work, the NN algorithm is applied to extract information about the clusters in the region of study. The NN algorithm (Zaliapin et al.2008; Zaliapin and Ben-Zion2013a, 2020), classifies the events in either background (those independent, typically assumed to follow a Poisson distribution) and clustered (those whose occurrence is related to other earthquakes, i.e. foreshocks and aftershocks). To do this, the spatiotemporal distances between each pair of events are computed following Baiesi and Paczuski (2004), using the times tk at which events occur, along with their hypocentre's coordinates ϕk,λk,zk and magnitude mk, for k=1,,N and N the number of seismic events.

This distance between any pair of earthquakes k and j is defined as follows:

(1) η k j = t k j r k j d 10 - w m k , t k j 0 , , t k j < 0 ,

where tkj=tj-tk are the times between events, rkj are the distances between the hypocentres or epicentres of the events, d is the fractal dimension of the events in the catalogue and w is the input parameter that defines the magnitude in the spatiotemporal distance computation. The parameter w usually equals the b value of the Gutenberg–Richter (G–R) law (Gutenberg and Ritcher1944). The NN distance of each earthquake j is then defined as ηij=minkηkj, thus identifying its NN i. The NN distance ηij can be further decomposed (Zaliapin et al.2008) into rescaled time, Tij, and rescaled space, Rij, components (with ηij=TijRij), which are defined as follows:

(2) T i j = t i j 10 - q b m i , R i j = r d 10 - p b m i , with q + p = 1 .

Importantly, Zaliapin and Ben-Zion (2013a) noticed that the NN distance exhibits a bimodal distribution that can be approximately decomposed into two Gaussian distributions: one corresponding to the background seismicity, and the other to the clustered seismicity. After calculating the threshold value η0, which is typically the distance at which the probability densities of the two Gaussian distributions intersect, each event j is classified as background seismicity if it is more than η0 away from its nearest event i (i.e. ηij>η0). Otherwise, it is classified as a clustered event and assigned a label identifying it as a foreshock, aftershock or mainshock, where the mainshock is defined as the largest-magnitude event in the cluster, and foreshocks and aftershocks are the events that precede or follow it in time within the cluster. Each clustered event is also labelled by its parent, here represented by the mainshock. This information is needed to decluster the catalogue (keeping only the mainshocks and background events) and studying the cluster structure, by using the cluster data and the labels from the events along with their time and magnitude.

3.1 Parameters for the NN algorithm

The computation of the rescaled time and space of the NN distance requires both the b value as defined in the G–R law and the fractal dimension of the events in the catalogue. Given that the seismic network's sensitivity in south-eastern Spain has changed over time, the b value and completeness magnitude have been computed for each of the periods provided in Fig. 4, as well as for the entire catalogue.

In Fig. 5 the values obtained for each of the parameters are compared with the changes in the seismic network near the study area using the supplementary data provided by González (2017). The b value and the completeness magnitude have been computed using ZMAP software (Wiemer2001). It can be seen that the completeness magnitude steadily decreases and stabilises, whereas the b value increases sharply around 1980 when the seismic network starts its further development, and then decreases again to stay around the value obtained for the whole catalogue (b=1.12).

https://nhess.copernicus.org/articles/25/3853/2025/nhess-25-3853-2025-f05

Figure 5Changes in the b value and completeness magnitude over time along with the evolution in the number of seismic stations near the area of study for the period 1970–2014. The blue circle markers represent the b values and the red cross markers the completeness magnitude values from the year in which they are plotted to the next marker's year location. For example, the first blue marker would be the b value from 1970 to 1984 (location of the next marker). The last marker would comprise the period from 2013 to 2023. The dashed lines indicate the values of the parameters for the whole catalogue (1970–2023).

Download

Regarding the fractal dimension d, two different approaches have been used: (1) optimising the box-counting fractal dimension and (2) using the correlation integral approach as implemented in ZMAP.

To optimise the box-counting fractal dimension, two different parameters have been studied: the box size and the minimum number of events that a box must contain to be counted. The latter parameter has been restricted between a minimum number of one event and a maximum number given by the floor of l, where l is the longitude of the box in kilometres. This value has been selected by expert judgement.

Figure 6 shows the evolution of the fractal dimension when the minimum number of events per box and the size of the boxes are varied. Each iteration considers a higher value for the maximum number of events a box should contain to be considered in the computation of the fractal dimension.

https://nhess.copernicus.org/articles/25/3853/2025/nhess-25-3853-2025-f06

Figure 6Fractal dimension computed with (a) decreasing sizes using l/2n, with l the maximum box size and n the step of reduction and (b) decreasing by 10 km the size of the box from l until 10 km. In this case l has been set as 200 km. The dashed lines indicate the median value of the fractal dimension and the filled area around the median is 1 absolute median standard deviation.

Download

Using a constant decreasing size of box (Fig. 6b) for the computation of the box-counting fractal dimension results in a more stable value for it, db)=1.45±0.06, when compared with the exponential decrease approach, da)=1.37±0.09.

The fractal dimension has also been obtained using the correlation integral method as implemented in the ZMAP software, obtaining a value of 1.5.

The influence on the variation of these parameters within the uncertainty limits has been studied in Appendix A section, so two sets of parameters for the catalogue are obtained and listed in Table 1. These two sets of parameters define two subsets of earthquakes extracted from the catalogue, which we refer to as the first and second datasets. The first dataset contains 1806 events, and the second dataset, which includes the first, contains 20 057 events. Each dataset, equipped with its set of parameters, is analysed using the NN algorithm to associate each event j with its NN i and NN distance ηij.

Table 1Optimal sets of parameters to be used in further cluster structure analysis.

Download Print Version | Download XLSX

Then, for both sets of parameters a further comparison on the anomalies due to the completeness magnitude can be seen in Appendix C.

Finally, we note that to compute the rescaled space component, Rij, due to the large uncertainties in hypocentral depth determination, only epicentres have been considered. Following Zaliapin and Ben-Zion (2020), the analysis is performed considering epicentral coordinates and thus is not affected by the depth uncertainties. When dealing with global-scale analysis, where very deep earthquakes are reported, events are eventually selected within specified depth ranges (e.g. Zaliapin and Ben-Zion2016), while for narrow-scale studies, where reliable depth information is available, more precise depth data can be readily used (e.g. Martínez‐Garzón et al.2018).

3.2 Role of η0 in the identification of clusters

After associating each event with its NN and calculating NN distances, a threshold distance η0 is chosen to separate background events from clustered events as well as to identify earthquake clusters.

Figure 7 shows the histogram of the NN distances obtained from the first dataset (panel a) and the second dataset (panel b). As expected, the distribution of the NN distances is bimodal in both cases and can be approximated by a mixture of two Gaussian distributions, one relating to background seismicity (yellow line) and the other to clustered seismicity (orange line).

The threshold distance is set equal to the value in which the two estimated Gaussian distributions intersect: η0=-3.4 for the first dataset and η0=-3.5 for the second dataset (black solid vertical lines). After that, earthquake clusters are identified in both datasets and those with the largest sizes are analysed to assess the choice of η0 and gain a deeper understanding of its role in the declustering algorithm.

https://nhess.copernicus.org/articles/25/3853/2025/nhess-25-3853-2025-f07

Figure 7Histogram of the NN distances for (a) the first dataset and (b) the second dataset. Mixture model of two Gaussian density functions, one for background seismicity (yellow line) and the other for clustered seismicity (orange line), fitted to the datasets. Then, panels (c) and (d) show the 2D joint distribution of rescaled time and space for both datasets. The threshold distance η0 is given by the estimated values −3.4 in panels (a) and (c) and −3.5 in panels (b) and (d) (solid vertical line) or is given by the fixed value −4.5 (dashed vertical line in all of the panels).

Download

To illustrate the analysis we performed on the most populated clusters, we consider two large clusters identified in both datasets: the Adra sequence, whose mainshock occurred on 23 December 1993, Mw 5.2; and the Granada swarm, whose mainshock occurred on 23 January 2021, Mw 4.4. Figure 8a and 8c show the cluster structures of the Adra sequence identified in the first dataset with η0=-3.4 and in the second dataset with η0=-3.5, respectively. In Fig. 9a and c, the spatial distributions of the epicentres are also shown for these clusters. Similarly, the results of the Granada swarm identified in both datasets by using the estimated values of η0 are shown in Figs. 10a and c and 11a and c.

An obvious note is that, for both seismic sequences, the events are more numerous in the cluster of the second dataset than in that of the first dataset (the lower the completeness magnitude, the more events in the cluster).

In Fig. 8a the mainshock of the cluster is preceded by a foreshock. A closer inspection of the spatial distribution of magnitude reveals that this foreshock must be an anomaly, as there is more than 1° difference in longitude between the foreshock and the mainshock (Fig. 9a). The same argument holds for those aftershocks that directly follow the mainshock, but which in turn generate few aftershocks.

Note that even the Granada swarm clusters identified in the two datasets have events anomalously distant from the mainshock (Fig. 10a and c).

To prevent the erroneous classification of distant events into clusters, which are probably background events, we manually adjusted the estimated distance threshold values for the two datasets by gradually lowering them until a satisfactory fixed value of η0=-4.5 was reached. Appendix D shows the difference in the anomaly count in the clusters when fixing a lower η0 value.

The right-hand panels in Figs. 89 show the resulting clusters of the Adra sequence obtained from the two datasets with tuned parameter η0=-4.5. Similarly, the right-hand panels in Figs. 1011 show the resulting clusters of the Granada swarm.

From the comparison of the pairwise panels on the left- and right-hand sides in Figs. 811 (the clusters comes from the same dataset, but with different η0 values), we note that the complexity in the cluster structures slightly changes by using the estimated (−3.4 and −3.5) or tuned (−4.5) η0 values. Nevertheless, the choice of η0 influences the spatial dispersion of the events belonging to the cluster: events that are anomalously distant from their mainshocks are excluded from the clusters when using η0=-4.5.

As a result of this study, we focus only on the clustering results obtained by the following parameters: (1) the completeness magnitude Mc=3.2 (first dataset), since only minor changes in the cluster structures are observed when it is lowered to Mc=3.0; (2) the tuned value η0=-4.5, as it more effectively defines the cluster structure (eliminating anomalous foreshock activity and/or distant events) and prevents background events from being misclassified as clustered events, which could significantly affect the cluster characterisation. In the following figures, the fault names have been shortened in order for them not to hinder the visualisation of the plots. The complete names according to QAFI 4.0 database (García-Mayordomo et al.2012; IGME2022) are listed in the following Table 2.

Table 2Notation used to identify the faults appearing in south-eastern Spain.

Download Print Version | Download XLSX

https://nhess.copernicus.org/articles/25/3853/2025/nhess-25-3853-2025-f08

Figure 8The Adra sequence, whose mainshock occurred in 23 December 1993, Mw 5.2. Cluster structures obtained from the different set of parameters and critical threshold values: (a) first dataset, estimated η0=-3.4; (b) first dataset, tuned η0=-4.5; (c) second dataset, estimated η0=-3.5; (d) second dataset, tuned η0=-4.5.

Download

https://nhess.copernicus.org/articles/25/3853/2025/nhess-25-3853-2025-f09

Figure 9The Adra sequence, whose mainshock occurred in 23 December 1993, Mw 5.2. Spatial distributions of the earthquakes' epicentres in the clusters obtained from the different set of parameters and critical threshold values: (a) first dataset, estimated η0=-3.4; (b) first dataset, tuned η0=-4.5; (c) second dataset, estimated η0=-3.5; (d) second dataset, tuned η0=-4.5. The central inset shows the location of the cluster's mainshock in Spain and the purple lines in each of the subplots mark the position of the active faults from QAFI v4.0 (García-Mayordomo et al.2012; IGME2022) in the area. The blue labels indicate the different confirmed fault names and the black labels with the inverted triangle mark the position of the main cities in the area (population greater than 30 000).

https://nhess.copernicus.org/articles/25/3853/2025/nhess-25-3853-2025-f10

Figure 10The Granada swarm, whose mainshock occurred in 23 January 2021, Mw 4.4. Cluster structures obtained from the different set of parameters and critical threshold values: (a) first dataset, estimated η0=-3.4; (b) first dataset, tuned η0=-4.5; (c) second dataset, estimated η0=-3.5; (d) second dataset, tuned η0=-4.5.

Download

https://nhess.copernicus.org/articles/25/3853/2025/nhess-25-3853-2025-f11

Figure 11The Granada swarm, whose mainshock occurred in 23 January 2021, Mw 4.4. Spatial distributions of the earthquake's epicentres in the clusters obtained from the different set of parameters and critical threshold values: (a) first dataset, estimated η0=-3.4; (b) first dataset, tuned η0=-4.5; (c) second dataset, estimated η0=-3.5; (d) second dataset, tuned η0=-4.5. The central inset shows the location of the cluster's mainshock in Spain and the purple lines in each of the subplots mark the position of the active faults from QAFI v4.0 (García-Mayordomo et al.2012; IGME2022) in the area. The blue labels indicate the different confirmed fault names and the black labels with the inverted triangle mark the position of the main cities in the area (population greater than 30 000).

4 Graph analysis of earthquake clusters

An earthquake cluster identified by the NN algorithm can be structured as a tree graph, illustrating the links between events and their NNs.

Clusters can be studied from a graph-theory point of view to identify their features and assess the complexity of their structures. Based on graph analysis, Zaliapin and Ben-Zion (2013b) found two distinct types of cluster sequences, referred to as burst-like (with an umbrella-like shape) and swarm-like (with a chain-like tree graph shape) sequences, whose spatial variability helps characterise Californian regions with different heat-flow and other viscosity-related properties. Similarly, Varini et al. (2020) used two other scalar measures of tree graphs to describe the structure of the clusters and classified them according to their complexity in north-eastern Italy. Some of these scalar measures will be used in this study to assess the complexity of clusters containing five or more events and to examine spatial patterns in the study area.

4.1 Scalar measures of tree graphs

Let the tree graph representation of an earthquake cluster be denoted by T, where the nodes of T are the events within the cluster, and the edges are set between the nodes and their NNs. Let |T| be the number of nodes in T, i.e. the cluster size.

4.1.1 Outdegree centralisation

The number of outgoing edges from a node v is named the outdegree of v and denoted by δ(vT). The outdegree centrality of v is then defined as cδ(vT)=δvT/(|T|-1), that is, the ratio between the outdegree of v and the maximum possible outdegree |T|-1 of a node in a general tree of size |T|. The outdegree centrality of v takes values [0,1]. The higher the outdegree centrality of v, the more important the node is, meaning that v is more central in the tree T as it has more outgoing edges than other nodes. Given the node v* with highest outdegree centrality in the tree T, the outdegree centralisation of T is given by

(3) C δ ( T ) = v T c δ ( v * T ) - c δ ( v T ) | T | - 1 ,

which represents the difference between the outdegree centrality of the node with highest centrality and that of all other nodes, normalised with respect to the maximum possible analogous difference in a general tree of size |T|. The outdegree centralisation is also an index in [0,1] that summarises the outdegree centralities of all nodes in T. A high outdegree centralisation of T indicates the presence of nodes with high outdegree centralities. For example, burst-like clusters have outdegree centralisations close to 1.

4.1.2 Closeness centralisation

The closeness centrality cc(v|T), as defined by Bavelas (1950), gives an idea of how close a node v is to the rest of the nodes w in the tree T. It can be calculated as the reciprocal of the ratio between the sum of the lengths d(v,w), and the minimum possible analogous sum in a general tree of size |T|: ccv|T=(|T|-1)/wTd(v,w). Closeness centrality of node v also ranges [0,1]. The higher the closeness centrality of v, the more important the node is, meaning that v is more central in the tree T because it is well connected to the other nodes by paths. Given the node v with highest closeness centrality in the tree T, the closeness centralisation of T is given by

(4) C c ( T ) = v T c c ( v * T ) - c c ( v T ) ( | T | - 1 ) 2 / | T | ,

which represents the difference between the closeness centrality of the node with highest centrality and that of all other nodes, normalised with respect to the maximum possible analogous difference in a general tree of size |T|. The closeness centralisation also varies as [0,1]. A high closeness centralisation of T indicates the presence of nodes that are well connected to the others. For example, burst-like clusters have closeness centralisations close to 1.

4.1.3 Average leaf depth

The average leaf depth d was introduced by Zaliapin and Ben-Zion (2013b) and is evaluated by considering some special nodes in T: the root and the leaves. The root corresponds to the first event in time within the cluster, while the leaves represent events that are not NNs of other nodes. In other words, the root is linked to all other nodes by a path and the leaves have a null outdegree. Given a leaf of T, its leaf depth is the length of the shortest path between the root and this leaf. Therefore, the average leaf depth d is given by the sum of all leaf depths divided by the number of leaves in T. In general, clusters with a burst-like structure have low average leaf depth values (closer to 1), as most of the nodes sprout from the root of the tree; chain-like clusters, on the other hand, have high average leaf depth values (closer to |T|-1).

In summary, values close to 1 for both centralisation measures indicate simple burst-like cluster structures, referred to as burst-like sequences by Zaliapin and Ben-Zion (2013b), because the mainshock node is directly connected to most of the aftershock nodes. Smaller values of the centralisation measures correspond to more complex chain-like/swarm-like clusters. In contrast, small (large) average leaf depth values denote simple burst-like (complex swarm-like) clusters. However, note that this last measure is not normalised (it does not have a constant upper bound) and, unlike the centralisation measures, is influenced by the cluster size.

Figure 12 shows an example of such cluster structure types according to their tree graphs. In reality more complex cluster structures are expected due to combinations of these typologies. For instance, the Adra sequence (Fig. 8) could be classified as a double burst-like cluster whereas the Granada swarm (Fig. 10) has a more pronounced chain-like component.

https://nhess.copernicus.org/articles/25/3853/2025/nhess-25-3853-2025-f12

Figure 12Cluster types according to their tree graph structures. The events are represented with sizes and grey-shaded colours according to their magnitude.

Download

4.2 Results for completeness magnitude Mw 3.2

Based on previous results, earthquake clusters containing five or more events are considered, as obtained from the NN algorithm with a tuned threshold distance η0=-4.5 applied to the south-eastern Spain catalogue with completeness magnitude Mw 3.2.

Figure 13 shows four maps where the mainshock epicentres of these clusters are coloured according to the values of the outdegree centralisation (top left), closeness centralisation (top right) and average leaf depth (bottom left) for their corresponding clusters, as well as to the mainshock magnitudes (bottom right). The yellow colour corresponds to simple burst-like clusters, whereas clusters become more complex with swarm-like features as the colour shifts towards dark red. Based on the outdegree and closeness centralisation maps, the south-western part of the region is dominated by more swarm-like clusters (e.g. the Mw 4.4 Granada swarm) than elsewhere. By comparing the centralisation maps with the magnitude map, we observe that the north-eastern part of the region was affected by the strongest earthquakes that occurred during the study period (1970–2023) and their centralisation values mostly approach simple burst-like behaviours (e.g. the Mw 5.2 Adra sequence). This aligns with the behaviour commonly associated with strong earthquakes, which are known to generate numerous aftershocks. All of these remarks are also consistent with the results in the average leaf depth map, which closely resemble those of closeness centralisation. This was expected, as both measures are based on selected path lengths between nodes.

https://nhess.copernicus.org/articles/25/3853/2025/nhess-25-3853-2025-f13

Figure 13Maps of the mainshock epicentres of the clusters identified by NN algorithm with η0=-4.5 and completeness magnitude Mw 3.2. Colours represent the cluster values of the (a) outdegree centralisation, (b) closeness centralisation, average leaf depth (c), and mainshock magnitude (d). The mainshocks of both Adra sequence (north-west) and Granada swarm (south-east) have been marked with stars. The red lines mark the position of the active faults from QAFI v4.0 (García-Mayordomo et al.2012; IGME2022) in the area and the blue lines the tectonic zonation for Spain (García-Mayordomo2015b; IGME2015). The centre inset shows the location of the epicentres within Spain.

4.3 Results for completeness magnitude Mw 2.1

The same analysis was conducted by lowering the completeness magnitude to Mw 2.1 and shortening the study period from 1999 (instead of 1970) to 2023, according to the results on the completeness of the catalogue in Fig. 5. The motivation lies in verifying whether the results obtained so far remain valid in a complete catalogue that includes low-magnitude events, even if the time period is necessarily shorter.

Table 3a compares the NN declustering of the two catalogues with completeness thresholds, Mw 3.2 and Mw 2.1, presenting the total number of events and those classified as background activity (singles) and clustered activity, respectively. The clustered events are further classified as foreshocks, mainshocks, and aftershocks. On the one hand, the smaller size of the previously studied dataset with Mc 3.2 made it easier to manage and visually display the clusters. On the other hand, the new dataset with Mc 2.1 offers the advantage of including a significantly larger number of events, including those of lower magnitude. It can be seen that the clustered to background seismicity ratio increases as the completeness magnitude decreases. This is to be expected, as lower-magnitude earthquakes are more frequently associated with seismic sequences in the aftershock category. It is also in agreement with the increase in the ratio of aftershocks (17 % for the Mw 3.2 completeness magnitude vs. 25 % for the Mw 2.1 completeness magnitude).

Table 3Summary of the clustering statistics for the two different completeness magnitudes.

CtoB: clustered seismicity to background seismicity ratio.

Download Print Version | Download XLSX

The maps in Fig. 14 illustrate the results of the new additional analysis. The comparison with Fig. 13 shows that the new findings are consistent with previous results: the mainshocks in the south-western part of the region exhibit low to moderate magnitude and their associated scalar measures suggest swarm-like behaviour. In contrast, the north-eastern part of the region has experienced recent strong earthquakes, with the scalar measures indicating burst-like activity.

https://nhess.copernicus.org/articles/25/3853/2025/nhess-25-3853-2025-f14

Figure 14Maps of the mainshock epicentres of the clusters identified by NN algorithm with η0=-4.5 and completeness magnitude Mw 2.1. Colours represent the cluster values of the (a) outdegree centralisation, (b) closeness centralisation, average leaf depth (c), and mainshock magnitude (d). The mainshocks of both Adra sequence (north-west) and Granada swarm (south-east) have been marked with stars. The red lines mark the position of the active faults from QAFI v4.0 (García-Mayordomo et al.2012) in the area and the blue lines the tectonic zonation for Spain (García-Mayordomo2015b; IGME2015). The centre inset shows the location of the epicentres within Spain.

5 Probing seismic zonation through cluster analysis

It is interesting to compare the results for both completeness magnitude thresholds in terms of average node depth and cluster size (Fig. 15), as this representation can provide some insights into the possible relation between the cluster size and their different types. It can be observed that, especially for Mc=2.1, the average node depth <d> naturally increases with cluster size for swarm-like sequences, while for burst-like sequences <d> remains quite low even for large clusters, composed by more than a hundred events. Moreover, considering the geographical information (longitude values) it can be seen that in the western zone (longitude <-3°) complex clusters (higher average node depth) are more common than in the eastern zone, even for similar cluster sizes.

https://nhess.copernicus.org/articles/25/3853/2025/nhess-25-3853-2025-f15

Figure 15Average leaf depth vs. cluster size for completeness magnitude 3.2 (a) and completeness magnitude 2.1 (b). The red markers are events at the western side of the area of study (longitude <-3°) and those blue-coloured belong to the eastern side (longitude >-3°).

Download

This motivates us to proceed with the clustering analysis of the dataset with completeness magnitude Mw 2.1 in order to propose possible zonation models for the study area complementing the information provided by the ZESIS tectonic zonation (García-Mayordomo2015a). In Fig. 16, the study area was divided into square cells of 0.5° in length, with only those cells containing at least four mainshocks being considered.

Using the information from Fig. 14 the median values of the scalar measures within the cells are calculated, along with the median magnitudes of the mainshocks. The resulting median values are then represented by colours in the cells according to a specified colour scale (Fig. 16); the background areas represent the external zones (blue area) and internal zones (red area) as defined in Fig. 1.

All maps in Fig. 16 show a clear separation between two different regions: the western region is characterised by complex swarm-like clusters with mainshocks of low to moderate magnitude (dominance of red-orange cells in the scalar measure maps and of blue–light-blue cells in the magnitude map), while the eastern region displays simple burst-like clusters with even stronger mainshocks (dominance of yellow–light-orange cells in the scalar measure maps and of green cells in the magnitude map).

https://nhess.copernicus.org/articles/25/3853/2025/nhess-25-3853-2025-f16

Figure 16Gridded median values of the outdegree centralisation (a), closeness centralisation (b), average leaf depth (c), and mainshock magnitude (d). The background areas represent the external zones (blue area) and internal zones (red area) as defined in Fig. 1. The red lines mark the position of the active faults from QAFI v4.0 (García-Mayordomo et al.2012; IGME2022) in the region.

Based on the spatial distribution of the scalar measures under consideration (Fig. 16) and on the geographic map of the geological domains (Fig. 2), we propose three zonation models, using the ZESIS tectonic zones as building pieces. These zonations are illustrated in Fig. 17. It can be seen that there is a slight difference between Zonation 1 and Zonation 2, as Zonation 2 includes an additional ZESIS tectonic zone. Zonation 3 is a modified version of Zonation 2, whose zone 1 is further divided into an eastern zone and a western zone. This has been done in order to check if further subdivisions of the most populated zone (in terms of seismic events) show different characteristics in the regional scale for the studied parameters. If such variations exist, a more complex zonation model could be proposed.

https://nhess.copernicus.org/articles/25/3853/2025/nhess-25-3853-2025-f17

Figure 17The three proposed zonation models of the study area based on the identified clustering features and the ZESIS tectonic zonation (García-Mayordomo2015b; IGME2015).

To assess the consistency of the proposed zonation models with the assumptions made, we investigate whether the values of the analysed scalar measures (i.e. outdegree centralisation, closeness centralisation and average leaf depth) differ significantly across the zones of a zonation. The two-sample Kolmogorov–Smirnov (KS) test (Rohatgi and Ehsanes Saleh2015) is used to verify the null hypothesis that two samples originate from the same population, versus the alternative hypothesis that they come from different populations. The results of KS tests for Zonation 1 are summarised in Table 4. Since the p value is below the significance level 0.05 for all scalar measures, the null hypotheses are rejected, indicating that the distributions of the scalar measures are significantly different between zones 1 and 2. The KS tests applied to Zonation 2 provide similar results (Table 5).

Similarly, Table 6 shows the comparison between pairs of the three zones for Zonation 3. The analysis confirms that zones 1 and 2 display significantly different distributions in relation to the scalar measures under consideration. The same holds true for zones 2 and 3. In contrast, zones 1 and 3 do not exhibit such differences; this suggests that these two zones of Zonation 3 might be better combined into one, as in Zonation 2, rather than kept separate. In conclusion, Zonation 2 is preferred due to its broader coverage of the study region compared to Zonation 1, although Zonation 1 has proven equally valid in this analysis.

Table 4Zonation 1: output of the KS tests comparing the distributions of scalar measure values in its zones 1 and 2, whose sample sizes are 115 and 67, respectively.

Download Print Version | Download XLSX

Table 5Zonation 2: output of the KS tests comparing the distributions of scalar measure values in its zones 1 and 2, whose sample sizes are 155 and 67, respectively.

Download Print Version | Download XLSX

Table 6Zonation 3: output of the KS tests comparing the distributions of scalar measure values for all pairwise combinations of its zones 1, 2, and 3, with sample sizes 72, 67, and 83, respectively.

Download Print Version | Download XLSX

6 Conclusions

In this work the seismic clusters of southern and south-eastern Spain are identified through the NN algorithm. Then, under the formalism of graph theory, the relations between the events of the clusters are used to both represent the cluster structures and compute the centrality measures, i.e. the outdegree centralisation, the closeness centralisation and the average leaf depth. These measures are useful towards the understanding of the tectonic behaviour and classification of the regions according to the complexity of the clustered seismicity (Peresan and Gentili2018, 2020; Talebi et al.2024).

Three different models for the zonation have been proposed based on the Spanish ZESIS tectonic zonation units as building blocks. According to the KS test, two out of the three proposed zonations (Zonation 1 and Zonation 2) reveal statistically significant differences between zones with respect to clustering characteristics. In other words, for each of the two zonations, the distributions of the considered scalar measures differ across zones. Given the results exposed in the previous section and the fact that Zonation 2 covers most of the investigated region and includes more events, and hence increases the statistics for clusters and background seismicity investigations, we consider it as the preferred option.

The eastern-most zone is characterised by prevailing burst-like structure for the clusters related to the external zone tectonic setting. The western-most zone, instead, exhibits a prevalent chain-like cluster structure (with relatively high complexity) that could be related to the internal zone tectonic setting, which is characterised by a dense fault system and where swarms are prone to happen.

The study shows that the clustering properties could help redefine the seismic zonation, particularly if a declustered catalogue is to be used in the seismic hazard assessment, as the regions that share the clustering properties might have a common background seismic activity rate due to their underlying tectonic setting.

Although the focus of this work is set on the study of the cluster structure properties and the classification of regions using the aforementioned complexity measures, it is worth noting that several studies have related the regional heat flow and fluid balance to swarm behaviour in seismicity and the occurrence of high-magnitude earthquakes. For instance, Martínez‐Garzón et al. (2018) found a positive correlation between the increase of geothermal activity (fluid balance) with the number of aftershocks in seismic series. Other works such as the one from Papadakis et al. (2016) relate relatively high heat-flow values with strong earthquakes with focal depths lower than 40 km in Greece. In the area of study, Luque-Espinar and Mateos (2023) found notable changes in the geochemistry of thermal waters during the 2020–2021 Granada swarm including temperature changes. They highlighted the role of the variations in the SO4 concentration as precursor; this signal increased during the seismic sequence peaking before the highest-magnitude earthquakes occurred.

Future work could explore the use of these zonations to assess the seismic hazard by computing peak ground acceleration for the areas that share similar clustering properties. These results should then be compared with those existing in the literature to see whether they better represent the seismicity in each region. In addition, the possible temporal changes in background seismicity rates within the identified zones could be explored, following (Benali et al.2020), so as to develop time-dependent hazard maps.

Appendix A: Homogenisation of the Spanish seismic catalogue

Regarding the homogenisation process, which can also affect the results, there are more than six types of magnitudes that can be found when the catalogue is downloaded after issuing a query in the IGN database web page: https://www.ign.es/web/ign/portal/sis-catalogo-terremotos (last access: 25 September 2025) (IGN2022). Each local magnitude scale (https://www.ign.es/web/resources/docs/IGNCnig/SIS-Tipo-Magnitud.pdf, last access: 22 May 2025) has a linear function that relates it to the moment magnitude (Mw) scale and a range of application (minimum and maximum local magnitude scale value) as seen in the last update of the Spanish Seismic Hazard Map (IGN-UPM Working Group2013). Table A1 summarises the equations for the four intensity/magnitude scales that appear in the area of study.

Table A1Homogenisation equation for magnitude types present in the area of study. Obtained from IGN-UPM Working Group (2013, pp. 28–29).

Download Print Version | Download XLSX

For this work, the equations listed in Table A1 were used to convert the local magnitude and intensity measures into moment magnitude. As for the magnitude range of the conversions, we used the one in the corresponding column given that the ranges are narrow for some of the conversion functions and following them would have resulted in a bias of the completeness magnitudes for the most recent periods.

The IGN-UPM Working Group (2013, pp. 28–29) supplies the histograms with the difference between the computed magnitudes and the supplied local magnitudes, and it can be seen that, for types 2 and 4, these differences are centred around 0. This is not the case for type 3, which also shows a higher uncertainty. Plotting the conversion function in Fig. A1 allows us to see how type 2 and 4 magnitudes do not deviate greatly from the Mw. This is not the case for type 3, as the slope is higher for this conversion function.

https://nhess.copernicus.org/articles/25/3853/2025/nhess-25-3853-2025-f18

Figure A1Example of application of the type 1, 2, 3, and 4 conversion to Mw from local energy measure. It can be seen that the type 3 slope is higher, so extrapolating this function to a wider range would result in high uncertainty. The red-coloured section indicate the measure range as indicated in Table A1. The uncertainty in the conversion is shown as a coloured fill around the lines.

Download

Figure A2 shows the event distribution for the whole catalogue depending on their magnitude type before conversion. Following the referee's question we have decided to add a new section in the Appendix, including this figure, so as to highlight the importance of the homogenisation step in the catalogue preparation. Type 5 comprises those events already in Mw in the catalogue.

https://nhess.copernicus.org/articles/25/3853/2025/nhess-25-3853-2025-f19

Figure A2Magnitude type distribution for the catalogue before conversion to Mw. A magnitude equal to −1 has been assigned to those events with only intensity values in order to differentiate them.

Download

Appendix B: Scale parameter optimisation

In order to check the influence of these parameters in the cluster analysis, a variation in these has been used as input in the NN method. Table B1 lists the set of parameters used in this work.

Table B1Optimal sets of parameters to be used in further cluster structure analysis.

Download Print Version | Download XLSX

We use different statistics to evaluate the influence of these parameters on the results. For instance, regarding the spatial distribution of the events identified as a member of a cluster, the z score for the parametric distribution of the distances from the rest of the events of the cluster to the mainshock has been computed:

(B1) z = d i k - d ̃ i k IQR ,

where dik is the distance from the event i to the mainshock of the cluster k, d̃ik is the median of these distances and IQR is the interquartile range.

When a z score is higher than or equal to 2, that event is regarded as a spatial anomaly.

In the case of the time distribution of the events, we have considered that, when 6 months or more have passed from one event to the next one in the cluster, the latter event and those after it are time anomalies. This approach is taken for those events with magnitudes lower than Mw 4.0.

With these definitions the parameter influence on the results will be evaluated in a preliminary analysis by computing the number of spatial and time anomalies in each cluster. The initial parameter configuration will correspond with the “Computed value” column in Table B1.

B1 Influence of the b value

Figure B1a shows that the influence of the b value in the total number of both time and spatial anomalies is not clear. However, in Fig. B1b it can be seen that, the higher the b value, the higher the maximum number of anomalies in a cluster.

B2 Influence of the completeness magnitude

Figure B2a and b show that, in general, both the anomalies and maximum number of anomalies per cluster decrease with increasing completeness magnitude. This could be related to the fact that an increasing completeness magnitude effectively means fewer events in the catalogue.

B3 Influence of the fractal dimension

As in the case of the b value it can be seen in Fig. B3a that the number of clusters with spatial and time anomalies does not change significantly with the fractal dimension for the considered range. Nevertheless, the maximum number of spatial anomalies (Fig. B3b) rises with the fractal dimension. This is directly related with Eq. (1) and the physical meaning of this parameter. A fractal dimension closer to 2 would mean the structure approaches to covering the whole 2D surface, which in turn involves greater spatial dispersion around the mainshocks.

Finally, the minimum distance to be considered in the NN distance algorithm will be analysed. In principle, the only constraint on this parameter is the epicentral uncertainty in the catalogue. This uncertainty has been already studied for some of the periods the catalogue covers (González2017) so it can be bounded by 10 km as the highest value and 2 km as the lowest value (computed as the mean epicentral error of the catalogue from 2000 onwards). The tendency is not clear for the time anomalies as seen in Fig. B4a and b, but it seems to be optimal for 7.5 km in the case of the maximum spatial anomalies in Fig. B4b.

https://nhess.copernicus.org/articles/25/3853/2025/nhess-25-3853-2025-f20

Figure B1(a) Clusters with spatial and time anomalies for different b values and (b) maximum spatial and time anomalies per cluster for different b values.

Download

https://nhess.copernicus.org/articles/25/3853/2025/nhess-25-3853-2025-f21

Figure B2(a) Clusters with spatial and time anomalies for different completeness magnitudes and (b) maximum spatial and time anomalies per cluster for different completeness magnitudes.

Download

https://nhess.copernicus.org/articles/25/3853/2025/nhess-25-3853-2025-f22

Figure B3(a) Clusters with spatial and time anomalies for different completeness magnitudes and (b) maximum spatial and time anomalies per cluster for different completeness magnitudes.

Download

https://nhess.copernicus.org/articles/25/3853/2025/nhess-25-3853-2025-f23

Figure B4(a) Clusters with spatial and time anomalies for different minimum neighbour distances and (b) maximum spatial and time anomalies per cluster for different minimum neighbour distances.

Download

Appendix C: Anomaly study for the main set of parameters

Figures C1 and C2 summarise the main anomaly analysis for these sets of parameters. Both sets of parameters minimise the number of clusters with time anomalies, although there is still one cluster with 10 or more anomalous events. As for the spatial anomalies, most of the clusters with spatial anomalies have only one anomaly.

https://nhess.copernicus.org/articles/25/3853/2025/nhess-25-3853-2025-f24

Figure C1Anomaly analysis for the clusters identified with NN algorithm using the following set of parameters: b value, 1.0; completeness magnitude, Mw 3.2; fractal dimension, 1.5; minimum neighbour distance, 7.5 km. The red dots represent the clusters with four or more events.

Download

https://nhess.copernicus.org/articles/25/3853/2025/nhess-25-3853-2025-f25

Figure C2Anomaly analysis for the clusters identified with NN algorithm using the following set of parameters: b value, 1.0; completeness magnitude, Mw 3.0; fractal dimension, 1.5; minimum neighbour distance, 7.5 km. The red dots represent the clusters with four or more events.

Download

Appendix D: Anomaly study for the critical threshold, η0

Figures D1 and D2 compare the spatial and time anomalies in the clusters when using the fixed η0 value of −4.5 for the two main set of parameters: (1) b value, 1.0; completeness magnitude, Mw 3.2; fractal dimension, 1.5; and minimum neighbour distance, 7.5 km; (2) b value, 1.0; completeness magnitude, Mw 3.0; fractal dimension, 1.5; and minimum neighbour distance, 7.5 km. These results should be compared with those in the section Appendix C as they were obtained by using a free η0 value (η0=-3.4 for the first set of parameters and η0=-3.5 for the second one).

https://nhess.copernicus.org/articles/25/3853/2025/nhess-25-3853-2025-f26

Figure D1Anomaly analysis for the clusters identified with NN algorithm using the following set of parameters: b value, 1.0; completeness magnitude, Mw 3.2; fractal dimension, 1.5; minimum neighbour distance, 7.5 km; critical threshold, η0, −4.5. The red dots represent the clusters with four or more events.

Download

https://nhess.copernicus.org/articles/25/3853/2025/nhess-25-3853-2025-f27

Figure D2Anomaly analysis for the clusters identified with NN algorithm using the following set of parameters: b value, 1.0; completeness magnitude, Mw 3.0; fractal dimension, 1.5; minimum neighbour distance, 7.5 km; and critical threshold, η0, −4.5. The red dots represent the clusters with four or more events.

Download

Code availability

The code used in this study is available upon reasonable request.

Data availability

The catalogue used in this work and the catalogue with the parents, founders, and members of the clusters for completeness magnitudes 3.2 and 2.1 can be found in the following Zenodo repository: https://doi.org/10.5281/zenodo.17234389 (last access: 30 September 2025).

Author contributions

Conceptualisation and original idea: AP and DML; methodology: DML, AP, EV, and SM. EV wrote the code regarding the clustering properties, AP wrote the code for the statistical testing, and DML wrote the code for the fractal dimension computation (box counting), spatial and temporal anomaly checking, and catalogue analysis and plotting. DML, AP, and SM performed the data curation. Writing – original draft: DML, SM, AP, and EV; writing – review and editing: DML, SM, AP, and EV. All of the authors have read and agreed to the published version of the paper.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher’s note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Acknowledgements

The authors thank the anonymous referee and Patricia Martínez-Garzón for their insight, as it helped to improve this paper. Antonella Peresan benefited from financial support from the RETURN Extended Partnership (Next-Generation EU).

Financial support

This research has been supported by the Ministerio de Ciencia e Innovación (grant no. PID2021-123135OB-C21) and the Conselleria de Innovación, Universidades, Ciencia y Sociedad Digital, Generalitat Valenciana (grant no. CIAICO/2022/038).

Review statement

This paper was edited by Filippos Vallianatos and reviewed by Patricia Martínez-Garzón and one anonymous referee.

References

Baiesi, M. and Paczuski, M.: Scale-free networks of earthquakes and aftershocks, Phys. Rev. E, 69, 066106, https://doi.org/10.1103/PhysRevE.69.066106, 2004. a

Bavelas, A.: Communication Patterns in Task‐Oriented Groups, J. Acoust. Soc. Am., 22, 725–730, https://doi.org/10.1121/1.1906679, 1950. a

Benali, A., Peresan, A., Varini, E., and Talbi, A.: Modelling background seismicity components identified by nearest neighbour and stochastic declustering approaches: the case of Northeastern Italy, Stoch. Env. Res. Risk A., 34, 775–791, https://doi.org/10.1007/s00477-020-01798-w, 2020. a

Buforn, E., Sanz de Galdeano, C., and Udías, A.: Seismotectonics of the Ibero-Maghrebian region, Tectonophysics, 248, 247–261, https://doi.org/10.1016/0040-1951(94)00276-F, 1995. a

García-Mayordomo, J.: Modelo de Zonas Sismogénicas para el Cálculo de la Peligrosidad Sísmica en España, no. 5 in Riesgos geológicos/geotecnica, Instituto Geológico y Minero de España (IGME), C/ La Caleram 1 Tres Cantos – 28760 Madrid, 1 edn., ISBN 978-84-7840-964-8, http://hdl.handle.net/10261/273473, 2015a. a, b, c

García-Mayordomo, J.: Creación de un modelo de zonas sismogénicas para el cálculo del Mapa de Peligrosidad Sísmica de España, Instituto Geológico y Minero de España (IGME), https://digital.csic.es/handle/10261/273473, 2015b. a, b, c

García-Mayordomo, J., Insua-Arévalo, J., Martínez-Díaz, J., Perea, H., Álvarez Gómez, J., Martín-González, F., González, A., Lafuente, P., Pérez-López, R., Rodríguez-Pascua, M., Giner-Robles, J., Azañón, J., Masana, E., and Moreno, X.: Modelo integral de zonas sismogénicas de España, in: Resúmenes de la 1a Reunión Ibérica sobre Fallas Activas y Paleosismología Sigüenza, España (2010), 193–196, http://hdl.handle.net/10261/82470, 2010. a

García-Mayordomo, J., Insua-Arévalo, J. M., Martínez-Díaz, J. J., Jiménez-Díaz, A., Martín-Banda, R., Martín-Alfageme, S., Álvarez Gómez, J. A., Rodríguez-Peces, M., Pérez-López, R., Rodríguez-Pascua, M. A., Masana, E., Perea, H., Martín-González, F., Giner-Robles, J., Nemser, E. S., Cabral, J., and QAFI compilers: The Quaternary Active Faults Database of Iberia (QAFI v.2.0), J. Iber. Geol., 38, 285–302, https://doi.org/10.5209/rev_JIGE.2012.v38.n1.39219, 2012. a, b, c, d, e, f, g, h, i

Gardner, J. K. and Knopoff, L.: Is the sequence of earthquakes in Southern California, with aftershocks removed, Poissonian?, B. Seismol. Soc. Am., 64, 1363–1367, https://doi.org/10.1785/BSSA0640051363, 1974. a

Gaspar-Escribano, J. M., Rivas-Medina, A., Parra, H., Cabañas, L., Benito, B., Ruiz-Barajas, S., and Martínez-Solares, J. M.: Uncertainty assessment for the seismic hazard map of Spain, Eng. Geol., 199, 62–73, https://doi.org/10.1016/j.enggeo.2015.10.001, 2015. a, b, c

González, Á.: The Spanish National Earthquake Catalogue: Evolution, precision and completeness, J. Seismol., 21, 435–471, https://doi.org/10.1007/s10950-016-9610-8, 2017. a, b, c, d

Gutenberg, B. and Ritcher, C. F.: Frequency of earthquakes in California, B. Seismol. Soc. Am., 34, 185–188, 1944. a

IGME: ZESIS: Base de Datos de Zonas Sismogénicas de la Península Ibérica y territorios de influencia para el cálculo de la peligrosidad sísmica en España, Insituto Geológico y Minero de España (IGME), https://info.igme.es/zesis (last access: 5 February 2025), 2015. a, b, c, d, e

IGME: QAFI: Quaternary Active Faults Database of Iberia, Instituto Geológico y Minero de España (IGME), https://info.igme.es/QAFI (last access: 5 February 2025), 2022. a, b, c, d, e, f

IGN: Catálogo de terremotos, Instituto Geográfico Nacional (IGN) [data set], https://doi.org/10.7419/162.03.2022, 2022. a, b

IGN: Descripción del tipo de Magnitud [in Spanish], Instituto Geográfico Nacional (IGN), https://www.ign.es/web/resources/docs/IGNCnig/SIS-Tipo-Magnitud.pdf, (last access: 22 May 2025), 2025. a

IGN-UPM Working Group: Actualización de mapas de peligrosidad sísmica de España 2012, Centro Nacional de Información Geográfica, https://www.ign.es/resources/acercaDe/libDigPub/ActualizacionMapasPeligrosidadSismica2012.pdf (last access: 25 September 2025), 2013. a, b, c, d, e

Kharazian, A., Molina, S., Galiana-Merino, J. J., and Agea-Medina, N.: Risk-targeted hazard maps for Spain, B. Earthq. Eng., 19, 5369–5389, https://doi.org/10.1007/s10518-021-01189-8, 2021. a

Luque-Espinar, J. A. and Mateos, R. M.: Hydrochemical changes in thermal waters related to seismic activity: The case of the 2020–2021 seismic sequence in Granada (S Spain), J. Hydrol., 627, 130390, https://doi.org/10.1016/j.jhydrol.2023.130390, 2023. a

López-Casado, C., Peláez-Montilla, J., and Henares-Romero, J.: Sismicidad en la Cuenca de Granada, in: La cuenca de Granada: estructura, tectónica activa, sismicidad, geomorfología y dataciones existentes, edited by: Sanz de Galdeano, C., Peláez-Montilla, J., and López-Garrido, A. C., 218, Granada, ISBN 8469955616, 2001. a

Martínez‐Garzón, P., Zaliapin, I., Ben‐Zion, Y., Kwiatek, G., and Bohnhoff, M.: Comparative Study of Earthquake Clustering in Relation to Hydraulic Activities at Geothermal Fields in California, J. Geophys. Res.-Sol. Ea., 123, 4041–4062, https://doi.org/10.1029/2017JB014972, 2018. a, b

Molina, S., Navarro, M., Martínez-Pagán, P., Pérez-Cuevas, J., Vidal, F., Navarro, D., and Agea-Medina, N.: Potential damage and losses in a repeat of the 1910 Adra (Southern Spain) earthquake, Nat. Hazards, 92, 1547–1571, https://doi.org/10.1007/s11069-018-3263-6, 2018. a

Ogata, Y.: Space-Time Point-Process Models for Earthquake Occurrences, Ann. I. Stat. Math., 50, 379–402, https://doi.org/10.1023/A:1003403601725, 1998. a

Papadakis, G., Vallianatos, F., and Sammonds, P.: Non-extensive statistical physics applied to heat flow and the earthquake frequency–magnitude distribution in Greece, Physica A, 456, 135–144, https://doi.org/10.1016/j.physa.2016.03.022, 2016. a

Peresan, A. and Gentili, S.: Seismic clusters analysis in Northeastern Italy by the nearest-neighbor approach, Phys. Earth Planet. In., 274, 87–104, https://doi.org/10.1016/j.pepi.2017.11.007, 2018. a

Peresan, A. and Gentili, S.: Identification and characterisation of earthquake clusters: a comparative analysis for selected sequences in Italy and adjacent regions, B. Geofis. Teor. Appl., 61, 57–80, https://doi.org/10.4430/bgta0249, 2020. a

Reasenberg, P. A.: Second-order moment of central California seismicity, 1969–1982, J. Geophys. Res.-Sol. Ea., 90, 5479–5495, https://doi.org/10.1029/JB090iB07p05479, 1985. a

Reasenberg, P. A. and Jones, L. M.: Earthquake Hazard After a Mainshock in California, Science, 243, 1173–1176, https://doi.org/10.1126/science.243.4895.1173, 1989. a

Rohatgi, V. K. and Ehsanes Saleh, A. K. M.: An Introduction to Probability and Statistics, John Wiley & Sons, Inc., ISBN 9781118799642, https://doi.org/10.1002/9781118799635, 2015. a

Saccorotti, G., Carmona, E., Ibàñez, J. M., and Del Pezzo, E.: Spatial characterization of Agron, southern Spain, 1988–1989 seismic series, Phys. Earth Planet. In., 129, 13–29, https://doi.org/10.1016/S0031-9201(01)00203-5, 2002. a

Stich, D., Morales, J., López-Comino, J. Á., Araque-Pérez, C., Azañón, J. M., Dengra, M. Á., Ruiz, M., and Weber, M.: Seismogenic structures and active creep in the Granada Basin (S-Spain), Tectonophysics, 882, 230368, https://doi.org/10.1016/j.tecto.2024.230368, 2024. a

Talebi, M., Zare, M., and Peresan, A.: Quantifying the features of earthquake clusters in north-Central Iran, based on nearest-neighbor distances and network analysis, Phys. Earth Planet. In., 353, 107215, https://doi.org/10.1016/j.pepi.2024.107215, 2024. a, b

Uhrhammer, R. A.: Characteristics of northern and southern California seismicity, Earthquake Notes, 57, 21–37, 1986. a

van Stiphout, T., Zhuang, J., and Marsan, D.: Seismicity declustering, in: Community Online Resource for Statistical Seismicity Analysis (CORSSA), CORSSA, 1–25, https://doi.org/10.5078/corssa-52382934, 2012. a

Varini, E., Peresan, A., and Zhuang, J.: Topological Comparison Between the Stochastic and the Nearest-Neighbor Earthquake Declustering Methods Through Network Analysis, J. Geophys. Res.-Sol. Ea., 125, e2020JB019718, https://doi.org/10.1029/2020JB019718, 2020. a

Vidal-Sánchez, F.: Terremotos relevantes y su impacto en Andalucía., in: Curso sobre Prevención Sísmica-Instituto Andaluz de Geofísica, 24, Universidad de Granada, https://raco.cat/index.php/AnuariVerdaguer/article/view/262467, 1993. a

Wiemer, S.: A Software Package to Analyze Seismicity: ZMAP, Seismol. Res. Lett., 72, 373–382, https://doi.org/10.1785/gssrl.72.3.373, 2001. a

Zaliapin, I. and Ben-Zion, Y.: Earthquake clusters in southern California I: Identification and stability, J. Geophys. Res.-Sol. Ea., 118, 2847–2864, https://doi.org/10.1002/jgrb.50179, 2013a. a, b, c

Zaliapin, I. and Ben-Zion, Y.: Earthquake clusters in southern California II: Classification and relation to physical properties of the crust, J. Geophys. Res.-Sol. Ea., 118, 2865–2877, https://doi.org/10.1002/jgrb.50178, 2013b. a, b, c

Zaliapin, I. and Ben-Zion, Y.: A global classification and characterization of earthquake clusters, Geophys. J. Int., 207, 608–634, https://doi.org/10.1093/gji/ggw300, 2016. a

Zaliapin, I. and Ben-Zion, Y.: Earthquake Declustering Using the Nearest-Neighbor Approach in Space-Time-Magnitude Domain, J. Geophys. Res.-Sol. Ea., 125, e2018JB017120, https://doi.org/10.1029/2018JB017120, 2020. a, b, c

Zaliapin, I., Gabrielov, A., Keilis-Borok, V., and Wong, H.: Clustering Analysis of Seismicity and Aftershock Identification, Phys. Rev. Lett., 101, 018501, https://doi.org/10.1103/PhysRevLett.101.018501, 2008. a, b, c

Zhuang, J.: Second-Order Residual Analysis of Spatiotemporal Point Processes and Applications in Model Evaluation, J. Roy. Stat. Soc. B, 68, 635–653, https://doi.org/10.1111/j.1467-9868.2006.00559.x, 2006. a

Zhuang, J., Ogata, Y., and Vere-Jones, D.: Stochastic Declustering of Space-Time Earthquake Occurrences, J. Am. Stat. Assoc., 97, 369–380, https://doi.org/10.1198/016214502760046925, 2002. a

Download
Short summary
South and south-eastern Spain has the highest seismicity in the country, but inconsistent fault data limit its use in seismic hazard assessment. This study applies the nearest-neighbour (NN) algorithm and graph theory to analyse clustering patterns. Two regions (western and eastern) with higher and lower (respectively) clustering complexities are identified. The results suggest alternative seismic zonation models, which could improve seismic hazard assessment.
Share
Altmetrics
Final-revised paper
Preprint