Appraising the Early-est earthquake monitoring system for tsunami alerting at the Italian Candidate Tsunami Service Provider

In this paper we present and discuss the performance of the procedure for earthquake location and characterization implemented in the Italian Candidate Tsunami Service Provider at the Istituto Nazionale di Geofisica e Vulcanologia (INGV) in Rome. Following the ICG/NEAMTWS guidelines, the first tsunami warning messages are based only on seismic information, i.e., epicenter location, hypocenter depth, and magnitude, which are automatically computed by the software Early-est. Early-est is a package for rapid location and seismic/tsunamigenic characterization of earthquakes. The Early-est software package operates using offline-event or continuous-real-time seismic waveform data to perform trace processing and picking, and, at a regular report interval, phase association, event detection, hypocenter location, and event characterization. Early-est also provides mb, Mwp, and Mwpd magnitude estimations. mb magnitudes are preferred for events with Mwp.5.8, while Mwpd estimations are valid for events with Mwp&7.2. In this paper we present the earthquake parameters computed by Early-est between the beginning of March 2012 and the end of December 2014 on a global scale for events with magnitude M ≥ 5.5, and we also present the detection timeline. We compare the earthquake parameters automatically computed by Early-est with the same parameters listed in reference catalogs. Such reference catalogs are manually revised/verified by scientists. The goal of this work is to test the accuracy and reliability of the fully automatic locations provided by Early-est. In our analysis, the epicenter location, hypocenter depth and magnitude parameters do not differ significantly from the values in the reference catalogs. Both mb and Mwp magnitudes show differences to the reference catalogs. We thus derived correction functions in order to minimize the differences and correct biases between our values and the ones from the reference catalogs. Correction of theMwp distance dependency is particularly relevant, since this magnitude refers to the larger and probably tsunamigenic earthquakes. Mwp values at stations with epicentral distance 1.30 are significantly overestimated with respect to the CMT-global solutions, whereas Mwp values at stations with epicentral distance 1&90 are slightly underestimated. After applying such distance correction the Mwp provided by Early-est differs from CMT-global catalog values of about δMwp ≈ 0.0∓ 0.2. Early-est continuously acquires time-series data and updates the earthquake source parameters. Our analysis shows that the epicenter coordinates and the magnitude values converge within less than 10 min (5 min in the Mediterranean region) toward the stable values. Our analysis shows that we can compute Mwp magnitudes that do not display short epicentral distance dependency overestimation, and we can provide robust and reliable earthquake source parameters to compile tsunami warning messages within less than 15 min after the event origin time.

mb, M wp , and M wpd magnitude estimations.mb magnitudes are preferred for events with M wp 5.8, while M wpd estimations are valid for events with M wp 7.2.In this paper we present the earthquake parameters computed by Early-est between the beginning of March 2012 and the end of December 2014 on a global scale for events with magnitude M ≥ 5.5, and we also present the detection timeline.We compare the earthquake parameters automatically computed by Early-est with the same parameters listed in reference catalogs.Such reference catalogs are manually revised/verified by scientists.The goal of this work is to test the accuracy and reliability of the fully automatic locations provided by Early-est.In our analysis, the epicenter location, hypocenter depth and magnitude parameters do not differ significantly from the values in the reference catalogs.Both mb and M wp magnitudes show differences to the reference catalogs.We thus derived correction functions in order to minimize the differences and correct biases between our values and the ones from the reference catalogs.Correction of the M wp distance dependency is particularly relevant, since this magnitude refers to the larger and probably tsunamigenic earthquakes.M wp values at stations with epicentral distance 30 • are significantly overestimated with respect to the CMT-global solutions, whereas M wp values at stations with epicentral distance 90 • are slightly underestimated.After applying such distance correction the M wp provided by Early-est differs from CMT-global catalog values of about δM wp ≈ 0.0 ∓ 0.2.Early-est continuously acquires time-series data and updates the earthquake source parameters.Our analysis shows that the epicenter coordinates and the magnitude values converge within less than 10 min (5 min in the Mediterranean region) toward the stable values.Our analysis shows that we can compute M wp magnitudes that do not display short epicentral distance dependency overestimation, and we can provide robust and reliable earthquake source parameters to compile tsunami warning messages within less than 15 min after the event origin time.

Introduction
Tsunamis may produce dangerous coastal flooding and inundations accompanied by powerful currents which can cause significant damage and casualties.A tsunami may be generated when a large or great earthquake occurs in oceans or inland close to the coast.When such earthquakes occur, a tsunami warning should be issued to alert national authorities and emergency management officials to take action for the entire tsunami hazard zone, such as evacuating the pop-Published by Copernicus Publications on behalf of the European Geosciences Union.
F. Bernardi: Early-est performance ulation or securing critical facilities such as nuclear power plants.With advance evacuation plans and well-informed communities, tsunami warnings could also be sent directly to the population.
Reliable tsunami warnings should be disseminated as fast as possible in order to also be effective for the coastal areas very close to the earthquake source, since a tsunami may arrive at these areas within the first few minutes after the event origin time.Populations exposed to tsunami hazards in the field near to the source, however, should be aware that the time between warning issuance and tsunami impact may be too short to escape the tsunami; warning may arrive even after the tsunami, or the system may be subject to failure for several reasons.Hence, the population should know how to self-evacuate relying on natural warnings when they are present, such as strong and/or unusually long shaking, ocean withdrawal, an anomalously rising tide, roaring sounds from the ocean, etc.
To provide the earliest possible alerts, initial warnings from regional tsunami warning systems are normally only based on seismic information.Thus, fast, precise, and reliable earthquake source parameters like epicenter coordinates, hypocenter depth, and magnitude are crucial for seismologically based tsunami early warning procedures.This is particularly important in the Mediterranean Sea, where the tsunami wave travel times between source regions and coastlines are short and dedicated deep-sea instruments, such as DART ® buoys (http://nctr.pmel.noaa.gov/Dart/), to be in place.
The Istituto Nazionale di Geofisica e Vulcanologia (INGV) in Italy is a Candidate Tsunami Service Provider (CTSP) in the framework of ICG/NEAMTWS (NEAMTWS, 2011), which is the tsunami early warning and mitigation system established by IOC/UNESCO for the northeastern Atlantic, the Mediterranean and connected seas.For this reason, the Centro Allerta Tsunami (CAT) (Italian for "tsunami alert center"), was established at the INGV headquarter in Rome at the end of 2013.The CAT mission is to implement and maintain a 24/7 service alongside the ordinary seismic surveillance of the national territory, and to work towards a probabilistic seismic hazard assessment (PSHA) for the Italian coasts, that is a tsunami hazard map for seismically induced tsunamis (Basili et al., 2013).CAT-INGV started operations on a 24/7 basis as a CTSP in October 2014.Monthly communication tests are performed with national authorities, subscriber IOC member states, and other institutions, such as the DG-ECHO Emergency Response Coordination Center in Brussels.In the NEAM region there are three other CTSPs in operation: CENALT in France, NOA in Greece, and KOERI in Turkey.IPMA, in Portugal, should begin operations soon.Each of these CTSPs has its specific competence source areas within the NEAM region.
At the national level, INGV is responsible for issuing messages to the Civil Protection authority, which is presently responsible for alert dissemination.INGV also maintains the national seismic network and exchanges seismic data in real time with a number of international seismic data providers.The Istituto Superiore per la Protezione e Ricerca Ambientale (ISPRA) maintains the national sea level network and provides real-time data to the INGV monitoring room.The implemented tsunami warning procedure uses the Earlyest software developed by Lomax and Michelini (2009a, b, 2011, 2012) to rapidly detect, locate, and determine the magnitude for large to great regional and teleseismic earthquakes.
The purpose of this paper is to analyze the performance of Early-est regarding past events, in order to evaluate its reliability for the near-real-time tsunami warnings disseminated by the INGV, and eventually tune the procedure as a whole.
INGV CTSP follows the ICG/NEAMTWS guidelines.ICG/NEAMTWS rules establish that a CTSP must disseminate a tsunami message, with warning levels that depend on location, magnitude, and depth of the earthquake according to a decision matrix, for all earthquakes with magnitudes M ≥ 5.5 in their zone of competence.Messages are sent for earthquakes that are large and shallow enough, and which occur in sea areas or inland but are sufficiently close to the coast to possibly generate a tsunami.INGV is responsible for the earthquake and tsunami source zone extending from the Gibraltar Strait in the west, to Marmara and Levantine seas to the east.
The seismicity in the Mediterranean region is moderate to high but also includes M = 8+ earthquakes that occurred in the past and generated significant tsunamis (Maramai et al., 2014;Lorito et al., 2015).It is difficult to assess if M = 9class earthquakes might occur, and these can not be excluded (Kagan and Jackson, 2013).Even if tsunamigenic earthquakes are likely to occur, their time recurrence intervals are however quite long (Koravos et al., 2003;Jenny et al., 2004;Bungum and Lindholm, 2007); moreover, the Mediterranean Sea is a relatively small area, and earthquakes with M ≥ 5.5 do not occur very frequently.The Global CMT catalogs (Dziewonski et al., 1981;Ekström et al., 2012) include about 125 earthquakes with M w ≥ 5.5 within the Mediterranean region, which implies an occurrence rate of ≈ 30 every 10 years.Early-est has now been running for several years, but only since the beginning of March 2012 has its current major version release been online and its solutions have been able to be systematically archived; thus we have few events to analyze for tuning our tsunami alert procedure (Table 1).For this reason, we perform our analysis using all earthquakes which have occurred worldwide and have been located by Early-est since March 2012.To perform the analysis and tune our procedure, we proceed by comparing the epicenters, the hypocenter depths, and the estimation of magnitudes provided fully automatically by Early-est with the same parameters provided by other agencies taken as a reference.Such agencies provide manually validated/revised locations and magnitude estimations for earthquakes on a global scale.
This paper is structured as follows: in the next section, we give a brief overview of the Early-est algorithm, in Sect. 3 we Table 1.List of earthquakes that occurred in the Mediterranean region located by Early-est with M ≥ 5.5 between March 2012 and December 2014.For each event we have listed the computed event origin time, epicenter coordinates, hypocenter depth, the maximum 68 % confidence error in xyz space (in kilometers), the preferred magnitude (mb, M wp or M wpd ), and a reference magnitude, i.e., when the first Early-est locations were available (in seconds) after the event origin time, and when the magnitudes stabilize (in minutes) after the first location was available.A magnitude is stable when the difference to the final magnitude is ≤ ∓0.2.describe the data set used in our analysis, and in the three sections following that, we then analyze and compare the earthquake source parameters provided by Early-est with the ones provided by the reference agencies; first the epicenter location (Sect.4), then the hypocenter depth (Sect.5), and lastly the magnitude (Sect.6).In Sect.7 we will analyze the speed performances of Early-est with respect to the location and the magnitude parameters, in order to set the timeline of our automatic tsunami warning procedure.Lastly, we present the discussions and conclusions.

Early-est algorithm description
Early-est is a software package for rapid location and seismic/tsunamigenic characterization of earthquakes.The Early-est software package operates using offline-event or continuous-real-time seismic waveform data to perform trace processing and picking, and, at a regular report interval, phase association, event detection, hypocenter location, and event characterization.This characterization (Table A1) includes mb and M wp magnitudes, the determination of apparent rupture duration, T 0 , large earthquake magnitude, M wpd , and the assessment of tsunamigenic potential using T d and T 50 Ex, as described in Lomax andMichelini (2009a, b, 2011).The Early-est program reads Mini-SEED data packets from a file or a SeedLink server (http://ds.iris.edu/ds/nodes/dmc/services/seedlink, http://www.seiscomp3.org/wiki, doc/applications/seedlink), and passes each packet to a trace-processing module.The program also runs an associate/locate-reporting module at regular reporting intervals (e.g., after all data are read by Mini-SEED; every 1 min for SeedLink).The Early-est software maintains a persistent pick list for the current reporting window (e.g., the last hour before real time) and an event list for a specified archive interval (e.g., the last 10 days).The pick list is updated continuously as picking and trace processing are applied to new data packets.The event list is updated at each reporting interval as new event locations are found or previous locations are deleted.At each reporting interval, the associate/locate module processes the current pick list from scratch, without making use of previous associations or location information from the event list; this memory-less procedure simplifies the associate/locate module and makes it very robust with respect to changes in the pick list, but increases the computational load.To reduce this load, the persistence of association and location information for well located events is currently being added to Early-est.

Trace-processing module
The trace-processing module processes each new data packet passed by the Early-est program.This processing includes channel identification, quality control, filtering for picking, picking, and further filtering and pre-processing as required for seismic and tsunamigenic event characterization (Table A1).Picking in Early-est is performed by FilterPicker (Lomax and Michelini, 2012;Vassallo et al., 2012), a general purpose, broadband, phase detector and picker which is applicable to real-time seismic monitoring and earthquake early warning.FilterPicker uses an efficient algorithm which operates stably on continuous, real-time, broadband signals, avoids excessive picking during large events, and produces onset timing, realistic timing uncertainty, onset polarity, and amplitude information.In practice, it operates on a predefined number of frequency bands by generating a set of bandpassed time series with different center frequencies.Characteristic functions are determined for each frequency band and a pick is declared if and when, within a window of predefined time width, the integral of the maximum of the characteristic functions exceeds a predefined threshold.
After picking for each new data packet, for each pick in the pick list for the current packet channel, the trace-processing module applies various analyses on the channel data and updates values needed for event characterization.Recursive, time-domain algorithms are used for all filtering and other time-series processing.

Associate/locate-reporting module
The Early-est associate/locate-reporting module runs an octree associate/locate module with the current pick list, and then the reporting module which determines event characterization results and generates graphical and alpha-numeric reporting output.The octree associate/locate module efficiently and robustly associates picks, and detects and locates seismic events over the whole Earth from 0 to 700 km depth using the efficient, nonlinearized, probabilistic and global, octree importance-sampling search (Lomax et al., 2001(Lomax et al., , 2009)).See Appendix A for more details.
The Early-est reporting module processes the current pick list and event list to determine event characterization results (Table A1) and generate graphical, alpha-numeric, XML, HTML, and other reporting output for events, picks, stations, etc.An e-mail or other alert message can be generated for each event with magnitudes or tsunamigenic potential that exceed preset thresholds.Figure A1 shows the main graphical display of Early-est, which summarizes the evolving trace processing, associate/locate module and event characterization results in real time.

Data set
The Early-est catalog (EEc in this paper) includes fully automatic and unrevised location and magnitude estimations for 5449 events from around the globe recorded at regional and teleseismic distance with magnitude M 5.0.The current major version release of Early-est has been running since the beginning of March 2012.Our analysis will use locations and magnitudes for events which occurred between the beginning of March 2012 and the end of December 2014.At the beginning of March 2012, Early-est was using about 300 seismic broadband stations.The number of stations has continuously been increasing, and at the end of September 2014 the Early-est software was using a virtual station network of 494 stations (Fig. 1).
We use the following as reference catalogs: (i) the catalog provided by GEOFON project of the Deutches GeoForschungsZentrum (Gc in this paper, http://geofon.gfz-potsdam.de/eqinfo/form.php);(ii) the catalog provided by the US National Earthquake Information Center (Nc in this paper); (iii) the catalog provided by the EMSC-CSEM (Cc in this paper) (http://www.emsc-csem.org/Earthquake),(Godey et al., 2007); (iv) the catalog provided by the Global CMT project (CMTc in this work) (Dziewonski et al., 1981;Ekström et al., 2012); (v) and the catalog provided by the Pacific Tsunami Warning Center (Pc in this paper) provided to the authors of this paper courtesy of Barry Hirshorn of the Pacific Tsunami Warning Center (http://ptwc.weather.gov).The CMTc and the Pc will be used specifically to compare and assess the M wp and M wpd magnitudes.
The above-mentioned observatories and centers provide manually verified and/or revised earthquakes source parameters for different time periods.Table 2 summarizes the abbreviations and time windows for each catalog used in this work.The ICG/NEAMTWS guidelines indicate that tsunami warning must be disseminated for all events in the Mediterranean and northeastern Atlantic regions with M ≥ 5.5.For this reason, although Early-est locates events with magnitude M 5.0, our analysis will focus only on worldwide earthquakes with magnitude M ≥ 5.5.

Epicenter location
In this section, we use the three reference catalogs Nc, Gc, and Cc, and the Early-est catalog EEc.
We first build three couples with the three reference catalogs (Gc-Cc, Cc-Nc and Gc-Nc) and we compute the distance between the epicenter coordinates for each earthquake listed in both catalogs for each couple.
The top panel in Fig. 2 shows the histograms representing the distributions of the location differences in each couple from the reference catalogs.The M ≥ 5. located with distance differences of δ ref 50 km.We did not find evidence for geographical and/or tectonic dependence of this uncertainty.
We then compare the epicenter coordinates between the earthquakes listed in the EEc and each of the three reference catalogs (Fig. 2, bottom panels), i.e., we build the couples EEc-Cc, EEc-Nc, and EEc-Gc.The histograms show that the epicenter location differences between the EEc and the reference catalogs δ EEc are similar to the differences plotted on the top panels.The mean location difference between the EEc and the reference catalogs is about δ EEc 20 ∓ 20 km and 95% of all events in the data set show differences δ EEc 45 km.
Generally our analysis showed that earthquakes with M ≥ 5.5 can be located by using seismic data from global networks, with an empirical uncertainty, defined as the mean location difference with respect to the reference catalogs, of about ν ≈ 20 ∓ 25 km.

Hypocenter depth
In this section we proceed as described in the section above: we use the three reference catalogs Nc, Gc, and Cc and the Early-est catalog EEc to build the catalog couples used in the previous section.We then compute the depth difference between the hypocenters for each earthquake listed in both catalogs of each couple.
Figure 3 (top panels) shows histograms that represent the distribution of the depth differences in each couple from the reference catalogs.The hypocenter depth estimation for earthquakes with magnitude M ≥ 5.5 listed in global catalogs is generally well resolved: the mean and standard deviations difference are δZ ref ≈ 0∓25 km for all catalog couples.We did not find evidence for geographical and/or tectonic dependence of these differences.
We then compare the hypocenter depths between the EEc and each of the three reference catalogs (Fig. 3 bottom panels; couples EEc-Cc, EEc-Nc, and EEc-Gc).The bottom panels show that the hypocenter depth estimation between the Early-est catalog and the reference catalogs do not differ significantly: the mean difference distributions are about δZ EEc ≈ 0 ∓ 30 km.Epicenter location difference distributions for the events listed in the reference and in the Early-est catalogs.The epicenter location difference is expressed in kilometers on the x axis.The y axis refers to the number of events for each bin; the bins are 5 km each.The top panels show the location difference between the locations of the three reference catalogs: Nc, Gc, and Cc.The bottom panels show the location difference between Early-est and the reference catalogs.The gray color scale indicates magnitude ranges as follows: dark gray 5.5 ≤ M < 6, middle dark gray 6.0 ≤ M < 6.5, middle light gray 6.5 ≤ M < 7.0 and light gray M ≥ 7.0.The mean and the standard deviation and the 95 % percentiles for the entire dates (i.e., regardless of the magnitude) are indicated on the top right of each panel.

F. Bernardi: Early-est performance
Generally, our analysis showed that hypocenter depth of earthquakes with M ≥ 5.5 can be precisely estimated when using seismic data from global networks, with an empirical uncertainty of about ν ≈ 00 ∓ 30 km.

Magnitude
Early-est provides three different types of magnitude: mb, M wp , and M wpd (Lomax and Michelini, 2011) and then automatically decides each minute which magnitude type is the most significant, following the rules in Table 3.The criteria to assign the best magnitude listed in Table 3 follow two simple principles: (i) a minimum number of observations is required to obtain reliable magnitude estimations, and (ii) magnitude types are reliable within magnitude ranges.Following Lomax andMichelini (2009a, b, 2011) we set the validity range 5.8 ≤ M wp < 7.2 for the best magnitude; mb is assigned to best magnitude when M wp < 5.8 and M wpd is assigned to best magnitude when M wp > 7.2 In this work we compare the Early-est magnitude types M wp and M wpd with the reference magnitude types M wp , and M w .Since the Table 3. Early-est computes mb, M wp , and M wpd .This table summarize the rules used by Early-est to define the best magnitude (i.e., the most significative magnitude type) between mb, M wp , and M wpd .The magnitude mb is computed using the 30 s time window or the apparent source duration T o as a time window when T o < 30 s and using the IASPEI WWSSN-SP response for convolution.The magnitude M wp is scaled to the largest of the first two maxima on integrated displacement within the window from t P to t P + T o time or 120 s after t P , where t P is the P -arrival time -whichever window is the shortest.The magnitude M wpd (duration-amplitude), which can be viewed as an extension of the M wp moment-magnitude algorithm, is computed following the M wp procedure and corrections described in Lomax and Michelini (2012).ICG/NEAMTWS guidelines prescribe that for earthquakes with depth Z ≤ 100 km, a standard general warning should only be delivered for events with M ≥ 5.5, and no action should be taken for smaller magnitudes, we only analyze the magnitude comparisons for events with Z ≤ 100 km in this section.
As in Sects.4 and 5 we first compare the magnitudes provided by the reference catalogs.Then, we compare the magnitudes provided by Early-est with the magnitudes listed in the reference catalogs.First we will compare all best magnitude (i.e., mb or M wp ) results, only considering couples between catalogs where the magnitude types are identical (Fig. 4).This comparison will provide a general overview on how the best magnitudes of Early-est match with the magnitudes from the reference catalogs.
Figure 4 shows the distribution of the magnitude differences δM EEc = M EEc − M ref between the values of the EEc and the ones from the reference catalogs.
When comparing the Early-est magnitudes with the magnitudes from the two reference catalogs (Fig. 4), Early-est seems to overestimate the magnitudes by about δM EEc ≈ 0.1 ∓ 0.2.The percentiles show that more than 10 % of the magnitudes provided by Early-est differ significantly from the magnitudes provided by the reference catalogs.The overestimation and the wider distribution appear to be homogeneously distributed among all magnitude ranges.
In the next subsections, we will analyze the magnitude values for each single magnitude type mb and M wp separately in more detail.

mb
In this subsection we compare the mb EE magnitudes provided by Early-est to the mb magnitudes provided by NEIC (mb Nc ) and EMSC (mb Cc ).We use the mb EEc only when Early-Est assigns the best magnitude to mb, following the rules of Table 3.
Figure 5 shows the mb EEc with respect to the mb Nc (top left panel) and with respect to the mb Cc (top right panel).These two plots show scattered and sparse distributed values, which are coherent with the magnitude differences of the histograms in Fig. 4c and d Differences are computed only when the same magnitude type is provided for the same event in the two compared catalogs.The magnitude difference is on the x axis.The y axis refers to the number of events for each bin; the bins are 0.1 magnitude each.The color scale refers to the same magnitude ranges as in Figs. 3 and 2, and not to the magnitude type.The gray color scale indicate magnitude ranges as follows: dark gray 5.5 ≤ M < 6, middle dark gray 6.0 ≤ M < 6.5, middle light gray 6.5 ≤ M < 7.0, and light gray M ≥ 7.0.The mean and the standard deviation and the 95 % percentiles for the entire dates (i.e., regardless of the magnitude) are indicated on the top left of each panel.
In order to correct such scattered and sparse distribution, we computed a linear regression function for each panel (thick dashed lines on the top panels).These functions are computed for f 1 = mb EEc → mb Nc and for f 2 = mb EEc → mb Cc .The constant a and b of the linear function are shown in the upper left corners of Fig. 4a and b.We then applied the regression functions f 1 and f 2 to the mb EEc values and we recompute the differences (third row of the histograms).Both new distributions have mean values close to 0 and smaller standard deviation and percentiles compared to the original ones.
The two functions appear similar but show different a and b constants.In order to test if such differences are significant, first we applied the function f 1 , derived for mb EEc  → mb Nc , and we computed the differences with respect to the mb Cc values.Secondly, we applied the function f 2 , derived for mb EE  → mb Cc , and we computed the residuals with respect to the mb Nc values.Applying these corrections, we obtain two new difference distributions: δmb EEc→Nc From the percentiles of the corrected distributions, particularly on the left side, we observe that when applied, the regression function f 1 produces a narrower magnitude difference distribution with respect to the function f 2 .
Generally, after applying the linear corrections, the resulting mb EE uncertainty (ν ≈ 0.00 ∓ 0.14), with respect to the reference catalogs, is coherent with the overall magnitude uncertainty between the two reference catalogs (Fig. 4, left panel).

M wp
As a reference, we first compare the magnitudes M Pc wp values provided by the Pacific Tsunami Warning Center (PTWC) using the correction of Whitmore et al. (2002) with the M CMTc w of the CMT-Harvard catalog (Fig. 6).The magnitudes compare well with a mean difference µ = 0.04 ∓ 0.19 for events with magnitude about M wp 7.0-7.5.For larger events, the magnitudes M Pc wp begin to overestimate with respect to the M CMTc w .We now compare the magnitudes M EEc wp with the M CMTc w (Fig. 7).The M EEc wp magnitudes appear to be significantly overestimated (> 0.2 magnitude unit) for earthquakes with M CMT w ≤ 6.5.M wp is based on the far-field approximation to the P wave displacement due to a double-couple point source (Tsuboi et al., 1995), thus we should consider that the result of M wp computed in the field near to the source may be biased.In fact Hirshorn et al. (2012) showed that single station M wp values measured at stations at epicentral distances ≤ 15 • have positive residuals with respect to the Harvard centroid moment tensor M w .Nevertheless, our procedure is built to obtain reliable M wp estimates as fast as possible, thus we aim to also use M wp measured from stations close to the epicenter.
To test if our M EEc wp values may be dependent as a function of the distance between station and epicenter, we plotted the station residuals at each station for each event with respect to the epicenter distance (Fig. 8).Station residuals are defined as δM i wp = M EEc,i wp − M CMTc w , where i indicates the M wp values measured at each station.The top left of Figure the residuals δM i wp (gray dots) for all events with hypocenter depth ≤ 100 km plotted with respect to the epicentral distance in degrees.From these residuals we compute the regression function (dashed line in Fig. 8): Figure 8 and Eq. ( 1) show that the δM i wp are overestimated for distances 30 • and slightly underestimated for distances 90 • .After applying the regression function f ( ) to the station values, the distance dependency of M i wp is removed (Fig. 8 top right panel).
The distance dependency of the measured M EEc,i wp at each station results in a general overestimation of the M EEc wp with respect to the M CMTc w (Fig. 7 bottom left).The overestimation of M EEc wp could of course be removed using only M wp , measured at stations with epicentral distance 30 • ≤ ≤ 90 • .Nevertheless, Early-est is designed to provide au-tomatic magnitude estimations within a few minutes after the event origin time in order to disseminate early tsunami warnings.Thus the closer stations are relevant and must be used.
For this reason we apply Eq. ( 1) to remove the distance dependency of the measured M EEc,i wp and we then recompute the magnitude events M EEc wp, corr .To recompute the M EEc wp, corr we follow the Early-est procedure as follows: we cut off stations with M EEc,i wp < 10th percentile and with M EEc,i wp > 10th percentile.The event magnitude is M wp = 50th percentile of the remaining values.The bottom right histogram of Fig. 8 shows the corrected magnitude differences δM EEc wp, corr .The right shift of the original magnitude differences distribution (Fig. 8, bottom left) is corrected.The resulting magnitude M EEc wp uncertainty with respect to the M CMTc w is δM wp = 0.0 ∓ 0.2, which is consistent with the uncertainty of the M wp provided by the PTWC with respect to the global CMT-Harvard catalog.

Speed performance and tsunami warning alert timeline
In the previous section, we analyzed the final epicenter location, hypocenter depth, and magnitude values provided by Early-est, i.e., the values obtained about 20 min to 1 h after the event origin time.A tsunami alert however, is meaningful when delivered within a short time after the event origin time and with reliable earthquake source parameters.In order to plan the timeline procedure at the CAT-INGV, we want to know how fast the earthquake source parameters computed by Early-est converge toward a stable value.We thus first analyze how fast Early-est provides a first automatic location, and second, how fast the epicenter coordinates and the magnitudes stabilize.
The histogram in Fig. 9 shows the delay time after the event origin time when a first automatic location of Early-est becomes available.We generally have to wait at least 2 min in order to have a first automatic solution; within 7 and 10 min Figure 9. Early-est first location performance.This figure shows how fast a first location for global events is available through Earlyest.The bins (25 s wide) on the x axis refer to the seconds after the event origin time (OT) when a first location is available.On the top right, the mean, the standard deviation, and four representative percentiles are indicated.
after the event origin time, about 95 and 100 % of all earthquakes are located, respectively.On a global scale, a large number of earthquakes are located along the oceanic ridges and trenches, which are far away from most of the seismic stations.In the Mediterranean region, the distances between earthquake sources and seismic stations are generally shorter than on a global scale.Table 1 lists the 12 events with magnitude M ≥ 5.5 that occurred in the Mediterranean region between March 2012 and the end of December 2014.These 12 events do not form a reliable statistic, but from Table 1 we may reasonably expect to locate an event in the Mediterranean region with magnitude M ≥ 5.5 within 2-3 min after the event origin time.
Figure 10 shows how fast a first location (top panel) and magnitude (bottom panel) stabilizes towards the final and stable values.
Both panels indicate that for most of the events, the epicenter coordinates and magnitudes within the first 8-10 min after the first available location may be considered stable and significantly close to the final values, since the magnitudes are for each run, we computed the distance in kilometers between the current epicenter and the epicenter of the last location.Bottom panel: for each run we computed the absolute magnitude difference between the current magnitude and the final magnitude.In this panel, most of the magnitudes are available 2 min after the event origin time, since often the first automatic location may not provide a magnitude value.The magnitude refers to the "best" magnitude decided by Early-est (Table 3) at each run.In both panels difference values (depicted by black crosses) are plotted on the y axis with respect to the minutes after the first location is established (0 value at the x axis).The black line depicts the mean value computed for each minute and the dashed line shows the mean plus the standard deviation.
The CAT-INGV uses the earthquake source parameters provided by Early-est to compile the tsunami warning messages to be disseminated to the civil authorities.The mission of the CAT is to provide tsunami warnings for earthquakes with M ≥ 5.5 which occur in the Mediterranean region according to the ICG/NEAMTWS guidelines.
Based on the speed performances of Early-Est on computing reliable earthquake source parameters (Fig. 10) and on the minimum delay time after the event origin time to localize an event in the Mediterranean (Table 1), we set the timeline described below to allow fast enough production of warning messages based on robust seismic estimates.
Based on Fig. 10 we decided to automatically compile a tsunami warning alert message always for the second, the fifth and the eighth locations available after the first location is established.Considering that the first location in the Mediterranean region may be available within 2-3 min after the event origin time, the second, the fifth and the eighth locations may be available by about 5, 8, and 11 min after the event origin time.Therefore, in the case of an earthquake in the Mediterranean region, the continuous monitoring of Early-est provides information to the seismologists for issuing tsunami warnings.Based on Fig. 10 and Table 1, such a procedure may be executed within about ≈ 15 min after the event origin time.The messages are delivered via several media such as mail, fax, GTS (https://www.wmo.int/pages/prog/www/TEM/GTS), and SMS.This ensures that messages are typically delivered to authorities within seconds.

Discussions and final remarks
Early-est is able to provide a first location within about 7 min from the origin time for almost 95 % of all worldwide earthquakes.In the Mediterranean region, where the epicentral distance between the earthquake and the seismic station is smaller, we may expect a first automatic location result within 2-3 min after the event origin time.Generally within less than 10 min after the first location is established, the estimations converge to stable values.
In our analysis, the automatic locations and source depth estimates provided by Early-est for global M ≥ 5.5 earthquakes are robust and reliable; in fact the epicenter source parameters estimates by Early-est are coherent with the epicenter source parameters provided after manual revision/validation by other agencies (NEIC, GFZ, and CSEM-EMSC) that locate earthquakes on a global scale.
Generally our analysis showed that earthquakes with M ≥ 5.5 can be located by using seismic data from global networks, with an empirical uncertainty, defined as the mean location difference with respect to the reference catalogs, of about ν ≈ 20 ∓ 25 km.The locations provided by Early-est show differences to the locations from the reference catalogs, that are comparable to the location differences among the reference catalogs.
A similar conclusion is valid for the mean Early-est focal depth difference for global M ≥ 5.5 earthquakes, which is about ν ≈ 0 ∓ 25 km, which is also coherent with the focal depth differences between the reference catalogs.
Early-est uses only a subset of all worldwide, public, realtime stations, and the fact that the available number of stations sometimes may be reduced because of latencies does not seem to affect the quality of the estimated epicenter coordinates and hypocenter depth.
The magnitude is a key earthquake parameter to determine the tsunami alert level (see Sect. 1).The decision matrix defined by the NEAMTWS (2011) sets the tsunami warning level on the basis of the magnitude, hypocenter depth, and the distance between the epicenter and the coastal forecast points.The automatic mb and M wp magnitudes provided by Early-est show differences to the reference values used, that in some cases may be significant in the context of a tsunami warning.
The mb magnitudes provided by Early-est compare well with the mb values provided by reference catalogs from the point of view of the mean differences, but show sparse and scattered distributions that can be larger than ∓0.3 units of magnitude.Such sparse distribution can be corrected by increasing the signal-to-noise ratio threshold for the mb station values.On the other hand, a higher signal-to-noise ratio threshold may reduce the number of station readings, and would require more stations to obtain a reliable mb value.This would result in a slower magnitude estimation, which may affect the efficiency and the speed required for the dissemination of tsunami warnings.A linear correction of the computed mb values produces indeed a reduction of the standard deviation to about ∓0.15 units of magnitude.Both f 1 and f 2 corrections help the avoidance of large magnitude over-and underestimations.The f 1 correction function shows slightly more narrow distribution than the f 2 correction function.
Nevertheless, the mb magnitude starts to saturate from a magnitude of mb 6.0, and for this reason Early-est does not use mb when M wp ≥ 5.8.Thus, mb values apply to earthquakes which are not generally expected to be tsunamigenic.
The Early-est magnitude M wp values are reliable when computed using only stations with epicentral distance 30 • ≤ ≤ 90 • .As expected (Tsuboi et al., 1995;Hirshorn et al., 2012), single station M i wp measurements at distance ≤ 30 • are significantly overestimated (Fig. 8).The observed distance-dependent bias at each station results in a general overestimation of the final M wp (Fig. 7).Earlyest is designed to provide automatic magnitude estimation within a few minutes after the event origin time in order to disseminate early tsunami warnings, thus the closer stations are relevant and must be used.For this reason we prefer to correct the station M wp values to remove the overestimation of the single station M wp values at distance ≤ 30 • , instead of introducing a minimum distance cut off.
Since the assignment rules for the best magnitude depend on the number of stations measuring reliable mb, M wp , and M wpd and the magnitude value for each one (Table 3), the assigned best magnitude may vary between mb, M wp , and M wpd at each run.This is particularly true within the first few minutes after the event origin time, when the number of available waveforms may still be small, and the magnitude values may not be stable yet (Fig. 10).The linear correction for mb and the distance-dependent correction for M wp will thus produce a stable and reliable best magnitude useful for seismologically based tsunami early warning procedures.
The CAT-INGV provides seismologically based tsunami early warnings when earthquakes with magnitude M ≥ 5.5 occur in the Mediterranean region.Such tsunami warning messages are based on the fully automatically location and magnitude estimations provided by the Early-est software.The analysis of a data set of 3 years of worldwide earthquakes showed that Early-est is a robust, reliable, and efficient piece of software for automatic real-time earthquake source parameter estimation, which provides reliable and robust location parameters and magnitude estimations within a few minutes after the event origin time.

F. Bernardi: Early-est performance Appendix A: Octree associate/locate module
The octree associate/locate module (Fig. A2) efficiently and robustly associates picks, and detects and locates seismic events over the whole Earth from 0 to 700 km depth using the efficient, nonlinearized, probabilistic, and global octree importance-sampling search (Lomax et al., 2001(Lomax et al., , 2009)).The objective function for the octree search is a probability function, P (x), based on the stacking of implicit origin times for each pick for each potential source x test .Given a seismic wave velocity model (currently ak135 Kennett et al., 1995), a pick time t p at a seismic station, and assuming a seismic phase type that may have produced the pick, the phase traveltime from the source x test to the station T x can be calculated and thus the implicit origin time T 0 for the source and phase can be determined by back projection (e.g., T 0 = T p − T x ).The set of stacks of T 0 for all picks forms a histogram over potential origin times for a source at x test .If the maximum histogram value exceeds a specified threshold, and if the associated picks for the maximum pass tests on amplitudes and station distributions, then P (x test ) is retained to drive the octree search further to find a maximum x max = max[P (x)] and define a seismic event at x max and associated picks.
The octree search is direct and nonlinearized -it does not involve linearization of the equations relating the pick times to the source location, and is global and probabilistic; it samples throughout the prior probability density function (PDF) for the seismic location problem.The search uses an initial, coarse, regular grid-search followed by recursive, octal sub-division, and sampling of cells in three-dimensional, latitude/longitude/depth space to generate a cascaded, octree structure of sampled cells.The octree search produces approximate importance-sampling -the spatial density of sampled cells follows the objective function P .
For each latitude/longitude/depth cell of volume v scanned by the octree search, a histogram-like stack over implicit origin times for first-arrival, P phases (currently Pg, P , Pdiff, PKPdf), is constructed for all picks in the pick list.Each origin-time value T 0 is assigned a distance and pick-qualityweighted amplitude A between 0 and 1.0, and an uncertainty σ determined by the sum of half the maximum travel-time range across the cell volume with the travel-time and pick uncertainties.Each implicit origin time is included in the origin-time stack with amplitude A using two step-function time limits at T 0 ∓ σ inserted in time order.After all picks have been processed, the maximum of the origin-time stack is found by a systematic scan over the available time limits; the use of step-function time limits and time ordering makes this scan very fast.All picks whose origin time limits overlap the stack maximum time are flagged as associated.The stack value, combined with the variance of the implicit origin times from all associate picks, is converted to a probability, P (x, v).If the maximum stack value exceeds a specified threshold (currently 4.5), and if the associated picks for the maximum pass tests on amplitude attenuation, station dis-tance, and azimuth distributions, then P (x, v) is stored for use in the progression of the octree search.If any of these conditions are not met, then the octree associate/locate module returns with a flag that no event has been associated.P (x, v) represents the relative probability that an event is located within a cell of volume v at position x.
The octree search to associate/locate is paused when the subdivided cells reach an adaptively determined, minimum size (e.g., ≤ 5 km for a location constrained by regional to globally distributed stations, ≤ 1 km for a location constrained by locally distributed stations); in this pause, uncertainty measures (e.g., PDF scatter samples) are generated in the association stage.The octree search and cell subdivision is then continued for a fixed number of samples (currently about 4600) to obtain a refined, precise location by fixing the associated phases to those corresponding to the maximum of the P (x, v) found in the association stage.The fixing of the associated phases is necessary for small cell sizes since a decreasing cell volume combined with the step-function limits on origin time leads to a continuous reduction in P (x, v) values and eventual instability and nonconvergence of the octree search near and at the optimal source location.The precise octree results provide uncertainty measures (e.g., PDF scatter samples, uncertainty ellipsoid) for the location.
When the octree associate/locate module returns an event, the associated picks for this event are masked in the pick list and the octree associate/locate module is run again using the remaining, non-associated picks, until no further events are returned.Thus multiple events can be associated and located within a report interval, and, in general, the events are identified in order of the number of associated picks and better location constraint.
Early-est runs the octree associate/locate module every 1 min using all picks from the past hour, without knowledge of or preserving information from previously associations and event locations.This procedure makes Early-est relatively simple algorithmically, and robust with regards to changes in the set of available picks and the number of associated picks defining locations.In particular, this procedure allows early stage locations with few associated picks to easily move in space or origin time, or to split in multiple events, or to be absorbed in other events, or to disappear as more pick data become available.However, this procedure is inefficient for later stage event locations which are defined by a larger number of associated picks, e.g., more than 10-20 picks, since such locations are very unlikely to change; much processing effort is repeated each minute to reobtain a previous result.This inefficiency can be problematic after large earthquakes, when the repeated re-processing of hundreds of picks from a mainshock and large aftershock can cause Early-est to fall behind real time.This filter is applied to the BRB velocity, so effectively gives: integrate → simulate the WWSSN-SP response → differentiate, without doing the integration and differentiation.
-measure V max -the peak from T p to the lesser of T p + T o or T p + 30 s -apply: mb(V max ) = log 10 (V max /2π ) + Q( , h) M wp Tsuboi et al. (1995Tsuboi et al. ( , 1999) ) M wp magnitude, modified as follows: -applied from T p to the lesser of T p + T o or T p + 120 s M wpd (RT ) Lomax and Michelini (2009a) M wpd duration-amplitude, large earthquake magnitude, modified as follows to allow simple and robust real-time application without event type determination: -use constant k = 4.213e19; PREM depth correction; no geometrical spreading or attenuation corrections. - 5 earthquakes are generally located with mean distance differences smaller than δ ref 20 ∓ 25 km; almost 95 % of all earthquakes are Global map with the 494 seismic broadband stations used by Early-est (the list is updated at the end of September 2014).The stations belong to 45 different networks providing data in real time.When working in real time, latencies in the data stream and/or connection problems may occur, reducing the number of waveform available for location and magnitude estimation.
Figure2.Epicenter location difference distributions for the events listed in the reference and in the Early-est catalogs.The epicenter location difference is expressed in kilometers on the x axis.The y axis refers to the number of events for each bin; the bins are 5 km each.The top panels show the location difference between the locations of the three reference catalogs: Nc, Gc, and Cc.The bottom panels show the location difference between Early-est and the reference catalogs.The gray color scale indicates magnitude ranges as follows: dark gray 5.5 ≤ M < 6, middle dark gray 6.0 ≤ M < 6.5, middle light gray 6.5 ≤ M < 7.0 and light gray M ≥ 7.0.The mean and the standard deviation and the 95 % percentiles for the entire dates (i.e., regardless of the magnitude) are indicated on the top right of each panel.

Figure 3 .
Figure3.Hypocenter depth difference distributions for the events listed in the reference and in the Early-est catalogs.The hypocenter depth difference is expressed in kilometers on the x axis.The y axis refers to the number of events for each bin; the bins are 5 km each.The top panels show the hypocenter depth difference distribution between the locations of the three reference catalogs Nc, Gc, and Cc.The bottom panels show the hypocenter depth difference between Early-est and the reference catalogs.The gray color scale indicates magnitude ranges as follows: dark gray 5.5 ≤ M < 6, middle dark gray 6.0 ≤ M < 6.5, middle light gray 6.5 ≤ M < 7.0, and light gray M ≥ 7.0.The mean and the standard deviation and the 95% percentiles for the entire dates (i.e., regardless of the magnitude) are indicated on the top right of each panel.

Figure 4 .
Figure 4. Magnitude difference distributions for the events listed in the EEc catalog compared to the two Nc and Cc reference catalogs.Differences are computed only when the same magnitude type is provided for the same event in the two compared catalogs.The magnitude difference is on the x axis.The y axis refers to the number of events for each bin; the bins are 0.1 magnitude each.The color scale refers to the same magnitude ranges as in Figs.3 and 2, and not to the magnitude type.The gray color scale indicate magnitude ranges as follows: dark gray 5.5 ≤ M < 6, middle dark gray 6.0 ≤ M < 6.5, middle light gray 6.5 ≤ M < 7.0, and light gray M ≥ 7.0.The mean and the standard deviation and the 95 % percentiles for the entire dates (i.e., regardless of the magnitude) are indicated on the top left of each panel.
and right panels).The δmb EE→Nc and δmb EEc→Nc Cc distributions, and the δmb EE→Cc and the δmb EE→Cc Nc distributions appear to be significantly different.We performed a t test between δmb EE→Nc and the δmb EEc→Nc hypothesis H o is rejected at more than 95 %.

Figure 5 .FFigure 6 .
Figure 5. Magnitude mb differences between the Early-est catalog and the reference catalogs (Nc on the left and Cc on the right).Top row panels (a) and (b) depict mb magnitude comparison between the Early-est values (x axis) and the reference catalog values (y axis).The dashed lines refer to the linear regression functions, the a and b constant are indicated on the upper left corner, and the thin black line refers to the 1 : 1 proportion.Second row panels (c) and (d) depict mb magnitude difference distribution; each bin is 0.05 magnitude units wide.The black line refers to the theoretical distribution derived from measured mean µ and standard deviation σ with = 1.Third row panels (e) and (f) are as in the second row panels, but after applying the correction function, shown in the top panels, to the Early-est mb.Fourth row panels (g) and (h) are as in the third row panels, but on the left panel, the EEc-Cc derived correction is applied and on the right panel, the EEc-Nc derived correction is applied.

Figure 7 .
Figure 7. Early-est magnitudes M wp compared with the CMT-Harvard M w from the CMT-Harvard catalog.The continuous line indicates the 1 : 1 ratio and dashed lines indicate uncertainty.

Figure 8 .
Figure 8. Epicentral distance dependence of the M wp for events with hypocentral depth ≤ 100 km.The top left panel shows station residuals δM i wp = M EE,i wp − M CMT w (gray dots), plotted with respect to the epicentral distance in degree, and the dashed line, which represents a third degree polynomial regression function (Eq. 1) which best fits the data.The top right panel indicates station residuals δM i wp = M EEcorr,i wp − M CMT w (gray dots), after applying the regression function (Eq.1), plotted with respect to the epicentral distance in degree, and the dashed line, which is a third degree polynomial regression function, which best fits the corrected residuals with respect to the distance.Bottom left panel: event magnitude difference M wp distribution before the distance correction.These distribution are similar to Fig. 7 as follows: mean, standard deviation, and percentiles are indicated on the left of the histogram, bins are 0.5 magnitude wide each, and the black solid line refers to theoretical distribution with = 1.The bottom right panel shows event magnitude difference M corr wp distribution after the distance correction using Eq.(1).

Figure 10 .
Figure10.Early-est location and magnitude estimation stability performances.This figure shows how fast a first location estimation (top panel) and magnitude (bottom panel) estimations evolve towards stable values.Top panel: for each run, we computed the distance in kilometers between the current epicenter and the epicenter of the last location.Bottom panel: for each run we computed the absolute magnitude difference between the current magnitude and the final magnitude.In this panel, most of the magnitudes are available 2 min after the event origin time, since often the first automatic location may not provide a magnitude value.The magnitude refers to the "best" magnitude decided by Early-est (Table3) at each run.In both panels difference values (depicted by black crosses) are plotted on the y axis with respect to the minutes after the first location is established (0 value at the x axis).The black line depicts the mean value computed for each minute and the dashed line shows the mean plus the standard deviation.

Table 2 .
Global earthquake catalogs used for the analysis in this work.For each catalog we have indicated the begin and end time of the time window of the data set included in this work.Catalog abbreviations used in this paper are in brackets in the first column.

Table A1 .
Early-est parameter specifications.Max.dominantperiodsmoothed over 5 s in window from T p to T p + 55.-reduced T 50 Ex minimum distance to 5 •Td •T 50 Ex Lomax and Michelini (2011) Period-duration discriminant for tsunami potential, modified as follows: -reduced Td •T 50 Ex minimum distance to 5 • T o Lomax and Michelini (2009a, b, 2011) High frequency, apparent source duration, modified as follows: -removed smoothing window width of 10 s from T o for short durations, applied with a linear ramp from 10 → 0s for initial durations of 20 → 60 s and minimum duration is at its highest frequency in HF stream (0.2 s) -reduced T o minimum distance to 5 • -added reference of T o duration to S arrival time (T s ) if raw duration end time T o,end is after T s (e.g., if T o,end > T s + (T s − T p )/3 then T o = T o,end − T s ).mb(V max ) Bormann and Saul (2008, 2009) mb body wave magnitude using V max formulation: -apply a recursive, time-domain filter to BRB velocity that implements the WWSSN-SP displacement response -WWSSN-SP displacement response from Working Group on Magnitudes (Magnitude WG) of the International Association of Seismology and Physics of the Earth's Interior (IASPEI) Commission on Seismological Observation and Interpretation (CoSOI) 2011 moment correction applied to all event types if T o > 80 s -moment correction applied to all event types if T o > 80 s -reduced M wpd minimum distance to 5 • (stable since added reference of T o duration to T s ) P arrival, first-motion and amplitude focal mechanism algorithm (fmamp).Uses octree search; solution quality based on weighted distribution (quasi-PDF) of P and T axis.(note: under development; not included yet in Early-est distribution.)