<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing with OASIS Tables v3.0 20080202//EN" "https://jats.nlm.nih.gov/nlm-dtd/publishing/3.0/journalpub-oasis3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:oasis="http://docs.oasis-open.org/ns/oasis-exchange/table" xml:lang="en" dtd-version="3.0" article-type="research-article">
  <front>
    <journal-meta><journal-id journal-id-type="publisher">NHESS</journal-id><journal-title-group>
    <journal-title>Natural Hazards and Earth System Sciences</journal-title>
    <abbrev-journal-title abbrev-type="publisher">NHESS</abbrev-journal-title><abbrev-journal-title abbrev-type="nlm-ta">Nat. Hazards Earth Syst. Sci.</abbrev-journal-title>
  </journal-title-group><issn pub-type="epub">1684-9981</issn><publisher>
    <publisher-name>Copernicus Publications</publisher-name>
    <publisher-loc>Göttingen, Germany</publisher-loc>
  </publisher></journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.5194/nhess-26-1161-2026</article-id><title-group><article-title>The EAWS matrix, a decision support tool to determine the regional avalanche danger level (Part B): operational testing and use</article-title><alt-title>EAWS Matrix (Part B)</alt-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author" corresp="yes" rid="aff1">
          <name><surname>Techel</surname><given-names>Frank</given-names></name>
          <email>techel@slf.ch</email>
        <ext-link>https://orcid.org/0000-0001-5686-6127</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff2">
          <name><surname>Müller</surname><given-names>Karsten</given-names></name>
          
        <ext-link>https://orcid.org/0000-0001-5746-0962</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff3">
          <name><surname>Marquardt</surname><given-names>Christopher</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff3">
          <name><surname>Mitterer</surname><given-names>Christoph</given-names></name>
          
        </contrib>
        <aff id="aff1"><label>1</label><institution>WSL Institute for Snow and Avalanche Research SLF, Davos, Switzerland</institution>
        </aff>
        <aff id="aff2"><label>2</label><institution>Norwegian Water Resources and Energy Directorate, Oslo, Norway</institution>
        </aff>
        <aff id="aff3"><label>3</label><institution>Avalanche Warning Service Tirol, Innsbruck, Austria</institution>
        </aff>
      </contrib-group>
      <author-notes><corresp id="corr1">Frank Techel (techel@slf.ch)</corresp></author-notes><pub-date><day>5</day><month>March</month><year>2026</year></pub-date>
      
      <volume>26</volume>
      <issue>3</issue>
      <fpage>1161</fpage><lpage>1181</lpage>
      <history>
        <date date-type="received"><day>11</day><month>July</month><year>2025</year></date>
           <date date-type="rev-request"><day>21</day><month>July</month><year>2025</year></date>
           <date date-type="rev-recd"><day>23</day><month>December</month><year>2025</year></date>
           <date date-type="accepted"><day>16</day><month>February</month><year>2026</year></date>
      </history>
      <permissions>
        <copyright-statement>Copyright: © 2026 Frank Techel et al.</copyright-statement>
        <copyright-year>2026</copyright-year>
      <license license-type="open-access"><license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p></license></permissions><self-uri xlink:href="https://nhess.copernicus.org/articles/nhess-26-1161-2026.html">This article is available from https://nhess.copernicus.org/articles/nhess-26-1161-2026.html</self-uri><self-uri xlink:href="https://nhess.copernicus.org/articles/nhess-26-1161-2026.pdf">The full text article is available as a PDF file from https://nhess.copernicus.org/articles/nhess-26-1161-2026.pdf</self-uri>
      <abstract><title>Abstract</title>

      <p id="d2e121">To support public safety and risk management in snow-covered mountains, regional avalanche forecasts must deliver reliable information on avalanche conditions, including regional danger levels representing the avalanche danger across warning regions. To promote greater transparency and consistency in avalanche danger level assessment across European avalanche warning services, a revised version of the EAWS Matrix was developed based on expert elicitation. The Matrix, a structured decision support tool that combines the Matrix input factors snowpack stability, the frequency of snowpack stability, and avalanche size, is used to determine the regional danger level. To support the development of the Matrix described in detail in the companion paper <xref ref-type="bibr" rid="bib1.bibx23" id="paren.1"/>, we analyzed its operational use over the first three winters following implementation by 26 European avalanche warning services. Our aim was to identify inconsistencies in Matrix application and to provide empirically based guidance for further refinement. In operational use, forecasters predominantly assigned a consistent single danger level to most Matrix input factor combinations. However, two factor combinations (<italic>poor-some-size 2</italic> and <italic>very poor-some-size3</italic>) were commonly assigned to one of two adjacent danger levels, indicating that these combinations function as transition zones between danger levels. Analyses based on finer-grained assessments of the input factors, that is, using sub-classes of the predefined coarse factor categories, revealed systematic tendencies within these classes. While application of the Matrix was relatively consistent for avalanche problems relating to dry-snow conditions, pronounced inconsistencies emerged in the classification of snowpack stability for wet-snow and gliding snow avalanche problems. These findings underscore the need for community-wide discussion and harmonization in Matrix application, particularly with respect to stability assessment practices. Assessing input factors at a finer scale shows potential for preserving important nuances in expert judgment and may enable more targeted guidance on when to assign the higher or lower of two danger levels indicated by the Matrix. However, because neither the danger level nor its input factors can be measured independently, a formal validation of Matrix logic and operational application is not possible. Despite some inconsistencies, our results suggest that European forecasters generally align with the Matrix logic, supporting its operational utility.</p>
  </abstract>
    </article-meta>
  </front>
<body>
      

<sec id="Ch1.S1" sec-type="intro">
  <label>1</label><title>Introduction</title>
      <p id="d2e142">Public avalanche forecasts play a key role in informing both authorities and recreational backcountry users about avalanche conditions at the regional scale. A primary objective of these forecasts is to communicate the severity of avalanche conditions clearly and efficiently (e.g., <xref ref-type="bibr" rid="bib1.bibx10" id="altparen.2"/>). To help users focus on the most important information first, avalanche forecasts are structured according to the concept of an information pyramid, with the most synthesized information placed at the top <xref ref-type="bibr" rid="bib1.bibx9" id="paren.3"/>. The danger level, a single categorical value summarizing the hazard using a standardized five-level scale ranging from 1 (low) to 5 (very high) <xref ref-type="bibr" rid="bib1.bibx7" id="paren.4"/>, sits at the top of this pyramid. It offers an immediate, easily understood signal to communicate the severity of avalanche conditions and serves as the entry point to more detailed forecast content. Recreational backcountry users often rely on this information, particularly during the planning stage, to guide their risk management strategies (e.g., <xref ref-type="bibr" rid="bib1.bibx13 bib1.bibx14 bib1.bibx27" id="altparen.5"/>).</p>
      <p id="d2e157">Consistent and accurate assessment of danger levels, both within and across regional or national warning services, is essential to their effectiveness and is a prerequisite for providing valuable forecasts to end users <xref ref-type="bibr" rid="bib1.bibx24" id="paren.6"/>. Yet, forecasting avalanche danger at a regional scale is inherently complex. It requires synthesizing diverse data sources, including field observations and model predictions, that are often sparse, unevenly distributed in time and space, and available in both structured and unstructured formats. This process culminates in an expert judgment of the danger level, integrating both the probability of avalanche occurrence and the expected size of potential avalanches in a region. As with most expert estimation tasks, differences in danger level assessments may occur even when the same information is available during the forecast process (e.g., <xref ref-type="bibr" rid="bib1.bibx18 bib1.bibx33" id="altparen.7"/>). To improve the consistency and transparency of danger level assessments across Europe, the European Avalanche Warning Services (EAWS) introduced decision-support tools to support the determination of regional avalanche danger levels (Bavarian matrix and EAWS-Matrix; <xref ref-type="bibr" rid="bib1.bibx23" id="altparen.8"/>). Recently, this framework was revised including updated definitions, a structured operational workflow, and a new version of the EAWS-Matrix <xref ref-type="bibr" rid="bib1.bibx6" id="paren.9"/> (Fig. <xref ref-type="fig" rid="F1"/>).</p>

      <fig id="F1" specific-use="star"><label>Figure 1</label><caption><p id="d2e176">EAWS-Matrix, as accepted by the EAWS General Assembly in 2022 <xref ref-type="bibr" rid="bib1.bibx23" id="paren.10"><named-content content-type="pre">taken from</named-content></xref>. The integer values shown in the Matrix cells refer to the danger levels. We refer to the first danger level shown in the Matrix as <inline-formula><mml:math id="M1" display="inline"><mml:mrow><mml:msup><mml:mi>D</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> or the Matrix-suggested danger level, and to the danger level shown in brackets as <inline-formula><mml:math id="M2" display="inline"><mml:mrow><mml:msup><mml:mi>D</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>. For a detailed explanation refer to the text.</p></caption>
        <graphic xlink:href="https://nhess.copernicus.org/articles/26/1161/2026/nhess-26-1161-2026-f01.png"/>

      </fig>

      <p id="d2e213">The revised EAWS-Matrix was developed through expert elicitation in a non-operational setting, using a survey-based approach, with the final version reached through expert consensus <xref ref-type="bibr" rid="bib1.bibx23" id="paren.11"/>. Although a broad group of professional forecasters contributed to its development, the Matrix was not systematically evaluated under real-world operational conditions, where forecasts are issued under uncertainty and may have serious consequences if incorrect. Observing how the Matrix is used in operations can help demonstrate its value in supporting and harmonizing danger level assessments, while also revealing potential weaknesses in its structure. Ideally, such evaluation would consider both accuracy, whether the Matrix structure and input factors reflect real avalanche conditions, and consistency, in terms of how reliably it supports similar danger level assessments across services, which are key attributes of forecast goodness <xref ref-type="bibr" rid="bib1.bibx24" id="paren.12"/>. However, since both the input factors and the resulting danger level are based on expert judgment and are not directly observable or measurable, such an evaluation is inherently difficult. Moreover, analyzing Matrix use in an operational context may reflect compliance with its design rather than an independent judgment of avalanche conditions, potentially masking design flaws.</p>
      <p id="d2e222">Given these limitations, we do not attempt to validate the Matrix in an absolute sense. Instead, we examine how it was used during daily operations to (i) identify differences in how avalanche danger is characterized using Matrix terminology across warning services; (ii) analyze how the Matrix was applied across the range of issued danger levels; and (iii) explore differences in situations with dry-snow conditions or when wet-snow or gliding snow avalanche problems were of concern. These analyses provide insight into the operational implementation of the Matrix and highlight potential areas for refinement.</p>
      <p id="d2e225">The presented study is part of the iterative process described in the companion paper <xref ref-type="bibr" rid="bib1.bibx23" id="paren.13"/>, which details the conceptual development of the revised Matrix and the accompanying workflow. Here, we focus on their operational implementation: how the Matrix was used in 26 warning services across multiple countries during the first three winters following its introduction, and how these real-world applications can inform ongoing improvements to the framework.</p>
</sec>
<sec id="Ch1.S2">
  <label>2</label><title>Determining the regional avalanche danger level – approach in Europe</title>
      <p id="d2e239">In the following, we briefly review the key principles of regional avalanche danger assessment in Europe. We refer the interested reader for details on the methodological background and conceptual derivation of the Matrix to <xref ref-type="bibr" rid="bib1.bibx23" id="text.14"/>.</p>
      <p id="d2e245">The revised conceptual framework for the determination of the danger levels introduced by EAWS includes updated definitions of the factors (snowpack stability, frequency of snowpack stability, and avalanche size) determining avalanche danger, a structured operational workflow, and the EAWS-Matrix <xref ref-type="bibr" rid="bib1.bibx6" id="paren.15"/>. While conceptually aligned with the Conceptual Model of Avalanche Hazard <xref ref-type="bibr" rid="bib1.bibx30" id="paren.16"><named-content content-type="pre">CMAH,</named-content></xref>, the framework is tailored to the regional scale, where avalanche danger must be evaluated across entire warning regions encompassing diverse terrain types, slope aspects, and elevation bands, culminating in the determination and communication of a danger level.</p>
      <p id="d2e256">The workflow begins by identifying all relevant avalanche problems <xref ref-type="bibr" rid="bib1.bibx8" id="paren.17"/>. For each problem, forecasters assess both its presence (including affected slope aspects and elevation bands) and its contribution to the overall danger within a micro-region, the smallest spatial units used for danger level assessment. This includes estimating the probability of avalanche occurrence and avalanche size. The avalanche occurrence probability is further decomposed into two factors <xref ref-type="bibr" rid="bib1.bibx6" id="paren.18"/>: <list list-type="bullet"><list-item>
      <p id="d2e267"><italic>Snowpack stability</italic> describes the local propensity of the snowpack to avalanche <xref ref-type="bibr" rid="bib1.bibx26" id="paren.19"/> and is categorized into four classes: <italic>very poor</italic>, <italic>poor</italic>, <italic>fair</italic>, and <italic>good</italic> (Table <xref ref-type="table" rid="TA1"/>). These classes correspond to typical triggering mechanisms. For instance, avalanches releasing due to natural causes, such as loading from new snow or weakening from rain or melt water, are linked to <italic>very poor</italic> stability, while <italic>poor</italic> stability is commonly associated with human triggering.</p></list-item><list-item>
      <p id="d2e297"><italic>Frequency of snowpack stability (classes)</italic> refers to the proportion of avalanche terrain within a region where a given stability class occurs. Frequency is categorized into four classes: <italic>many</italic>, <italic>some</italic>, <italic>a few</italic>, and <italic>none or nearly none</italic> (Table <xref ref-type="table" rid="TA2"/>). The last class indicates that the stability class is either absent or so rare that it is not considered relevant for the avalanche danger level assessment.</p></list-item></list></p>
      <p id="d2e316">In practice, estimating the full spatial distribution of snowpack stability across a region, where conditions vary by elevation, aspect, and slope, is not feasible <xref ref-type="bibr" rid="bib1.bibx32" id="paren.20"/>. In fact, forecasters must synthesize limited information to estimate the prevalence of the lowest stability class(es), which forms the basis for judging the probability of avalanche occurrence. Combined with the largest avalanche size which can reasonably be expected given the conditions, categorized into five classes (Table <xref ref-type="table" rid="TA3"/>), these factors inform the danger level. Thus, the three inputs (stability, frequency, and avalanche size) constitute the core of the EAWS-Matrix.</p>
      <p id="d2e325">The EAWS-Matrix is a look-up table providing guidance for assigning the danger level based on these assessments (Fig. <xref ref-type="fig" rid="F1"/>). It consists of three panels, one for each of the lowest relevant stability class (<italic>very poor</italic>, <italic>poor</italic>, and <italic>fair</italic>), with each showing combinations of frequency (<inline-formula><mml:math id="M3" display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula> axis) and avalanche size (<inline-formula><mml:math id="M4" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula> axis). Each cell contains one or two danger levels: the primary value (<inline-formula><mml:math id="M5" display="inline"><mml:mrow><mml:msup><mml:mi>D</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>), shown first and corresponding to the cell's color, represents the majority expert view and is referred to as the Matrix-suggested danger level. A secondary danger level (<inline-formula><mml:math id="M6" display="inline"><mml:mrow><mml:msup><mml:mi>D</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>), shown in brackets, is included when at least 30 % of experts selected a different level <xref ref-type="bibr" rid="bib1.bibx7 bib1.bibx23" id="paren.21"/>.</p>
      <p id="d2e379">To determine the danger level, forecasters begin with the lowest assessed stability class and evaluate whether its frequency is relevant (i.e., not <italic>none or nearly none</italic>). If not, they proceed to the next stability class, which is congruent with an increase in triggering level needed to release an avalanche, following the arrows between panels (Fig. <xref ref-type="fig" rid="F1"/>), and so on. If stability is assessed as <italic>good</italic> throughout the region, the danger level is set to 1 (low) by default. Forecasters locate the Matrix cell that best represents the conditions for each avalanche problem. Avalanche size is assessed independently and reflects the largest avalanche that could reasonably be expected under the expected conditions.</p>
      <p id="d2e390">The final communicated danger level for a warning region is the highest level assigned across all relevant avalanche problems. When different problems occur in a way that the combined frequency of potential triggering spots increases in the same aspects and elevations, their combined effect may result in a higher overall danger level than any single problem would suggest in isolation <xref ref-type="bibr" rid="bib1.bibx23" id="paren.22"/>.</p>
</sec>
<sec id="Ch1.S3">
  <label>3</label><title>Data</title>
<sec id="Ch1.S3.SS1">
  <label>3.1</label><title>Data overview</title>
      <p id="d2e411">In total, 26 avalanche warning services recorded their factor choices (snowpack stability, frequency of the lowest snowpack stability class, expected avalanche size) during operational forecasting, covering one or more winter seasons or extended parts thereof. Table <xref ref-type="table" rid="T1"/> provides an overview, including the three-letter abbreviations used when referring to specific warning services.</p>

<table-wrap id="T1" specific-use="star"><label>Table 1</label><caption><p id="d2e419">Data overview. Warning services are ordered according to the groups they were assigned to for the analysis (see also Sect. <xref ref-type="sec" rid="Ch1.S4.SS2"/>).</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="8">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="left"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:colspec colnum="6" colname="col6" align="center"/>
     <oasis:colspec colnum="7" colname="col7" align="left"/>
     <oasis:colspec colnum="8" colname="col8" align="left"/>
     <oasis:thead>
       <oasis:row>
         <oasis:entry colname="col1">Country</oasis:entry>
         <oasis:entry colname="col2">Warning service</oasis:entry>
         <oasis:entry colname="col3">Code</oasis:entry>
         <oasis:entry rowsep="1" namest="col4" nameend="col5" align="center"><inline-formula><mml:math id="M9" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula> cases (<inline-formula><mml:math id="M10" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula> days) </oasis:entry>
         <oasis:entry colname="col6">Matrix used</oasis:entry>
         <oasis:entry rowsep="1" namest="col7" nameend="col8" align="center">Group </oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4">Dry-snow</oasis:entry>
         <oasis:entry colname="col5">Wet-/gliding</oasis:entry>
         <oasis:entry colname="col6"/>
         <oasis:entry colname="col7">Compliance</oasis:entry>
         <oasis:entry colname="col8">Wet-/gliding</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5">snow</oasis:entry>
         <oasis:entry colname="col6"/>
         <oasis:entry colname="col7"/>
         <oasis:entry colname="col8">snow</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">Austria</oasis:entry>
         <oasis:entry colname="col2">Oberösterreich</oasis:entry>
         <oasis:entry colname="col3">OBE</oasis:entry>
         <oasis:entry colname="col4">394 (282)</oasis:entry>
         <oasis:entry colname="col5">170 (141)</oasis:entry>
         <oasis:entry colname="col6">yes</oasis:entry>
         <oasis:entry colname="col7">compl-hi</oasis:entry>
         <oasis:entry colname="col8">wet-P</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Austria</oasis:entry>
         <oasis:entry colname="col2">Vorarlberg</oasis:entry>
         <oasis:entry colname="col3">VOR</oasis:entry>
         <oasis:entry colname="col4">675 (391)</oasis:entry>
         <oasis:entry colname="col5">255 (168)</oasis:entry>
         <oasis:entry colname="col6">yes</oasis:entry>
         <oasis:entry colname="col7">compl-hi</oasis:entry>
         <oasis:entry colname="col8">wet-P</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Italy</oasis:entry>
         <oasis:entry colname="col2">Lombardia</oasis:entry>
         <oasis:entry colname="col3">LOM</oasis:entry>
         <oasis:entry colname="col4">759 (147)</oasis:entry>
         <oasis:entry colname="col5">311 (90)</oasis:entry>
         <oasis:entry colname="col6">yes</oasis:entry>
         <oasis:entry colname="col7">compl-hi</oasis:entry>
         <oasis:entry colname="col8">wet-P</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Italy</oasis:entry>
         <oasis:entry colname="col2">Val d’Aosta</oasis:entry>
         <oasis:entry colname="col3">VDA</oasis:entry>
         <oasis:entry colname="col4">267 (132)</oasis:entry>
         <oasis:entry colname="col5">48 (42)</oasis:entry>
         <oasis:entry colname="col6">yes</oasis:entry>
         <oasis:entry colname="col7">compl-hi</oasis:entry>
         <oasis:entry colname="col8">wet-P</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Italy</oasis:entry>
         <oasis:entry colname="col2">Meteomont<sup>a</sup></oasis:entry>
         <oasis:entry colname="col3">MMT</oasis:entry>
         <oasis:entry colname="col4">5290 (263)</oasis:entry>
         <oasis:entry colname="col5">2718 (249)</oasis:entry>
         <oasis:entry colname="col6">yes</oasis:entry>
         <oasis:entry colname="col7">compl-hi</oasis:entry>
         <oasis:entry colname="col8">wet-P</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Slovakia</oasis:entry>
         <oasis:entry colname="col2">Slovakia</oasis:entry>
         <oasis:entry colname="col3">SVK</oasis:entry>
         <oasis:entry colname="col4">300 (201)</oasis:entry>
         <oasis:entry colname="col5">168 (116)</oasis:entry>
         <oasis:entry colname="col6">yes</oasis:entry>
         <oasis:entry colname="col7">compl-hi</oasis:entry>
         <oasis:entry colname="col8">wet-P</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Austria</oasis:entry>
         <oasis:entry colname="col2">Steiermark</oasis:entry>
         <oasis:entry colname="col3">STE</oasis:entry>
         <oasis:entry colname="col4">605 (311)</oasis:entry>
         <oasis:entry colname="col5">248 (161)</oasis:entry>
         <oasis:entry colname="col6">yes</oasis:entry>
         <oasis:entry colname="col7">compl-hi</oasis:entry>
         <oasis:entry colname="col8">wet-P/VP</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Italy</oasis:entry>
         <oasis:entry colname="col2">Marche</oasis:entry>
         <oasis:entry colname="col3">MAR</oasis:entry>
         <oasis:entry colname="col4">214 (80)</oasis:entry>
         <oasis:entry colname="col5">203 (96)</oasis:entry>
         <oasis:entry colname="col6">yes</oasis:entry>
         <oasis:entry colname="col7">compl-hi</oasis:entry>
         <oasis:entry colname="col8">wet-P/VP</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Italy</oasis:entry>
         <oasis:entry colname="col2">Trentino</oasis:entry>
         <oasis:entry colname="col3">TRE</oasis:entry>
         <oasis:entry colname="col4">549 (336)</oasis:entry>
         <oasis:entry colname="col5">181 (110)</oasis:entry>
         <oasis:entry colname="col6">yes</oasis:entry>
         <oasis:entry colname="col7">compl-hi</oasis:entry>
         <oasis:entry colname="col8">wet-P/VP</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Italy</oasis:entry>
         <oasis:entry colname="col2">Veneto</oasis:entry>
         <oasis:entry colname="col3">VEN</oasis:entry>
         <oasis:entry colname="col4">384 (121)</oasis:entry>
         <oasis:entry colname="col5">155 (47)</oasis:entry>
         <oasis:entry colname="col6">yes</oasis:entry>
         <oasis:entry colname="col7">compl-hi</oasis:entry>
         <oasis:entry colname="col8">wet-P/VP</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Spain</oasis:entry>
         <oasis:entry colname="col2">Catalunya</oasis:entry>
         <oasis:entry colname="col3">CAT</oasis:entry>
         <oasis:entry colname="col4">151 (39)</oasis:entry>
         <oasis:entry colname="col5">136 (31)</oasis:entry>
         <oasis:entry colname="col6">yes</oasis:entry>
         <oasis:entry colname="col7">compl-hi</oasis:entry>
         <oasis:entry colname="col8">wet-P/VP</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Germany</oasis:entry>
         <oasis:entry colname="col2">Bayern</oasis:entry>
         <oasis:entry colname="col3">BAY</oasis:entry>
         <oasis:entry colname="col4">500 (281)</oasis:entry>
         <oasis:entry colname="col5">354 (220)</oasis:entry>
         <oasis:entry colname="col6">yes</oasis:entry>
         <oasis:entry colname="col7">compl-hi</oasis:entry>
         <oasis:entry colname="col8">wet-VP</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Norway</oasis:entry>
         <oasis:entry colname="col2">Norway<sup>a</sup></oasis:entry>
         <oasis:entry colname="col3">NOR</oasis:entry>
         <oasis:entry colname="col4">10316 (543)</oasis:entry>
         <oasis:entry colname="col5">3995 (355)</oasis:entry>
         <oasis:entry colname="col6">yes</oasis:entry>
         <oasis:entry colname="col7">compl-hi</oasis:entry>
         <oasis:entry colname="col8">wet-VP</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Italy</oasis:entry>
         <oasis:entry colname="col2">Friuli Venezia Giulia</oasis:entry>
         <oasis:entry colname="col3">FRI</oasis:entry>
         <oasis:entry colname="col4">314 (104)</oasis:entry>
         <oasis:entry colname="col5">118 (39)</oasis:entry>
         <oasis:entry colname="col6">yes</oasis:entry>
         <oasis:entry colname="col7">compl-mid</oasis:entry>
         <oasis:entry colname="col8">wet-P</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Austria</oasis:entry>
         <oasis:entry colname="col2">Niederösterreich</oasis:entry>
         <oasis:entry colname="col3">NIE</oasis:entry>
         <oasis:entry colname="col4">389 (247)</oasis:entry>
         <oasis:entry colname="col5">120 (96)</oasis:entry>
         <oasis:entry colname="col6">yes</oasis:entry>
         <oasis:entry colname="col7">compl-mid</oasis:entry>
         <oasis:entry colname="col8">wet-P/VP</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Austria</oasis:entry>
         <oasis:entry colname="col2">Salzburg</oasis:entry>
         <oasis:entry colname="col3">SAL</oasis:entry>
         <oasis:entry colname="col4">802 (382)</oasis:entry>
         <oasis:entry colname="col5">398 (221)</oasis:entry>
         <oasis:entry colname="col6">yes</oasis:entry>
         <oasis:entry colname="col7">compl-mid</oasis:entry>
         <oasis:entry colname="col8">wet-VP</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Austria</oasis:entry>
         <oasis:entry colname="col2">Tirol</oasis:entry>
         <oasis:entry colname="col3">TIR</oasis:entry>
         <oasis:entry colname="col4">1040 (370)</oasis:entry>
         <oasis:entry colname="col5">402 (202)</oasis:entry>
         <oasis:entry colname="col6">yes</oasis:entry>
         <oasis:entry colname="col7">compl-mid</oasis:entry>
         <oasis:entry colname="col8">wet-VP</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Italy</oasis:entry>
         <oasis:entry colname="col2">Bozen–Südtirol<sup>b</sup></oasis:entry>
         <oasis:entry colname="col3">BOZ</oasis:entry>
         <oasis:entry colname="col4">594 (318)</oasis:entry>
         <oasis:entry colname="col5">152 (99)</oasis:entry>
         <oasis:entry colname="col6">yes</oasis:entry>
         <oasis:entry colname="col7">compl-mid</oasis:entry>
         <oasis:entry colname="col8">wet-VP</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Spain</oasis:entry>
         <oasis:entry colname="col2">Val d’Aran</oasis:entry>
         <oasis:entry colname="col3">VAR</oasis:entry>
         <oasis:entry colname="col4">411 (304)</oasis:entry>
         <oasis:entry colname="col5">249 (220)</oasis:entry>
         <oasis:entry colname="col6">yes</oasis:entry>
         <oasis:entry colname="col7">compl-mid</oasis:entry>
         <oasis:entry colname="col8">wet-VP</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Italy</oasis:entry>
         <oasis:entry colname="col2">Piemonte</oasis:entry>
         <oasis:entry colname="col3">PIE</oasis:entry>
         <oasis:entry colname="col4">793 (159)</oasis:entry>
         <oasis:entry colname="col5">111 (31)</oasis:entry>
         <oasis:entry colname="col6">yes</oasis:entry>
         <oasis:entry colname="col7">compl-lo</oasis:entry>
         <oasis:entry colname="col8">wet-P</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Andorra</oasis:entry>
         <oasis:entry colname="col2">Andorra</oasis:entry>
         <oasis:entry colname="col3">AND</oasis:entry>
         <oasis:entry colname="col4">266 (169)</oasis:entry>
         <oasis:entry colname="col5">201 (158)</oasis:entry>
         <oasis:entry colname="col6">yes</oasis:entry>
         <oasis:entry colname="col7">compl-lo</oasis:entry>
         <oasis:entry colname="col8">wet-P/VP</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Sweden</oasis:entry>
         <oasis:entry colname="col2">Sweden</oasis:entry>
         <oasis:entry colname="col3">SWE</oasis:entry>
         <oasis:entry colname="col4">1419 (412)</oasis:entry>
         <oasis:entry colname="col5">133 (71)</oasis:entry>
         <oasis:entry colname="col6">yes</oasis:entry>
         <oasis:entry colname="col7">compl-lo</oasis:entry>
         <oasis:entry colname="col8">wet-P/VP</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Austria</oasis:entry>
         <oasis:entry colname="col2">Kärnten</oasis:entry>
         <oasis:entry colname="col3">KAE</oasis:entry>
         <oasis:entry colname="col4">627 (272)</oasis:entry>
         <oasis:entry colname="col5">239 (125)</oasis:entry>
         <oasis:entry colname="col6">yes</oasis:entry>
         <oasis:entry colname="col7">compl-lo</oasis:entry>
         <oasis:entry colname="col8">wet-VP</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Italy</oasis:entry>
         <oasis:entry colname="col2">Livigno</oasis:entry>
         <oasis:entry colname="col3">LIV</oasis:entry>
         <oasis:entry colname="col4">318 (244)</oasis:entry>
         <oasis:entry colname="col5">157 (119)</oasis:entry>
         <oasis:entry colname="col6">yes</oasis:entry>
         <oasis:entry colname="col7">compl-lo</oasis:entry>
         <oasis:entry colname="col8">wet-VP</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Switzerland</oasis:entry>
         <oasis:entry colname="col2">Switzerland</oasis:entry>
         <oasis:entry colname="col3">SWI</oasis:entry>
         <oasis:entry colname="col4">3876 (423)</oasis:entry>
         <oasis:entry colname="col5">431 (169)</oasis:entry>
         <oasis:entry colname="col6">no</oasis:entry>
         <oasis:entry colname="col7">compl-lo</oasis:entry>
         <oasis:entry colname="col8">wet-VP</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Great Britain</oasis:entry>
         <oasis:entry colname="col2">Scotland</oasis:entry>
         <oasis:entry colname="col3">SCO</oasis:entry>
         <oasis:entry colname="col4">1126 (348)</oasis:entry>
         <oasis:entry colname="col5">–</oasis:entry>
         <oasis:entry colname="col6">yes</oasis:entry>
         <oasis:entry colname="col7">compl-lo</oasis:entry>
         <oasis:entry colname="col8">–</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table><table-wrap-foot><p id="d2e424"><sup>a</sup> not possible to filter unique cases as described in text. <sup>b</sup> Bozen–Südtirol/Bolzano–Alto Adige.</p></table-wrap-foot></table-wrap>

</sec>
<sec id="Ch1.S3.SS2">
  <label>3.2</label><title>Varying forecasting practices and data harmonization</title>
      <p id="d2e1317">Forecasting practices varied both across and within services over the 3-year study period, requiring careful standardization to enable meaningful comparisons. These differences concerned not only the format of the data provided for this analysis, but also how the EAWS-Matrix and the factor assessments were applied in practice. While we do not have detailed knowledge of service-specific procedures, the following examples illustrate key variations, some of which may reflect differences in data availability for this study rather than differences in actual workflows, and how we addressed them.</p>
<sec id="Ch1.S3.SS2.SSS1">
  <label>3.2.1</label><title>Avalanche problem-specific assessments</title>
      <p id="d2e1327">Most services assessed each avalanche problem individually and assigned it a corresponding danger level. However, some issued a single overarching danger level despite assessing factors per problem (e.g., BAY in first year), grouped problems into broader categories (e.g., SWI separated dry-snow avalanche problems from wet- or gliding snow avalanche problems), or assessed the factors without assigning them to a specific problem (e.g., SCO). In certain services (e.g., SWE), no factor assessments were made when no avalanche problem was identified, a situation commonly observed for danger level 1 (low). Others documented such situations using the label <italic>no distinct avalanche problem</italic> (e.g., SVK, SWI).</p>
</sec>
<sec id="Ch1.S3.SS2.SSS2">
  <label>3.2.2</label><title>Matrix integration in workflow</title>
      <p id="d2e1341">Most services had integrated the EAWS-Matrix into their operational workflows, either using it to directly link factor assessments to a suggested danger level or as a reference framework. A notable exception was Switzerland (SWI), where forecasters assessed factors and danger levels independently, with the latter determined during group discussions rather than derived from the Matrix <xref ref-type="bibr" rid="bib1.bibx35" id="paren.23"/>.</p>
</sec>
<sec id="Ch1.S3.SS2.SSS3">
  <label>3.2.3</label><title>Assessment of Matrix input factors</title>
      <p id="d2e1355">The majority of services used the EAWS-defined ordinal classes for estimating the three Matrix input factors <xref ref-type="bibr" rid="bib1.bibx6" id="paren.24"/> (Table <xref ref-type="table" rid="T2"/>), though some used North American terminology (as defined in the CMAH; <xref ref-type="bibr" rid="bib1.bibx30" id="altparen.25"/>) for assessing stability and frequency (e.g., SCO, SWE). These were subsequently mapped to EAWS-classes based on <xref ref-type="bibr" rid="bib1.bibx22" id="text.26"/>. In Livigno (LIV), the Matrix was applied operationally, but the published forecasts followed North American terminology for describing the sensitivity to triggers. Additionally, some services implemented tools that allowed forecasters to express tendencies within a class. These included services using the ALBINA forecasting software platform <xref ref-type="bibr" rid="bib1.bibx21" id="paren.27"/>, as used for instance in BOZ, TIR, and TRE, as well as the Swiss forecasting software <xref ref-type="bibr" rid="bib1.bibx35" id="paren.28"/>. In ALBINA, tendencies within classes can be expressed via sliders allowing near-continuous values between 0 and 100 for snowpack stability, frequency of the lowest snowpack stability class, and avalanche size (Fig. <xref ref-type="fig" rid="F2"/>a). Class boundaries and underlying numerical values allow unambiguous conversion into the EAWS classes, for example, <italic>poor</italic> corresponds to values from 50 to 74, and <italic>very poor</italic> to values from 75 to 100. In Switzerland, forecasters make distinctions within classes using modifiers such as <italic>minus (−)</italic>, <italic>neutral (=)</italic>, or <italic>plus (+)</italic> (e.g., <italic>poor−</italic>, <italic>poor=</italic>, <italic>poor+</italic>), or selected intermediate labels straddling two adjacent classes (e.g., <italic>fair/poor</italic>) (Fig. <xref ref-type="fig" rid="F2"/>b). In SWI, factor assessments are used for internal purposes only, in ALBINA factors are published. For the purpose of this study, modifier-based entries (e.g., <italic>poor+</italic>) were aggregated into their primary class (<italic>poor</italic>), and intermediate labels (e.g., <italic>fair/poor</italic>) were randomly assigned to either the lower (<italic>fair</italic>) or higher (<italic>poor</italic>) class.</p>

      <fig id="F2" specific-use="star"><label>Figure 2</label><caption><p id="d2e1426">Examples of the user interface for assessing snowpack stability, frequency of snowpack stability, and expected avalanche size in the operational forecasting software used by some European warning services. Sliders allow forecasters to indicate tendencies within each factor class. Panel <bold>(a)</bold> shows the implementation in ALBINA (e.g., BOZ, TIR, TRE) with near-continuous sliders; panel <bold>(b)</bold> shows the interface used in Switzerland (SWI) with underlying modifier-based selection.</p></caption>
            <graphic xlink:href="https://nhess.copernicus.org/articles/26/1161/2026/nhess-26-1161-2026-f02.png"/>

          </fig>

</sec>
<sec id="Ch1.S3.SS2.SSS4">
  <label>3.2.4</label><title>Individual vs. group forecasting</title>
      <p id="d2e1450">In most warning services, a single forecaster was responsible for the forecast for either the entire domain or a specific part of the domain containing one or several micro-regions (e.g., in NOR or SCO), sometimes supported by a second forecaster for review (e.g. TIR). In contrast, SWI followed a fundamentally different approach: the forecasting process always involved at least two (but up to four) forecasters who independently prepared full forecast drafts for the entire forecast domain (referred to as <italic>suggestions</italic>). These suggestions were then consolidated in a group discussion prior to publication. For this study, we derived the median factor assessment from the individual suggestions and linked these to the issued danger level resulting from the group consensus.</p>
      <p id="d2e1456">To harmonize data across services, we applied the following procedure: for each day and micro-region, we selected the decisive avalanche problem, typically the first listed in the bulletin or the one associated with the highest danger level. If neither criterion was available, the first entry was retained. This step was performed separately for dry-snow avalanche problems and for wet-snow or gliding-snow avalanche problems. After selecting the decisive problem, we retained unique entries defined by the combination of date, issuing warning service and/or forecaster, avalanche problem, danger level, and the associated Matrix factors. If the same combination appeared in multiple micro-regions on the same day within the domain of a service, only one instance was retained. This approach was feasible for all services except NOR and MMT, where the data structure did not permit such filtering.</p>

<table-wrap id="T2" specific-use="star"><label>Table 2</label><caption><p id="d2e1463">Forecast parameters and their possible values. For brief descriptions, see Tables <xref ref-type="table" rid="TA1"/>–<xref ref-type="table" rid="TA3"/> in Appendix A, for full definitions, refer to <xref ref-type="bibr" rid="bib1.bibx23" id="text.29"/>.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="2">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Forecast parameter</oasis:entry>
         <oasis:entry colname="col2">Possible values</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">Avalanche problem</oasis:entry>
         <oasis:entry colname="col2">dry snow<sup>*</sup>: new snow, wind slab, persistent weak layer, no distinct problem</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">wet snow, gliding snow</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Snowpack stability</oasis:entry>
         <oasis:entry colname="col2">very poor, poor, fair, good</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Frequency of snowpack stability</oasis:entry>
         <oasis:entry colname="col2">many, some, a few, (nearly) none</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Avalanche size</oasis:entry>
         <oasis:entry colname="col2">5 – extremely large, 4 – very large, 3 – large, 2 – medium, 1 – small</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table><table-wrap-foot><p id="d2e1473"><sup>*</sup> These avalanche problems are analyzed together as problems relating to dry-snow conditions.</p></table-wrap-foot></table-wrap>

</sec>
</sec>
</sec>
<sec id="Ch1.S4">
  <label>4</label><title>Methods</title>
      <p id="d2e1571">The analysis was conducted in three main steps. First, we examined Matrix usage in aggregate, without distinguishing between individual warning services or specific avalanche problems. This allowed us to evaluate whether forecasters assigned danger levels consistently to factor combinations and whether  some combinations of factors resulted in a wider range of danger level assignments. In the second step, we grouped warning services by shared operational characteristics and analyzed differences in Matrix usage across two avalanche problem categories: (i) <italic>dry-snow</italic> avalanche problems, including <italic>new snow</italic>, <italic>wind slab</italic>, <italic>persistent weak layer</italic>, and <italic>no distinct avalanche problem</italic>; and (ii) <italic>wet-snow</italic> and <italic>gliding-snow</italic> avalanche problems <xref ref-type="bibr" rid="bib1.bibx8" id="paren.30"/>. This stratification enabled us to investigate whether observed patterns were consistent across services and groups of avalanche problems, or whether they reflected local practices, compliance with the Matrix, or differing conceptual models. Finally, we took advantage of the finer-granularity assessments of input factors available in some warning services (BOZ, SWI, TIR, TRE) and explored whether tendencies within classes helped explain variations in danger level assignments within specific Matrix cells.</p>
<sec id="Ch1.S4.SS1">
  <label>4.1</label><title>Matrix compliance</title>
      <p id="d2e1606">We analyzed each individual forecast to assess compliance with the Matrix, i.e., how often the issued danger level matched the Matrix-suggested level (<inline-formula><mml:math id="M16" display="inline"><mml:mrow><mml:msup><mml:mi>D</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>). To do so, we used the indicated snowpack stability, frequency of snowpack stability, and avalanche size as inputs to the Matrix to derive the Matrix-suggested danger level (<inline-formula><mml:math id="M17" display="inline"><mml:mrow><mml:msup><mml:mi>D</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>) resulting from a strict application of the Matrix. We then compared <inline-formula><mml:math id="M18" display="inline"><mml:mrow><mml:msup><mml:mi>D</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> with the danger level issued in the forecast (<inline-formula><mml:math id="M19" display="inline"><mml:mrow><mml:msup><mml:mi>D</mml:mi><mml:mi mathvariant="normal">fx</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula>), calculating the proportion of disagreements (<inline-formula><mml:math id="M20" display="inline"><mml:mrow><mml:msup><mml:mi>D</mml:mi><mml:mi mathvariant="normal">fx</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M21" display="inline"><mml:mo>≠</mml:mo></mml:math></inline-formula> <inline-formula><mml:math id="M22" display="inline"><mml:mrow><mml:msup><mml:mi>D</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>), <inline-formula><mml:math id="M23" display="inline"><mml:mrow><mml:msub><mml:mi>P</mml:mi><mml:mi mathvariant="normal">disagree</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, for each warning service and avalanche problem category (Table <xref ref-type="table" rid="T2"/>). We use <inline-formula><mml:math id="M24" display="inline"><mml:mrow><mml:msub><mml:mi>P</mml:mi><mml:mi mathvariant="normal">disagree</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> as measure of Matrix compliance.</p>
</sec>
<sec id="Ch1.S4.SS2">
  <label>4.2</label><title>Grouping of warning services</title>
      <p id="d2e1715">Warning services were grouped to simplify analysis and presentation while retaining sufficient detail to examine key differences in how the Matrix was applied. Grouping also increased the size of the resulting data subsets relative to analyses at the level of individual services, thereby enabling more statistically robust findings. To preserve meaningful distinctions between services, we applied two grouping strategies:</p>
      <p id="d2e1718">First, for dry-snow avalanche problems, we grouped services based on their degree of Matrix compliance (Sect. <xref ref-type="sec" rid="Ch1.S4.SS1"/>). We defined three groups using the 50th and 75th percentiles of the disagreement rate (<inline-formula><mml:math id="M25" display="inline"><mml:mrow><mml:msub><mml:mi>P</mml:mi><mml:mi mathvariant="normal">disagree</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>) as thresholds (see Sect. <xref ref-type="sec" rid="Ch1.S5.SS2.SSS1"/>, see the <italic>compliance</italic> column in Table <xref ref-type="table" rid="T1"/>). For the purpose of analysis, we focused on the two groups with the highest and lowest compliance rates: <list list-type="bullet"><list-item>
      <p id="d2e1744">(compl-hi) Operational use with greater Matrix compliance (<inline-formula><mml:math id="M26" display="inline"><mml:mrow><mml:msub><mml:mi>P</mml:mi><mml:mi mathvariant="normal">disagree</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M27" display="inline"><mml:mo>&lt;</mml:mo></mml:math></inline-formula> 0.04), and</p></list-item><list-item>
      <p id="d2e1766">(compl-lo) Operational use with lower Matrix compliance (<inline-formula><mml:math id="M28" display="inline"><mml:mrow><mml:msub><mml:mi>P</mml:mi><mml:mi mathvariant="normal">disagree</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M29" display="inline"><mml:mo>≥</mml:mo></mml:math></inline-formula> 0.07). We included SWI in this group, the only service not using the Matrix in an operational context, as <inline-formula><mml:math id="M30" display="inline"><mml:mrow><mml:msub><mml:mi>P</mml:mi><mml:mi mathvariant="normal">disagree</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> for SWI was similar to some other services using the Matrix but with low compliance.</p></list-item></list></p>
      <p id="d2e1798">Second, for wet-snow and gliding-snow avalanche problems, we grouped services based on their use of the <italic>very poor</italic> stability class, which varied considerably between warning services (see Section <xref ref-type="sec" rid="Ch1.S5.SS2.SSS2"/>). We split services using the 33rd and 67th percentiles of the proportion of <italic>very poor</italic> classifications (see the <italic>wet-snow/gliding-snow</italic> column in Table <xref ref-type="table" rid="T1"/>), and focused on the groups with the lowest and highest usage of <italic>very poor</italic> stability: <list list-type="bullet"><list-item>
      <p id="d2e1820">(wet-P) Proportion of <italic>very poor</italic> stability <inline-formula><mml:math id="M31" display="inline"><mml:mrow><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mi mathvariant="normal">stab</mml:mi><mml:mo>:</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="normal">very</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="normal">poor</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M32" display="inline"><mml:mo>&lt;</mml:mo></mml:math></inline-formula> 0.25, and</p></list-item><list-item>
      <p id="d2e1854">(wet-VP) Proportion of <italic>very poor</italic> stability <inline-formula><mml:math id="M33" display="inline"><mml:mrow><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mi mathvariant="normal">stab</mml:mi><mml:mo>:</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="normal">very</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="normal">poor</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M34" display="inline"><mml:mo>≥</mml:mo></mml:math></inline-formula> 0.56.</p></list-item></list> Warning services representing intermediate levels of Matrix compliance (compl-mid) and intermediate use of the <italic>very poor</italic> stability class (wet-P/VP) were excluded from the main analysis to maintain focus on key contrasts. However, we present their results in the Appendix Figs. C1 and C2  for completeness.</p>
      <p id="d2e1891">Since the Scottish data (SCO) did not include avalanche problems linked to specific forecasts, we assumed that most cases represented dry-snow avalanche problems. This assumption is supported by information from the Scottish Avalanche Information Service indicating that dry-snow avalanche problems predominate (Mark Diggins, personal communication, 2025).<fn id="Ch1.Footn1"><p id="d2e1894">Across the six forecast regions covered by the Scottish Avalanche Information Service, and excluding the cornice problem, dry-snow problems accounted for 67 %–90 % of all assigned avalanche problems during the 2024/2025 season (unpublished data provided by Mark Diggins, Scottish Avalanche Information Service, 2025).</p></fn></p>
</sec>
<sec id="Ch1.S4.SS3">
  <label>4.3</label><title>Matrix cell usage</title>
      <p id="d2e1905">For each warning service and for each danger level separately, we calculated the proportion of forecast cases assigned to each Matrix cell. Danger levels that occurred in three or fewer cases during the study period were excluded from the analysis for the corresponding service. We then averaged these cell-specific proportions across all warning services and normalized the resulting distributions so that proportions summed to one.</p>
      <p id="d2e1908">In addition to this overall analysis, we repeated the same procedure stratified by group of avalanche problems (dry snow vs. wet and gliding snow; Table <xref ref-type="table" rid="T2"/>). For these strata, we further derived normalized mean cell-usage distributions for groups of warning services with shared characteristics in Matrix compliance for dry-snow problems and use of <italic>very poor</italic> stability for wet-snow or gliding snow problems (Sect. <xref ref-type="sec" rid="Ch1.S4.SS2"/>).</p>
</sec>
<sec id="Ch1.S4.SS4">
  <label>4.4</label><title>Detecting patterns within Matrix cells</title>
      <p id="d2e1927">To better understand how individual factor combinations relate to the issued danger levels, and whether tendencies within a factor class toward higher or lower classes correspond to more frequent use of alternative danger levels, we exploited the finer-grained assessments of the three Matrix factors available from SWI and the warning services in BOZ, TIR, TRE using the ALBINA software from the 2024–2025 forecast season. Focusing on the most frequently used factor combination <italic>poor-some-size 2</italic> for dry-snow avalanche problems (Fig. <xref ref-type="fig" rid="FD1"/>), we identified combinations of sub-classes, which most often predicted one of the two danger levels as shown in the Matrix. Beside presenting the respective data, we derived decision boundaries applying Classification and Regression Trees (CART, <xref ref-type="bibr" rid="bib1.bibx1" id="altparen.31"/>), as they are well suited for detecting decision patterns in multi-factor ordinal data. A detailed description of this approach can be found in  Appendix <xref ref-type="sec" rid="App1.Ch1.S2"/>.</p>
</sec>
</sec>
<sec id="Ch1.S5">
  <label>5</label><title>Results</title>
<sec id="Ch1.S5.SS1">
  <label>5.1</label><title>Overall characterization of danger levels with the Matrix</title>
      <p id="d2e1957">Our analysis of the overall Matrix use without distinguishing between degrees of Matrix compliance or considering avalanche problems, shows how different combinations of stability, frequency, and avalanche size were assigned to issued danger levels (Fig. <xref ref-type="fig" rid="F3"/>). Each cell represents a specific Matrix factor combination, with color intensity indicating the percentage of cases with which that combination was used for a particular danger level.</p>

      <fig id="F3"><label>Figure 3</label><caption><p id="d2e1964">Matrix use by danger level: percentage of cases that a specific danger level (rows from top: <inline-formula><mml:math id="M35" display="inline"><mml:mi>D</mml:mi></mml:math></inline-formula> <inline-formula><mml:math id="M36" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 1 (low) to <inline-formula><mml:math id="M37" display="inline"><mml:mi>D</mml:mi></mml:math></inline-formula> <inline-formula><mml:math id="M38" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 4 (high)) was used for a specific matrix combination. Colour intensity corresponds to the percentage. Cells with less than 1 % usage are not shown. Values highlighted bold correspond to use of Matrix-suggested danger levels (<inline-formula><mml:math id="M39" display="inline"><mml:mrow><mml:msup><mml:mi>D</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>, see Fig. <xref ref-type="fig" rid="F1"/>), values in italics represent the optional level (<inline-formula><mml:math id="M40" display="inline"><mml:mrow><mml:msup><mml:mi>D</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>).</p></caption>
          <graphic xlink:href="https://nhess.copernicus.org/articles/26/1161/2026/nhess-26-1161-2026-f03.png"/>

        </fig>

      <p id="d2e2026">At danger level 1 (low), only 17 % of cases were described using <italic>very poor</italic> stability suggesting that natural avalanches or very easy triggering conditions (Table <xref ref-type="table" rid="TA1"/>) were seldom considered. The combined percentages of cases classified as <italic>very poor</italic> or <italic>poor</italic> was 44 %. According to the intended logic of the Matrix, <italic>fair</italic> stability should only have been selected if both <italic>very poor</italic> and <italic>poor</italic> stability were assessed as <italic>none or nearly none</italic> in the region. Thus, the use of fair stability in more than half of the cases (56 %) implies that often neither natural avalanche activity nor human-triggered avalanches were considered as typical triggers. Frequency class <italic>few</italic> is used in 98 % of the cases, underlining that danger level 1 is associated with few triggering spots. Avalanche size was typically classified as size 1, with a smaller share (20 %) of size 2. It is important to note that not all warning services assessed Matrix factors when no avalanche problem was identified. Consequently, the percentages shown in Fig. <xref ref-type="fig" rid="F3"/> do not equally reflect all services. For instance, for SWE, 39 % of the danger level 1 forecasts lacked factor estimates.</p>
      <p id="d2e2059">At level 2 (moderate), usage was concentrated in the <italic>poor</italic> stability panel (62 %), which is typically associated with human-triggered avalanches (Table <xref ref-type="table" rid="TA1"/>). In 16 % of forecasts, stability was described as <italic>very poor</italic>, most often in connection with avalanche size 2. The frequency of locations with <italic>very poor</italic> or <italic>poor</italic> stability was most commonly assessed as <italic>a few</italic> or <italic>some</italic>, never <italic>many</italic>. 17 % of cases would have been classified as <italic>none or nearly none</italic> for these two stability classes, corresponding to situations where stability was assessed as <italic>fair</italic>. Avalanche size was predominantly size 2, accounting for 87 % of cases.</p>
      <p id="d2e2092">At level 3 (considerable), stability was equally often described as <italic>very poor</italic> and <italic>poor</italic>, with frequency most commonly assessed as <italic>some</italic> (82 %). Avalanche size was typically size 2 or size 3. Notably, size 3 was more frequently associated with <italic>poor</italic> stability, while size 2 was used more often in combination with <italic>very poor</italic> stability.</p>
      <p id="d2e2110">At level 4 (high), there was a strong concentration in the cell <italic>very poor-many-size 3</italic>, which was used in 60 % of cases. Nonetheless, in 16 % of the cases, forecasters chose <italic>poor</italic> stability in combination with <italic>many</italic> locations and avalanche size 3 or 4. These situations are often referred to as “skier-level 4”, situations where human triggering is the dominant triggering mechanism with few or no natural avalanches expected to occur (e.g., <xref ref-type="bibr" rid="bib1.bibx29" id="altparen.32"/>).</p>
      <p id="d2e2125">Danger level 5 (very high) was issued only once during the study period and is therefore not shown in Fig. <xref ref-type="fig" rid="F3"/>; in this case, it was described as <italic>very poor-many-size 4</italic>.</p>
      <p id="d2e2133">Across all danger levels, most Matrix cells were predominantly used for a single danger level, with two important exceptions highlighted in Fig. <xref ref-type="fig" rid="F3"/>. The cell <italic>poor-some-size 2</italic> was the most frequently used combination for level 2 (moderate) (31 %), but also accounted for 13 % of level 3 (considerable) assessments. Similarly, the cell <italic>very poor-some-size 3</italic> was used in 12 % of level 3 cases and 7 % of level 4 cases.</p>
</sec>
<sec id="Ch1.S5.SS2">
  <label>5.2</label><title>Avalanche-problem specific characterization of danger levels with the Matrix</title>
      <p id="d2e2152">We analyzed Matrix usage separately for dry-snow and for wet- or gliding snow avalanche problems. The corresponding data contributions from each warning service are summarized in Table <xref ref-type="table" rid="T1"/>.</p>
<sec id="Ch1.S5.SS2.SSS1">
  <label>5.2.1</label><title>Avalanche problems relating to dry-snow conditions</title>
      <p id="d2e2164">Comparing the issued danger level with the Matrix-suggested danger level showed that nearly all warning services occasionally deviated from the Matrix (median <inline-formula><mml:math id="M41" display="inline"><mml:mrow><mml:msub><mml:mi>P</mml:mi><mml:mi mathvariant="normal">disagree</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M42" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 4 %; Fig. <xref ref-type="fig" rid="F4"/>a). The highest disagreement rates were observed in Andorra (AND), Scotland (SCO), and Switzerland (SWI), where deviations occurred in 36 %–39 % of cases. Piemonte (PIE), Livigno (LIV), and Sweden (SWE) also showed comparably frequent deviations (10 %–21 %), while in half of the warning services, deviations were relatively rare (<inline-formula><mml:math id="M43" display="inline"><mml:mrow><mml:msub><mml:mi>P</mml:mi><mml:mi mathvariant="normal">disagree</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M44" display="inline"><mml:mo>&lt;</mml:mo></mml:math></inline-formula> 4 %).</p>

      <fig id="F4" specific-use="star"><label>Figure 4</label><caption><p id="d2e2207">Proportion of disagreement between Matrix-suggested and issued danger level (<inline-formula><mml:math id="M45" display="inline"><mml:mrow><mml:msub><mml:mi>P</mml:mi><mml:mi mathvariant="normal">disagree</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>) for <bold>(a)</bold> dry-snow avalanche problems and <bold>(b)</bold> for wet- and gliding snow avalanche problems. In panel <bold>(a)</bold>, use of the Matrix is shown on the <inline-formula><mml:math id="M46" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula> axis, in panel <bold>(b)</bold> the proportion of <italic>very poor</italic> stability assessments (<inline-formula><mml:math id="M47" display="inline"><mml:mrow><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mi mathvariant="normal">stab</mml:mi><mml:mo>:</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="normal">very</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="normal">poor</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>) is shown. Each point represents one warning service, colored by group as defined in Sect. <xref ref-type="sec" rid="Ch1.S4.SS2"/>.</p></caption>
            <graphic xlink:href="https://nhess.copernicus.org/articles/26/1161/2026/nhess-26-1161-2026-f04.png"/>

          </fig>

      <p id="d2e2272">Figure <xref ref-type="fig" rid="F5"/> illustrates how the issued danger levels for dry-snow avalanche problems were assigned to combinations of stability, frequency, and avalanche size. Results are shown for the groups introduced in Fig. <xref ref-type="fig" rid="F4"/>a and Sect. <xref ref-type="sec" rid="Ch1.S4.SS2"/>. As expected, warning services with high compliance with the Matrix (group compl-hi) generally assigned the Matrix-suggested danger level. In these services, alternations between the two danger levels shown in the Matrix were rare: the secondary danger level was only used in three factor combinations (<italic>fair-a few-size 1</italic>, <italic>poor-some-size 2</italic>, and <italic>very poor-some-size 3</italic>) in more than 1 % of cases. In contrast, forecasters in services with low Matrix compliance (group compl-lo, <inline-formula><mml:math id="M48" display="inline"><mml:mrow><mml:msub><mml:mi>P</mml:mi><mml:mi mathvariant="normal">disagree</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M49" display="inline"><mml:mo>≥</mml:mo></mml:math></inline-formula> 7 %) and presumably greater heterogeneity in how the Matrix is integrated into the forecasting process, showed a much broader distribution of danger levels across Matrix cells and, consequently, greater overlap in the use of the same factor combinations for different danger levels.</p>

      <fig id="F5" specific-use="star"><label>Figure 5</label><caption><p id="d2e2312">Matrix use for avalanche problems relating to dry-snow conditions: proportion of cases that a specific danger level (rows from top: <inline-formula><mml:math id="M50" display="inline"><mml:mi>D</mml:mi></mml:math></inline-formula> <inline-formula><mml:math id="M51" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 1 (low) to <inline-formula><mml:math id="M52" display="inline"><mml:mi>D</mml:mi></mml:math></inline-formula> <inline-formula><mml:math id="M53" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 4 (high)) was used for a specific matrix combination. Colour intensity corresponds to the proportion of cases. Cells with less than 1 % usage are not shown. Values highlighted bold correspond to use of Matrix-suggested danger levels, values in italics represent the second level. Group <italic>compl-hi</italic> forecasters rarely deviated from the Matrix-suggested danger level (high compliance), group <italic>compl-lo</italic> forecasters deviated <inline-formula><mml:math id="M54" display="inline"><mml:mo>≥</mml:mo></mml:math></inline-formula> 7 % of the time. The respective figure for warning services lying in between is shown in Fig. <xref ref-type="fig" rid="FC1"/> in Appendix C. Absolute use of Matrix cells, regardless of danger level, is shown in Fig. <xref ref-type="fig" rid="FD1"/> in Appendix D.</p></caption>
            <graphic xlink:href="https://nhess.copernicus.org/articles/26/1161/2026/nhess-26-1161-2026-f05.png"/>

          </fig>

      <p id="d2e2367">Two factor combinations, highlighted in Figure <xref ref-type="fig" rid="F5"/>, stood out for both groups: <italic>poor-some-size 2</italic> and <italic>very poor-some-size 3</italic>. These combinations were among those where deviations from the Matrix-suggested danger level were more frequent, even among forecasters with high Matrix compliance. According to the Matrix (Fig. <xref ref-type="fig" rid="F1"/>), the suggested danger level for <italic>poor-some-size 2</italic> is 2 (moderate), with 3 (considerable) as the second option. Warning services with low Matrix compliance used this factor combination equally often to describe danger levels 3 (considerable) (26 % of level 3 cases) as 2 (moderate) (27 %) (Fig. <xref ref-type="fig" rid="F5"/>b). In contrast, warning services with high Matrix compliance rarely used the <italic>poor-some-size 2</italic> cell when choosing 3 (considerable). Instead, they typically selected neighbouring Matrix cells that differed in one factor, either <italic>very poor</italic> stability rather than <italic>poor</italic>, <italic>many</italic> rather than <italic>some</italic>, or <italic>size 3</italic> rather than <italic>size 2</italic>, when choosing 3 (considerable). Nonetheless, even in this high compliance group, 7 % of all 3 (considerable) assessments fell into the <italic>poor-some-size 2</italic> combination. A second notable difference involved the factor combination <italic>very poor-some-size 3</italic>, where the Matrix suggests 3 (considerable) and 4 (high) as the secondary level. Services with low Matrix compliance used this combination for 4 (high) in 25 % of cases, while it was less frequently used for 3 (considerable) (10 %). In contrast, warning services with high Matrix compliance used this combination in 11 % of cases for 3 (considerable), and only 3 % of cases for 4 (high). These two factor combinations appear to represent key transition zones between these danger levels, where danger level assignment differs depending on the degree of Matrix compliance.</p>
      <p id="d2e2414">Despite these differences, both groups shared several patterns. For 1 (low), the most frequently used combinations were <italic>fair-a few-size 1</italic> or <italic>poor-a few-size 1</italic>, for 2 (moderate), the typical combinations were <italic>poor-some-size 2</italic> and <italic>poor-a few-size 2</italic>. And when issuing 4 (high), both groups predominantly used the factor combination <italic>very poor-many-size 3</italic>.</p>
      <p id="d2e2432">Although the Matrix required forecasters to classify conditions into a small number of discrete categories, the underlying processes are inherently continuous. In BOZ, TIR, and TRE (using the ALBINA software), as well as in SWI, forecasters were able to assess the Matrix factors with finer granularity (Fig. <xref ref-type="fig" rid="F2"/>). This allowed a closer examination of how subtle differences in factor assessments translated into differences in the issued danger level. We focused on the combination <italic>poor-some-size 2</italic>, the most frequently used Matrix cell (Fig. <xref ref-type="fig" rid="FD1"/>), which also showed considerable variation in the issued danger level (Fig. <xref ref-type="fig" rid="F5"/>), either 2 (moderate), as suggested by the Matrix, or 3 (considerable).</p>
      <p id="d2e2444">For the three warning services BOZ, TIR, TRE (using ALBINA) and for SWI separately, Fig. <xref ref-type="fig" rid="F6"/> shows how the issued danger levels varied within this cell. Panels (a) and (b) display the distribution of danger levels as a function of snowpack stability and frequency, while avalanche size was fixed to <italic>size 2</italic>. Panels (c) and (d) focus on cases where stability was classified as <italic>poor</italic>, showing how danger levels varied as a function of frequency and size, corresponding directly to the Matrix cell shown in Fig. <xref ref-type="fig" rid="F1"/>.</p>

      <fig id="F6" specific-use="star"><label>Figure 6</label><caption><p id="d2e2460">Distribution of issued danger levels as a function of factor combinations for different warning services, illustrating tendencies within coarse Matrix categories for the Matrix cell <italic>poor-some-size 2</italic>. Panels <bold>(a)</bold> and <bold>(b)</bold> show stability and frequency (with avalanche size fixed to 2), while panels <bold>(c)</bold> and <bold>(d)</bold> show frequency and size (with stability fixed to <italic>poor</italic>). The left column <bold>(a, c)</bold> presents data from BOZ, TIR, and TRE, which use the ALBINA forecasting software (see also Fig. <xref ref-type="fig" rid="F2"/>); the right column <bold>(b, d)</bold> shows data from SWI. Issued danger levels are indicated by circles <bold>(a, c)</bold> or pie segments <bold>(b, d)</bold>. Background colors reflect the predicted danger level based on the CART model.</p></caption>
            <graphic xlink:href="https://nhess.copernicus.org/articles/26/1161/2026/nhess-26-1161-2026-f06.png"/>

          </fig>

      <p id="d2e2502">In SWI, the issued danger levels were either 2 (moderate) (42 %) or 3 (considerable) (57 %); 1 % (12) of the 964 cases were assigned to level 1 (low). In contrast, the ALBINA services issued level 2 in 75 % and level 3 in 25 % of 282 cases. Despite this difference in the relative preference for level 2 versus level 3, similar tendencies emerged across groups. Slider positions leaning toward <italic>very poor</italic> stability and <italic>many</italic> locations were more often associated with level 3 (considerable) (Fig. <xref ref-type="fig" rid="F6"/>a, b), whereas shifts toward <italic>fair</italic> stability and a <italic>few</italic> locations tended to result in level 2 (moderate). The services BOZ, TIR, TRE showed a pronounced diagonal pattern in slider use, highlighting the interplay between stability and frequency (Fig. <xref ref-type="fig" rid="F6"/>a). In SWI, the proportion of level 2 (moderate) assignments decreased systematically with decreasing stability and increasing frequency (Fig. <xref ref-type="fig" rid="F6"/>b). A similar trend was observed for the combination of frequency and size (Fig. <xref ref-type="fig" rid="F6"/>c, d): danger level 3 (considerable) was more often given when both factors tended towards higher categories. The CART predictions shown in the figure backgrounds closely mirrored these tendencies. While the prediction surfaces were generally well defined, some scatter was present (notably in Fig. <xref ref-type="fig" rid="F6"/>a).</p>
</sec>
<sec id="Ch1.S5.SS2.SSS2">
  <label>5.2.2</label><title>Wet-snow and gliding snow avalanche problems</title>
      <p id="d2e2536">Figure <xref ref-type="fig" rid="F4"/>b illustrates the use of <italic>very poor</italic> stability to describe wet-snow or gliding snow avalanche problems and the proportion of disagreements between the issued danger level and the Matrix-suggested danger level. While forecasters in several services (notably SWI, TIR, and BAY) predominantly assessed wet-snow and gliding snow avalanche problems using <italic>very poor</italic> stability, a few services never used this stability class. This was striking, given that wet-snow and gliding snow avalanches are typically associated with natural avalanche release <xref ref-type="bibr" rid="bib1.bibx28" id="paren.33"/>, and are thus conceptually linked to <italic>very poor</italic> stability (Table <xref ref-type="table" rid="TA1"/>, see also Tables A2 and A3 in <xref ref-type="bibr" rid="bib1.bibx23" id="altparen.34"/>). This divergence in the use of the stability classes was also reflected in the degree of disagreement with the Matrix-suggested danger level. Services that used <italic>very poor</italic> stability more frequently also tended to show greater disagreement, and vice versa (Spearman <inline-formula><mml:math id="M55" display="inline"><mml:mi mathvariant="italic">ρ</mml:mi></mml:math></inline-formula> <inline-formula><mml:math id="M56" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 0.38, <inline-formula><mml:math id="M57" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula> <inline-formula><mml:math id="M58" display="inline"><mml:mo>&lt;</mml:mo></mml:math></inline-formula> 0.1). Notable exceptions included BAY, where stability was described as <italic>very poor</italic> in most situations (92 %) while high agreement with the Matrix was maintained (99 %), and Piemonte (PIE), where <inline-formula><mml:math id="M59" display="inline"><mml:mrow><mml:msub><mml:mi>P</mml:mi><mml:mi mathvariant="normal">disagree</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> was high (25 %) even though stability was essentially never assessed as <italic>very poor</italic> (1 %). In the case of BAY, forecasters typically assigned smaller avalanche sizes to the same danger levels compared to, for instance, SWI and TIR, who showed much lower agreement with the Matrix-suggested danger levels. For example, at 1 (low), 93 % of BAY cases were size 1, compared to 37 % in SWI and 13 % in TIR. Overall, for wet-snow and gliding snow problems, the median disagreement rate <inline-formula><mml:math id="M60" display="inline"><mml:mrow><mml:msub><mml:mi>P</mml:mi><mml:mi mathvariant="normal">disagree</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> was 7 %, slightly higher than the 4 % observed for dry-snow avalanche problems.</p>
      <p id="d2e2619">These differences in stability assessments and levels of compliance with the Matrix-suggested danger level carried over into the final danger level assignments, leading to a broad range of factor combinations and considerable variation in how the same danger level was described. Splitting the warning services into three groups (Sect. <xref ref-type="sec" rid="Ch1.S4.SS2"/>, Fig. <xref ref-type="fig" rid="F4"/>b) based on their use of <italic>very poor</italic> stability, and focusing on the two groups that either used this class infrequently (group wet-P, <inline-formula><mml:math id="M61" display="inline"><mml:mrow><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mi mathvariant="normal">stab</mml:mi><mml:mo>:</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="normal">very</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="normal">poor</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M62" display="inline"><mml:mo>&lt;</mml:mo></mml:math></inline-formula> 0.25) or often (group wet-VP, <inline-formula><mml:math id="M63" display="inline"><mml:mrow><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mi mathvariant="normal">stab</mml:mi><mml:mo>:</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="normal">very</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="normal">poor</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M64" display="inline"><mml:mo>≥</mml:mo></mml:math></inline-formula> 0.56), revealed substantial differences in Matrix cell usage (Fig. <xref ref-type="fig" rid="F7"/>). Group wet-P forecasters most often used <italic>poor</italic> and <italic>fair</italic> stability across danger levels 1 (low) to 3 (considerable). Even for 3 (considerable), 62 % of cases involved <italic>poor</italic> stability. In contrast, group wet-VP forecasters predominantly used the <italic>very poor</italic> stability panel, even when describing 1 (low) (66 % of cases). Furthermore, group wet-P forecasters did not use danger level 4 (high) at all when assessing wet- or gliding snow problems. As a result, similarities between these groups related primarily to avalanche size, which increased with rising danger level in both groups, from size 1 at 1 (low), to size 2 at 2 (moderate), and size 3 at 3 (considerable) in group wet-P and size 2 to size 3 in group wet-VP.</p>

      <fig id="F7" specific-use="star"><label>Figure 7</label><caption><p id="d2e2701">Wet-snow and gliding snow avalanche problems: percentage of cases that a specific danger level (rows: <inline-formula><mml:math id="M65" display="inline"><mml:mi>D</mml:mi></mml:math></inline-formula> <inline-formula><mml:math id="M66" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 1 (low) to <inline-formula><mml:math id="M67" display="inline"><mml:mi>D</mml:mi></mml:math></inline-formula> <inline-formula><mml:math id="M68" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 4 (high)) was used for a specific matrix combination. Colour intensity corresponds to the percentage of cases. Cells with less than 1 % usage are not shown. Values highlighted bold correspond to use of Matrix-suggested danger levels, values in italics represent the second level. Group wet-P forecasters used very poor stability <inline-formula><mml:math id="M69" display="inline"><mml:mo>&lt;</mml:mo></mml:math></inline-formula> 25 % of the time, group wet-VP forecasters <inline-formula><mml:math id="M70" display="inline"><mml:mo>≥</mml:mo></mml:math></inline-formula> 56 % of the time (see Fig. <xref ref-type="fig" rid="F4"/>b). The respective figure for warning services lying in between these two is shown in Fig. <xref ref-type="fig" rid="FC2"/> in Appendix C. Absolute use of Matrix cells, regardless of danger level, is shown in Fig. <xref ref-type="fig" rid="FD1"/> in Appendix D.</p></caption>
            <graphic xlink:href="https://nhess.copernicus.org/articles/26/1161/2026/nhess-26-1161-2026-f07.png"/>

          </fig>

</sec>
</sec>
</sec>
<sec id="Ch1.S6">
  <label>6</label><title>Discussion</title>
<sec id="Ch1.S6.SS1">
  <label>6.1</label><title>Interpretation of findings</title>
      <p id="d2e2776">We examined how the Matrix was used in daily operations with the goal of (i) identifying differences in how avalanche danger is characterized using Matrix terminology across warning services; (ii) analyzing how the Matrix was applied across the range of issued danger levels; and (iii) exploring differences in Matrix use between dry-snow and wet- or gliding-snow avalanche problems. Our findings show that Matrix usage was broadly consistent with its design, while several observed patterns and variations provide empirical insights into both the Matrix's strengths and limitations and offer guidance for refinement and further harmonization.</p>
      <p id="d2e2779">Importantly, these findings must be interpreted in light of a fundamental limitation: the EAWS Matrix ultimately rests on expert judgment, both in its original development <xref ref-type="bibr" rid="bib1.bibx23" id="paren.35"/> and in its operational application. Since avalanche danger and its defining factors cannot be measured independently, there is no external reference against which Matrix-based danger level assignments can be formally validated. This introduces an inherent circularity into the evaluation, as forecasters assess factors and danger levels through the same conceptual framework that underlies the Matrix itself. As a result, deviations from the Matrix may reflect inconsistent application, justified adaptations to specific avalanche conditions, or differences in interpretation that cannot be disentangled without additional contextual information. In this sense, our analysis describes how the Matrix is applied in practice, rather than evaluating the correctness of issued danger levels.</p>
<sec id="Ch1.S6.SS1.SSS1">
  <label>6.1.1</label><title>Dominant Matrix cells and transition zones</title>
      <p id="d2e2792">Our analyses showed that most combinations of snowpack stability, frequency, and avalanche size were consistently linked to a single danger level. This was particularly pronounced for danger levels 1 (low), 2 (moderate), and 4 (high), indicating that the Matrix provides a robust and interpretable structure for operational use. Notably, even though about half the Matrix cells show a secondary danger level (<inline-formula><mml:math id="M71" display="inline"><mml:mrow><mml:msup><mml:mi>D</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>, see Fig. <xref ref-type="fig" rid="F1"/>), most factor combinations were almost always used for a single consistent danger level in practice, even by services with comparatively low Matrix compliance. This suggests a broad consensus on the danger level for these factor combinations and indicates that such cells could potentially be simplified.</p>
      <p id="d2e2808">In contrast, a small number of cells, most notably <italic>poor-some-size 2</italic> and <italic>very poor-some-size 3</italic>, functioned as transition zones between adjacent danger levels. These cells showed frequent overlap in danger level assignment and played different roles across forecaster groups. For danger levels 3 (considerable) and 4 (high), respectively, they were among the most frequently used cells by services with low Matrix compliance (Fig. <xref ref-type="fig" rid="F5"/>b), even though the Matrix indicates these danger levels only as second options. The reasons for this discrepancy remain speculative. This may be due to differences in how factor classes are interpreted, particularly the frequency categories <xref ref-type="bibr" rid="bib1.bibx33" id="paren.36"/>. Alternatively, it may reflect a tendency among more Matrix-compliant services to avoid cells that do not yield the desired Matrix-suggested danger level. Notably, even services with high Matrix-compliance occasionally selected these cells issuing the second danger level suggested in the Matrix, underscoring their role as boundary regions between adjacent danger levels.</p>
      <p id="d2e2822">Beyond the dominant and transitional cells, some Matrix cells remained rarely used across all services (Figs. <xref ref-type="fig" rid="FD1"/> and <xref ref-type="fig" rid="FD2"/>). This lack of empirical support reduces confidence in the danger levels suggested for these cells and highlights that these assignments still rely solely on expert elicitation during Matrix development <xref ref-type="bibr" rid="bib1.bibx23" id="paren.37"/>.</p>
</sec>
<sec id="Ch1.S6.SS1.SSS2">
  <label>6.1.2</label><title>Granularity, relative assessments, and convergent validity</title>
      <p id="d2e2840">Several warning services applied a finer-granularity when assessing the Matrix input factors, reflecting a fundamental tension between simplicity – few, well-defined classes – and nuance – expressed through finer-grained, ordinal rankings within otherwise discrete classes. On the one hand, the Matrix relies on a limited number of discrete classes, a design choice that aligns with evidence that humans can reliably assess only a small number of categories (e.g., <xref ref-type="bibr" rid="bib1.bibx20 bib1.bibx16" id="altparen.38"/>). On the other hand, requiring forecasters to commit early to a single class is analogous to rounding intermediate results, which can discard relevant information and introduce discontinuities in subsequent decision steps. Such rounding effects are particularly relevant when assessments lie near the boundary between two neighbouring classes; expressing these tendencies through sub-classes would preserve this information for the final danger level decision.</p>
      <p id="d2e2846">At the same time, humans are comparatively good at making relative judgments, such as ranking conditions within an absolute class (e.g., <xref ref-type="bibr" rid="bib1.bibx16" id="altparen.39"/>). Combining absolute classifications with relative assessments by first selecting a discrete class and then indicating tendencies within that class therefore represents a promising way to preserve nuance while remaining within the overall structure of the Matrix. This is illustrated by the analysis of finer-granularity factor assessments for the frequently used factor combination <italic>poor-some-size 2</italic> (Sect. <xref ref-type="sec" rid="Ch1.S5.SS2.SSS1"/>). While this cell was predominantly associated with danger level 3 (considerable) in SWI and with level 2 (moderate) in BOZ, TIR, and TRE (Fig. <xref ref-type="fig" rid="F6"/>), tendencies within the cell toward either danger level were broadly comparable across services. This convergence occurred despite differences in how relative positioning within the cell was implemented (Fig. <xref ref-type="fig" rid="F2"/>) and regardless of whether the Matrix was used for danger level determination.</p>
      <p id="d2e2861">Despite these procedural differences, the observed agreement provides evidence of convergent validity <xref ref-type="bibr" rid="bib1.bibx2" id="paren.40"/>: independent approaches aimed at assessing the same underlying concept yielded broadly similar outcomes. Even in the absence of external reference data, this supports the feasibility of a harmonized interpretation across warning services. While finer granularity helps preserve variation within coarse classes, the appropriate level of resolution remains a subject for discussion. Evidence from parallel forecasting in SWI indicates that reasonably reliable estimates are achievable even with higher-resolved absolute–relative scales <xref ref-type="bibr" rid="bib1.bibx33" id="paren.41"/>. Together, this suggests that combining absolute and relative assessments can capture meaningful tendencies while remaining cognitively manageable and compatible with the Matrix framework.</p>
</sec>
<sec id="Ch1.S6.SS1.SSS3">
  <label>6.1.3</label><title>Disentangling Matrix compliance, consistency, and accuracy</title>
      <p id="d2e2879">We distinguish between Matrix compliance (agreement with the danger level suggested by the Matrix), consistency (the degree to which the same factor combination leads to the same danger level), and accuracy, the latter of which cannot be evaluated in the absence of external reference data. When viewed through the lens of Matrix compliance, differences in how warning services applied the Matrix provide insight into both the consistency of danger level assignment and its potential limitations.</p>
      <p id="d2e2882">The stratified analyses (Sect. <xref ref-type="sec" rid="Ch1.S5.SS2.SSS1"/>) confirmed that Matrix compliance influenced Matrix usage. Services, which disagreed more frequently with the Matrix-suggested danger level (<inline-formula><mml:math id="M72" display="inline"><mml:mrow><mml:msup><mml:mi>D</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>), exhibited broader distributions and more frequent use of alternative danger levels (<inline-formula><mml:math id="M73" display="inline"><mml:mrow><mml:msup><mml:mi>D</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>) than services with more standardized use. Notably, however, even the services generally in strong alignment with the Matrix for dry-snow avalanche problems, showed comparably large deviations when assessing wet- and gliding-snow avalanche problems.</p>
      <p id="d2e2909">A primary objective of integrating the Matrix into the forecasting process is to improve consistency. Our findings indicate that services with high Matrix compliance indeed used a narrower set of Matrix cells for each danger level, suggesting greater consistency in how factor combinations were linked to danger levels. However, consistency in output does not necessarily imply consistency in the assessment of the input factors. It is possible though speculative that forecasters occasionally adjusted input factor estimates to achieve a desired Matrix outcome rather than allowing factor assessments alone to determine the danger level.</p>
      <p id="d2e2912">While differences in danger levels assigned to factor input combinations clearly highlight inconsistency, they do not necessarily imply that one outcome is “wrong”. Rather, they may reflect shifts in how specific danger levels are interpreted relative to earlier practice of assessing the input factors, whether intentionally or not. Prior to the introduction of the revised Matrix in 2022, warning services did not explicitly assess the Matrix input factors or combine them systematically to derive danger levels. Since then, many services have learned to apply factor classifications within the Matrix framework, potentially anchoring factor assessments to a desired danger level. Possibly, services with low Matrix compliance applied a more independent approach when estimating factor classes and relating them to danger levels. From an operational perspective, however, a broad overlap of factor combinations across multiple danger levels, as observed for these services, remains problematic because it reduces consistency in communication and makes it more difficult for users to associate a given danger level with a set of avalanche conditions.</p>
      <p id="d2e2916">In addition, each factor estimate carries uncertainty, depending on data availability and the forecasters' ability to retrieve and correctly interpret information <xref ref-type="bibr" rid="bib1.bibx31" id="paren.42"/>. Even small inconsistencies in factor assessment can propagate through the Matrix and lead to different danger levels. For example, if two forecasters assessing identical conditions differ by only one neighbouring class for a single factor in half of all cases, while fully agreeing otherwise, the danger level suggested in the Matrix would differ in approximately 21 % of cases on average <xref ref-type="bibr" rid="bib1.bibx33" id="paren.43"/>. Without access to external validation data or forecaster rationale, it remains impossible to determine whether the Matrix promotes genuinely consistent interpretation or merely uniform outputs.</p>
      <p id="d2e2925">Even though the assessment of the input factors is subject to considerable uncertainty and subjectivity, and thus to variation in outcome, it is important to recognize that danger levels are increasingly defined by the Matrix logic and by the frequency with which the Matrix-suggested danger level is selected for specific factor combinations. Consequently, consistency in input factor estimation, within and across warning services, becomes even more important. Achieving this may require clearer factor definitions (e.g., for frequency categories) as well as improved availability of relevant data, such as snow-cover simulations, to support robust and harmonized factor assessment.</p>
</sec>
<sec id="Ch1.S6.SS1.SSS4">
  <label>6.1.4</label><title>Avalanche problem dependence and implications for harmonization</title>
      <p id="d2e2936">Inconsistencies became particularly evident when considering different avalanche problems. While Matrix use was relatively consistent for dry-snow avalanche problems, forecasts for wet-snow and gliding snow problems showed considerably more variation between services. Similar discrepancies have been documented in Canada <xref ref-type="bibr" rid="bib1.bibx5 bib1.bibx4" id="paren.44"/>, where identical combinations of likelihood of avalanches (the North American counterpart to stability and frequency in the Matrix) and avalanche size resulted in different danger levels depending on avalanche problem type and, at times, the forecasting agency. In our analysis, the most notable differences related to the classification of snowpack stability for wet-snow and gliding snow avalanche problems. Although wet-snow and gliding snow avalanches are generally associated with natural avalanche occurrence <xref ref-type="bibr" rid="bib1.bibx28 bib1.bibx15" id="paren.45"/>, and thus conceptually with <italic>very poor</italic> stability (Table <xref ref-type="table" rid="TA1"/>), services showed considerable divergence in their stability ratings (Figs. <xref ref-type="fig" rid="F4"/>b, <xref ref-type="fig" rid="F7"/>). Again, we can only speculate whether this reflects different conceptual models or deliberate adjustments of factor inputs to achieve expected Matrix outcomes. To promote harmonized application of the Matrix, a clearer, shared framework for assessing stability in wet- and gliding snow contexts appears necessary.</p>
</sec>
</sec>
<sec id="Ch1.S6.SS2">
  <label>6.2</label><title>Recommendations for improving the Matrix and workflow</title>
      <p id="d2e2964">Based on the findings outlined above, we propose the following recommendations: <list list-type="bullet"><list-item>
      <p id="d2e2969"><italic>Simplify cell content.</italic> Forecasters used several Matrix cells predominantly for a single danger level (Fig. <xref ref-type="fig" rid="F3"/>). These include the factor combinations <italic>very poor-a few-size 3</italic> and <italic>very poor-some-size 2</italic>, which forecasters assigned strongly with danger level 3 (considerable), as well as <italic>poor-a few-size 2</italic> and <italic>poor-some-size 1</italic>, which were used mostly for danger level 2 (moderate). These cells could therefore be simplified to emphasize a single danger level without a substantial loss of information.</p></list-item><list-item>
      <p id="d2e2989"><italic>Use white shading for under-supported cells.</italic> Beyond the cells already marked white in the Matrix (Fig. <xref ref-type="fig" rid="F1"/>), several additional cells were rarely used in operational practice and could also be shaded white to indicate increased uncertainty. For these cells, danger level assignments currently rely exclusively on the expert elicitation underlying the Matrix. Specifically, this applies to combinations involving <italic>fair</italic> stability with either avalanche size 4 or 5, or frequency <italic>many</italic>, as well as <italic>poor</italic> stability in combination with avalanche size 5.</p></list-item><list-item>
      <p id="d2e3006"><italic>Investigate transition zones and promote transparency in ambiguous cases.</italic> The cells <italic>poor-some-size 2</italic> and <italic>very poor-some-size 3</italic> were frequently assigned to two different danger levels, highlighting persistent ambiguity in these transition zones. These areas warrant further investigation to better understand the sources of variation and to refine the corresponding guidance. While the Matrix should be applied in line with agreed standards, these cells remain open to interpretation. Because objective validation is not feasible, we recommend that deviations from the primary danger level (<inline-formula><mml:math id="M74" display="inline"><mml:mrow><mml:msup><mml:mi>D</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>) be documented transparently. Such documentation helps distinguish well-founded expert judgment from inconsistent application and facilitates learning across warning services. This transparency may include practices already in use, such as indicating tendencies within factor classes, as applied and analysed for four warning services (Figs. <xref ref-type="fig" rid="F2"/> and <xref ref-type="fig" rid="F6"/>). In addition, this information could be used to further refine the Matrix itself; for example, cells such as <italic>poor-some-size 2</italic> could be subdivided to provide more detailed guidance to forecasters (Fig. <xref ref-type="fig" rid="F8"/>).</p></list-item><list-item>
      <p id="d2e3039"><italic>Clarify assessment procedures for wet-snow and gliding snow avalanche problems.</italic> A harmonized conceptual framework is needed to guide the assessment of wet-snow and gliding snow avalanche problems. This may require revised definitions, targeted forecaster training, and/or updated guidance within the forecasting workflow.</p></list-item></list></p>

      <fig id="F8"><label>Figure 8</label><caption><p id="d2e3046">Close-up of the Matrix panel for <italic>poor</italic> stability. The cell <italic>poor-some-size 2</italic> shows an internal differentiation based on finer-granularity factor assessments, as identified in Fig. <xref ref-type="fig" rid="F6"/>.</p></caption>
          <graphic xlink:href="https://nhess.copernicus.org/articles/26/1161/2026/nhess-26-1161-2026-f08.png"/>

        </fig>

</sec>
<sec id="Ch1.S6.SS3">
  <label>6.3</label><title>Limitations</title>
      <p id="d2e3071">A key limitation is that we lack precise information about how the Matrix was implemented operationally within each warning service. While some services likely provided formal internal guidance, for instance by implementing the Matrix directly in the forecasting software, or training, others allowed for greater forecaster discretion in referring to the Matrix or did not use the Matrix, but rather reached a final decision following group discussion (SWI). We also do not know the rationale behind individual forecaster decisions or the situational context in which factor assessments were made. These missing layers of information limit our ability to distinguish between genuine differences in interpretation and procedural differences across services and would be valuable for understanding when and why forecasters adhered to or deviated from the Matrix, particularly in ambiguous or transitional cells. Moreover, variations in Matrix use and data collection procedures described in Sect. <xref ref-type="sec" rid="Ch1.S3.SS2"/> required harmonizing the data across services, which may have introduced an unknown degree of information loss, including potentially relevant detail. Finally, while grouping warning services facilitated data analysis and presentation, some groups, such as the group of warning services with low Matrix compliance, were likely heterogeneous with respect to the underlying reasons for deviating from the Matrix.</p>
</sec>
</sec>
<sec id="Ch1.S7" sec-type="conclusions">
  <label>7</label><title>Conclusions</title>
      <p id="d2e3086">This study provides the first comprehensive assessment of how the revised EAWS-Matrix has been used operationally across European avalanche warning services. The findings confirm that the Matrix structure is broadly effective, with most cells supporting consistent danger level assignment – even among services with a tendency to diverge from the Matrix-suggested danger level. However, two factor combinations (<italic>poor-some-size 2</italic> and <italic>very poor-some-size 3</italic>) emerged as areas of ambiguity that warrant closer examination. Integrating finer-granularity factor assessments can reveal tendencies within Matrix fields and may offer a path toward more specific guidance in cells that currently contain two danger levels. Rarely used cells remain a source of uncertainty and should be visually marked. These findings were integrated into the final revisions of the Matrix <xref ref-type="bibr" rid="bib1.bibx23" id="paren.46"/>.</p>
      <p id="d2e3098">While Matrix use was relatively consistent for dry-snow avalanche problems, substantial inconsistencies were observed for wet-snow and gliding-snow avalanche problems, especially in the classification of snowpack stability. This highlights a need for broader harmonization of factor assessment and Matrix application practices across services, to support more reliable and consistent avalanche danger level forecasts. Overall, the results provide guidance for refining the Matrix and underscore the importance of transparent documentation and shared interpretation frameworks in domains where expert judgment plays a central role.</p>
      <p id="d2e3101">Ultimately, the assessment of avalanche danger levels should be guided as directly and transparently as possible by available data and not by individual forecasting styles, conceptual preferences, or service-specific traditions. Forecasts should reflect a shared understanding of avalanche danger and its determining factors, rather than the personality or institutional context of the forecaster. While some degree of expert interpretation is unavoidable, danger level assignment should result from a forward-looking evaluation of the evidence, not reverse reasoning from a desired outcome. Given the inherent uncertainty and subjectivity of the data, providing objective and targeted recommendations for improving the Matrix and its use is undoubtedly challenging but remains essential for producing credible and consistent public avalanche forecasts across Europe. Given these limitations, our conclusions and recommendations should not be interpreted as validation of the Matrix or its outputs, but as operational insights to support its ongoing development.</p>
</sec>

      
      </body>
    <back><app-group>

<app id="App1.Ch1.S1">
  <label>Appendix A</label><title>Definition of factors</title>

<table-wrap id="TA1"><label>Table A1</label><caption><p id="d2e3120">Snowpack stability classes and the type of triggering typically associated with these classes according to <xref ref-type="bibr" rid="bib1.bibx23" id="text.47"><named-content content-type="post">Table 2</named-content></xref>. Stability refers to the point scale. Values in parentheses indicate that the trigger or evidence is not typical but may occur.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="7">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="left"/>
     <oasis:colspec colnum="4" colname="col4" align="left"/>
     <oasis:colspec colnum="5" colname="col5" align="left"/>
     <oasis:colspec colnum="6" colname="col6" align="left"/>
     <oasis:colspec colnum="7" colname="col7" align="left"/>
     <oasis:thead>
       <oasis:row>
         <oasis:entry colname="col1">Snowpack</oasis:entry>
         <oasis:entry colname="col2">Description</oasis:entry>
         <oasis:entry colname="col3">Sensitivity</oasis:entry>
         <oasis:entry colname="col4">Natural</oasis:entry>
         <oasis:entry colname="col5">Human</oasis:entry>
         <oasis:entry colname="col6">Explosive/</oasis:entry>
         <oasis:entry colname="col7">Other indicators of</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">stability</oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3">(CMAH<sup>*</sup>)</oasis:entry>
         <oasis:entry colname="col4">avalanches</oasis:entry>
         <oasis:entry colname="col5">triggers</oasis:entry>
         <oasis:entry colname="col6">Cornice fall</oasis:entry>
         <oasis:entry colname="col7">instability</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">Very poor</oasis:entry>
         <oasis:entry colname="col2">Very easy to trigger</oasis:entry>
         <oasis:entry colname="col3">Touchy</oasis:entry>
         <oasis:entry colname="col4">yes</oasis:entry>
         <oasis:entry colname="col5">yes</oasis:entry>
         <oasis:entry colname="col6">yes</oasis:entry>
         <oasis:entry colname="col7">Shooting cracks, whumpf sounds</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Poor</oasis:entry>
         <oasis:entry colname="col2">Easy to trigger</oasis:entry>
         <oasis:entry colname="col3">Reactive</oasis:entry>
         <oasis:entry colname="col4">no</oasis:entry>
         <oasis:entry colname="col5">yes</oasis:entry>
         <oasis:entry colname="col6">yes</oasis:entry>
         <oasis:entry colname="col7">(Shooting cracks, whumpf sounds)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Fair</oasis:entry>
         <oasis:entry colname="col2">Difficult to trigger</oasis:entry>
         <oasis:entry colname="col3">Stubborn</oasis:entry>
         <oasis:entry colname="col4">no</oasis:entry>
         <oasis:entry colname="col5">(yes)</oasis:entry>
         <oasis:entry colname="col6">yes</oasis:entry>
         <oasis:entry colname="col7"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Good</oasis:entry>
         <oasis:entry colname="col2">Stable conditions</oasis:entry>
         <oasis:entry colname="col3">Unreactive</oasis:entry>
         <oasis:entry colname="col4">no</oasis:entry>
         <oasis:entry colname="col5">no</oasis:entry>
         <oasis:entry colname="col6">no</oasis:entry>
         <oasis:entry colname="col7"/>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table><table-wrap-foot><p id="d2e3128"><sup>*</sup> Terminology according to Conceptual Model of Avalanche Hazard <xref ref-type="bibr" rid="bib1.bibx30" id="paren.48"/>.</p></table-wrap-foot></table-wrap>

<table-wrap id="TA2"><label>Table A2</label><caption><p id="d2e3328">Frequency classes of snowpack stability, taken from <xref ref-type="bibr" rid="bib1.bibx23" id="text.49"><named-content content-type="post">Table 3</named-content></xref>.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="3">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="left"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Frequency class</oasis:entry>
         <oasis:entry colname="col2">Description</oasis:entry>
         <oasis:entry colname="col3">Evidence (e.g., observations)</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Many</oasis:entry>
         <oasis:entry colname="col2">Points with this stability class are abundant.</oasis:entry>
         <oasis:entry colname="col3">Evidence for instability  is often easy to find.</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Some</oasis:entry>
         <oasis:entry colname="col2">Points with this stability class are neither many nor a few,</oasis:entry>
         <oasis:entry colname="col3"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">but these points typically exist in terrain features with</oasis:entry>
         <oasis:entry colname="col3"/>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">common characteristics (i.e., close to ridgelines, in gullies).</oasis:entry>
         <oasis:entry colname="col3"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">A few</oasis:entry>
         <oasis:entry colname="col2">Points with this stability class are rare. While rare, their number</oasis:entry>
         <oasis:entry colname="col3">Evidence for instability is hard to find.</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">is considered relevant for stability assessment.</oasis:entry>
         <oasis:entry colname="col3"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">None or nearly none</oasis:entry>
         <oasis:entry colname="col2">Points with this stability class do not exist, or they are so rare</oasis:entry>
         <oasis:entry colname="col3"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">that they are not considered relevant for stability assessment.</oasis:entry>
         <oasis:entry colname="col3"/>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

<table-wrap id="TA3"><label>Table A3</label><caption><p id="d2e3456">Avalanche size classes, taken from <xref ref-type="bibr" rid="bib1.bibx23" id="text.50"><named-content content-type="post">Table 4</named-content></xref>.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="3">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="left"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Size class</oasis:entry>
         <oasis:entry colname="col2">Label</oasis:entry>
         <oasis:entry colname="col3">Destructive potential</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">1</oasis:entry>
         <oasis:entry colname="col2">Small</oasis:entry>
         <oasis:entry colname="col3">Unlikely to bury a person, except in run out zones with unfavorable terrain features (e.g., terrain traps).</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">2</oasis:entry>
         <oasis:entry colname="col2">Medium</oasis:entry>
         <oasis:entry colname="col3">May bury, injure, or kill a person.</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">3</oasis:entry>
         <oasis:entry colname="col2">Large</oasis:entry>
         <oasis:entry colname="col3">May bury and destroy cars, damage trucks, destroy small buildings and break a few trees.</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">4</oasis:entry>
         <oasis:entry colname="col2">Very large</oasis:entry>
         <oasis:entry colname="col3">May bury and destroy trucks and trains. May destroy fairly large buildings and small areas of forest.</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">5</oasis:entry>
         <oasis:entry colname="col2">Extremely large</oasis:entry>
         <oasis:entry colname="col3">May devastate the landscape and has catastrophic destructive potential.</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>


</app>

<app id="App1.Ch1.S2">
  <label>Appendix B</label><title>Details on the CART model for sub-class analysis</title>
      <p id="d2e3563">Classification and Regression Trees (CART, <xref ref-type="bibr" rid="bib1.bibx1" id="altparen.51"/>) are non-parametric models that recursively partition the predictor space to construct interpretable decision trees for classification or regression. CART models are highly interpretable, as each split corresponds to a simple decision rule. The resulting tree structure provides clear insight into the relationships between predictors and the response. CART is robust to outliers, can handle both numerical and categorical predictors, and does not require assumptions about the distribution of the data. For this study, we applied the <monospace>rpartScore</monospace> package <xref ref-type="bibr" rid="bib1.bibx11" id="paren.52"/>, which extends CART to ordinal outcomes by incorporating misclassification costs that reflect the ordered structure of the response variable. Unlike standard CART procedures that treat all misclassifications equally, this approach recognizes that errors between adjacent ordinal categories are less severe than errors between distant categories. The methodology assumes that numerical scores are assigned to the ordered categories of the response variable, reflecting the inherent ordinal structure of the data.</p>
      <p id="d2e3575">The dataset consists of <inline-formula><mml:math id="M77" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula> predictors <inline-formula><mml:math id="M78" display="inline"><mml:mi>X</mml:mi></mml:math></inline-formula> <inline-formula><mml:math id="M79" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> <inline-formula><mml:math id="M80" display="inline"><mml:mrow><mml:mfenced open="{" close="}"><mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mi>p</mml:mi></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:math></inline-formula> (here: stability, frequency, size) and an ordinal target variable <inline-formula><mml:math id="M81" display="inline"><mml:mi>Y</mml:mi></mml:math></inline-formula> (here: <inline-formula><mml:math id="M82" display="inline"><mml:mi>D</mml:mi></mml:math></inline-formula>). For each observation <inline-formula><mml:math id="M83" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula> <inline-formula><mml:math id="M84" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> <inline-formula><mml:math id="M85" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mi>N</mml:mi></mml:mrow></mml:math></inline-formula>, there is a data pair <inline-formula><mml:math id="M86" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, where <inline-formula><mml:math id="M87" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the target variable and <inline-formula><mml:math id="M88" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M89" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> <inline-formula><mml:math id="M90" display="inline"><mml:mrow><mml:mfenced open="(" close=")"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>p</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:math></inline-formula> represents the predictor values.</p>
      <p id="d2e3750">Trees were grown using the <italic>Generalized Gini impurity function</italic>, which incorporates misclassification costs calculated based on the ordinal distances between categories. The splitting criterion selects the predictor variable and split point that maximizes the reduction in the Generalized Gini impurity at each node. For a given node <inline-formula><mml:math id="M91" display="inline"><mml:mi>m</mml:mi></mml:math></inline-formula> representing region <inline-formula><mml:math id="M92" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>m</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> with <inline-formula><mml:math id="M93" display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi>m</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> observations, the proportion of class <inline-formula><mml:math id="M94" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula> is

          <disp-formula id="App1.Ch1.S2.Ex1"><mml:math id="M95" display="block"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi>p</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mrow><mml:mi>m</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mn mathvariant="normal">1</mml:mn><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi>m</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mstyle><mml:munder><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>∈</mml:mo><mml:msub><mml:mi>R</mml:mi><mml:mi>m</mml:mi></mml:msub></mml:mrow></mml:munder><mml:mi>I</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mspace linebreak="nobreak" width="0.25em"/><mml:msub><mml:mover accent="true"><mml:mi>p</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mrow><mml:mi>m</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>∈</mml:mo><mml:mo>[</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>]</mml:mo><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>

        where <inline-formula><mml:math id="M96" display="inline"><mml:mrow><mml:mi>I</mml:mi><mml:mo>(</mml:mo><mml:mo>⋅</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is the indicator function. The Generalized Gini impurity for node <inline-formula><mml:math id="M97" display="inline"><mml:mi>m</mml:mi></mml:math></inline-formula> is then defined as

          <disp-formula id="App1.Ch1.S2.Ex2"><mml:math id="M98" display="block"><mml:mrow><mml:msub><mml:mi>G</mml:mi><mml:mi>m</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>K</mml:mi></mml:munderover><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>l</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>K</mml:mi></mml:munderover><mml:msub><mml:mi>c</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mi>l</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mover accent="true"><mml:mi>p</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mrow><mml:mi>m</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mover accent="true"><mml:mi>p</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mrow><mml:mi>m</mml:mi><mml:mi>l</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>

        where <inline-formula><mml:math id="M99" display="inline"><mml:mi>K</mml:mi></mml:math></inline-formula> is the number of categories, and <inline-formula><mml:math id="M100" display="inline"><mml:mrow><mml:msub><mml:mi>c</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mi>l</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> is the misclassification cost between categories <inline-formula><mml:math id="M101" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M102" display="inline"><mml:mi>l</mml:mi></mml:math></inline-formula>, typically set as the absolute or squared difference between their scores (e.g., <inline-formula><mml:math id="M103" display="inline"><mml:mrow><mml:msub><mml:mi>c</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mi>l</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M104" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> <inline-formula><mml:math id="M105" display="inline"><mml:mrow><mml:mo>|</mml:mo><mml:mi>k</mml:mi><mml:mo>-</mml:mo><mml:mi>l</mml:mi><mml:mo>|</mml:mo></mml:mrow></mml:math></inline-formula> or <inline-formula><mml:math id="M106" display="inline"><mml:mrow><mml:msub><mml:mi>c</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mi>l</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M107" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> <inline-formula><mml:math id="M108" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>-</mml:mo><mml:mi>l</mml:mi><mml:msup><mml:mo>)</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>). This impurity measure accounts for both the frequency of misclassifications and their severity based on the ordinal distance between categories, ensuring that splits are chosen to minimize not only the number but also the seriousness of classification errors in the context of ordinal data.</p>
      <p id="d2e4088">To avoid overfitting, cost-complexity pruning was applied. Two pruning criteria were evaluated: <list list-type="bullet"><list-item>
      <p id="d2e4093"><italic>Total misclassification rate</italic> (<monospace>prune = "mr"</monospace>): minimizes the proportion of misclassified observations.</p></list-item><list-item>
      <p id="d2e4102"><italic>Total misclassification cost</italic> (<monospace>prune = "mc"</monospace>): minimizes the cumulative cost of misclassifications, weighted by ordinal distance.</p></list-item></list></p>
      <p id="d2e4111">The complexity parameter (<monospace>cp</monospace>) is a central hyperparameter in the <monospace>rpartScore</monospace> framework that controls the trade-off between tree complexity and model fit <xref ref-type="bibr" rid="bib1.bibx1 bib1.bibx11" id="paren.53"/>. At each split, <monospace>cp</monospace> specifies the minimum reduction in the overall cost-complexity measure required for a split to be retained in the tree. As <monospace>cp</monospace> increases, the algorithm produces smaller, simpler trees by pruning branches that do not sufficiently decrease the impurity, thereby reducing the risk of overfitting. Conversely, a lower <monospace>cp</monospace> allows for more complex trees, which may capture more structure but risk overfitting the training data.</p>
      <p id="d2e4133">The analysis employed a comprehensive hyperparameter tuning approach using repeated cross-validation. A 10-fold cross-validation with 3 repetitions was implemented to ensure robust model evaluation. The hyperparameter grid included: <list list-type="bullet"><list-item>
      <p id="d2e4138"><italic>Complexity parameter</italic>: values ranging from 0.01 to 0.3 in increments of 0.05</p></list-item><list-item>
      <p id="d2e4144"><italic>Split functions</italic>: both absolute (<monospace>"abs"</monospace>) and squared (<monospace>"quad"</monospace>) difference approaches</p></list-item><list-item>
      <p id="d2e4156"><italic>Pruning measures</italic>: both misclassification rate (<monospace>"mr"</monospace>) and misclassification cost (<monospace>"mc"</monospace>) criteria</p></list-item></list></p>
      <p id="d2e4167">To address potential class imbalance in the ordinal response variable, the Synthetic Minority Oversampling Technique (SMOTE) was applied during the cross-validation process. This technique generates synthetic examples of minority classes to balance the training data while preserving the ordinal structure of the response variable. However, SMOTE was only used in cases where the predictors were continuous. In the case of SWI, where the predictors were categorical, no SMOTE was applied.</p>
      <p id="d2e4170">Model performance was assessed using Matthews Correlation Coefficient (MCC) as the primary evaluation metric <xref ref-type="bibr" rid="bib1.bibx19" id="paren.54"/>. MCC is well-suited for imbalanced class distributions, as it considers all elements of the confusion matrix and avoids the pitfalls of standard accuracy metrics when one class dominates. In this analysis, MCC is especially appropriate for SWI, where predictors were categorical and SMOTE was not used. Without SMOTE, traditional metrics could overestimate performance, but MCC ensures a fair and robust evaluation even with class imbalance.

          <disp-formula id="App1.Ch1.S2.Ex3"><mml:math id="M109" display="block"><mml:mrow><mml:mtext>MCC</mml:mtext><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mtext>TP</mml:mtext><mml:mo>×</mml:mo><mml:mtext>TN</mml:mtext><mml:mo>-</mml:mo><mml:mtext>FP</mml:mtext><mml:mo>×</mml:mo><mml:mtext>FN</mml:mtext></mml:mrow><mml:msqrt><mml:mrow><mml:mfenced close=")" open="("><mml:mrow><mml:mtext>TP</mml:mtext><mml:mo>+</mml:mo><mml:mtext>FP</mml:mtext></mml:mrow></mml:mfenced><mml:mfenced open="(" close=")"><mml:mrow><mml:mtext>TP</mml:mtext><mml:mo>+</mml:mo><mml:mtext>FN</mml:mtext></mml:mrow></mml:mfenced><mml:mfenced open="(" close=")"><mml:mrow><mml:mtext>TN</mml:mtext><mml:mo>+</mml:mo><mml:mtext>FP</mml:mtext></mml:mrow></mml:mfenced><mml:mfenced close=")" open="("><mml:mrow><mml:mtext>TN</mml:mtext><mml:mo>+</mml:mo><mml:mtext>FN</mml:mtext></mml:mrow></mml:mfenced></mml:mrow></mml:msqrt></mml:mfrac></mml:mstyle><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>

        where TP, TN, FP, and FN represent true positives, true negatives, false positives, and false negatives, respectively.  While the formula above shows the binary case, for multiclass classification the MCC is computed using a generalized confusion matrix formula that accounts for all classes simultaneously. This ensures the reported MCC remains a balanced and robust measure of model performance, even when more than two classes are involved <xref ref-type="bibr" rid="bib1.bibx12 bib1.bibx3" id="paren.55"/>. Additionally, balanced accuracy was computed as a secondary metric:

          <disp-formula id="App1.Ch1.S2.Ex4"><mml:math id="M110" display="block"><mml:mrow><mml:mtext>Balanced Accuracy</mml:mtext><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mtext>Sensitivity</mml:mtext><mml:mo>+</mml:mo><mml:mtext>Specificity</mml:mtext></mml:mrow><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle></mml:mrow></mml:math></disp-formula></p>
      <p id="d2e4262">The optimal model configuration was selected based on the highest cross-validated MCC score, ensuring that the final model provides the best balance between predictive accuracy and model complexity while appropriately accounting for the ordinal nature of the response variable.</p>
      <p id="d2e4265">The analysis was conducted in <italic>R</italic> <xref ref-type="bibr" rid="bib1.bibx25" id="paren.56"/> using the <monospace>caret</monospace> package framework <xref ref-type="bibr" rid="bib1.bibx17" id="paren.57"/>.</p>
</app>

<app id="App1.Ch1.S3">
  <label>Appendix C</label><title>Matrix usage – warning service groups <italic>compl-mid</italic> and <italic>wet-VP/P</italic></title>

      <fig id="FC1"><label>Figure C1</label><caption><p id="d2e4297">Dry-snow avalanche problems – warning service group <italic>compl-mid</italic> with intermediate values for compliance (Table <xref ref-type="table" rid="T1"/>): proportion of cases that a specific danger level (rows: <inline-formula><mml:math id="M111" display="inline"><mml:mi>D</mml:mi></mml:math></inline-formula> <inline-formula><mml:math id="M112" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 1 (low) to <inline-formula><mml:math id="M113" display="inline"><mml:mi>D</mml:mi></mml:math></inline-formula> <inline-formula><mml:math id="M114" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 4 (high)) was used for a specific matrix combination. Colour shading corresponds to the proportion of cases. Cells with <inline-formula><mml:math id="M115" display="inline"><mml:mo>&lt;</mml:mo></mml:math></inline-formula> 1 % usage are not shown.</p></caption>
        
        <graphic xlink:href="https://nhess.copernicus.org/articles/26/1161/2026/nhess-26-1161-2026-f09.png"/>

      </fig>

<fig id="FC2"><label>Figure C2</label><caption><p id="d2e4352">Wet-snow and gliding snow avalanche problems – warning service group <italic>wet-VP/P</italic> with a mix of stability classes <italic>poor</italic> and <italic>very poor</italic> (Table <xref ref-type="table" rid="T1"/>). Refer to Fig. <xref ref-type="fig" rid="FC1"/> for details.</p></caption>
        
        <graphic xlink:href="https://nhess.copernicus.org/articles/26/1161/2026/nhess-26-1161-2026-f10.png"/>

      </fig>

</app>

<app id="App1.Ch1.S4">
  <label>Appendix D</label><title>Matrix usage regardless of danger level</title>

      <fig id="FD1"><label>Figure D1</label><caption><p id="d2e4386">Dry-snow avalanche problems. Shown are the median percentages of cell usage of the 26 warning services. The values in brackets represent the min-max range.</p></caption>
        
        <graphic xlink:href="https://nhess.copernicus.org/articles/26/1161/2026/nhess-26-1161-2026-f11.png"/>

      </fig>

      <fig id="FD2"><label>Figure D2</label><caption><p id="d2e4399">Wet-snow and gliding snow avalanche problems. Shown are the median percentages of cell usage of the 25 warning services (no data for SCO). The values in brackets represent the min-max range.</p></caption>
        
        <graphic xlink:href="https://nhess.copernicus.org/articles/26/1161/2026/nhess-26-1161-2026-f12.png"/>

      </fig>


</app>
  </app-group><notes notes-type="codedataavailability"><title>Code and data availability</title>

      <p id="d2e4416">Data and code are accessible at <ext-link xlink:href="https://doi.org/10.5281/zenodo.18030373" ext-link-type="DOI">10.5281/zenodo.18030373</ext-link> <xref ref-type="bibr" rid="bib1.bibx34" id="paren.58"/>.</p>
  </notes><notes notes-type="authorcontribution"><title>Author contributions</title>

      <p id="d2e4428">FT (study design, data curation, formal analysis, writing, reviewing), KM (project lead, reviewing), CMa (study design, formal analysis, writing, reviewing), CMi (study design, writing, reviewing).</p>
  </notes><notes notes-type="competinginterests"><title>Competing interests</title>

      <p id="d2e4434">The contact author has declared that none of the authors has any competing interests.</p>
  </notes><notes notes-type="disclaimer"><title>Disclaimer</title>

      <p id="d2e4440">Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. The authors bear the ultimate responsibility for providing appropriate place names. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.</p>
  </notes><ack><title>Acknowledgements</title><p id="d2e4447">We thank Stefano Sofia, Petter Palmgren, Nicolas Roux, Giacomo Villa, Guillém Martin Bellido, and Lorenzo Bertranda for invaluable discussions within the EAWS working group <italic>Matrix &amp; Scale</italic>. Filip Kyzek, Mark Diggins, Igor Chiambretti, Giacomo Villa, Guillém Martin Bellido, and Lorenzo Bertranda provided data. We thank the reviewers Erich Peitzsch and Benjamin Reuter for their valuable feedback, and we thank the editor Pascal Haegeli for his feedback on the initial manuscript, which ultimately led to the decision to separate the conceptual development <xref ref-type="bibr" rid="bib1.bibx23" id="paren.59"/> and the operational analysis (this study) into two publications. The analysis was conducted using the programming languages <italic>R</italic> <xref ref-type="bibr" rid="bib1.bibx25" id="paren.60"/>. We acknowledge the use of <italic>ChatGPT-5.2</italic> (OpenAI) to support language editing of this manuscript and to assist with debugging of code.</p></ack><notes notes-type="reviewstatement"><title>Review statement</title>

      <p id="d2e4467">This paper was edited by Pascal Haegeli and reviewed by Erich Peitzsch and Benjamin Reuter.</p>
  </notes><ref-list>
    <title>References</title>

      <ref id="bib1.bibx1"><label>Breiman et al.(2017)</label><mixed-citation>Breiman, L., Friedman, J. H., Olshen, R. A., and Stone, C. J.: Classification  and Regression Trees, Chapman and Hall/CRC, New York, <ext-link xlink:href="https://doi.org/10.1201/9781315139470" ext-link-type="DOI">10.1201/9781315139470</ext-link>, ISBN 9781315139470, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx2"><label>Campbell and Fiske(1959)</label><mixed-citation>Campbell, D. T. and Fiske, D. W.: Convergent and discriminant validation by the multitrait-multimethod matrix, Psychol. Bull., 56, 81–105,  <ext-link xlink:href="https://doi.org/10.1037/h0046016" ext-link-type="DOI">10.1037/h0046016</ext-link>, 1959.</mixed-citation></ref>
      <ref id="bib1.bibx3"><label>Chicco and Jurman(2020)</label><mixed-citation>Chicco, D. and Jurman, G.: The advantages of the Matthews correlation  coefficient (MCC) over F1 score and accuracy in binary classification  evaluation, BMC Genomics, 21, 6, <ext-link xlink:href="https://doi.org/10.1186/s12864-019-6413-7" ext-link-type="DOI">10.1186/s12864-019-6413-7</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx4"><label>Clark(2019)</label><mixed-citation>Clark, T.: Exploring the Link between the Conceptual Model of Avalanche Hazard and the North American Public Avalanche Danger Scale, Master's thesis, Simon Fraser University, <uri>https://summit.sfu.ca/_flysystem/fedora/sfu_migrate/18786/etd20073.pdf</uri> (last access: 23 December 2025), 2019.</mixed-citation></ref>
      <ref id="bib1.bibx5"><label>Clark and Haegeli(2018)</label><mixed-citation>Clark, T. and Haegeli, P.: Establishing the link between the Conceptual Model  of Avalanche Hazard and the North American Public Avalanche Danger Scale:  initial explorations from Canada, in: Proceedings ISSW 2018. International  Snow Science Workshop, Innsbruck, Austria, 7–12 October 2018, 1116–1120, <uri>https://arc.lib.montana.edu/snow-science/objects/ISSW2018_O12.4.pdf</uri> (last access: 3 March 2026), 2018.</mixed-citation></ref>
      <ref id="bib1.bibx6"><label>EAWS(2025a)</label><mixed-citation>EAWS: Determination of the avalanche danger level in regional avalanche  forecasting, <uri>https://www.avalanches.org/wp-content/uploads/2025/08/EAWS_matrix_definitions_EN.pdf</uri> (last access: 23 December 2025), 2025a.</mixed-citation></ref>
      <ref id="bib1.bibx7"><label>EAWS(2025b)</label><mixed-citation>EAWS: European Avalanche Danger Scale, <uri>https://www.avalanches.org/wp-content/uploads/2022/09/European_Avalanche_Danger_Scale-EAWS.pdf</uri> (last access: 10 July 2025), 2025b.</mixed-citation></ref>
      <ref id="bib1.bibx8"><label>EAWS(2025c)</label><mixed-citation>EAWS: Typical avalanche problems, <uri>https://www.avalanches.org/wp-content/uploads/2022/09/EN_EAWS_avalanche_problems.pdf</uri> (last access, 10 July 2025), 2025c.</mixed-citation></ref>
      <ref id="bib1.bibx9"><label>EAWS(2025d)</label><mixed-citation>EAWS: Information pyramid, <uri>https://www.avalanches.org/wp-content/uploads/2022/09/Content_and_Structure_Avalanche_Bulletin-EAWS.pdf</uri> (last access: 10 July 2025), 2025d.</mixed-citation></ref>
      <ref id="bib1.bibx10"><label>Engeset et al.(2018)</label><mixed-citation>Engeset, R. V., Pfuhl, G., Landrø, M., Mannberg, A., and Hetland, A.: Communicating public avalanche warnings – what works?, Nat. Hazards Earth Syst. Sci., 18, 2537–2559, <ext-link xlink:href="https://doi.org/10.5194/nhess-18-2537-2018" ext-link-type="DOI">10.5194/nhess-18-2537-2018</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx11"><label>Galimberti et al.(2012)</label><mixed-citation> Galimberti, G., Soffritti, G., and Di Maso, M.: Classification trees for  ordinal responses in R: the rpartScore package, J. Stat. Softw., 47, 1–25, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx12"><label>Gorodkin(2004)</label><mixed-citation>Gorodkin, J.: Comparing two K-category assignments by a K-category  correlation coefficient, Comput. Biol. Chem., 28, 367–374,  <ext-link xlink:href="https://doi.org/10.1016/j.compbiolchem.2004.09.006" ext-link-type="DOI">10.1016/j.compbiolchem.2004.09.006</ext-link>, 2004.</mixed-citation></ref>
      <ref id="bib1.bibx13"><label>Haegeli et al.(2006)</label><mixed-citation>Haegeli, P., McCammon, I., Jamieson, B., Israelson, C., and Statham, G.: The Avaluator – A Canadian rule-based avalanche decision support tool for amateur recreationists, in: Proceedings International Snow Science Workshop, Telluride, Colorado, USA, 254–263 pp., <uri>https://arc.lib.montana.edu/snow-science/objects/issw-2006-254-263.pdf</uri> (last access: 3 March 2026), 2006.</mixed-citation></ref>
      <ref id="bib1.bibx14"><label>Harvey et al.(2012)H</label><mixed-citation> Harvey, S., Rhyner, H., and Schweizer, J.: Lawinenkunde, Bruckmann Verlag  GmbH, München, ISBN 978-3-7654-5779-1, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx15"><label>Hutter et al.(2021)</label><mixed-citation>Hutter, V., Techel, F., and Purves, R. S.: How is avalanche danger described in textual descriptions in avalanche forecasts in Switzerland? Consistency between forecasters and avalanche danger, Nat. Hazards Earth Syst. Sci., 21, 3879–3897, <ext-link xlink:href="https://doi.org/10.5194/nhess-21-3879-2021" ext-link-type="DOI">10.5194/nhess-21-3879-2021</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx16"><label>Kahneman et al.(2021)</label><mixed-citation> Kahneman, D., Sibony, O., and Sunstein, C.: Noise: A flaw in human judgment,  William Collins, London, U.K., ISBN 978-0-00-830899-5, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx17"><label>Kuhn(2008)</label><mixed-citation>Kuhn, M.: Building Predictive Models in R Using the caret Package, J. Stat. Softw., 28, 1–26, <ext-link xlink:href="https://doi.org/10.18637/jss.v028.i05" ext-link-type="DOI">10.18637/jss.v028.i05</ext-link>, 2008.</mixed-citation></ref>
      <ref id="bib1.bibx18"><label>Lazar et al.(2016)</label><mixed-citation>Lazar, B., Trautmann, S., Cooperstein, M., Greene, E., and Birkeland, K.: North American avalanche danger scale: Do backcountry forecasters apply it  consistently?, in: Proceedings ISSW 2016. International Snow Science  Workshop, Breckenridge, 2–7 October 2016, CO, 457–465, <uri>https://arc.lib.montana.edu/snow-science/objects/ISSW16_O20.01.pdf</uri> (last access: 3 March 2026),  2016.</mixed-citation></ref>
      <ref id="bib1.bibx19"><label>Matthews(1975)</label><mixed-citation> Matthews, B. W.: Comparison of the predicted and observed secondary structure  of T4 phage lysozyme, Biochimica et Biophysica Acta (BBA)-Protein Structure,  405, 442–451, 1975.</mixed-citation></ref>
      <ref id="bib1.bibx20"><label>Miller(1956)</label><mixed-citation>Miller, G.: The magical number seven, plus or minus two: Some limits on our  capacity for processing information, Psychol. Rev., 63, 81–97,  <ext-link xlink:href="https://doi.org/10.1037/h0043158" ext-link-type="DOI">10.1037/h0043158</ext-link>, 1956.</mixed-citation></ref>
      <ref id="bib1.bibx21"><label>Mitterer et al.(2018)</label><mixed-citation>Mitterer, C., Lanzanasto, N., Nairz, P., Boninsegna, A., Munari, M., Geier, G., Rastner, L., Gheser, F., Trenti, A., Begnini, S., Tognoni, G., Pucher, A., Nell, D., Kriz, K., and Mair, R.: Project ALBINA: A conceptual framework for a consistent, cross-border and multilingual regional avalanche forecasting system, in: Proceedings ISSW 2018. International Snow Science Workshop Innsbruck, Austria, 7–12 October 2018, 1523–1530, <uri>https://arc.lib.montana.edu/snow-science/objects/ISSW2018_O17.8.pdf</uri> (last access: 3 March 2026),  2018.</mixed-citation></ref>
      <ref id="bib1.bibx22"><label>Müller et al.(2016)</label><mixed-citation>Müller, K., Mitterer, C., Engeset, R., Ekker, R., and Kosberg, S.: Combining the conceptual model of avalanche hazard with the Bavarian matrix, in: Proceedings ISSW 2016. International Snow Science Workshop, CO, USA, 2–7 October 2016, Breckenridge, 472–479, <uri>https://arc.lib.montana.edu/snow-science/objects/ISSW16_O20.03.pdf</uri> (last access: 3 March 2026),  2016.</mixed-citation></ref>
      <ref id="bib1.bibx23"><label>Müller et al.(2025)</label><mixed-citation>Müller, K., Techel, F., and Mitterer, C.: The EAWS matrix, a decision support tool to determine the regional avalanche danger level (Part A): conceptual development, Nat. Hazards Earth Syst. Sci., 25, 4503–4525, <ext-link xlink:href="https://doi.org/10.5194/nhess-25-4503-2025" ext-link-type="DOI">10.5194/nhess-25-4503-2025</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bibx24"><label>Murphy(1993)</label><mixed-citation>Murphy, A. H.: What is a good forecast? An essay on the nature of goodness in weather forecasting, Weather Forecasting, 8, 281–293,  <ext-link xlink:href="https://doi.org/10.1175/1520-0434(1993)008&lt;0281:WIAGFA&gt;2.0.CO;2" ext-link-type="DOI">10.1175/1520-0434(1993)008&lt;0281:WIAGFA&gt;2.0.CO;2</ext-link>, 1993.</mixed-citation></ref>
      <ref id="bib1.bibx25"><label>R Core Team(2024)</label><mixed-citation>R Core Team: R: A Language and Environment for Statistical Computing, R  Foundation for Statistical Computing, Vienna, Austria,  <uri>https://www.R-project.org/</uri> (last access: 3 March 2026), 2024.</mixed-citation></ref>
      <ref id="bib1.bibx26"><label>Reuter and Schweizer(2018)</label><mixed-citation>Reuter, B. and Schweizer, J.: Describing snow instability by failure  initiation, crack propagation, and slab tensile support, Geophys. Res.  Lett., 45, 7019–7029, <ext-link xlink:href="https://doi.org/10.1029/2018GL078069" ext-link-type="DOI">10.1029/2018GL078069</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx27"><label>Schmudlach and Köhler(2016)</label><mixed-citation>Schmudlach, G. and Köhler, J.: Automated avalanche risk rating of backcountry ski routes, in: Proceedings ISSW 2016. International Snow Science Workshop, 2–7 October 2016, Breckenridge, CO, 450–456, <uri>https://arc.lib.montana.edu/snow-science/objects/ISSW16_O19.04.pdf</uri> (last access: 3 March 2026),  2016. </mixed-citation></ref>
      <ref id="bib1.bibx28"><label>Schweizer et al.(2020)</label><mixed-citation>Schweizer, J., Mitterer, C., Techel, F., Stoffel, A., and Reuter, B.: On the relation between avalanche occurrence and avalanche danger level, The Cryosphere, 14, 737–750, <ext-link xlink:href="https://doi.org/10.5194/tc-14-737-2020" ext-link-type="DOI">10.5194/tc-14-737-2020</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx29"><label>SLF(2024)</label><mixed-citation>SLF: Avalanche bulletin interpretation guide, WSL Institute for Snow and  Avalanche Research SLF, November 2024th edn., <uri>https://www.slf.ch/fileadmin/user_upload/SLF/Lawinenbulletin_Schneesituation/Wissen_zum_Lawinenbulletin/Interpretationshilfe/Interpretationshilfe_EN.pdf</uri> (last access: 10 July 2025), 2024.</mixed-citation></ref>
      <ref id="bib1.bibx30"><label>Statham et al.(2018)</label><mixed-citation>Statham, G., Haegeli, P., Greene, E., Birkeland, K., Israelson, C., Tremper,  B., Stethem, C., McMahon, B., White, B., and Kelly, J.: A conceptual model of  avalanche hazard, Nat. Hazards, 90, 663–691,  <ext-link xlink:href="https://doi.org/10.1007/s11069-017-3070-5" ext-link-type="DOI">10.1007/s11069-017-3070-5</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx31"><label>Stewart and Lusk(1994)</label><mixed-citation>Stewart, T. and Lusk, C.: Seven components of judgmental forecasting skill:  implications for research and the improvement of forecasts, J. Forecasting, 13, 579–599, <ext-link xlink:href="https://doi.org/10.1002/for.3980130703" ext-link-type="DOI">10.1002/for.3980130703</ext-link>, 1994.</mixed-citation></ref>
      <ref id="bib1.bibx32"><label>Techel et al.(2020)</label><mixed-citation>Techel, F., Müller, K., and Schweizer, J.: On the importance of snowpack stability, the frequency distribution of snowpack stability, and avalanche size in assessing the avalanche danger level, The Cryosphere, 14, 3503–3521, <ext-link xlink:href="https://doi.org/10.5194/tc-14-3503-2020" ext-link-type="DOI">10.5194/tc-14-3503-2020</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx33"><label>Techel et al.(2024)</label><mixed-citation>Techel, F., Lucas, C., Pielmeier, C., Müller, K., and Morreau, M.: Unreliability in expert estimates of factors determining avalanche danger and  impact on danger level estimation with the Matrix, in: Proceedings  International Snow Science Workshop, Tromsø, Norway, 23–29 September 2024, 264–271, <uri>https://arc.lib.montana.edu/snow-science/item.php?id=3144</uri> (last access: 3 March 2026), 2024.</mixed-citation></ref>
      <ref id="bib1.bibx34"><label>Techel et al.(2025)</label><mixed-citation>Techel, F., Müller, K., Mitterer, C., and Marquardt, C.: Data for publication: The EAWS Matrix, a decision support tool to determine the regional avalanche danger level (Part B): Operational testing and use, Zenodo [data set and code], <ext-link xlink:href="https://doi.org/10.5281/zenodo.18030373" ext-link-type="DOI">10.5281/zenodo.18030373</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bibx35"><label>Winkler et al.(2024)</label><mixed-citation>Winkler, K., Trachsel, J., Knerr, J., Niederer, U., Weiss, G., Ruesch, M., and Techel, F.: SAFE – a layer-based avalanche forecast editor for better  integration of model predictions, in: Proceedings International Snow Science  Workshop, Tromsø, Norway, 23–29 September 2024, 124–131,  <uri>https://arc.lib.montana.edu/snow-science/item.php?id=3123</uri> (last access: 3 March 2026),  2024.</mixed-citation></ref>

  </ref-list></back>
    <!--<article-title-html>The EAWS matrix, a decision support tool to determine the regional avalanche danger level (Part B): operational testing and use</article-title-html>
<abstract-html/>
<ref-html id="bib1.bib1"><label>Breiman et al.(2017)</label><mixed-citation>
      
Breiman, L., Friedman, J. H., Olshen, R. A., and Stone, C. J.: Classification  and Regression Trees, Chapman and Hall/CRC, New York, <a href="https://doi.org/10.1201/9781315139470" target="_blank">https://doi.org/10.1201/9781315139470</a>, ISBN&thinsp;9781315139470, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib2"><label>Campbell and Fiske(1959)</label><mixed-citation>
      
Campbell, D. T. and Fiske, D. W.: Convergent and discriminant validation by the multitrait-multimethod matrix, Psychol. Bull., 56, 81–105,  <a href="https://doi.org/10.1037/h0046016" target="_blank">https://doi.org/10.1037/h0046016</a>, 1959.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib3"><label>Chicco and Jurman(2020)</label><mixed-citation>
      
Chicco, D. and Jurman, G.: The advantages of the Matthews correlation  coefficient (MCC) over F1 score and accuracy in binary classification  evaluation, BMC Genomics, 21, 6, <a href="https://doi.org/10.1186/s12864-019-6413-7" target="_blank">https://doi.org/10.1186/s12864-019-6413-7</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib4"><label>Clark(2019)</label><mixed-citation>
      
Clark, T.: Exploring the Link between the Conceptual Model of Avalanche Hazard and the North American Public Avalanche Danger Scale, Master's thesis, Simon Fraser University, <a href="https://summit.sfu.ca/_flysystem/fedora/sfu_migrate/18786/etd20073.pdf" target="_blank"/> (last access: 23 December 2025), 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib5"><label>Clark and Haegeli(2018)</label><mixed-citation>
      
Clark, T. and Haegeli, P.: Establishing the link between the Conceptual Model  of Avalanche Hazard and the North American Public Avalanche Danger Scale:  initial explorations from Canada, in: Proceedings ISSW 2018. International  Snow Science Workshop, Innsbruck, Austria, 7–12 October 2018, 1116–1120, <a href="https://arc.lib.montana.edu/snow-science/objects/ISSW2018_O12.4.pdf" target="_blank"/> (last access: 3 March 2026),
2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib6"><label>EAWS(2025a)</label><mixed-citation>
      
EAWS: Determination of the avalanche danger level in regional avalanche  forecasting, <a href="https://www.avalanches.org/wp-content/uploads/2025/08/EAWS_matrix_definitions_EN.pdf" target="_blank"/>
(last access: 23 December 2025), 2025a.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib7"><label>EAWS(2025b)</label><mixed-citation>
      
EAWS: European Avalanche Danger Scale, <a href="https://www.avalanches.org/wp-content/uploads/2022/09/European_Avalanche_Danger_Scale-EAWS.pdf" target="_blank"/>
(last access: 10 July 2025), 2025b.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib8"><label>EAWS(2025c)</label><mixed-citation>
      
EAWS: Typical avalanche problems, <a href="https://www.avalanches.org/wp-content/uploads/2022/09/EN_EAWS_avalanche_problems.pdf" target="_blank"/>
(last access, 10 July 2025), 2025c.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib9"><label>EAWS(2025d)</label><mixed-citation>
      
EAWS: Information pyramid, <a href="https://www.avalanches.org/wp-content/uploads/2022/09/Content_and_Structure_Avalanche_Bulletin-EAWS.pdf" target="_blank"/> (last access: 10 July 2025), 2025d.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib10"><label>Engeset et al.(2018)</label><mixed-citation>
      
Engeset, R. V., Pfuhl, G., Landrø, M., Mannberg, A., and Hetland, A.: Communicating public avalanche warnings – what works?, Nat. Hazards Earth Syst. Sci., 18, 2537–2559, <a href="https://doi.org/10.5194/nhess-18-2537-2018" target="_blank">https://doi.org/10.5194/nhess-18-2537-2018</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib11"><label>Galimberti et al.(2012)</label><mixed-citation>
      
Galimberti, G., Soffritti, G., and Di Maso, M.: Classification trees for  ordinal responses in R: the rpartScore package, J. Stat. Softw., 47, 1–25, 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib12"><label>Gorodkin(2004)</label><mixed-citation>
      
Gorodkin, J.: Comparing two K-category assignments by a K-category  correlation coefficient, Comput. Biol. Chem., 28, 367–374,  <a href="https://doi.org/10.1016/j.compbiolchem.2004.09.006" target="_blank">https://doi.org/10.1016/j.compbiolchem.2004.09.006</a>, 2004.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib13"><label>Haegeli et al.(2006)</label><mixed-citation>
      
Haegeli, P., McCammon, I., Jamieson, B., Israelson, C., and Statham, G.: The Avaluator – A Canadian rule-based avalanche decision support tool for amateur recreationists, in: Proceedings International Snow Science Workshop, Telluride, Colorado, USA, 254–263 pp., <a href="https://arc.lib.montana.edu/snow-science/objects/issw-2006-254-263.pdf" target="_blank"/> (last access: 3 March 2026), 2006.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib14"><label>Harvey et al.(2012)H</label><mixed-citation>
      
Harvey, S., Rhyner, H., and Schweizer, J.: Lawinenkunde, Bruckmann Verlag  GmbH, München, ISBN&thinsp;978-3-7654-5779-1, 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib15"><label>Hutter et al.(2021)</label><mixed-citation>
      
Hutter, V., Techel, F., and Purves, R. S.: How is avalanche danger described in textual descriptions in avalanche forecasts in Switzerland? Consistency between forecasters and avalanche danger, Nat. Hazards Earth Syst. Sci., 21, 3879–3897, <a href="https://doi.org/10.5194/nhess-21-3879-2021" target="_blank">https://doi.org/10.5194/nhess-21-3879-2021</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib16"><label>Kahneman et al.(2021)</label><mixed-citation>
      
Kahneman, D., Sibony, O., and Sunstein, C.: Noise: A flaw in human judgment,  William Collins, London, U.K., ISBN&thinsp;978-0-00-830899-5, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib17"><label>Kuhn(2008)</label><mixed-citation>
      
Kuhn, M.: Building Predictive Models in R Using the caret Package, J. Stat. Softw., 28, 1–26, <a href="https://doi.org/10.18637/jss.v028.i05" target="_blank">https://doi.org/10.18637/jss.v028.i05</a>, 2008.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib18"><label>Lazar et al.(2016)</label><mixed-citation>
      
Lazar, B., Trautmann, S., Cooperstein, M., Greene, E., and Birkeland, K.: North American avalanche danger scale: Do backcountry forecasters apply it  consistently?, in: Proceedings ISSW 2016. International Snow Science  Workshop, Breckenridge, 2–7 October 2016, CO, 457–465, <a href="https://arc.lib.montana.edu/snow-science/objects/ISSW16_O20.01.pdf" target="_blank"/> (last access: 3 March 2026),  2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib19"><label>Matthews(1975)</label><mixed-citation>
      
Matthews, B. W.: Comparison of the predicted and observed secondary structure  of T4 phage lysozyme, Biochimica et Biophysica Acta (BBA)-Protein Structure,  405, 442–451, 1975.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib20"><label>Miller(1956)</label><mixed-citation>
      
Miller, G.: The magical number seven, plus or minus two: Some limits on our  capacity for processing information, Psychol. Rev., 63, 81–97,  <a href="https://doi.org/10.1037/h0043158" target="_blank">https://doi.org/10.1037/h0043158</a>, 1956.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib21"><label>Mitterer et al.(2018)</label><mixed-citation>
      
Mitterer, C., Lanzanasto, N., Nairz, P., Boninsegna, A., Munari, M., Geier, G., Rastner, L., Gheser, F., Trenti, A., Begnini, S., Tognoni, G., Pucher, A., Nell, D., Kriz, K., and Mair, R.: Project ALBINA: A conceptual framework for a consistent, cross-border and multilingual regional avalanche forecasting system, in: Proceedings ISSW 2018. International Snow Science Workshop Innsbruck, Austria, 7–12 October 2018, 1523–1530, <a href="https://arc.lib.montana.edu/snow-science/objects/ISSW2018_O17.8.pdf" target="_blank"/> (last access: 3 March 2026),  2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib22"><label>Müller et al.(2016)</label><mixed-citation>
      
Müller, K., Mitterer, C., Engeset, R., Ekker, R., and Kosberg, S.: Combining the conceptual model of avalanche hazard with the Bavarian matrix, in: Proceedings ISSW 2016. International Snow Science Workshop, CO, USA, 2–7 October 2016, Breckenridge, 472–479, <a href="https://arc.lib.montana.edu/snow-science/objects/ISSW16_O20.03.pdf" target="_blank"/> (last access: 3 March 2026),  2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib23"><label>Müller et al.(2025)</label><mixed-citation>
      
Müller, K., Techel, F., and Mitterer, C.: The EAWS matrix, a decision support tool to determine the regional avalanche danger level (Part A): conceptual development, Nat. Hazards Earth Syst. Sci., 25, 4503–4525, <a href="https://doi.org/10.5194/nhess-25-4503-2025" target="_blank">https://doi.org/10.5194/nhess-25-4503-2025</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib24"><label>Murphy(1993)</label><mixed-citation>
      
Murphy, A. H.: What is a good forecast? An essay on the nature of goodness in weather forecasting, Weather Forecasting, 8, 281–293,  <a href="https://doi.org/10.1175/1520-0434(1993)008&lt;0281:WIAGFA&gt;2.0.CO;2" target="_blank">https://doi.org/10.1175/1520-0434(1993)008&lt;0281:WIAGFA&gt;2.0.CO;2</a>, 1993.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib25"><label>R Core Team(2024)</label><mixed-citation>
      
R Core Team: R: A Language and Environment for Statistical Computing, R  Foundation for Statistical Computing, Vienna, Austria,  <a href="https://www.R-project.org/" target="_blank"/> (last access: 3 March 2026), 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib26"><label>Reuter and Schweizer(2018)</label><mixed-citation>
      
Reuter, B. and Schweizer, J.: Describing snow instability by failure  initiation, crack propagation, and slab tensile support, Geophys. Res.  Lett., 45, 7019–7029, <a href="https://doi.org/10.1029/2018GL078069" target="_blank">https://doi.org/10.1029/2018GL078069</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib27"><label>Schmudlach and Köhler(2016)</label><mixed-citation>
      
Schmudlach, G. and Köhler, J.: Automated avalanche risk rating of backcountry ski routes, in: Proceedings ISSW 2016. International Snow Science Workshop, 2–7 October 2016, Breckenridge, CO, 450–456, <a href="https://arc.lib.montana.edu/snow-science/objects/ISSW16_O19.04.pdf" target="_blank"/> (last access: 3 March 2026),  2016.


    </mixed-citation></ref-html>
<ref-html id="bib1.bib28"><label>Schweizer et al.(2020)</label><mixed-citation>
      
Schweizer, J., Mitterer, C., Techel, F., Stoffel, A., and Reuter, B.: On the relation between avalanche occurrence and avalanche danger level, The Cryosphere, 14, 737–750, <a href="https://doi.org/10.5194/tc-14-737-2020" target="_blank">https://doi.org/10.5194/tc-14-737-2020</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib29"><label>SLF(2024)</label><mixed-citation>
      
SLF: Avalanche bulletin interpretation guide, WSL Institute for Snow and  Avalanche Research SLF, November 2024th edn.,
<a href="https://www.slf.ch/fileadmin/user_upload/SLF/Lawinenbulletin_Schneesituation/Wissen_zum_Lawinenbulletin/Interpretationshilfe/Interpretationshilfe_EN.pdf" target="_blank"/> (last access: 10 July 2025), 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib30"><label>Statham et al.(2018)</label><mixed-citation>
      
Statham, G., Haegeli, P., Greene, E., Birkeland, K., Israelson, C., Tremper,  B., Stethem, C., McMahon, B., White, B., and Kelly, J.: A conceptual model of  avalanche hazard, Nat. Hazards, 90, 663–691,  <a href="https://doi.org/10.1007/s11069-017-3070-5" target="_blank">https://doi.org/10.1007/s11069-017-3070-5</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib31"><label>Stewart and Lusk(1994)</label><mixed-citation>
      
Stewart, T. and Lusk, C.: Seven components of judgmental forecasting skill:  implications for research and the improvement of forecasts, J. Forecasting, 13, 579–599, <a href="https://doi.org/10.1002/for.3980130703" target="_blank">https://doi.org/10.1002/for.3980130703</a>, 1994.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib32"><label>Techel et al.(2020)</label><mixed-citation>
      
Techel, F., Müller, K., and Schweizer, J.: On the importance of snowpack stability, the frequency distribution of snowpack stability, and avalanche size in assessing the avalanche danger level, The Cryosphere, 14, 3503–3521, <a href="https://doi.org/10.5194/tc-14-3503-2020" target="_blank">https://doi.org/10.5194/tc-14-3503-2020</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib33"><label>Techel et al.(2024)</label><mixed-citation>
      
Techel, F., Lucas, C., Pielmeier, C., Müller, K., and Morreau, M.: Unreliability in expert estimates of factors determining avalanche danger and  impact on danger level estimation with the Matrix, in: Proceedings  International Snow Science Workshop, Tromsø, Norway, 23–29 September 2024, 264–271, <a href="https://arc.lib.montana.edu/snow-science/item.php?id=3144" target="_blank"/> (last access: 3 March 2026), 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib34"><label>Techel et al.(2025)</label><mixed-citation>
      
Techel, F., Müller, K., Mitterer, C., and Marquardt, C.: Data for publication: The EAWS Matrix, a decision support tool to determine the regional avalanche danger level (Part B): Operational testing and use, Zenodo [data set and code], <a href="https://doi.org/10.5281/zenodo.18030373" target="_blank">https://doi.org/10.5281/zenodo.18030373</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib35"><label>Winkler et al.(2024)</label><mixed-citation>
      
Winkler, K., Trachsel, J., Knerr, J., Niederer, U., Weiss, G., Ruesch, M., and Techel, F.: SAFE – a layer-based avalanche forecast editor for better  integration of model predictions, in: Proceedings International Snow Science  Workshop, Tromsø, Norway, 23–29 September 2024, 124–131,  <a href="https://arc.lib.montana.edu/snow-science/item.php?id=3123" target="_blank"/> (last access: 3 March 2026),  2024.

    </mixed-citation></ref-html>--></article>
