<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing with OASIS Tables v3.0 20080202//EN" "journalpub-oasis3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:oasis="http://docs.oasis-open.org/ns/oasis-exchange/table" xml:lang="en" dtd-version="3.0" article-type="research-article">
  <front>
    <journal-meta><journal-id journal-id-type="publisher">NHESS</journal-id><journal-title-group>
    <journal-title>Natural Hazards and Earth System Sciences</journal-title>
    <abbrev-journal-title abbrev-type="publisher">NHESS</abbrev-journal-title><abbrev-journal-title abbrev-type="nlm-ta">Nat. Hazards Earth Syst. Sci.</abbrev-journal-title>
  </journal-title-group><issn pub-type="epub">1684-9981</issn><publisher>
    <publisher-name>Copernicus Publications</publisher-name>
    <publisher-loc>Göttingen, Germany</publisher-loc>
  </publisher></journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.5194/nhess-23-2133-2023</article-id><title-group><article-title>Using machine learning algorithms to identify predictors of social vulnerability in the event of a hazard: Istanbul case study</article-title><alt-title>Identifying predictors of social vulnerability using machine learning algorithms​​​​​​​</alt-title>
      </title-group><?xmltex \runningtitle{Identifying predictors of social vulnerability using machine learning algorithms​​​​​​​}?><?xmltex \runningauthor{O. Kalaycıoğlu et al.}?>
      <contrib-group>
        <contrib contrib-type="author" corresp="yes" rid="aff1 aff2">
          <name><surname>Kalaycıoğlu</surname><given-names>Oya</given-names></name>
          <email>oyakalaycioglu@ibu.edu.tr</email>
        <ext-link>https://orcid.org/0000-0003-2183-7080</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff3">
          <name><surname>Akhanlı</surname><given-names>Serhat Emre</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff4">
          <name><surname>Menteşe</surname><given-names>Emin Yahya</given-names></name>
          
        <ext-link>https://orcid.org/0000-0002-7187-4384</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff5">
          <name><surname>Kalaycıoğlu</surname><given-names>Mehmet</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff6">
          <name><surname>Kalaycıoğlu</surname><given-names>Sibel</given-names></name>
          
        </contrib>
        <aff id="aff1"><label>1</label><institution>Department of Biostatistics and Medical Informatics, Bolu Abant İzzet Baysal University, Bolu, 14030, Türkiye</institution>
        </aff>
        <aff id="aff2"><label>2</label><institution>Department of Statistical Science, University College London, London, WC1E 6BT, United Kingdom</institution>
        </aff>
        <aff id="aff3"><label>3</label><institution>Department of Statistics, Muğla Sıtkı Koçman University, Muğla, 48000, Türkiye</institution>
        </aff>
        <aff id="aff4"><label>4</label><institution>Kandilli Observatory and Earthquake Research Institute, Boğaziçi University, Istanbul, 34684, Türkiye</institution>
        </aff>
        <aff id="aff5"><label>5</label><institution>Tomorrow's Cities Research Group, Middle East Technical University, Ankara, 06800, Türkiye</institution>
        </aff>
        <aff id="aff6"><label>6</label><institution>Department of Sociology, Middle East Technical University, Ankara, 06800, Türkiye</institution>
        </aff>
      </contrib-group>
      <author-notes><corresp id="corr1">Oya Kalaycıoğlu (oyakalaycioglu@ibu.edu.tr)</corresp></author-notes><pub-date><day>15</day><month>June</month><year>2023</year></pub-date>
      
      <volume>23</volume>
      <issue>6</issue>
      <fpage>2133</fpage><lpage>2156</lpage>
      <history>
        <date date-type="received"><day>8</day><month>July</month><year>2022</year></date>
           <date date-type="rev-request"><day>20</day><month>July</month><year>2022</year></date>
           <date date-type="rev-recd"><day>3</day><month>April</month><year>2023</year></date>
           <date date-type="accepted"><day>10</day><month>May</month><year>2023</year></date>
      </history>
      <permissions>
        <copyright-statement>Copyright: © 2023 Oya Kalaycıoğlu et al.</copyright-statement>
        <copyright-year>2023</copyright-year>
      <license license-type="open-access"><license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p></license></permissions><self-uri xlink:href="https://nhess.copernicus.org/articles/23/2133/2023/nhess-23-2133-2023.html">This article is available from https://nhess.copernicus.org/articles/23/2133/2023/nhess-23-2133-2023.html</self-uri><self-uri xlink:href="https://nhess.copernicus.org/articles/23/2133/2023/nhess-23-2133-2023.pdf">The full text article is available as a PDF file from https://nhess.copernicus.org/articles/23/2133/2023/nhess-23-2133-2023.pdf</self-uri>
      <abstract><title>Abstract</title>

      <p id="d1e152">To what extent an individual or group will be affected by the damage of a hazard depends not just on their exposure to the event but on their social vulnerability – that is, how well they are able to anticipate, cope with, resist, and recover from the impact of a hazard. Therefore, for mitigating disaster risk effectively and building a disaster-resilient society to natural hazards, it is essential that policy makers develop an understanding of social vulnerability. This study aims to propose an optimal predictive model that allows decision makers to identify households with high social vulnerability by using a number of easily accessible household variables. In order to develop such a model, we rely on a large dataset comprising a household survey (<inline-formula><mml:math id="M1" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> <inline-formula><mml:math id="M2" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 41 093) that was conducted to generate a social vulnerability index (SoVI) in Istanbul, Türkiye. In this study, we assessed the predictive ability of socio-economic, socio-demographic, and housing conditions on the household-level social vulnerability through machine learning models. We used classification and regression tree (CART), random forest (RF), support vector machine (SVM), naïve Bayes (NB), artificial neural network (ANN), <inline-formula><mml:math id="M3" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula>-nearest neighbours (KNNs), and logistic regression to classify households with respect to their social vulnerability level, which was used as the outcome of these models. Due to the disparity of class size outcome variables, subsampling strategies were applied for dealing with imbalanced data. Among these models, ANN was found to have the optimal predictive performance for discriminating households with low and high social vulnerability when random-majority under sampling was applied (area under the curve (AUC): 0.813). The results from the ANN method indicated that lack of social security, living in a squatter house, and job insecurity were among the most important predictors of social vulnerability to hazards. Additionally, the level of education, the ratio of elderly persons in the household, owning a property, household size, ratio of income earners, and savings of the household were found to be associated with social vulnerability. An open-access R Shiny web application was developed to visually display the performance of machine learning (ML) methods, important variables for the classification of households with high and low social vulnerability, and the spatial distribution of the variables across Istanbul neighbourhoods. The machine learning methodology and the findings that we present in this paper can guide decision makers in identifying social vulnerability effectively and hence let them prioritise actions towards vulnerable groups in terms of needs prior to an event of a hazard.</p>
  </abstract>
    </article-meta>
  </front>
<body>
      

<sec id="Ch1.S1" sec-type="intro">
  <label>1</label><title>Introduction</title>
      <p id="d1e185">The impacts of hazards are increasing at an unprecedented rate as the exposure of communities and individuals increases and climate change amplifies the intensity of the hazards <xref ref-type="bibr" rid="bib1.bibx110" id="paren.1"/>. Moreover, urban expansion and population growth are expected to be mostly in low- and middle-income countries <xref ref-type="bibr" rid="bib1.bibx81 bib1.bibx100" id="paren.2"/>,<?pagebreak page2134?> where vulnerability to hazards are significantly high due to a lack of proper urbanisation practices (e.g. construction codes, infrastructure quality, and infrastructure availability) and socioeconomic characteristics (e.g. poverty, lack of access to livelihoods, and low level of education attainment) <xref ref-type="bibr" rid="bib1.bibx36" id="paren.3"/>.</p>
      <p id="d1e197">In this research, we focus on the socioeconomic aspect of the vulnerability phenomenon, which will be named “social vulnerability” hereafter. Based on the vulnerability definition, “The conditions determined by physical, social, economic and environmental factors or processes which increase the susceptibility of an individual, a community, assets or systems to the impacts of hazards” by <xref ref-type="bibr" rid="bib1.bibx110" id="text.4"/>, we look at specific social factors that may increase the level of adverse impacts due to a hazard. Social vulnerability increases the risks of different social groups in relation to a set of socioeconomic conditions and needs to be determined before a particular hazard hits society <xref ref-type="bibr" rid="bib1.bibx21" id="paren.5"/>. Therefore, identification of the factors that contribute to social vulnerability is crucial for building a more resilient society <xref ref-type="bibr" rid="bib1.bibx6" id="paren.6"/>. In doing so, some characteristics of various layers of society come to the fore in explaining the concept of social vulnerability.</p>
      <p id="d1e209">There is a critical need to assess vulnerabilities for improved preparedness and ability to recover from hazards at different scales; however, only a few studies assessed vulnerability at the individual household level in developing countries <xref ref-type="bibr" rid="bib1.bibx32" id="paren.7"/>. Within this frame, we aim to understand the factors that influence social vulnerability by utilising machine learning (ML) techniques, which give us the chance to deal with big household databases. By that, our target is to provide an efficient approach that can be adopted within different spatial contexts for comprehending the determinants of social vulnerability based on easily accessible databases. ML techniques are capable of handling interactions between variables; thus, the proposed approach considers interactions between factors to reflect the multidimensional and complex nature of social vulnerability. We demonstrate this approach to the Istanbul case study area, in which we benefit from a previous social vulnerability study to test our methodology at household level. For building ML models, we rely on a large dataset of a previous study comprising a household survey (<inline-formula><mml:math id="M4" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> <inline-formula><mml:math id="M5" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 41 093) and pre-constructed social vulnerability index (SoVI) of these households. We consider the SoVI scores as an indication of the social vulnerability level for each household, and our focus in this study is to assess to what extent the pre-constructed SoVI (and hence the social vulnerability of the households) can be predicted with machine learning techniques using household data that are available within databases of various institutions and public authorities.</p>
      <p id="d1e229">This study contributes to disaster risk research in several aspects. First, we propose a methodology to identify the descriptors of social vulnerability, which is generic enough to be adopted for any spatial context. The proposed method extracts representative predictors for social vulnerability, which are accessible in most spatial contexts around the world. Second, we introduce ML algorithms into vulnerability assessment practices, which is a relatively overlooked aspect as a method in the disaster risk discipline. It is seen that ML algorithms can be used efficiently to overcome the complexity of the social vulnerability concept, particularly with large datasets. Thirdly, since there are only a limited number of studies which assesses vulnerability at the household level (particularly in developing countries) <xref ref-type="bibr" rid="bib1.bibx32" id="paren.8"/>, our method is an attempt to contribute to the literature by bringing in a more precise approach for estimating social vulnerability in a household scale.</p>
      <p id="d1e236">This paper is structured into the following four sections: (i) context and motivation for this study, which involves a literature review on the social vulnerability context and the approaches developed to measure it, followed by our motivation on why we chose machine learning techniques as an approach to identify the descriptors of social vulnerability (Sect. <xref ref-type="sec" rid="Ch1.S2"/>); (ii) the materials and methods applied within our research (Sect. <xref ref-type="sec" rid="Ch1.S3"/>); (iii) the results that came out as a consequence of our methodology applied (Sect. <xref ref-type="sec" rid="Ch1.S4"/>); and (iv) conclusions and discussions, where we present our findings based on the results and discuss the limitations and room for improvement in our approach (Sects. <xref ref-type="sec" rid="Ch1.S5"/>, <xref ref-type="sec" rid="Ch1.S6"/>, <xref ref-type="sec" rid="Ch1.S7"/>).</p>
</sec>
<sec id="Ch1.S2">
  <label>2</label><title>Background for social vulnerability assessment</title>
      <p id="d1e260">The social, political, and economic characteristics of individuals influence their status of being exposed to disasters <xref ref-type="bibr" rid="bib1.bibx28" id="paren.9"/>. Therefore, the human dimension has become an increasingly popular topic in disaster risk research for comprehensively assessing and understanding the potential impacts of natural hazards <xref ref-type="bibr" rid="bib1.bibx101" id="paren.10"/>. In this regard, social science research in the hazard domain is shaped around questions such as “Which factors influence the adoption of individuals to hazards?”, “Why do people prefer to live in hazardous areas?”, and “How the individuals' risk perception influences their behaviour?” <xref ref-type="bibr" rid="bib1.bibx19" id="paren.11"/>. Answers to these questions could help to understand social indicators of vulnerability, and they explain why people with similar levels of exposure may experience very different levels of adverse impact. Social indicators of vulnerability were studied extensively in the literature (e.g. <xref ref-type="bibr" rid="bib1.bibx6 bib1.bibx46 bib1.bibx21 bib1.bibx31 bib1.bibx115" id="altparen.12"/>). Within these studies, social vulnerability expands over a diverse range of social, individual, and sometimes spatial characteristics.</p>
      <p id="d1e275">Just to mention a few, disability, for example, is one of the most common indicators within social vulnerability literature, in which it is emphasised that disabled people are more disadvantaged in terms of coping against the implications of hazards compared to non-disabled individuals. It is also empirically known that the death rate of disabled people<?pagebreak page2135?> is higher in large-scale disasters such as earthquakes, floods, and tsunamis <xref ref-type="bibr" rid="bib1.bibx103 bib1.bibx89" id="paren.13"/>. Within demographical components, gender is also one of the most commonly used ones, as women are considered more vulnerable to hazards compared to men <xref ref-type="bibr" rid="bib1.bibx71 bib1.bibx75 bib1.bibx47" id="paren.14"/>. With respect to the age dimension, it is acknowledged that children and especially elderly people over 65 who live alone are age groups that can be more affected by any disaster (e.g. <xref ref-type="bibr" rid="bib1.bibx46" id="altparen.15"/>). The responses of children, the elderly, the disabled, and patients to a hazard may not be the same as those of young, healthy people <xref ref-type="bibr" rid="bib1.bibx24" id="paren.16"/>.</p>
      <p id="d1e290">Besides  demographic properties, the characteristics that determine the socioeconomic level such as income, employment status, social security, and household size have an influence on the level of vulnerability (e.g. <xref ref-type="bibr" rid="bib1.bibx23 bib1.bibx58 bib1.bibx45" id="altparen.17"/>). <xref ref-type="bibr" rid="bib1.bibx41" id="text.18"/> showed that the distribution of labour affects the impact of disasters on mortality and morbidity. It must also be noted that socioeconomic status is mostly accompanied by “education level”, which denotes the highest education degree a person has. In several studies, it is implied that higher education level leads to more ability to cope and/or resist hazards, as higher education level enables higher income jobs and a wealthier life (e.g. <xref ref-type="bibr" rid="bib1.bibx119 bib1.bibx8" id="altparen.19"/>).</p>
      <p id="d1e302">In addition to socioeconomic and demographic properties, in some studies, the physical environment is also considered an indicator of social vulnerability, where the infrastructure quality, availability, and access to public resources such as transportation, education, and health facilities are incorporated within the concept (e.g. <xref ref-type="bibr" rid="bib1.bibx34 bib1.bibx30 bib1.bibx57" id="altparen.20"/>). It is assumed that the lack of those opportunities increases the social vulnerability of the individuals within the area of interest.</p>
      <p id="d1e309">In this context, it is seen that descriptors for social vulnerability to hazards are mainly grouped under three dimensions: (i) demographics, (ii) socioeconomics, and (iii) the physical environment. More detailed reviews on social vulnerability indicators can be found at <xref ref-type="bibr" rid="bib1.bibx84 bib1.bibx47 bib1.bibx46" id="paren.21"/>.</p>
      <p id="d1e315">Although there is more or less a consensus on the indicators of social vulnerability, measuring it is challenging due to the complexity of the concept and its latent nature <xref ref-type="bibr" rid="bib1.bibx17" id="paren.22"/>. To quantify social vulnerability as a single metric value, three main statistical modelling approaches are employed: inductive, deductive, and hierarchical. Inductive models combine a set of large indicators into latent factors and then sum these factors to construct a single-index score for social vulnerability. Deductive models contain fewer indicators, which are normalised and summed to construct the index score. Hierarchical designs aggregate indicators into groups (sub-indices) that share an underlying dimension of vulnerability. These sub-indices are then aggregated to construct a vulnerability index. The methodological comparison of these designs and various approaches to constructing a social vulnerability index are reviewed by various authors (e.g. <xref ref-type="bibr" rid="bib1.bibx106 bib1.bibx97 bib1.bibx10" id="altparen.23"/>).</p>
      <p id="d1e324">Among these approaches, the social vulnerability index (SoVI) developed by <xref ref-type="bibr" rid="bib1.bibx31" id="text.24"/> has been one of the most commonly used tools to quantify vulnerability (6840 citations according to Google Scholar by 1st April 2023). In the aforementioned study, SoVI was constructed by factor analysis based on principal components analysis (PCA) in the U.S. county scale based on 42 vulnerability variables. In <xref ref-type="bibr" rid="bib1.bibx31" id="text.25"/>, where the data from areal divisions (U.S. counties) are used, a total of 11 factors were obtained, which explains 76.4 % of the variance in social vulnerability in the U.S. counties. The SoVI scores were calculated by summing the raw metrics for each county, where the higher and lower scores represent high and low social vulnerability, respectively. Various studies thereafter assessed the indicators that could be used to measure social vulnerability for a certain location and time frame <xref ref-type="bibr" rid="bib1.bibx58 bib1.bibx16 bib1.bibx46 bib1.bibx97 bib1.bibx102 bib1.bibx72" id="paren.26"/>. It can be suggested that there is almost a consensus between those studies, where social vulnerability is defined as a function of gender, health status and access to healthcare, poverty, age, property ownership, and socio-economic indicators <xref ref-type="bibr" rid="bib1.bibx63" id="paren.27"/>. For the SoVI, which was constructed in Istanbul in 2018, similar variables and categories were used with reference to <xref ref-type="bibr" rid="bib1.bibx31" id="text.28"/>, but the data were collected via a household survey (for more information on variables see Sect. <xref ref-type="sec" rid="Ch1.S3"/> and Sect. S1 in the Supplement).</p>
      <p id="d1e345">The inductive factor analytic framework proposed by <xref ref-type="bibr" rid="bib1.bibx31" id="text.29"/> to measure social vulnerability has been widely adopted in many studies (e.g. <xref ref-type="bibr" rid="bib1.bibx6 bib1.bibx23 bib1.bibx92 bib1.bibx54 bib1.bibx65 bib1.bibx96 bib1.bibx114" id="altparen.30"/>). SoVI is a valuable tool not only for academics but also for policy makers and governmental bodies, as it allows for making spatial assessments that enable comparison of different spatial entities such as counties, districts, and neighbourhoods with respect to their social vulnerability level (e.g. <xref ref-type="bibr" rid="bib1.bibx102 bib1.bibx112 bib1.bibx40 bib1.bibx37 bib1.bibx48" id="altparen.31"/>). Although SoVI is used in many studies, the vulnerability research which assesses household-level social vulnerability is limited <xref ref-type="bibr" rid="bib1.bibx70 bib1.bibx118 bib1.bibx105" id="paren.32"/>.</p>
      <p id="d1e360">Despite the common usage of SoVI and its advantages, various studies have shown that the prediction of social vulnerability can be enhanced by empirical modelling, utilising historical event data and intensity measures for the given hazard <xref ref-type="bibr" rid="bib1.bibx115 bib1.bibx116 bib1.bibx18" id="paren.33"/>. Relying on empirical data can be considered a more realistic approach for estimating the social vulnerability of a given entity (compared to SoVI); however,<?pagebreak page2136?> the high dependence on data may become an obstacle, particularly for contexts where data scarcity is in place or data sharing protocols are missing. Another drawback of such an approach is that, when catastrophic hazard occurrence is rare, the policy makers can underestimate the impacts of a major hazard event if they rely on historical data from the smaller-scale hazardous events where the losses are much less due to infrastructural investments. Thus, data scarcity and rare occurrence of major hazards make it challenging to use historic data for a hazard-driven social vulnerability research.</p>
      <p id="d1e366">In this respect, SoVI scores are commonly used as a proxy of social vulnerability, which is independent of empirical data and which enables one to develop a more generic methodology that can be applied in different contexts. Within this scope, there are numerous studies that have examined the factors relating to social vulnerability in a hazard by using either descriptive statistics <xref ref-type="bibr" rid="bib1.bibx122 bib1.bibx113" id="paren.34"/> or traditional data analysis tools, such as linear or logistic regression <xref ref-type="bibr" rid="bib1.bibx47 bib1.bibx85 bib1.bibx104 bib1.bibx71 bib1.bibx82" id="paren.35"/>. While the former lacks the incorporation of the relationships between the vulnerability indicators, the latter relies heavily on data assumptions. In contrast, machine learning algorithms allow for a larger number of predictors, can handle complex interactions between predictors, can model nonlinear relationships, and do not make any distributional assumptions regarding the data <xref ref-type="bibr" rid="bib1.bibx98" id="paren.36"/>. In quantitative social research, particularly with large-scale survey data where relationships between socio-demographic and socio-economic variables cannot be ignored, there is an emerging interest in using ML methods for making predictions <xref ref-type="bibr" rid="bib1.bibx20" id="paren.37"/>.</p>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T1" specific-use="star"><?xmltex \currentcnt{1}?><label>Table 1</label><caption><p id="d1e385">Studies that assess factors related to social vulnerability using ML models.</p></caption><oasis:table frame="topbot"><?xmltex \begin{scaleboxenv}{.97}[.97]?><oasis:tgroup cols="7">
     <oasis:colspec colnum="1" colname="col1" align="justify" colwidth="2.1cm"/>
     <oasis:colspec colnum="2" colname="col2" align="justify" colwidth="1.5cm"/>
     <oasis:colspec colnum="3" colname="col3" align="justify" colwidth="1.6cm"/>
     <oasis:colspec colnum="4" colname="col4" align="justify" colwidth="1.7cm"/>
     <oasis:colspec colnum="5" colname="col5" align="justify" colwidth="1.1cm"/>
     <oasis:colspec colnum="6" colname="col6" align="justify" colwidth="3.4cm"/>
     <oasis:colspec colnum="7" colname="col7" align="justify" colwidth="3.7cm"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Study</oasis:entry>
         <oasis:entry colname="col2">Type of <?xmltex \hack{\hfill\break}?>hazard</oasis:entry>
         <oasis:entry colname="col3">Region</oasis:entry>
         <oasis:entry colname="col4">Scale level</oasis:entry>
         <oasis:entry colname="col5">ML model</oasis:entry>
         <oasis:entry colname="col6">Outcome</oasis:entry>
         <oasis:entry colname="col7">Predictors</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"><xref ref-type="bibr" rid="bib1.bibx7" id="author.38"/> <?xmltex \hack{\hfill\break}?>(<xref ref-type="bibr" rid="bib1.bibx7" id="year.39"/>)</oasis:entry>
         <oasis:entry colname="col2">Earthquake</oasis:entry>
         <oasis:entry colname="col3">Tabriz, Iran</oasis:entry>
         <oasis:entry colname="col4">Municipality zones</oasis:entry>
         <oasis:entry colname="col5">ANN</oasis:entry>
         <oasis:entry colname="col6">Five-category SoVI</oasis:entry>
         <oasis:entry colname="col7">Seven regional indicators <?xmltex \hack{\hfill\break}?>such as densities of the <?xmltex \hack{\hfill\break}?>population, men, women, <?xmltex \hack{\hfill\break}?>literate people, household, <?xmltex \hack{\hfill\break}?>employed, and unemployed <?xmltex \hack{\hfill\break}?>people</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"><xref ref-type="bibr" rid="bib1.bibx39" id="author.40"/> <?xmltex \hack{\hfill\break}?>(<xref ref-type="bibr" rid="bib1.bibx39" id="year.41"/>)</oasis:entry>
         <oasis:entry colname="col2">Earthquake</oasis:entry>
         <oasis:entry colname="col3">Perth city, <?xmltex \hack{\hfill\break}?>Australia</oasis:entry>
         <oasis:entry colname="col4">Households</oasis:entry>
         <oasis:entry colname="col5">CART</oasis:entry>
         <oasis:entry colname="col6">Two-category SV class <?xmltex \hack{\hfill\break}?>variable, assessed with a <?xmltex \hack{\hfill\break}?>risk perception <?xmltex \hack{\hfill\break}?>questionnaire applied to <?xmltex \hack{\hfill\break}?>1100 individuals</oasis:entry>
         <oasis:entry colname="col7">A total of 15 indicators <?xmltex \hack{\hfill\break}?>related to demographic and <?xmltex \hack{\hfill\break}?>economic household <?xmltex \hack{\hfill\break}?>attributes</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"><xref ref-type="bibr" rid="bib1.bibx121" id="text.42"/></oasis:entry>
         <oasis:entry colname="col2">Any single <?xmltex \hack{\hfill\break}?>hazard</oasis:entry>
         <oasis:entry colname="col3">South Korea</oasis:entry>
         <oasis:entry colname="col4">Local <?xmltex \hack{\hfill\break}?>communities</oasis:entry>
         <oasis:entry colname="col5">Random forest, cubist</oasis:entry>
         <oasis:entry colname="col6">Community vulnerability, <?xmltex \hack{\hfill\break}?>assessed with indicators <?xmltex \hack{\hfill\break}?>related to economic <?xmltex \hack{\hfill\break}?>damage</oasis:entry>
         <oasis:entry colname="col7">A total of 12 indicators <?xmltex \hack{\hfill\break}?>including social, economic, <?xmltex \hack{\hfill\break}?>and natural environment and <?xmltex \hack{\hfill\break}?>built environment</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><xref ref-type="bibr" rid="bib1.bibx1" id="text.43"/></oasis:entry>
         <oasis:entry colname="col2">Any single <?xmltex \hack{\hfill\break}?>hazard</oasis:entry>
         <oasis:entry colname="col3">Andalusia</oasis:entry>
         <oasis:entry colname="col4">Dwelling units</oasis:entry>
         <oasis:entry colname="col5">CART</oasis:entry>
         <oasis:entry colname="col6">Two-category SV class <?xmltex \hack{\hfill\break}?>variable, which is <?xmltex \hack{\hfill\break}?>obtained from previous <?xmltex \hack{\hfill\break}?>database</oasis:entry>
         <oasis:entry colname="col7">A total of 66 indicators of the <?xmltex \hack{\hfill\break}?>demographic, social, labour, <?xmltex \hack{\hfill\break}?>facilities, and services, etc., <?xmltex \hack{\hfill\break}?>dimensions</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup><?xmltex \end{scaleboxenv}?></oasis:table><table-wrap-foot><p id="d1e388">SV: social vulnerability, CART: classification and regression trees, ANN: artificial neural network.</p></table-wrap-foot><?xmltex \gdef\@currentlabel{1}?></table-wrap>

      <p id="d1e616">A relatively small number of researchers have opted to use ML methodology over traditional statistical techniques in vulnerability research (Table <xref ref-type="table" rid="Ch1.T1"/>), and indeed a detailed model-based assessment of the predictors of social vulnerability to hazards seems lacking. The few studies that employ ML techniques were based on larger sampling units such as districts, neighbourhoods, or communities, in contrast to our study which was based on a household scale. Due to the low number of studies and significant variation in their methodology, scale level, and outcome type, it is difficult to make model-based recommendations. Moreover, the performances of various ML methods are rarely compared in terms of their predictive accuracy for social vulnerability in hazards <xref ref-type="bibr" rid="bib1.bibx121" id="paren.44"/>.</p>
</sec>
<sec id="Ch1.S3">
  <label>3</label><title>Materials and methods</title>
      <p id="d1e632">In our study, we attempt to contribute to social vulnerability research by identifying the most important factors that contribute to the prediction of social vulnerability of households by using the ML approaches. In this regard, we address the following research questions. (1) What is the best-performing ML method for the prediction of social vulnerability? (2) What are the most influential predictors associated with social vulnerability? We posit that, when large datasets are available at the household level, the models developed based on ML algorithms have the potential to predict socially vulnerable households with high accuracy.</p>
      <p id="d1e635">As an indication of hazard-related social vulnerability, we have adopted SoVI, which was previously constructed in Istanbul in 2017 <xref ref-type="bibr" rid="bib1.bibx60 bib1.bibx79" id="paren.45"/>. In this paper we do not intend to discuss the SoVI scores or the methodology of this previous study, but instead, we consider the SoVI scores as a proxy of the social vulnerability state for each household. We assessed to what extent the pre-constructed SoVI (and hence the social vulnerability of the households) can be predicted with machine learning techniques using quantifiable household variable data (such as socio-economic and socio-demographic characteristics and housing conditions) that are assumed to be available within publicly accessible databases provided by statistical institutes of central government agencies or local public authorities. Thus, we aimed at presenting an approach that can reduce the time and economic burden that decision makers can spend collecting data and modelling to identify households with high social vulnerability.</p>
<sec id="Ch1.S3.SS1">
  <label>3.1</label><title>Study area</title>
      <p id="d1e648">Türkiye is in a region that is prone to natural hazards, where a large-scale disaster happens every 7 to 8 years <xref ref-type="bibr" rid="bib1.bibx12" id="paren.46"/>. Among the different types of disasters, earthquakes are responsible for the most extensive losses in terms of both human life and property, accounting for 60 % of disaster-related fatalities in Türkiye <xref ref-type="bibr" rid="bib1.bibx4" id="paren.47"/>. Following earthquakes, landslides (which mostly take the form of rock falls, slides or flows, or mass movements), floods, snow avalanches, and large-scale wildfires are amongst the most commonly occurring hazardous events that have adverse impacts on human lives, as well as the environment and the economy
<xref ref-type="bibr" rid="bib1.bibx4 bib1.bibx25" id="paren.48"/>. Our case study area Istanbul city is also prone to hazardous events, such as earthquakes, flooding, landslides, tsunamis, and extreme weather events <xref ref-type="bibr" rid="bib1.bibx80" id="paren.49"/>. However, our site selection is not only related to Istanbul's location in a hazard-prone area but also mostly related to its high population density and high level of economic investments that increase the expected losses from possible hazards in the city. Istanbul is the 15th most populated city in the world, with a population of approximately 16 million, and it is also the largest metropolitan city in Türkiye <xref ref-type="bibr" rid="bib1.bibx120" id="paren.50"/>. After the 1930s, the city of Istanbul grew steadily and became the heart of Türkiye's economy, producing almost 31 % of the national GDP in 2021 <xref ref-type="bibr" rid="bib1.bibx87" id="paren.51"/>. In the last century, the economic growth triggering mass migration to the city induced uncontrolled illegal housing with low-quality building materials in hazardous areas <xref ref-type="bibr" rid="bib1.bibx107" id="paren.52"/>. Additionally, building codes were<?pagebreak page2137?> updated in 1997, and before that, even if legally constructed, buildings were built with less stringent building codes which did not consider disaster risk <xref ref-type="bibr" rid="bib1.bibx9" id="paren.53"/>. This rapid and uncontrolled urban growth increased vulnerability to hazards in the city <xref ref-type="bibr" rid="bib1.bibx53" id="paren.54"/>. Hence, our study area is selected as a suitable setting for our research on social vulnerability because it is a hazard-prone zone with high population density and poor-quality housing.</p>
</sec>
<sec id="Ch1.S3.SS2">
  <label>3.2</label><title>Data source: social vulnerability research in Istanbul in 2017</title>
<sec id="Ch1.S3.SS2.SSS1">
  <label>3.2.1</label><title>Survey sampling method and application</title>
      <p id="d1e694">To provide a basis for the social vulnerability analysis, a large-scale household survey was carried out by Istanbul Metropolitan Municipality (IMM) in 2017 to assess the disaster-related social vulnerability of the households in Istanbul. The variables used in this research were in line with the social science and disaster literature, where such research is focused generally on the social factors that increase or decrease the impact of specific hazard events on the local population. The authors of this study were given permission to use this survey data after the data were fully anonymised. The exact number of surveys is 41 093 households covering 955 neighbourhoods, with residential occupation expanding over the whole jurisdiction boundary of the metropolitan municipality of Istanbul <xref ref-type="bibr" rid="bib1.bibx60" id="paren.55"/>. The households were randomly selected from the Address Based Population Registration System Database of the Turkish Statistical Institute using the proportionate stratified sampling method. All <inline-formula><mml:math id="M6" display="inline"><mml:mn mathvariant="normal">955</mml:mn></mml:math></inline-formula> neighbourhoods within 39 districts of Istanbul were taken as strata, then households were randomly selected from each neighbourhood. The number of households in each neighbourhood taken is proportional to the neighbourhood population. The survey was conducted via face-to-face interviews with one household member, aged between 18 and 70 and capable of giving relevant and accurate information about the household. The verbal and written informed consents were obtained from the participants during the data collection stage.</p>
</sec>
<sec id="Ch1.S3.SS2.SSS2">
  <label>3.2.2</label><title>Construction of SoVI</title>
      <p id="d1e715">SoVI scores of the selected households were calculated using Cutter's factor analytic framework <xref ref-type="bibr" rid="bib1.bibx31" id="paren.56"/> in social vulnerability research funded and being used by IMM, as explained by <xref ref-type="bibr" rid="bib1.bibx79" id="text.57"/> and Sect. S1. To date, this work by the IMM has been the most comprehensive study for assessing the social vulnerability of households in the event of a hazard, which was originally constructed for earthquake-induced disasters as the most probable major hazard for Istanbul. It considers the concept of social vulnerability as a state that arises from the lack of capacity of society and individuals to cope with natural hazards.  The concept further includes the perception of and preparedness for risk and the measures taken against the risk, as well as cultural values and socio-economic status. To construct SoVI, 53<?pagebreak page2138?> indicators within seven variable clusters (socio-demography, socio-economy, access to health services, social solidarity, risk perception, actions taken to reduce risk, and values) were used, as they are regarded to be related to social vulnerability. The indicators and variable clusters were selected following extensive literature reviews and expert judgement, with a specific focus on earthquake hazards <xref ref-type="bibr" rid="bib1.bibx60" id="paren.58"/>. In the theoretical framework, social vulnerability is considered to be independent of hazard type, and exposure zones to any or all hazards are combined with SoVI to create place vulnerability <xref ref-type="bibr" rid="bib1.bibx28" id="paren.59"/>. Hence the earthquake-related (as the major hazard in Istanbul) data collected in this household survey and the indicators used for SoVI are also assumed to explain other hazard events as well.</p>
      <p id="d1e730">Here we note that it is quite challenging to access/find quality empirical information regarding disaster-related topics in Türkiye as in many developing countries and the global south context. Information related to historical data on disaster impact/losses/recovery is mostly not in place for smaller regional units in Türkiye, and then even if it is there (gathered by related institutions), it is not shared. Therefore, the <xref ref-type="bibr" rid="bib1.bibx31" id="text.60"/> index-based methodology to represent social vulnerability was opted for when constructing SoVI in the previous study by IMM.</p>
</sec>
</sec>
<sec id="Ch1.S3.SS3">
  <label>3.3</label><title>Outcome of the machine learning models: household-level social vulnerability</title>
      <p id="d1e746">In this study, we relied on the pre-constructed SoVI as an indication of the social vulnerability of the households. By that, we used SoVI as the outcome of the machine learning models we tested. The SoVI score does not have any unit, and, rather than its absolute value, its importance lies within its comparative value across various households <xref ref-type="bibr" rid="bib1.bibx29" id="paren.61"/>. Various authors dichotomised social vulnerability index scores in their research both for ease of interpretation and to identify those most vulnerable <xref ref-type="bibr" rid="bib1.bibx39 bib1.bibx1 bib1.bibx14 bib1.bibx82" id="paren.62"/>. In this research, we also aimed to discriminate between the most vulnerable households and all others. Therefore, we defined households with high social vulnerability (SV) as those with SoVI scores <inline-formula><mml:math id="M7" display="inline"><mml:mrow><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> standard deviation from the mean, which corresponds to 17.2 % of the households, whereas the rest of the households were deemed as low SV. Thus, a binary variable (with an approximate imbalance ratio of <inline-formula><mml:math id="M8" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">5</mml:mn></mml:mrow></mml:math></inline-formula> in favour of low SV) was generated as an indication of social vulnerability level, which in turn was used as the primary outcome for all the further analyses presented in this paper. Further, from the statistical point of view, we preferred to dichotomise the outcome rather than using it as a multi-category variable, as the available performance metrics for a multi-class confusion matrix are limited compared to a binary classification problem, and the complexity of analysis increases with the increase in a number of classes <xref ref-type="bibr" rid="bib1.bibx74" id="paren.63"/>. Therefore, in accordance with our motivation and for interpretive reasons we used SoVI as a binary outcome.</p>
</sec>
<sec id="Ch1.S3.SS4">
  <label>3.4</label><title>Predictors of the machine learning models and data pre-processing</title>
      <p id="d1e788">We have restricted the variables that are used in the ML models as input variables to quantifiable predictors, which can be obtained from various institutional databases without requiring a household-based survey that is costly and time intensive. These quantifiable predictors are related to the socio-demography and socio-economy of the households as well as housing information. The list of institutions to which the variables used in this study are related is given in Sect. S2. Here we note that, although the household data used in the <xref ref-type="bibr" rid="bib1.bibx60" id="text.64"/> to construct SoVI are focused on earthquakes, the indicators used for social vulnerability classification in the present study can be implemented in a more generic way to assess the possible impact of social vulnerability to other hazards.</p>

<?xmltex \floatpos{p}?><table-wrap id="Ch1.T2" specific-use="star"><?xmltex \currentcnt{2}?><label>Table 2</label><caption><p id="d1e797">Predictors used in ML model building for prediction household-level social vulnerability.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="3">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="justify" colwidth="4.3cm"/>
     <oasis:colspec colnum="3" colname="col3" align="justify" colwidth="9cm"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Themes</oasis:entry>
         <oasis:entry colname="col2">Variable</oasis:entry>
         <oasis:entry colname="col3">Definition of a variable or survey question</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">Socio-</oasis:entry>
         <oasis:entry rowsep="1" colname="col2">Household size</oasis:entry>
         <oasis:entry rowsep="1" colname="col3">Number of people living in the house (HhS) (range: 1–14)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">demographic</oasis:entry>
         <oasis:entry rowsep="1" colname="col2">Average age</oasis:entry>
         <oasis:entry rowsep="1" colname="col3">Average age of the household members in years (range: 8.8–85)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry rowsep="1" colname="col2">Number of women/HhS</oasis:entry>
         <oasis:entry rowsep="1" colname="col3">Ratio of women in the household (range: 0–1)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry rowsep="1" colname="col2">Number of men/HhS</oasis:entry>
         <oasis:entry rowsep="1" colname="col3">Ratio of men in the household (range: 0–1)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry rowsep="1" colname="col2">Number of <inline-formula><mml:math id="M9" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">5</mml:mn></mml:mrow></mml:math></inline-formula> year olds/HhS</oasis:entry>
         <oasis:entry rowsep="1" colname="col3">Ratio of <inline-formula><mml:math id="M10" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">5</mml:mn></mml:mrow></mml:math></inline-formula>-year-old children in the household (range: 0–0.67)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry rowsep="1" colname="col2">Number of <inline-formula><mml:math id="M11" display="inline"><mml:mrow><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">65</mml:mn></mml:mrow></mml:math></inline-formula> years of age/HhS</oasis:entry>
         <oasis:entry rowsep="1" colname="col3">Ratio of over <inline-formula><mml:math id="M12" display="inline"><mml:mn mathvariant="normal">65</mml:mn></mml:math></inline-formula>-year-old individuals in the household (range: 0–0.1)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry rowsep="1" colname="col2">Average education</oasis:entry>
         <oasis:entry rowsep="1" colname="col3">Average years of education of the household members who are over <inline-formula><mml:math id="M13" display="inline"><mml:mn mathvariant="normal">15</mml:mn></mml:math></inline-formula> years old (range: 0–17)</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">Social security</oasis:entry>
         <oasis:entry colname="col3">Are there any household members with social security? (yes/no)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Health</oasis:entry>
         <oasis:entry rowsep="1" colname="col2">Health insurance</oasis:entry>
         <oasis:entry rowsep="1" colname="col3">Are there any household members with health security or insurance? (yes/no)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry rowsep="1" colname="col2">Disability</oasis:entry>
         <oasis:entry rowsep="1" colname="col3">Are there any disabled or elderly persons who need care in the Hh? (yes/no)</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">Health access</oasis:entry>
         <oasis:entry colname="col3">Do you have any hospital/health centre within close proximity to your house? (yes/no)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Socio-</oasis:entry>
         <oasis:entry rowsep="1" colname="col2">Number of income earners/HhS</oasis:entry>
         <oasis:entry rowsep="1" colname="col3">Ratio of the number of income earners in the household (range: 0–2)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">economic</oasis:entry>
         <oasis:entry rowsep="1" colname="col2">Regular salary income</oasis:entry>
         <oasis:entry rowsep="1" colname="col3">Are there any household members who have regular salary income? (yes/no)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry rowsep="1" colname="col2">Pension income</oasis:entry>
         <oasis:entry rowsep="1" colname="col3">Are there any household members who earn pension income? (yes/no)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry rowsep="1" colname="col2">Rent income</oasis:entry>
         <oasis:entry rowsep="1" colname="col3">Are there any household members who earn income from rent? (yes/no)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry rowsep="1" colname="col2">Income support from public <?xmltex \hack{\hfill\break}?>authorities</oasis:entry>
         <oasis:entry rowsep="1" colname="col3">Are there any household members who receive income support from public authorities? (yes/no)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry rowsep="1" colname="col2">Job Insecurity</oasis:entry>
         <oasis:entry rowsep="1" colname="col3">Are there any household members who have job insecurity? i.e. unregistered informal work, unemployment (yes/no)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry rowsep="1" colname="col2">House ownership</oasis:entry>
         <oasis:entry rowsep="1" colname="col3">Do any of the household members own the house of your residence? (yes/no)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry rowsep="1" colname="col2">Type of the house</oasis:entry>
         <oasis:entry rowsep="1" colname="col3">What is the type of the home of your residence? (apartment flat, squatter house, detached house, gatekeepers lodge)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry rowsep="1" colname="col2">Natural gas heating</oasis:entry>
         <oasis:entry rowsep="1" colname="col3">Do you have natural gas heating at the home of your residence? (yes/no)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry rowsep="1" colname="col2">Own house in Istanbul</oasis:entry>
         <oasis:entry rowsep="1" colname="col3">Are there any household members who own a house in Istanbul, other than the home of residence? (yes/no)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry rowsep="1" colname="col2">Own land in Istanbul</oasis:entry>
         <oasis:entry rowsep="1" colname="col3">Are there any household members who own land in Istanbul? (yes/no)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry rowsep="1" colname="col2">Own house out of Istanbul</oasis:entry>
         <oasis:entry rowsep="1" colname="col3">Are there any household members who own a house outside Istanbul? (yes/no)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry rowsep="1" colname="col2">Own land out of Istanbul</oasis:entry>
         <oasis:entry rowsep="1" colname="col3">Are there any household members who own land outside Istanbul? (yes/no)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry rowsep="1" colname="col2">Saving</oasis:entry>
         <oasis:entry rowsep="1" colname="col3">Are there any household members who have savings to use for emergency situations? (yes/no)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">Debt</oasis:entry>
         <oasis:entry colname="col3">Are there any household members who have debt to banks (incl. credits, bank loans, etc.)? (yes/no)</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table><?xmltex \gdef\@currentlabel{2}?></table-wrap>

      <p id="d1e1170">Prior to model development, the predictors were prepared in terms of data representation, standardisation, and feature selection. As the predictors represent household characteristics, they were sought at the household level. As stated by <xref ref-type="bibr" rid="bib1.bibx5" id="text.65"/>, data representation is about enabling better interpretation of the relevant information. Therefore, the predictors which are measured at the household level, such as the number of women, men, <inline-formula><mml:math id="M14" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">5</mml:mn></mml:mrow></mml:math></inline-formula> year olds, <inline-formula><mml:math id="M15" display="inline"><mml:mrow><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">65</mml:mn></mml:mrow></mml:math></inline-formula> year olds, and income earners were taken in proportion to the given household's size (HhS). Then, in order to make the variation of continuous variables comparable, these variables were standardised into the same scale with unit variance standardisation <xref ref-type="bibr" rid="bib1.bibx56" id="paren.66"/>. For the final step, we used feature selection prior to processing the data, and we identified the predictors with near-zero variance, as the predictors which take only one value may cause numerical problems during resampling <xref ref-type="bibr" rid="bib1.bibx67" id="paren.67"/>. The set of <inline-formula><mml:math id="M16" display="inline"><mml:mn mathvariant="normal">26</mml:mn></mml:math></inline-formula> variables used for model building is presented in Table <xref ref-type="table" rid="Ch1.T2"/>, along with their relevance in relation to the objectives of our study.</p>
</sec>
<sec id="Ch1.S3.SS5">
  <label>3.5</label><title>Machine learning methods</title>
      <?pagebreak page2140?><p id="d1e1220">We developed models for the classification of households in terms of their social vulnerability in the event of an earthquake using six supervised machine learning algorithms: classification and regression tree (CART), random forest (RF), artificial neural network (ANN), support vector machine (SVM), naïve Bayes (NB), and <inline-formula><mml:math id="M17" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula>-nearest neighbours (KNNs). The predictive performances of these ML models are compared to that of the logistic regression (LR) model, which is a traditional statistical technique used for binary classification. Supervised ML adopts an algorithm to learn the mapping function from the input variables to the output variable, and it is well suited to classification problems.  Models were developed using the variable set in Table <xref ref-type="table" rid="Ch1.T2"/> as the input variables, while a binary indicator of the social vulnerability level of each household was the output variable. We developed a prediction model using 90 % of the dataset to train the underlying algorithm, while 10 % was held back as independent testing data for evaluating the performance of the models. We note that these algorithms have different tuning parameters. For different tuning parameter alternatives, the choice of the optimal tuning parameter was determined by the largest area under the curve (AUC) value of the receiver operating characteristic (ROC) curve using the automated grid search. The details regarding the machine learning models and R software packages used for the analysis are provided in Sect. S3. The workflow for the model building is shown in Fig. <xref ref-type="fig" rid="Ch1.F1"/>.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F1" specific-use="star"><?xmltex \currentcnt{1}?><?xmltex \def\figurename{Figure}?><label>Figure 1</label><caption><p id="d1e1236">Machine learning flowchart for data processing and model development.</p></caption>
          <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://nhess.copernicus.org/articles/23/2133/2023/nhess-23-2133-2023-f01.png"/>

        </fig>

</sec>
<sec id="Ch1.S3.SS6">
  <label>3.6</label><title>Data-level pre-processing</title>
<sec id="Ch1.S3.SS6.SSS1">
  <label>3.6.1</label><title>Resampling techniques</title>
      <p id="d1e1260">Repeated cross validation (RCV) and bootstrap resampling procedures were used to draw multiple subsamples from the original data to build machine learning models on the training data and to validate the models, in each instance, on the data that were excluded from the subsample. The tuning parameters were selected as <inline-formula><mml:math id="M18" display="inline"><mml:mn mathvariant="normal">5</mml:mn></mml:math></inline-formula>-fold, with <inline-formula><mml:math id="M19" display="inline"><mml:mn mathvariant="normal">4</mml:mn></mml:math></inline-formula> repetitions for repeated cross validation and 20 repetitions for bootstrap, resulting in the same amount of resampling. The number of resampling repetitions was kept low to diminish the computational time burden.</p>
</sec>
<sec id="Ch1.S3.SS6.SSS2">
  <label>3.6.2</label><title>Subsampling for the imbalanced class variables</title>
      <p id="d1e1285">A dataset is said to be imbalanced when the classification categories are not represented equally <xref ref-type="bibr" rid="bib1.bibx69" id="paren.68"/>. In our study, the social vulnerability dataset consists of imbalanced class variables, in which the “high SV” class has a lower frequency compared to the “low SV” class. The imbalance ratio of these two classes was approximately <inline-formula><mml:math id="M20" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">5</mml:mn></mml:mrow></mml:math></inline-formula>. The main challenge of the imbalance problem in standard machine learning algorithms is that the minority classes can be overlooked and weighed down by the majority one <xref ref-type="bibr" rid="bib1.bibx93" id="paren.69"/>. In order to address this issue, we used various subsampling approaches during the data pre-processing steps as explained below.
<list list-type="custom"><list-item><label>i.</label>
      <p id="d1e1308"><italic>Random-majority under sampling (Under)</italic>. Under sampling randomly samples from the majority class and returns a subsample which has the same size as the minority class, thus ensuring the majority class prevalence is equal to that of minority one for subsequent modelling <xref ref-type="bibr" rid="bib1.bibx15" id="paren.70"/>. For instance, assume a binary class variable in which <inline-formula><mml:math id="M21" display="inline"><mml:mn mathvariant="normal">90</mml:mn></mml:math></inline-formula> % of training set samples belong to the majority class, while the remaining <inline-formula><mml:math id="M22" display="inline"><mml:mn mathvariant="normal">10</mml:mn></mml:math></inline-formula> % are in the minority class. Under sampling will randomly subsample from the majority class such that its prevalence is <inline-formula><mml:math id="M23" display="inline"><mml:mn mathvariant="normal">10</mml:mn></mml:math></inline-formula> %. As a result, only <inline-formula><mml:math id="M24" display="inline"><mml:mn mathvariant="normal">20</mml:mn></mml:math></inline-formula> % of the total training set will be used for the classification model. While balancing the class variable, however, in some cases this approach may remove many important or otherwise influential data points prior to modelling.</p></list-item><list-item><label>ii.</label>
      <p id="d1e1345"><italic>Over-sampling</italic>. Three different over-sampling strategies were applied.
<list list-type="bullet"><list-item>
      <p id="d1e1352"><italic>Random minority over-sampling (Over)</italic>. It aims to balance the distribution of the class variable by taking random replicates of the minority class <xref ref-type="bibr" rid="bib1.bibx15" id="paren.71"/>. Although it helps to improve the accuracy of classification in imbalanced datasets, it is prone to overfitting and computational problems when the dataset is large <xref ref-type="bibr" rid="bib1.bibx73" id="paren.72"/>.</p></list-item><list-item>
      <p id="d1e1364"><italic>Synthetic minority over-sampling technique (SMOTE)</italic>. It creates artificial minority examples by interpolating between randomly selected examples of the minority class and their nearest neighbours <xref ref-type="bibr" rid="bib1.bibx22" id="paren.73"/>. It attempts to avoid the overfitting problem by using new synthetic minority class examples instead of replicating minority samples.</p></list-item><list-item>
      <p id="d1e1373"><italic>Random over-sampling examples (ROSE)</italic>. It generates artificial balanced samples according to a smoothed bootstrap approach and aids in the phases of estimation and accuracy evaluation of a classification algorithm in the presence of an imbalanced class variable <xref ref-type="bibr" rid="bib1.bibx78" id="paren.74"/>.</p></list-item></list></p></list-item></list></p>
      <p id="d1e1381">The above procedures are independent of resampling methods such as repeated cross validation and bootstrap. On the other hand, these subsampling procedures can also be performed for the resampling techniques, so that subsampling is conducted inside of resampling. In this paper, when subsampling procedures are performed outside of resampling techniques it is referred to as “out sampling”, otherwise it is expressed as “in sampling”.</p>
      <p id="d1e1384">One could also consider creating a custom-made subsampling procedure. In this respect, we also apply the transformed version of SMOTE that use 10 nearest neighbours instead of the default of 5 by adopting a simple wrapper function, which we call the “SMOTEST”. Note that the SMOTEST function is only performed inside the resampling <xref ref-type="bibr" rid="bib1.bibx68" id="paren.75"/>.</p>
</sec>
</sec>
<sec id="Ch1.S3.SS7">
  <label>3.7</label><title>Statistical analysis and model performance assessment</title>
      <p id="d1e1399">The characteristics of the study population were summarised using descriptive statistics. Pearson's chi-square tests were used to compare categorical variables, and independent samples <inline-formula><mml:math id="M25" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula> tests or non-parametric Mann–Whitney <inline-formula><mml:math id="M26" display="inline"><mml:mi>U</mml:mi></mml:math></inline-formula> tests were used to compare continuous variables between the high and<?pagebreak page2141?> low SV groups depending on the data distribution. In studies with large sample sizes, in addition to <inline-formula><mml:math id="M27" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula> values, it is also relevant to provide effect sizes, as it can help decide whether the difference found is meaningful or not <xref ref-type="bibr" rid="bib1.bibx11" id="paren.76"/>. Thus, we have reported effect sizes in the univariate comparisons that measure the strength of the relationship between two variables along with the <inline-formula><mml:math id="M28" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula> values to assess whether the effect of a variable is real and large enough to be useful or not. Cohen's <inline-formula><mml:math id="M29" display="inline"><mml:mi>d</mml:mi></mml:math></inline-formula> statistic with sample size adjustment was used for normally distributed continuous variables, Cohen's <inline-formula><mml:math id="M30" display="inline"><mml:mi>r</mml:mi></mml:math></inline-formula> value, which is calculated by dividing the <inline-formula><mml:math id="M31" display="inline"><mml:mi>z</mml:mi></mml:math></inline-formula> value obtained from the Mann–Whitney test by the square root of the sample size, was used for non-normally distributed variables, and Cramér's <inline-formula><mml:math id="M32" display="inline"><mml:mi>V</mml:mi></mml:math></inline-formula> is used for categorical variables <xref ref-type="bibr" rid="bib1.bibx49" id="paren.77"/>.</p>
      <p id="d1e1465">For various machine learning applications, confusion matrices were generated.  Sensitivity, specificity, and accuracy with <inline-formula><mml:math id="M33" display="inline"><mml:mn mathvariant="normal">95</mml:mn></mml:math></inline-formula> % confidence intervals (CIs) were calculated for LR and each ML algorithms using different resampling and subsampling techniques. The models were fitted with two different resampling strategies and eight subsampling techniques. In addition, we fitted the models to the raw data without any subsampling, and thus we obtained results for <inline-formula><mml:math id="M34" display="inline"><mml:mn mathvariant="normal">18</mml:mn></mml:math></inline-formula> combinations of various sampling strategies for each ML algorithm.</p>
      <p id="d1e1482">In line with the objective of the study, we compared the methods in terms of their success in identifying the households with high social vulnerability, which is the minority class with a smaller prevalence in our study. Therefore, we used sensitivity <inline-formula><mml:math id="M35" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:mtext>true positives</mml:mtext><mml:mo>/</mml:mo><mml:mo>(</mml:mo><mml:mtext>true positives</mml:mtext><mml:mo>+</mml:mo><mml:mtext>false negatives</mml:mtext><mml:mo>)</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> as the primary measure for assessing the model performance. As an indication of model accuracy, we used balanced accuracy <inline-formula><mml:math id="M36" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:mo>(</mml:mo><mml:mtext>sensitivity</mml:mtext><mml:mo>+</mml:mo><mml:mtext>specificity</mml:mtext><mml:mo>)</mml:mo><mml:mo>/</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, which performs better on imbalanced datasets. We identified the best-performing method as the one with the highest sensitivity and balanced accuracy, provided that the AUC of the ROC curve is greater than <inline-formula><mml:math id="M37" display="inline"><mml:mn mathvariant="normal">0.7</mml:mn></mml:math></inline-formula>, and the model could be considered acceptable to discriminate households with high SV from those with low SV <xref ref-type="bibr" rid="bib1.bibx59" id="paren.78"/>.</p>
      <?pagebreak page2142?><p id="d1e1543">The sensitivity and specificity of the best-performing method with those of other methods were compared with pairwise comparisons using McNemar's chi-square test <xref ref-type="bibr" rid="bib1.bibx64" id="paren.79"/>. In addition, AUC comparisons were performed using DeLong chi-square statistics <xref ref-type="bibr" rid="bib1.bibx33" id="paren.80"/>. Bonferroni adjustment was applied in these pairwise comparisons of ML methods, and <inline-formula><mml:math id="M38" display="inline"><mml:mrow><mml:mi mathvariant="italic">α</mml:mi><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.05</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">7</mml:mn><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.007</mml:mn></mml:mrow></mml:math></inline-formula> was considered as an indication of a statistically significant difference in terms of performance metrics between two methods.</p>
</sec>
<sec id="Ch1.S3.SS8">
  <label>3.8</label><title>Variable importance analysis</title>
      <p id="d1e1580">As the final step of our analysis, the important variables of each model were assessed. Analysing variable importance is important in machine learning applications because it assists in the interpretation of the model. It can be performed in two ways: (1) by using a model-based approach which computes the contribution of the predictor variables to the model or (2) by evaluating the importance of predictors individually by conducting an ROC curve analysis for each predictor in turn <xref ref-type="bibr" rid="bib1.bibx67" id="paren.81"/>. How to choose which approach to use depends on which ML model was employed.</p>
      <p id="d1e1586">Logistic regression models rank the variables according to standardised coefficients. The regression coefficients of continuous variables are standardised by dividing each coefficient by a value twice its standard deviation, as explained in <xref ref-type="bibr" rid="bib1.bibx51" id="text.82"/>. The coefficients for factor variables are left unchanged. The relative importance of the independent variables for ANN models are computed by Garson weights <xref ref-type="bibr" rid="bib1.bibx50" id="paren.83"/>, which identify all weighted connections between the nodes of interest. In this context, the weights connecting the variables can be thought of as similar to coefficients in a regression model and are used to describe the relationships between outcome and predictor variables. In random forests, variable importance analysis is based on the prediction accuracy of the model. The average differences between the out-of-bag errors before and after permuting each predictor variable over all trees are calculated as an indication of the importance of a variable. The underlying idea is that a permutation of an important variable reduces the accuracy of the model more strongly than a permutation of an unimportant variable <xref ref-type="bibr" rid="bib1.bibx26" id="paren.84"/>. On the other hand, another tree-based method, CART, does not use the permutation technique for measuring variable importance, as it is trained on a single decision tree. Instead, CART depends on an impurity metric – which is often called the “Gini-index” – for determining the importance of a variable when the outcome is categorical <xref ref-type="bibr" rid="bib1.bibx66" id="paren.85"/>.</p>
      <p id="d1e1601">For classification models (e.g. NB, KNN, and SVM) there is no available model-specific variable importance metric. Rather, these models calculate the area under the ROC curve for each predictor variable, and this AUC statistic is considered as the measure of variable importance <xref ref-type="bibr" rid="bib1.bibx67" id="paren.86"/>.</p>
</sec>
<sec id="Ch1.S3.SS9">
  <label>3.9</label><title>Open-access R Shiny web application</title>
      <p id="d1e1616">An open-access R Shiny web application was created for visualising summary statistics and predictive performances of the LR and ML methods for the classification of households in terms of their social vulnerability level. Users are able to examine the distribution of the characteristics of the households with high and low social vulnerability, compare the performances of ML and subsampling methods based on user-defined evaluation criteria, assess variable importance rankings for each ML method, and obtain the area-based calculations of the variables on the Istanbul map. The R Shiny web application is freely available online and can be accessed at <uri>https://oyakalaycioglu.shinyapps.io/Social_Vulnerability/</uri> (last access: 13 June 2023). The components of this R Shiny application are presented in detail in Fig. <xref ref-type="fig" rid="Ch1.F2"/>. All analyses were performed in the statistical programming environment R version 4.0.3 <xref ref-type="bibr" rid="bib1.bibx94" id="paren.87"/>, and the machine learning model development was carried out using the R caret package <xref ref-type="bibr" rid="bib1.bibx67" id="paren.88"/>. The spatial distribution of the important predictors within the city scale was expressed via the 3.10 version of the QGIS software <xref ref-type="bibr" rid="bib1.bibx91" id="paren.89"/>.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F2" specific-use="star"><?xmltex \currentcnt{2}?><?xmltex \def\figurename{Figure}?><label>Figure 2</label><caption><p id="d1e1635">The components of an open-access web application created in R Shiny interface (can be accessed from <uri>https://oyakalaycioglu.shinyapps.io/Social_Vulnerability/</uri>). The left side commands allow the user to choose which analysis to activate. <bold>(a)</bold> Summary statistics of the variables are visually compared across social vulnerability groups. Box plots and bar plots were used for continuous and categorical variables, respectively. <bold>(b)</bold> The performance metric is chosen by the user (<inline-formula><mml:math id="M39" display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula> axis) in comparison to the subsampling method (<inline-formula><mml:math id="M40" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula> axis). The ML methods are displayed in different colours. Two separate plots are generated for RCV and bootstrap resampling techniques. <bold>(c)</bold> For the chosen subsampling method, LR and ML methods are compared in terms of the AUC of the ROC curve. Different coloured lines represent different methods. <bold>(d)</bold> For the chosen ML method and subsampling techniques, variable importance plots are displayed.</p></caption>
          <?xmltex \igopts{width=497.923228pt}?><graphic xlink:href="https://nhess.copernicus.org/articles/23/2133/2023/nhess-23-2133-2023-f02.jpg"/>

        </fig>

</sec>
</sec>
<sec id="Ch1.S4">
  <label>4</label><title>Results</title>
<sec id="Ch1.S4.SS1">
  <label>4.1</label><title>Descriptive statistics</title>
      <p id="d1e1690">The prevalence of households with high social vulnerability to a possible hazard in Istanbul was 7052 (17.2 %) among 41 093 households.  The median household size was <inline-formula><mml:math id="M41" display="inline"><mml:mn mathvariant="normal">3</mml:mn></mml:math></inline-formula>, with values ranging from 1 to <inline-formula><mml:math id="M42" display="inline"><mml:mn mathvariant="normal">14</mml:mn></mml:math></inline-formula> residents, and the median average age of the households varied between 8.8 to 85 years with the median being <inline-formula><mml:math id="M43" display="inline"><mml:mn mathvariant="normal">35.5</mml:mn></mml:math></inline-formula>. The median of the average education was <inline-formula><mml:math id="M44" display="inline"><mml:mn mathvariant="normal">8</mml:mn></mml:math></inline-formula> years (range: 0–17 years) in the entire survey sample, while it was <inline-formula><mml:math id="M45" display="inline"><mml:mn mathvariant="normal">8.8</mml:mn></mml:math></inline-formula> years (range: 0–17 years) in those households with low SV and <inline-formula><mml:math id="M46" display="inline"><mml:mn mathvariant="normal">6</mml:mn></mml:math></inline-formula> (range: 0–16.3 years) in those households with high SV. Additional comparisons between social vulnerability levels in terms of socio-demographic, health, and socioeconomic information are demonstrated in Table <xref ref-type="table" rid="Ch1.T3"/>. Households with high SV were often overcrowded, less educated, older, had a low number of income earners, had low levels of savings, and had less access to social security and health insurance compared to the low SV group. The statistically significant variable with the largest effect on social vulnerability was the average education of the household (Cohen's <inline-formula><mml:math id="M47" display="inline"><mml:mrow><mml:mi>d</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.947</mml:mn></mml:mrow></mml:math></inline-formula>), followed by the ratio of income earners (Cohen's <inline-formula><mml:math id="M48" display="inline"><mml:mrow><mml:mi>d</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.366</mml:mn></mml:mrow></mml:math></inline-formula>) and the ratio of over <inline-formula><mml:math id="M49" display="inline"><mml:mn mathvariant="normal">65</mml:mn></mml:math></inline-formula> year olds in the household (Cohen's <inline-formula><mml:math id="M50" display="inline"><mml:mrow><mml:mi>r</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.120</mml:mn></mml:mrow></mml:math></inline-formula>),  having social security (Cramér's <inline-formula><mml:math id="M51" display="inline"><mml:mrow><mml:mi>V</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.211</mml:mn></mml:mrow></mml:math></inline-formula>), having health security or insurance (Cramér's <inline-formula><mml:math id="M52" display="inline"><mml:mrow><mml:mi>V</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.226</mml:mn></mml:mrow></mml:math></inline-formula>), having natural gas heating at home (Cramér's <inline-formula><mml:math id="M53" display="inline"><mml:mrow><mml:mi>V</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.152</mml:mn></mml:mrow></mml:math></inline-formula>), the presence of anyone with a disability or who is elderly and needs care at home  (Cramér's <inline-formula><mml:math id="M54" display="inline"><mml:mrow><mml:mi>V</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.142</mml:mn></mml:mrow></mml:math></inline-formula>), and having savings for emergency situations (Cramér's <inline-formula><mml:math id="M55" display="inline"><mml:mrow><mml:mi>V</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.135</mml:mn></mml:mrow></mml:math></inline-formula>).</p>

<?xmltex \floatpos{p}?><table-wrap id="Ch1.T3" specific-use="star"><?xmltex \currentcnt{3}?><label>Table 3</label><caption><p id="d1e1845">Univariate analysis of the study population characteristics.</p></caption><oasis:table frame="topbot"><?xmltex \begin{scaleboxenv}{.95}[.95]?><oasis:tgroup cols="5">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="left"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:thead>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry rowsep="1" namest="col2" nameend="col3" align="center">Social vulnerability level </oasis:entry>
         <oasis:entry colname="col4">Effect size</oasis:entry>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Variables</oasis:entry>
         <oasis:entry colname="col2">Low SV</oasis:entry>
         <oasis:entry colname="col3">High SV</oasis:entry>
         <oasis:entry colname="col4">(Cohen's <inline-formula><mml:math id="M63" display="inline"><mml:mrow><mml:msup><mml:mi>d</mml:mi><mml:mi mathvariant="normal">a</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> or</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M64" display="inline"><mml:mi>P</mml:mi></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">(<inline-formula><mml:math id="M65" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> <inline-formula><mml:math id="M66" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 34 041)</oasis:entry>
         <oasis:entry colname="col3">(<inline-formula><mml:math id="M67" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> <inline-formula><mml:math id="M68" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 7052)</oasis:entry>
         <oasis:entry colname="col4">Cohen's <inline-formula><mml:math id="M69" display="inline"><mml:mrow><mml:msup><mml:mi>r</mml:mi><mml:mi mathvariant="normal">b</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> or</oasis:entry>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4">Cramér's <inline-formula><mml:math id="M70" display="inline"><mml:mrow><mml:msup><mml:mi>V</mml:mi><mml:mi mathvariant="normal">b</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col5"/>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Socio-demographics</oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Household Size (HhS)</oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M71" display="inline"><mml:mrow><mml:mi>d</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.178</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M72" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"> Mean <inline-formula><mml:math id="M73" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula> SD</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M74" display="inline"><mml:mrow><mml:mn mathvariant="normal">3.28</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">1.40</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M75" display="inline"><mml:mrow><mml:mn mathvariant="normal">3.54</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">1.72</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"> Median (min–max)</oasis:entry>
         <oasis:entry colname="col2">3 (1–13)</oasis:entry>
         <oasis:entry colname="col3">3 (1–14)</oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Average education (years)</oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M76" display="inline"><mml:mrow><mml:mi>d</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.947</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M77" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"> Mean <inline-formula><mml:math id="M78" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula> SD</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M79" display="inline"><mml:mrow><mml:mn mathvariant="normal">9.11</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">3.22</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M80" display="inline"><mml:mrow><mml:mn mathvariant="normal">6.11</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">2.9</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"> Median (min–max)</oasis:entry>
         <oasis:entry colname="col2">8.8 (0–17)</oasis:entry>
         <oasis:entry colname="col3">6 (0–16.3)</oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Average age of the HH</oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M81" display="inline"><mml:mrow><mml:mi>d</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.107</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M82" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"> Mean <inline-formula><mml:math id="M83" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula> SD</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M84" display="inline"><mml:mrow><mml:mn mathvariant="normal">38.28</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">14.49</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M85" display="inline"><mml:mrow><mml:mn mathvariant="normal">39.87</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">16.65</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"> Median (min–max)</oasis:entry>
         <oasis:entry colname="col2">35.5 (10.3–85.0)</oasis:entry>
         <oasis:entry colname="col3">36.4 (8.8–84.0)</oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">No. of women/HhS</oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M86" display="inline"><mml:mrow><mml:mi>d</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.130</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M87" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"> Mean <inline-formula><mml:math id="M88" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula> SD</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M89" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.48</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.23</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M90" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.51</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.23</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"> Median (min–max)</oasis:entry>
         <oasis:entry colname="col2">0.5 (0–1)</oasis:entry>
         <oasis:entry colname="col3">0.5 (0–1</oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">No. of men/HhS</oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M91" display="inline"><mml:mrow><mml:mi>d</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.130</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M92" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"> Mean <inline-formula><mml:math id="M93" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula> SD</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M94" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.52</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.23</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M95" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.49</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.23</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"> Median (min–max)</oasis:entry>
         <oasis:entry colname="col2">0.5 (0–1)</oasis:entry>
         <oasis:entry colname="col3">0.5 (0–1)</oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">No. of <inline-formula><mml:math id="M96" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">5</mml:mn></mml:mrow></mml:math></inline-formula>-year-old children/HhS</oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M97" display="inline"><mml:mrow><mml:mi>r</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.130</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M98" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"> Mean <inline-formula><mml:math id="M99" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula> SD</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M100" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.037</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.099</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M101" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.039</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.088</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"> Median (min–max)</oasis:entry>
         <oasis:entry colname="col2">0 (0–0.7)</oasis:entry>
         <oasis:entry colname="col3">0 (0–0.7)</oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">No. of <inline-formula><mml:math id="M102" display="inline"><mml:mrow><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">65</mml:mn></mml:mrow></mml:math></inline-formula>-year-old individuals/HhS</oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M103" display="inline"><mml:mrow><mml:mi>r</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.120</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M104" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"> Mean <inline-formula><mml:math id="M105" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula> SD</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M106" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.09</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.24</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M107" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.15</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.30</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"> Median (min–max)</oasis:entry>
         <oasis:entry colname="col2">0 (0–1)</oasis:entry>
         <oasis:entry colname="col3">0 (0–01)</oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Number of income earners/HhS</oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M108" display="inline"><mml:mrow><mml:mi>d</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.366</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M109" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"> Mean <inline-formula><mml:math id="M110" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula> SD</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M111" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.53</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.28</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M112" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.43</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.24</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"> Median (min–max)</oasis:entry>
         <oasis:entry colname="col2">0.5 (0–2)</oasis:entry>
         <oasis:entry colname="col3">0.3 (0–2)</oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Social security, <inline-formula><mml:math id="M113" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> (%)</oasis:entry>
         <oasis:entry colname="col2">30 956 (90.9)</oasis:entry>
         <oasis:entry colname="col3">5118 (72.6)</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M114" display="inline"><mml:mrow><mml:mi>V</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.211</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M115" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Membership to a non-governmental organisation, <inline-formula><mml:math id="M116" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> (%)</oasis:entry>
         <oasis:entry colname="col2">872 (2.6)</oasis:entry>
         <oasis:entry colname="col3">70 (1.0)</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M117" display="inline"><mml:mrow><mml:mi>V</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.040</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M118" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Health</oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Health insurance, <inline-formula><mml:math id="M119" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> (%)</oasis:entry>
         <oasis:entry colname="col2">33 563 (99.9)</oasis:entry>
         <oasis:entry colname="col3">6206 (88.0)</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M120" display="inline"><mml:mrow><mml:mi>V</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.226</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M121" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Any disabled or elderly who needs care in the Hh, <inline-formula><mml:math id="M122" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> (%)</oasis:entry>
         <oasis:entry colname="col2">1112 (3.3)</oasis:entry>
         <oasis:entry colname="col3">789 (11.2)</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M123" display="inline"><mml:mrow><mml:mi>V</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.142</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M124" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Health access, <inline-formula><mml:math id="M125" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> (%)</oasis:entry>
         <oasis:entry colname="col2">28 309 (83.2)</oasis:entry>
         <oasis:entry colname="col3">5682 (80.6)</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M126" display="inline"><mml:mrow><mml:mi>V</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.026</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M127" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Socio-economic</oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Regular salary income, <inline-formula><mml:math id="M128" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> (%)</oasis:entry>
         <oasis:entry colname="col2">27 342 (80.3)</oasis:entry>
         <oasis:entry colname="col3">4899 (69.5)</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M129" display="inline"><mml:mrow><mml:mi>V</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.100</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M130" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Pension income, <inline-formula><mml:math id="M131" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> (%)</oasis:entry>
         <oasis:entry colname="col2">11 283 (33.1)</oasis:entry>
         <oasis:entry colname="col3">2320 (32.9)</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M132" display="inline"><mml:mrow><mml:mi>V</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.002</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M133" display="inline"><mml:mn mathvariant="normal">0.668</mml:mn></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Rent income, <inline-formula><mml:math id="M134" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> (%)</oasis:entry>
         <oasis:entry colname="col2">1794 (5.3)</oasis:entry>
         <oasis:entry colname="col3">180 (2.6)</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M135" display="inline"><mml:mrow><mml:mi>V</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.048</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M136" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Income support from public authorities, <inline-formula><mml:math id="M137" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> (%)</oasis:entry>
         <oasis:entry colname="col2">646 (1.9)</oasis:entry>
         <oasis:entry colname="col3">470 (6.7)</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M138" display="inline"><mml:mrow><mml:mi>V</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.111</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M139" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Job insecurity in Hh, <inline-formula><mml:math id="M140" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> (%)</oasis:entry>
         <oasis:entry colname="col2">11 808 (34.7)</oasis:entry>
         <oasis:entry colname="col3">2790 (39.6)</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M141" display="inline"><mml:mrow><mml:mi>V</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.038</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M142" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Ownership of the house of residence, <inline-formula><mml:math id="M143" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> (%)</oasis:entry>
         <oasis:entry colname="col2">22 105 (64.9)</oasis:entry>
         <oasis:entry colname="col3">4057 (57.5)</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M144" display="inline"><mml:mrow><mml:mi>V</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.058</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M145" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Status of the house of residence, <inline-formula><mml:math id="M146" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> (%)</oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M147" display="inline"><mml:mrow><mml:mi>V</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.087</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M148" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"> Apartment flat</oasis:entry>
         <oasis:entry colname="col2">30 453 (89.5)</oasis:entry>
         <oasis:entry colname="col3">5797 (82.2)</oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"> Squatter house</oasis:entry>
         <oasis:entry colname="col2">912 (2.7)</oasis:entry>
         <oasis:entry colname="col3">379 (5.4)</oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"> Detached/semi-detached house</oasis:entry>
         <oasis:entry colname="col2">2578 (7.6)</oasis:entry>
         <oasis:entry colname="col3">851 (12.1)</oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"> Gate keepers lodge</oasis:entry>
         <oasis:entry colname="col2">98 (0.3)</oasis:entry>
         <oasis:entry colname="col3">25 (0.4)</oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Natural gas heating at home, <inline-formula><mml:math id="M149" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> (%)</oasis:entry>
         <oasis:entry colname="col2">31 164 (91.5)</oasis:entry>
         <oasis:entry colname="col3">5580 (79.1)</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M150" display="inline"><mml:mrow><mml:mi>V</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.152</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M151" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Ownership of any other house in Istanbul, <inline-formula><mml:math id="M152" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> (%)</oasis:entry>
         <oasis:entry colname="col2">5667 (16.6)</oasis:entry>
         <oasis:entry colname="col3">585 (8.3)</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M153" display="inline"><mml:mrow><mml:mi>V</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.088</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M154" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Land ownership in Istanbul, <inline-formula><mml:math id="M155" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> (%)</oasis:entry>
         <oasis:entry colname="col2">2669 (7.8)</oasis:entry>
         <oasis:entry colname="col3">282 (4.0)</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M156" display="inline"><mml:mrow><mml:mi>V</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.056</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M157" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">House ownership outside Istanbul, <inline-formula><mml:math id="M158" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> (%)</oasis:entry>
         <oasis:entry colname="col2">4210 (12.4)</oasis:entry>
         <oasis:entry colname="col3">491 (7.0)</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M159" display="inline"><mml:mrow><mml:mi>V</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.078</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M160" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Land ownership outside Istanbul, <inline-formula><mml:math id="M161" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> (%)</oasis:entry>
         <oasis:entry colname="col2">7092 (20.8)</oasis:entry>
         <oasis:entry colname="col3">889 (12.6)</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M162" display="inline"><mml:mrow><mml:mi>V</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.064</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M163" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Savings for emergency situation, <inline-formula><mml:math id="M164" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> (%)</oasis:entry>
         <oasis:entry colname="col2">5499 (16.2)</oasis:entry>
         <oasis:entry colname="col3">260 (3.7)</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M165" display="inline"><mml:mrow><mml:mi>V</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.135</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M166" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Any debt of Hh members, <inline-formula><mml:math id="M167" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> (%)</oasis:entry>
         <oasis:entry colname="col2">11 009 (32.3)</oasis:entry>
         <oasis:entry colname="col3">2728 (38.7)</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M168" display="inline"><mml:mrow><mml:mi>V</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.051</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M169" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup><?xmltex \end{scaleboxenv}?></oasis:table><?xmltex \begin{scaleboxenv}{.95}[.95]?><table-wrap-foot><p id="d1e1848"><?xmltex \hack{\vspace{1mm}}?><inline-formula><mml:math id="M56" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">a</mml:mi></mml:msup></mml:math></inline-formula> 0.2 is a small effect, 0.5 is a medium effect, and 0.8 is a large effect. <inline-formula><mml:math id="M57" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">b</mml:mi></mml:msup></mml:math></inline-formula> 0.1 is a small effect, 0.3 is a medium effect, and 0.5 is a large effect. HhS: household size. No: number. Where Cohen's <inline-formula><mml:math id="M58" display="inline"><mml:mi>d</mml:mi></mml:math></inline-formula> is given, independent samples <inline-formula><mml:math id="M59" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula> tests is used; where Cohen's <inline-formula><mml:math id="M60" display="inline"><mml:mi>r</mml:mi></mml:math></inline-formula> is given Mann–Whitney <inline-formula><mml:math id="M61" display="inline"><mml:mi>U</mml:mi></mml:math></inline-formula> test is used; and where Cramér's <inline-formula><mml:math id="M62" display="inline"><mml:mi>V</mml:mi></mml:math></inline-formula> is given, Pearson's chi-square test is used.</p></table-wrap-foot><?xmltex \end{scaleboxenv}?><?xmltex \gdef\@currentlabel{3}?></table-wrap>

<?xmltex \hack{\newpage}?>
</sec>
<?pagebreak page2144?><sec id="Ch1.S4.SS2">
  <label>4.2</label><title>Comparison of machine learning methods</title>
      <p id="d1e3754">The comparison of the machine learning models in terms of their sensitivity, specificity, balanced accuracy, and AUC for different subsampling methods are presented in Fig. <xref ref-type="fig" rid="Ch1.F3"/>. The additional comparisons of models using other evaluation metrics (e.g. positive prediction value, negative prediction value, accuracy, <inline-formula><mml:math id="M170" display="inline"><mml:mrow><mml:mi>F</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> score, etc.) can be found in the R Shiny application. Within these comparisons, no substantial differences were observed in the model performance indicators of LR and different ML strategies between RCV and bootstrap resampling methods. Therefore, we present the results that were obtained with repeated 5-fold cross validation.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F3" specific-use="star"><?xmltex \currentcnt{3}?><?xmltex \def\figurename{Figure}?><label>Figure 3</label><caption><p id="d1e3771">Model performance comparisons. LR and ML methods are visualised in different colours in all figures. <bold>(a)</bold> Sensitivity (<inline-formula><mml:math id="M171" display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula> axis) in comparison to subsampling technique (<inline-formula><mml:math id="M172" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula> axis). <bold>(b)</bold> Specificity (<inline-formula><mml:math id="M173" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula> axis) in comparison to subsampling technique (<inline-formula><mml:math id="M174" display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula> axis). <bold>(c)</bold> Balanced accuracy ((sensitivity <inline-formula><mml:math id="M175" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> specificity) <inline-formula><mml:math id="M176" display="inline"><mml:mo>/</mml:mo></mml:math></inline-formula> 2) (<inline-formula><mml:math id="M177" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula> axis) in comparison to subsampling technique (<inline-formula><mml:math id="M178" display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula> axis). <bold>(d)</bold> Using the under(in) imbalanced subsampling technique, ML methods are compared in terms of the AUC of the ROC curve.</p></caption>
          <?xmltex \igopts{width=361.35pt}?><graphic xlink:href="https://nhess.copernicus.org/articles/23/2133/2023/nhess-23-2133-2023-f03.png"/>

        </fig>

      <p id="d1e3850">As mentioned earlier, the dataset suffered from imbalanced class variables, particularly the outcome variable, and as such significant differences were observed when subsampling strategies were applied. Using the standard algorithm without subsampling (referred to as “Original”) resulted in poor sensitivity (Fig. <xref ref-type="fig" rid="Ch1.F3"/>a), and inflated specificity (Fig. <xref ref-type="fig" rid="Ch1.F3"/>b) rates, due to the class imbalance in the studied sample where the negative class is dominant. Based on the criteria that <inline-formula><mml:math id="M179" display="inline"><mml:mrow><mml:mi mathvariant="normal">AUC</mml:mi><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">0.7</mml:mn></mml:mrow></mml:math></inline-formula>, overall, the methods fitted with under subsampling inside the resampling procedure (referred as under(in)) performed better in terms of model performance metrics when compared to other subsampling methods. The highest balanced accuracy for each method was also obtained with under(in) subsampling (Fig. <xref ref-type="fig" rid="Ch1.F3"/>c).</p>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T4" specific-use="star"><?xmltex \currentcnt{4}?><label>Table 4</label><caption><p id="d1e3875">Comparison of the model performances of LR and ML methods using raw data and under(in) subsampling.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="7">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:colspec colnum="6" colname="col6" align="right"/>
     <oasis:colspec colnum="7" colname="col7" align="right"/>
     <oasis:thead>
       <oasis:row>
         <oasis:entry colname="col1">ML models</oasis:entry>
         <oasis:entry colname="col2">AUC</oasis:entry>
         <oasis:entry colname="col3">Accuracy</oasis:entry>
         <oasis:entry colname="col4">Balanced accuracy</oasis:entry>
         <oasis:entry colname="col5">Sensitivity</oasis:entry>
         <oasis:entry colname="col6">Specificity</oasis:entry>
         <oasis:entry colname="col7">Diff sens<inline-formula><mml:math id="M182" display="inline"><mml:msup><mml:mi/><mml:mo>∗</mml:mo></mml:msup></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">(<inline-formula><mml:math id="M183" display="inline"><mml:mn mathvariant="normal">95</mml:mn></mml:math></inline-formula> % CI)</oasis:entry>
         <oasis:entry colname="col3">(<inline-formula><mml:math id="M184" display="inline"><mml:mn mathvariant="normal">95</mml:mn></mml:math></inline-formula> % CI)</oasis:entry>
         <oasis:entry colname="col4">(<inline-formula><mml:math id="M185" display="inline"><mml:mn mathvariant="normal">95</mml:mn></mml:math></inline-formula> % CI)</oasis:entry>
         <oasis:entry colname="col5">(<inline-formula><mml:math id="M186" display="inline"><mml:mn mathvariant="normal">95</mml:mn></mml:math></inline-formula> % CI)</oasis:entry>
         <oasis:entry colname="col6">(<inline-formula><mml:math id="M187" display="inline"><mml:mn mathvariant="normal">95</mml:mn></mml:math></inline-formula> % CI)</oasis:entry>
         <oasis:entry colname="col7">(<inline-formula><mml:math id="M188" display="inline"><mml:mn mathvariant="normal">95</mml:mn></mml:math></inline-formula> % CI)</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row rowsep="1">
         <oasis:entry namest="col1" nameend="col7">Original data (no subsampling) </oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">LR</oasis:entry>
         <oasis:entry colname="col2">0.798</oasis:entry>
         <oasis:entry colname="col3">0.842</oasis:entry>
         <oasis:entry colname="col4">0.598</oasis:entry>
         <oasis:entry colname="col5">0.224</oasis:entry>
         <oasis:entry colname="col6">0.971</oasis:entry>
         <oasis:entry colname="col7">n/a</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">(0.776–0.820)</oasis:entry>
         <oasis:entry colname="col3">(0.830–0.853)</oasis:entry>
         <oasis:entry colname="col4">(0.573–0.623)</oasis:entry>
         <oasis:entry colname="col5">(0.194–0.257)</oasis:entry>
         <oasis:entry colname="col6">(0.965–0.976)</oasis:entry>
         <oasis:entry colname="col7"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">CART</oasis:entry>
         <oasis:entry colname="col2">0.771</oasis:entry>
         <oasis:entry colname="col3">0.823</oasis:entry>
         <oasis:entry colname="col4">0.629</oasis:entry>
         <oasis:entry colname="col5">0.332</oasis:entry>
         <oasis:entry colname="col6">0.926</oasis:entry>
         <oasis:entry colname="col7">n/a</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">(0.752–0.790)</oasis:entry>
         <oasis:entry colname="col3">(0.811–0.835)</oasis:entry>
         <oasis:entry colname="col4">(0.610–0.649)</oasis:entry>
         <oasis:entry colname="col5">(0.297–0.368)</oasis:entry>
         <oasis:entry colname="col6">(0.916–0.934)</oasis:entry>
         <oasis:entry colname="col7"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">RF</oasis:entry>
         <oasis:entry colname="col2">0.795</oasis:entry>
         <oasis:entry colname="col3">0.842</oasis:entry>
         <oasis:entry colname="col4">0.615</oasis:entry>
         <oasis:entry colname="col5">0.268</oasis:entry>
         <oasis:entry colname="col6">0.963</oasis:entry>
         <oasis:entry colname="col7">n/a</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">(0.775–0.815)</oasis:entry>
         <oasis:entry colname="col3">(0.830–0.853)</oasis:entry>
         <oasis:entry colname="col4">(0.598–0.632)</oasis:entry>
         <oasis:entry colname="col5">(0.236–0.303)</oasis:entry>
         <oasis:entry colname="col6">(0.955–0.969)</oasis:entry>
         <oasis:entry colname="col7"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">SVM</oasis:entry>
         <oasis:entry colname="col2">0.738</oasis:entry>
         <oasis:entry colname="col3">0.836</oasis:entry>
         <oasis:entry colname="col4">0.573</oasis:entry>
         <oasis:entry colname="col5">0.170</oasis:entry>
         <oasis:entry colname="col6">0.976</oasis:entry>
         <oasis:entry colname="col7">n/a</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">(0.709–0.767)</oasis:entry>
         <oasis:entry colname="col3">(0.825–0.848)</oasis:entry>
         <oasis:entry colname="col4">(0.560–0.586)</oasis:entry>
         <oasis:entry colname="col5">(0.144–0.200)</oasis:entry>
         <oasis:entry colname="col6">(0.970–0.981)</oasis:entry>
         <oasis:entry colname="col7"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">NB</oasis:entry>
         <oasis:entry colname="col2">0.784</oasis:entry>
         <oasis:entry colname="col3">0.832</oasis:entry>
         <oasis:entry colname="col4">0.654</oasis:entry>
         <oasis:entry colname="col5">0.382</oasis:entry>
         <oasis:entry colname="col6">0.926</oasis:entry>
         <oasis:entry colname="col7">n/a</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">(0.767–0.801)</oasis:entry>
         <oasis:entry colname="col3">(0.820–0.843)</oasis:entry>
         <oasis:entry colname="col4">(0.635–0.673)</oasis:entry>
         <oasis:entry colname="col5">(0.346–0.419)</oasis:entry>
         <oasis:entry colname="col6">(0.917–0.935)</oasis:entry>
         <oasis:entry colname="col7"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">K-NN</oasis:entry>
         <oasis:entry colname="col2">0.805</oasis:entry>
         <oasis:entry colname="col3">0.838</oasis:entry>
         <oasis:entry colname="col4">0.547</oasis:entry>
         <oasis:entry colname="col5">0.102</oasis:entry>
         <oasis:entry colname="col6">0.992</oasis:entry>
         <oasis:entry colname="col7">n/a</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">(0.772–0.838)</oasis:entry>
         <oasis:entry colname="col3">(0.826–0.849)</oasis:entry>
         <oasis:entry colname="col4">(0.535–0.559)</oasis:entry>
         <oasis:entry colname="col5">(0.081–0.127)</oasis:entry>
         <oasis:entry colname="col6">(0.989–0.995)</oasis:entry>
         <oasis:entry colname="col7"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">ANN</oasis:entry>
         <oasis:entry colname="col2">0.820</oasis:entry>
         <oasis:entry colname="col3">0.851</oasis:entry>
         <oasis:entry colname="col4">0.626</oasis:entry>
         <oasis:entry colname="col5">0.281</oasis:entry>
         <oasis:entry colname="col6">0.971</oasis:entry>
         <oasis:entry colname="col7">n/a</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">(0.801–0.839)</oasis:entry>
         <oasis:entry colname="col3">(0.840–0.862)</oasis:entry>
         <oasis:entry colname="col4">(0.609–0.643)</oasis:entry>
         <oasis:entry colname="col5">(0.248–0.316)</oasis:entry>
         <oasis:entry colname="col6">(0.964–0.976)</oasis:entry>
         <oasis:entry colname="col7"/>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry namest="col1" nameend="col7">Using under (in) subsampling </oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">LR</oasis:entry>
         <oasis:entry colname="col2">0.798</oasis:entry>
         <oasis:entry colname="col3">0.704</oasis:entry>
         <oasis:entry colname="col4">0.713</oasis:entry>
         <oasis:entry colname="col5">0.726</oasis:entry>
         <oasis:entry colname="col6">0.699</oasis:entry>
         <oasis:entry colname="col7">0.502</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">(0.785–0.811)</oasis:entry>
         <oasis:entry colname="col3">(0.690–0.718)</oasis:entry>
         <oasis:entry colname="col4">(0.689–0.737)</oasis:entry>
         <oasis:entry colname="col5">(0.691–0.759)</oasis:entry>
         <oasis:entry colname="col6">(0.683–0.715)</oasis:entry>
         <oasis:entry colname="col7">(0.483–0.520)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">CART</oasis:entry>
         <oasis:entry colname="col2">0.782<inline-formula><mml:math id="M189" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">a</mml:mi></mml:msup></mml:math></inline-formula>​​​​​​​</oasis:entry>
         <oasis:entry colname="col3">0.704</oasis:entry>
         <oasis:entry colname="col4">0.712</oasis:entry>
         <oasis:entry colname="col5">0.725</oasis:entry>
         <oasis:entry colname="col6">0.699</oasis:entry>
         <oasis:entry colname="col7">0.393</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">(0.768–0.796)</oasis:entry>
         <oasis:entry colname="col3">(0.690–718)</oasis:entry>
         <oasis:entry colname="col4">(0.690–0.734)</oasis:entry>
         <oasis:entry colname="col5">(0.690–0.757)</oasis:entry>
         <oasis:entry colname="col6">(0.684–0.715)</oasis:entry>
         <oasis:entry colname="col7">(0.373–0.413)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">RF</oasis:entry>
         <oasis:entry colname="col2">0.803</oasis:entry>
         <oasis:entry colname="col3">0.722</oasis:entry>
         <oasis:entry colname="col4">0.713</oasis:entry>
         <oasis:entry colname="col5">0.711</oasis:entry>
         <oasis:entry colname="col6">0.724</oasis:entry>
         <oasis:entry colname="col7">0.443</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">(0.790–0.816)</oasis:entry>
         <oasis:entry colname="col3">(0.708–736)</oasis:entry>
         <oasis:entry colname="col4">(0.692–0.734)</oasis:entry>
         <oasis:entry colname="col5">(0.676–0.744)</oasis:entry>
         <oasis:entry colname="col6">(0.709–0.738)</oasis:entry>
         <oasis:entry colname="col7">(0.421–0.465)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">SVM</oasis:entry>
         <oasis:entry colname="col2">0.799</oasis:entry>
         <oasis:entry colname="col3">0.707</oasis:entry>
         <oasis:entry colname="col4">0.715</oasis:entry>
         <oasis:entry colname="col5">0.72</oasis:entry>
         <oasis:entry colname="col6">0.702</oasis:entry>
         <oasis:entry colname="col7">0.559</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">(0.786–0.812)</oasis:entry>
         <oasis:entry colname="col3">(0.693–721)</oasis:entry>
         <oasis:entry colname="col4">(0.693–0.737)</oasis:entry>
         <oasis:entry colname="col5">(0.694–0.761)</oasis:entry>
         <oasis:entry colname="col6">(0.687–0.718)</oasis:entry>
         <oasis:entry colname="col7">(0.541–0.576)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">NB</oasis:entry>
         <oasis:entry colname="col2">0.778<inline-formula><mml:math id="M190" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">b</mml:mi></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3">0.566<inline-formula><mml:math id="M191" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">a</mml:mi></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4">0.690</oasis:entry>
         <oasis:entry colname="col5">0.871<inline-formula><mml:math id="M192" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">a</mml:mi></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col6">0.502<inline-formula><mml:math id="M193" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">a</mml:mi></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col7">0.489</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">(0.763–0.793)</oasis:entry>
         <oasis:entry colname="col3">(0.550–0.581</oasis:entry>
         <oasis:entry colname="col4">(0.671–0.710)</oasis:entry>
         <oasis:entry colname="col5">(0.843–0.894)</oasis:entry>
         <oasis:entry colname="col6">(0.485–0.519)</oasis:entry>
         <oasis:entry colname="col7">(0.471–0.507)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">K-NN</oasis:entry>
         <oasis:entry colname="col2">0.800</oasis:entry>
         <oasis:entry colname="col3">0.720</oasis:entry>
         <oasis:entry colname="col4">0.719</oasis:entry>
         <oasis:entry colname="col5">0.719</oasis:entry>
         <oasis:entry colname="col6">0.720</oasis:entry>
         <oasis:entry colname="col7">0.617</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">(0.786–0.814)</oasis:entry>
         <oasis:entry colname="col3">(0.705–0.733)</oasis:entry>
         <oasis:entry colname="col4">(0.697–0.742)</oasis:entry>
         <oasis:entry colname="col5">(0.684–0.752)</oasis:entry>
         <oasis:entry colname="col6">(0.704–0.735)</oasis:entry>
         <oasis:entry colname="col7">(0.600–0.633)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">ANN</oasis:entry>
         <oasis:entry colname="col2">0.813<inline-formula><mml:math id="M194" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mi mathvariant="normal">a</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">b</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3">0.724<inline-formula><mml:math id="M195" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">a</mml:mi></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4">0.730</oasis:entry>
         <oasis:entry colname="col5">0.740<inline-formula><mml:math id="M196" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">a</mml:mi></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col6">0.720<inline-formula><mml:math id="M197" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">a</mml:mi></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col7">0.459</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">(0.800–0.826)</oasis:entry>
         <oasis:entry colname="col3">(0.710–0.737)</oasis:entry>
         <oasis:entry colname="col4">(0.709–0.752)</oasis:entry>
         <oasis:entry colname="col5">(0.706–0.772)</oasis:entry>
         <oasis:entry colname="col6">(0.705–0.735)</oasis:entry>
         <oasis:entry colname="col7">(0.440–0.478)</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table><table-wrap-foot><p id="d1e3878">Diff sens: the difference in sensitivity between the same ML method with and without subsampling strategy for imbalanced problem. <inline-formula><mml:math id="M180" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mi mathvariant="normal">a</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">b</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula> The same superscript letters indicate statistically significant difference in a performance measure between two methods, at <inline-formula><mml:math id="M181" display="inline"><mml:mrow><mml:mi mathvariant="italic">α</mml:mi><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.05</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">7</mml:mn><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.007</mml:mn></mml:mrow></mml:math></inline-formula> significance level. CI: confidence interval. n/a​​​​​​​: not applicable.</p></table-wrap-foot><?xmltex \gdef\@currentlabel{4}?></table-wrap>

      <p id="d1e4814">In Table <xref ref-type="table" rid="Ch1.T4"/>, all ML methods using under(in) subsampling were compared to their counterpart using the original data without imbalanced subsampling. Here we remind the reader that the priority in this study was to assess the performance of the models in terms of their success in identifying the households with high social vulnerability, which is the minority class but therefore also the positive class. Using the under(in) subsampling strategy demonstrated superior sensitivity and balanced accuracy rates compared to using original data and other subsampling strategies. Therefore, the results obtained with under(in) subsampling are considered for further comparisons between ML methods. Classification results for the ML models using under(in) subsampling are presented with ROC curves in Fig. <xref ref-type="fig" rid="Ch1.F3"/>d. The ROC curves for all other subsampling strategies with all other methods can be found in the R Shiny web application.</p>
      <p id="d1e4821">The best-performing method in terms of AUC, accuracy, balanced accuracy, and sensitivity was the artificial neural network using the under(in) subsampling<?pagebreak page2145?> strategy (AUC: 0.813 (0.800–0.826), accuracy: 0.724 (0.710–0.737), balanced accuracy: 0.730 (0.790–0.752), sensitivity: 0.740 (0.706–0.772), specificity: 0.720 (0.705–0.735)). Naïve Bayes (NB) also produced a high sensitivity rate of 0.871 (0.843–0.894); however, it resulted in significantly lower specificity (0.502 (0.485–0.519)) and overall accuracy 0.566 (0.550–0.581) compared to ANN (<inline-formula><mml:math id="M198" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.003</mml:mn></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M199" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn></mml:mrow></mml:math></inline-formula>, respectively). While ANN balances sensitivity (0.740) and specificity (0.720), NB emphasises sensitivity (0.871) over specificity (0.502). All other methods using under(in) sampling provided similar sensitivity rates between the range of <inline-formula><mml:math id="M200" display="inline"><mml:mn mathvariant="normal">71.9</mml:mn></mml:math></inline-formula> % and <inline-formula><mml:math id="M201" display="inline"><mml:mn mathvariant="normal">72.9</mml:mn></mml:math></inline-formula> % and specificity rates between <inline-formula><mml:math id="M202" display="inline"><mml:mn mathvariant="normal">69.9</mml:mn></mml:math></inline-formula> % and <inline-formula><mml:math id="M203" display="inline"><mml:mn mathvariant="normal">72.4</mml:mn></mml:math></inline-formula> %. When AUC was considered, CART was also significantly worse than ANN (0.782 (0.768–0.796) vs. 0.813 (0.800–0.826), <inline-formula><mml:math id="M204" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.005</mml:mn></mml:mrow></mml:math></inline-formula>). Logistic regression, random forest, support vector machine, and <inline-formula><mml:math id="M205" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula>-nearest neighbours did not show significant differences from ANN in terms of performance metrics.</p>
</sec>
<sec id="Ch1.S4.SS3">
  <label>4.3</label><title>Important predictors for the machine learning methods</title>
      <p id="d1e4904">In Fig. <xref ref-type="fig" rid="Ch1.F4"/>, a visual summary of variable importance analysis is presented as the relative importance of the predictors, as indicated by the ML methods using under(in) sampling. As the methodologies used for analysing variable importance vary across different models, we averaged the variable importance rankings obtained with all models in Fig. <xref ref-type="fig" rid="Ch1.F4"/>a. The most important variable for every model is given a score of <inline-formula><mml:math id="M206" display="inline"><mml:mn mathvariant="normal">100</mml:mn></mml:math></inline-formula> %, followed by the next important variable which takes a relative value between 0 and 100. The variables which appeared in the top 10 most influential variables in all seven models were education, having social security, the ratio of income earners in the household, and having savings for emergency situations. Of these variables, the variable with the highest average importance was education.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F4" specific-use="star"><?xmltex \currentcnt{4}?><?xmltex \def\figurename{Figure}?><label>Figure 4</label><caption><p id="d1e4920">Important predictors for the assessment of social vulnerability. <bold>(a)</bold> The average relative importance of the predictors obtained with ML methods using under(in) sampling. Average ranking of the predictor across all models (<inline-formula><mml:math id="M207" display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula> axis) in comparison to the number of models that the predictor appeared in the top 10 most important variables (<inline-formula><mml:math id="M208" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula> axis). <bold>(b)</bold> Variable importance for the ANN-under(in) model.</p></caption>
          <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://nhess.copernicus.org/articles/23/2133/2023/nhess-23-2133-2023-f04.png"/>

        </fig>

      <?pagebreak page2146?><p id="d1e4949"><?xmltex \hack{\newpage}?>In Fig. <xref ref-type="fig" rid="Ch1.F4"/>b we investigated the relative importance of the independent variables within the top-performing model, ANN under(in), using the approach suggested by <xref ref-type="bibr" rid="bib1.bibx50" id="text.90"/>. Based on this model, the most important variable for the classification of households’ social vulnerability appeared to be having social security. The other predictors with over <inline-formula><mml:math id="M209" display="inline"><mml:mn mathvariant="normal">50</mml:mn></mml:math></inline-formula> % of relative importance were a mixture of demographic and economic variables including living in a squatter house, job insecurity, ratio of the over 65 year olds in the household, owning a house outside of Istanbul, household size, the ratio of income earners in the household and having savings for emergency situations.</p>
</sec>
<sec id="Ch1.S4.SS4">
  <label>4.4</label><title>Spatial distribution of the important predictors of the ANN model</title>
      <p id="d1e4973">Based on the variable importance analysis with the top-performing model, ANN under(in), we performed area-based calculations to compare the neighbourhood characteristics in Istanbul. For categorical variables, the prevalence in the neighbourhood was calculated, while neighbourhood<?pagebreak page2147?> averages were used for the continuous variables. The three most important predictors of social vulnerability level were subsequently displayed as a five-category map in Fig. <xref ref-type="fig" rid="Ch1.F5"/>.</p>

      <?xmltex \floatpos{p}?><fig id="Ch1.F5" specific-use="star"><?xmltex \currentcnt{5}?><?xmltex \def\figurename{Figure}?><label>Figure 5</label><caption><p id="d1e4980">The five-category neighbourhood map of the three most important predictors of social vulnerability. <bold>(a)</bold> Neighbourhood prevalence of having social security. <bold>(b)</bold> Neighbourhood prevalence of living in squatter houses. <bold>(c)</bold> Neighbourhood prevalence of job insecurity of any household member.</p></caption>
          <?xmltex \igopts{width=355.659449pt}?><graphic xlink:href="https://nhess.copernicus.org/articles/23/2133/2023/nhess-23-2133-2023-f05.jpg"/>

        </fig>

      <p id="d1e4998">For Fig. <xref ref-type="fig" rid="Ch1.F5"/>a, the areas represented with dark red colours, below <inline-formula><mml:math id="M210" display="inline"><mml:mn mathvariant="normal">70</mml:mn></mml:math></inline-formula> %, indicate those neighbourhoods with the lowest social security, and these areas are prevalent in the outer regions of the metropolitan area. On the other hand, those neighbourhoods close to the central region mostly cover households with a higher prevalence of social security benefits. The number of neighbourhoods with a high density of squatter housing (<inline-formula><mml:math id="M211" display="inline"><mml:mrow><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">20</mml:mn></mml:mrow></mml:math></inline-formula> %) was 27 (Fig. <xref ref-type="fig" rid="Ch1.F5"/>b). These neighbourhoods are scattered throughout the city and are not concentrated in any specific region. The households with job insecurity are mainly located in the central region of the city (Fig. <xref ref-type="fig" rid="Ch1.F5"/>c). The distribution of all other variables across neighbourhoods of Istanbul can be found in the R Shiny web application.</p>
</sec>
</sec>
<sec id="Ch1.S5">
  <label>5</label><title>Discussion</title>
<sec id="Ch1.S5.SS1">
  <label>5.1</label><title>The selection of the optimal ML method</title>
      <?pagebreak page2149?><p id="d1e5041">In this study, we demonstrated that it is possible to predict the social vulnerability of households with a certain degree of precision using household indicators available within the databases of various institutions and public authorities. Based on our results, the best-performing ML method for identifying households with high social vulnerability was ANN using under subsampling within the resampling procedure to address the problem of class imbalance (AUC = 0.813, balanced accuracy is <inline-formula><mml:math id="M212" display="inline"><mml:mn mathvariant="normal">73</mml:mn></mml:math></inline-formula> %, sensitivity is <inline-formula><mml:math id="M213" display="inline"><mml:mn mathvariant="normal">74</mml:mn></mml:math></inline-formula> %, and specificity is <inline-formula><mml:math id="M214" display="inline"><mml:mn mathvariant="normal">72</mml:mn></mml:math></inline-formula> %). ANN is often considered an effective and useful tool for identifying hidden relationships between socio-demographic and socio-economic variables that arise in social science research <xref ref-type="bibr" rid="bib1.bibx77 bib1.bibx35" id="paren.91"/>. This may imply that the interrelated social relations between the variables in our dataset may be best handled by ANN. Apart from CART and NB, all methods provided similar AUC results (<inline-formula><mml:math id="M215" display="inline"><mml:mn mathvariant="normal">0.80</mml:mn></mml:math></inline-formula>) with no significant differences. There was no significant difference between the ML methods, except NB, in terms of the performance of identifying households with high social vulnerability (i.e. sensitivity).</p>
      <p id="d1e5075">A model with an AUC greater than 0.80 was considered to have an excellent discriminative ability by <xref ref-type="bibr" rid="bib1.bibx59" id="text.92"/>. Therefore, our proposed ANN model, with AUC of 0.813, indicated a good ability to discriminate households with high social vulnerability in a hazard event in Istanbul from those with low social vulnerability. Similarly, the AUC values achieved with RF and KNN were greater than 0.8. In terms of predictive accuracy, we obtained the largest balanced accuracy (<inline-formula><mml:math id="M216" display="inline"><mml:mn mathvariant="normal">73</mml:mn></mml:math></inline-formula> %) with ANN. Further, the accuracy obtained with ANN and other models did not differ significantly. We considered the accuracy of our optimal ANN model to be acceptable, as the value is halfway between <inline-formula><mml:math id="M217" display="inline"><mml:mn mathvariant="normal">50</mml:mn></mml:math></inline-formula> %, which is useless, and <inline-formula><mml:math id="M218" display="inline"><mml:mn mathvariant="normal">100</mml:mn></mml:math></inline-formula> %, which is perfect <xref ref-type="bibr" rid="bib1.bibx90" id="paren.93"/>.</p>
      <p id="d1e5105">A limited number of studies have used ML to predict hazard-related social vulnerability and reported performance metrics. <xref ref-type="bibr" rid="bib1.bibx1" id="text.94"/> achieved an AUC of 0.780 using the CART model to predict the social vulnerability of residential units in Andalusia with dwelling variables. Similarly, we obtained an AUC of 0.782 with the CART model when under sampling was used. When demographic and social indicators were used with an ANN model, <xref ref-type="bibr" rid="bib1.bibx1" id="text.95"/>  obtained a balanced accuracy of <inline-formula><mml:math id="M219" display="inline"><mml:mn mathvariant="normal">86.1</mml:mn></mml:math></inline-formula> %. <xref ref-type="bibr" rid="bib1.bibx7" id="text.96"/> reported a high accuracy of <inline-formula><mml:math id="M220" display="inline"><mml:mn mathvariant="normal">95.6</mml:mn></mml:math></inline-formula> % with ANN using  regional indicators when predicting the social vulnerability of municipal zones in Tabriz, Iran. Compared to these studies, we obtained a relatively low accuracy with our ML models, as we focused on proposing an optimal modelling strategy using readily available household variables. Thus, our modelling approach can be useful for decision makers to take immediate action for the most vulnerable households, and there is no doubt that the predictive performance of our models would benefit from incorporating more predictor variables.</p>
</sec>
<sec id="Ch1.S5.SS2">
  <label>5.2</label><title>The importance of subsampling for imbalanced class variables</title>
      <p id="d1e5139">An important aspect of our study was to find the most viable solution for the imbalance problem in our dataset, as the imbalance ratio between the high and low SV groups was around <inline-formula><mml:math id="M221" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">5</mml:mn></mml:mrow></mml:math></inline-formula>. When no subsampling strategy was applied to handle imbalance problem, we obtained poor sensitivity rates. A <inline-formula><mml:math id="M222" display="inline"><mml:mn mathvariant="normal">39.3</mml:mn></mml:math></inline-formula> % to <inline-formula><mml:math id="M223" display="inline"><mml:mn mathvariant="normal">61.7</mml:mn></mml:math></inline-formula> % gain in sensitivity was achieved with different ML models when under(in) subsampling was applied, and therefore the imbalance was being addressed, compared to using the original raw data without subsampling.</p>
      <p id="d1e5168">In our study, when ML models without subsampling strategies were used, the overall accuracy was higher due to the inflated specificity compared to the models using subsampling strategies. The standard application of ML model targets is to maximise the overall accuracy. Therefore, if they are trained on imbalanced data without considering imbalanced classes, they tend to over predict the class with higher frequency <xref ref-type="bibr" rid="bib1.bibx44" id="paren.97"/>, which is the low vulnerability group in our dataset. This increases specificity and therefore reduces sensitivity. Therefore, the models based on the original imbalanced data resulted in lower sensitivity and failed to identify households with high social vulnerability, and they failed to meet our aims in the study.</p>
      <p id="d1e5174">Among subsampling methods, the random-majority under-sampling approach resulted in the best performance for all ML methods. This method discards data points from the majority class (i.e. low vulnerability group) at random until a more balanced distribution is reached, while training the models. Our dataset was sufficiently large to not be negatively affected by the discarding of data. Our results obtained with random under sampling are consistent with the ML literature, in the sense that if the size of the dataset is large then it is better to employ an under-sampling method <xref ref-type="bibr" rid="bib1.bibx38" id="paren.98"/>.</p>
</sec>
<sec id="Ch1.S5.SS3">
  <label>5.3</label><title>Important variables and their theoretical implications</title>
      <p id="d1e5188">Variable importance rankings tended to differ depending on the technique employed. Therefore, initially we aggregated the results of the variable importance analysis. On average, education was found to be the most important variable in all methods, followed by having social security, the ratio of income earners in the household, and having savings to be used in emergency situations. Within the top-performing model, ANN, the most important variable was found to be social security, followed by living in a squatter house, and job insecurity. When we discuss these results based on socio-urban conditions in Türkiye, we can easily comprehend that education and social security are interrelated factors, as more educated citizens tend to work in jobs with social security. Second, income and savings represent households’ economic power to cope with hazards.</p>
      <?pagebreak page2150?><p id="d1e5191">Social security refers to the right to have the guarantee of unemployment benefits, retirement pensions, public protection from job injuries, and access to public health coverage, gained through regular work and employment <xref ref-type="bibr" rid="bib1.bibx95" id="paren.99"/>. The lack of social security and insurance, particularly in a demonstrably unstable economy, increases vulnerability to many kinds of crises, including disasters and health emergencies such as pandemics. In our research, having social security actually means being able to get different kinds of socio-economic and health support in sudden shocks, which also covers the aftermath of a hazard, as the individual is registered in the public health system. In Türkiye, the rate of unregistered labourers who are not affiliated with the Social Security Institution in total employment was <inline-formula><mml:math id="M224" display="inline"><mml:mn mathvariant="normal">27.4</mml:mn></mml:math></inline-formula> % <xref ref-type="bibr" rid="bib1.bibx108" id="paren.100"/>, while most unregistered  labourers were found in the agriculture and service sectors <xref ref-type="bibr" rid="bib1.bibx86" id="paren.101"/>. Unregistered employment means that no social insurance premiums are paid by the employer; thus, employees cannot benefit from social security <xref ref-type="bibr" rid="bib1.bibx109" id="paren.102"/>. However, people in agriculture are mostly self-employed and do not have social security because they cannot afford to pay social security premiums regularly. Hence, the map we have presented on the different social security status of neighbourhoods with respect to the household survey indicates the northwest of Istanbul as having lower social security, which may be due to a large number of agricultural areas in that region. However, those neighbourhoods close to the centre of the Istanbul metropolitan area are mostly inhabited by people employed in the services and industrial sectors, with a higher rate of registered employment and thus a higher prevalence of social security benefits. Moreover, in the data presented, the prevalence of social security in the high vulnerability group is around <inline-formula><mml:math id="M225" display="inline"><mml:mn mathvariant="normal">72</mml:mn></mml:math></inline-formula> %, whereas it is as high as <inline-formula><mml:math id="M226" display="inline"><mml:mn mathvariant="normal">91</mml:mn></mml:math></inline-formula> % in the households with low vulnerability.</p>
      <p id="d1e5228">Based on our findings, living in a squatter house was the second most important variable of social vulnerability using the ANN method. Squatter housing comprises houses that are assembled quickly and do not conform to the technical and legal standards (called “gecekondu”, as the Turkish name for poor squatter settlements). Hence, this type of housing represents at-high-risk buildings in the event of geological and climatic hazards and is more likely to be damaged in such events, which implies higher vulnerability to hazards. One of the large-scale hazardous events anticipated for Istanbul is an earthquake with a magnitude greater than 7 MW, which is predicted to strike the city within the next 30 years with 42 %–47 % probability <xref ref-type="bibr" rid="bib1.bibx83" id="paren.103"/>. Previous studies inform that a large proportion of buildings in Istanbul, including squatter settlements, are not earthquake resistant
<xref ref-type="bibr" rid="bib1.bibx61 bib1.bibx88 bib1.bibx43 bib1.bibx42 bib1.bibx9" id="paren.104"/>. Furthermore, squatter housing is linked to a poor socio-economic household profile. It is known that poorer people are more vulnerable to natural hazards, as they settle in buildings that are at higher risk but more affordable to them because of cheap rents <xref ref-type="bibr" rid="bib1.bibx99" id="paren.105"/>. In particular, squatter houses are very low-quality buildings, and when taken together with the poor socio-economic characteristics of their residents, they represent high social vulnerability for households. A study by <xref ref-type="bibr" rid="bib1.bibx1" id="text.106"/> in Andalusia, which used CART, showed the importance of dwelling variables on social vulnerability, such as the average age of constructions and the density of buildings in a particular district of an urban area. In our study, the age of the buildings was not available in the data; however, the type of housing was found to be an important predictor of social vulnerability.</p>
      <p id="d1e5243">With the ANN method, the third-highest-ranked variable was job insecurity. The spatial distribution of neighbourhoods in terms of job insecurity indicates that the centre of Istanbul close to the Marmara Sea is densely populated, with households with job insecurity representing the possible unemployment figures in those crowded areas. Further, as mentioned above in the social security indicator, the labour market opportunities in Türkiye are highly dominated by the casual or seasonal employment opportunities <xref ref-type="bibr" rid="bib1.bibx86" id="paren.107"/>. Such forms of casual employment are highly fragile since the labourers are not in full employment and not registered in the social insurance system. A recent study showed that casual and unregistered employment increases social vulnerability to natural hazards <xref ref-type="bibr" rid="bib1.bibx76" id="paren.108"/>. These may be either in the form of casual, seasonal employment or self-employment, where social security and social insurance registrations are not provided by the employers, and the employees could not afford to pay their premiums regularly by themselves. These types of employees and small businesses mostly fall below the poverty line even if they may be observed as working <xref ref-type="bibr" rid="bib1.bibx3" id="paren.109"/>. Those households which depend on casual, unregistered employment and small businesses have a high probability of experiencing vulnerability when a disaster strikes, as they may experience loss of any economic means in that situation. There is an important difference between the job insecurity and social security variables. Job insecurity actually reflects the situation where the individual has no regular income; on the other hand, social security is covering all kinds of support and compensation mechanisms not only limited to the economic means of regular income. Although not limited to these, there might be several reasons for the difference between neighbourhoods in terms of these two variables. For example, it may be that, in the rural areas of northwest Istanbul, the individuals may not have social security, but they own their land and small businesses, and their jobs are more secure even though they may have a limited income <xref ref-type="bibr" rid="bib1.bibx2" id="paren.110"/>. In contrast, in the centre of the city  most of the  population is in wage employment, where a major group is in regular registered employment beside a significant group of the unemployed or those working on a daily basis in casual jobs <xref ref-type="bibr" rid="bib1.bibx2" id="paren.111"/>. Hence, unemployed or those in daily jobs may suffer job insecurity and a high risk of losing employment and/or income if caught by a hazard. Moreover, the individuals working in the service sector, which is common in Istanbul neighbourhoods, may suffer more from the possibility of work closures after a major hazard. For example, during the COVID-19 pandemic, when small workplaces have been required to close or restrict their services for a long period of time, most working people suffered severe job and income losses; hence, high vulnerability emerged <xref ref-type="bibr" rid="bib1.bibx13 bib1.bibx52" id="paren.112"/>. While Istanbul took a <inline-formula><mml:math id="M227" display="inline"><mml:mn mathvariant="normal">41.9</mml:mn></mml:math></inline-formula> % share of the total services sector in Türkiye in 2021, the share of the services sector in Istanbul's total gross domestic product was <inline-formula><mml:math id="M228" display="inline"><mml:mn mathvariant="normal">33.7</mml:mn></mml:math></inline-formula> % <xref ref-type="bibr" rid="bib1.bibx108" id="paren.113"/>.</p>
      <p id="d1e5283">The other variables among the top 10 most important predictors that contribute to the model performance of the ANN model were a mixture of demographic and economic<?pagebreak page2151?> variables. These included the ratio of over 65 year olds in the household, owning a house outside of Istanbul, household size, the ratio of income earners in the household, having savings for emergency situations, owning land outside of Istanbul, and the level of education of the residents. The demographic variable of having elderly (<inline-formula><mml:math id="M229" display="inline"><mml:mrow><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">65</mml:mn></mml:mrow></mml:math></inline-formula> years) people in the household being an important predictor of social vulnerability to hazards is also highlighted in the literature <xref ref-type="bibr" rid="bib1.bibx24 bib1.bibx46" id="paren.114"/>. High education which lowers social vulnerability is a factor that is related to both having social security, as mentioned before, and an increase of awareness of taking precautions for possible hazards.  The other significant variables like having property and savings are both related to income, where property outside the city may give more chances for the households to have a safe shelter after a major hazard. Furthermore, the associations between income and level of education are strong and consistent; that is, children from poorer family backgrounds have a tendency towards achieving a lower level of education <xref ref-type="bibr" rid="bib1.bibx117" id="paren.115"/>. Also, the poor have less access to resources which may be effective  in  reducing risks, such as extra savings for preparing their houses for a hazard or accessing risk preparation information, and therefore cannot take as many precautions to cope with a disaster when it occurs <xref ref-type="bibr" rid="bib1.bibx55" id="paren.116"/>.</p>
</sec>
</sec>
<sec id="Ch1.S6">
  <label>6</label><title>Limitations and recommendations</title>
      <p id="d1e5314">Socially, economically, and environmentally vulnerable communities are more likely to suffer disproportionately from disasters <xref ref-type="bibr" rid="bib1.bibx27 bib1.bibx55" id="paren.117"/>. However, our analysis was based solely on quantifiable household data, since variables related to environmental factors, historical hazard data, and building infrastructure were not available in our survey-based dataset. Another important limitation is the fact that we are using social vulnerability index scores that are pre-constructed in previous social vulnerability research. As we aim to assist the social vulnerability assessment process of local authorities, which is IMM in our case, we do not tend to discuss their scoring scheme, as it is part of their official policy-making process, but we try to present them with a methodological approach based on machine learning techniques to identify the best possible predictors of social vulnerability. However, as urban growth and migration are common experiences in a vibrant city like Istanbul, by regeneration and renewal processes accelerating the trend, the location of residents is continuously changing, similar to the change in socio-economic positions of neighbourhoods, both upward and downward. This may result in a continuous change of status and a dynamic social vulnerability of households and neighbourhoods, which needs to be studied in further research.</p>
      <p id="d1e5320">Although assessing social vulnerability is a complex process that takes many personal and environmental factors into account, our predictors in the ML models were limited to quantifiable household data, as our aim in this paper is to present an optimal modelling strategy capable of processing readily available large databases. Therefore, the model accuracy with the final ANN model was relatively low compared to other studies which assessed social vulnerability to hazards with machine learning techniques. For future studies, we recommend using household data along with community-level spatial predictors to enhance the predictive ability of the models. We note that we could not perform an external validation of the ML models using an independent dataset due to the unavailability of such household data derived from another source. Although the models were tested using independent testing data from our survey data, the model predictions may benefit from validation studies which could be conducted using independent datasets.</p>
</sec>
<sec id="Ch1.S7" sec-type="conclusions">
  <label>7</label><title>Conclusions</title>
      <p id="d1e5331">This research presents a new and alternative approach for public authorities to develop ideas for future governance mechanisms to cope with social vulnerability based on interdisciplinarity as a combination of social and statistical science.  To address the social vulnerability predictors by using ML, we compared six different supervised machine learning techniques and logistic regression, which can be employed for binary classification with imbalanced class variables. We demonstrated that an ANN using majority under sampling was the optimum method in terms of sensitivity, AUC, and other relevant performance metrics. The variable importance results showed that economically deprived households which do not have social security and experience job insecurity, the ones living in squatter houses, and less educated individuals are more likely to have a high social vulnerability to hazards. We stress strongly that our research outcomes and demonstration of employing machine learning with large household-level data have the potential to support decision makers in developing more effective policies by making use of quantifiable household data, which are available across various institutions and public bodies. More explicitly, a policy maker can make use of our proposed final ANN model to discriminate between households with low and high social vulnerability by inputting the variables found significantly important in the study. Thus, the groups with certain characteristics which are more vulnerable may be prioritised by decision makers in terms of their needs in order to develop new schemes that are specifically targeted to reducing disaster-related vulnerabilities. This kind of targeted assistance is missing in Türkiye's local and national disaster risk reduction policies, though it is a part of the Sendai Framework <xref ref-type="bibr" rid="bib1.bibx111" id="paren.118"/>. Therefore, the local authorities, mainly municipalities, can benefit from the results of this study, to target poor groups to accommodate them in affordable disaster-resistant<?pagebreak page2152?> housing within urban renewal schemes; for improving social assistance for the elderly, children, youth, and the poor; and for increasing awareness-raising events. Also, the central authorities may define new policies for increasing access to education and to social security of the poor and the vulnerable groups. This study made use of machine learning methodology and assessed their performances on social data based on an interdisciplinary collaboration where the statistics, urban planning, and sociology disciplines intersect to understand the significance of assessing social vulnerability at the household level and how to build a society more resilient to disasters.</p>
</sec>

      
      </body>
    <back><notes notes-type="codeavailability"><title>Code availability</title>

      <p id="d1e5341">R codes can be obtained by contacting Oya Kalaycioglu at her e-mail address: oyakalaycioglu@ibu.edu.tr. Codes for the R Shiny web application (<uri>https://oyakalaycioglu.shinyapps.io/Social_Vulnerability/</uri>, <xref ref-type="bibr" rid="bib1.bibx62" id="altparen.119"/>) can be obtained by contacting Serhat Emre Akhanli at his e-mail address: serhatakhanli@mu.edu.tr.​​​​​​​</p>
  </notes><notes notes-type="dataavailability"><title>Data availability</title>

      <p id="d1e5353">Data are available from the authors with the permission of Istanbul Metropolitan Municipality, Directorate of Earthquake and Ground Research.</p>
  </notes><app-group>
        <supplementary-material position="anchor"><p id="d1e5356">The supplement related to this article is available online at: <inline-supplementary-material xlink:href="https://doi.org/10.5194/nhess-23-2133-2023-supplement" xlink:title="pdf">https://doi.org/10.5194/nhess-23-2133-2023-supplement</inline-supplementary-material>.</p></supplementary-material>
        </app-group><notes notes-type="authorcontribution"><title>Author contributions</title>

      <p id="d1e5365">OK and EYM planned the initial concept of the study. OK led the writing of the paper, with contributions from all the co-authors. OK and SEA implemented the data analysis, trained ML models, and designed the tables, figures, and R Shiny web application. EYM obtained the data and designed Fig. 5. MK and SK wrote literature review on social vulnerability and the discussion on important predictors. All the authors critically reviewed the paper.</p>
  </notes><notes notes-type="competinginterests"><title>Competing interests</title>

      <p id="d1e5371">The contact author has declared that none of the authors has any competing interests.</p>
  </notes><notes notes-type="disclaimer"><title>Disclaimer</title>

      <p id="d1e5378">Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.</p>
  </notes><notes notes-type="sistatement"><title>Special issue statement</title>

      <p id="d1e5384">This article is part of the special issue “Advances in machine learning for natural hazards risk assessment”. It is not associated with a conference.</p>
  </notes><ack><title>Acknowledgements</title><p id="d1e5390">The authors wish to thank research colleague Kezban Celik. The authors also gratefully acknowledge Istanbul Metropolitan Municipality, Directorate of Earthquake and Ground Investigation, for providing permission to use the survey data.</p></ack><notes notes-type="reviewstatement"><title>Review statement</title>

      <p id="d1e5395">This paper was edited by Sabine Loos and reviewed by Yi Victor Wang and Jocelyn West.</p>
  </notes><ref-list>
    <title>References</title>

      <ref id="bib1.bibx1"><?xmltex \def\ref@label{{Abarca-Alvarez et~al.(2019)}}?><label>Abarca-Alvarez et al.(2019)</label><?label abarca-alvarez_decision_2019?><mixed-citation>Abarca-Alvarez, F. J., Reinoso-Bellido, R., and Campos-Sánchez, F. S.: Decision Model for Predicting Social Vulnerability Using Artificial Intelligence, ISPRS Int. J. Geo-Inf., 8, 575, <ext-link xlink:href="https://doi.org/10.3390/ijgi8120575" ext-link-type="DOI">10.3390/ijgi8120575</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx2"><?xmltex \def\ref@label{{Acar et~al.(2022)}}?><label>Acar et al.(2022)</label><?label acar_ilcelerin_2022?><mixed-citation>Acar, s., Karagoz, T., Meydan, M. C., Sahin Cinoglu, D., Kaygisiz, G., and
Isik, M.: Ilcelerin sosyo-ekonomik gelismislik siralamasi arastirmasi –
SEGE 2022 (Research on the socio-econimic development ranking of
districts), Tech. Rep. 35, Republic Of Turkey Ministry of Industry and
Technology, General Directorate of Development Agencies,
<uri>https://www.sanayi.gov.tr/merkez-birimi/b94224510b7b/sege</uri> (last access: 20 March 2023), 2022.</mixed-citation></ref>
      <ref id="bib1.bibx3"><?xmltex \def\ref@label{{Adaman et~al.(2015)}}?><label>Adaman et al.(2015)</label><?label adaman_espn_2015?><mixed-citation>Adaman, F., Aslan, D., Erus, B., and Sayan, S.: ESPN Thematic Report on  in-work poverty in Turkey, Tech. rep., European Commission, Brussels, <uri>https://ec.europa.eu/social/BlobServlet?docId=21089&amp;langId=en</uri>​​​​​​​ (last access: 20 March 2023), 2015.</mixed-citation></ref>
      <ref id="bib1.bibx4"><?xmltex \def\ref@label{{AFAD(2019)}}?><label>AFAD(2019)</label><?label afad_disaster_2019?><mixed-citation>AFAD: Disaster and Management Presidency of Turkey – 2019 Overview of Disaster Management and Natural Disaster Statistics, Tech. rep., AFAD, <uri>https://en.afad.gov.tr/kurumlar/en.afad/Afet_Istatistikleri_2020_eng_1.pdf</uri>​​​​​​​ (last access: 26 March 2023), 2019.</mixed-citation></ref>
      <ref id="bib1.bibx5"><?xmltex \def\ref@label{{Akhanli and Hennig(2020)}}?><label>Akhanli and Hennig(2020)</label><?label akhanli_comparing_2020?><mixed-citation>Akhanli, S. E. and Hennig, C.: Comparing clusterings and numbers of clusters by aggregation of calibrated clustering validity indexes, Stat. Comput., 30, 1523–1544, <ext-link xlink:href="https://doi.org/10.1007/s11222-020-09958-2" ext-link-type="DOI">10.1007/s11222-020-09958-2</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx6"><?xmltex \def\ref@label{{Aksha et~al.(2019)}}?><label>Aksha et al.(2019)</label><?label aksha_analysis_2019?><mixed-citation>Aksha, S. K., Juran, L., Resler, L. M., and Zhang, Y.: An Analysis of Social Vulnerability to Natural Hazards in Nepal Using a Modified Social Vulnerability Index, Int. J. Disast. Risk Sc., 10, 103–116, <ext-link xlink:href="https://doi.org/10.1007/s13753-018-0192-7" ext-link-type="DOI">10.1007/s13753-018-0192-7</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx7"><?xmltex \def\ref@label{{Alizadeh et~al.(2018)}}?><label>Alizadeh et al.(2018)</label><?label alizadeh_social_2018?><mixed-citation>Alizadeh, M., Alizadeh, E., Asadollahpour Kotenaee, S., Shahabi, H., Beiranvand Pour, A., Panahi, M., Bin Ahmad, B., and Saro, L.: Social Vulnerability Assessment Using Artificial Neural Network (ANN) Model for Earthquake Hazard in Tabriz City, Iran, Sustainability,
10, 3376, <ext-link xlink:href="https://doi.org/10.3390/su10103376" ext-link-type="DOI">10.3390/su10103376</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx8"><?xmltex \def\ref@label{{Armaş(2008)}}?><label>Armaş(2008)</label><?label armas_social_2008?><mixed-citation>Armaş, I.: Social vulnerability and seismic risk perception. Case study: the historic center of the Bucharest Municipality/Romania, Nat. Hazards, 47, 397–410, <ext-link xlink:href="https://doi.org/10.1007/s11069-008-9229-3" ext-link-type="DOI">10.1007/s11069-008-9229-3</ext-link>, 2008.</mixed-citation></ref>
      <ref id="bib1.bibx9"><?xmltex \def\ref@label{{Atun and Menoni(2014)}}?><label>Atun and Menoni(2014)</label><?label atun_vulnerability_2014?><mixed-citation>Atun, F. and Menoni, S.: Vulnerability to earthquake in Istanbul: application of the ENSURE methodology, Orhan Hacihasanoglu ITU Faculty of Architecture, A/Z ITU Journal of the Faculty of Architecture, 11, 99–116, <uri>https://research.utwente.nl/en/publications/vulnerability-to-earthquake-in-istanbul-application-of-the-ensure</uri> (last access: 18 March 2023), 2014.</mixed-citation></ref>
      <ref id="bib1.bibx10"><?xmltex \def\ref@label{{Bakkensen et~al.(2017)}}?><label>Bakkensen et al.(2017)</label><?label bakkensen_validating_2017?><mixed-citation>
Bakkensen, L. A., Fox‐Lent, C., Read, L. K., and Linkov, I.: Validating
resilience and vulnerability indices in the context of natural disasters,
Risk Anal., 37, 982–1004, 2017.</mixed-citation></ref>
      <?pagebreak page2153?><ref id="bib1.bibx11"><?xmltex \def\ref@label{{Bakker et~al.(2019)}}?><label>Bakker et al.(2019)</label><?label bakker_beyond_2019?><mixed-citation>Bakker, A., Cai, J., English, L., Kaiser, G., Mesa, V., and Van Dooren, W.:
Beyond small, medium, or large: points of consideration when interpreting
effect sizes, Educ. Stud. Math., 102, 1–8,
<ext-link xlink:href="https://doi.org/10.1007/s10649-019-09908-4" ext-link-type="DOI">10.1007/s10649-019-09908-4</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx12"><?xmltex \def\ref@label{{Baris(2009)}}?><label>Baris(2009)</label><?label baris_effectiveness_2009?><mixed-citation>
Baris, M.: Effectiveness of Turkish disaster management system and recommendations, Biotechnol. Biotec. Eq., 23, 1391–1398, 2009.</mixed-citation></ref>
      <ref id="bib1.bibx13"><?xmltex \def\ref@label{{Bartik et~al.(2020)}}?><label>Bartik et al.(2020)</label><?label bartik_impact_2020?><mixed-citation>
Bartik, A. W., Bertrand, M., Cullen, Z., Glaeser, E. L., Luca, M., and Stanton, C.: The impact of COVID-19 on small business outcomes and expectations, P. Natl. Acad. Sci. USA, 117, 17656–17666, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx14"><?xmltex \def\ref@label{{Basile~Ibrahim et~al.(2021)}}?><label>Basile Ibrahim et al.(2021)</label><?label basile_ibrahim_association_2021?><mixed-citation>Basile Ibrahim, B., Barcelona, V., Condon, E. M., Crusto, C. A., and Taylor,
J. Y.: The Association Between Neighborhood Social Vulnerability
and Cardiovascular Health Risk among Black/African American
Women in the InterGEN Study, Nurs. Res., 70, S3–S12,
<ext-link xlink:href="https://doi.org/10.1097/NNR.0000000000000523" ext-link-type="DOI">10.1097/NNR.0000000000000523</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx15"><?xmltex \def\ref@label{{Batista et~al.(2004)}}?><label>Batista et al.(2004)</label><?label batista_study_2004?><mixed-citation>Batista, G. E. A. P. A., Prati, R. C., and Monard, M. C.: A Study of the
Behavior of Several Methods for Balancing Machine Learning
Training Data, SIGKDD Explor. Newsl., 6, 20–29,
<ext-link xlink:href="https://doi.org/10.1145/1007730.1007735" ext-link-type="DOI">10.1145/1007730.1007735</ext-link>, 2004.</mixed-citation></ref>
      <ref id="bib1.bibx16"><?xmltex \def\ref@label{{Bergstrand et~al.(2015)}}?><label>Bergstrand et al.(2015)</label><?label bergstrand_assessing_2015?><mixed-citation>Bergstrand, K., Mayer, B., Brumback, B., and Zhang, Y.: Assessing the
Relationship Between Social Vulnerability and Community
Resilience to Hazards, Soc. Indic. Res., 122, 391–409,
<ext-link xlink:href="https://doi.org/10.1007/s11205-014-0698-3" ext-link-type="DOI">10.1007/s11205-014-0698-3</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx17"><?xmltex \def\ref@label{{Birkmann and Wisner(2006)}}?><label>Birkmann and Wisner(2006)</label><?label birkmann_measuring_2006?><mixed-citation>
Birkmann, J. and Wisner, B.: Measuring the unmeasurable: the challenge of vulnerability, UNU-EHS – United Nations University – Institute for Environment and Human Security, vol. 5, Bonn, Germany, 64 pp., ISBN 3981058267, 2006.</mixed-citation></ref>
      <ref id="bib1.bibx18"><?xmltex \def\ref@label{{Bjarnadottir et~al.(2011)}}?><label>Bjarnadottir et al.(2011)</label><?label bjarnadottir_social_2011?><mixed-citation>
Bjarnadottir, S., Li, Y., and Stewart, M. G.: Social vulnerability index for
coastal communities at risk to hurricane hazard and a changing climate,
Nat. Hazards, 59, 1055–1075, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx19"><?xmltex \def\ref@label{{Burton et~al.(2018)}}?><label>Burton et al.(2018)</label><?label burton_social_2018?><mixed-citation>Burton, C., Rufat, S., and Tate, E.: Social vulnerability: Conceptual
Foundations and Geospatial Modeling, Vulnerability and resilience to
natural hazards, Cambridge University Press, 53–81, <ext-link xlink:href="https://doi.org/10.1017/9781316651148" ext-link-type="DOI">10.1017/9781316651148</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx20"><?xmltex \def\ref@label{{Buskirk et~al.(2018)}}?><label>Buskirk et al.(2018)</label><?label buskirk_introduction_2018?><mixed-citation>Buskirk, T. D., Kirchner, A., Eck, A., and Signorino, C. S.: An Introduction to Machine Learning Methods for Survey Researchers, Survey Practice, 11, <ext-link xlink:href="https://doi.org/10.29115/SP-2018-0004" ext-link-type="DOI">10.29115/SP-2018-0004</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx21"><?xmltex \def\ref@label{{Cannon(2008)}}?><label>Cannon(2008)</label><?label cannon_reducing_2008?><mixed-citation>Cannon, T.: Reducing People's Vulnerability to Natural Hazards: Communities and Resilience, World Institute for Development Economic Research (UNU-WIDER), WIDER Working Paper Series, RP2008-34, <uri>https://ideas.repec.org/p/unu/wpaper/rp2008-34.html</uri> (last access: 20 December 2022), 2008.</mixed-citation></ref>
      <ref id="bib1.bibx22"><?xmltex \def\ref@label{{Chawla et~al.(2002)}}?><label>Chawla et al.(2002)</label><?label chawla_smote_2002?><mixed-citation>Chawla, N. V., Bowyer, K. W., Hall, L. O., and Kegelmeyer, W. P.: SMOTE:
Synthetic Minority Over-sampling Technique, J. Artif. Intell. Res.​​​​​​​, 16, 321–357, <ext-link xlink:href="https://doi.org/10.1613/jair.953" ext-link-type="DOI">10.1613/jair.953</ext-link>, 2002.</mixed-citation></ref>
      <ref id="bib1.bibx23"><?xmltex \def\ref@label{{Chen et~al.(2013)}}?><label>Chen et al.(2013)</label><?label chen_measuring_2013?><mixed-citation>
Chen, W., Cutter, S. L., Emrich, C. T., and Shi, P.: Measuring social
vulnerability to natural hazards in the Yangtze River Delta region,
China, Int. J. Disast. Risk Sc., 4, 169–181, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx24"><?xmltex \def\ref@label{{Chou et~al.(2004)}}?><label>Chou et al.(2004)</label><?label chou_who_2004?><mixed-citation>
Chou, Y.-J., Huang, N., Lee, C.-H., Tsai, S.-L., Chen, L.-S., and Chang, H.-J.: Who is at risk of death in an earthquake?, Am. J. Epidemiol., 160, 688–695, 2004.</mixed-citation></ref>
      <ref id="bib1.bibx25"><?xmltex \def\ref@label{{Çolak and Sunar(2020)}}?><label>Çolak and Sunar(2020)</label><?label colak_importance_2020?><mixed-citation>Çolak, E. and Sunar, F.: The importance of ground-truth and crowdsourcing data for the statistical and spatial analyses of the NASA FIRMS active fires in the Mediterranean Turkish forests, Remote Sensing Applications: Society and Environment, 19, 100327, <ext-link xlink:href="https://doi.org/10.1016/j.rsase.2020.100327" ext-link-type="DOI">10.1016/j.rsase.2020.100327</ext-link>,
2020.</mixed-citation></ref>
      <ref id="bib1.bibx26"><?xmltex \def\ref@label{{Couronné et~al.(2018)}}?><label>Couronné et al.(2018)</label><?label couronne_random_2018?><mixed-citation>Couronné, R., Probst, P., and Boulesteix, A.-L.: Random forest versus logistic regression: a large-scale benchmark experiment, BMC Bioinformatics, 19, 270, <ext-link xlink:href="https://doi.org/10.1186/s12859-018-2264-5" ext-link-type="DOI">10.1186/s12859-018-2264-5</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx27"><?xmltex \def\ref@label{{Cureton(2011)}}?><label>Cureton(2011)</label><?label cureton_environmental_2011?><mixed-citation>Cureton, S.: Environmental victims: environmental injustice issues that
threaten the health of children living in poverty, Rev. Environ. Health, 26, 141–147, <ext-link xlink:href="https://doi.org/10.1515/reveh.2011.021" ext-link-type="DOI">10.1515/reveh.2011.021</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx28"><?xmltex \def\ref@label{{Cutter et~al.(2009)}}?><label>Cutter et al.(2009)</label><?label cutter_social_2009?><mixed-citation>Cutter, S., Emrich, C., Haney (Webb), J., and Morath, D.: Social Vulnerability to Climate Variability Hazards: A Review of the Literature, Final Report to Oxfam America, 1–44, <uri>https://citeseerx.ist.psu.edu/document?repid=rep1&amp;type=pdf&amp;doi=e0708976f51536074aba4cf7fd5375d9c8f58c2b</uri>
(last access: 20 March 2023), 2009.</mixed-citation></ref>
      <ref id="bib1.bibx29"><?xmltex \def\ref@label{{Cutter and Finch(2008)}}?><label>Cutter and Finch(2008)</label><?label cutter_temporal_2008?><mixed-citation>
Cutter, S. L. and Finch, C.: Temporal and spatial changes in social
vulnerability to natural hazards, P. Natl. Acad. Sci. USA, 105, 2301–2306, 2008.</mixed-citation></ref>
      <ref id="bib1.bibx30"><?xmltex \def\ref@label{{Cutter et~al.(2000)}}?><label>Cutter et al.(2000)</label><?label cutter_revealing_2000?><mixed-citation>Cutter, S. L., Mitchell, J. T., and Scott, M. S.: Revealing the Vulnerability of People and Places: A Case Study of Georgetown County, South Carolina, Ann. Assoc. Am. Geogr., 90, 713–737, <ext-link xlink:href="https://doi.org/10.1111/0004-5608.00219" ext-link-type="DOI">10.1111/0004-5608.00219</ext-link>, 2000.</mixed-citation></ref>
      <ref id="bib1.bibx31"><?xmltex \def\ref@label{{Cutter et~al.(2003)}}?><label>Cutter et al.(2003)</label><?label cutter_social_2003?><mixed-citation>Cutter, S. L., Boruff, B. J., and Shirley, W. L.: Social Vulnerability to
Environmental Hazards, Soc. Sci. Quart., 84, 242–261,
<ext-link xlink:href="https://doi.org/10.1111/1540-6237.8402002" ext-link-type="DOI">10.1111/1540-6237.8402002</ext-link>, 2003.</mixed-citation></ref>
      <ref id="bib1.bibx32"><?xmltex \def\ref@label{{Debesai(2020)}}?><label>Debesai(2020)</label><?label debesai_factors_2020?><mixed-citation>Debesai, M. G.: Factors affecting vulnerability level of farming households to climate change in developing countries: evidence from Eritrea, IOP Conf. Ser.-Mat. Sci., 1001, 012093, <ext-link xlink:href="https://doi.org/10.1088/1757-899x/1001/1/012093" ext-link-type="DOI">10.1088/1757-899x/1001/1/012093</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx33"><?xmltex \def\ref@label{{DeLong et~al.(1988)}}?><label>DeLong et al.(1988)</label><?label delong_comparing_1988?><mixed-citation>DeLong, E. R., DeLong, D. M., and Clarke-Pearson, D. L.: Comparing the Areas under Two or More Correlated Receiver Operating Characteristic Curves: A Nonparametric Approach, Biometrics, 44, 837–845, <ext-link xlink:href="https://doi.org/10.2307/2531595" ext-link-type="DOI">10.2307/2531595</ext-link>, 1988.</mixed-citation></ref>
      <ref id="bib1.bibx34"><?xmltex \def\ref@label{{de~Oliveira~Mendes(2009)}}?><label>de Oliveira Mendes(2009)</label><?label de_oliveira_mendes_social_2009?><mixed-citation>de Oliveira Mendes, J. M.: Social vulnerability indexes as planning tools:
beyond the preparedness paradigm, J. Risk Res., 12, 43–58,
<ext-link xlink:href="https://doi.org/10.1080/13669870802447962" ext-link-type="DOI">10.1080/13669870802447962</ext-link>, 2009.</mixed-citation></ref>
      <ref id="bib1.bibx35"><?xmltex \def\ref@label{{Di~Franco and Santurro(2020)}}?><label>Di Franco and Santurro(2020)</label><?label di_franco_machine_2020?><mixed-citation>Di Franco, G. and Santurro, M.: Machine learning, artificial neural networks
and social research, Qual. Quant., 55, 1007–1025, <ext-link xlink:href="https://doi.org/10.1007/s11135-020-01037-y" ext-link-type="DOI">10.1007/s11135-020-01037-y</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx36"><?xmltex \def\ref@label{{Dodman et~al.(2013)}}?><label>Dodman et al.(2013)</label><?label dodman_understanding_2013?><mixed-citation>Dodman, D., Brown, D., Francis, K., Hardoy, J., Johnson, C., and Satterthwaite, D.: Understanding the nature and scale of urban risk in low- and middleincome countries and its implications for humanitarian preparedness, planning and response, Tech. rep., International Institute for Environment and Development, <uri>http://pubs.iied.org/10624IIED.html</uri> (last access: 1 April 2023), 2013.</mixed-citation></ref>
      <ref id="bib1.bibx37"><?xmltex \def\ref@label{{Dunning and Durden(2011)}}?><label>Dunning and Durden(2011)</label><?label dunning_social_2011?><mixed-citation>Dunning, C. and Durden, S.: Social vulnerability analysis methods for Corps planning, Tech. rep., US Army Corps of Engineers, <uri>https://www.iwr.usace.army.mil/portals/70/docs/iwrreports/2011-r-07.pdf</uri> (last access: 15 December 2022), 2011.</mixed-citation></ref>
      <ref id="bib1.bibx38"><?xmltex \def\ref@label{{Durahim(2016)}}?><label>Durahim(2016)</label><?label durahim_comparison_2016?><mixed-citation>
Durahim, A. O.: Comparison of sampling techniques for imbalanced learning,
Yönetim Bilişim Sistemleri Dergisi, 2, 181–191, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx39"><?xmltex \def\ref@label{{Dwyer et~al.(2004)}}?><label>Dwyer et al.(2004)</label><?label dwyer_quantifying_2004?><mixed-citation>Dwyer, A., Zoppou, C., Nielsen, O., Day, S., and Roberts, S.: Quantifying social vulnerability: a methodology for identifying those at risk to natural hazards, Geoscience Australia Canberra, <uri>https://d28rz98at9flks.cloudfront.net/61168/Rec2004_014.pdf</uri>​​​​​​​ (last access: 3 December 2022), 2004.</mixed-citation></ref>
      <ref id="bib1.bibx40"><?xmltex \def\ref@label{{Emrich et~al.(2014)}}?><label>Emrich et al.(2014)</label><?label emrich_climate-sensitive_2014?><mixed-citation>Emrich, C., Morath, D., Morath, G., and Reeves, R.: Climate-sensitive hazards
in Florida: identifying and prioritizing threats to b<?pagebreak page2154?>uild resilience
against climate effects, Hazard Vulnerability Res. Inst.
Columbia, Columbia, SC, USA, <uri>https://www.floridahealth.gov/environmental-health/climate-and-health/_documents/climate-sensitive-hazards-in-florida-final-report.pdf</uri> (last access: 16 December 2023), 2014.</mixed-citation></ref>
      <ref id="bib1.bibx41"><?xmltex \def\ref@label{{Enarson et~al.(2018)}}?><label>Enarson et al.(2018)</label><?label enarson_gender_2018?><mixed-citation>Enarson, E., Fothergill, A., and Peek, L.: Gender and disaster: Foundations and new directions for research and practice, Handbook of disaster research, Springer, 205–223, <ext-link xlink:href="https://doi.org/10.1007/978-3-319-63254-4_11" ext-link-type="DOI">10.1007/978-3-319-63254-4_11</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx42"><?xmltex \def\ref@label{{Erdik et~al.(2003)}}?><label>Erdik et al.(2003)</label><?label erdik_earthquake_2003?><mixed-citation>Erdik, M., Aydinoglu, N., Fahjan, Y., Sesetyan, K., Demircioglu, M., Siyahi, B., Durukal, E., Ozbey, C., Biro, Y., Akman, H., and Yuzugullu, O.: Earthquake risk assessment for Istanbul metropolitan area, Earthq. Eng. Eng. Vib., 2, 1–23, <ext-link xlink:href="https://doi.org/10.1007/BF02857534" ext-link-type="DOI">10.1007/BF02857534</ext-link>, 2003.</mixed-citation></ref>
      <ref id="bib1.bibx43"><?xmltex \def\ref@label{{Ersoy and Ko\c{c}ak(2016)}}?><label>Ersoy and Koçak(2016)</label><?label ersoy_disasters_2016?><mixed-citation>Ersoy, S. and Koçak, A.: Disasters and earthquake preparedness of children and schools in Istanbul, Turkey, Geomat. Nat. Haz. Risk​​​​​​​, 7, 1307–1336, <ext-link xlink:href="https://doi.org/10.1080/19475705.2015.1060637" ext-link-type="DOI">10.1080/19475705.2015.1060637</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx44"><?xmltex \def\ref@label{{Esposito et~al.(2021)}}?><label>Esposito et al.(2021)</label><?label esposito_ghost_2021?><mixed-citation>Esposito, C., Landrum, G. A., Schneider, N., Stiefl, N., and Riniker, S.:
GHOST: Adjusting the Decision Threshold to Handle Imbalanced
Data in Machine Learning, J. Chem. Inf. Model., 61, 2623–2640,
<ext-link xlink:href="https://doi.org/10.1021/acs.jcim.1c00160" ext-link-type="DOI">10.1021/acs.jcim.1c00160</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx45"><?xmltex \def\ref@label{{Evans and Kantrowitz(2002)}}?><label>Evans and Kantrowitz(2002)</label><?label evans_socioeconomic_2002?><mixed-citation>
Evans, G. W. and Kantrowitz, E.: Socioeconomic status and health: the potential role of environmental risk exposure, Annu. Rev. Publ. Health, 23, 303–331, 2002.</mixed-citation></ref>
      <ref id="bib1.bibx46"><?xmltex \def\ref@label{{Fatemi et~al.(2017)}}?><label>Fatemi et al.(2017)</label><?label fatemi_social_2017?><mixed-citation>Fatemi, F., Ardalan, A., Aguirre, B., Mansouri, N., and Mohammadfam, I.: Social vulnerability indicators in disasters: Findings from a systematic review, Int. J. Disast. Risk Re., 22, 219–227,
<ext-link xlink:href="https://doi.org/10.1016/j.ijdrr.2016.09.006" ext-link-type="DOI">10.1016/j.ijdrr.2016.09.006</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx47"><?xmltex \def\ref@label{{Fekete(2009)}}?><label>Fekete(2009)</label><?label fekete_validation_2009?><mixed-citation>Fekete, A.: Validation of a social vulnerability index in context to river-floods in Germany, Nat. Hazards Earth Syst. Sci., 9, 393–403, <ext-link xlink:href="https://doi.org/10.5194/nhess-9-393-2009" ext-link-type="DOI">10.5194/nhess-9-393-2009</ext-link>, 2009.</mixed-citation></ref>
      <ref id="bib1.bibx48"><?xmltex \def\ref@label{{Flanagan et~al.(2011)}}?><label>Flanagan et al.(2011)</label><?label flanagan_social_2011?><mixed-citation>Flanagan, B. E., Gregory, E. W., Hallisey, E. J., Heitgerd, J. L., and Lewis, B.: A social vulnerability index for disaster management, J. Homel. Secur. Emerg., 8, <ext-link xlink:href="https://doi.org/10.2202/1547-7355.1792" ext-link-type="DOI">10.2202/1547-7355.1792</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx49"><?xmltex \def\ref@label{{Fritz et~al.(2012)}}?><label>Fritz et al.(2012)</label><?label fritz_effect_2012?><mixed-citation>Fritz, C. O., Morris, P. E., and Richler, J. J.: Effect size estimates: Current use, calculations, and interpretation, J. Exp. Psychol. Gen., 141, 2–18, <ext-link xlink:href="https://doi.org/10.1037/a0024338" ext-link-type="DOI">10.1037/a0024338</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx50"><?xmltex \def\ref@label{{Garson(1991)}}?><label>Garson(1991)</label><?label garson_interpreting_1991?><mixed-citation>
Garson, G. D.: Interpreting Neural-Network Connection Weights, AI Expert, 6, 46–51, 1991.</mixed-citation></ref>
      <ref id="bib1.bibx51"><?xmltex \def\ref@label{{Gelman(2008)}}?><label>Gelman(2008)</label><?label gelman_scaling_2008?><mixed-citation>Gelman, A.: Scaling regression inputs by dividing by two standard deviations,
Stat. Med., 27, 2865–2873, <ext-link xlink:href="https://doi.org/10.1002/sim.3107" ext-link-type="DOI">10.1002/sim.3107</ext-link>, 2008.</mixed-citation></ref>
      <ref id="bib1.bibx52"><?xmltex \def\ref@label{{Gray et~al.(2022)}}?><label>Gray et al.(2022)</label><?label gray_characteristics_2022?><mixed-citation>
Gray, B. J., Kyle, R. G., Song, J., and Davies, A. R.: Characteristics of those most vulnerable to employment changes during the COVID-19 pandemic: a
nationally representative cross-sectional study in Wales, J. Epidemiol.
Commun. H., 76, 8–15, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx53"><?xmltex \def\ref@label{{Green(2008)}}?><label>Green(2008)</label><?label green_unauthorised_2008?><mixed-citation>Green, R. A.: Unauthorised development and seismic hazard vulnerability: a
study of squatters and engineers in Istanbul, Turkey, Disasters, 32,
358–376, <ext-link xlink:href="https://doi.org/10.1111/j.1467-7717.2008.01044.x" ext-link-type="DOI">10.1111/j.1467-7717.2008.01044.x</ext-link>,
2008.</mixed-citation></ref>
      <ref id="bib1.bibx54"><?xmltex \def\ref@label{{Guillard-Gonçalves et~al.(2015)}}?><label>Guillard-Gonçalves et al.(2015)</label><?label guillard-goncalves_application_2015?><mixed-citation>
Guillard-Gonçalves, C., Cutter, S. L., Emrich, C. T., and Zêzere, J. L.: Application of Social Vulnerability Index (SoVI) and delineation of natural risk zones in Greater Lisbon, Portugal, J. Risk Res., 18, 651–674, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx55"><?xmltex \def\ref@label{{Hallegatte et~al.(2020)}}?><label>Hallegatte et al.(2020)</label><?label hallegatte_poverty_2020?><mixed-citation>Hallegatte, S., Vogt-Schilb, A., Rozenberg, J., Bangalore, M., and Beaudet, C.: From Poverty to Disaster and Back: a Review of the Literature,
EconDisCliCha, 4, 223–247, <ext-link xlink:href="https://doi.org/10.1007/s41885-020-00060-5" ext-link-type="DOI">10.1007/s41885-020-00060-5</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx56"><?xmltex \def\ref@label{{Hennig and Liao(2013)}}?><label>Hennig and Liao(2013)</label><?label hennig_how_2013?><mixed-citation>Hennig, C. and Liao, T. F.: How to find an appropriate clustering for mixed-type variables with application to socio-economic stratification, J. Roy. Stat. Soc. C-App., 62, 309–369, <ext-link xlink:href="https://doi.org/10.1111/j.1467-9876.2012.01066.x" ext-link-type="DOI">10.1111/j.1467-9876.2012.01066.x</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx57"><?xmltex \def\ref@label{{Holand and Lujala(2013)}}?><label>Holand and Lujala(2013)</label><?label holand_replicating_2013?><mixed-citation>Holand, I. S. and Lujala, P.: Replicating and Adapting an Index of Social Vulnerability to a New Context: A Comparison Study for Norway, Prof. Geogr., 65, 312–328, <ext-link xlink:href="https://doi.org/10.1080/00330124.2012.681509" ext-link-type="DOI">10.1080/00330124.2012.681509</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx58"><?xmltex \def\ref@label{{Holand et~al.(2011)}}?><label>Holand et al.(2011)</label><?label holand_social_2011?><mixed-citation>Holand, I. S., Lujala, P., and Rød, J. K.: Social vulnerability assessment for Norway: A quantitative approach, Norsk Geogr. Tidsskr.​​​​​​​, 65, 1–17, <ext-link xlink:href="https://doi.org/10.1080/00291951.2010.550167" ext-link-type="DOI">10.1080/00291951.2010.550167</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx59"><?xmltex \def\ref@label{{Hosmer et~al.(2013)}}?><label>Hosmer et al.(2013)</label><?label hosmer_applied_2013?><mixed-citation>Hosmer, D. W., Lemeshow, S., and Sturdivant, R. X.: Applied Logistic Regression, Wiley Series in Probability and Statistics, Wiley, 1st edn., <uri>https://onlinelibrary.wiley.com/doi/book/10.1002/9781118548387</uri> (last access: 15 November 2022), 2013.</mixed-citation></ref>
      <ref id="bib1.bibx60"><?xmltex \def\ref@label{{IMM(2018)}}?><label>IMM(2018)</label><?label imm_afetler_2018?><mixed-citation>Istanbul Metropolitan Municipality (IMM): Afetler Karsisinda Sosyal Hasargörebilirlik Sonuç Raporu (Final Report of Survey Study for Social Vulnerability to Natural Disasters), Istanbul Metropolitan Municipality (IMM)  Directorate of Earthquake and Ground Research, Tech. rep., <ext-link xlink:href="https://depremzemin.ibb.istanbul/calismalarimiz/tamamlanmiscalismalar/istanbul-ili-genelinde-afetler-karsisinda-sosyalhasar-gorebilirlik-arastirmasi/">https://depremzemin.ibb.istanbul/ calismalarimiz/tamamlanmiscalismalar/istanbul-ili-genelinde-afetler-karsisinda-sosyalhasar-gorebilirlik-arastirmasi/</ext-link> (last access: 20 April 2023), 2018.</mixed-citation></ref>
      <ref id="bib1.bibx61"><?xmltex \def\ref@label{{IMM and KOERI(2019)}}?><label>IMM and KOERI(2019)</label><?label imm_istanbul_2019?><mixed-citation>Istanbul Metropolitan Municipality (IMM)  and Kandilli Observatory Earthquake Research Institution (KOERI): İstanbul İli Olası Deprem Kayıp Tahminlerinin Güncellenmesi Projesi (Updating The Earthquake Loss Estimation for Istanbul), Istanbul Metropolitan Municipality (IMM)  and Kandilli Observatory Earthquake Research Institution (KOERI), <ext-link xlink:href="https://depremzemin.ibb.istanbul/calismalarimiz/tamamlanmis-calismalar/istanbul-ili-olasi-deprem-kayip-tahminlerinin-guncellenmesi-projesi/">https://depremzemin.ibb.istanbul/calismalarimiz/tamamlanmis-calismalar/istanbul-ili-olasi-deprem-kayip-tahminlerinin-guncellenmesi-projesi/</ext-link> (last access: 26 April 2023), 2019.</mixed-citation></ref>
      <ref id="bib1.bibx62"><?xmltex \def\ref@label{{Kalaycıo\u{g}lu et~al.(2022)}}?><label>Kalaycıoğlu et al.(2022)</label><?label kalaycioglu_data_2022?><mixed-citation>Kalaycıoğlu, O., Akhanlı, S. E., Menteşe, E. Y., Kalaycıoğlu, M., and Kalaycıoğlu, S.: R Shiny web application, shinyapps.io​​​​​​​ [data set]​​​​​​​, <uri>https://oyakalaycioglu.shinyapps.io/Social_Vulnerability/</uri> (last access: 13 June 2023), 2022.</mixed-citation></ref>
      <ref id="bib1.bibx63"><?xmltex \def\ref@label{{Kalaycioglu et~al.(2006)}}?><label>Kalaycioglu et al.(2006)</label><?label kalaycioglu_integrated_2006?><mixed-citation>Kalaycioglu, S., Rittersberger, H., Çelik, K., and Gunes, F.: Integrated natural disaster risk assessment: The socio-economic dimension of earthquake risk in the urban area, in: Geohazards, Okinawa, Japan, 18–21 June 2006, Engineering Conferences International Symposium Series, <uri>http://dc.engconfintl.org/geohazards/23/</uri> (last access: 18 December 2023), 2006.</mixed-citation></ref>
      <ref id="bib1.bibx64"><?xmltex \def\ref@label{{Kim and Lee(2017)}}?><label>Kim and Lee(2017)</label><?label kim_does_2017?><mixed-citation>Kim, S. and Lee, W.: Does McNemar's test compare the sensitivities and
specificities of two diagnostic tests?, Stat. Methods Med. Res., 26, 142–154, <ext-link xlink:href="https://doi.org/10.1177/0962280214541852" ext-link-type="DOI">10.1177/0962280214541852</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx65"><?xmltex \def\ref@label{{Krishnan et~al.(2019)}}?><label>Krishnan et al.(2019)</label><?label krishnan_framework_2019?><mixed-citation>
Krishnan, P., Ananthan, P. S., Purvaja, R., Joyson Joe Jeevamani, J., Amali Infantina, J., Srinivasa Rao, C., Anand, A., Mahendra, R. S., Sekar, I., and Kareemulla, K.: Framework for mapping the drivers of coastal vulnerability and spatial decision making for climate-change adaptation: A
case study from Maharashtra, India, Ambio, 48, 192–212, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx66"><?xmltex \def\ref@label{{Krzywinski and Altman(2017)}}?><label>Krzywinski and Altman(2017)</label><?label krzywinski_classification_2017?><mixed-citation>Krzywinski, M. and Altman, N.: Classification and regression trees, Nat. Methods, 14, 757–758, <ext-link xlink:href="https://doi.org/10.1038/nmeth.4370" ext-link-type="DOI">10.1038/nmeth.4370</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx67"><?xmltex \def\ref@label{{Kuhn(2008)}}?><label>Kuhn(2008)</label><?label kuhn_building_2008?><mixed-citation>Kuhn, M.: Building Predictive Models in R Using the caret Package,
J. Stat. Softw., 1, 1–26, <ext-link xlink:href="https://doi.org/10.18637/jss.v028.i05" ext-link-type="DOI">10.18637/jss.v028.i05</ext-link>, 2008.</mixed-citation></ref>
      <?pagebreak page2155?><ref id="bib1.bibx68"><?xmltex \def\ref@label{{Kuhn and Johnson(2013)}}?><label>Kuhn and Johnson(2013)</label><?label kuhn_applied_2013?><mixed-citation>Kuhn, M. and Johnson, K.: Applied predictive modeling, vol. 26, Springer, <ext-link xlink:href="https://doi.org/10.1007/978-1-4614-6849-3" ext-link-type="DOI">10.1007/978-1-4614-6849-3</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx69"><?xmltex \def\ref@label{{Lin and Nguyen(2020)}}?><label>Lin and Nguyen(2020)</label><?label lin_boosting_2020?><mixed-citation>Lin, H.-I. and Nguyen, M. C.: Boosting Minority Class Prediction on
Imbalanced Point Cloud Data, Appl. Sci., 10, 973,
<ext-link xlink:href="https://doi.org/10.3390/app10030973" ext-link-type="DOI">10.3390/app10030973</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx70"><?xmltex \def\ref@label{{Liu and Li(2016)}}?><label>Liu and Li(2016)</label><?label liu_social_2016?><mixed-citation>Liu, D. and Li, Y.: Social vulnerability of rural households to flood hazards in western mountainous regions of Henan province, China, Nat. Hazards Earth Syst. Sci., 16, 1123–1134, <ext-link xlink:href="https://doi.org/10.5194/nhess-16-1123-2016" ext-link-type="DOI">10.5194/nhess-16-1123-2016</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx71"><?xmltex \def\ref@label{{Llorente-Marrón et~al.(2020)}}?><label>Llorente-Marrón et al.(2020)</label><?label llorente-marron_social_2020?><mixed-citation>Llorente-Marrón, M., Díaz-Fernández, M., Méndez-Rodríguez, P., and González Arias, R.: Social Vulnerability, Gender and Disasters. The Case of Haiti in 2010, Sustainability, 12, 3574, <ext-link xlink:href="https://doi.org/10.3390/su12093574" ext-link-type="DOI">10.3390/su12093574</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx72"><?xmltex \def\ref@label{{Mahbubur~Rahman et~al.(2023)}}?><label>Mahbubur Rahman et al.(2023)</label><?label mahbubur_rahman_social_2022?><mixed-citation>Mahbubur Rahman, M., Sadequr Rahman, M., and Jerin, T.: Social vulnerability to earthquake disaster: insights from the people of 48th ward of Dhaka South City, Bangladesh, Environ. Hazards, 22, 116–135, <ext-link xlink:href="https://doi.org/10.1080/17477891.2022.2085075" ext-link-type="DOI">10.1080/17477891.2022.2085075</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx73"><?xmltex \def\ref@label{{Maheshwari et~al.(2017)}}?><label>Maheshwari et al.(2017)</label><?label maheshwari_review_2017?><mixed-citation>
Maheshwari, S., Jain, D. R., and Jadon, D. S.: A Review on Class Imbalance Problem: Analysis and Potential Solutions, International Journal Of Computer Science Issues, 14, 43–51, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx74"><?xmltex \def\ref@label{{Markoulidakis et~al.(2021)}}?><label>Markoulidakis et al.(2021)</label><?label markoulidakis_multiclass_2021?><mixed-citation>Markoulidakis, I., Rallis, I., Georgoulas, I., Kopsiaftis, G., Doulamis, A.,
and Doulamis, N.: Multiclass Confusion Matrix Reduction Method and
Its Application on Net Promoter Score Classification Problem,
Technologies, 9, 81, <ext-link xlink:href="https://doi.org/10.3390/technologies9040081" ext-link-type="DOI">10.3390/technologies9040081</ext-link>, 2021.​​​​​​​</mixed-citation></ref>
      <ref id="bib1.bibx75"><?xmltex \def\ref@label{{Martins et~al.(2012)}}?><label>Martins et al.(2012)</label><?label martins_social_2012?><mixed-citation>Martins, V. N., e Silva, D. S., and Cabral, P.: Social vulnerability assessment to seismic risk using multicriteria analysis: the case study of Vila Franca do Campo (São Miguel Island, Azores, Portugal), Nat. Hazards, 62, 385–404, <ext-link xlink:href="https://doi.org/10.1007/s11069-012-0084-x" ext-link-type="DOI">10.1007/s11069-012-0084-x</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx76"><?xmltex \def\ref@label{{Mavhura and Manyangadze(2021)}}?><label>Mavhura and Manyangadze(2021)</label><?label mavhura_comprehensive_2021?><mixed-citation>Mavhura, E. and Manyangadze, T.: A comprehensive spatial analysis of social vulnerability to natural hazards in Zimbabwe: Driving factors and policy implications, Int. J. Disast. Risk Re., 56, 102139, <ext-link xlink:href="https://doi.org/10.1016/j.ijdrr.2021.102139" ext-link-type="DOI">10.1016/j.ijdrr.2021.102139</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx77"><?xmltex \def\ref@label{{Meade et~al.(1970)}}?><label>Meade et al.(1970)</label><?label meade_demography_1970?><mixed-citation>Meade, J. E., Wrigley, E. A., Brass, W., Boreham, A. J., Glass, D. V., and
Grebenik, E.: Demography and Economics, Popul. Stud., 24, 25–31,
<ext-link xlink:href="https://doi.org/10.2307/2172399" ext-link-type="DOI">10.2307/2172399</ext-link>, 1970.</mixed-citation></ref>
      <ref id="bib1.bibx78"><?xmltex \def\ref@label{{Menardi and Torelli(2014)}}?><label>Menardi and Torelli(2014)</label><?label menardi_training_2014?><mixed-citation>Menardi, G. and Torelli, N.: Training and assessing classification rules with
imbalanced data, Data Min. Knowl. Disc., 28, 92–122,
<ext-link xlink:href="https://doi.org/10.1007/s10618-012-0295-5" ext-link-type="DOI">10.1007/s10618-012-0295-5</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx79"><?xmltex \def\ref@label{{Mente\c{s}e et~al.(2019)}}?><label>Menteşe et al.(2019)</label><?label mentese_understanding_2019?><mixed-citation>Menteşe, E. Y., Kalaycıoğlu, S., Çelik, K., Türkyılmaz, A. S., Çelen, U., Kara, S., Kılıç, O., Baş, M., and Uğur, C.:  Understanding Social Vulnerability Against Disasters in Istanbul, in: Geophysical Research Abstracts, vol. 21, <ext-link xlink:href="https://doi.org/10.13140/RG.2.2.28128.64005" ext-link-type="DOI">10.13140/RG.2.2.28128.64005</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx80"><?xmltex \def\ref@label{{Mente\c{s}e et~al.(2022)}}?><label>Menteşe et al.(2022)</label><?label mentese_stakeholder_2022?><mixed-citation>Menteşe, E. Y., Trogrlić, R. Š., Hussein, E., Thompson, H., Öner, E., Yolcu, A., and Malamud, B. D.: Stakeholder Perceptions of Multi-hazards and Implications for Urban Disaster Risk Reduction in Istanbul, EGU General Assembly 2022, Vienna, Austria, 23–27 May 2022, EGU22-10895, <ext-link xlink:href="https://doi.org/10.5194/egusphere-egu22-10895" ext-link-type="DOI">10.5194/egusphere-egu22-10895</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx81"><?xmltex \def\ref@label{{Mesta et~al.(2022)}}?><label>Mesta et al.(2022)</label><?label mesta_urban_2022?><mixed-citation>Mesta, C., Cremen, G., and Galasso, C.: Urban growth modelling and social
vulnerability assessment for a hazardous Kathmandu Valley, Sci. Rep.​​​​​​​, 12, 1–16, <ext-link xlink:href="https://doi.org/10.1038/s41598-022-09347-x" ext-link-type="DOI">10.1038/s41598-022-09347-x</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx82"><?xmltex \def\ref@label{{Mtintsilana et~al.(2022)}}?><label>Mtintsilana et al.(2022)</label><?label mtintsilana_social_2022?><mixed-citation>Mtintsilana, A., Dlamini, S. N., Mapanga, W., Craig, A., Du Toit, J., Ware, L. J., and Norris, S. A.: Social vulnerability and its association with food
insecurity in the South African population: findings from a National
Survey, J. Public Health Pol., 43, 575–592,
<ext-link xlink:href="https://doi.org/10.1057/s41271-022-00370-w" ext-link-type="DOI">10.1057/s41271-022-00370-w</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx83"><?xmltex \def\ref@label{{Murru et~al.(2016)}}?><label>Murru et al.(2016)</label><?label murru_m_2016?><mixed-citation>Murru, M., Akinci, A., Falcone, G., Pucci, S., Console, R., and Parsons, T.: <inline-formula><mml:math id="M230" display="inline"><mml:mrow><mml:mi>M</mml:mi><mml:mo>≥</mml:mo><mml:mn mathvariant="normal">7</mml:mn></mml:mrow></mml:math></inline-formula> earthquake rupture forecast and time-dependent probability for the sea of Marmara region, Turkey, J. Geophys. Res.-Sol. Ea., 121, 2679–2707, <ext-link xlink:href="https://doi.org/10.1002/2015JB012595" ext-link-type="DOI">10.1002/2015JB012595</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx84"><?xmltex \def\ref@label{{Nor~Diana et~al.(2021)}}?><label>Nor Diana et al.(2021)</label><?label nor_diana_social_2021?><mixed-citation>Nor Diana, M. I., Muhamad, N., Taha, M. R., Osman, A., and Alam, M. M.: Social Vulnerability Assessment for Landslide Hazards in Malaysia: A Systematic Review Study, Land, 10, 315, <ext-link xlink:href="https://doi.org/10.3390/land10030315" ext-link-type="DOI">10.3390/land10030315</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx85"><?xmltex \def\ref@label{{Noriega and Ludwig(2012)}}?><label>Noriega and Ludwig(2012)</label><?label noriega_social_2012?><mixed-citation>Noriega, G. R. and Ludwig, L. G.: Social vulnerability assessment for
mitigation of local earthquake risk in Los Angeles County, Nat.
Hazards, 64, 1341–1355, <ext-link xlink:href="https://doi.org/10.1007/s11069-012-0301-7" ext-link-type="DOI">10.1007/s11069-012-0301-7</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx86"><?xmltex \def\ref@label{{Ocal and Senel(2021)}}?><label>Ocal and Senel(2021)</label><?label ocal_turkiyekayit_2021?><mixed-citation>Ocal, M. and Senel, D.: Türkiye’de Kayıt Dışı İstihdamın Bölgesel Analizi, Çalışma ve Toplum Dergisi, 2, 1201–1232, <uri>https://www.calismatoplum.org/makale/turkiyede-kayit-disiistihdamin-bolgesel-analizi</uri> (last access: 16 November 2022), 2021.</mixed-citation></ref>
      <ref id="bib1.bibx87"><?xmltex \def\ref@label{{{OECD}(2021)}}?><label>OECD(2021)</label><?label oecd_oecd_2021?><mixed-citation>OECD: OECD Economic Surveys: Turkey 2021, OECD Economic  Surveys: Turkey Series, OECD, <uri>https://www.oecd.org/economy/surveys/TURKEY-2021-OECD-economic-survey-overview.pdf</uri> (last access: 26 April 2023), 2021.</mixed-citation></ref>
      <ref id="bib1.bibx88"><?xmltex \def\ref@label{{Parsons(2004)}}?><label>Parsons(2004)</label><?label parsons_recalculated_2004?><mixed-citation>Parsons, T.: Recalculated probability of <inline-formula><mml:math id="M231" display="inline"><mml:mrow><mml:mi>M</mml:mi><mml:mo>≥</mml:mo><mml:mn mathvariant="normal">7</mml:mn></mml:mrow></mml:math></inline-formula> earthquakes beneath the
Sea of Marmara, Turkey, J. Geophys. Res.-Sol. Ea.,
109, B05304, <ext-link xlink:href="https://doi.org/10.1029/2003JB002667" ext-link-type="DOI">10.1029/2003JB002667</ext-link>, 2004.</mixed-citation></ref>
      <ref id="bib1.bibx89"><?xmltex \def\ref@label{{Peek and Stough(2010)}}?><label>Peek and Stough(2010)</label><?label peek_children_2010?><mixed-citation>Peek, L. and Stough, L. M.: Children with disabilities in the context of
disaster: A social vulnerability perspective, Child Dev., 81,
1260–1270, <ext-link xlink:href="https://doi.org/doi.org/10.1111/j.1467-8624.2010.01466.x" ext-link-type="DOI">doi.org/10.1111/j.1467-8624.2010.01466.x</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bibx90"><?xmltex \def\ref@label{{Power et~al.(2013)}}?><label>Power et al.(2013)</label><?label power_principles_2013?><mixed-citation>Power, M., Fell, G., and Wright, M.: Principles for high-quality, high-value
testing, Evid. Based Med., 18, 5–10, <ext-link xlink:href="https://doi.org/10.1136/eb-2012-100645" ext-link-type="DOI">10.1136/eb-2012-100645</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx91"><?xmltex \def\ref@label{{QGIS Development Team(2021)}}?><label>QGIS Development Team(2021)</label><?label qgis_qgis_2021?><mixed-citation>QGIS Development Team: QGIS Geographic Information System, Open Source Geospatial Foundation Project, <uri>http://qgis.osgeo.org</uri> (last access: 20 December 2022), 2021.</mixed-citation></ref>
      <ref id="bib1.bibx92"><?xmltex \def\ref@label{{Rabby et~al.(2019)}}?><label>Rabby et al.(2019)</label><?label rabby_social_2019?><mixed-citation>Rabby, Y. W., Hossain, M. B., and Hasan, M. U.: Social vulnerability in the
coastal region of Bangladesh: An investigation of social vulnerability
index and scalar change effects, Int. J. Disast. Risk
Re., 41, 101329, <ext-link xlink:href="https://doi.org/10.1016/j.ijdrr.2019.101329" ext-link-type="DOI">10.1016/j.ijdrr.2019.101329</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx93"><?xmltex \def\ref@label{{Ramyachitra and Manikandan(2014)}}?><label>Ramyachitra and Manikandan(2014)</label><?label ramyachitra_imbalanced_2014?><mixed-citation>
Ramyachitra, R. and Manikandan, P.: Imbalanced dataset classification and  solutions: a review, International Journal of Computing and Business Research, 5, 1–29, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx94"><?xmltex \def\ref@label{{R Core Team(2021)}}?><label>R Core Team(2021)</label><?label r_core_team_r_2021?><mixed-citation>R Core Team: R: A Language and Environment for Statistical Computing, R Foundation for Statistical Computing, Vienna, Austria, <uri>https://www.R-project.org/</uri> (last access: 1 April 2023), 2021.</mixed-citation></ref>
      <ref id="bib1.bibx95"><?xmltex \def\ref@label{{Republic of~Türkiye Ministry~of Labour and Social Security(2021)}}?><label>Republic of Türkiye Ministry of Labour and Social Security(2021)</label><?label the_republic_of_turkey?><mixed-citation>Republic of Türkiye Ministry of Labour and Social Security​​​​​​​: The European Code Of Social Security – Country Report (Article 74), Council of Europe, <uri>https://rm.coe.int/turkey-reportcode-art74-2021/1680a51194</uri> (last access: 26 April 2023), 2021.</mixed-citation></ref>
      <ref id="bib1.bibx96"><?xmltex \def\ref@label{{Roncancio et~al.(2020)}}?><label>Roncancio et al.(2020)</label><?label roncancio_social_2020?><mixed-citation>Roncancio, D. J., Cutter, S. L., and Nardocci, A. C.: Social vulnerability in
Colombia, Int. J. Disast. Risk Re., 50, 101872, <ext-link xlink:href="https://doi.org/10.1016/j.ijdrr.2020.101872" ext-link-type="DOI">10.1016/j.ijdrr.2020.101872</ext-link>, 2020.</mixed-citation></ref>
      <?pagebreak page2156?><ref id="bib1.bibx97"><?xmltex \def\ref@label{{Rufat et~al.(2019)}}?><label>Rufat et al.(2019)</label><?label rufat_how_2019?><mixed-citation>Rufat, S., Tate, E., Emrich, C. T., and Antolini, F.: How Valid Are
Social Vulnerability Models?, Ann. Am. Assoc. Geogr.​​​​​​​, 109, 1131–1153, <ext-link xlink:href="https://doi.org/10.1080/24694452.2018.1535887" ext-link-type="DOI">10.1080/24694452.2018.1535887</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx98"><?xmltex \def\ref@label{{Ryo and Rillig(2017)}}?><label>Ryo and Rillig(2017)</label><?label ryo_statistically_2017?><mixed-citation>Ryo, M. and Rillig, M. C.: Statistically reinforced machine learning for
nonlinear patterns and variable interactions, Ecosphere, 8, e01976,
<ext-link xlink:href="https://doi.org/10.1002/ecs2.1976" ext-link-type="DOI">10.1002/ecs2.1976</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx99"><?xmltex \def\ref@label{{Salami et~al.(2015)}}?><label>Salami et al.(2015)</label><?label salami_disasters_2015?><mixed-citation>
Salami, R., Von Meding, J., Giggins, H., and Olotu, A.: Disasters,
vulnerability and inadequate housing in Nigeria: A viable strategic
framework, in: 5th International Conference on Building Resilience, Newcastle, Australia, 15–17 July 2015, Proceedings ANDROID Residential Doctoral School, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx100"><?xmltex \def\ref@label{{Schipper et~al.(2016)}}?><label>Schipper et al.(2016)</label><?label schipper_linking_2016?><mixed-citation>Schipper, E. L. F., Thomalla, F., Vulturius, G., Davis, M., and Johnson, K.:
Linking disaster risk reduction, climate change and development,
International Journal of Disaster Resilience in the Built Environment, 7,
216–228, <ext-link xlink:href="https://doi.org/10.1108/IJDRBE-03-2015-0014" ext-link-type="DOI">10.1108/IJDRBE-03-2015-0014</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx101"><?xmltex \def\ref@label{{Shen et~al.(2018)}}?><label>Shen et al.(2018)</label><?label shen_visualized_2018?><mixed-citation>Shen, S., Cheng, C., Yang, J., and Yang, S.: Visualized analysis of developing trends and hot topics in natural disaster research, PLOS ONE, 13, e0191250, <ext-link xlink:href="https://doi.org/10.1371/journal.pone.0191250" ext-link-type="DOI">10.1371/journal.pone.0191250</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx102"><?xmltex \def\ref@label{{Spielman et~al.(2020)}}?><label>Spielman et al.(2020)</label><?label spielman_evaluating_2020?><mixed-citation>Spielman, S. E., Tuccillo, J., Folch, D. C., Schweikert, A., Davies, R., Wood, N., and Tate, E.: Evaluating social vulnerability indicators: criteria and their application to the Social Vulnerability Index, Nat. Hazards,
100, 417–436, <ext-link xlink:href="https://doi.org/10.1007/s11069-019-03820-z" ext-link-type="DOI">10.1007/s11069-019-03820-z</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx103"><?xmltex \def\ref@label{{Stough and Kelman(2018)}}?><label>Stough and Kelman(2018)</label><?label stough_people_2018?><mixed-citation>Stough, L. M. and Kelman, I.: People with disabilities and disasters, in:
Handbook of disaster research, Springer International Publishing,
Cham, 225–242, <ext-link xlink:href="https://doi.org/10.1007/978-3-319-63254-4_12" ext-link-type="DOI">10.1007/978-3-319-63254-4_12</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx104"><?xmltex \def\ref@label{{Syed and Kumar~Routray(2014)}}?><label>Syed and Kumar Routray(2014)</label><?label syed_vulnerability_2014?><mixed-citation>Syed, A. and Kumar Routray, J.: Vulnerability assessment of earthquake prone communities in Baluchistan, International Journal of Disaster Resilience in
the Built Environment, 5, 144–162, <ext-link xlink:href="https://doi.org/10.1108/IJDRBE-12-2010-0053" ext-link-type="DOI">10.1108/IJDRBE-12-2010-0053</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx105"><?xmltex \def\ref@label{{Tasnuva et~al.(2021)}}?><label>Tasnuva et al.(2021)</label><?label tasnuva_employing_2021?><mixed-citation>
Tasnuva, A., Hossain, M., Salam, R., Islam, A. R. M., Patwary, M. M., and Ibrahim, S. M.: Employing social vulnerability index to assess household social vulnerability of natural hazards: An evidence from southwest coastal Bangladesh, Environ. Dev. Sustain., 23, 10223–10245, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx106"><?xmltex \def\ref@label{{Tate(2012)}}?><label>Tate(2012)</label><?label tate_social_2012?><mixed-citation>
Tate, E.: Social vulnerability indices: a comparative assessment using
uncertainty and sensitivity analysis, Nat. Hazards, 63, 325–347, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx107"><?xmltex \def\ref@label{{Taubenböck et~al.(2006)}}?><label>Taubenböck et al.(2006)</label><?label taubenbock_assessing_2006?><mixed-citation>
Taubenböck, H., Kemper, T., Roth, A., and Voigt, S.: Assessing vulnerability
in Istanbul: An example to support disaster management with remote
sensing at ZKI-DLR, 1–9, ISBN 3-9809030-4-4, 2006.</mixed-citation></ref>
      <ref id="bib1.bibx108"><?xmltex \def\ref@label{{{Turkish Statistics
Institute}(2021)}}?><label>Turkish Statistics
Institute(2021)</label><?label turkish_statistics_institute_labour_2021?><mixed-citation>Turkish Statistics Institute: Labour Force Statistics, Tech. rep.,
<uri>https://data.tuik.gov.tr/Bulten/Index?p=Labour-Force-Statistics-February-2021-37487&amp;dil=2</uri> (last access: 18 March 2023), 2021.</mixed-citation></ref>
      <ref id="bib1.bibx109"><?xmltex \def\ref@label{{Turkoglu(2013)}}?><label>Turkoglu(2013)</label><?label turkoglu_sosyal_2013?><mixed-citation>
Turkoglu, I.: Sosyal devlet bağlamında Türkiye'de sosyal yardım ve sosyal güvenlik, Akademik İncelemeler Dergisi, 8, 275–305, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx110"><?xmltex \def\ref@label{{{UNDRR}(2022)}}?><label>UNDRR(2022)</label><?label undrr_united_2022?><mixed-citation>UNDRR: Global Assessment Report on Disaster Risk Reduction 2022: Our World at Risk: Transforming Governance for a Resilient Future, United Nations Office for Disaster Risk Reduction,   UNDRR, Geneva, Switzerland, <uri>https://www.undrr.org/gar2022-our-world-risk-gar#container-downloads</uri>
(last access: 18 March 2023), 2022.
</mixed-citation></ref><?xmltex \hack{\newpage}?>
      <ref id="bib1.bibx111"><?xmltex \def\ref@label{{UNISDR Terminology on Disaster Risk Reduction(2015)}}?><label>UNISDR Terminology on Disaster Risk Reduction(2015)</label><?label unisdr_sandai_2015?><mixed-citation>UNISDR Terminology on Disaster Risk Reduction​​​​​​​: Sandai Framework for Disaster Risk Reduction 2015–2030, Tech. rep., <uri>https://www.undrr.org/publication/sendai-framework-disaster-risk-reduction-2015-2030</uri> (last access: 18 March 2023), 2015.</mixed-citation></ref>
      <ref id="bib1.bibx112"><?xmltex \def\ref@label{{{U.S. Environmental Protection
Agency}(2015)}}?><label>U.S. Environmental Protection
Agency(2015)</label><?label us_environmental_protection_agency_climate_2015?><mixed-citation>U.S. Environmental Protection Agency: Climate change in the United States – benefits of global action, Tech. Rep. EPA 430-R-15-001, Enviromental Protection Agency, Office of Atmospheric Programs,
<uri>https://www.epa.gov/cira</uri> (last access: 20 March 2023), 2015.</mixed-citation></ref>
      <ref id="bib1.bibx113"><?xmltex \def\ref@label{{Walker et~al.(2019)}}?><label>Walker et al.(2019)</label><?label walker_risk_2019?><mixed-citation>Walker, T., Kawasoe, Y., and Shrestha, J.: Risk and Vulnerability in Nepal, Risk and Vulnerability Assessment, World Bank,  <ext-link xlink:href="https://doi.org/10.1596/33365" ext-link-type="DOI">10.1596/33365</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx114"><?xmltex \def\ref@label{{Wang et~al.(2022)}}?><label>Wang et al.(2022)</label><?label wang_urbanrural_2022?><mixed-citation>
Wang, S., Zhang, M., Huang, X., Hu, T., Sun, Q. C., Corcoran, J., and Liu, Y.: Urban–rural disparity of social vulnerability to natural hazards in
Australia, Sci. Rep.​​​​​​​, 12, 1–15, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx115"><?xmltex \def\ref@label{{Wang and Sebastian(2021)}}?><label>Wang and Sebastian(2021)</label><?label wang_community_2021?><mixed-citation>Wang, Y. V. and Sebastian, A.: Community flood vulnerability and risk
assessment: An empirical predictive modeling approach, J. Flood
Risk Manage., 14, e12739, <ext-link xlink:href="https://doi.org/10.1111/jfr3.12739" ext-link-type="DOI">10.1111/jfr3.12739</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx116"><?xmltex \def\ref@label{{Wang et~al.(2021)}}?><label>Wang et al.(2021)</label><?label wang_empirical_2021?><mixed-citation>Wang, Y. V., Gardoni, P., Murphy, C., and Guerrier, S.: Empirical Predictive Modeling Approach to Quantifying Social Vulnerability to Natural Hazards, Ann. Am. Assoc. Geogr., 111, 1559–1583, <ext-link xlink:href="https://doi.org/10.1080/24694452.2020.1823807" ext-link-type="DOI">10.1080/24694452.2020.1823807</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx117"><?xmltex \def\ref@label{{West(2007)}}?><label>West(2007)</label><?label west_poverty_2007?><mixed-citation>West, A.: Poverty and educational achievement: why do children from low-income families tend to do less well at school?, Benefits: A Journal of Poverty and Social Justice, 15, 283–297, <ext-link xlink:href="https://doi.org/10.51952/XLJA4165" ext-link-type="DOI">10.51952/XLJA4165</ext-link>, 2007.</mixed-citation></ref>
      <ref id="bib1.bibx118"><?xmltex \def\ref@label{{Wilson(2019)}}?><label>Wilson(2019)</label><?label wilson_overrun_2019?><mixed-citation>Wilson, B. S.: Overrun by averages: An empirical analysis into the
consistency of social vulnerability components across multiple scales,
Int. J. Disast. Risk Re., 40, 101268, <ext-link xlink:href="https://doi.org/10.1016/j.ijdrr.2019.101268" ext-link-type="DOI">10.1016/j.ijdrr.2019.101268</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx119"><?xmltex \def\ref@label{{Wisner and Luce(1993)}}?><label>Wisner and Luce(1993)</label><?label wisner_disaster_1993?><mixed-citation>Wisner, B. and Luce, H. R.: Disaster vulnerability: Scale, power and daily
life, GeoJournal, 30, 127–140, <ext-link xlink:href="https://doi.org/10.1007/BF00808129" ext-link-type="DOI">10.1007/BF00808129</ext-link>, 1993.</mixed-citation></ref>
      <ref id="bib1.bibx120"><?xmltex \def\ref@label{{WUP(2023)}}?><label>WUP(2023)</label><?label wup_world_2023?><mixed-citation>WUP: United Nations population estimates and projections of major Urban Agglomerations, World Urbanization Prospects, Tech. rep.,
<uri>https://worldpopulationreview.com/world-cities</uri>​​​​​​​ (last access: 17 March 2023), 2023.</mixed-citation></ref>
      <ref id="bib1.bibx121"><?xmltex \def\ref@label{{Yoon and Jeong(2016)}}?><label>Yoon and Jeong(2016)</label><?label yoon_assessment_2016?><mixed-citation>Yoon, D. K. and Jeong, S.: Assessment of Community Vulnerability to Natural Disasters in Korea by Using GIS and Machine Learning Techniques, in: Quantitative Regional Economic and Environmental Analysis for Sustainability in Korea, Springer,  Singapore, vol. 25, 123–140, <ext-link xlink:href="https://doi.org/10.1007/978-981-10-0300-4_7" ext-link-type="DOI">10.1007/978-981-10-0300-4_7</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx122"><?xmltex \def\ref@label{{Yücel and Arun(2010)}}?><label>Yücel and Arun(2010)</label><?label yucel_earthquake_2010?><mixed-citation>
Yücel, G. and Arun, G.: Earthquake and Physical and Social Vulnerability Assessment for Settlements: Case Study Avcılar District, Megaron, 5, 23–32, 2010.</mixed-citation></ref>

  </ref-list></back>
    <!--<article-title-html>Using machine learning algorithms to identify predictors of social vulnerability in the event of a hazard: Istanbul case study</article-title-html>
<abstract-html/>
<ref-html id="bib1.bib1"><label>Abarca-Alvarez et al.(2019)</label><mixed-citation>
      
Abarca-Alvarez, F. J., Reinoso-Bellido, R., and Campos-Sánchez, F. S.: Decision Model for Predicting Social Vulnerability Using Artificial Intelligence, ISPRS Int. J. Geo-Inf., 8, 575, <a href="https://doi.org/10.3390/ijgi8120575" target="_blank">https://doi.org/10.3390/ijgi8120575</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib2"><label>Acar et al.(2022)</label><mixed-citation>
      
Acar, s., Karagoz, T., Meydan, M. C., Sahin Cinoglu, D., Kaygisiz, G., and
Isik, M.: Ilcelerin sosyo-ekonomik gelismislik siralamasi arastirmasi –
SEGE 2022 (Research on the socio-econimic development ranking of
districts), Tech. Rep. 35, Republic Of Turkey Ministry of Industry and
Technology, General Directorate of Development Agencies,
<a href="https://www.sanayi.gov.tr/merkez-birimi/b94224510b7b/sege" target="_blank"/> (last access: 20 March 2023), 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib3"><label>Adaman et al.(2015)</label><mixed-citation>
      
Adaman, F., Aslan, D., Erus, B., and Sayan, S.: ESPN Thematic Report on  in-work poverty in Turkey, Tech. rep., European Commission, Brussels, <a href="https://ec.europa.eu/social/BlobServlet?docId=21089&amp;langId=en" target="_blank"/>​​​​​​​ (last access: 20 March 2023), 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib4"><label>AFAD(2019)</label><mixed-citation>
      
AFAD: Disaster and Management Presidency of Turkey – 2019 Overview of Disaster Management and Natural Disaster Statistics, Tech. rep., AFAD, <a href="https://en.afad.gov.tr/kurumlar/en.afad/Afet_Istatistikleri_2020_eng_1.pdf" target="_blank"/>​​​​​​​ (last access: 26 March 2023), 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib5"><label>Akhanli and Hennig(2020)</label><mixed-citation>
      
Akhanli, S. E. and Hennig, C.: Comparing clusterings and numbers of clusters by aggregation of calibrated clustering validity indexes, Stat. Comput., 30, 1523–1544, <a href="https://doi.org/10.1007/s11222-020-09958-2" target="_blank">https://doi.org/10.1007/s11222-020-09958-2</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib6"><label>Aksha et al.(2019)</label><mixed-citation>
      
Aksha, S. K., Juran, L., Resler, L. M., and Zhang, Y.: An Analysis of Social Vulnerability to Natural Hazards in Nepal Using a Modified Social Vulnerability Index, Int. J. Disast. Risk Sc., 10, 103–116, <a href="https://doi.org/10.1007/s13753-018-0192-7" target="_blank">https://doi.org/10.1007/s13753-018-0192-7</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib7"><label>Alizadeh et al.(2018)</label><mixed-citation>
      
Alizadeh, M., Alizadeh, E., Asadollahpour Kotenaee, S., Shahabi, H., Beiranvand Pour, A., Panahi, M., Bin Ahmad, B., and Saro, L.: Social Vulnerability Assessment Using Artificial Neural Network (ANN) Model for Earthquake Hazard in Tabriz City, Iran, Sustainability,
10, 3376, <a href="https://doi.org/10.3390/su10103376" target="_blank">https://doi.org/10.3390/su10103376</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib8"><label>Armaş(2008)</label><mixed-citation>
      
Armaş, I.: Social vulnerability and seismic risk perception. Case study: the historic center of the Bucharest Municipality/Romania, Nat. Hazards, 47, 397–410, <a href="https://doi.org/10.1007/s11069-008-9229-3" target="_blank">https://doi.org/10.1007/s11069-008-9229-3</a>, 2008.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib9"><label>Atun and Menoni(2014)</label><mixed-citation>
      
Atun, F. and Menoni, S.: Vulnerability to earthquake in Istanbul: application of the ENSURE methodology, Orhan Hacihasanoglu ITU Faculty of Architecture, A/Z ITU Journal of the Faculty of Architecture, 11, 99–116, <a href="https://research.utwente.nl/en/publications/vulnerability-to-earthquake-in-istanbul-application-of-the-ensure" target="_blank"/> (last access: 18 March 2023), 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib10"><label>Bakkensen et al.(2017)</label><mixed-citation>
      
Bakkensen, L. A., Fox‐Lent, C., Read, L. K., and Linkov, I.: Validating
resilience and vulnerability indices in the context of natural disasters,
Risk Anal., 37, 982–1004, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib11"><label>Bakker et al.(2019)</label><mixed-citation>
      
Bakker, A., Cai, J., English, L., Kaiser, G., Mesa, V., and Van Dooren, W.:
Beyond small, medium, or large: points of consideration when interpreting
effect sizes, Educ. Stud. Math., 102, 1–8,
<a href="https://doi.org/10.1007/s10649-019-09908-4" target="_blank">https://doi.org/10.1007/s10649-019-09908-4</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib12"><label>Baris(2009)</label><mixed-citation>
      
Baris, M.: Effectiveness of Turkish disaster management system and recommendations, Biotechnol. Biotec. Eq., 23, 1391–1398, 2009.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib13"><label>Bartik et al.(2020)</label><mixed-citation>
      
Bartik, A. W., Bertrand, M., Cullen, Z., Glaeser, E. L., Luca, M., and Stanton, C.: The impact of COVID-19 on small business outcomes and expectations, P. Natl. Acad. Sci. USA, 117, 17656–17666, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib14"><label>Basile Ibrahim et al.(2021)</label><mixed-citation>
      
Basile Ibrahim, B., Barcelona, V., Condon, E. M., Crusto, C. A., and Taylor,
J. Y.: The Association Between Neighborhood Social Vulnerability
and Cardiovascular Health Risk among Black/African American
Women in the InterGEN Study, Nurs. Res., 70, S3–S12,
<a href="https://doi.org/10.1097/NNR.0000000000000523" target="_blank">https://doi.org/10.1097/NNR.0000000000000523</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib15"><label>Batista et al.(2004)</label><mixed-citation>
      
Batista, G. E. A. P. A., Prati, R. C., and Monard, M. C.: A Study of the
Behavior of Several Methods for Balancing Machine Learning
Training Data, SIGKDD Explor. Newsl., 6, 20–29,
<a href="https://doi.org/10.1145/1007730.1007735" target="_blank">https://doi.org/10.1145/1007730.1007735</a>, 2004.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib16"><label>Bergstrand et al.(2015)</label><mixed-citation>
      
Bergstrand, K., Mayer, B., Brumback, B., and Zhang, Y.: Assessing the
Relationship Between Social Vulnerability and Community
Resilience to Hazards, Soc. Indic. Res., 122, 391–409,
<a href="https://doi.org/10.1007/s11205-014-0698-3" target="_blank">https://doi.org/10.1007/s11205-014-0698-3</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib17"><label>Birkmann and Wisner(2006)</label><mixed-citation>
      
Birkmann, J. and Wisner, B.: Measuring the unmeasurable: the challenge of vulnerability, UNU-EHS – United Nations University – Institute for Environment and Human Security, vol. 5, Bonn, Germany, 64 pp., ISBN 3981058267, 2006.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib18"><label>Bjarnadottir et al.(2011)</label><mixed-citation>
      
Bjarnadottir, S., Li, Y., and Stewart, M. G.: Social vulnerability index for
coastal communities at risk to hurricane hazard and a changing climate,
Nat. Hazards, 59, 1055–1075, 2011.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib19"><label>Burton et al.(2018)</label><mixed-citation>
      
Burton, C., Rufat, S., and Tate, E.: Social vulnerability: Conceptual
Foundations and Geospatial Modeling, Vulnerability and resilience to
natural hazards, Cambridge University Press, 53–81, <a href="https://doi.org/10.1017/9781316651148" target="_blank">https://doi.org/10.1017/9781316651148</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib20"><label>Buskirk et al.(2018)</label><mixed-citation>
      
Buskirk, T. D., Kirchner, A., Eck, A., and Signorino, C. S.: An Introduction to Machine Learning Methods for Survey Researchers, Survey Practice, 11, <a href="https://doi.org/10.29115/SP-2018-0004" target="_blank">https://doi.org/10.29115/SP-2018-0004</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib21"><label>Cannon(2008)</label><mixed-citation>
      
Cannon, T.: Reducing People's Vulnerability to Natural Hazards: Communities and Resilience, World Institute for Development Economic Research (UNU-WIDER), WIDER Working Paper Series, RP2008-34, <a href="https://ideas.repec.org/p/unu/wpaper/rp2008-34.html" target="_blank"/> (last access: 20 December 2022), 2008.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib22"><label>Chawla et al.(2002)</label><mixed-citation>
      
Chawla, N. V., Bowyer, K. W., Hall, L. O., and Kegelmeyer, W. P.: SMOTE:
Synthetic Minority Over-sampling Technique, J. Artif. Intell. Res.​​​​​​​, 16, 321–357, <a href="https://doi.org/10.1613/jair.953" target="_blank">https://doi.org/10.1613/jair.953</a>, 2002.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib23"><label>Chen et al.(2013)</label><mixed-citation>
      
Chen, W., Cutter, S. L., Emrich, C. T., and Shi, P.: Measuring social
vulnerability to natural hazards in the Yangtze River Delta region,
China, Int. J. Disast. Risk Sc., 4, 169–181, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib24"><label>Chou et al.(2004)</label><mixed-citation>
      
Chou, Y.-J., Huang, N., Lee, C.-H., Tsai, S.-L., Chen, L.-S., and Chang, H.-J.: Who is at risk of death in an earthquake?, Am. J. Epidemiol., 160, 688–695, 2004.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib25"><label>Çolak and Sunar(2020)</label><mixed-citation>
      
Çolak, E. and Sunar, F.: The importance of ground-truth and crowdsourcing data for the statistical and spatial analyses of the NASA FIRMS active fires in the Mediterranean Turkish forests, Remote Sensing Applications: Society and Environment, 19, 100327, <a href="https://doi.org/10.1016/j.rsase.2020.100327" target="_blank">https://doi.org/10.1016/j.rsase.2020.100327</a>,
2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib26"><label>Couronné et al.(2018)</label><mixed-citation>
      
Couronné, R., Probst, P., and Boulesteix, A.-L.: Random forest versus logistic regression: a large-scale benchmark experiment, BMC Bioinformatics, 19, 270, <a href="https://doi.org/10.1186/s12859-018-2264-5" target="_blank">https://doi.org/10.1186/s12859-018-2264-5</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib27"><label>Cureton(2011)</label><mixed-citation>
      
Cureton, S.: Environmental victims: environmental injustice issues that
threaten the health of children living in poverty, Rev. Environ. Health, 26, 141–147, <a href="https://doi.org/10.1515/reveh.2011.021" target="_blank">https://doi.org/10.1515/reveh.2011.021</a>, 2011.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib28"><label>Cutter et al.(2009)</label><mixed-citation>
      
Cutter, S., Emrich, C., Haney (Webb), J., and Morath, D.: Social Vulnerability to Climate Variability Hazards: A Review of the Literature, Final Report to Oxfam America, 1–44, <a href="https://citeseerx.ist.psu.edu/document?repid=rep1&amp;type=pdf&amp;doi=e0708976f51536074aba4cf7fd5375d9c8f58c2b" target="_blank"/>
(last access: 20 March 2023), 2009.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib29"><label>Cutter and Finch(2008)</label><mixed-citation>
      
Cutter, S. L. and Finch, C.: Temporal and spatial changes in social
vulnerability to natural hazards, P. Natl. Acad. Sci. USA, 105, 2301–2306, 2008.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib30"><label>Cutter et al.(2000)</label><mixed-citation>
      
Cutter, S. L., Mitchell, J. T., and Scott, M. S.: Revealing the Vulnerability of People and Places: A Case Study of Georgetown County, South Carolina, Ann. Assoc. Am. Geogr., 90, 713–737, <a href="https://doi.org/10.1111/0004-5608.00219" target="_blank">https://doi.org/10.1111/0004-5608.00219</a>, 2000.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib31"><label>Cutter et al.(2003)</label><mixed-citation>
      
Cutter, S. L., Boruff, B. J., and Shirley, W. L.: Social Vulnerability to
Environmental Hazards, Soc. Sci. Quart., 84, 242–261,
<a href="https://doi.org/10.1111/1540-6237.8402002" target="_blank">https://doi.org/10.1111/1540-6237.8402002</a>, 2003.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib32"><label>Debesai(2020)</label><mixed-citation>
      
Debesai, M. G.: Factors affecting vulnerability level of farming households to climate change in developing countries: evidence from Eritrea, IOP Conf. Ser.-Mat. Sci., 1001, 012093, <a href="https://doi.org/10.1088/1757-899x/1001/1/012093" target="_blank">https://doi.org/10.1088/1757-899x/1001/1/012093</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib33"><label>DeLong et al.(1988)</label><mixed-citation>
      
DeLong, E. R., DeLong, D. M., and Clarke-Pearson, D. L.: Comparing the Areas under Two or More Correlated Receiver Operating Characteristic Curves: A Nonparametric Approach, Biometrics, 44, 837–845, <a href="https://doi.org/10.2307/2531595" target="_blank">https://doi.org/10.2307/2531595</a>, 1988.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib34"><label>de Oliveira Mendes(2009)</label><mixed-citation>
      
de Oliveira Mendes, J. M.: Social vulnerability indexes as planning tools:
beyond the preparedness paradigm, J. Risk Res., 12, 43–58,
<a href="https://doi.org/10.1080/13669870802447962" target="_blank">https://doi.org/10.1080/13669870802447962</a>, 2009.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib35"><label>Di Franco and Santurro(2020)</label><mixed-citation>
      
Di Franco, G. and Santurro, M.: Machine learning, artificial neural networks
and social research, Qual. Quant., 55, 1007–1025, <a href="https://doi.org/10.1007/s11135-020-01037-y" target="_blank">https://doi.org/10.1007/s11135-020-01037-y</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib36"><label>Dodman et al.(2013)</label><mixed-citation>
      
Dodman, D., Brown, D., Francis, K., Hardoy, J., Johnson, C., and Satterthwaite, D.: Understanding the nature and scale of urban risk in low- and middleincome countries and its implications for humanitarian preparedness, planning and response, Tech. rep., International Institute for Environment and Development, <a href="http://pubs.iied.org/10624IIED.html" target="_blank"/> (last access: 1 April 2023), 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib37"><label>Dunning and Durden(2011)</label><mixed-citation>
      
Dunning, C. and Durden, S.: Social vulnerability analysis methods for Corps planning, Tech. rep., US Army Corps of Engineers, <a href="https://www.iwr.usace.army.mil/portals/70/docs/iwrreports/2011-r-07.pdf" target="_blank"/> (last access: 15 December 2022), 2011.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib38"><label>Durahim(2016)</label><mixed-citation>
      
Durahim, A. O.: Comparison of sampling techniques for imbalanced learning,
Yönetim Bilişim Sistemleri Dergisi, 2, 181–191, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib39"><label>Dwyer et al.(2004)</label><mixed-citation>
      
Dwyer, A., Zoppou, C., Nielsen, O., Day, S., and Roberts, S.: Quantifying social vulnerability: a methodology for identifying those at risk to natural hazards, Geoscience Australia Canberra, <a href="https://d28rz98at9flks.cloudfront.net/61168/Rec2004_014.pdf" target="_blank"/>​​​​​​​ (last access: 3 December 2022), 2004.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib40"><label>Emrich et al.(2014)</label><mixed-citation>
      
Emrich, C., Morath, D., Morath, G., and Reeves, R.: Climate-sensitive hazards
in Florida: identifying and prioritizing threats to build resilience
against climate effects, Hazard Vulnerability Res. Inst.
Columbia, Columbia, SC, USA, <a href="https://www.floridahealth.gov/environmental-health/climate-and-health/_documents/climate-sensitive-hazards-in-florida-final-report.pdf" target="_blank"/> (last access: 16 December 2023), 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib41"><label>Enarson et al.(2018)</label><mixed-citation>
      
Enarson, E., Fothergill, A., and Peek, L.: Gender and disaster: Foundations and new directions for research and practice, Handbook of disaster research, Springer, 205–223, <a href="https://doi.org/10.1007/978-3-319-63254-4_11" target="_blank">https://doi.org/10.1007/978-3-319-63254-4_11</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib42"><label>Erdik et al.(2003)</label><mixed-citation>
      
Erdik, M., Aydinoglu, N., Fahjan, Y., Sesetyan, K., Demircioglu, M., Siyahi, B., Durukal, E., Ozbey, C., Biro, Y., Akman, H., and Yuzugullu, O.: Earthquake risk assessment for Istanbul metropolitan area, Earthq. Eng. Eng. Vib., 2, 1–23, <a href="https://doi.org/10.1007/BF02857534" target="_blank">https://doi.org/10.1007/BF02857534</a>, 2003.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib43"><label>Ersoy and Koçak(2016)</label><mixed-citation>
      
Ersoy, S. and Koçak, A.: Disasters and earthquake preparedness of children and schools in Istanbul, Turkey, Geomat. Nat. Haz. Risk​​​​​​​, 7, 1307–1336, <a href="https://doi.org/10.1080/19475705.2015.1060637" target="_blank">https://doi.org/10.1080/19475705.2015.1060637</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib44"><label>Esposito et al.(2021)</label><mixed-citation>
      
Esposito, C., Landrum, G. A., Schneider, N., Stiefl, N., and Riniker, S.:
GHOST: Adjusting the Decision Threshold to Handle Imbalanced
Data in Machine Learning, J. Chem. Inf. Model., 61, 2623–2640,
<a href="https://doi.org/10.1021/acs.jcim.1c00160" target="_blank">https://doi.org/10.1021/acs.jcim.1c00160</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib45"><label>Evans and Kantrowitz(2002)</label><mixed-citation>
      
Evans, G. W. and Kantrowitz, E.: Socioeconomic status and health: the potential role of environmental risk exposure, Annu. Rev. Publ. Health, 23, 303–331, 2002.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib46"><label>Fatemi et al.(2017)</label><mixed-citation>
      
Fatemi, F., Ardalan, A., Aguirre, B., Mansouri, N., and Mohammadfam, I.: Social vulnerability indicators in disasters: Findings from a systematic review, Int. J. Disast. Risk Re., 22, 219–227,
<a href="https://doi.org/10.1016/j.ijdrr.2016.09.006" target="_blank">https://doi.org/10.1016/j.ijdrr.2016.09.006</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib47"><label>Fekete(2009)</label><mixed-citation>
      
Fekete, A.: Validation of a social vulnerability index in context to river-floods in Germany, Nat. Hazards Earth Syst. Sci., 9, 393–403, <a href="https://doi.org/10.5194/nhess-9-393-2009" target="_blank">https://doi.org/10.5194/nhess-9-393-2009</a>, 2009.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib48"><label>Flanagan et al.(2011)</label><mixed-citation>
      
Flanagan, B. E., Gregory, E. W., Hallisey, E. J., Heitgerd, J. L., and Lewis, B.: A social vulnerability index for disaster management, J. Homel. Secur. Emerg., 8, <a href="https://doi.org/10.2202/1547-7355.1792" target="_blank">https://doi.org/10.2202/1547-7355.1792</a>, 2011.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib49"><label>Fritz et al.(2012)</label><mixed-citation>
      
Fritz, C. O., Morris, P. E., and Richler, J. J.: Effect size estimates: Current use, calculations, and interpretation, J. Exp. Psychol. Gen., 141, 2–18, <a href="https://doi.org/10.1037/a0024338" target="_blank">https://doi.org/10.1037/a0024338</a>, 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib50"><label>Garson(1991)</label><mixed-citation>
      
Garson, G. D.: Interpreting Neural-Network Connection Weights, AI Expert, 6, 46–51, 1991.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib51"><label>Gelman(2008)</label><mixed-citation>
      
Gelman, A.: Scaling regression inputs by dividing by two standard deviations,
Stat. Med., 27, 2865–2873, <a href="https://doi.org/10.1002/sim.3107" target="_blank">https://doi.org/10.1002/sim.3107</a>, 2008.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib52"><label>Gray et al.(2022)</label><mixed-citation>
      
Gray, B. J., Kyle, R. G., Song, J., and Davies, A. R.: Characteristics of those most vulnerable to employment changes during the COVID-19 pandemic: a
nationally representative cross-sectional study in Wales, J. Epidemiol.
Commun. H., 76, 8–15, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib53"><label>Green(2008)</label><mixed-citation>
      
Green, R. A.: Unauthorised development and seismic hazard vulnerability: a
study of squatters and engineers in Istanbul, Turkey, Disasters, 32,
358–376, <a href="https://doi.org/10.1111/j.1467-7717.2008.01044.x" target="_blank">https://doi.org/10.1111/j.1467-7717.2008.01044.x</a>,
2008.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib54"><label>Guillard-Gonçalves et al.(2015)</label><mixed-citation>
      
Guillard-Gonçalves, C., Cutter, S. L., Emrich, C. T., and Zêzere, J. L.: Application of Social Vulnerability Index (SoVI) and delineation of natural risk zones in Greater Lisbon, Portugal, J. Risk Res., 18, 651–674, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib55"><label>Hallegatte et al.(2020)</label><mixed-citation>
      
Hallegatte, S., Vogt-Schilb, A., Rozenberg, J., Bangalore, M., and Beaudet, C.: From Poverty to Disaster and Back: a Review of the Literature,
EconDisCliCha, 4, 223–247, <a href="https://doi.org/10.1007/s41885-020-00060-5" target="_blank">https://doi.org/10.1007/s41885-020-00060-5</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib56"><label>Hennig and Liao(2013)</label><mixed-citation>
      
Hennig, C. and Liao, T. F.: How to find an appropriate clustering for mixed-type variables with application to socio-economic stratification, J. Roy. Stat. Soc. C-App., 62, 309–369, <a href="https://doi.org/10.1111/j.1467-9876.2012.01066.x" target="_blank">https://doi.org/10.1111/j.1467-9876.2012.01066.x</a>, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib57"><label>Holand and Lujala(2013)</label><mixed-citation>
      
Holand, I. S. and Lujala, P.: Replicating and Adapting an Index of Social Vulnerability to a New Context: A Comparison Study for Norway, Prof. Geogr., 65, 312–328, <a href="https://doi.org/10.1080/00330124.2012.681509" target="_blank">https://doi.org/10.1080/00330124.2012.681509</a>, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib58"><label>Holand et al.(2011)</label><mixed-citation>
      
Holand, I. S., Lujala, P., and Rød, J. K.: Social vulnerability assessment for Norway: A quantitative approach, Norsk Geogr. Tidsskr.​​​​​​​, 65, 1–17, <a href="https://doi.org/10.1080/00291951.2010.550167" target="_blank">https://doi.org/10.1080/00291951.2010.550167</a>, 2011.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib59"><label>Hosmer et al.(2013)</label><mixed-citation>
      
Hosmer, D. W., Lemeshow, S., and Sturdivant, R. X.: Applied Logistic Regression, Wiley Series in Probability and Statistics, Wiley, 1st edn., <a href="https://onlinelibrary.wiley.com/doi/book/10.1002/9781118548387" target="_blank"/> (last access: 15 November 2022), 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib60"><label>IMM(2018)</label><mixed-citation>
      
Istanbul Metropolitan Municipality (IMM): Afetler Karsisinda Sosyal Hasargörebilirlik Sonuç Raporu (Final Report of Survey Study for Social Vulnerability to Natural Disasters), Istanbul Metropolitan Municipality (IMM)  Directorate of Earthquake and Ground Research, Tech. rep., <a href="https://depremzemin.ibb.istanbul/calismalarimiz/tamamlanmiscalismalar/istanbul-ili-genelinde-afetler-karsisinda-sosyalhasar-gorebilirlik-arastirmasi/" target="_blank">https://depremzemin.ibb.istanbul/ calismalarimiz/tamamlanmiscalismalar/istanbul-ili-genelinde-afetler-karsisinda-sosyalhasar-gorebilirlik-arastirmasi/</a> (last access: 20 April 2023), 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib61"><label>IMM and KOERI(2019)</label><mixed-citation>
      
Istanbul Metropolitan Municipality (IMM)  and Kandilli Observatory Earthquake Research Institution (KOERI): İstanbul İli Olası Deprem Kayıp Tahminlerinin Güncellenmesi Projesi (Updating The Earthquake Loss Estimation for Istanbul), Istanbul Metropolitan Municipality (IMM)  and Kandilli Observatory Earthquake Research Institution (KOERI), <a href="https://depremzemin.ibb.istanbul/calismalarimiz/tamamlanmis-calismalar/istanbul-ili-olasi-deprem-kayip-tahminlerinin-guncellenmesi-projesi/" target="_blank">https://depremzemin.ibb.istanbul/calismalarimiz/tamamlanmis-calismalar/istanbul-ili-olasi-deprem-kayip-tahminlerinin-guncellenmesi-projesi/</a> (last access: 26 April 2023), 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib62"><label>Kalaycıoğlu et al.(2022)</label><mixed-citation>
      
Kalaycıoğlu, O., Akhanlı, S. E., Menteşe, E. Y., Kalaycıoğlu, M., and Kalaycıoğlu, S.: R Shiny web application, shinyapps.io​​​​​​​ [data set]​​​​​​​, <a href="https://oyakalaycioglu.shinyapps.io/Social_Vulnerability/" target="_blank"/> (last access: 13 June 2023), 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib63"><label>Kalaycioglu et al.(2006)</label><mixed-citation>
      
Kalaycioglu, S., Rittersberger, H., Çelik, K., and Gunes, F.: Integrated natural disaster risk assessment: The socio-economic dimension of earthquake risk in the urban area, in: Geohazards, Okinawa, Japan, 18–21 June 2006, Engineering Conferences International Symposium Series, <a href="http://dc.engconfintl.org/geohazards/23/" target="_blank"/> (last access: 18 December 2023), 2006.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib64"><label>Kim and Lee(2017)</label><mixed-citation>
      
Kim, S. and Lee, W.: Does McNemar's test compare the sensitivities and
specificities of two diagnostic tests?, Stat. Methods Med. Res., 26, 142–154, <a href="https://doi.org/10.1177/0962280214541852" target="_blank">https://doi.org/10.1177/0962280214541852</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib65"><label>Krishnan et al.(2019)</label><mixed-citation>
      
Krishnan, P., Ananthan, P. S., Purvaja, R., Joyson Joe Jeevamani, J., Amali Infantina, J., Srinivasa Rao, C., Anand, A., Mahendra, R. S., Sekar, I., and Kareemulla, K.: Framework for mapping the drivers of coastal vulnerability and spatial decision making for climate-change adaptation: A
case study from Maharashtra, India, Ambio, 48, 192–212, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib66"><label>Krzywinski and Altman(2017)</label><mixed-citation>
      
Krzywinski, M. and Altman, N.: Classification and regression trees, Nat. Methods, 14, 757–758, <a href="https://doi.org/10.1038/nmeth.4370" target="_blank">https://doi.org/10.1038/nmeth.4370</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib67"><label>Kuhn(2008)</label><mixed-citation>
      
Kuhn, M.: Building Predictive Models in R Using the caret Package,
J. Stat. Softw., 1, 1–26, <a href="https://doi.org/10.18637/jss.v028.i05" target="_blank">https://doi.org/10.18637/jss.v028.i05</a>, 2008.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib68"><label>Kuhn and Johnson(2013)</label><mixed-citation>
      
Kuhn, M. and Johnson, K.: Applied predictive modeling, vol. 26, Springer, <a href="https://doi.org/10.1007/978-1-4614-6849-3" target="_blank">https://doi.org/10.1007/978-1-4614-6849-3</a>, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib69"><label>Lin and Nguyen(2020)</label><mixed-citation>
      
Lin, H.-I. and Nguyen, M. C.: Boosting Minority Class Prediction on
Imbalanced Point Cloud Data, Appl. Sci., 10, 973,
<a href="https://doi.org/10.3390/app10030973" target="_blank">https://doi.org/10.3390/app10030973</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib70"><label>Liu and Li(2016)</label><mixed-citation>
      
Liu, D. and Li, Y.: Social vulnerability of rural households to flood hazards in western mountainous regions of Henan province, China, Nat. Hazards Earth Syst. Sci., 16, 1123–1134, <a href="https://doi.org/10.5194/nhess-16-1123-2016" target="_blank">https://doi.org/10.5194/nhess-16-1123-2016</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib71"><label>Llorente-Marrón et al.(2020)</label><mixed-citation>
      
Llorente-Marrón, M., Díaz-Fernández, M., Méndez-Rodríguez, P., and González Arias, R.: Social Vulnerability, Gender and Disasters. The Case of Haiti in 2010, Sustainability, 12, 3574, <a href="https://doi.org/10.3390/su12093574" target="_blank">https://doi.org/10.3390/su12093574</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib72"><label>Mahbubur Rahman et al.(2023)</label><mixed-citation>
      
Mahbubur Rahman, M., Sadequr Rahman, M., and Jerin, T.: Social vulnerability to earthquake disaster: insights from the people of 48th ward of Dhaka South City, Bangladesh, Environ. Hazards, 22, 116–135, <a href="https://doi.org/10.1080/17477891.2022.2085075" target="_blank">https://doi.org/10.1080/17477891.2022.2085075</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib73"><label>Maheshwari et al.(2017)</label><mixed-citation>
      
Maheshwari, S., Jain, D. R., and Jadon, D. S.: A Review on Class Imbalance Problem: Analysis and Potential Solutions, International Journal Of Computer Science Issues, 14, 43–51, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib74"><label>Markoulidakis et al.(2021)</label><mixed-citation>
      
Markoulidakis, I., Rallis, I., Georgoulas, I., Kopsiaftis, G., Doulamis, A.,
and Doulamis, N.: Multiclass Confusion Matrix Reduction Method and
Its Application on Net Promoter Score Classification Problem,
Technologies, 9, 81, <a href="https://doi.org/10.3390/technologies9040081" target="_blank">https://doi.org/10.3390/technologies9040081</a>, 2021.​​​​​​​

    </mixed-citation></ref-html>
<ref-html id="bib1.bib75"><label>Martins et al.(2012)</label><mixed-citation>
      
Martins, V. N., e Silva, D. S., and Cabral, P.: Social vulnerability assessment to seismic risk using multicriteria analysis: the case study of Vila Franca do Campo (São Miguel Island, Azores, Portugal), Nat. Hazards, 62, 385–404, <a href="https://doi.org/10.1007/s11069-012-0084-x" target="_blank">https://doi.org/10.1007/s11069-012-0084-x</a>, 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib76"><label>Mavhura and Manyangadze(2021)</label><mixed-citation>
      
Mavhura, E. and Manyangadze, T.: A comprehensive spatial analysis of social vulnerability to natural hazards in Zimbabwe: Driving factors and policy implications, Int. J. Disast. Risk Re., 56, 102139, <a href="https://doi.org/10.1016/j.ijdrr.2021.102139" target="_blank">https://doi.org/10.1016/j.ijdrr.2021.102139</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib77"><label>Meade et al.(1970)</label><mixed-citation>
      
Meade, J. E., Wrigley, E. A., Brass, W., Boreham, A. J., Glass, D. V., and
Grebenik, E.: Demography and Economics, Popul. Stud., 24, 25–31,
<a href="https://doi.org/10.2307/2172399" target="_blank">https://doi.org/10.2307/2172399</a>, 1970.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib78"><label>Menardi and Torelli(2014)</label><mixed-citation>
      
Menardi, G. and Torelli, N.: Training and assessing classification rules with
imbalanced data, Data Min. Knowl. Disc., 28, 92–122,
<a href="https://doi.org/10.1007/s10618-012-0295-5" target="_blank">https://doi.org/10.1007/s10618-012-0295-5</a>, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib79"><label>Menteşe et al.(2019)</label><mixed-citation>
      
Menteşe, E. Y., Kalaycıoğlu, S., Çelik, K., Türkyılmaz, A. S., Çelen, U., Kara, S., Kılıç, O., Baş, M., and Uğur, C.:  Understanding Social Vulnerability Against Disasters in Istanbul, in: Geophysical Research Abstracts, vol. 21, <a href="https://doi.org/10.13140/RG.2.2.28128.64005" target="_blank">https://doi.org/10.13140/RG.2.2.28128.64005</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib80"><label>Menteşe et al.(2022)</label><mixed-citation>
      
Menteşe, E. Y., Trogrlić, R. Š., Hussein, E., Thompson, H., Öner, E., Yolcu, A., and Malamud, B. D.: Stakeholder Perceptions of Multi-hazards and Implications for Urban Disaster Risk Reduction in Istanbul, EGU General Assembly 2022, Vienna, Austria, 23–27 May 2022, EGU22-10895, <a href="https://doi.org/10.5194/egusphere-egu22-10895" target="_blank">https://doi.org/10.5194/egusphere-egu22-10895</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib81"><label>Mesta et al.(2022)</label><mixed-citation>
      
Mesta, C., Cremen, G., and Galasso, C.: Urban growth modelling and social
vulnerability assessment for a hazardous Kathmandu Valley, Sci. Rep.​​​​​​​, 12, 1–16, <a href="https://doi.org/10.1038/s41598-022-09347-x" target="_blank">https://doi.org/10.1038/s41598-022-09347-x</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib82"><label>Mtintsilana et al.(2022)</label><mixed-citation>
      
Mtintsilana, A., Dlamini, S. N., Mapanga, W., Craig, A., Du Toit, J., Ware, L. J., and Norris, S. A.: Social vulnerability and its association with food
insecurity in the South African population: findings from a National
Survey, J. Public Health Pol., 43, 575–592,
<a href="https://doi.org/10.1057/s41271-022-00370-w" target="_blank">https://doi.org/10.1057/s41271-022-00370-w</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib83"><label>Murru et al.(2016)</label><mixed-citation>
      
Murru, M., Akinci, A., Falcone, G., Pucci, S., Console, R., and Parsons, T.: <i>M</i> ≥ 7 earthquake rupture forecast and time-dependent probability for the sea of Marmara region, Turkey, J. Geophys. Res.-Sol. Ea., 121, 2679–2707, <a href="https://doi.org/10.1002/2015JB012595" target="_blank">https://doi.org/10.1002/2015JB012595</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib84"><label>Nor Diana et al.(2021)</label><mixed-citation>
      
Nor Diana, M. I., Muhamad, N., Taha, M. R., Osman, A., and Alam, M. M.: Social Vulnerability Assessment for Landslide Hazards in Malaysia: A Systematic Review Study, Land, 10, 315, <a href="https://doi.org/10.3390/land10030315" target="_blank">https://doi.org/10.3390/land10030315</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib85"><label>Noriega and Ludwig(2012)</label><mixed-citation>
      
Noriega, G. R. and Ludwig, L. G.: Social vulnerability assessment for
mitigation of local earthquake risk in Los Angeles County, Nat.
Hazards, 64, 1341–1355, <a href="https://doi.org/10.1007/s11069-012-0301-7" target="_blank">https://doi.org/10.1007/s11069-012-0301-7</a>, 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib86"><label>Ocal and Senel(2021)</label><mixed-citation>
      
Ocal, M. and Senel, D.: Türkiye’de Kayıt Dışı İstihdamın Bölgesel Analizi, Çalışma ve Toplum Dergisi, 2, 1201–1232, <a href="https://www.calismatoplum.org/makale/turkiyede-kayit-disiistihdamin-bolgesel-analizi" target="_blank"/> (last access: 16 November 2022), 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib87"><label>OECD(2021)</label><mixed-citation>
      
OECD: OECD Economic Surveys: Turkey 2021, OECD Economic  Surveys: Turkey Series, OECD, <a href="https://www.oecd.org/economy/surveys/TURKEY-2021-OECD-economic-survey-overview.pdf" target="_blank"/> (last access: 26 April 2023), 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib88"><label>Parsons(2004)</label><mixed-citation>
      
Parsons, T.: Recalculated probability of <i>M</i> ≥ 7 earthquakes beneath the
Sea of Marmara, Turkey, J. Geophys. Res.-Sol. Ea.,
109, B05304, <a href="https://doi.org/10.1029/2003JB002667" target="_blank">https://doi.org/10.1029/2003JB002667</a>, 2004.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib89"><label>Peek and Stough(2010)</label><mixed-citation>
      
Peek, L. and Stough, L. M.: Children with disabilities in the context of
disaster: A social vulnerability perspective, Child Dev., 81,
1260–1270, <a href="https://doi.org/doi.org/10.1111/j.1467-8624.2010.01466.x" target="_blank">https://doi.org/doi.org/10.1111/j.1467-8624.2010.01466.x</a>, 2010.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib90"><label>Power et al.(2013)</label><mixed-citation>
      
Power, M., Fell, G., and Wright, M.: Principles for high-quality, high-value
testing, Evid. Based Med., 18, 5–10, <a href="https://doi.org/10.1136/eb-2012-100645" target="_blank">https://doi.org/10.1136/eb-2012-100645</a>, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib91"><label>QGIS Development Team(2021)</label><mixed-citation>
      
QGIS Development Team: QGIS Geographic Information System, Open Source Geospatial Foundation Project, <a href="http://qgis.osgeo.org" target="_blank"/> (last access: 20 December 2022), 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib92"><label>Rabby et al.(2019)</label><mixed-citation>
      
Rabby, Y. W., Hossain, M. B., and Hasan, M. U.: Social vulnerability in the
coastal region of Bangladesh: An investigation of social vulnerability
index and scalar change effects, Int. J. Disast. Risk
Re., 41, 101329, <a href="https://doi.org/10.1016/j.ijdrr.2019.101329" target="_blank">https://doi.org/10.1016/j.ijdrr.2019.101329</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib93"><label>Ramyachitra and Manikandan(2014)</label><mixed-citation>
      
Ramyachitra, R. and Manikandan, P.: Imbalanced dataset classification and  solutions: a review, International Journal of Computing and Business Research, 5, 1–29, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib94"><label>R Core Team(2021)</label><mixed-citation>
      
R Core Team: R: A Language and Environment for Statistical Computing, R Foundation for Statistical Computing, Vienna, Austria, <a href="https://www.R-project.org/" target="_blank"/> (last access: 1 April 2023), 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib95"><label>Republic of Türkiye Ministry of Labour and Social Security(2021)</label><mixed-citation>
      
Republic of Türkiye Ministry of Labour and Social Security​​​​​​​: The European Code Of Social Security – Country Report (Article 74), Council of Europe, <a href="https://rm.coe.int/turkey-reportcode-art74-2021/1680a51194" target="_blank"/> (last access: 26 April 2023), 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib96"><label>Roncancio et al.(2020)</label><mixed-citation>
      
Roncancio, D. J., Cutter, S. L., and Nardocci, A. C.: Social vulnerability in
Colombia, Int. J. Disast. Risk Re., 50, 101872, <a href="https://doi.org/10.1016/j.ijdrr.2020.101872" target="_blank">https://doi.org/10.1016/j.ijdrr.2020.101872</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib97"><label>Rufat et al.(2019)</label><mixed-citation>
      
Rufat, S., Tate, E., Emrich, C. T., and Antolini, F.: How Valid Are
Social Vulnerability Models?, Ann. Am. Assoc. Geogr.​​​​​​​, 109, 1131–1153, <a href="https://doi.org/10.1080/24694452.2018.1535887" target="_blank">https://doi.org/10.1080/24694452.2018.1535887</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib98"><label>Ryo and Rillig(2017)</label><mixed-citation>
      
Ryo, M. and Rillig, M. C.: Statistically reinforced machine learning for
nonlinear patterns and variable interactions, Ecosphere, 8, e01976,
<a href="https://doi.org/10.1002/ecs2.1976" target="_blank">https://doi.org/10.1002/ecs2.1976</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib99"><label>Salami et al.(2015)</label><mixed-citation>
      
Salami, R., Von Meding, J., Giggins, H., and Olotu, A.: Disasters,
vulnerability and inadequate housing in Nigeria: A viable strategic
framework, in: 5th International Conference on Building Resilience, Newcastle, Australia, 15–17 July 2015, Proceedings ANDROID Residential Doctoral School, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib100"><label>Schipper et al.(2016)</label><mixed-citation>
      
Schipper, E. L. F., Thomalla, F., Vulturius, G., Davis, M., and Johnson, K.:
Linking disaster risk reduction, climate change and development,
International Journal of Disaster Resilience in the Built Environment, 7,
216–228, <a href="https://doi.org/10.1108/IJDRBE-03-2015-0014" target="_blank">https://doi.org/10.1108/IJDRBE-03-2015-0014</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib101"><label>Shen et al.(2018)</label><mixed-citation>
      
Shen, S., Cheng, C., Yang, J., and Yang, S.: Visualized analysis of developing trends and hot topics in natural disaster research, PLOS ONE, 13, e0191250, <a href="https://doi.org/10.1371/journal.pone.0191250" target="_blank">https://doi.org/10.1371/journal.pone.0191250</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib102"><label>Spielman et al.(2020)</label><mixed-citation>
      
Spielman, S. E., Tuccillo, J., Folch, D. C., Schweikert, A., Davies, R., Wood, N., and Tate, E.: Evaluating social vulnerability indicators: criteria and their application to the Social Vulnerability Index, Nat. Hazards,
100, 417–436, <a href="https://doi.org/10.1007/s11069-019-03820-z" target="_blank">https://doi.org/10.1007/s11069-019-03820-z</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib103"><label>Stough and Kelman(2018)</label><mixed-citation>
      
Stough, L. M. and Kelman, I.: People with disabilities and disasters, in:
Handbook of disaster research, Springer International Publishing,
Cham, 225–242, <a href="https://doi.org/10.1007/978-3-319-63254-4_12" target="_blank">https://doi.org/10.1007/978-3-319-63254-4_12</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib104"><label>Syed and Kumar Routray(2014)</label><mixed-citation>
      
Syed, A. and Kumar Routray, J.: Vulnerability assessment of earthquake prone communities in Baluchistan, International Journal of Disaster Resilience in
the Built Environment, 5, 144–162, <a href="https://doi.org/10.1108/IJDRBE-12-2010-0053" target="_blank">https://doi.org/10.1108/IJDRBE-12-2010-0053</a>, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib105"><label>Tasnuva et al.(2021)</label><mixed-citation>
      
Tasnuva, A., Hossain, M., Salam, R., Islam, A. R. M., Patwary, M. M., and Ibrahim, S. M.: Employing social vulnerability index to assess household social vulnerability of natural hazards: An evidence from southwest coastal Bangladesh, Environ. Dev. Sustain., 23, 10223–10245, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib106"><label>Tate(2012)</label><mixed-citation>
      
Tate, E.: Social vulnerability indices: a comparative assessment using
uncertainty and sensitivity analysis, Nat. Hazards, 63, 325–347, 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib107"><label>Taubenböck et al.(2006)</label><mixed-citation>
      
Taubenböck, H., Kemper, T., Roth, A., and Voigt, S.: Assessing vulnerability
in Istanbul: An example to support disaster management with remote
sensing at ZKI-DLR, 1–9, ISBN 3-9809030-4-4, 2006.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib108"><label>Turkish Statistics
Institute(2021)</label><mixed-citation>
      
Turkish Statistics Institute: Labour Force Statistics, Tech. rep.,
<a href="https://data.tuik.gov.tr/Bulten/Index?p=Labour-Force-Statistics-February-2021-37487&amp;dil=2" target="_blank"/> (last access: 18 March 2023), 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib109"><label>Turkoglu(2013)</label><mixed-citation>
      
Turkoglu, I.: Sosyal devlet bağlamında Türkiye'de sosyal yardım ve sosyal güvenlik, Akademik İncelemeler Dergisi, 8, 275–305, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib110"><label>UNDRR(2022)</label><mixed-citation>
      
UNDRR: Global Assessment Report on Disaster Risk Reduction 2022: Our World at Risk: Transforming Governance for a Resilient Future, United Nations Office for Disaster Risk Reduction,   UNDRR, Geneva, Switzerland, <a href="https://www.undrr.org/gar2022-our-world-risk-gar#container-downloads" target="_blank"/>
(last access: 18 March 2023), 2022.


    </mixed-citation></ref-html>
<ref-html id="bib1.bib111"><label>UNISDR Terminology on Disaster Risk Reduction(2015)</label><mixed-citation>
      
UNISDR Terminology on Disaster Risk Reduction​​​​​​​: Sandai Framework for Disaster Risk Reduction 2015–2030, Tech. rep., <a href="https://www.undrr.org/publication/sendai-framework-disaster-risk-reduction-2015-2030" target="_blank"/> (last access: 18 March 2023), 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib112"><label>U.S. Environmental Protection
Agency(2015)</label><mixed-citation>
      
U.S. Environmental Protection Agency: Climate change in the United States – benefits of global action, Tech. Rep. EPA 430-R-15-001, Enviromental Protection Agency, Office of Atmospheric Programs,
<a href="https://www.epa.gov/cira" target="_blank"/> (last access: 20 March 2023), 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib113"><label>Walker et al.(2019)</label><mixed-citation>
      
Walker, T., Kawasoe, Y., and Shrestha, J.: Risk and Vulnerability in Nepal, Risk and Vulnerability Assessment, World Bank,  <a href="https://doi.org/10.1596/33365" target="_blank">https://doi.org/10.1596/33365</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib114"><label>Wang et al.(2022)</label><mixed-citation>
      
Wang, S., Zhang, M., Huang, X., Hu, T., Sun, Q. C., Corcoran, J., and Liu, Y.: Urban–rural disparity of social vulnerability to natural hazards in
Australia, Sci. Rep.​​​​​​​, 12, 1–15, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib115"><label>Wang and Sebastian(2021)</label><mixed-citation>
      
Wang, Y. V. and Sebastian, A.: Community flood vulnerability and risk
assessment: An empirical predictive modeling approach, J. Flood
Risk Manage., 14, e12739, <a href="https://doi.org/10.1111/jfr3.12739" target="_blank">https://doi.org/10.1111/jfr3.12739</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib116"><label>Wang et al.(2021)</label><mixed-citation>
      
Wang, Y. V., Gardoni, P., Murphy, C., and Guerrier, S.: Empirical Predictive Modeling Approach to Quantifying Social Vulnerability to Natural Hazards, Ann. Am. Assoc. Geogr., 111, 1559–1583, <a href="https://doi.org/10.1080/24694452.2020.1823807" target="_blank">https://doi.org/10.1080/24694452.2020.1823807</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib117"><label>West(2007)</label><mixed-citation>
      
West, A.: Poverty and educational achievement: why do children from low-income families tend to do less well at school?, Benefits: A Journal of Poverty and Social Justice, 15, 283–297, <a href="https://doi.org/10.51952/XLJA4165" target="_blank">https://doi.org/10.51952/XLJA4165</a>, 2007.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib118"><label>Wilson(2019)</label><mixed-citation>
      
Wilson, B. S.: Overrun by averages: An empirical analysis into the
consistency of social vulnerability components across multiple scales,
Int. J. Disast. Risk Re., 40, 101268, <a href="https://doi.org/10.1016/j.ijdrr.2019.101268" target="_blank">https://doi.org/10.1016/j.ijdrr.2019.101268</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib119"><label>Wisner and Luce(1993)</label><mixed-citation>
      
Wisner, B. and Luce, H. R.: Disaster vulnerability: Scale, power and daily
life, GeoJournal, 30, 127–140, <a href="https://doi.org/10.1007/BF00808129" target="_blank">https://doi.org/10.1007/BF00808129</a>, 1993.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib120"><label>WUP(2023)</label><mixed-citation>
      
WUP: United Nations population estimates and projections of major Urban Agglomerations, World Urbanization Prospects, Tech. rep.,
<a href="https://worldpopulationreview.com/world-cities" target="_blank"/>​​​​​​​ (last access: 17 March 2023), 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib121"><label>Yoon and Jeong(2016)</label><mixed-citation>
      
Yoon, D. K. and Jeong, S.: Assessment of Community Vulnerability to Natural Disasters in Korea by Using GIS and Machine Learning Techniques, in: Quantitative Regional Economic and Environmental Analysis for Sustainability in Korea, Springer,  Singapore, vol. 25, 123–140, <a href="https://doi.org/10.1007/978-981-10-0300-4_7" target="_blank">https://doi.org/10.1007/978-981-10-0300-4_7</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib122"><label>Yücel and Arun(2010)</label><mixed-citation>
      
Yücel, G. and Arun, G.: Earthquake and Physical and Social Vulnerability Assessment for Settlements: Case Study Avcılar District, Megaron, 5, 23–32, 2010.

    </mixed-citation></ref-html>--></article>
