In her article, Krynsky Baal (2021) highlights several persistent challenges to data on forcibly displaced persons, the first of which is the lack of comparability between statistics on forcibly displaced persons both within and across countries. The 2018 International Recommendations on Refugee Statistics (IRRS) and the 2020 International Recommendations on Internally Displaced Persons Statistics (IRIS), both developed by the Expert Groups on Refugee, IDP and Statelessness Statistics (EGRISS), have provided clear definitions for addressing this shortcoming (which are outlined in Part I, Chapter 4).
Second, the common perception of displacement as a temporary or short-lived phenomenon has resulted in forcibly displaced persons being excluded from national development planning and budgeting. This is because most data collection among forcibly displaced persons does not include socio-economic indicators aligned with existing statistical standards which facilitate comparisons with the host population and because refugees, IDPs and stateless individuals are largely omitted from the official numbers of national statistical systems. The Uganda Refugee and Host Communities 2018 Household Survey took measures to address the lack of socio-economic indicators by aligning itself with national survey instruments. The survey demonstrated that while poverty and unemployment rates are generally higher among refugees than among the host population, access to certain resources is sometimes lower among the host population, highlighting that the former should also be included in humanitarian programming (World Bank, 2018).
Third, age-, gender-, disability- and other diversity criteria-disaggregated data needed to inform targeted interventions are often missing. Even the breakdown of refugee and IDP population data by age and sex in the UNHCR Global Trends is based on statistical modelling instead of actual numbers (UNHCR, 2020). Developing sampling strategies for producing overall statistics based on a smaller representative sample of refugees, IDPs or stateless persons (from registration, flow monitoring or geospatial sampling frames) is a way of generating more disaggregated data without having to conduct a complete census (Krynsky Baal, 2021).
Finally, the data landscape in humanitarian contexts is often characterized by challenging operational realities, limited coordination between agencies and poor technical skills. Access to potential respondents is restricted for reasons of insecurity or inadequate infrastructure. Limited coordination between agencies and duplication of data collection efforts may contribute to "survey fatigue” among affected populations or lead to contradictory data making decisionmakers more sceptical of the need to act. Furthermore, data literacy, or the ability to master a basic understanding of data sources and analytical methods, is not widespread among humanitarian actors (Krynsky Baal, 2021).