4 June 2026
13:15 – 14:00
ŠIBENIK VI
Presentation title
Whatever affects one directly, affects all indirectly.
When microdata cannot be publicly disseminated, detailed documentation becomes essential.
Read more
Read less
For machine learning applications in official statistics, transparent dataset description directly supports several quality dimensions: relevance, by clarifying the dataset’s analytical purpose; accuracy and reliability, by enabling replication of preprocessing and modelling procedures; and coherence and comparability, by situating the dataset within established statistical frameworks. This paper therefore provides a structured description of the AIML4OS–WP11-PT dataset, an administrative-based representation of the full population of inter-firm commercial transactions in Portugal for 2022. Although the confidential microdata cannot be shared, documenting its construction is crucial for its responsible use in ML-driven statistical research.
The dataset portrays the Portuguese production and supply-chain system as a directed, weighted network in which firms are nodes and business-to-business transactions form edges. Its comprehensive, threshold-free design enables analysis of how economic linkages emerge, how shocks propagate through production chains, and where systemic vulnerabilities may arise. Such a representation supports both descriptive analyses of economic connectivity and predictive modelling of networked behaviour, offering a foundation that can be compared and adapted across statistical systems.
Because of the high sensitivity of the underlying data, the dataset itself cannot be released. However, its processing algorithms and analytical methods are fully reproducible. This constraint highlights the importance of producing transparent summary network statistics, which convey structural information without revealing confidential firm-level details. Such summaries enhance accessibility and clarity, allowing users to understand the dataset’s key characteristics while respecting confidentiality protections.
The empirical analysis includes standard network measures—degree distributions, reciprocity, clustering coefficients, path lengths, and weighted connectivity—that capture fundamental features of complex systems. These indicators provide interpretable representations of network structure and dynamics. For machine learning, they are particularly valuable in assessing representativeness, sparsity, and potential sources of bias, thereby reinforcing the dataset’s accuracy and reliability even when direct access to microdata is restricted.
To explore structural heterogeneity, seven subnetworks were derived, organised by firm size (DIM), geographical region (NUTS3), and economic activity (NACE). These subnetworks reveal how connectivity patterns vary across sectors and locations, supporting more relevant and targeted analytical applications.
Finally, the characteristics of the AIML4OS–WP11-PT dataset align closely with those observed in other national production networks, such as those of Ecuador and Hungary. This convergence validates the dataset’s structure and enhances the comparability and generalisability of ML models built upon it, enabling the Portuguese case to contribute meaningfully to broader analyses of economic interdependencies.
Alexandre Cunha
Instituto Nacional de Estatística
Read more
Read less
Alexandre Cunha holds a degree in Applied Statistics from Universidade do Minho. Has been working at Statistics Portugal (INE) since 2021, within the Data Collection Department, where he focuses on the processing, integration and analytical use of administrative data in support of official statistics.
CO-AUTHORS:
Presentation title
Strengthening User Trust through Quality Management in Official Statistics: Lessons from Somalia
Building and maintaining user trust in official statistics depends fundamentally on the perceived quality, accessibility, and relevance of data.
Read more
Read less
This paper examines how improvements in statistical quality have contributed to strengthening user trust in Somalia’s official statistics between 2017 and 2025, drawing on global performance metrics and national user feedback. The analysis focuses on evidence from the Open Data Inventory (ODIN), the World Bank’s Statistical Performance Indicators (SPI), and findings from the first and second National User Satisfaction Surveys conducted by the Somalia National Bureau of Statistics (SNBS).
According to the Open Data Inventory 2024, Somalia ranked 157th globally with an overall score of 40 out of 100, reflecting low data coverage (32) and relatively stronger data openness (46) . These results indicate persistent challenges in data completeness and frequency, despite progress in publishing machine-readable and accessible datasets. In contrast, the World Bank’s Statistical Performance Indicators show substantial system-wide progress, with Somalia’s SPI increasing by approximately 147 percent from 19.6 in 2019 to 48.4 in 2022, and further improving to 49.7 in 2023. This improvement reflects gains across the five SPI pillars—data use, services, products, sources, and infrastructure—signaling a maturing national statistical system.
User perceptions provide critical insight into how these quality improvements translate into trust. The first National User Satisfaction Survey found that 55 percent of users were satisfied with the overall quality of official statistics, although satisfaction with timeliness, accuracy, frequency, disaggregation, and coverage remained below 45 percent . The survey also revealed gaps in responsiveness to data requests and limited awareness of official statistics among some users. Findings from the second User Satisfaction Survey indicate increased stakeholder engagement and improved satisfaction with accessibility, usability, and service delivery, particularly for first-time users, while highlighting continued demand for stronger analytical quality and interpretation .
By triangulating international quality indicators with national user feedback, this paper demonstrates that targeted improvements in statistical quality—especially openness, accessibility, and user engagement—are central to building and sustaining trust in official statistics in fragile contexts. The Somalia experience underscores the importance of continuous quality management, transparent communication, and responsiveness to user needs as foundations for credibility and trust in official statistics.
Abdulkadir Gedi
Somali National Bureau of Statistics
Read more
Read less
I am a data science and data management specialist at the Somali National Bureau of Statistics (SNBS), with over eight years of experience in the production, management, and dissemination of official statistics at national and international levels. My work focuses on data quality management, SDG monitoring and reporting, statistical data portals, and the implementation of Statistical Data and Metadata Exchange (SDMX) standards.
I hold a Master of Science (M.Sc.) in Computer Science and Engineering (Data Science) from United International University, Dhaka, Bangladesh. I have completed advanced professional training in machine learning for official statistics, big data analytics for policy planning, data anonymization, spatial data infrastructures, and data leadership through programs delivered by UNSD, WB, ESCWA, IMF, SIAP, ADB, and ILO. My professional interests include strengthening data credibility, transparency, and user trust in fragile and post-conflict statistical systems.
Presentation title
STRENGTHENING DATA QUALITY IN OFFICIAL STATISTICS: THE CASE OF THE TÜRKİYE VIOLENCE AGAINST WOMEN (VAW) SURVEY
In the compilation of official statistics, data collection of high quality is of great importance, notably in the case of socially sensitive issues such as the “Violence Against Women Survey” (VAW) , which used various methodological and quality-enhancing measures throughout the implementation process to maintain accuracy, consistency, and efficiency.
Read more
Read less
This paper examines these measures taken in the official statistics production process and evaluates the overall contribution of this study to improving data quality within the national statistical system.
In the sampling stage, the research was designed to decrease the non-response rate by selecting households with at least one woman aged 15-59, which was the target group of the research. By sampling only from such households, the research minimized unsuccessful interview attempts, increased overall response rates, and helped field teams use their time and resources more effectively.
The length of each interviewer's interview was tracked during the interview process and regularly reported to the Regional Directorates involved in the implementation process to allow irregular interview patterns to be detected early and interventions to be supported when necessary. Interview duration monitoring served as a function of quality control and ensured that interviews were conducted and in accordance with established standards.
In the 2024 survey, consisting of many modules and questions, consistency between questions was monitored throughout the fieldwork using software developed specific to the study, and feedback was sent to Regional Directorates in real time so that corrections could be made. Some basic demographic information obtained from administrative records was previously entered into the CAPI system, reducing data entry mistakes and shortening survey time. The consistency among survey responses and administrative records was then checked to make sure the final set of data was high-quality and reliable.
Eda Evin Şahin
TurkStat
Read more
Read less
I have been working at the Turkish Statistical Institute since 2004 and currently hold the position of Specialist. I am a PhD candidate in Demography, with extensive experience in population registers and household surveys. I have contributed to the implementation of major official statistics projects, most recently the Surveys on Violence Against Women and Crime Victimization. My professional responsibilities include questionnaire design, development of data entry systems, data processing and analysis, preparation of statistical publications, and reporting.
CO-AUTHOR:
Presentation title
"Official Statistics and Digital Society Empowerment for Policy Support and Decision-Making: A Strategic Vision from Egypt’s Statistical Authority Experience"
Official statistics constitute a fundamental pillar for evidence-based policymaking frameworks, particularly amid the rapid transition toward a digital society (CAPMAS, 2024).
Read more
Read less
This study aims to present a strategic vision derived from the experience of the Central Agency for Public Mobilization and Statistics (CAPMAS) in Egypt, seeking to advance official statistics and enhance digital accessibility to support policy formulation and decision-making (CAPMAS, 2025a). The study adopts a descriptive-analytical approach, reviewing official documents, national strategies, and statistical reports, while aligning with international frameworks and standards for statistical quality and digital transformation (UN, 2014; OECD, 2020).
Study results demonstrate that operationalizing this vision fundamentally relies on modernizing the statistical production system and governing digital pathways, while fostering institutional coordination among various government entities and expanding the accessibility of data and metadata through digital platforms to facilitate utilization by policymakers, researchers, and civil society (CAPMAS, 2025). The transition toward cloud-based systems and interactive platforms further enhances the reliability and real-time availability of metadata for decision-makers, researchers, and civil society. Additionally, leveraging administrative records as complementary knowledge assets contributes to streamlining institutional processes and reducing redundancy, thus improving the efficiency of the National Statistical System. Furthermore, aligning statistical indicators with "Egypt Vision 2030" transforms statistics into forward-looking drivers for evaluating public policies and measuring developmental impact (Abdel-Fattah, 2023; El-Saadani, 2022; Youssef, 2021; Egypt Vision 2030, 2016).
The study concludes that empowering the digital society with access to high-quality official statistics, alongside building the digital capacities of both users and practitioners, serves as the cornerstone for transforming data from traditional descriptive outputs into interactive strategic tools that support evidence-based decision-making and sustainable development (UN DESA, 2024; CAPMAS, 2024). This shift toward a 'Data as a Service' (DaaS) model reinforces the Egyptian experience as a benchmark for developing digital statistical systems nationally and internationally, contributing to narrowing the knowledge gap and consolidating good governance in accordance with modern global standards (OECD, 2023; UN-GGIM, 2023; Youssef, 2021).
Yasmin Hassan Ali
Central Agency for Public Mobilization and Statistics (CAPMAS)
Read more
Read less
I am a dedicated First-Degree Statistician and Senior Researcher at the Central Agency for Public Mobilization and Statistics (CAPMAS) in Egypt, holding an MSc in Statistics from Cairo University. My career is driven by a deep passion for advancing statistical education and its pivotal role in modern society. Currently, I serve within the Sustainable Development Unit, where I am responsible for monitoring and calculating SDG indicators according to international methodologies and coordinating with organizations such as UN-ESCWA to develop the 'SDG Observatory.'
My expertise includes field management and data analysis for vital national surveys, such as the Informal Labor Survey, the Agricultural Survey, and the Solid Waste Survey. I have also contributed significantly to the 2nd and 3rd National Statistical Reports for monitoring the 2030 SDGs (2019 & 2023). I aim to bridge the gap between theory and practice, fostering a data-literate society that utilizes information ethically for informed decision-making.