The leading source of labour statistics

Home > Statistical standards and methods > Concepts and definitions

ILOSTAT database description

ILO Modelled Estimates (ILOEST database)

The ILO modelled estimates series provides a complete set of internationally comparable labour statistics, including both nationally reported observations and imputed data for countries with missing data. The imputations are produced through a series of econometric models maintained by the ILO. The purpose of estimating labour market indicators for countries with missing data is to obtain a balanced panel data set so that, every year, regional and global aggregates with consistent country coverage can be computed. These allow the ILO to analyse global and regional estimates of key labour market indicators and related trends. Moreover, the resulting country-level data, combining both reported and imputed observations, constitutes a unique, internationally comparable dataset of key labour market indicators.

Estimates for countries with very limited labour market information have a high degree of uncertainty. Hence, estimates for countries with limited nationally reported data should not be considered as “observed” data, and great care needs to be applied when using these data for analysis, especially at the country level.

For more information on the ILO modelled estimates, refer to this:

ILO modelled estimates methodological overview

Data collection and evaluation

The ILO modelled estimates are generally derived for 189 countries, disaggregated by sex and age as appropriate. For selected indicators, an additional disaggregation by rural/urban areas is performed. Before running the models to obtain the estimates, labour market information specialists from the ILO Department of Statistics, in cooperation with the Research Department, evaluate existing country‑reported data and select only those observations deemed sufficiently comparable across countries.

The recent efforts by the ILO to produce harmonized indicators from country-reported microdata have greatly increased the comparability of the observations. Nonetheless, it is still necessary to select the data based on the following four criteria: (1) type of data source; (2) geographical coverage; (3) age-group coverage; and (4) presence of methodological breaks or outliers.

Data selection and revision to historical estimates

The ILO maintains a series of econometric models used to produce estimates of labour market indicators for countries and years where country-reported data are unavailable and to produce forecasts (see descriptions below). As in previous years, the ILO modelled estimates have been updated to account for new information. It is important to note that the integration of new information can impact and revise older historical data if newer data are a more trusted type of data source, or it creates methodological breaks. This may lead to the removal of previously included data. Consequently, historical trends reflected in the ILO modelled estimates of November 2024 may be different from those published in November 2023 due to the inclusion of updated data inputs.

An important difference between the ILO modelled estimates of November 2023 and those of November 2024 arises from India’s Periodic Labour Force Survey (PLFS) data. In the November 2023 edition, PLFS data for 2020, 2021, 2022, and the first half of 2023 became available and were included as model inputs, while data from 2018 and 2019 were excluded as they appear to present limited comparability with both the previous NSS results and the newer PLFS results. The PLFS does not strictly follow international labour standards (neither the 13^th nor 19^thICLS resolutions), particularly in applying the priority rule, the one-hour employment criterion, and the structured separation of key criteria like job search and availability. Additionally, the reliance on semi-structured interviews and interviewer judgment increases the potential for inconsistencies, reduces transparency and makes the assessment of data quality more difficult. These factors undermine the PLFS’s comparability both with other countries and across different time periods. To ensure that the trends reflected in the estimates are consistent with the broader patterns observed in the NSS and PLFS datasets, the November 2024 edition adopted a revised approach for the labour force participation rates. This approach incorporates NSS and PLFS data points, excluding data from 2018 and 2019, and applies a smoothing function to reduce year-to-year variability and enhance the international comparability of the estimates. Given the country’s size, this has a sizeable impact on the global aggregates.

Country groupings

The UN does not have a standardized set of regional groupings. Groupings in ILOSTAT are based on the regions used for administrative purposes by the ILO, which may differ from those of other organizations. These usually do not change over time.

ILOSTAT also presents income groupings based on the World Bank’s classification. The world’s economies are assigned to one of four income groups: low, lower-middle, upper-middle, and high-income countries. The classifications are updated each year on July 1 and are based on GNI per capita in current USD of the previous year. Aggregates in the ILO modelled estimates from the November edition reflect the World Bank’s income classifications from July that year. Hence, estimates from different editions will not reflect the same income groupings.

Country, territory and area groupings

See the country groupings by ILO region and World Bank income group for data in ILOSTAT. Regional groups vary across UN organizations despite similar names.

F.A.Q.

Why does the ILO use estimates?

Conducting labour force surveys is a complicated and costly task which some countries are unable to do on a systematic basis. Consequently, significant data gaps remain in most international labour statistics databases. To be able to produce reliable global and regional estimates of key labour indicators, the ILO has developed statistical models that produce estimates for countries in years for which no data have been reported. These models have been tested for statistical accuracy and allow the ILO to forecast changes in key labour market indicators as well as to produce global and regional aggregates. The end result of these models is a complete set of national labour statistics alongside the global and regional aggregates. In the interest of transparency, the ILO publishes the resulting country-level and global and regional estimates in the ILO modelled estimates series.

What kind of figures are used in your estimates?

Not all countries submit statistically comparable data. Before running the models to obtain the estimates, ILO labour market information specialists evaluate country-reported data and select only those observations deemed sufficiently comparable across countries. The recent efforts by the ILO to produce harmonized indicators from country-reported microdata have greatly increased the comparability of the observations. Nonetheless, it is still necessary to select the data on the basis of the following four criteria: (a) type of data source; (b) geographical coverage; (c) age-group coverage; and (d) presence of methodological breaks or outliers.

Our models also include country-level data on population, economic growth, poverty and other economic indicators from the following sources:

United Nations World Population Prospects
IMF/World Bank data on macroeconomic indicators
World Bank poverty estimates from the PovcalNet database

How do you calculate these estimates?

The estimates are produced using a series of models, which establish statistical relationships between observed labour market indicators and explanatory variables. These relationships are used to impute missing observations and to make projections for the indicators.

There are many potential statistical relationships, also called “model specifications” that could be used to predict labour market indicators. The key to obtaining accurate and unbiased estimates is to select the best model specification in each case. The ILO modelled estimates generally rely on a procedure called cross-validation, which is used to identify those models that minimize the expected error and variance of the estimation. This procedure involves repeatedly computing a number of candidate model specifications using random subsets of the data: the missing observations are predicted and the prediction error is calculated for each iteration. This makes it possible to identify the statistical relationship that provides the best estimate of a given labour market indicator.

Your estimates about a specific country are different from the ones on its official statistics website. Why is that?

The aim of the ILO modelled estimates is to provide a complete set of internationally comparable labour statistics, without missing observations, across countries and over time. To achieve this, a number of harmonisation and modelling procedures are applied, which can result in differences between ILO estimates and figures published in national statistical sources. The most common sources of these differences are the following:

Benchmarking the working-age population to the estimates of the United Nations World Population Prospects. Many labour market indicators are estimated as rates or ratios, such as labour force participation rates or unemployment rates. These estimated rates are then combined with population data from the United Nations World Population Prospects to derive the corresponding levels. As a result, even when estimated rates are similar to nationally reported rates, the corresponding levels may differ due to differences between national population estimates and the United Nations population benchmarks used by the ILO.

Application of international statistical standards. The ILO applies the standards adopted by the International Conference of Labour Statisticians to ensure international comparability. National estimates may be based on definitions, concepts or measurement practices that differ from these standards, leading to differences with ILO modelled estimates.

Adjustment of classifications for standardization purposes. Data may be adjusted to account for differences in classifications or coverage across countries, such as differences in age coverage, sectoral classifications or population scope. For example, a national statistical office may report total labour force participation rates for ages 15 to 64, while the ILO produces totals for persons aged 15 and over. In such cases, participation rates for ages 65 and over are estimated, and the resulting total may differ from that published nationally.

Internal consistency procedures. Each edition of the ILO modelled estimates is internally consistent by construction, ensuring that components, such as sex or age groups, add up to the corresponding totals across all related indicators. These normalization procedures may result in differences compared with figures published by national sources.
Mitigation of breaks in time series. To ensure comparability over time, adjustments may be applied to national data to account for changes in survey methodology, coverage or other structural breaks in the underlying series. Such adjustments can result in differences between ILO estimates and nationally reported figures.

What do time stamps like “Nov. 2025” or “Nov. 2024” mean?

The ILO modelled estimates cover a wide variety of indicators. Hence, input updates and methodological improvements are implemented in a staggered manner. The time stamp indicates the production date of the estimates, also referred to as the edition. The production date is important because it indicates approximately the cut-off date for inclusion of nationally reported observations as input into the models. Additionally, estimates with the same production date have undergone normalization to ensure that they are internally consistent. For instance, the sum of employment across all economic sectors will equal the sum across all occupations. Nonetheless, for estimates with different production dates, this will not be the case.

Do you ever revise your model?

We are constantly improving the ILO modelled estimates. Revisions usually happen for one of three reasons:

Countries make new data available. The ILOSTAT database is constantly updated as new national labour statistics become available. In some cases, this may only happen after a significant delay, requiring the ILO to replace estimates for that year with the reported statistics.
Revisions are made to other databases used by our statistical model. The ILO’s econometrics models use databases maintained by other international organizations such as the UN’s World Population Prospects and the IMF’s World Economic Outlook. These databases are periodically subject to their own revisions, which can lead to revisions in the ILO modelled estimates.
Historical data needs to be revised. Periodically, data from prior years needs to be revised as new information emerges.

How can I cite this dataset?

Please see different options on our dissemination and analysis page.

Labour market indicators

Labour market indicators are estimated using a series of models that establish statistical relationships between observed labour market indicators and explanatory variables. These relationships are used to impute missing observations and to make projections for the indicators.

There are many potential statistical relationships, also called “model specifications”, that could be used to predict labour market indicators. The key to obtaining accurate and unbiased estimates is to select the best model specification in each case. The ILO modelled estimates generally rely on a procedure called “cross-validation”, which is used to identify those models that minimize the expected error and variance of the estimation. This procedure involves repeatedly computing a number of candidate model specifications using random subsets of the data: the missing observations are predicted and the prediction error is calculated for each iteration. Each candidate model is assessed based on the pseudo-out-of-sample root mean square error, although other metrics such as result stability are also assessed depending on the model. This makes it possible to identify the statistical relationship that provides the best estimate of a given labour market indicator. It is worth noting that the most appropriate statistical relationship for this purpose may differ according to country.

The benchmark for the ILO modelled estimates is the 2024 Revision of the United Nations World Population Prospects, which provides estimates and projections of the total population broken down into five-year age groups. The working-age population comprises everyone who is at least 15 years of age. Although the same basic approach is followed in the models used to estimate all the indicators, there are differences between the various models because of specific features of the underlying data. Further details are provided for each model in this methodological description, while an overview is provided below.

Conflict countries

Within the series of econometric models used to produce estimates of labour market indicators in the countries and years for which country-reported data are unavailable and to produce forecasts, the ILO includes an econometric model for countries during years of conflict. The econometric model measures the elasticity of the target variable of interest and employment and GDP per capita (during 2020, a period of severe supply and demand shocks) for all countries with available data. The model then uses these estimated elasticities to reflect changes in the target variable using changes in employment and GDP per capita during conflict years. An example of this methodology applied to Ukraine can be found in the ILO Monitor on the world of work, Tenth edition. Given the exceptional situation, including the scarcity of relevant data, the estimates for countries in years of conflict are subject to exceptionally high uncertainty.

Labour force, employment structure and labour underutilization

To track the participation in the labour market of the working-age population, estimates of the labour force are produced, disaggregated by sex and age. The labour force measures active participation in the labour market: the sum of persons employed and the unemployed. To analyse the employment structure, the distribution of employment as a function of four different breakdowns is estimated: employment status, economic activity (sector), occupation, economic class (working poverty), and informality. To measure labour underutilization, there are numerous series available disaggregated by sex and age: unemployment rate (LU1), labour underutilization rates (LU2 and LU4), the NEET rate (youth not in employment, education or training), time-related underemployment rate, and others. For some of the indicators described a breakdown by rural/urban areas is produced. Projections are only available for selected indicators. Moreover, for the estimates of informality, LU3, and the jobs gap, only regional and global aggregate estimates are available.

Hours worked

The ILO nowcasting model pertaining to changes in quarterly hours worked during the pandemic has now been replaced with a yearly model of hours worked. This new series of hours related indicators includes total weekly hours worked of employed persons, the ratio of total weekly hours worked to population, mean weekly hours actually worked per employed person, and the number of full-time equivalent jobs. Specifically, the series are:

Total weekly hours worked by employed persons.
Mean weekly hours actually worked per employed person.
Ratio of total weekly hours worked to population aged 15-64.
Number of full-time equivalent jobs (assuming 40 or 48 workweek hours). This measure is constructed by dividing the total number of weekly hours actually worked by 40 or 48.

Labour income

The dataset covers 189 countries as well as global and regional aggregates. The data are based on the ILO Harmonized Microdata collection. To produce consistent time series for all countries, statistical models are used to extrapolate and impute missing data points. The dataset contains two key indicators: the labour income share and the labour income distribution, following the recommendation of the ILO Global Commission on the Future of Work to develop new distributional indicators. Furthermore, the new internationally comparable labour share data will be used to monitor progress towards the United Nations’ Sustainable Development Goals.

Wages

The methodology to estimate global and regional wage trends was developed by the ILO for the previous editions of the Global Wage Report (GWR) in collaboration between technical departments and the Department of Statistics, following four peer reviews conducted by five independent experts. The appendix of the GWR describes the methodology adopted as a result of this process.

Global estimates on wages are not published on ILOSTAT.

International migrant workers

The fourth edition of the ILO Global Estimates on International Migrant Workers presents the most recent estimates on the stock of international migrant workers, disaggregated by age, sex, country-income group and region, and the estimation methodology. The reference year is 2022. The report predates the onset of the COVID-19 crisis, which has affected the magnitude and characteristics of international labour migration.

Global estimates on labour migration are not published in ILOSTAT databases but are available in Excel format from the topic page on migrant workers.

Child labour

The current (sixth) edition of the Global Estimates of Child Labour provides updated estimates for 2020 and has been produced for the first time in partnership with UNICEF. The ILO-UNICEF estimates are based on the international standards concerning statistics on child labour, which were adopted by the 20th International Conference of Labour Statisticians (ICLS) in October 2018. These standards outline statistical definitions of child labour and its components, hazardous work by children and the worst forms of child labour other than hazardous work. To gauge trends in child labour and other related indicators at the regional and global levels, a series of econometric models were developed to account for the non-randomness in missing data. These efforts improve the accuracy of the estimates and also ensure replicability of the estimation process, thereby facilitating updates and the development of subsequent global estimates. A report presents the methodological protocols used for the development of the 2020 ILO-UNICEF Global Estimates of Child Labour.

Publications

Note: Many publications are available only in English. If available in other languages, a new page will open displaying these options.

Estimation methodology – ILO Modelled Estimates of International Migrants in the Labour Force

February 20, 2026

This note presents an overview of the technical foundations of the estimation methodology. It describes the models used, outlines the sources of input data and summarizes the approaches applied to assess the quality of the estimates. Where available, data based on foreign-born population as international migrants were used as input in the models.

ILO modelled estimates methodological overview

January 25, 2023

Employment and economic class in the developing world

June 19, 2013

This paper introduces a model for generating national estimates and projections of the distribution of the employed across five economic classes for 142 developing countries over the period 1991 to 2017. The national estimates are used to produce aggregate estimates for eight developing regions and for the developing world as a whole.

ILO Modelled Estimates (ILOEST database)

ILO modelled estimates methodological overview

Data collection and evaluation

Data selection and revision to historical estimates

Country groupings

Country, territory and area groupings

F.A.Q.

Labour market indicators

Conflict countries

Labour force, employment structure and labour underutilization

Hours worked

Labour income

Wages

International migrant workers

Child labour

Publications

Estimation methodology – ILO Modelled Estimates of International Migrants in the Labour Force

ILO modelled estimates methodological overview

Employment and economic class in the developing world

Related pages

International Classifications of Status in Employment and Status at Work (ICSE and ICSaW)

International Standard Industrial Classification of All Economic Activities (ISIC)

Country, territory and area groupings

International Standard Classification of Occupations (ISCO)

Work Statistics – 19th ICLS (WORK database)

Labour Force Statistics (LFS and RURBAN databases)

Prices, Costs and Currency Conversions (PRICES database)

Worker and sector profiles (PROFILES database)

Statistics on youth