Entrepreneurs International (Compendia)

 General   Dimensions 
 Sources   Data Overview 
 Variable(s)   Missing Data 
 Variable Description(s)   Additional information

General

The data set Entrepreneurs International (Compendia) contains harmonized data on 18 variables for 30 OECD countries starting with the year 1970. The alternative name COMPENDIA stands for Comparative Entrepreneurship Data for International Analysis. The figures in this set are comparable across countries and over time.

Sources

The main data sources are:

  • OECD National Accounts
  • OECD Labour force statistics
  • OECD Economic Outlook
  • OECD Employment and Labour Market Statistics
  • OECD Main Economics Indicators
  • OECD Revenue Statistics: Comparative tables
  • OECD Science, Technology and R&D Database
  • OECD - Tax-benefit models
  • OECD Employment Outlook june 2000
  • LO Yearbook of labour statistics
  • The Wordl Bank statistical database 'EdStats'
  • The European Observatory for SME's: sixth report

Variable(s)

The dataset contains 18 variables:

  • business owners (total private sector; private sector excluding agriculture, hunting forestry and fishing; and agriculture, hunting forestry and fishing)
  • total labour force
  • business ownership rate (total private sector; private sector excluding agriculture, hunting forestry and fishing; and agriculture, hunting forestry and fishing)
  • gross domestic product (GDP)
  • population
  • gross domestic product per capita
  • harmonised unemployment rate
  • employment
  • share of services in total employment
  • labour productivity
  • femal labour share
  • population density
  • total tax revenue as a percentage of GDP
  • gross domestic expenditure on RenD as a percentage of GDP
  • gross Replacement rate
  • gross enrollment rate primary education
  • gross enrollment rate secundary education
  • gross enrollment rate tertiary education

Variable Description(s)

The variables can be described in the following way:

Business owners (self-employed)
The dataset contains the number of business owners by country (expressed in thousands of persons), separated for the total private sector, the private sector ex-cluding agriculture, hunting forestry and fishing, and agriculture, hunting forestry and fishing. Only persons who are self-employed as their main occupation are in-cluded in the figures. The OECD Labour Force Statistics form the main data source. EIM completed the missing data by using ratios derived from various other sources. Furthermore, EIM made a unified dataset of business owners as the definitions of business owners or self-employed (these terms are used interchangeably) in the OECD statistics are not fully compatible between countries. In some countries, business owners are defined as individuals owning a business that is not legally in-corporated. In other countries, owner/managers of an incorporated business (OMIBs) who enjoy profits as well as a salary are also considered as self-employed. There are also countries that classify a part of the OMIBs as self-employed and an-other part as employee. This results from a different set-up of labour force surveys in different countries; see Chapter 5 of OECD Employment Outlook June 2000. By and large, Australia, Japan, Norway and the United States use a narrow business ownership definition (excluding OMIBs or excluding most OMIBs), while the other countries apply a broader characterization (including OMIBs or including most OMIBs). Business owners in the dataset are defined to include OMIBs. For the coun-tries not following this definition, EIM made an estimation of the number of OMIBs using information derived from The European Observatory for SMEs: Sixth Report (published in 2000 by EIM), or using information from domestic sources for the non-European countries (for the United States, sources include The State of Small Business; a Report of the President 1996). Another difference in definition is that for some countries, unpaid family workers are included in the self-employment data as well, mostly for early years. For these years, the unpaid family workers were removed from the database by using ratios from more recent years for which sepa-rate data on unpaid family workers are available. Finally, for countries where im-portant unclarified trend breaks occur, these trend breaks were corrected for. For more information, see Research report H200302 and H201019.
Data are missing for Czech Republic, Hungary and Slovak Republic (1972-1988), Korea (1972-1979), Mexico (1972-1990), Poland (1972-1980), and Turkey (1972-1987). In the agricultural sector – and hence in the total private sector as well – data are also missing for Czech Republic (1989-1992), Hungary and Slovak Repub-lic (1989-1993) and Korea (1980).

Total labour force.
Data on total labour force, expressed in thousands of persons, are also from OECD Labour Force Statistics. Again, some missing data have been filled up from various other sources. Total labour force consists of employees, self-employed persons (including OMIBs), unpaid family workers, people employed by the Army and unemployed persons.
Data are missing for Czech Republic, Hungary and Slovak Republic (1972-1988), Korea (1972), Mexico (1972-1990) and Poland (1972-1980).

Business ownership rate.
This is the number of business owners divided by total labour force. This variable is available for the total private sector, the private sector excluding agriculture, hunt-ing forestry and fishing, and agriculture, hunting forestry and fishing.
Data are missing for Czech Republic, Hungary and Slovak Republic (1972-1988), Korea (1972-1979), Mexico (1972-1990), Poland (1972-1980), and Turkey (1972-1987). In the agricultural sector – and hence in the total private sector as well – data are also missing for Czech Republic (1989-1992), Hungary and Slovak Repub-lic (1989-1993) and Korea (1980).

Gross domestic product.
This variable is taken from OECD National Accounts. GDP is measured in millions of US $ at constant prices of 2000. Furthermore, purchasing power parities of 2000 are used to make the monetary units comparable between countries.
Data are missing for Czech Republic and Poland (1970-1989), Hungary (1970-1990), and Slovak Republic (1970-1991).

Population.
The population of a country, expressed in thousands of persons. The main source for this variable is OECD Labour Force Statistics.
Data are missing for Czech Republic, Hungary, Korea, Mexico, Poland and Turkey (1970-1972), and Slovak Republic (1970-1977).

Gross domestic product per capita.
This is Gross Domestic Product divided by population.
Data are missing for Czech Republic and Poland (1970-1989), Hungary (1970-1990), Korea, Mexico and Turkey (1970-1972), and Slovak Republic (1970-1991).

Harmonised Unemployment rate.
This variable measures the number of unemployed as a fraction of the labour force. The main source for this variable is OECD Main Economic Indicators. Some missing data on the number of unemployed have been filled up with help of data from the OECD Labour Force Statistics and the Yearbook of Labour Statistics from the Inter-national Labour Office.
Data are missing for Czech Republic and Poland (1970-1989), Hungary (1970-1991), Mexico (1970-1986), and Slovak Republic (1970-1993).

Employment.
This variable, expressed in thousands of persons, is computed as total labour force multiplied by the complement of the harmonised unemployment rate.
Data are missing for Czech Republic and Poland (1972-1989), Hungary (1972-1991), Korea (1972), Mexico (1972-1990), and Slovak Republic (1972-1993).

Share of services in total employment.
This variable measures the share of services in total employment. The main source for this variable is OECD Employment and Labour Market Statistics, which provides data on employment in the services sector, which in Compendia is broadly defined as including Wholesale and retail trade, restaurants and hotels; Transport, storage and communication; Financing, insurance, real estate and business services, and Community, social and personal services. To obtain the share of services in total employment, employment in services is divided by employment.
Data are missing for Czech Republic (1972-1989), Hungary (1972-1991), Korea (1972), Mexico (1972-1990), Poland (1972-1992), and Slovak Republic (1972-1993).

Labour productivity.
This is Gross Domestic Product divided by employment.
Data are missing for Czech Republic and Poland (1972-1989), Hungary (1972-1991), Korea (1972-1979), Mexico (1972-1990), Slovak Republic (1972-1993), and Turkey (1972-1987).

Female Labour share.
This is the share of women in the total labour force. The variable is derived from OECD Labour Force Statistics.
Data are missing for Czech Republic (1970-1989), Hungary (1970-1991), Korea (1970-1980), Mexico (1970-1990), Poland (1970-1991), Slovak Republic (1970-1993), and Turkey (1970-1987).

Population density.
This is population divided by surface.
Data are missing for Czech Republic, Hungary, Korea, Mexico, Poland and Turkey (1970-1972), and Slovak Republic (1970-1977).

Total tax revenue as a percentage of GDP
This variable measures total tax revenue as a percentage of GDP. Data are taken from OECD Revenue Statistics: Comparative tables. Some missing data on total tax revenue as a percentage of GDP have been interpolated.
Data are missing for The Netherlands, Portugal, Japan, Australia and Poland (2009) Czech Republic (1970-1992), Hungary and Poland (1970-1990), Korea (1970-1971), Mexico (1970-1979), and Slovak Republic (1970-1987).

Gross domestic expenditure on RenD as a percentage of GDP
This variable measures gross domestic expenditure on R&D as a percentage of GDP. Data on gross domestic expenditure on R&D (in national currency) are taken from the OECD Science, Technology and R&D database, and data on GDP (in na-tional currency) are taken from the OECD Economic Outlook No. 88. To obtain gross domestic expenditure on R&D as a percentage of GDP, gross domestic expenditure on R&D is divided by GDP. Some missing data on gross domestic expenditure on R&D as a percentage of GDP have been interpolated.
Data are missing for all countries prior to 1981. In addition, data are missing for Greece and Mexico (2008-2009), Iceland, Switzerland, United States, Japan, Aus-tralia and Korea (2009), Belgium (1981-1982), Portugal (1981), Luxembourg (1981-1999), Czech Republic, Mexico and Slovak Republic (1981-1992), Hungary and Korea (1981-1990), and Poland and Turkey (1981-1989).

Gross replacement rate
This variable measures gross unemployment benefit replacement rates, as reflected by the OECD summary measure of benefit entitlements. Data for uneven years are taken from the OECD Tax-Benefit Models www.oecd.org/els/social/workincentives. Data on gross replacement rate for even years have been interpolated.
Data are missing for all countries in 2008-2009. In addition, data are missing for Luxembourg, Iceland, Czech Republic, Hungary, Korea, Poland and Slovak Republic (1970-2000), and Turkey (1970-2004). No data is available for Mexico.

Gross enrollment rate primary education.
This variable measures total gross enrollment rates (%) in primary education. The source for this variable is The World Bank statistical database ‘EdStats’. Some miss-ing data on gross enrollment rate primary education have been interpolated.
Data are missing for all countries in 1970 (except for New Zealand) and 2009. In addition, data are missing for Greece (2008), Canada (2007-2008), Germany (1971-1990), Canada (1971), Australia (1971-1974), and Slovak Republic (1971-1992).

Gross enrollment rate secundary education
This variable measures total gross enrollment rates (%) in secondary education. The source for this variable is The World Bank statistical database ‘EdStats’. Some missing data on gross enrollment rate secondary education have been interpolated.
Data are missing for all countries in 1970 (except for New Zealand) and 2009. In addition, data are missing for Greece (2008), Canada (1971 and 2007-2008), Ger-many (1971-1990), and Australia and Slovak Republic (1971-1992).

Gross enrollment rate tertiary education
This variable measures total gross enrollment rates (%) in tertiary education. The source for this variable is The World Bank statistical database ‘EdStats’. Some miss-ing data on gross enrollment rate tertiary education have been interpolated.
Data are missing for all countries prior to 1975 (except for Australia and New Zea-land) and 2009. In addition, data are missing for Germany (1975-1990 and 1998-2008), Greece (2008), Luxembourg (2007-2008), Canada (1975-1989 and 2005-2008), and Slovak Republic (1975-1992).

Dimensions

Countries
The dataset contains 30 OECD-countries.

Austria

Germany

The Netherlands

Iceland

Canada

Belgium

Greece

Portugal

Norway

Australia

Denmark

Ireland

Spain

Switzerland

New Zealand

Finland

Italy

Sweden

United States

Mexico

France

Luxembourg

United Kingdom

Japan

Korea

Hungary

Czech Republic

Slovak Republik

Turkey

Poland

Size-class distribution
The dataset has no size-class distribution.

Sectors
In general, the dataset has three complementary sectors:
- agriculture, hunting, forestry and fishing (=agriculture)
- private sector excluding agriculture
- government
as well as total, which is the aggregate of the three sectors mentioned above.

For the variables business owners and business ownership rate, the dataset distinguishes between the sectors:
- agriculture, hunting, forestry and fishing
- private sector excluding agriculture

Years
Data are available from 1970 onwards.
For some variables data are available from 1972 onwards. Other exceptions are mentioned in the variable descriptions above.