Structure of Earnings Survey

Statistical domain

Structure of Earnings – national level, 4-year periodicity

ESS Standard for Quality Reports Structure (ESQRS)

Structure of earnings survey - average hourly earnings, average monthly earnings, average annual earnings, employees, paid hours, days of annual paid leave - national and regional (statistical zones) levels

Contact

Contact organisation

National Statistical Institute

Contact organisation unit

Statistics on Labour costs, Research and Development, Innovation and Information Society Department

Contact name

Todor Davidkov

Contact person function

head of unit

Contact mail address

2, P. Volov Str.; 1038 Sofia, Bulgaria

Contact email address

[email protected]

Contact phone number

+35929857568

Contact fax number

Statistical presentation

Data description

The Structure of Earnings Survey is aimed to give detailed and comparable at European Union level information on distribution and relationships between the level of remuneration, individual characteristics of employees and these of their employer.

Objects of the survey are: characteristics of employer (enterprise, local unit) - number of employees in the local unit, type of ownership, existence and type of collective pay agreement, size of the enterprise; individual characteristics of employees - age, sex, educational level, occupation, length of service, mode of employment (full-time/part-time), type of employment contract, annual gross earnings, annual bonuses, annual payments in kind, annual days of paid holiday leave, monthly gross earnings, earnings related to overtime, earnings related to shift work, employee's compulsory social security contributions and income tax, number of paid monthly hours, number of paid overtime hours.

Classification system

· Classification of Economic Activities (CEA-2008, for international use NACE.BG 2008);

· National Classification of Occupations and Duties-2011 (NCOD-2011) - Data were collected by the NCOD-2005 as it was the acting classification in 2010 and after that recoded into NCOD-2001 to be in compliance with international classification ISCO-08, in accordance with the requirements of the Commission Regulation N1738/2005.

· Nomenclature of Educational Levels - in compliance with the International Standard Classification of Education, 1997 version (ISCED 1997);

· Classification of Territorial Units for Statistical Purposes in Bulgaria - NUTS.

Sector coverage

Enterprises with 1 or more employees in economic activity within sections B to S of NACE.BG-2008 are covered.

Statistical concepts and definitions

Employees are all persons who have a direct employment contract with their employer and receive remuneration in cash or in kind for certain quality and quantity of work done, irrespective of the type of work performed, the number of working hours (full or part-time) and the duration of the employment contract (fixed or indefinite).

Gross earnings are the remuneration in cash paid to the employee directly and regularly by the employer at the time of each pay period, before deductions of any tax and social security contributions payable by employee and withheld by the employer.

The gross monthly earnings of employees include:

basic wage or salary for work done or time worked in the reference month;
earnings related to annual paid holiday leave and other periods of absences paid for entirely by the employer at a full rate;
payment for overtime work;
special payment for shift work - night shifts, week-end or public-holiday shifts;
payments for seniority, harmful and other specific working conditions, high personal qualification or other collectively or individually agreed additional payments;
bonuses and gratuities paid regularly in each pay period, even if the amount varies from month to month.

The annual gross earnings are the total amount of the regular payments in cash received by the employee for the work performed during the reference year, including:

· the value of annual payments in kind (goods and services) made available to employees by employer;

· all irregular payments as quarterly bonuses, 13th or 14th salaries and other gratuities not received at each pay period.

Paid hours cover the total number of normal and overtime hours to which the gross monthly earnings in the reference month relate. The number of paid hours includes: actually worked normal hours, worked and paid overtime hours, hours not worked but nevertheless paid by the employer at a full rate (annual leave, work stoppages and other hours paid such as for medical examinations).

Statistical unit

Local units (territorial structures) with 1 or more employees belonging to enterprises with 1 or more employees.

Statistical population

The structure of earnings statistics relates to enterprises with at least one employee in economic activities within sections B to S of NACE.BG-2008, including section O "Public administration".

Reference area

Area of Republic of Bulgaria.

Time coverage

2002, 2006, 2010, 2014, 2018,2022

Base period

Not applicable.

Statistical processing

Source data

Source of the data is a sampling statistical survey. The sampling procedure used for the SES contains two stages. In the first stage, a stratified random sample of local units without replacement is drawn. Stratification criteria used include:

· economic activity – divisions (2-digit level) of NACE.BG 2008;

· the number of employees in the local unit: 1 to 9 employees; 10 to 49 employees; 50 to 249 employees; 250 to 499 employees; 500 to 999 employees; 1000 and more employees;

· regional breakdown – level 1 of national Classification of Territorial Units for Statistical Purposes in Bulgaria, in force since 2009:

- BG3 - Severna i Yugoiztochna Bulgaria

- BG4 – Yugozapadna i Yuzhna Tsentralna Bulgaria.

At the second stage, a systematic sample of employees is taken within each of the selected local units.

At the first sampling stage 19 182 local units were selected (9.1 form total population) from which 17 331 responded units (90% from the sample) provided data for approximately 217 500 employees (9.2% from total population).

Frequency of data collection

Once per four years.

Data collection

Data are collected through tailor-made questionnaire which consists of: part A collecting information for sampled local units and part B collecting information for each sampled employee. The questionnaire is accompanied by a dispatch note to each respondent about the purposes of the survey and explanatory notes for sampling of employees within the local units and instructions on information required. The SES questionnaire was developed on paper and electronic format, uploaded on the official web page of NSI. The Head office of NSI provides methodological assistance to the respondents and to the Regional Statistical Offices (RSOs) on completion and processing of the information.

Data validation

Approximately 100 controls have been applied on micro data for validation of: completeness, plausible values, logical and arithmetic coherence between collected variables. Data are checked at three levels: (1) during the initial data entry - in the electronic questionnaire by respondents and by the RSOs from paper questionnaires. (2) at the RSOs on the joint regional micro data set; (3) at NSI level - on the joint national micro data set. Besides micro data validation, survey results are compared with other sources of similar type of information.

Data compilation

The data processing goes through the following stages:

· entry of the initial information into electronic format;

· data validation;

· data editing and imputation on the base of additional information from respondents and/or other statistical and administrative sources;

· weighting of the sampling data to gross-up results over the total surveyed population;

· producing of summarized table results.

The software used for data processing:

· completion and processing of the individual data by respondents - on-line based questionnaire;

· integration of individual files into a database, validation and export of outputs from the database by different dimensions, checks and formatting of the macro data to be sent to Eurostat – MS Access;

· processing of the integrated national database, table outputs and analyses – SPSS.

Adjustment

Not applicable.

Quality management

Quality assurance

According to Article 2, Para 3 of the Bulgarian Law on Statistics statistical information shall be produced in compliance with the following criteria for quality: adequacy, accuracy, timeliness, punctuality, accessibility and clarity, comparability and logical consistency.

According art. 10 of Council Regulation 530/1999 the national authorities shall ensure that the results reflect the true situation of the total population of units with a sufficient degree of representativity. The national authorities submit to Eurostat at its request after each reference period a report to enable the quality of the statistics to be evaluated.

Quality assessment

According to the Commission Regulation 698/2006, having regard to Council Regulation (EC) No 530/1999, each Member State shall prepare quality report for evaluation of the quality of structure of earnings statistics at the latest 24 months after the end of the reference period.

Relevance

User needs

The user groups are defined on the base of the data requests received by NSI. The customers of the SES results can be classified as follows:

· National institutions - Ministries, Agencies, Councils, other governmental bodies and public;

· International institutions - Eurostat, ILO, OECD, UNICEF, UNECE;

· Social partners: Trade unions and employment associations

· Private institutions and businesses, incl. media

· Researchers and students

Internal to NSI: other units of NSI, e.g. dealing with LFS, Classifications, National Accounts, Household Budgets etc. for purpose of comparison or other.

User satisfaction

NSI has not carried out a specific survey among users to know their needs of information concerning SES and whether they are satisfied with the published results. Users usually prefer more detailed data at the lowest levels of the classifications applied in the survey which is problematic due to the limited number of observations and correspondingly the lower level of precision.

Completeness

The survey covers all mandatory variables according to the Commission Regulation (EC) No 1738/2005. There is full coverage as well in terms of size of the enterprises (with 1+ employees) and of economic activities (NACE.BG 2008 sections B - S, including O).

Data completeness - rate

The survey covers all (100%) mandatory variables according to the Commission Regulation (EC) No 1738/2005 and 5 out of 10 optional variables.

Accuracy and reliability

Overall accuracy

The overall accuracy of the survey results depends on:

· number of the surveyed units

To achieve a certain desired accuracy of the survey results a sampling plan of employees and local units is made. First, the total number of persons who should be observed is calculated. The calculations are made with 95% confidence level that the maximum error of the estimate shall be within a preset interval. The resulting number of persons is distributed proportionally to the population by three stratification criteria: the size of the local unit of economic activity and territory (location) of the local unit. Based on the parameters of the population and proportionally distributed number of employees that need to be observed, it is calculated how many local units must be selected from each cell.

· the survey framework

To construct the framework population from which the sample survey is to be selected data for the local units and the number of employees from the comprehensive Annual survey of employees, hours worked, wages and salaries and other labor costs for 2017 were used. For purposes of grossing-up procedures of the sampling data, the parameters of the population are updated with information for 2018, which is available approximately 12 months after the reference period.

· survey tools

The survey questionnaire is developed on paper and electronic format. The electronic questionnaire is an on-line based with incorporated logical controls allowing data to be validated while entered.

· methods of identifying and addressing possible errors

Approximately 100 checks are applied to verify data concerning: completeness of responses, data plausibility, arithmetic and logical consistency between the collected variables. Data editing is done by: a reference back to the persons filled information, use of information from administrative sources (Personal register of insured persons of the National Social Security Institute), use of other statistical surveys containing information about the surveyed units, application of statistical methods and techniques for the assessment of missing values (mean value imputation, the most frequent value imputation, etc.).

Sampling error

Coefficients of variation (relative standard errors) are calculated by use of the Horvitz-Tompson estimator. Coefficients of variation (CVs) are low for most of the relevant items and important classification levels.

Coefficients of variations by working time schedule and gender - %

	Gross monthly earnings	Gross hourly earnings
Total population	0.21	0.20
Full-time employees	0.21	0.21
Male	0.33	0.33
Female	0.24	0.24
Part-time	0.76	0.78
Male	1.08	1.07
Female	1.06	1.10

The highest CVs appeared for small heterogeneous populations with low sampling probability and in cases of high unit non-response rate (small number of observations).

The following criteria were agreed for data publishing:

· cells with earnings (monthly/hourly/annual) showing CV between 20 and 30% were put in parenthesis;

· cells with earnings (monthly/hourly/annual) showing CV higher than 30% were hidden (deleted) and marked with sloped forward dash ‘/ ‘;

· cells with number of observations between 4 and 9 were put in parenthesis;

· cells with number of observations between 1 and 3 were hidden (deleted) and marked with sloped forward dash ‘/ ‘.

Extreme values were removed from the dataset and grossing-up factors were recalculated. From calculations were excluded 0.09% of records with extreme values of hourly earnings.

Sampling errors - indicators

Coefficients of variations by NACE Rev. 2 sections -%

Economic activity (NACE Rev. 2)	Gross monthly earnings	Gross hourly earnings
B Mining and quarrying	0.77	0.77
C Manufacturing	0.28	0.28
D Electricity, gas, steam and air conditioning supply	1.05	1.12
E Water supply, sewerage, waste management and remediation activities	0.90	0.92
F Construction	0.82	0.79
G Wholesale and retail trade; repair of motor vehicles and motorcycles	0.56	0.54
H Transportation and storage	0.87	0.83
I Accommodation and food service activities	0.71	0.63
J Information and communication	0.79	0.78
K Financial and insurance activities	1.05	1.04
L Real estate activities	2.26	2.27
M Professional, scientific and technical activities	1.13	1.09
N Administrative and support service activities	0.85	0.89
O Public administration and defence; compulsory social security	0.62	0.61
P Education	0.41	0.40
Q Human health and social work activities	0.70	0.71
R Arts, entertainment and recreation	2.15	2.02
S Other service activities	1.27	1.08

Coefficients of variations by occupations (ISCO-08, 1-digit level) -%

Occupations (ISCO-08)	Gross monthly earnings	Gross hourly earnings
Managers	0.80	0.77
Professionals	0.38	0.37
Technicians and associate professionals	0.51	0.51
Clerical support workers	0.41	0.38
Service and sales workers	0.26	0.24
Skilled agricultural, forestry and fishery workers	4.82	4.18
Craft and related trades workers	0.36	0.35
Plant and machine operators, and assemblers	0.32	0.31
Elementary occupations	0.29	0.26

Coefficients of variations by age groups -%

Age groups	Gross monthly earnings	Gross hourly earnings
< 20 years	1.53	1.23
20 - 29 years	0.48	0.46
30 - 39 years	0.43	0.42
40 - 49 years	0.39	0.38
50 - 59 years	0.37	0.37
60 years and over	0.54	0.50

Non-sampling error

Non-sampling errors are described in details in the Quality Report where assessment is made of:

· over-coverage - the percentage of units covered in the survey that are out of scope of the target population;

· measurement errors - from the wrong interpretation of the survey questionnaire and respondents’ errors when not complying with explanatory notes for completion of the questionnaire - measured through the percentage of the wrong cases identified by the applied arithmetic and logic controls.

· the relative share of the non-response from sampled units is 7.3%. The non-responded units are grouped in two basic groups by reasons: (a) because of lack of up-to-date framework at the time of sample selection - restructured, closed or units without activity during reference period; lack of contact; (b) units refused to respond . To neutralize the errors resulting from the lack of response from some of the units in the sample, the weights are recalculated with the number of respondents units.

· percentage of non-responses of individual variables or individual employees in the observation unit;

· percentage of corrected cases of key variables.

Coverage error

The sample of local units was taken from the local units’ population as of 31.12.2021. The sampling frame represented the most current situation of the Business Register available at the time of the sampling. In the sampling frame population were included all local units with 1 or more employees that belonged to enterprises with 1 or more employees within the NACE Rev. 2 sections B to S, including O.

The under-coverage refers to the situation when newly emerged or units with renewed activity with 1 or more employees within NACE sections B to S were not included in the sampling frame. The under-coverage was not quantified. To offset the errors that might arise from under-coverage and for purposes of the weighting procedure the framework population was updated where appropriate with the most recent situation of Business Register in 2022 to reflect major changes and fluctuations between NACE divisions and size classes of enterprises.

As over-coverage are referred sampled local units that during the reference period have been already closed-down, dormant units or units without employees. The overall over-coverage rate is 5.9%. When there have been cases of over-coverage, new units have not been sampled.

Over-coverage - rate

NACE Rev.2 divisions	Number of local units in the frame	Number of local units in the sample	Number of local units out of scope	Over-coverage rate in the sample - %
05	24	19	0	0.0
06	3	3	0	0.0
07	20	17	0	0.0
08	198	26	0	0.0
09	23	14	0	0.0
10	4295	435	16	3.7
11	378	44	0	0.0
12	13	12	0	0.0
13	456	51	1	2.0
14	3416	377	19	5.0
15	399	51	4	7.8
16	1320	114	3	2.6
17	384	46	1	2.2
18	651	63	1	1.6
19	8	8	0	0.0
20	465	58	2	3.4
21	47	30	2	6.7
22	1305	141	2	1.4
23	944	101	1	1.0
24	145	37	0	0.0
25	2351	240	13	5.4
26	277	41	1	2.4
27	389	61	1	1.6
28	774	100	2	2.0
29	90	40	0	0.0
30	48	26	1	3.8
31	1613	150	5	3.3
32	836	80	3	3.8
33	1320	120	9	7.5
35	695	79	5	6.3
36	212	43	0	0.0
37	39	13	0	0.0
38	542	70	2	2.9
39	51	21	0	0.0
41	4166	392	29	7.4
42	1175	141	3	2.1
43	6013	503	33	6.6
45	7826	641	46	7.2
46	19486	1651	103	6.2
47	46271	3753	283	7.5
49	11813	1004	49	4.9
50	39	16	1	6.3
51	37	16	1	6.3
52	1470	152	5	3.3
53	848	88	2	2.3
55	2660	257	18	7.0
56	13262	1088	117	10.8
58	522	49	1	2.0
59	419	38	5	13.2
60	145	29	1	3.4
61	449	50	1	2.0
62	3483	321	23	7.2
63	824	82	5	6.1
64	1502	156	4	2.6
65	50	19	0	0.0
66	1198	105	9	8.6
68	5649	459	34	7.4
69	7855	628	43	6.8
70	1970	166	20	12.0
71	3381	273	14	5.1
72	379	53	1	1.9
73	1605	136	8	5.9
74	3041	249	22	8.8
75	272	29	1	3.4
77	765	71	6	8.5
78	380	46	3	6.5
79	1001	85	2	2.4
80	1089	144	5	3.5
81	1566	161	6	3.7
82	1048	103	7	6.8
84	1593	292	0	0.0
85	6296	714	11	1.5
86	7841	737	20	2.7
87	492	58	1	1.7
88	742	94	0	0.0
90	453	54	2	3.7
91	409	39	0	0.0
92	853	83	3	3.6
93	2398	199	17	8.5
94	5041	411	15	3.6
95	1030	90	9	10.0
96	6622	526	56	10.6
Total	211160	19182	1139	5.9

Common units - proportion

Not applicable.

Measurement error

To avoid measurement errors detailed explanatory notes with illustrative examples were attached to the questionnaire. To further help the respondents a list with contact information was posted on Internet and telephone consultations on methodological and technical issues were provided. The Regional Offices were also provided by the Head office of NSI with written and telephone guidance how to process data and deal with arising problems.

Main sources of measurement and processing errors are:

•way of asking questions in the survey questionnaire. E.g. Annual days of holiday leave - in some cases respondents provided the number of days actually taken not the total number of days due to be taken.

•respondents keep data differently and do not make further efforts to comply to statistical requirements, or do not understand or read the explanatory notes. Example for such errors is var. 3.2 which is among most corrected items because instead of number of hours paid during the representative month respondents provided: paid days during the month; paid hours during the year; working hours per day; paid hours excluding paid overtime hours (when available).

•data entry errors - these errors had very low proportion compared to the first two types.

Non response error

The overall unit response rate for in-scope respondents with 1 or more employees is 95.0% and for the mandatory size class of enterprises with 10 or more employees the response rate is 96.4%. The lowest is the unit response rate for small units with 1 to 9 employees - 94.4%. The lower response rates for the enterprises with 1 to 9 employees could be explained with their dynamic nature featuring with frequent structural changes and instability as regards location, economic activity, financial status and employment - peculiarities for which it is difficult to maintain up to date information in the business register. The main reasons reported by the regional offices of NSI for the high non-response levels are rather “not found (out of date contact information)”, “closed down/sleeping”, “no employees in the reference period” than explicit refusals. Regional offices reported that nearly 40% of respondents were reminded for their duty to reply by phone calls, e-mail and follow-up letters. In the official period of data collection (May - June 2019) only 60% of responses were received. To improve response rate the deadline was prolonged with two months. Reminders were sent to the non-respondent units as special attention was paid to cells (NUTSxNACExSize) with low response rates.

Unit non-response - rate

In the table below are presented two types of unit response rates - the first one calculated to the total number of sampled units and the second one calculated to the total number of in-scope respondents (enterprises with one or more employees with earnings in October 2018). Rates are broken down by divisions of NACE Rev. 2 and by size classes - 1 or more employees, 1 to 9 employees (optional), 10 or more employees (mandatory).

NACE Rev.2	Response rate - % of total sample			Response rate - % of in-scope units
NACE Rev.2	1+	1_9	10+	1+	1_9	10+
05	94.7	83.3	100.0	94.7	83.3	100.0
06	100.0	-	100.0	100.0	-	100.0
07	94.1	100.0	92.3	94.1	100.0	92.3
08	88.5	77.8	94.1	88.5	77.8	94.1
09	92.9	87.5	100.0	92.9	87.5	100.0
10	93.3	89.5	97.6	96.9	96.2	97.6
11	95.5	100.0	92.9	95.5	100.0	92.9
12	100.0	100.0	100.0	100.0	100.0	100.0
13	94.1	95.5	93.1	96.0	100.0	93.1
14	91.8	86.4	95.8	96.6	95.2	97.6
15	88.2	70.6	97.1	95.7	85.7	100.0
16	93.0	92.9	93.2	95.5	97.0	93.2
17	93.5	94.1	93.1	95.6	100.0	93.1
18	96.8	94.3	100.0	98.4	97.1	100.0
19	100.0	-	100.0	100.0	-	100.0
20	89.7	78.3	97.1	92.9	85.7	97.1
21	93.3	75.0	100.0	100.0	100.0	100.0
22	96.5	94.0	98.6	97.8	96.9	98.6
23	96.0	93.0	98.3	97.0	95.2	98.3
24	100.0	100.0	100.0	100.0	100.0	100.0
25	93.8	88.5	99.2	99.1	98.2	100.0
26	97.6	90.9	100.0	100.0	100.0	100.0
27	96.7	93.3	97.8	98.3	100.0	97.8
28	97.0	93.8	98.5	99.0	100.0	98.5
29	85.0	88.9	83.9	85.0	88.9	83.9
30	92.3	77.8	100.0	96.0	87.5	100.0
31	93.3	90.8	98.1	96.6	94.7	100.0
32	96.3	94.9	100.0	100.0	100.0	100.0
33	90.0	86.6	97.4	97.3	97.3	97.4
35	93.7	86.8	100.0	100.0	100.0	100.0
36	100.0	100.0	100.0	100.0	100.0	100.0
37	84.6	87.5	80.0	84.6	87.5	80.0
38	94.3	85.0	98.0	97.1	94.4	98.0
39	95.2	87.5	100.0	95.2	87.5	100.0
41	80.9	74.0	88.0	87.3	84.1	90.4
42	91.5	84.8	94.7	93.5	88.6	95.7
43	86.5	84.2	93.5	92.6	91.1	96.7
45	88.6	87.8	93.3	95.5	95.7	94.4
46	90.1	88.0	96.3	96.1	95.6	97.4
47	88.8	88.5	91.5	96.0	96.4	92.9
49	90.0	87.9	97.7	94.7	93.8	97.7
50	81.3	71.4	88.9	86.7	83.3	88.9
51	87.5	100.0	81.8	93.3	100.0	90.0
52	86.8	79.3	95.7	89.8	83.3	97.1
53	90.9	85.7	95.7	93.0	90.0	95.7
55	87.2	83.0	93.9	93.7	93.0	94.8
56	84.0	84.4	82.4	94.1	95.2	89.6
58	93.9	91.2	100.0	95.8	93.9	100.0
59	73.7	70.0	87.5	84.8	84.0	87.5
60	96.6	87.5	100.0	100.0	100.0	100.0
61	94.0	91.7	96.2	95.9	95.7	96.2
62	88.2	83.7	96.5	95.0	94.1	96.5
63	90.2	88.7	93.1	96.1	97.9	93.1
64	92.3	84.2	100.0	94.7	88.9	100.0
65	94.7	80.0	100.0	94.7	80.0	100.0
66	91.4	88.9	100.0	100.0	100.0	100.0
68	84.7	83.9	90.9	91.5	91.4	92.6
69	86.0	86.0	86.7	92.3	92.6	86.7
70	76.5	73.7	89.7	87.0	85.6	92.9
71	85.0	84.5	88.2	89.6	89.8	88.2
72	98.1	93.3	100.0	100.0	100.0	100.0
73	76.5	73.9	88.0	81.3	79.6	88.0
74	81.9	80.8	90.0	89.9	89.8	90.0
75	93.1	90.9	100.0	96.4	95.2	100.0
77	88.7	87.0	94.1	96.9	97.9	94.1
78	76.1	70.0	80.8	81.4	82.4	80.8
79	87.1	84.3	100.0	89.2	86.8	100.0
80	84.7	75.0	89.0	87.8	82.5	89.9
81	85.7	78.3	95.7	89.0	83.7	95.7
82	88.3	85.3	94.3	94.8	93.5	97.1
84	99.7	100.0	99.6	99.7	100.0	99.6
85	97.3	91.3	99.6	98.9	96.2	99.8
86	95.5	93.8	100.0	98.2	97.4	100.0
87	96.6	92.3	97.8	98.2	100.0	97.8
88	97.9	96.2	98.5	97.9	96.2	98.5
90	94.4	89.7	100.0	98.1	96.3	100.0
91	100.0	100.0	100.0	100.0	100.0	100.0
92	92.8	90.7	95.0	96.3	97.5	95.0
93	86.4	85.7	90.3	94.5	95.4	90.3
94	93.2	92.8	97.3	96.7	96.7	97.3
95	84.4	82.7	100.0	93.8	93.1	100.0
96	83.5	83.0	92.6	93.4	93.5	92.6
Total	89.4	86.9	95.5	95.0	94.4	96.4

Item non-response - rate

Normally, no item non-response (blank or zero values) has been accepted for any of the key variables. Only eight[1] of the collected items could possibly be zero and therefore item non-response could be supposed.

Processing error

The evaluation of quality at regional level was done by virtue of a questionnaire concerning number of issues. As regards measurement and processing errors Regional Offices were asked which variables have been most often corrected - wrong or missing. In the following table are listed variables that were reported by the 28 Regional offices (ROs) of NSI as being most problematic.

SES2014 variables most often corrected by the 28 Regional offices of NSI

Variable number according to Reg.1738/2005	Variable label	% of ROs that reported variables as problematic
2.3	Occupation (ISCO08)	42%
3.1	Number of weeks to which the gross annual earnings relate	74%
3.2	Number of hours paid during the representative month	89%
3.3	Annual days of holiday leave	63%

In addition ROs reported that approximately 20% of responded units were contacted for reference on completeness, compliance and consistency of the data.

Methods applied for correction of data that were identified as wrong (inconsistent, impossible values, missing values, not corresponding to definition, wrong format) differ depending on the type, seriousness of error and willingness of respondents to cooperate:

logical correction - applied when required information is available but format is wrong or the error is obvious - suitable correction was performed, e.g. format of data of entry into enterprise is not correct, overtime hours are not included in total hours paid in reference month, etc.;
reference to respondent - when problem is more complex inquiry for validity of data was undertaken, asking for confirmation of the nature of the error or for new delivery of given variables;
reference to other statistical sources – it is used to validate information received by surveyed units or when not possible to contact respondents, or respondents refuse to give further information. Possible sources of supporting information are Quarterly Survey on Labour and Annual Survey on Labour that provide data on total number of employees, existence of irregular bonuses, payments in kind, overtime earnings, distribution by occupations, distribution by sex, etc.
reference to administrative sources – it is used to validate information received by surveyed units or when not possible to contact respondents, or respondents refuse to give further information. In such cases the Register of socially insured persons in 2018 was employed. This source contains many of the key SES variables like sex, age, hourly, monthly and annual earnings, working time, paid hours, paid periods during the year.
deletion of the out of scope records (local units with no employees with earnings in reference month, employees without earnings in October, unreliable or many missing data).

Imputation - rate

In the following table are presented number of imputed cases and rate of imputed item non-response for the variables allowing blank or zero values.

Item imputation rates

Variable number according to Reg.1738/ 2005	Variable label	Number of imputed cases	Rate of imputed item non-response - %
3.2.1	Number of overtime hours paid in the representative month	11	0.01
3.3	Annual days of holiday leave	478	0.23
4.1.1	Total Annual Bonuses	1055	0.50
4.1.2	Annual payments in kind	200	0.09
4.2.1	Earnings related to overtime	11	0.01
4.2.2	Special payments for shift work	49	0.02
	Irregular bonuses paid in reference month	793	0.38
	Payments in kind paid in reference month	162	0.08

In addition, 1574 employees’ records (0.71% of all cases) were imputed for some of the local units that provided data for significantly less employees than required or provided data were completely unreliable (e.g. one record duplicated number of times). As main data donor was used the Register of insured persons that contains many of key variables.

Model assumption error

Not applicable

Seasonal adjustment

Not applicable.

Data revision - policy

Not applicable.

Data revision - practice

Not applicable.

Data revision - average size

Not applicable.

Timeliness and punctuality

Timeliness

Publication of survey results at national level: 15 July 2024

Submission of micro data to Eurostat: 1 July 2024

Issue of paper and electronic publication with detailed survey results: January 2025

Time lag - first results

18 months

Time lag - final results

20 months

Punctuality

Micro data were sent to Eurostat on 1 July 2024.
The reasons for the delays from the scheduled deadlines are:
• difficulty to collect data from respondents and to achieve a good response rate;
• lack of time and human resources.

Punctuality - delivery and publication

1 day delay

Coherence and comparability

Comparability - geographical

The national and regional data (level 1 - statistical zones) are conformed to the acting Classification of territorial units for statistical purposes which is application of the European classification NUTS.

Asymmetry for mirror flows statistics - coefficient

Not applicable

Comparability - over time

The comparability over time is influenced mainly by changes in definitions and classifications as result of amendments of Community legislation as well as by change in coverage of enterprises.

	2002	2006	2010	2014-2022
Coverage in terms of size of the enterprise	10 + employees	1+ employees	1+ employees	1+ employees
Definition of gross annual earnings	excluding annual payments in kind	including annual payments in kind *	including annual payments in kind	including annual payments in kind
Classification of economic activities	NCEA - 2001 (in compl. with NACE Rev.1)	NCEA - 2003 (in compl. with NACE Rev.1.1)	CEA - 2008 (in compl. with NACE Rev.2)	CEA - 2008 (in compl. with NACE Rev.2)
Classification of occupations	NCO - 1996 (in compl. with ISCO - 88 COM)	NCOD - 2005 (in compl. with ISCO - 88 COM)	NCOD - 2005, NCOD - 2011 (in compl. with ISCO - 08)	NCOD - 2011 (in compl. with ISCO - 08)
Classification of territorial units for statistical purposes	NUTS1 - corresponds to national level	NUTS1 - Bulgaria is divided into two statistical zones (BG3, BG4)*	NUTS1 – Bulgaria is divided into two statistical zones (BG3, BG4)*	NUTS1 – Bulgaria is divided into two statistical zones (BG3, BG4)*

Length of comparable time series

2006,2010,2014,2018,2022

Coherence - cross domain

There are other statistical sources that produce information on number of employees, earnings and working time:

Quarterly Survey on Number of Employees, Time Worked, Wages and Salaries and Other Labour Costs – number of employees; wages and salaries; worked days and worked hours;
Annual Survey on Employed Persons, Wages and Salaries and Other Labour Costs - number of employees; wages and salaries; worked days and worked hours;
Labour Force Survey - number of employees; worked hours;
National Accounts - number of employees; worked hours.

The results from the 2022 Structure of Earnings Survey are relatively comparable with from the above-sited surveys due to the methodological characteristics of each information source in regards to: goals, unit of observation, definitions, coverage of economic activities, statistical methods used for collection and estimation of variables surveyed.

Coherence - sub annual and annual statistics

Not applicable

Coherence - National Accounts

Values of annual earnings from SES for total B to S and in almost all of the NACE sections are lower than the corresponding values of wages and salaries from NA, except for section C. Although the definitions of the compared variables are similar the two sources have many methodological and conceptual differences that explain the disparities in levels of earnings.

Among the main reasons are:

• inclusion in 'Wages and salaries' of some kinds of imputed social contributions like: guaranteed remuneration in event of sickness that are paid by employer at a reduced rate (70%); compensations paid to dismissed workers (severance pay and compensation in lieu of notice);

• coverage by NA of employees working under non-labour contract (civil) and their earning in the form of fees and commissions;

• inclusions by NA of more types of wages and salaries in kind than in SES: part of daily allowances for business travelling; provision of recreation or holiday facilities for employees and their families;

• NA make adjustments for exhaustiveness on the following components: tips are estimated for activities like restaurants, bars, transport and other service activities (hairdressing and other beauty treatment); non-reported wages for employees working without any contract (informal employment).

Coherence - internal

Indicators within the data set are internally coherent.

Accessibility and clarity

News release

None.

Publications

On-line database

Detailed results are available to all users of the NSI website under the heading Labour Market - Structural (four yearly) statistics on earnings and labour costs - Structure of Earnings - national level, 4-year periodicity: http://www.nsi.bg/en/node/6520

Information System INFOSTAT: https://infostat.nsi.bg/infostat/pages/module.jsf?x_2=95

Data tables - consultations

Micro-data access

Access to the anonymised micro data is granted according to the Rules for granting access to anonymised micro-data for scientific and research purposes set by NSI.

Other

Not applicable.

Metadata - consultations

200 to 400 in the data collection period

Documentation on methodology

Metadata completeness – rate

100%

Quality documentation

A quality report is prepared according to requirements of the Commission Regulation 698/2006.

Cost and burden

Survey on respondents’ burden is not carried out.

Confidentiality

Confidentiality - policy

· Law on Statistics (Statistics Act);

· Regulation (EC) No 223/2009 on European statistics (recital 24 and Article 20(4)) of 11 March 2009 (OJ L 87, p. 164), stipulates the need to establish common principles and guidelines ensuring the confidentiality of data used for the production of European statistics and the access to those confidential data with due account for technical developments and the requirements of users in a democratic society.

Confidentiality – data treatment

Individual data are not published according to Art. 25 of Statistics Act. Dissemination of individual data is performed only according to Art. 26 of the Statistics Act.

Comment

Download in SDMX 2.1 file format: Structure of earnings survey - average hourly earnings, average monthly earnings, average annual earnings, employees, paid hours, days of annual paid leave - national and regional (statistical zones) levels

Metadata Structure Definition in SDMX 2.1: ESQRS_MSD+BNSI+2.0+SDMX.2.1.xml

Download in SDMX 2.0 file format: Structure of earnings survey - average hourly earnings, average monthly earnings, average annual earnings, employees, paid hours, days of annual paid leave - national and regional (statistical zones) levels

Metadata Structure Definition in SDMX 2.0: ESQRS_MSD+BNSI+2.0+SDMX.2.0.xml