View of A ratio-cum-product estimator of population mean in stratified random sampling using two auxiliary variables

(1)

A RATIO-CUM-PRODUCT ESTIMATOR OF POPULATION MEAN IN STRATIFIED RANDOM SAMPLING USING TWO AUXILIARY VARIABLES

R. Tailor, S. Chouhan, R. Tailor, N. Garg

1. INTRODUCTION

A country or state frequently requires estimates of agricultural production to assess status of grain and to make future policies regarding export and import of grains according to need. It requires the estimates of total production, average production and per hectare production of any crop which corresponds to the problem of estimation of population total, population mean and ratio of two population means respectively. This paper discusses the problem of estimation of finite population mean using information on two auxiliary variates. Auxiliary in-formation is often used by researchers in order to improve the efficiencies of es-timators. Cochran (1940) used auxiliary information at estimation stage and en-visaged ratio method of estimation that provides ratio estimator. Ratio estimator has higher efficiency when study variate and auxiliary variates are positively corre-lated. Robson (1957) developed product method of estimation that provides product estimator. When study variate and auxiliary variate are negatively corre-lated, product estimator gives higher efficiency in comparison to simple mean es-timator provided correlation coefficient between study variate and auxiliary vari-ate is grevari-ater than half of the ratio of coefficient of variation of auxiliary varivari-ate and coefficient of variation of study variate. In both, ratio and product methods of estimation, population mean of the auxiliary variate is assumed to be known. Singh (1967) utilized information on two auxiliary variates, one is positively corre-lated and another is negatively correcorre-lated with the study variate and suggested ra-tio-cum-product estimator of population mean in simple random sampling. Later many authors proposed various ratio and product type estimators in simple ran-dom sampling, for instance see Sisodia and Dwivedi (1981), Pandey and Dubey (1988), Upadhyaya and Singh (1999), Singh and Tailor (2003), Singh et al. (2004), Singh and Tailor (2005), Kadilar and Cingi (2006), Singh et al. (2009), Singh et al. (2011), etc.. Hansen et al. (1946) defined combined ratio estimator using auxiliary information in stratified random sampling. Many authors including Kadilar and Cingi (2003, 2005, 2006) and Singh et al. (2008) worked out ratio type estimators in stratified random sampling.

(2)

Simple random sampling technique has some shortcomings like less represen-tative of different sections of the population, administrative inconvenience and less efficiency in case of heterogeneous population. Literature reveals that ratio-cum-product estimator performs better than ratio and product type estimators in simple random sampling under certain conditions.

This motivates authors to work out Singh (1967) ratio-cum-product estimator in stratified random sampling and study its properties.

Consider a finite population U{ ,U U1 2,...,UN} of size N and it is di- vided into L strata of size N hh( 1, 2,..., ).L Let Y be the study variate and

X and Z be two auxiliary variates taking values yhi,x and hi z hi (h1, 2,..., ;L i1, 2,...,Nh) on i unit of the th h stratum. A sample of size th n h is drawn from each stratum which constitutes a sample of size

1 L h h n n  



and we define: 1 1 Nh h hi i h Y y N 





:h stratum mean for the study variate Y , th

1 1 Nh h hi i h X x N 





: h stratum mean for the auxiliary variate X , th

1 1 Nh h hi i h Z z N 





: h stratum mean for the auxiliary variate Z , th

1 1 1 1 1 L Nh 1 L L hi h h h h h i h h Y y N Y W Y N   N  













: population mean of the study vari-

ate Y , 1 1 1 1 L Nh 1 L hi h h h i h X x W X N   N 









: population mean of the auxiliary variate X ,

1 1 1 1 L Nh 1 L hi h h h i h Z z W Z N   N 









: population mean of the auxiliary variate Z ,

1 1 nh h hi i h y y n 





: sample mean of the study variate Y for _{h stratum,}th

1 1 nh h hi i h x x n 

(3)

1 1 nh h hi i h z z n 





: sample mean of the auxiliary variate Z for h stratum, th

h h

N W

N

 : stratum weight of _{h stratum.}th

Usual unbiased estimators of population means Y X and Z in stratified , random sampling are defined respectively as

1 L st h h h y W y  



, (1) 1 L st h h h x W x  



, (2) 1 . L st h h h z W z  



(3)

In the line of Cochran (1940) ratio estimator, Hansen et al. (1946) utilized known value of population mean X of auxiliary variate X and defined com-bined ratio estimator for population mean Y as

ˆ RC st st X Y y x    _ _  . (4)

Here it is assumed that the study variate Y and the auxiliary variate X are positively correlated.

When the study variate Y and the auxiliary variate Z are negatively correlated, assuming that the population mean Z of auxiliary variate Z is known, combined product estimator is defined as

ˆ st PC st z Y y Z    _ _  . (5)

The bias and mean squared error of ˆY and ˆRC Y , up to the first degree of PC approximation, are obtained as

2 2 1 1 1 ˆ ( _RC) L _h _h( _xh _yxh) h B Y W R S S X   



 , (6)

(4)

2 1 1 ˆ ( _PC) L _h _h _yzh h B Y W S Z   



, (7) 2 2 2 2 1 1 1 ˆ ( _RC) L _h _h( _yh _xh 2 _yxh) h MSE Y W  S R S R S  



  , (8) 2 2 2 2 2 2 1 ˆ ( _PC) L _h _h( _yh _zh 2 _yzh) h MSE Y W  S R S R S  



  (9) where 2 2 1 1 ( ) 1 h N yh hi h i h S y Y N    



, 2 2 1 1 ( ) 1 h N xh hi h i h S x X N    



, 2 2 1 1 ₍ ₎ 1 h N zh hi h i h S z Z N    



, ₁ 1 ₍ ₎₍ ₎ 1 h N yxh hi h hi h i h S y Y x X N     



, 1 1 ( )( ) 1 h N yzh hi h hi h i h S y Y z Z N     



, ₁ 1 ( )( ) 1 h N xzh hi h hi h i h S x X z Z N     



, 1 Y R X  , R₂ Y Z  and _h 1 1 h h n N  _  _  . 2. PROPOSED ESTIMATOR

Assuming that the population means of auxiliary variates X and Z are known, Singh (1967) proposed a ratio-cum-product estimator for population mean Y as ˆ RP z X Y y x Z     _ _{ }_{ }   (10) where 1 1 n i i y y n  



and 1 1 n i i x x n 





are unbiased estimates of population means Y and X in simple random sampling without replacement.

We propose Singh (1967) ratio-cum- product estimator ˆYRP in stratified ran-dom sampling as

(5)

1 1 1 1 1 ˆ L L h h h h L ST st h h RP st h h L L h st h h h h h h W X W z z X Y y W Y x Z _{W x} _{W Z}             _ _ _ _ _  _ _ _             



. (11)

To compare the efficiency of the proposed estimator in comparison to other es-timators, bias and mean squared error of the proposed estimator are obtained. To obtain the bias and mean squared error expressions of the proposed estimator

ST RP

Yˆ

, we write 0 1 2 (1 ), (1 ) and (1 ) h h h h h h h h h y Y e x X e z Z e such that E e( 0h)E e( )1h E e( 2h) 0 and

2 2 0 ( h) h yh E e  C , 2 2 1 ( )h h xh E e  C , 2 2 2 ( h) h zh E e  C , 0 1 ( h h) h yxh yh xh , E e e   C C E e e( _{0 2}h h) h yzhC Cyh zh 1 2 ( _h _h) _h _xzh _xh _zh E e e   C C .

Expressing (11) in terms of e s , we have _i'

2 1 1 0 1 1 1 1 (1 ) ˆ ₍₁ ₎ (1 ) L L h h h h h L ST h h RP h h h L L h h h h h h h h W X W Z e Y W Y e W X e W Z                   _        



1 0 1 2 ˆST ₍₁ ₎₍₁ ₎₍₁ _{) ,} RP Y Y e e e  (12) where 0 1 0 L h h h h W Y e e Y  



, 1 1 1 L h h h h W X e e X  



and 2 1 2 L h h h h W Z e e Z  



such that 0 1 2 ( ) ( ) ( ) 0 E e E e E e  and 2 2 2 0 2 1 1 ( ) L _h _h _yh h E e W S Y   



, 2 2 2 1 2 1 1 ( ) L _h _{h xh} h E e W S X   



, 2 2 2 2 2 1 1 ( ) L _h _{h zh} h E e W S Z   



,

(6)

2 0 1 1 1 ( ) L _h _h _yxh h E e e W S Y X   



, 0 2 2 1 1 ( ) L _h _h _yzh h E e e W S Y Z   



and 2 1 2 1 1 ( ) L h h xzh h E e e W S X Z   



.

After solving (12) we get the bias and mean squared error of proposed estima-tor upto the first degree of approximation as

2 2 2 1 ˆ ( ST) L xh yxh xzh yzh RP h h h S S S S B Y Y W Y X X Z Y Z X      _    _  



, (13) 2 2 2 2 2 2 1 2 1 2 1 2 1 ˆ ( ST) L { 2( )} . RP h h yh xh zh yxh yzh xzh h MSE Y W  S R S R S R S R S R R S  



     (14) 3. EFFICIENCY COMPARISONS

To see the efficiency of the proposed estimator in comparison to other con-sidered estimators, we compare the mean squared error of the proposed estima-tor with variance or mean squared errors of other estimaestima-tors. Variance of usual unbiased estimator in stratified random sampling yst is

2 2 1 ( st) L h h yh . h V y W  S  



(15)

Comparison of (14) and (15) shows that the proposed estimator ˆST RP

Y would

be more efficient than usual unbiased estimator y if st

2 2 2 2 2 1 2 1 2 1 2 1 { 2( )} 0 L h h xh zh yxh yzh xzh h W  R S R S R S R S R R S      



. (16)

From (8) and (14), it is observed that the proposed estimator ˆYRPST would be more efficient than combined ratio estimator ˆYRC if

2 2 2 2 2 1 1 { 2 ( )} 0 . L h h zh yzh xzh h W  R S R S R S    



(17)

Comparison of (9) and (14) reveals that the proposed estimator ˆST RP

Y would be more efficient than combined product estimator ˆYPC if

(7)

2 2 2 1 1 2 1 { 2 ( )} 0 . L h h xh yxh xzh h W  R S R S R S    



(18)

Expressions (16), (17) and (18) provide the conditions under which the pro-posed estimator ˆYRPST would have less mean squared error in comparison to mean squared error of usual unbiased estimator y , combined ratio estimator ˆst YRC and combined product estimator ˆYPC.

4. EFFICIENCY COMPARISONS IN CASE OF PROPORTIONAL ALLOCATION

When the units from the h stratum are selected according to proportional al-th location i.e. nh Nh then, h .

h

n n

N  N

In case of proportional allocation, variance of unbiased estimator y , mean st squared error of combined ratio estimator ˆYRC, combined product estimator

ˆ PC

Y and ratio-cum-product estimator ˆYRPST are obtained as 2 1 1 1 ( _{st prop}) L _h _yh h V y W S n N    _  _  



, (19) 2 2 2 1 1 1 1 1 ˆ ( _{RC prop}) L _h( _yh _xh 2 _yxh) h MSE Y W S R S R S n N    _  _    



, (20) 2 2 2 2 2 1 1 1 ˆ ( _{PC prop}) L _h ( _yh _zh 2 _yzh) h MSE Y W S R S R S n N    _  _    



, (21) 2 2 2 2 2 1 2 1 1 2 1 2 1 1 ˆ ( ) { 2( )} . L ST RP prop h yh xh zh h yxh yzh xzh MSE Y W S R S R S n N R S R S R R S    _  _       



(22) From (19), (20), (21) and (22), it is observed that in case of proportional alloca-tion, proposed estimator ˆY would be more efficient than RPST

(i) y if st 2 2 2 2 1 2 1 2 1 2 1 { 2( )} 0 L h xh zh yxh yzh xzh h W R S R S R S R S R R S      



, (23)

(8)

(ii) ˆY if RC 2 2 2 2 1 1 { 2 ( )} 0 , L h zh yzh xzh h W R S R S R S    



(24) (iii) ˆY if PC 2 2 1 1 2 1 { 2 ( )} 0 . L h xh yxh xzh h W R S R S R S    



(25)

Expressions (23), (24) and (25) provides the conditions under which proposed estimator ˆST

RP

Y would have less mean squared error in comparison to usual unbi-ased estimator y , combined ratio estimator ˆst Y and combined product estima-RC tor ˆYPC in case of proportional allocation.

5. EMPERICAL STUDY

To see the performance of the proposed estimator empirically in comparison to other estimators, we consider two natural population data sets. Description of the populations are given below.

Population I [Source: Murthy (1967)] Y : Output, X : Fixed capital, Z: Number of workers. 1 n =2 n =3 2 N =5 1 N =5 2 1 Y = 1925.80 Y =315.60 2 X =214.40 1 X =333.80 2 1 Z =51.80 Z =60.60 2 Sy1=615.92 Sy2=340.38 1 x S =74.87 Sx₂=66.35 Sz1=0.75 Sz2=4.84 1 yx S =39360.68 Syx₂=22356.50 Syz1=411.16 Syz2=1536.24 N=10 n=5 1 zx S =38.08 Szx₂=287.92

Population II [Source: National Horticulture Board (2010)] Y : Productivity (MT/Hectare),

X : Production in ‘000 Tons, Z: Area in ‘000 Hectare.

(9)

1 n =3 n =4 2 N =10 1 N =10 2 1 Y = 1.70 Y =3.67 2 X =10.41 1 X =309.14 2 1 Z =6.20 Z =80.67 2 Sy₁=0.54 Sy₂=1.41 1 x S =3.53 Sx₂=80,54 Sz1=1.19 Sz2=10.81 1 yx S =1.60 Syx₂=83.47 Syz₁=-0.02 Syz₂=-7.06 N=20 n=7 1 zx S =1.75 Szx₂=68.57 TABLE 1

Empirical presentation of conditions (16), (17) and (18) under which the suggested estimator ˆST RP

Y is more efficient than y , ˆst Y and ˆRC Y PC

TABLE 2

Percent relative efficiencies of y , ˆst Y , ˆRC Y and ˆPC YRPST with respect to y st

Estimators y st YˆRC YˆPC YˆRPST

Population 1 100.00 263.19 78.18 293.21

Population 2 100.00 179.76 121.32 311.30

TABLE 3

Percent relative efficiencies of y , ˆst Y , ˆRC Y and ˆPC YRPST with respect to y st

(in case of proportional allocation)

Estimators y st YˆRC YˆPC YˆRPST

Population 1 100.00 239.88 68.95 308.58

Population 2 100.00 184.86 123.06 343.16

5. CONCLUSION

Section 3 provides the conditions under which the proposed estimator ˆYRPST has less mean squared error in comparison to usual unbiased estimator y , com-st bined ratio estimator ˆYRC and combined product estimator ˆYPC. Section 4 that deals with the efficiency comparisons in case of proportional allocation, provides the conditions (23), (24) and (25) under which proposed estimator ˆST

RP

Y is more efficient than usual unbiased estimator, combined ratio estimator and combined product estimator in case of proportional allocation.

The conditions given in section 3 have been checked empirically and given in

Estimators y st YˆRC YˆPC

Population 1 -22913.00 < 0 -30309.70 < 0 -1257.22 < 0

(10)

Table 1, which shows that all conditions are satisfied for both the populations. Table 2 exhibits that the proposed estimator ˆST

RP

Y has highest percent relative ef-ficiency in comparison to y , ˆ_st Y and ˆ_RC Y . Therefore, it is concluded that pro-_PC posed estimator ˆYRPST is more efficient than y , ˆst Y and ˆRC Y provided that PC conditions (16), (17) and (18) are satisfied

Table 3 shows that in case of proportional allocation, the proposed estimator has highest percent relative efficiency in comparison to other considered estima-tors in both populations. It is important to note that proportional allocation pro-vides higher percent relative efficiency as compared to non-proportional case.

Thus the proposed estimator ˆYRPST is recommended for use in practice instead of other conventional estimators when conditions given in section 3 and 4 are satisfied.

This study uses information on the population mean of two auxiliary variates. The same study may be extended using various known parameters of auxiliary va-riates such as coefficient of variation, coefficient of kurtosis, correlation coeffi-cient between two auxiliary variates etc. see Singh and Tailor (2005). Using simu-lation study impact of use of various parameters of auxiliary variates can also be studied in the estimation of population parameters.

School of studies in statistics, RJESH TAILOR

Vikram University Ujjain

Department of Statistics, SUNIL CHOUHAN

Shri Vaishnav Institute of Management

Wood Properties and Uses Division, RITESH TAILOR

Institute of Wood Science and Technology, Bangalore

School of Sciences NEHA GARG

Indira Gandhi National Open University (IGNOU)

ACKNOWLEDGEMENT

Authors are thankful to the referee for his valuable suggestions regarding the im-provement of the paper.

REFERENCES

B.N. PANDEY, V. DUBEY, (1988), Modified product estimator using coefficient of variation of auxiliary

variable, “Assam Statistical Review”, 2, pp. 64-66.

B.V.S. SISODIA, V.K.. DWIVEDI, (1981), A modified ratio estimator using coefficient of variation of

auxil-iary variable, “Journal of Indian Society of agricultural Statistics”, 33, 2, pp. 13-18.

C. KADILAR, H. CINGI, (2003), Ratio estimators in stratified random sampling, “Biometrical

(11)

C. KADILAR, H. CINGI, (2005), A New Ratio Estimator in Stratified Random Sampling,

“Commu-nications in Statistics-Theory and Methods”, 34, pp. 597-602.

C. KADILAR, H. CINGI, (2006), An improvement in estimating the population mean by using the

correla-tion coefficient, “Hacettepe Journal of Mathematics and Statistics”, 35, 1, pp. 103-109.

D.S. ROBSON, (1957), Application of multivariate polykays to the theory of unbiased ratio-type

estima-tion, “J. Amer. Statist. Assoc.”, 52, pp. 511-522.

H.P. SINGH, L.N. UPADHYAYA, R. TAILOR (2009). Ratio-cum-product type exponential estimator.

“Sta-tistica”, 69, 4, pp. 299-310.

H.P. SINGH, R. TAILOR, (2003), Use of known correlation coefficient in estimating the finite population

mean, “Statistics in Transition”, 6, 4, pp. 555-560.

H.P. SINGH, R. TAILOR, (2005), Estimation of finite population mean using known correlation coefficient

between auxiliary characters, “Statistica”, anno 65, 4, pp. 407-418.

H.P. SINGH, R. TAILOR, R. TAILOR, M.S. KAKRAN, (2004), An improved estimator of population mean

using power transformation, “Journal of Indian Society of Agricultural Statistics”, 58, 2, pp.

223-230.

H.P. SINGH, R. TAILOR, S. SINGH, and J.M. KIM, (2008), A modified estimator of population mean using

power transformation, “Statistical Papers”, 49, pp. 37-58.

H.P. SINGH, R. TAILOR, and R. TAILOR (2011). Estimation of finite population mean in two - phase

sampling with known coefficient of variation of an auxiliary character. Accepted for publication

in Statistica.

L.N. UPADHYAYA, H.P. SINGH, (1999), Use of transformed auxiliary variable in estimating the finite

population mean. “Biometrical Journal”, 41, 5, pp. 627-636.

M. N. MURTHY, (1967), Sampling theory and methods, Statistical Publishing Society, Calcutta,

India, pp. 228.

M. P. SINGH, (1967), Ratio cum product method of estimation, “Metrika”, 12, pp. 34-42. M.H. HANSEN, W.N. HURWITZ, and M. GURNEY, (1946). Problems and methods of the sample survey

of business, “Journal of American Statistical Association. Amer. Statist. Assoc.”, 41, pp.

173-189.

NATIONAL HORTICULTURE BOARD (2010).

http://nhb.gov.in/statistics/area-production-statistics.html

W.G. COCHRAN, (1940), The estimation of yield of cereal experiments by sampling for the ratio of gain to

total produce, “Journal of Agricultural Science.”, 30, pp. 262-275.

SUMMARY

A ratio-cum-product estimator of population mean in stratified random sampling using two auxiliary variables

This paper proposes a ratio-cum-product estimator of finite population mean in strati-fied random sampling using information on population means of two auxiliary variables. The bias and mean squared error expressions are derived under large sample approxima-tions. Proposed estimator has been compared with usual unbiased estimator in stratified sampling, combined ratio estimator and combined product estimator theoretically as well as empirically.