View of On the relationship between partial sufficiency, invariance and conditional independence

(1)

ON THE RELATIONSHIP BETWEEN PARTIAL SUFFICIENCY,

INVARIANCE AND CONDITIONAL INDEPENDENCE (*)1

J. Montanero, A. G. Nogales, J. A. Oyola, P. Pérez

1. INTRODUCTION AND MATHEMATICAL DEFINITIONS

Conditional independence is a well known and very useful concept, not only in probability theory, but also in the theory of statistical inference, where it can be used as a basic tool to express many of the important concepts of statistics (such as sufficiency, ancillarity, adequacy, etc), given an unified treatment to many areas that are, at first sight, different. We refer the reader to Dawid (1979), where an excellent justification of these statements can be found. A pioneer work in the use of conditional independence in statistical theory is Hall et al. (1965) in the study of the relationship between sufficiency and invariance; see also Nogales and Oyola (1996), where the Lemma 3.3 of Hall et al. (1965) on conditional independ-ence adopts a more appropriate formulation. Nevertheless, the main result of Hall et al. (1965) is the Theorem 3.1, that they called Stein theorem.

We also refer to Florens et al. (1990) where an extensive use of conditional in-dependence is made in a Bayesian context. The four references just cited contain many examples about the use of conditional independence in statistical theory, in general, and in the relationship of sufficiency and invariance, in particular.

Recall that two well known data reduction methods in the exact theory of sta-tistical inference are sufficiency – where no information is lost – and invariance – where the loss of information is justified by an argument of symmetry –. While the paper Hall et al. (1965) deals with the relationship between sufficiency and in-variance, Berk (1972) solves the analogous problem for the almost invariant case; we also refer to Berk et al. (1996), where some remarks on their relationship with conditional independence are given.

In this paper, a similar study is given for the concept of partial sufficiency. While sufficiency is usually recognized as the main contribution of Fisher to theo-retical statistics, the appealing intuitive concept of partial sufficient statistic (a sta-tistic keeping all the relevant information about a subparameter) is more elusive, as Fisher himself pointed out. The problem of fixing a natural mathematical

defi-(*) This work was supported by the Spanish Ministerio de Ciencia y Tecnología under the pro-ject BFM2002-01217.

(2)

nition of partial sufficiency in a classical setting has been considered in a large number of famous papers, but a satisfactory and definitive solution is not avail-able.

This paper follows the Fraser approach that the interested reader can find in Basu (1978), where a fairly detailed review of the usefulness of partial sufficiency in statistical theory, together with other approaches to this concept (like H-sufficiency of Hájek, Q-H-sufficiency and L-H-sufficiency of Raiffa and Schlaifer, or the related concept of Bandorff-cut introduced in Bandorff-Nielsen (1973)), is given. We can also find several illustrative examples and some justifications via Rao-Blackwell type theorems. Godambe (1980) introduces the concept of suffi-ciency for T ignoring I, and studies its relationship with Fisher's information; this problem is also considered in Kung-Yee Liang (1983), in Rémon (1984), where partial sufficiency also is named L-sufficiency, and, in a different frame-work, in Kabaila (1998). See also Zhu and Reid (1994), where a notion of partial sufficiency (named P-sufficiency and including the definitions of Fraser (1956) and Bhapkar (1991)) based on partial information is presented. Finally, the reader is referred to Montanero et al. (2003), where the theorem of Stein for partial suffi-ciency is obtained.

Let us fix the notations to be used throughout the paper. ( , , ): ( will be a statistical experiment, i.e., ( , ): is a measurable space and ( a family of prob-ability measures on ( , ): .

Usually, we shall suppose the family ( written in the form

,

{PT I : ( , )T I u4 )},

( (1)

where 4 and ) are nonempty sets. T will be considered as the parameter of interest, while I remains as a nuisance parameter. The family ( will be sup-posed identifiable, in the sense that PT I_, zPT I_{', '} if ( , ) ( ', ')T I z T I .

Given, I ) we shall write (_I {P_{T I}_, :T 4 }; & (resp., &_I) will denote the family of the (-null (resp., (I-null) events. For two statistics, f and g, we shall write f ~ g (resp.,_{) if {}I f zg} belongs to & (resp., &_I). In this case, f and g are said to be -equivalent( (resp., (I-equivalent). For a sub-V-field , [ ] (resp., [ ] ) will denote the class of the -measurable (resp., -measurable and

non negative) functions, and we shall write for the completion of with the

(-null sets. Given two sub-V -fields and of , we say that both are equivalent (we write ~ ) when .

Recall that a sub-V -field of is said to be sufficient when, for all A , PPP A( | ) Ø, z where ( | )P A denotes the conditional probabil- ity of A given , i.e., the class of all real functions f[ ] such that E (_P I_A f) P A( B), for all B  . Writing the family ( as in (1), is said to

(3)

be -orientedT when the restriction B_,

PT I of the probability PT I, to does not

de-pend on I. is said to be specific -sufficientT if it is sufficient for the statistical experiments ( , , ),: (_I I ) . is said to be partially -sufficientT (in the sense of Fraser, 1956) if it is -orientedT and specific -sufficient.T

A transformation on a set : is a bijective map from : onto itself. We say

that a group G of bimeasurable transformations on ( , ): leaves invariant the statistical experiment ( , , ),: (_I when, for all gG and P (, the probability distribution Pg

of g with respect to P lies in (. Thus, for every transformation ,

g there exists a bijective map :g (o( such that Pg g( ),P for all P (. A statistic :( , , )f : ( o( ', ' ): is said to be _G-invariant if f Dg f, for all

; G

g f is said to be almost -invariantG (resp., -almostI G-invariant for a given I ) ) if f _~f Dg (resp., f I f Dg ), for all gG. An event A is said to be -invariant,G almost -invariantG or -almostI G-invariant when so is its indicator function I_A. In the next, _Ɗ, _Ƃ and _ƂI will denote, respectively, the V-fields of the G -invariant almost G -invariant or I-almost G -invariant

events. Obviously, every G-invariant, almost G -invariant or I-almost

-invariant

G statistic is _Ɗ-measurable, _A-measurable or _ƂI -measurable,

resp.; we can find in Florens et al. (1990) some conditions of regularity under which the converse implications are also true. A sub-V-field is said to be stable (resp., essentially stable) when g (resp., g ~ ), for all gG.

Let us also recall the concept of conditional independence. Given three sub-V-fields ₁, ₂, ₃ , 1 and 2 are said to be conditionally inde-

pendent given ₃ (we write ₁A ₂| ₃) when, for every A₁₁,

2 2

A  and P  ,( P A( 1A2| )3 P A( 1| ) (3 P A2| );3 it is well known

that this is equivalent to the fact that, for all A ₁ ₁, and P  ,(

1 3 1 2 3

( | ) ( | ) Ø,

P A P A z where ₂ ₃ is the least V-field

con-taining ₂ and ₃.

The relationship between invariance and sufficiency is studied in Hall et al. (1965), whose main theorem, attributed to Stein, is the following: “Given a statis-tical experiment which remains invariant under the action of a group G and a sufficient and essentially G -stableV-field such that _Ƃ ~_Ɗ, the V -field _Ɗ is sufficient for _Ɗ ”. The main part of their paper deals with the four propositions below:

(SI1) For all A _Ɗ, there exists an invariant version in _PPP A( | ) .

(4)

(SI3) _Ɗ is sufficient for _Ɗ .

(SI4) For all A _Ɗ, there exists an almost-invariant version in

( | ) P P A

.

Although it is asserted in Hall et al. (1965) that (SI2) implies (SI3), it is shown in Nogales and Oyola (1996) that it is not true, and that the relationship between these propositions are the following: (SI1) (SI2) + (SI3) and (SI2) (SI4).

If the principle of invariance is understood as a reduction to the V-field _Ƃ of the almost invariant events, the following result of Berk (1972) (see the discussion after Lemma 3; see also Nogales and Oyola (1996, p. 907, Remark 1(v))) solves the main problem considered in Hall et al. (1965):

“If is sufficient and essentially stable, _Ƃ is sufficient for _Ƃ”. 2. PARTIAL SUFFICIENCY AND INVARIANCE

In this section we are interested in the relationship between partial sufficiency and invariance. First, we consider the following propositions, the partial suffi-ciency analogue of propositions (SI1), (SI2), (SI3) and (SI4). will be a partially sufficient V-field.

(PSI1) For all I and ) A _Ɗ, there exists an invariant version in

, ( | )

P A

T4 T I

.

(PSI2) A _Ɗ| _Ɗ .

(PSI3) _Ɗ is partially T-sufficient for _Ɗ .

(PSI4) For all I and ) A _Ɗ, there exists a I-almost-invariant version in

, ( | )

P Ƃ

T 4 T I

.

Equivalent formulations of propositions (PSI2) and (PSI4) are: (PSI2) ( , )T I u ,4 ) A _Ɗ, PT I, ( |A Ɗ)PT I, ( | )A .

(PSI4) ,I ) A _Ɗ, T4PT I_, ( | ) [A I_Ƃ] .

It is our aim to study the relationship between these four propositions. We shall need the following lemma, whose proof can be found in Montanero et al. (2003).

Lemma 1. If is a V-field T-oriented and essentially G-stable, then ,

ƂI Ƃ

for all I .)

The following proposition is a direct application of the result of Berk cited above to the statistical experiment ( , , ): (I .

(5)

Proposition 2. Let be a specific -sufficientT and essentially stable V-field. If G leaves invariant every family ,(I I ) , then (PSI4) holds.

Now, we are ready to obtain the main result of the paper. See Nogales and Oyola (1996) for a similar result for sufficiency.

Theorem 3. If is partial sufficient and G leaves invariant every family (I, ,

I ) then

(i) (PSI1) (PSI2) + (PSI3) (ii) (PSI2) (PSI4).

Proof. (i) Given I ) and A _Ɗ, let be pI_Ƃ an invariant statistic in

, ( | )

P Ƃ

T4 T I

. (PSI2) follows immediately from this. Moreover,

, ( | )

Ƃ P A Ɗ

I

T4 T I

p , and (PSI3) also holds.

For the converse, choose I ) and A _Ɗ. By (PSI3), there exists a statistic

, ( | )

Ƃ P A Ɗ

I

T4 T I

p . Then, pI_Ƃ_T₄_P_{T I}_, ( | )_A follows from (PSI2).

This gives the proof as pI_Ƃ is invariant.

(ii) Let I ) and A _Ɗ. Since is specific -sufficient,T there exists

, ( | )

Ƃ P A

I

T4 T I

p . Given gG and T 4 let f_ƂT I, and f_Ƃg( , )T I be versions

of PT I_, ( |A _Ɗ) and PT Ig, ( |A Ɗ), respectively. Then, we have that ( , ) , ({ Ƃ Ƃ}) , ({ Ƃ Ƃ }) PT I pI Dg zpI dPT I pI Dg zfg T I Dg PT I_, ({f_Ƃg( , )T I Dg zf_Ƃg( , )T I }) (2) PT I_, ({f_Ƃg( , )T I zf_ƂT I, }) PT I_, ({f_ƂT I, z pI_Ƃ})

By hypothesis, G leaves invariant the family .(_I So, there exists g₁( )T 4 such that g₁( , ) ( ( ), )T I g₁₁ T I . Then, by (PSI2) we have that f_AT I, PT I_, ( | )A and

1 ( , ) ( ), ( | ) A P A T I T I fg g . So, 1 ( , ) ( , ) , ({ Ƃ Ƃ }) ( ), ({ Ƃ Ƃ }) 0 PT I pI Dg zfg T I Dg Pg T I pI zfg T I Analogously, , , ({ Ƃ Ƃ}) 0 PT I fT I zpI

(6)

The second term of the sum in the right-hand side of (2) is also null, because

( , )

Ƃ T I fg _is

Ɗ

-measurable. Finally, if B  _Ɗ then

1 1 , , , ( , ) ( , ) , _{( )} , ( , ) , A A A B B B Ƃ Ƃ B B Ƃ B I dP I dP I dP dP dP dP T I T I T I T I T I T I T I T I T I

³

D D g g g g f f g f g g g g g (3)

Thus, f_Ƃg( , )T I PT I_, ( |A _Ɗ ), and the third term is 0. So, we have shown that Ƃ

I

p is I-almost-invariant.

Remark 4. As a consequence of the previous result we get the version for V-fields of the Stein theorem for partial sufficiency, that the reader can find in Montanero et al. (2003). Namely, if G leaves invariant every family (I and is a partially

-sufficient

T and essentially stable V-field such that _Ƃ ~_Ɗ, then the

propositions (PSI1), (PSI2), (PSI3) and (PSI4) hold.

The paper Nogales and Oyola (1996) contains an example showing that (SI2) does not imply (SI3), as it is assured in Hall et al. (1965). The following proposi-tion shows how this example can be adapted to prove that (PSI2) does not imply (PSI3).

Proposition 5. There exist a statistical experiment ( , ,{: PT I, :( , )T I 4 ) u }), a

group of bimeasurable transformations leaving invariant every family (I and a partially T-sufficient V-field satisfying (PSI2) but not (PSI3).

Proof. Let ( ,:₁ ₁,{ :PT T 4 }) be a statistical experiment, G1 be a group of

bi-measurable transformations leaving it invariant and ₁ be a sufficient V-field such that (SI2) holds and (SI3) does not hold (see Nogales and Oyola (1996) for such an example).

Let us consider the statistical experiment

1 1

(: u`, Pu ` ,{PT uH_n : ( , )T n u4 `})

and the group G {( ,g₁ id_`) : g₁G₁} where H_n is the probability distribution degenerated at n and id_` denotes the identity map on ` . It is clear that G leaves invariant each family {PT uH_n : T 4 },n` . We shall show the V-field

{ , }

u _`

is partially -sufficientT and (PSI2) holds for it, but (PSI3) doesn’t.

First, we note that _Ɗ _Ɗ1u `P( ), _Ɗ1 and _Ɗ being the V-fields of the G1

(7)

1G1, g if g: ( ,g₁ id_`) and A_n : {Z : Z ₁: ( , )n A}, then (gA)_n g₁A_n; so 1 n { } ( )n { } n { }, n n n A n A n A A A n u u u ` ` `

*

g

*

g g

*

which proves that g₁(A_n) A_n, for all n ` and hence, , A_,1u `_P( ). The inclusion _Ɗ1uP( )` _Ɗ is obvious.

Now, fix ( , )T n u4 `, A₁_Ɗ1 and N P( )` . From (SI2), the existence of a _Ɗ -measurable statistic

1

A T

p in ( | )P AT follows. Since the map

1 1 , 1 : ( , ') ( ) ( ) n A N n A pT u Z n u: ` 6H N pT Z

is a _Ɗ-measurable version of (PT uH_n)(A₁uN|u{ , }), ` the theorem of Dynkin proves that, for every A _Ɗ _Ɗ1u `P( ), there exists a Ɗ-measurable version of (PT uH_n)( |A u{ , }), ` . Thus (PSI2) holds.

An analogous argument applied to a version

1 ( 1| ) Ƃ T 4 T P Ƃ p shows that 1 ( , ') : ( ) 1( ) n A N n A p u Z n H N p T Z belongs to T 4 (PT uHn)(A N1u | { , })u ` ;

hence, u{ , } _` is specific -sufficient.T Moreover, being also -oriented,T it is partially -sufficient.T

Finally, let us see that (PSI3) does not hold: otherwise, given A _Ɗ _Ɗ1 and ,

n ` there would exist

1 ( )( 1 |( { , }) ). n A PT n A Ɗ T 4 H

u u_` u _` q

Then, being (u{ , }) _` _Ɗ (_Ɗ1) { , },u _` there would exist a

1 (_Ɗ )-measurable statistic n ₁ A q such that 1( ) 1( , '), n n A Z A Z n p p for all Z : ₁

and all 'n ` It would follow that . p_Ƃn₁ T 4 T P ( |Ƃ _Ɗ1), which leads to a contradiction, since (SI3) does not hold.

Remark 6. Under the equality_Ƃ _Ɗ,the propositions (PSI1), (PSI2), (PSI3) and (PSI4) are equivalent. In Lehmann (1986), section 6.5, sufficient conditions are given to get this equality. To illustrate this situation, let us recall an example from Montanero et al. (2003). Consider the statistical experiment

(8)

2 2 2 2 ₂ 0 ( ) ,( ) , , : , 0, 0 n n n N T \ T [ \ \ [ § _° _§_{§ ·} _§ _·_· ½_°· ¨ _® _¨ _¨ _¸_¸ _! _¾¸ ¨ ¸ ¨ ¸ ¨ ¨ ¸ ¸ ¨ _°_¯ _©© ¹ _© _¹_¹ _°_¿¸ © ¹ \ R \

corresponding to a n sized sample of a bivariate normal distribution with mean 0 and unknown covariance matrix. Setting E \/T2 and V2 [ \/ ,T2 define I ( ,E V2). Let G {g/ : /O_n}, where On is the group of orthogonal

matrices of order n and g/( , ) : (x y / /x, y), for ( , ) (x y  \ and n)2 /On. It can be easily checked that G leaves invariant each family (I and that

{ , }

n_u _\n

* is a G -stable, partially -sufficientT V-field. Since all the regularity

conditions are satisfied, the propositions (PSI1), (PSI2), (PSI3) and (PSI4) hold. 3. THE ALMOST-INVARIANT CASE

Replacing invariance by almost invariance, the four propositions (PSI1), (PSI2), (PSI3) and (PSI4) become:

(PSA1) For all I ) and A _Ƃ, there exists an almost-invariant statistic

, ( | ) P A I T 4 T I $ p . (PSA2) A _Ƃ| _Ƃ.

(PSA3) _Ƃ is partially T-sufficient for _Ƃ.

(PSA4) For all I ) and A _Ƃ, there exists an -almost-invariantI statistic

, ( | ) Ƃ P A I T4 T I p .

The next theorem states the relationship between them.

Theorem 7. If is partially -sufficientT and G leaves invariant every family (I, then:

(i) (PSA1) (PSA2) + (PSA3).

(ii) (PSA2) (PSA4).

Proof. (i) Given I ) and A_Ƃ, let be pI_A an almost-invariant statistic in T4PT I_, ( | )A . (PSA2) follows immediately from this. Moreover,

, ( | ),

A P A A

I

T4 T I

p and (PSA3) also holds. For the converse,

choose I ) and A_Ƃ. By (PSA3), there exists a statistic

, ( | ).

A P A A

I

T4 T I

p From (PSA2), we have pI_AT4PT I_, ( | ).A This

(9)

(ii) Let I ) and A_Ƃ. being specific -sufficient,T there exists , ( | ). A P A I T4 T I

p Given gG and T T let , f$T I, PT I_, ( |A _Ƃ) and

( , )

, ( | ).

Ƃ T I PT I A Ƃ

fg g _As _in _(2), _it _can _be _shown _that

, ({ A A}) 0,

PT I pI Dg zpI and this shows that A

I

p is I-almost invariant, which gives the proof.

Remark 8. As the previous theorem is the almost-invariant analogue of Theorem 3, we can derive from it an analogue to Remark 4; namely, if G leaves invariant every family (I and is a partially -sufficient,T and essentially stable V -field, then the four propositions (PSA1), (PSA2), (PSA3) and (PSA4) hold.

Departmento de Matemáticas JESÚS MONTANERO

Universidad de Extremadura AGUSTÍN G. NOGALES

JOSÉ A. OYOLA PALOMA PÉREZ

REFERENCES

BANDORFF-NIELSEN, O.(1973), On M-ancillarity, “Biometrika”, 72 (3), pp. 447-455.

BASU, D.(1978), On the elimination of nuisance parameters, “Journal of the American Statistical Asociation”, 72 (358), pp. 355-366.

BERK, R.H.(1972), A note on Sufficiency and Invariance, “Annals of Mathematical Statistics”, 39, pp. 647-650.

BERK, R.H., A.G. NOGALES, J.A. OYOLA(1996), Some counterexamples concerning sufficiency and invari-ance, “Annals of Statistics”, 24 (2), pp. 902-905.

BHAPKAR, V.P.(1991), Loss of information in the presence of nuisance parameters and partial sufficiency, “Journal of Statistical Planning and Inference”, 28, pp. 185-203.

DAWID, A.P.(1979), Conditional independence in statistical theory, “The Journal of the Royal Sta-tistical Society”, Series B 41, pp. 1-31.

FLORENS, J.P., M., MOUCHART, J.M., ROLIN(1990), Elements of Bayesian Statistics, Marcel Dekker Inc., New York.

FRASER, D.A.S.(1956), Sufficient statistics with nuisances parameters, “Annals of Mathematical Sta-tistics”, 27, pp. 838-842.

GHOSH, J.K.(1988), Statistical Information and Likelihood'. A collection of critical essays by Dr. D. Basu, “Lecture Notes in Statistics”, 45, Springer Verlag, New York.

GODAMBE, V. (1980), On sufficiency and ancillarity in the presence of a nuisance parameters, “Bio-metrika”, 67 (1), pp. 155-162.

HALL, W.J., R.A. WIJSMAN and J.K. GHOSH (1965), The relationship between sufficiency and invariance with applications in sequential analysis, “Annals of Mathematical Statistics”, 36, 575-614. KABAILA, P.(1998), A note on sufficiency and information loss, “Statistics & Probability Letters”,

37, pp. 111-114.

KUNG-YEE LIANG (1983), On information and ancillarity in the presence of a nuisances parameter, “Biometrika”, 70 (3), pp. 607-612.

MONTANERO, J., NOGALES, A.G., OYOLA, J.A., P. PÉREZ(2003), On partial sufficiency and invariance, “Statistica Sinica”, 13 (4), pp. 1269-1280.

(10)

NOGALES, A.G., J.A. OYOLA(1996), Some remarks on sufficiency, invariance and conditional independ-ence,“The Ann. Statist.” 24 (2), pp. 906-909.

RÉMON, M. (1984), On a concept of partial sufficiency: L-sufficiency, “International Statistical Re-view”, 52 (2), pp. 127-135.

ZHU, Y., REID, N.(1994), Information, ancillarity, and sufficiency in the presence of nuisance parameters, “Canadian Journal of Statistics”, 22 (1), pp. 111-123.

RIASSUNTO

Sulla relazione fra parziale sufficienza, invarianza e independenza condizionale

Molte proposizioni che appaiono in modo naturale nella letteratura sulla sufficienza e invarianza, includendo la dipendenza condizionale delle sigma-algebra invarianti e suffi-cienti data la loro intersezione, sono adattate per la sufficienza parziale (nel senso di Fra-ser) ed é studiata la relazione tra esse.

SUMMARY

On the relationship between partial sufficiency, invariance and conditional independence

Several propositions that appear in a natural way in the literature on sufficiency and invariance, including the conditional independence of the invariant and the sufficient

V-fields given its intersection, are adapted for partial sufficiency (in the sense of Fraser), and the relationship between them is studied.

View of On the relationship between partial sufficiency, invariance and conditional independence

³

³

³

³

³

³

*

*

*