• Non ci sono risultati.

View of On the relationship between partial sufficiency, invariance and conditional independence

N/A
N/A
Protected

Academic year: 2021

Condividi "View of On the relationship between partial sufficiency, invariance and conditional independence"

Copied!
10
0
0

Testo completo

(1)

ON THE RELATIONSHIP BETWEEN PARTIAL SUFFICIENCY,

INVARIANCE AND CONDITIONAL INDEPENDENCE (*)1

J. Montanero, A. G. Nogales, J. A. Oyola, P. Pérez

1. INTRODUCTION AND MATHEMATICAL DEFINITIONS

Conditional independence is a well known and very useful concept, not only in probability theory, but also in the theory of statistical inference, where it can be used as a basic tool to express many of the important concepts of statistics (such as sufficiency, ancillarity, adequacy, etc), given an unified treatment to many areas that are, at first sight, different. We refer the reader to Dawid (1979), where an excellent justification of these statements can be found. A pioneer work in the use of conditional independence in statistical theory is Hall et al. (1965) in the study of the relationship between sufficiency and invariance; see also Nogales and Oyola (1996), where the Lemma 3.3 of Hall et al. (1965) on conditional independ-ence adopts a more appropriate formulation. Nevertheless, the main result of Hall et al. (1965) is the Theorem 3.1, that they called Stein theorem.

We also refer to Florens et al. (1990) where an extensive use of conditional in-dependence is made in a Bayesian context. The four references just cited contain many examples about the use of conditional independence in statistical theory, in general, and in the relationship of sufficiency and invariance, in particular.

Recall that two well known data reduction methods in the exact theory of sta-tistical inference are sufficiency – where no information is lost – and invariance – where the loss of information is justified by an argument of symmetry –. While the paper Hall et al. (1965) deals with the relationship between sufficiency and in-variance, Berk (1972) solves the analogous problem for the almost invariant case; we also refer to Berk et al. (1996), where some remarks on their relationship with conditional independence are given.

In this paper, a similar study is given for the concept of partial sufficiency. While sufficiency is usually recognized as the main contribution of Fisher to theo-retical statistics, the appealing intuitive concept of partial sufficient statistic (a sta-tistic keeping all the relevant information about a subparameter) is more elusive, as Fisher himself pointed out. The problem of fixing a natural mathematical

defi-(*) This work was supported by the Spanish Ministerio de Ciencia y Tecnología under the pro-ject BFM2002-01217.

(2)

nition of partial sufficiency in a classical setting has been considered in a large number of famous papers, but a satisfactory and definitive solution is not avail-able.

This paper follows the Fraser approach that the interested reader can find in Basu (1978), where a fairly detailed review of the usefulness of partial sufficiency in statistical theory, together with other approaches to this concept (like H-sufficiency of Hájek, Q-H-sufficiency and L-H-sufficiency of Raiffa and Schlaifer, or the related concept of Bandorff-cut introduced in Bandorff-Nielsen (1973)), is given. We can also find several illustrative examples and some justifications via Rao-Blackwell type theorems. Godambe (1980) introduces the concept of suffi-ciency for T ignoring I, and studies its relationship with Fisher's information; this problem is also considered in Kung-Yee Liang (1983), in Rémon (1984), where partial sufficiency also is named L-sufficiency, and, in a different frame-work, in Kabaila (1998). See also Zhu and Reid (1994), where a notion of partial sufficiency (named P-sufficiency and including the definitions of Fraser (1956) and Bhapkar (1991)) based on partial information is presented. Finally, the reader is referred to Montanero et al. (2003), where the theorem of Stein for partial suffi-ciency is obtained.

Let us fix the notations to be used throughout the paper. ( , , ):  ( will be a statistical experiment, i.e., ( , ):  is a measurable space and ( a family of prob-ability measures on ( , ):  .

Usually, we shall suppose the family ( written in the form

,

{PT I : ( , )T I  u4 )},

( (1)

where 4 and ) are nonempty sets. T will be considered as the parameter of interest, while I remains as a nuisance parameter. The family ( will be sup-posed identifiable, in the sense that PT I, zPT I', ' if ( , ) ( ', ')T I z T I .

Given, I ) we shall write (I {PT I, :T 4 }; & (resp., &I) will denote the family of the (-null (resp., (I-null) events. For two statistics, f and g, we shall write f ~ g (resp., ) if {I f zg} belongs to & (resp., &I). In this case, f and g are said to be -equivalent( (resp., (I-equivalent). For a sub-V-field  , [ ] (resp., [ ] ) will denote the class of the -measurable  (resp., -measurable  and

non negative) functions, and we shall write  for the completion of  with the

(-null sets. Given two sub-V -fields  and  of , we say that both are equivalent (we write ~ ) when  .

Recall that a sub-V -field  of  is said to be sufficient when, for all A , PPP A( | ) Ø, z where ( | )P A  denotes the conditional probabil- ity of A given , i.e., the class of all real functions f[ ] such that E (P IA ˜ f) P A( ˆB), for all B  . Writing the family ( as in (1),  is said to

(3)

be -orientedT when the restriction B,

PT I of the probability PT I, to  does not

de-pend on I.  is said to be specific -sufficientT if it is sufficient for the statistical experiments ( , , ),:  (I I ) .  is said to be partially -sufficientT (in the sense of Fraser, 1956) if it is -orientedT and specific -sufficient.T

A transformation on a set : is a bijective map from : onto itself. We say

that a group G of bimeasurable transformations on ( , ):  leaves invariant the statistical experiment ( , , ),:  (I when, for all gG and P (, the probability distribution Pg

of g with respect to P lies in (. Thus, for every transformation ,

g there exists a bijective map :g (o( such that Pg g( ),P for all P (. A statistic :( , , )f :  ( o( ', ' ):  is said to be G-invariant if f Dg f, for all

; G 

g f is said to be almost -invariantG (resp., -almostI G-invariant for a given I ) ) if f ~f Dg (resp., f I f Dg ), for all gG. An event A  is said to be -invariant,G almost -invariantG or -almostI G-invariant when so is its indicator function IA. In the next,  Ɗ, Ƃ and ƂI will denote, respectively, the V-fields of the G -invariant almost G -invariant or I-almost G -invariant

events. Obviously, every G-invariant, almost G -invariant or I-almost

-invariant

G statistic is Ɗ-measurable, A-measurable or ƂI -measurable,

resp.; we can find in Florens et al. (1990) some conditions of regularity under which the converse implications are also true. A sub-V-field  is said to be stable (resp., essentially stable) when g  (resp., g ~ ), for all gG.

Let us also recall the concept of conditional independence. Given three sub-V-fields   1, 2, 3  , 1 and 2 are said to be conditionally inde-

pendent given 3 (we write 1A 2| 3) when, for every A11,

2 2

A  and P  ,( P A( 1ˆA2| )3 P A( 1| ) (3 ˜P A2| );3 it is well known

that this is equivalent to the fact that, for all A 1 1, and P  ,(

1 3 1 2 3

( | ) ( | ) Ø,

P A  ˆ P A  ›  z where 2 › 3 is the least V-field

con-taining 2 and 3.

The relationship between invariance and sufficiency is studied in Hall et al. (1965), whose main theorem, attributed to Stein, is the following: “Given a statis-tical experiment which remains invariant under the action of a group G and a sufficient and essentially G -stableV-field  such that ˆƂ ~ˆƊ, the V -field ˆƊ is sufficient for Ɗ ”. The main part of their paper deals with the four propositions below:

(SI1) For all A Ɗ, there exists an invariant version in ˆPPP A( | ) .

(4)

(SI3)  ˆƊ is sufficient for Ɗ .

(SI4) For all A Ɗ, there exists an almost-invariant version in

( | ) P P A

ˆ   .

Although it is asserted in Hall et al. (1965) that (SI2) implies (SI3), it is shown in Nogales and Oyola (1996) that it is not true, and that the relationship between these propositions are the following: (SI1) œ (SI2) + (SI3) and (SI2) Ÿ (SI4).

If the principle of invariance is understood as a reduction to the V-field Ƃ of the almost invariant events, the following result of Berk (1972) (see the discussion after Lemma 3; see also Nogales and Oyola (1996, p. 907, Remark 1(v))) solves the main problem considered in Hall et al. (1965):

“If  is sufficient and essentially stable,  ˆƂ is sufficient for Ƃ”. 2. PARTIAL SUFFICIENCY AND INVARIANCE

In this section we are interested in the relationship between partial sufficiency and invariance. First, we consider the following propositions, the partial suffi-ciency analogue of propositions (SI1), (SI2), (SI3) and (SI4).  will be a partially sufficient V-field.

(PSI1) For all I and ) A Ɗ, there exists an invariant version in

, ( | )

P A

T4 T I

ˆ  .

(PSI2)  A  Ɗ| ˆƊ .

(PSI3)  ˆƊ is partially T-sufficient for Ɗ .

(PSI4) For all I and ) A Ɗ, there exists a I-almost-invariant version in

, ( | )

P Ƃ

T 4 T I

ˆ  .

Equivalent formulations of propositions (PSI2) and (PSI4) are: (PSI2) ( , )T I  u ,4 )  A Ɗ, PT I, ( |A ˆƊPT I, ( | )A  .

(PSI4)   ,I )  A Ɗ, ˆT4PT I, ( | ) [A   IƂ] .

It is our aim to study the relationship between these four propositions. We shall need the following lemma, whose proof can be found in Montanero et al. (2003).

Lemma 1. If  is a V-field T-oriented and essentially G-stable, then ,

ƂI Ƃ

ˆ ˆ

    for all I .)

The following proposition is a direct application of the result of Berk cited above to the statistical experiment ( , , ):  (I .

(5)

Proposition 2. Let  be a specific -sufficientT and essentially stable V-field. If G leaves invariant every family ,(I I ) , then (PSI4) holds.

Now, we are ready to obtain the main result of the paper. See Nogales and Oyola (1996) for a similar result for sufficiency.

Theorem 3. If  is partial sufficient and G leaves invariant every family (I, ,

I ) then

(i) (PSI1) œ (PSI2) + (PSI3) (ii) (PSI2) Ÿ (PSI4).

Proof. (i) Given I ) and A Ɗ, let be pIƂ an invariant statistic in

, ( | )

P Ƃ

T4 T I

ˆ  . (PSI2) follows immediately from this. Moreover,

, ( | )

Ƃ P A Ɗ

I

T4 T I

ˆ ˆ

p   , and (PSI3) also holds.

For the converse, choose I ) and A Ɗ. By (PSI3), there exists a statistic

, ( | )

Ƃ P A Ɗ

I

T4 T I

ˆ ˆ

p   . Then, pIƂˆT4PT I, ( | )A  follows from (PSI2).

This gives the proof as pIƂ is invariant.

(ii) Let I ) and A Ɗ. Since  is specific -sufficient,T there exists

, ( | )

Ƃ P A

I

T4 T I ˆ

p  . Given gG and T 4 let fƂT I, and fƂg( , )T I be versions

of PT I, ( |A ˆƊ) and PT Ig, ( |A ˆƊ), respectively. Then, we have that ( , ) , ({ Ƃ Ƃ}) , ({ Ƃ Ƃ }) PT I pI Dg zpI dPT I pI Dg zfg T I Dg PT I, ({fƂg( , )T I Dg zfƂg( , )T I }) (2) PT I, ({fƂg( , )T I zfƂT I, }) PT I, ({fƂT I, z pIƂ})

By hypothesis, G leaves invariant the family .(I So, there exists g1( )T 4 such that g1( , ) ( ( ), )T I g11 T I . Then, by (PSI2) we have that fAT I, PT I, ( | )A  and

1 ( , ) ( ), ( | ) A P A T I T I  fg  g . So, 1 ( , ) ( , ) , ({ Ƃ Ƃ }) ( ), ({ Ƃ Ƃ }) 0 PT I pI Dg zfg T I Dg Pg T I pI zfg T I Analogously, , , ({ Ƃ Ƃ}) 0 PT I fT I zpI

(6)

The second term of the sum in the right-hand side of (2) is also null, because

( , )

Ƃ T I fg is

Ɗ

 -measurable. Finally, if B  ˆƊ then

1 1 , , , ( , ) ( , ) , ( ) , ( , ) , A A A B B B Ƃ Ƃ B B Ƃ B I dP I dP I dP dP dP dP T I T I T I T I T I T I T I T I T I  

³

³

³

³

³

³

D D g g g g f f g f g g g g g (3)

Thus, fƂg( , )T I PT I, ( |A ˆƊ ), and the third term is 0. So, we have shown that Ƃ

I

p is I-almost-invariant.

Remark 4. As a consequence of the previous result we get the version for V-fields of the Stein theorem for partial sufficiency, that the reader can find in Montanero et al. (2003). Namely, if G leaves invariant every family (I and  is a partially

-sufficient

T and essentially stable V-field such that ˆƂ ~ˆƊ, then the

propositions (PSI1), (PSI2), (PSI3) and (PSI4) hold.

The paper Nogales and Oyola (1996) contains an example showing that (SI2) does not imply (SI3), as it is assured in Hall et al. (1965). The following proposi-tion shows how this example can be adapted to prove that (PSI2) does not imply (PSI3).

Proposition 5. There exist a statistical experiment ( , ,{:  PT I, :( , )T I 4 ) u }), a

group of bimeasurable transformations leaving invariant every family (I and a partially T-sufficient V-field  satisfying (PSI2) but not (PSI3).

Proof. Let ( ,:1 1,{ :PT T 4 }) be a statistical experiment, G1 be a group of

bi-measurable transformations leaving it invariant and  1 be a sufficient V-field such that (SI2) holds and (SI3) does not hold (see Nogales and Oyola (1996) for such an example).

Let us consider the statistical experiment

1 1

(: u`, Pu ` ,{PT uHn : ( , )T n  u4 `})

and the group G {( ,g1 id`) : g1G1} where Hn is the probability distribution degenerated at n and id` denotes the identity map on ` . It is clear that G leaves invariant each family {PT uHn : T 4 },n` . We shall show the V-field

{ , }‡

u `

 is partially -sufficientT and (PSI2) holds for it, but (PSI3) doesn’t.

First, we note that Ɗ Ɗ1u `P( ), Ɗ1 and Ɗ being the V-fields of the G1

(7)

G1, g if g: ( ,g1 id`) and An : {Z : Z 1: ( , )n A}, then (gA)n g1An; so 1 n { } ( )n { } n { }, n n n A n A n A A A n    u u u ` ` `

*

g

*

g g

*

which proves that g1(An) An, for all n ` and hence, , A,1u `P( ). The inclusion Ɗ1uP( )` Ɗ is obvious.

Now, fix ( , )T n  u4 `, A1Ɗ1 and N P( )` . From (SI2), the existence of a Ɗ -measurable statistic

1

A T

p in ( | )P AT  follows. Since the map

1 1 , 1 : ( , ') ( ) ( ) n A N n A pT u Z n  u: ` 6H N ˜ pT Z

is a Ɗ-measurable version of (PT uHn)(A1uN|u{ , }),‡ ` the theorem of Dynkin proves that, for every A Ɗ Ɗ1u `P( ), there exists a Ɗ-measurable version of (PT uHn)( |A u{ , }),‡ ` . Thus (PSI2) holds.

An analogous argument applied to a version

1 ( 1| ) Ƃ ˆT 4 T P Ƃ p  shows that 1 ( , ') : ( ) 1( ) n A N n A p u Z n H N p˜ T Z belongs to ˆT 4 (PT uHn)(A N1u | { , })u ‡ ` ;

hence, u{ , }‡ ` is specific -sufficient.T Moreover, being also -oriented,T it is partially -sufficient.T

Finally, let us see that (PSI3) does not hold: otherwise, given A Ɗ Ɗ1 and ,

n ` there would exist

1 ( )( 1 |( { , }) ). n A PT n A Ɗ T 4 H  ‡ 



u u` u ` ˆ q  

Then, being (u{ , })‡ ` ˆƊ (ˆƊ1) { , },u ‡ ` there would exist a

1 (ˆƊ )-measurable statistic n 1 A q such that 1( ) 1( , '), n n A Z A Z n p p for all Z : 1

and all 'n ` It would follow that . pƂn1 ˆT 4 T P ( |Ƃ ˆƊ1), which leads to a contradiction, since (SI3) does not hold.

Remark 6. Under the equalityƂ Ɗ,the propositions (PSI1), (PSI2), (PSI3) and (PSI4) are equivalent. In Lehmann (1986), section 6.5, sufficient conditions are given to get this equality. To illustrate this situation, let us recall an example from Montanero et al. (2003). Consider the statistical experiment

(8)

2 2 2 2 2 0 ( ) ,( ) , , : , 0, 0 n n n N T \ T [ \ \ [ § ­° §§ · § ·· ½°· ¨ ® ¨ ¨ ¸¸ !  ¾¸ ¨ ¸ ¨ ¸ ¨ ¨ ¸ ¸ ¨ °¯ ©© ¹ © ¹¹ °¿¸ © ¹ \ R \

corresponding to a n sized sample of a bivariate normal distribution with mean 0 and unknown covariance matrix. Setting E \/T2 and V2 [ \/ ,T2 define I ( ,E V2). Let G {g/ : /On}, where On is the group of orthogonal

matrices of order n and g/( , ) : (x y / /x, y), for ( , ) (x y  \ and n)2 /On. It can be easily checked that G leaves invariant each family (I and that

{ , }

nu ‡ \n

* is a G -stable, partially -sufficientT V-field. Since all the regularity

conditions are satisfied, the propositions (PSI1), (PSI2), (PSI3) and (PSI4) hold. 3. THE ALMOST-INVARIANT CASE

Replacing invariance by almost invariance, the four propositions (PSI1), (PSI2), (PSI3) and (PSI4) become:

(PSA1) For all I ) and A Ƃ, there exists an almost-invariant statistic

, ( | ) P A I T 4 T I $ˆ  p  . (PSA2)  A Ƃ|  ˆƂ.

(PSA3)  ˆƂ is partially T-sufficient for Ƃ.

(PSA4) For all I ) and A Ƃ, there exists an -almost-invariantI statistic

, ( | ) Ƃ P A I T4 T I ˆ p  .

The next theorem states the relationship between them.

Theorem 7. If  is partially -sufficientT and G leaves invariant every family (I, then:

(i) (PSA1) (PSA2) + (PSA3).

(ii) (PSA2) (PSA4).

œ Ÿ

Proof. (i) Given I ) and AƂ, let be pIA an almost-invariant statistic in ˆT4PT I, ( | )A  . (PSA2) follows immediately from this. Moreover,

, ( | ),

A P A A

I

T4 T I

ˆ ˆ

p   and (PSA3) also holds. For the converse,

choose I ) and AƂ. By (PSA3), there exists a statistic

, ( | ).

A P A A

I

T4 T I

ˆ ˆ

p   From (PSA2), we have pIAˆT4PT I, ( | ).A  This

(9)

(ii) Let I ) and AƂ.  being specific -sufficient,T there exists , ( | ). A P A I T4 T I ˆ

p  Given gG and T T let , f$T I, PT I, ( |A ˆƂ) and

( , )

, ( | ).

Ƃ T I PT I A ˆ Ƃ

fg g   As in (2), it can be shown that

, ({ A A}) 0,

PT I pI Dg zpI and this shows that A

I

p is I-almost invariant, which gives the proof.

Remark 8. As the previous theorem is the almost-invariant analogue of Theorem 3, we can derive from it an analogue to Remark 4; namely, if G leaves invariant every family (I and  is a partially -sufficient,T and essentially stable V -field, then the four propositions (PSA1), (PSA2), (PSA3) and (PSA4) hold.

Departmento de Matemáticas JESÚS MONTANERO

Universidad de Extremadura AGUSTÍN G. NOGALES

JOSÉ A. OYOLA PALOMA PÉREZ

REFERENCES

BANDORFF-NIELSEN, O.(1973), On M-ancillarity, “Biometrika”, 72 (3), pp. 447-455.

BASU, D.(1978), On the elimination of nuisance parameters, “Journal of the American Statistical Asociation”, 72 (358), pp. 355-366.

BERK, R.H.(1972), A note on Sufficiency and Invariance, “Annals of Mathematical Statistics”, 39, pp. 647-650.

BERK, R.H., A.G. NOGALES, J.A. OYOLA(1996), Some counterexamples concerning sufficiency and invari-ance, “Annals of Statistics”, 24 (2), pp. 902-905.

BHAPKAR, V.P.(1991), Loss of information in the presence of nuisance parameters and partial sufficiency, “Journal of Statistical Planning and Inference”, 28, pp. 185-203.

DAWID, A.P.(1979), Conditional independence in statistical theory, “The Journal of the Royal Sta-tistical Society”, Series B 41, pp. 1-31.

FLORENS, J.P., M., MOUCHART, J.M., ROLIN(1990), Elements of Bayesian Statistics, Marcel Dekker Inc., New York.

FRASER, D.A.S.(1956), Sufficient statistics with nuisances parameters, “Annals of Mathematical Sta-tistics”, 27, pp. 838-842.

GHOSH, J.K.(1988), Statistical Information and Likelihood'. A collection of critical essays by Dr. D. Basu, “Lecture Notes in Statistics”, 45, Springer Verlag, New York.

GODAMBE, V. (1980), On sufficiency and ancillarity in the presence of a nuisance parameters, “Bio-metrika”, 67 (1), pp. 155-162.

HALL, W.J., R.A. WIJSMAN and J.K. GHOSH (1965), The relationship between sufficiency and invariance with applications in sequential analysis, “Annals of Mathematical Statistics”, 36, 575-614. KABAILA, P.(1998), A note on sufficiency and information loss, “Statistics & Probability Letters”,

37, pp. 111-114.

KUNG-YEE LIANG (1983), On information and ancillarity in the presence of a nuisances parameter, “Biometrika”, 70 (3), pp. 607-612.

MONTANERO, J., NOGALES, A.G., OYOLA, J.A., P. PÉREZ(2003), On partial sufficiency and invariance, “Statistica Sinica”, 13 (4), pp. 1269-1280.

(10)

NOGALES, A.G., J.A. OYOLA(1996), Some remarks on sufficiency, invariance and conditional independ-ence,“The Ann. Statist.” 24 (2), pp. 906-909.

RÉMON, M. (1984), On a concept of partial sufficiency: L-sufficiency, “International Statistical Re-view”, 52 (2), pp. 127-135.

ZHU, Y., REID, N.(1994), Information, ancillarity, and sufficiency in the presence of nuisance parameters, “Canadian Journal of Statistics”, 22 (1), pp. 111-123.

RIASSUNTO

Sulla relazione fra parziale sufficienza, invarianza e independenza condizionale

Molte proposizioni che appaiono in modo naturale nella letteratura sulla sufficienza e invarianza, includendo la dipendenza condizionale delle sigma-algebra invarianti e suffi-cienti data la loro intersezione, sono adattate per la sufficienza parziale (nel senso di Fra-ser) ed é studiata la relazione tra esse.

SUMMARY

On the relationship between partial sufficiency, invariance and conditional independence

Several propositions that appear in a natural way in the literature on sufficiency and invariance, including the conditional independence of the invariant and the sufficient

V-fields given its intersection, are adapted for partial sufficiency (in the sense of Fraser), and the relationship between them is studied.

Riferimenti

Documenti correlati

Chronic osmotic diarrhoea derives from enteroendocrine cell dysgenesis, due to defects in the stem cell differentiation process Stem cell..

When very compliant elements (pads) are included in the mechanical structure, the overall compliance is still obtained by simply summing up the contribution of each element

these findings indicate that the ApC/C Cdh1 ubiquitin ligase targets e2F3 for proteasome-dependent degradation during cell cycle exit and neuronal differentiation.. APC/C Cdh1

Keywords: silymarin; drug delivery; polymeric nanomicelles; mixed nanomicelles; antioxidant activity; PAMPA; Caco-2 cell line; oral

In order to evaluate the feasibility and potential benefit of such a study design, we developed a method that automatically retrieves relevant data from PubMed abstracts in order

Since endothelial metabolism has proved essential in the regulation of the angiogenic switch, many recent patents focus on agents able to inhibit specific

Data are% variation with respect to mean value of MSCs kept under standard conditions for 3-days (MSC-3-N). In normoxia only free-VEGF-A pre-treatment induced a significant increase

Per maggiore chiarezza, riassumo i termini che ricavo dal ragiona- mento di Imperatori: 1) la Sindone esiste, e può essere studiata scien- tificamente; 2) secondo i sindonologi