Testing Finitary Probabilistic Processes - Semantics of Probabilistic Processes: An Operational Approach

Information Technology Reference

In-Depth Information

ʴ , r

max ( s )

ʔ |

⃒ ʴ ʔ }

sup

{

−ₒ

( ʔ 0 +

ʔ 0 , ʔ 0

ʔ )

ʔ 0

ʔ 1 , and ʔ 1 ⃒ ʴ ʔ

for some ʔ 0

sup

{

, ʔ 0 , ʔ 1 , ʔ }

−ₒ

ʔ 0 +

ʔ 0 , ʔ 0

ʔ |

ʔ 0

ʔ 1 , and ʔ 1 ⃒ ʴ ʔ

for some ʔ 0 , ʔ 0 , ʔ 1 , ʔ }

sup

{

−ₒ

ʔ 0 +

ʔ 0 and ʔ 0

{

ʔ |

ʔ 1 ⃒ ʴ ʔ }|

ʔ 0

sup

ʔ 1

for some ʔ 0

, ʔ 0 , ʔ 1 }

−ₒ

ʔ 0 +

ʔ 0

ʴ , max ( ʔ 1 )

ʔ 0

and ʔ 0

sup

{

· P

ʔ 1

for some ʔ 0 , ʔ 0 , ʔ 1 }

−ₒ

{

−

· P

ʴ , max ( ʔ 1 )

∈

ʔ 1 for some ʔ 1 }

sup

p )

r ( s )

pʴ

[0, 1] and s

[ s can be split into ps

−

p ) s only]

−ₒ

ʴ , max ( ʔ 1 )

sup

{

−

p )

r ( s )

pʴ

· P

∈

[0, 1] and s

ʔ 1

for some ʔ 1 }

−ₒ

sup

{

−

p )

r ( s )

pʴ

sup

ʴ , max ( ʔ 1 )

ʔ 1 }|

∈

[0, 1]

}

−ₒ

ʴ , r

max ( r ( s ), ʴ

sup

max ( ʔ 1 )

ʔ 1 }

)

ʴ , max ( ʔ )

· P

[as dp is max-seeking]

F ʴ , dp , r (

ʴ , max )( s )

Definition 6.17 Let ʔ be a subdistribution and d p a static derivative policy. We

define a collection of subdistributions ʔ k as follows.

ʔ 0 = ʔ

ʔ k + 1 = {

ʔ k ( s )

dp ( s )

∈

ʔ k

and dp ( s )

↓}

for all k

≥

0 .

Then ʔ k

is obtained from ʔ k by letting

⊧

⊨

if dp ( s )

↓

ʔ k ( s )

⊩

ʔ k ( s )

otherwise

⃒ ʴ , dp ʔ for the discounted weak derivation that

for all k

≥

0. Then we write ʔ

determines a unique subdistribution ʔ with ʔ = k = 0 ʴ k ʔ k .

In other words, if ʔ

⃒ ʴ , dp ʔ then ʔ comes from the discounted weak derivation

⃒ ʴ ʔ that is constructed by following the derivative policy d p when choosing

˄ transitions from each state. In the special case when the discount factor ʴ

1, we

see that

⃒ 1, dp becomes

⃒ dp as defined in page 176.

Semantics of Probabilistic Processes: An Operational Approach

Search WWH ::

Custom Search

Home