The intuition for an efficient coercion composition function came from thinking about types, not coercions. We'll start with the UD blame tracking strategy and then later consider D. Also, for now we'll stick with lazy cast checking.
Consider the following sequence of casts:
e:T1⇒ℓ1T2⇒ℓ2⋯⇒ℓn−1Tn
We'd like some way to summarize the sequence of types without loosing
any important information. That is, we'd like to come up with
something that can catch the same cast errors as the entire sequence,
blaming the appropriate label, but using less space. Imagine the n
types as a line of differently colored trees on the side of a road.
If you're next to the road, staring down the line of trees, you see
what looks like one tree with branches of many colors. Some of the
branches from further-away trees are hidden from view by closer trees,
but some are visible. Now, suppose we wanted to maintain the same view
from your standpoint, but save on water. We could replace the line of
trees with a single multi-colored tree that includes all the branches
visible to you. The figure below depicts
three differently-colored trees getting merged into a single
multi-color tree. The nodes without color should be considered
transparent.
Mapping this idea back to types, the colors are blame labels and transparent nodes are the type ⋆. Because we need to color individual branches, we need blame labels on every internal node of a type. In particular, we need the notion of a labeled type: optional labelsp,q::=ϵ∣ℓlabeled typesP,Q::=Bp∣P→pQ∣⋆∣(Ip;⊥ℓ) (These labeled types are for the UD strategy. We'll discuss the labeled types for D later. Also, the labeled type Ip;⊥ℓ deserves some explanation, which we'll get to soon.) The label function returns the top-most label of a labeled type: label(Bp)=plabel(P→pQ)=plabel(⋆)=ϵlabel(Ip;⊥ℓ)=p
We'll define a function for composing two labeled types P and Q to produce a new labeled type P′, using semicolon as the syntax for this composition function: P;Q=P′ We replace each cast with a threesome, that is, a cast annotated with a labeled type. The labeled type is computed by a simple function L that we define below. e:T1⇒ℓT2 becomes e:T1P⟹T2 where P=L(T1⇒ℓT2) L(B⇒ℓB)=BL(⋆⇒ℓ⋆)=⋆L(B⇒ℓ⋆)=BL(⋆⇒ℓB)=BℓL(T1⇒ℓT2)=I;⊥ℓwhere I∼T1L(T1→T2⇒ℓT3→T4)=L(T3⇒ℓT1)→L(T2⇒ℓT4)L(T1→T2⇒ℓ⋆)=L(⋆⇒ℓT1)→L(T2⇒ℓ⋆)L(⋆⇒ℓT3→T4)=L(T3⇒ℓ⋆)→ℓL(⋆⇒ℓT4) A sequence of threesomes is compressed to a single threesome using the composition function: e:T1P1⟹T2P2⟹⋯Pn−1⟹Tnbecomese:T1P⟹Tnwhere P=P1;P2;⋯;Pn−1
Before we go into the details of the composition function, it helps to see how (well-formed) threesomes correspond to coercions in normal form. With this correspondence in place, we can use the coercion reduction rules to help guide the definition of threesome composition. The function TC defined below maps threesomes to coercions in normal form. This function is an isomorphism, so it's inverse maps normal coercions back to threesomes. TC(BB⟹B)=ιBTC(⋆⋆⟹⋆)=ι⋆TC(⋆Bℓ⟹B)=B?ℓTC(BB⟹⋆)=B!TC(⋆Bℓ⟹⋆)=B?ℓ;B!TC(T1I;⊥ℓ⟹T2)=FailℓTC(T1Iℓ1;⊥ℓ2⟹T2)=I?ℓ1;Failℓ2TC(T1→T2P1→P2⟹T3→T4)=TC(T3P1⟹T1)→TC(T2P2⟹T4)TC(⋆P1→ℓP2⟹T3→T4)=(⋆→⋆)?ℓ;TC(T3P1⟹⋆)→TC(⋆P2⟹T4)TC(T1→T2P1→P2⟹⋆)=TC(⋆P1⟹T1)→TC(T2P2⟹⋆);(⋆→⋆)!TC(⋆P1→ℓP2⟹⋆)=(⋆→⋆)?ℓ;TC(⋆P1⟹⋆)→TC(⋆P2⟹⋆);(⋆→⋆)!
We're ready to make precise how two labeled types can be composed to form a single labeled type. The two cases in which one of the labeled types is ⋆ are easy: just return the other type: ⋆;Q=QP;⋆=P Next, suppose we have Intℓ1 followed by Intℓ2. These should compose to Intℓ1 because if the first cast succeeds, so will the second, making the blame label ℓ2 redundant. In general, for labeled basic types we have the following rule. Bp;Bq=Bp Suppose instead that the basic types don't match. The right-hand side of the following rule is a bit tricky, so let's think about this in terms of coercions. Bp1;Bq2=Bp1;⊥qif B1≠B2 Suppose p=ℓ1,q=ℓ2 and these two labeled types come from the threesomes ⋆Bℓ11⟹⋆Bℓ22⟹B2 The corresponding coercion sequence is B1?ℓ1;B1!;B2?ℓ2 which reduces to B1?ℓ1;Failℓ2 and that corresponds to the labeled type for errors, Bℓ11;⊥ℓ2. We also need to consider a mismatch between basic types and function types: Bp;(P→qQ)=Bq;⊥q(P→pQ);Bq=(⋆→⋆)p;⊥q The rule for labeled function types takes the label p for the label in the result and it recursively composes the domain and codomain types. The contra-variance in the parameter type is important for getting the right blame and coincides with the contra-variance in the reduction rule for composing function coercions. (P1→pP2);(Q1→qQ2)=(Q1;P1)→p(P2;Q2) The following figure shows an example similar to the previous figure, but with function types instead of pair types. The analogy with real trees and line-of-sight breaks down because you have to flip to looking at the trees from back-to-front instead of front-to-back for negative positions within the type.
Lastly we need several rules to handle when the error type is on the left or right. (Ip;⊥ℓ);Q=(Ip;⊥ℓ)P;(Iq;⊥ℓ)=Ip;⊥ℓif I∼P and label(P)=pP;(Iq;⊥ℓ)=Ip;⊥qif I≁P and label(P)=p
What's extra cool about labeled types and their composition function is that each rule covers many different rules if you were to formulate them in terms of coercions. For example, the single rule Bp;Bq=Bp covers four situations when viewed as threesomes or coercions: (BB⟹B;BB⟹B)=BB⟹BιB;ιB⟶ιB(⋆Bℓ⟹B;BB⟹B)=⋆Bℓ⟹BB?ℓ;ιB⟶B?ℓ(BB⟹B;BB⟹⋆)=BB⟹⋆ιB;B!⟶B!(⋆Bℓ⟹B;BB⟹⋆)=⋆Bℓ⟹⋆B?ℓ;B! is already in normal form
We define a threesomes as a source type, middle labeled typed, and a target type. threesomesτ::=TP⟹T We define the sequencing of threesomes as follows (T1P⟹T2);(T2Q⟹T3)=T1P;Q⟹T3 Similarly, we define the notation τ1→τ2 as (T3P⟹T1)→(T2Q⟹T4)=T1→T2P→Q⟹T3→T4
We can now go back to the ECD machine and replace the coercions with threesomes. Here's the syntax in A-normal form. expressionse::=k∣x∣λx:T.sdefinitionsd::=x=op(e)∣x:T=e(e)∣x=e:τstatementss::=d;s∣returne∣returne(e):τsimple values˜v::=k∣⟨λx:T.s,ρ⟩valuesv::=˜v∣˜v:τ The cast function, of course, needs to change. cast(k,τ)={kif τ=(BB⟹B)blameℓif τ=BBp;⊥ℓ⟹Tk:τotherwisecast(⟨λx:T.s,ρ⟩,τ)=⟨λx:T.s,ρ⟩:τcast(k:τ1,τ2)={kif (τ1;τ2)=BB⟹Bblameℓif (τ1;τ2)=BBp;⊥ℓ⟹Tk:(τ1;τ2)otherwisecast(⟨λx:T.s,ρ⟩:τ1,τ2)={⟨λx:T.s,ρ⟩:(τ1;τ2)if middle(τ1;τ2)≠(Ip;⊥ℓ)blameℓif middle(τ1;τ2)=(Ip;⊥ℓ) And last but not least, here's the transitions for the ECD machine, but with threesomes instead of coercions. (x:T1=e1(e2);s,ρ,κ)⟼(s′,ρ′[y:=v2],(T1T1⟹T1,(x,s,ρ)::κ))where V(e1,ρ)=⟨λy:T2.s′,ρ′⟩,V(e2,ρ)=v2(x:T1=e1(e2);s,ρ,κ)⟼(s′,ρ′[y:=v′2],(τ2,(x,s,ρ)::κ))where V(e1,ρ)=⟨λy:T2.s′,ρ′⟩:τ1→τ2,V(e2,ρ)=v2, and cast(v2,τ1)=v′2(x=op(¯e);s,ρ,κ)⟼(s,ρ[x:=v],κ)where v=δ(op,V(e,ρ))(x=e:τ;s,ρ,κ)⟼(s,ρ[x:=v′],κ)where V(e,ρ)=v,cast(v,τ)=v′(returne,ρ,(τ,(x,s,ρ′)::κ))⟼(s,[x:=v′]ρ′,κ)where V(e,ρ)=v,cast(v,τ)=v′(returne1(e2):τ1,ρ,(τ2,σ))⟼(s,ρ′[y:=v2],((τ1;τ2),σ))where V(e1,ρ)=⟨λy:T.s,ρ′⟩,V(e2,ρ)=v2(returne1(e2):τ1,ρ,(τ2,σ))⟼(s,ρ′[y:=v′2],(τ5,σ))where V(e1,ρ)=⟨λy:T1.s,ρ′⟩:τ3→τ4,V(e2,ρ)=v2,cast(v2,τ3)=v′2, and(τ4;τ1;τ2)=τ5(x=e:τ;s,ρ,κ)⟼blameℓwhere V(e,ρ)=v,cast(v,τ)=blameℓ(returne,ρ,(τ,(x,s,ρ′)::κ))⟼blameℓwhere V(e,ρ)=v,cast(v,τ)=blameℓ
We now have an implementation of Lazy UD that is space efficient and relatively efficient in time as well. However, there is one nagging issue regarding the speed of statically-typed code. Notice how there are two transition rules for each kind of function call. The source of the problem is that there are two kinds of values that have function type, closures and closures wrapped in a threesome. In the next post I'll define a unified representation for closures and wrapped closures so that we don't need to dispatch at runtime.
No comments:
Post a Comment