The ICFP paper submission deadline kept me busy for much of February,
but now I'm back to thinking about the simple denotational semantics
of the lambda calculus. In previous posts I showed that this semantics
is equivalent to standard operational semantics when considering the
behavior of whole programs. However, sometimes it is necessary to
reason about the behavior of program fragments and we would like to
use the denotational semantics for this as well. For example, an
optimizing compiler might want to exchange one expression for another
less-costly expression that does the same job.
The formal notion of two such ``exchangeable'' expressions is
contextual equivalence (Morris 1968). It says that two
expression are equivalent if plugging them into an arbitrary context
produces programs that behave the same.
Definition (Contextual Equivalence)
Two expressions e1 and e2 are contextually equivalent,
written e1≃e2, iff for any closing context C,
eval(C[e1])=eval(C[e2]).
We would like to know that when two expressions are denotationally
equal, then they are also contextually equivalent.
Theorem (Sound wrt. Contextual Equivalence)
If E[e1]Γ=E[e2]Γ for any Γ,
then e1≃e2.
The rest of the blog post gives an overview of the proof (except
for the discussion of related work at the very end). The details
of the proof are in the Isabelle mechanization. But first we need to
define the terms used in the above statements.
Definitions
Recall that our denotational semantics is defined in terms of an
intersection type system. The meaning of an expression is the set of
all types assigned to it by the type system.
E[e]Γ≡{A∣Γ⊢2e:A}
Recall that the types include singletons, functions, intersections,
and a top type:
A,B,C::=n∣A→B∣A∧B∣⊤
I prefer to think of these types as values, where the function,
intersection, and top types are used to represent finite tables that
record the input-output values of a function.
The intersection type system that we use here differs from the one in
the previous post in that we remove the subsumption rule and sprinkle
uses of subtyping elsewhere in a standard fashion (Pierce 2002).
Γ⊢2n:nΓ⊢2λx.e:⊤Γ⊢2λx.e:AΓ⊢2λx.e:BΓ⊢2λx.e:A∧Bx:A∈ΓΓ⊢2x:AΓ,x:A⊢2BΓ⊢2λx.e:A→BΓ⊢2e1:CC<:A→BΓ⊢2e2:AΓ⊢2e1e2:BΓ⊢2e1:AA<:n1Γ⊢2e2:BB<:n2[|op|](n1,n2)=n3Γ⊢2op(e1,e2):n3Γ⊢2e1:AA<:0Γ⊢2e3:BΓ⊢2ife1thene2elsee3:BΓ⊢2e1:AA<:nn≠0Γ⊢2e2:BΓ⊢2ife1thene2elsee3:B
Regarding subtyping, we make a minor change and leave out the
rule
(C→A)∧(C→B)<:C→(A∧B)
because I had a hunch that it wasn't needed to prove Completeness with
respect to the small step semantics, and indeed it was not.
So the subtyping relation is defined as follows.
n<:n⊤<:⊤A→B<:⊤A′<:AB<:B′A→B<:A′→B′C<:AC<:BC<:A∧BA∧B<:AA∧B<:B
This type system is equivalent to the one with subsumption
in the following sense.
Theorem (Equivalent Type Systems)
- If Γ⊢e:A, then Γ⊢2e:A′
and A′<:A for some A′.
- If Γ⊢2e:A, then Γ⊢e:A.
Proof
The proofs of the two parts are straightforward inductions on the
derivations of the typing judgments.
QED
This type system satisfies the usual progress and preservation
properties.
Theorem (Preservation)
If Γ⊢2e:A and e⟶e′,
then Γ⊢ee′:A′ and A′<:A for some A′.
Proof
The proof of preservation is by induction on the derivation of the
reduction. The case for β reduction relies on lemmas about
substitution and type environments.
QED
Theorem (Progress)
If ∅⊢2e:A and FV(e)=∅,
then e is a value or e⟶e′
for some e′.
Proof
The proof of progress is by induction on the typing derivation.
As usual it relies on a canonical forms lemma.
QED
Lemma (Canonical forms)
Suppose ∅⊢2v:A.
- If A<:n, then v=n.
- If A<:B→C, then v=λx.e for some x,e.
Next we turn to the definition of eval. As usual, we
shall define the behavior of a program in terms of the operational
(small-step) semantics and an observe function.
eval(e)={observe(v)if e⟶∗vbadotherwiseobserve(n)=nobserve(λx.e)=fun
In the above we categorize programs as bad if they do not
produce a value. Thus, we are glossing over the distinction between
programs that diverge and programs that go wrong (e.g., segmentation
fault). We do this because our denotational semantics does not make
such a distinction. However, I plan to circle back to this issue in
the future and develop a version of the semantics that does.
Soundness wrt. Contextual Equivalence
We assume that E[e1]Γ=E[e2]Γ for any Γ
and need to show that e1≃e2. That is, we need to show
that eval(C[e1])=eval(C[e2]) for any
closing context C. We shall prove Congruence which lets us lift the denotational
equality of e1 and e2 through any context, so we have
E[C[e1]]∅=E[C[e2]]∅(1)
Now let us consider the cases for eval(C[e1]).
- Case eval(C[e1])=observe(v) and
C[e1]⟶∗v:
By Completeness of the
intersection type system we
have ∅⊢2C[e1]:A and ∅⊢2v:A′ for some A,A′ such that A′<:A. Then with
(1) we have
∅⊢2C[e2]:A(2)
The type system is sound wrt. the big-step semantics, so ∅⊢C[e2]⇓v′ for some v′. Therefore C[e2]⟶∗v″ because the big-step semantics is sound
wrt. the small-step semantics. It remains to show that
observe(v″)=observe(v). From
(2) we have ∅⊢2v″:A″ for some
A″ where A″<:A, by Preservation. Noting that we already have
∅⊢2v:A′, ∅⊢2v″:A″,
A′<:A, and A″<:A, we conclude that
observe(v)=observe(v″) by
the Lemma Observing values of subtypes.
- Case eval(C[e1])=bad:
So C[e1] either diverges or gets stuck.
In either case, we have E[C[e1]]∅=∅
(Lemmas Diverging programs have no meaning and
Programs that get stuck have no meaning).
So by (1) we have
E[C[e2]]∅=∅.
We conclude that C[e2] either diverges or gets stuck
by Lemma (Programs with no meaning diverge or get stuck). Thus,
eval(C[e2])=bad.
QED
Lemma (Congruence)
Let C be an arbitrary context.
If E[e1]Γ′=E[e2]Γ′ for any Γ′,
then E[C[e1]]Γ=E[C[e2]]Γ.
Proof
We prove congruence by structural induction on the context
C, using the induction hypothesis and the appropriate
Compatibility lemma for each kind of expression.
QED
Most of the Compatibility lemmas are straightforward, though the one
for abstraction is worth discussing.
Lemma (Compatibility for abstraction)
If E[e1]Γ′=E[e2]Γ′ for any Γ′,
then E[λx.e1]Γ=E[λx.e2]Γ.
Proof
To prove compatibility for abstractions, we first prove that
If Γ′⊢2e1:B implies Γ′⊢2e2:B
for any Γ′,B,
then Γ⊢2λx.e1:C
implies Γ⊢2λx.e2:C.
This is a straightforward induction on the type
C.
Compatibility follows by two uses this fact.
QED
Theorem (Completeness wrt. small-step semantics)
If e⟶∗v then
∅⊢2e:A and ∅⊢2v:A′
for some A,A′ such that A′<:A.
Proof
We have ∅⊢e:B and ∅⊢v:B by Completeness of the type system with subsumption.
Therefore ∅⊢2e:A and A<:B
by Theorem Equivalent Type Systems.
By preservation
we conclude that ∅⊢2v:A′ and A′<:A.
QED
In a previous blog post, we proved soundness with respect to big-step
semantics for a slightly different denotational semantics. So we
update that proof for the denotational semantics defined above. We
shall make use of the following logical relation G in
this proof.
G[n]={n}G[A→B]={⟨λx.e,ρ⟩∣∀v∈G[A].ρ(x:=v)⊢e⇓v′ and v′∈G[B]}G[A∧B]=G[A]∩G[B]G[⊤]={v∣v∈Values}G[∅]={∅}G[Γ,x:A]={ρ(x:=v)∣v∈G[A] and ρ∈G[Γ]}
We shall need two lemmas about this logical relation.
Lemma (Lookup in G)
If x:A∈Γ and ρ∈G[Γ],
then ρ(x)=v and v∈G[A].
Lemma (G preserves subtyping )
If A<:B and v∈G[A], then v∈G[B].
Theorem (Soundness wrt. big-step semantics)
If Γ⊢2e:A and ρ∈G[Γ],
then ρ⊢e⇓v and v∈G[A].
Proof
The proof is by induction on the typing derivation. The case for
variables uses the Lookup Lemma and all of the elimination
forms use the above Subtyping Lemma (because their typing rules use
subtyping).
QED
Lemma (Observing values of subtypes)
If ∅⊢2v:A, ∅⊢2v′:B, A<:C, and B<:C, then observe(v)=observe(v′).
Proof
The proof is by cases of v and v′. We use Lemmas
about the symmetry of subtyping for singletons,
an inversion lemma for functions, and
that subtyping preserves function types.
QED
Lemma (Subtyping symmetry for singletons)
If n<:A, then A<:n.
For the next lemma we need to characterize the types for functions.
fun(A→B)fun(A)fun(B)fun(A∧B)fun(⊤)
Lemma (Inversion on Functions)
If Γ⊢2λx.e:A, then fun(A).
Lemma (Subtyping preserves functions)
If A<:B and fun(A),
then fun(B).
Lemma (Diverging Programs have no meaning)
If e diverges, then E[e]∅=∅.
Proof
Towards a contradiction, suppose E[e]∅≠∅.
Then we have ∅⊢2e:A for some A. Then by
soundness wrt. big-step semantics, we have ∅⊢e⇓v and so also e⟶∗v′. But this contradicts the
premise that e diverges.
QED
Lemma (Programs that get stuck have no meaning)
Suppose that e⟶∗e′ and e′ is stuck
(and not a value).
Then E[e]∅=∅.
Proof
Towards a contradiction, suppose E[e]∅≠∅.
Then we have ∅⊢2e:A for some A. Therefore
∅⊢2e′:A′ for some A′<:A. By Progress,
either e′ is a value or it can
take a step. But that contradicts the premise.
QED
Lemma (Programs with no meaning diverge or gets stuck)
If E[e]∅=∅, then e diverges or reduces to
a stuck non-value.
Proof
Towards a contradiction, suppose that e does not diverge
and does not reduce to a stuck non-value.
So e⟶∗v for some v.
But then by Completeness wrt. the small-step semantics, we have
∅⊢2e:A for some A, which
contradicts the premise E[e]∅=∅.
QED
Related Work
The proof method used here, of proving Compatibility and Congruence
lemmas to show soundness wrt. contextual equivalence, is adapted from
Gunter's book (1992), where he proves that the standard model for PCF
(CPO's and continuous functions) is sound. This approach is also
commonly used to show that logical relations are sound wrt. contextual
equivalence (Pitts 2005).
The problem of full abstraction is to show that denotational
equivalence is both sound (aka. correct):
E[e1]=E[e2]impliese1≃e2
and complete:
e1≃e2impliesE[e1]=E[e2]
with respect to contextual equivalence (Milner 1975). Here we showed
that the simple denotational semantics is sound. I do not know whether
it is complete wrt. contextual equivalence.
There are famous examples of denotational semantics that are not
complete. For example, the standard model for PCF is not
complete. There are two expressions in PCF that are contextually
equivalent but not denotationally equivalent (Plotkin 1977). The idea
behind the counter-example is that parallel-or cannot be defined in
PCF, but it can be expressed in the standard model. The two
expressions are higher-order functions constructed to behave
differently only when applied to parallel-or.
Rocca and Paolini (2004) define a filter model V for the
call-by-value lambda calculus, similar to our simple denotational
semantics, and prove that it is sound wrt. contextual equivalence
(Theorem 12.1.18). Their type system and subtyping relation differs
from ours in several ways. Their ∧intro rule is not
restricted to λ, they include subsumption, their ⊤
type is a super-type of all types (not just function types), they
include the distributivity rule discussed at the beginning of this
post, and they include a couple other rules (labeled (g) and
(v) in Fig. 12.1). I'm not sure whether any of these differences
really matter; the two systems might be equivalent. Their proof is
quite different from ours and more involved; it is based on the notion
of approximants. They also show that V is incomplete
wrt. contextual equivalence, but go on to create another model based
on V that is. The fact that V is
incomplete leads me suspect that E is also incomplete.
This is certainly worth looking into.
Abramsky (1990) introduced a domain logic whose formulas are
intersetion types:
ϕ::=⊤∣ϕ∧ϕ∣ϕ→ϕ
and whose proof theory is an intersection type system designed to
capture the semantics of the lazy lambda calculus. Abramsky proves
that it is sound with respect to contextual equivalence. As far as I
can tell, the proof is different than the approach used here, as
it shows that the domain logic is sound with respect to a denotational
semantics that solves the domain equation D=(D→D)⊥, then
shows that this denotational semantics is sound wrt. contextual
equivalence. (See also Alan Jeffrey (1994).)