Let Polymorphism in Mini ML

9/2/21

Let polymorphism

We would like to have polymorphism in certain circumstances. For example, we would like to have the same identity function λx:x that could be used on Booleans and Integers, without having to have two separate identity functions, one of type B->B and the other of type I->I .

The usual way to do this, in this setting, is via a 'let' construction. Most functional programming languages have an expression form similar to

let <variable> = <expression1> in <expression2>

For example, the following is a well-formed expression in the computer language Haskell

let identity = \x->x in (identity 7, identity False)     (and a Haskell interpreter will return (7,False) as the value for this.)

What we are seeing here is a local definition of 'identity'. The expression \x->x is Haskell's version of λx.x. Haskell has a notation for ordered pair and it is ( , ). The expression being evaluated consists of an ordered pair whose first component is identity applied to 7 with a second component of identity applied to False. For Haskell this is well-typed and has a value (7,False). This means that the identity function that has been defined is polymorphic— it has been applied both to Integers and Booleans without any problem or issue.

Haskell, of course, has a much more sophisticated type system than mini-ML But a local definition let construction is a way of bringing polymorphism into mini-ML

In a general hand-waving way, local definitions are not hard to do in lambda calculus. Say we wanted to define variableX as being expressionY (i.e. variableX=expressionY) and substitute  ys for xs throughout expressionZ, all we need to do is to wrap a lambda x around z and apply that to y i.e.

(λvariableX.expressionZ expressionY)   ... on reduction that will substitute ys for xs in z

however, there can be details and complications. For example, can the variable x appear within the expression y (as might be the case with recursion)? what about capturing and problems with substitution (where, perhaps, several lets are nested one within another)? Also, will this be able to provide the polymorphism we are looking for?

For this extension to lambda calculus, the construction

let <variable> = <expression1> in <expression2>

will be accepted as creating well-formed formulas in the extended lambda calculus. [This does introduce questions of how that construction is to be reduced to normal form. However, at present the interest is with types, we will set that aside.] Also, for simplicity, we'll not let the <variable> appear within <expression1>.

As mentioned, the let addition is similar to a plain lambda calculus expression with form

(λvarX. e2  e1)

How might that entire expression form be typed? What might be a typing rule for it? One obvious approach is to work out the type of ethen calculate the type of e2 using the type of eas the types for all the occurrences of varX in e2. That would give a typing rule like Possible Typing Rule 0

Γ ⊢ e1:τ  and Γ∪{x:τ) } ⊢ e2: σ
                                                                 
               Γ ⊢ (let x=e1 in e2) : σ

Unfortunately this will not produce the effect we are looking for. The reason is that every occurrence of x in ewill have the same type, and there are expressions where different occurrences of x should have different types

The Haskell example above uses an ordered pair, which is not available in our lambda calculus. But there are suitable expressions to illustrate the problem.

let identity = λx. x in
   let ignore = (identity True) in
      (identity 2)

We would expect this expression to have a value of (or to reduce to) 2 which is an Integer i.e. of type I. Using Possible Typing Rule 0

 Γ , the final environment is empty, x is identity, e1 is λx. x , and e2 is let ignore = (identity True) in (identity 2)

⊢ (λx. x):τ  and {identity:τ } ⊢(let ignore = (identity True) in (identity 2): σ
                                                                                                                                      
        ⊢ (let identity = λx. x in (let ignore = identity True in identity 2)) : σ

Then, on the right hand branch, for {identity:τ } ⊢(let ignore = identity True in identity 2): σ ,  Γ is {identity:τ , x is ignore, e1 is (identity True) and e2is (identity 2)

{identity:τ } ⊢ (identity True):τ  and {identity:τ }∪{ignore:τ) ⊢(identity 2): σ
                                                                                                                                      
       {identity:τ } ⊢(let ignore = identity True in identity 2): σ

But we are in trouble. (identity True) and (identity 2) are applications, and the Application rule is not able to type identity in this case.

Let polymorphism has a definite suggestion here.

An 'easy' explanation of Let polymorphism.

1. All lambda expressions are going to be monomorphic. That means that were we to annotate the lambda variable to get a view of it, all lambda expressions would look similar to this.

λx:<type>. <scope>

The important point here is that <type> is a plain type and not a universally quantified type-scheme. So the <type> is monomorphic. It may have type variables in it, but if those are valued (perhaps by unification) they will be valued only once (and uniquely).

2. The variable in a let expression is polymorphic. That means that were we to annotate the let variable to get a view of it, all let expressions would look similar to this.

let x:<type-scheme>= e1 in e2

The type scheme here likely will be a full blown type scheme with a universal quantifier and a list of quantified type variables. Which type variables you may ask? Well, there is a little excursion needed here into free type variables (ftvs) . The type for e, were it to be evaluated on its own, might have free type variables in it, denote these by ftv(e1) . And, in the general case for finding types, there will be an environment  Γ . Now,  Γ is a collection of pairs of term variables and type schemes. Those type schemes may have free variables in them, and all of those in Γ can be collected up. These may be denoted by ftv(Γ). Our interest is in the free variable of e1 which are not free variables in Γ i.e. ftv(e1)-ftv(Γ). Those are the ones that should be used as the quantified variables in the corresponding type scheme. So, for example, if the type of  eis typeVar1->typeVar2 and neither of these are free in the environment then ∀typeVar1, typeVar2 (typeVar1->typeVar2). Were we to annotate the opening segment of the let, it would look like this, schematically 

let x:∀typeVar1, typeVar2 (typeVar1->typeVar2)= e1 in e2

3. The next step is to use instantiations of the type scheme, instantiations that have all new type variables. The scheme ∀typeVar1, typeVar2 (typeVar1->typeVar2), for example, has instantiations (typeVar100->typeVar101), (typeVar102->typeVar103), and many more.

For each occurrence of x in e2 , when considering its type, a new type instantiation will be used. This means, to hark back to an earlier example, when considering the types of the sub-expressions (identity True) and (identity 2) , they will have entirely different type variables one from the other (and this will allow them to be solved simultaneously).

4. This can all be rolled into a single rule

Γ ⊢ e1:τ  and Γ∪{x:∀A(τ)) } ⊢ e2: σ     i) the term variable x needs not to have a prior valuation in  Γ and
                                                          ii) A is the list of free variables  ftv(e1)-ftv(Γ)
                                                                 
               Γ ⊢ (let x=e1 in e2) : σ