Reversibility and focalization

From LLWiki
(Difference between revisions)
Jump to: navigation, search
(Creation)
 
(Generalized connectives and rules)
Line 162: Line 162:
 
inductively as
 
inductively as
   
{|
+
<math> \mathcal{I}_{P\tens Q} := [ I+J \mid I\in\mathcal{I}_P, J\in\mathcal{I}_Q ] </math>,
|-
+
<math> \mathcal{I}_{P\plus Q} := \mathcal{I}_P + \mathcal{I}_Q </math>,
| <math>
+
<math> \mathcal{I}_\one := [[]] </math>,
\mathcal{I}_{P\tens Q} </math>
+
<math> \mathcal{I}_\zero := [] </math>,
| <math>:= [ I+J \mid I\in\mathcal{I}_P, J\in\mathcal{I}_Q ]
+
<math> \mathcal{I}_{X_i} := [[i]] </math>.
</math>
 
| <math> \mathcal{I}_{P\plus Q} </math>
 
| <math>:= \mathcal{I}_P + \mathcal{I}_Q
 
</math>
 
| <math> \mathcal{I}_\one </math>
 
| <math>:= [[]]
 
</math>
 
| <math> \mathcal{I}_\zero </math>
 
| <math>:= []
 
</math>
 
| <math> \mathcal{I}_{X_i} </math>
 
| <math>:= [[i]]
 
</math>
 
|}
 
   
 
The branching of a negative generalized connective is the branching of its
 
The branching of a negative generalized connective is the branching of its

Revision as of 22:27, 31 August 2012

Reversibility

Theorem

Negative connectives are reversible:

  • A sequent \vdash\Gamma,A\parr B is provable if and only if \vdash\Gamma,A,B is provable.
  • A sequent \vdash\Gamma,A\with B is provable if and only if \vdash\Gamma,A and \vdash\Gamma,B are provable.
  • A sequent \vdash\Gamma,\bot is provable if and only if \vdash\Gamma is provable.
  • A sequent \vdash\Gamma,\forall\xi A is provable if and only if \vdash\Gamma,A is provable, for some fresh variable ξ.

Proof.  We start with the case of the \parr connective. If \vdash\Gamma,A,B is provable, then by the introduction rule for \parr we know that \vdash\Gamma,A\parr B is provable. For the reverse implication we proceed by induction on a proof π of \vdash\Gamma,A\parr B.

  • If the last rule of π is the introduction of the \parr in A\parr B, then the premiss is exacty \vdash\Gamma,A,B so we can conclude.
  • The other case where the last rule introduces A\parr B is when π is an axiom rule, hence \Gamma=A\orth\tens B\orth. Then we can conclude with the proof
  • Otherwise A\parr B is in the context of the last rule. If the last rule is a tensor, then π has the shape
  • or the same with A\parr B in the conclusion of π2 instead. By induction hypothesis on π1 we get a proof π'1 of \vdash\Gamma_1,A,B,C, then we can conclude with the proof
  • The case of the cut rule has the same structure as the tensor rule.
  • In the case of the \with rule, we have A\with B in both premisses and we conclude similarly, using the induction hypothesis on both π1 and π2.
  • If A\parr B is in the context of a rules for \parr, \plus, \bot or quantifiers, or in the context of a dereliction, weakening or contraction, the situation is similar as for \tens except that we have only one premiss.
  • If A\parr B is in the context of \top rules, we can freely change the context of the rule to get the expected one.
  • The two remaining cases are if the last rule is the rule for 1 or a promotion. By the constraints these rules impose on the contexts, these cases cannot happen.

The \with connective is treated in the same way. In this cases where A\with B is in the context of a rule with two premisses, the premiss where this formula is not present will be duplicated, with one copy in the premiss for A and one in the premiss for B.

The \forall connective is also treated similarly. Its peculiarity is that introducing \forall\xi requires that ξ does not appear free in the context. For all rules with one premiss except the quantifier rules, the set of fresh variables in the same in the premiss and the conclusion, so everything works well. Other rules might change the set of free variables, but problems are avoided by choosing for ξ a variable that is fresh for the whole proof we are considering.

Remark that this result is proved using only commutation rules, except when the formula is introduced by an axiom rule. Furthermore, if axioms are applied only on atoms, this particular case disappears.

A consequence of this fact is that, when searching for a proof of some sequent \vdash\Gamma, one can always start by decomposing negative connectives in Γ without losing provability. Applying this result to successive connectives, we can get generalized formulations for more complex formulas. For instance:

  • \vdash\Gamma,(A\parr B)\parr(B\with C) is provable
  • iff \vdash\Gamma,A\parr B,B\with C is provable
  • iff \vdash\Gamma,A\parr B,B and \vdash\Gamma,A\parr B,C are provable
  • iff \vdash\Gamma,A,B,B and \vdash\Gamma,A,B,C are provable

So without loss of generality, we can assume that any proof of \vdash\Gamma,(A\parr B)\parr(B\with C) ends like


  \AxRule{ \vdash \Gamma, A, B, B }
  \UnaRule{ \vdash \Gamma, A\parr B, B }
  \AxRule{ \vdash \Gamma, A, B, C }
  \UnaRule{ \vdash \Gamma, A\parr B, C }
  \BinRule{ \vdash \Gamma, A\parr B, B\with C }
  \UnaRule{ \vdash \Gamma, (A\parr B)\parr(B\with C) }
  \DisplayProof

In order to define a general statement for compound formulas, as well as an analogous result for positive connectives, we need to give a proper status to clusters of connectives of the same polarity.

Generalized connectives and rules

Definition

A positive generalized connective is a parametrized formula P[X_1,\ldots,X_n] made from the variables Xi using the connectives \tens, \plus, \one, \zero.

A negative generalized connective is a parametrized formula N[X_1,\ldots,X_n] made from the variables Xi using the connectives \parr, \with, \bot, \top.

If C[X_1,\ldots,X_n] is a generalized connectives (of any polarity), the dual of C is the connective C * such that C^*[X_1\orth,\ldots,X_n\orth]=C[X_1,\ldots,X_n]\orth.

It is clear that dualization of generalized connectives is involutive and exchanges polarities. We do not include quantifiers in this definition, mainly for simplicity. Extending the notion to quantifiers would only require taking proper care of the scope of variables.

Sequent calculus provides introduction rules for each connective. Negative connectives have one rule, positive ones may have any number of rules, namely 2 for \plus and 0 for \zero. We can derive introduction rules for the generalized connectives by combining the different possible introduction rules for each of their components.

Considering the previous example N[X_1,X_2,X_3]=(X_1\parr X_2)\parr(X_2\with X_3), we can derive an introduction rule for N as


  \AxRule{ \vdash \Gamma, X_1, X_2, X_2 }
  \UnaRule{ \vdash \Gamma, X_1\parr X_2, X_2 }
  \AxRule{ \vdash \Gamma, X_1, X_2, X_3 }
  \UnaRule{ \vdash \Gamma, X_1\parr X_2, X_3 }
  \BinRule{ \vdash \Gamma, X_1\parr X_2, X_2\with X_3 }
  \UnaRule{ \vdash \Gamma, (X_1\parr X_2)\parr(X_2\with X_3) }
  \DisplayProof
\quad\text{or}\quad
  \AxRule{ \vdash \Gamma, X_1, X_2, X_2 }
  \AxRule{ \vdash \Gamma, X_1, X_2, X_3 }
  \BinRule{ \vdash \Gamma, X_1, X_2, X_2\with X_3 }
  \UnaRule{ \vdash \Gamma, X_1\parr X_2, X_2\with X_3 }
  \UnaRule{ \vdash \Gamma, (X_1\parr X_2)\parr(X_2\with X_3) }
  \DisplayProof

but these rules only differ by the commutation of independent rules. In particular, their premisses are the same. The dual of N is P[X_1,X_2,X_3]=(X_1\tens X_2)\tens(X_2\plus X_3), for which we have two possible derivations:


  \AxRule{ \vdash \Gamma_1, X_1 }
  \AxRule{ \vdash \Gamma_2, X_2 }
  \BinRule{ \vdash \Gamma_1, \Gamma_2, X_1\tens X_2 }
  \AxRule{ \vdash \Gamma_3, X_2 }
  \UnaRule{ \vdash \Gamma_3, X_2\plus X_3 }
  \BinRule{ \vdash \Gamma_1, \Gamma_2, \Gamma_3, (X_1\tens X_2)\tens(X_2\plus X_3) }
  \DisplayProof
\qquad
  \AxRule{ \vdash \Gamma_1, X_1 }
  \AxRule{ \vdash \Gamma_2, X_2 }
  \BinRule{ \vdash \Gamma_1, \Gamma_2, X_1\tens X_2 }
  \AxRule{ \vdash \Gamma_3, X_3 }
  \UnaRule{ \vdash \Gamma_3, X_2\plus X_3 }
  \BinRule{ \vdash \Gamma_1, \Gamma_2, \Gamma_3, (X_1\tens X_2)\tens(X_2\plus X_3) }
  \DisplayProof

These are actually different, in particular their premisses differ. Each possible derivation corresponds to the choice of one side of the \plus connective.

We can remark that the branches of the negative inference precisely correspond to the possible positive inferences:

  • the first branch of the negative inference has a premiss X1,X2,X2 and the first positive inference has three premisses, holding X1, X2 and X2 respectively.
  • the second branch of the negative inference has a premiss X1,X2,X3 and the second positive inference has three premisses, holding X1, X2 and X3 respectively.

This phenomenon extends to all generalized connectives.

Definition

The branching of a generalized connective P[X_1,\ldots,X_n] is the multiset \mathcal{I}_P of multisets over \{1,\ldots,n\} defined inductively as

 \mathcal{I}_{P\tens Q} := [ I+J \mid I\in\mathcal{I}_P, J\in\mathcal{I}_Q ] ,  \mathcal{I}_{P\plus Q} := \mathcal{I}_P + \mathcal{I}_Q ,  \mathcal{I}_\one := [[]] ,  \mathcal{I}_\zero := [] ,  \mathcal{I}_{X_i} := [[i]] .

The branching of a negative generalized connective is the branching of its dual. Elements of a branching are called branches.

In the example above, the branching will be [[1,2,2],[1,2,3]], which corresponds to the granches of the negative inference and to the cases of positive inference.

Definition

Let \mathcal{I} be a branching. Write \mathcal{I} as [I_1,\ldots,I_k] and write each Ij as [i_{j,1},\ldots,i_{j,\ell_j}]. The derived rule for a negative generalized connective N with branching \mathcal{I} is


    \AxRule{ \vdash \Gamma, A_{i_{1,1}}, \ldots, A_{i_{1,\ell_1}} }
    \AxRule{ \cdots }
    \AxRule{ \vdash \Gamma, A_{i_{k,1}}, \ldots, A_{i_{k,\ell_k}} }
    \LabelRule{N}
    \TriRule{ \vdash \Gamma, N[A_1,\ldots,A_n] }
    \DisplayProof

For each branch I=[i_1,\ldots,i_\ell] of a positive generalized connective P, the derived rule for branch I of P is


    \AxRule{ \vdash \Gamma_1, A_{i_1} }
    \AxRule{ \cdots }
    \AxRule{ \vdash \Gamma_\ell, A_{i_\ell} }
    \LabelRule{P_I}
    \TriRule{ \vdash \Gamma_1, \ldots, \Gamma_\ell, P[A_1,\ldots,A_n] }
    \DisplayProof

The reversibility property of negative connectives can be rephrased in a generalized way as

Theorem

Let N be a negative generalized connective. A sequent \vdash\Gamma,N[A_1,\ldots,A_n] is provable if and only if, for each [i_1,\ldots,i_k]\in\mathcal{I}_N, the sequent \vdash\Gamma,A_{i_1},\ldots,A_{i_k} is provable.

The corresponding property for positive connectives is the focalization property, defined in the next section.

Focalization

Definition

A formula is positive if it has a main connective among \tens, \plus, \one, \zero. It is called negative if it has a main connective among \parr, \with, \bot, \top. It is called neutral if it is neither positive nor negative.

If we extended the theory to include quantifiers in generalized connectives, then the definition of positive and negative formulas would be extended to include them too.

Definition

A proof \pi\vdash\Gamma,A is said to be focalized on A if it has the shape


    \AxRule{ \pi_1 \vdash \Gamma_1, A_{i_1} }
    \AxRule{ \cdots }
    \AxRule{ \pi_\ell \vdash \Gamma_\ell, A_{i_\ell} }
    \LabelRule{P_{[i_1,\ldots,i_\ell]}}
    \TriRule{ \vdash  \Gamma_1, \ldots, \Gamma_\ell, P[A_1,\ldots,A_n] }
    \DisplayProof

where P is a positive generalized connective, the Ai ar non-positive and A=P[A_1,\ldots,A_n]. The formula A is called the focus of the proof π.

In other words, a proof is focalized on a conclusion A if its last rules build A from some of its non-positive subformulas in one cluster of inferences. Note that this notion only makes sense for a conclusion made only of positive formulas, since a proof is obviously focalized on any of its non-positive conclusions, using the degenerate generalized connective P[X] = X.

Theorem

A sequent \vdash\Gamma is cut-free provable if and only if it is provable by a cut-free focalized proof.

Proof.  We reason by induction on a proof π of Γ. As noted above, the result trivially holds if Γ has a non-positive formula. We can thus assume that Γ contains only positive formulas and reason on the nature of the last rule, which is necessarily the introduction of a positive connective (it cannot be an axiom rule because an axiom always has at least on non-positive conclusion).

Suppose that the last rule of π introduces a tensor, so that π is


    \AxRule{ \rho \vdash \Gamma, A }
    \AxRule{ \theta \vdash \Delta, B }
    \BinRule{ \vdash \Gamma, \Delta, A\tens B }
    \DisplayProof

By induction hypothesis, there are focalized proofs \rho'\vdash\Gamma,A and \theta'\vdash\Delta,B. If A is the focus of ρ' and B is the focus of θ', then the proof


    \AxRule{ \rho' \vdash \Gamma, A }
    \AxRule{ \theta' \vdash \Delta, B }
    \BinRule{ \vdash \Gamma, \Delta, A\tens B }
    \DisplayProof

is focalized on A\tens B, so we can conclude. Otherwise, one of the two proofs is focalized on another conclusion. Without loss of generality, suppose that ρ' is not focalized on A. Then it decomposes as


    \AxRule{ \rho_1 \vdash \Gamma_1, C_{i_1} }
    \AxRule{ \cdots }
    \AxRule{ \rho_\ell \vdash \Gamma_\ell, C_{i_\ell} }
    \TriRule{ \vdash  \Gamma_1, \ldots, \Gamma_\ell, P[C_1,\ldots,C_n] }
    \DisplayProof

where the Ci are not positive and A belongs to some context Γj that we will write Γ'j,A. Then we can conclude with the proof


    \AxRule{ \rho_1 \vdash \Gamma_1, C_{i_1} \quad\cdots }
    \AxRule{ \rho_j \vdash \Gamma_j, A, C_{i_j} }
    \AxRule{ \theta \vdash \Delta, B }
    \BinRule{ \vdash \Gamma_j, \Delta, A\tens B, C_{i_j} }
    \AxRule{ \cdots\quad \rho_\ell \vdash \Gamma_\ell, C_{i_\ell} }
    \TriRule{ \vdash \Gamma_1, \ldots, \Gamma_\ell, \Delta, A\tens B, P[C_1,\ldots,C_n] }
    \DisplayProof

which is focalized on P[C_1,\ldots,C_n].

If the last rule of π introduces a \plus, we proceed the same way except that there is only one premiss. If the last rule of π introduces a \one, then it is the only rule of π, which is thus focalized on this \one.

As in the reversibility theorem, this proof only makes use of commutation of independent rules.

These results say nothing about exponential modalities, because they respect neither reversibility nor focalization. However, if we consider the fragment of LL which consists only of multiplicative and additive connectives, we can restrict the proof rules to enforce focalization without loss of expressiveness.

Personal tools