wneuper/isa: doc-src/Ref/theories.tex@20e3616b2d9c

     2 \chapter{Theories, Terms and Types} \label{theories}

     3 \index{theories|(}

     5 \section{The theory loader}\label{sec:more-theories}

     6 \index{theories!reading}\index{files!reading}

     8 Isabelle's theory loader manages dependencies of the internal graph of theory

     9 nodes (the \emph{theory database}) and the external view of the file system.

    10 See \S\ref{sec:intro-theories} for its most basic commands, such as

    11 \texttt{use_thy}.  There are a few more operations available.

    13 \begin{ttbox}

    14 use_thy_only    : string -> unit

    15 update_thy_only : string -> unit

    16 touch_thy       : string -> unit

    17 remove_thy      : string -> unit

    18 delete_tmpfiles : bool ref \hfill\textbf{initially true}

    19 \end{ttbox}

    21 \begin{ttdescription}

    22 \item[\ttindexbold{use_thy_only} "$name$";] is similar to \texttt{use_thy},

    23   but processes the actual theory file $name$\texttt{.thy} only, ignoring

    24   $name$\texttt{.ML}.  This might be useful in replaying proof scripts by hand

    25   from the very beginning, starting with the fresh theory.

    27 \item[\ttindexbold{update_thy_only} "$name$";] is similar to

    28   \texttt{update_thy}, but processes the actual theory file

    29   $name$\texttt{.thy} only, ignoring $name$\texttt{.ML}.

    31 \item[\ttindexbold{touch_thy} "$name$";] marks theory node $name$ of the

    32   internal graph as outdated.  While the theory remains usable, subsequent

    33   operations such as \texttt{use_thy} may cause a reload.

    35 \item[\ttindexbold{remove_thy} "$name$";] deletes theory node $name$,

    36   including \emph{all} of its descendants.  Beware!  This is a quick way to

    37   dispose a large number of theories at once.  Note that {\ML} bindings to

    38   theorems etc.\ of removed theories may still persist.

    40 \end{ttdescription}

    42 \medskip Theory and {\ML} files are located by skimming through the

    43 directories listed in Isabelle's internal load path, which merely contains the

    44 current directory ``\texttt{.}'' by default.  The load path may be accessed by

    45 the following operations.

    47 \begin{ttbox}

    48 show_path: unit -> string list

    49 add_path: string -> unit

    50 del_path: string -> unit

    51 reset_path: unit -> unit

    52 with_path: string -> ('a -> 'b) -> 'a -> 'b

    53 no_document: ('a -> 'b) -> 'a -> 'b

    54 \end{ttbox}

    56 \begin{ttdescription}

    57 \item[\ttindexbold{show_path}();] displays the load path components in

    58   canonical string representation (which is always according to Unix rules).

    60 \item[\ttindexbold{add_path} "$dir$";] adds component $dir$ to the beginning

    61   of the load path.

    63 \item[\ttindexbold{del_path} "$dir$";] removes any occurrences of component

    64   $dir$ from the load path.

    66 \item[\ttindexbold{reset_path}();] resets the load path to ``\texttt{.}''

    67   (current directory) only.

    69 \item[\ttindexbold{with_path} "$dir$" $f$ $x$;] temporarily adds component

    70   $dir$ to the beginning of the load path while executing $(f~x)$.

    72 \item[\ttindexbold{no_document} $f$ $x$;] temporarily disables {\LaTeX}

    73   document generation while executing $(f~x)$.

    75 \end{ttdescription}

    77 Furthermore, in operations referring indirectly to some file (e.g.\

    78 \texttt{use_dir}) the argument may be prefixed by a directory that will be

    79 temporarily appended to the load path, too.

    82 \section{Basic operations on theories}\label{BasicOperationsOnTheories}

    84 \subsection{*Theory inclusion}

    85 \begin{ttbox}

    86 subthy      : theory * theory -> bool

    87 eq_thy      : theory * theory -> bool

    88 transfer    : theory -> thm -> thm

    89 transfer_sg : Sign.sg -> thm -> thm

    90 \end{ttbox}

    92 Inclusion and equality of theories is determined by unique

    93 identification stamps that are created when declaring new components.

    94 Theorems contain a reference to the theory (actually to its signature)

    95 they have been derived in.  Transferring theorems to super theories

    96 has no logical significance, but may affect some operations in subtle

    97 ways (e.g.\ implicit merges of signatures when applying rules, or

    98 pretty printing of theorems).

   100 \begin{ttdescription}

   102 \item[\ttindexbold{subthy} ($thy@1$, $thy@2$)] determines if $thy@1$

   103   is included in $thy@2$ wrt.\ identification stamps.

   105 \item[\ttindexbold{eq_thy} ($thy@1$, $thy@2$)] determines if $thy@1$

   106   is exactly the same as $thy@2$.

   108 \item[\ttindexbold{transfer} $thy$ $thm$] transfers theorem $thm$ to

   109   theory $thy$, provided the latter includes the theory of $thm$.

   111 \item[\ttindexbold{transfer_sg} $sign$ $thm$] is similar to

   112   \texttt{transfer}, but identifies the super theory via its

   113   signature.

   115 \end{ttdescription}

   118 \subsection{*Primitive theories}

   119 \begin{ttbox}

   120 ProtoPure.thy  : theory

   121 Pure.thy       : theory

   122 CPure.thy      : theory

   123 \end{ttbox}

   124 \begin{description}

   125 \item[\ttindexbold{ProtoPure.thy}, \ttindexbold{Pure.thy},

   126   \ttindexbold{CPure.thy}] contain the syntax and signature of the

   127   meta-logic.  There are basically no axioms: meta-level inferences

   128   are carried out by \ML\ functions.  \texttt{Pure} and \texttt{CPure}

   129   just differ in their concrete syntax of prefix function application:

   130   $t(u@1, \ldots, u@n)$ in \texttt{Pure} vs.\ $t\,u@1,\ldots\,u@n$ in

   131   \texttt{CPure}.  \texttt{ProtoPure} is their common parent,

   132   containing no syntax for printing prefix applications at all!

   134 %% FIXME

   135 %\item [\ttindexbold{extend_theory} $thy$ {\tt"}$T${\tt"} $\cdots$] extends

   136 %  the theory $thy$ with new types, constants, etc.  $T$ identifies the theory

   137 %  internally.  When a theory is redeclared, say to change an incorrect axiom,

   138 %  bindings to the old axiom may persist.  Isabelle ensures that the old and

   139 %  new theories are not involved in the same proof.  Attempting to combine

   140 %  different theories having the same name $T$ yields the fatal error

   141 %extend_theory  : theory -> string -> \(\cdots\) -> theory

   142 %\begin{ttbox}

   143 %Attempt to merge different versions of theory: \(T\)

   144 %\end{ttbox}

   145 \end{description}

   147 %% FIXME

   148 %\item [\ttindexbold{extend_theory} $thy$ {\tt"}$T${\tt"}

   149 %      ($classes$, $default$, $types$, $arities$, $consts$, $sextopt$) $rules$]

   150 %\hfill\break   %%% include if line is just too short

   151 %is the \ML{} equivalent of the following theory definition:

   152 %\begin{ttbox}

   153 %\(T\) = \(thy\) +

   154 %classes \(c\) < \(c@1\),\(\dots\),\(c@m\)

   155 %        \dots

   156 %default {\(d@1,\dots,d@r\)}

   157 %types   \(tycon@1\),\dots,\(tycon@i\) \(n\)

   158 %        \dots

   159 %arities \(tycon@1'\),\dots,\(tycon@j'\) :: (\(s@1\),\dots,\(s@n\))\(c\)

   160 %        \dots

   161 %consts  \(b@1\),\dots,\(b@k\) :: \(\tau\)

   162 %        \dots

   163 %rules   \(name\) \(rule\)

   164 %        \dots

   165 %end

   166 %\end{ttbox}

   167 %where

   168 %\begin{tabular}[t]{l@{~=~}l}

   169 %$classes$ & \tt[("$c$",["$c@1$",\dots,"$c@m$"]),\dots] \\

   170 %$default$ & \tt["$d@1$",\dots,"$d@r$"]\\

   171 %$types$   & \tt[([$tycon@1$,\dots,$tycon@i$], $n$),\dots] \\

   172 %$arities$ & \tt[([$tycon'@1$,\dots,$tycon'@j$], ([$s@1$,\dots,$s@n$],$c$)),\dots]

   173 %\\

   174 %$consts$  & \tt[([$b@1$,\dots,$b@k$],$\tau$),\dots] \\

   175 %$rules$   & \tt[("$name$",$rule$),\dots]

   176 %\end{tabular}

   179 \subsection{Inspecting a theory}\label{sec:inspct-thy}

   180 \index{theories!inspecting|bold}

   181 \begin{ttbox}

   182 print_syntax        : theory -> unit

   183 print_theory        : theory -> unit

   184 parents_of          : theory -> theory list

   185 ancestors_of        : theory -> theory list

   186 sign_of             : theory -> Sign.sg

   187 Sign.stamp_names_of : Sign.sg -> string list

   188 \end{ttbox}

   189 These provide means of viewing a theory's components.

   190 \begin{ttdescription}

   191 \item[\ttindexbold{print_syntax} $thy$] prints the syntax of $thy$

   192   (grammar, macros, translation functions etc., see

   193   page~\pageref{pg:print_syn} for more details).

   195 \item[\ttindexbold{print_theory} $thy$] prints the logical parts of

   196   $thy$, excluding the syntax.

   198 \item[\ttindexbold{parents_of} $thy$] returns the direct ancestors

   199   of~$thy$.

   201 \item[\ttindexbold{ancestors_of} $thy$] returns all ancestors of~$thy$

   202   (not including $thy$ itself).

   204 \item[\ttindexbold{sign_of} $thy$] returns the signature associated

   205   with~$thy$.  It is useful with functions like {\tt

   206     read_instantiate_sg}, which take a signature as an argument.

   208 \item[\ttindexbold{Sign.stamp_names_of} $sg$]\index{signatures}

   209   returns the names of the identification \rmindex{stamps} of ax

   210   signature.  These coincide with the names of its full ancestry

   211   including that of $sg$ itself.

   213 \end{ttdescription}

   216 \section{Terms}\label{sec:terms}

   217 \index{terms|bold}

   218 Terms belong to the \ML\ type \mltydx{term}, which is a concrete datatype

   219 with six constructors:

   220 \begin{ttbox}

   221 type indexname = string * int;

   222 infix 9 $;

   223 datatype term = Const of string * typ

   224               | Free  of string * typ

   225               | Var   of indexname * typ

   226               | Bound of int

   227               | Abs   of string * typ * term

   228               | op $  of term * term;

   229 \end{ttbox}

   230 \begin{ttdescription}

   231 \item[\ttindexbold{Const} ($a$, $T$)] \index{constants|bold}

   232   is the \textbf{constant} with name~$a$ and type~$T$.  Constants include

   233   connectives like $\land$ and $\forall$ as well as constants like~0

   234   and~$Suc$.  Other constants may be required to define a logic's concrete

   235   syntax.

   237 \item[\ttindexbold{Free} ($a$, $T$)] \index{variables!free|bold}

   238   is the \textbf{free variable} with name~$a$ and type~$T$.

   240 \item[\ttindexbold{Var} ($v$, $T$)] \index{unknowns|bold}

   241   is the \textbf{scheme variable} with indexname~$v$ and type~$T$.  An

   242   \mltydx{indexname} is a string paired with a non-negative index, or

   243   subscript; a term's scheme variables can be systematically renamed by

   244   incrementing their subscripts.  Scheme variables are essentially free

   245   variables, but may be instantiated during unification.

   247 \item[\ttindexbold{Bound} $i$] \index{variables!bound|bold}

   248   is the \textbf{bound variable} with de Bruijn index~$i$, which counts the

   249   number of lambdas, starting from zero, between a variable's occurrence

   250   and its binding.  The representation prevents capture of variables.  For

   251   more information see de Bruijn \cite{debruijn72} or

   252   Paulson~\cite[page~376]{paulson-ml2}.

   254 \item[\ttindexbold{Abs} ($a$, $T$, $u$)]

   255   \index{lambda abs@$\lambda$-abstractions|bold}

   256   is the $\lambda$-\textbf{abstraction} with body~$u$, and whose bound

   257   variable has name~$a$ and type~$T$.  The name is used only for parsing

   258   and printing; it has no logical significance.

   260 \item[$t$ \$ $u$] \index{$@{\tt\$}|bold} \index{function applications|bold}

   261 is the \textbf{application} of~$t$ to~$u$.

   262 \end{ttdescription}

   263 Application is written as an infix operator to aid readability.  Here is an

   264 \ML\ pattern to recognize FOL formulae of the form~$A\imp B$, binding the

   265 subformulae to~$A$ and~$B$:

   266 \begin{ttbox}

   267 Const("Trueprop",_) $ (Const("op -->",_) $ A $ B)

   268 \end{ttbox}

   271 \section{*Variable binding}

   272 \begin{ttbox}

   273 loose_bnos     : term -> int list

   274 incr_boundvars : int -> term -> term

   275 abstract_over  : term*term -> term

   276 variant_abs    : string * typ * term -> string * term

   277 aconv          : term * term -> bool\hfill\textbf{infix}

   278 \end{ttbox}

   279 These functions are all concerned with the de Bruijn representation of

   280 bound variables.

   281 \begin{ttdescription}

   282 \item[\ttindexbold{loose_bnos} $t$]

   283   returns the list of all dangling bound variable references.  In

   284   particular, \texttt{Bound~0} is loose unless it is enclosed in an

   285   abstraction.  Similarly \texttt{Bound~1} is loose unless it is enclosed in

   286   at least two abstractions; if enclosed in just one, the list will contain

   287   the number 0.  A well-formed term does not contain any loose variables.

   289 \item[\ttindexbold{incr_boundvars} $j$]

   290   increases a term's dangling bound variables by the offset~$j$.  This is

   291   required when moving a subterm into a context where it is enclosed by a

   292   different number of abstractions.  Bound variables with a matching

   293   abstraction are unaffected.

   295 \item[\ttindexbold{abstract_over} $(v,t)$]

   296   forms the abstraction of~$t$ over~$v$, which may be any well-formed term.

   297   It replaces every occurrence of \(v\) by a \texttt{Bound} variable with the

   298   correct index.

   300 \item[\ttindexbold{variant_abs} $(a,T,u)$]

   301   substitutes into $u$, which should be the body of an abstraction.

   302   It replaces each occurrence of the outermost bound variable by a free

   303   variable.  The free variable has type~$T$ and its name is a variant

   304   of~$a$ chosen to be distinct from all constants and from all variables

   305   free in~$u$.

   307 \item[$t$ \ttindexbold{aconv} $u$]

   308   tests whether terms~$t$ and~$u$ are \(\alpha\)-convertible: identical up

   309   to renaming of bound variables.

   310 \begin{itemize}

   311   \item

   312     Two constants, \texttt{Free}s, or \texttt{Var}s are \(\alpha\)-convertible

   313     if their names and types are equal.

   314     (Variables having the same name but different types are thus distinct.

   315     This confusing situation should be avoided!)

   316   \item

   317     Two bound variables are \(\alpha\)-convertible

   318     if they have the same number.

   319   \item

   320     Two abstractions are \(\alpha\)-convertible

   321     if their bodies are, and their bound variables have the same type.

   322   \item

   323     Two applications are \(\alpha\)-convertible

   324     if the corresponding subterms are.

   325 \end{itemize}

   327 \end{ttdescription}

   329 \section{Certified terms}\index{terms!certified|bold}\index{signatures}

   330 A term $t$ can be \textbf{certified} under a signature to ensure that every type

   331 in~$t$ is well-formed and every constant in~$t$ is a type instance of a

   332 constant declared in the signature.  The term must be well-typed and its use

   333 of bound variables must be well-formed.  Meta-rules such as \texttt{forall_elim}

   334 take certified terms as arguments.

   336 Certified terms belong to the abstract type \mltydx{cterm}.

   337 Elements of the type can only be created through the certification process.

   338 In case of error, Isabelle raises exception~\ttindex{TERM}\@.

   340 \subsection{Printing terms}

   341 \index{terms!printing of}

   342 \begin{ttbox}

   343      string_of_cterm :           cterm -> string

   344 Sign.string_of_term  : Sign.sg -> term -> string

   345 \end{ttbox}

   346 \begin{ttdescription}

   347 \item[\ttindexbold{string_of_cterm} $ct$]

   348 displays $ct$ as a string.

   350 \item[\ttindexbold{Sign.string_of_term} $sign$ $t$]

   351 displays $t$ as a string, using the syntax of~$sign$.

   352 \end{ttdescription}

   354 \subsection{Making and inspecting certified terms}

   355 \begin{ttbox}

   356 cterm_of       : Sign.sg -> term -> cterm

   357 read_cterm     : Sign.sg -> string * typ -> cterm

   358 cert_axm       : Sign.sg -> string * term -> string * term

   359 read_axm       : Sign.sg -> string * string -> string * term

   360 rep_cterm      : cterm -> \{T:typ, t:term, sign:Sign.sg, maxidx:int\}

   361 Sign.certify_term : Sign.sg -> term -> term * typ * int

   362 \end{ttbox}

   363 \begin{ttdescription}

   365 \item[\ttindexbold{cterm_of} $sign$ $t$] \index{signatures} certifies

   366   $t$ with respect to signature~$sign$.

   368 \item[\ttindexbold{read_cterm} $sign$ ($s$, $T$)] reads the string~$s$

   369   using the syntax of~$sign$, creating a certified term.  The term is

   370   checked to have type~$T$; this type also tells the parser what kind

   371   of phrase to parse.

   373 \item[\ttindexbold{cert_axm} $sign$ ($name$, $t$)] certifies $t$ with

   374   respect to $sign$ as a meta-proposition and converts all exceptions

   375   to an error, including the final message

   376 \begin{ttbox}

   377 The error(s) above occurred in axiom "\(name\)"

   378 \end{ttbox}

   380 \item[\ttindexbold{read_axm} $sign$ ($name$, $s$)] similar to {\tt

   381     cert_axm}, but first reads the string $s$ using the syntax of

   382   $sign$.

   384 \item[\ttindexbold{rep_cterm} $ct$] decomposes $ct$ as a record

   385   containing its type, the term itself, its signature, and the maximum

   386   subscript of its unknowns.  The type and maximum subscript are

   387   computed during certification.

   389 \item[\ttindexbold{Sign.certify_term}] is a more primitive version of

   390   \texttt{cterm_of}, returning the internal representation instead of

   391   an abstract \texttt{cterm}.

   393 \end{ttdescription}

   396 \section{Types}\index{types|bold}

   397 Types belong to the \ML\ type \mltydx{typ}, which is a concrete datatype with

   398 three constructor functions.  These correspond to type constructors, free

   399 type variables and schematic type variables.  Types are classified by sorts,

   400 which are lists of classes (representing an intersection).  A class is

   401 represented by a string.

   402 \begin{ttbox}

   403 type class = string;

   404 type sort  = class list;

   406 datatype typ = Type  of string * typ list

   407              | TFree of string * sort

   408              | TVar  of indexname * sort;

   410 infixr 5 -->;

   411 fun S --> T = Type ("fun", [S, T]);

   412 \end{ttbox}

   413 \begin{ttdescription}

   414 \item[\ttindexbold{Type} ($a$, $Ts$)] \index{type constructors|bold}

   415   applies the \textbf{type constructor} named~$a$ to the type operand list~$Ts$.

   416   Type constructors include~\tydx{fun}, the binary function space

   417   constructor, as well as nullary type constructors such as~\tydx{prop}.

   418   Other type constructors may be introduced.  In expressions, but not in

   419   patterns, \hbox{\tt$S$-->$T$} is a convenient shorthand for function

   420   types.

   422 \item[\ttindexbold{TFree} ($a$, $s$)] \index{type variables|bold}

   423   is the \textbf{type variable} with name~$a$ and sort~$s$.

   425 \item[\ttindexbold{TVar} ($v$, $s$)] \index{type unknowns|bold}

   426   is the \textbf{type unknown} with indexname~$v$ and sort~$s$.

   427   Type unknowns are essentially free type variables, but may be

   428   instantiated during unification.

   429 \end{ttdescription}

   432 \section{Certified types}

   433 \index{types!certified|bold}

   434 Certified types, which are analogous to certified terms, have type

   435 \ttindexbold{ctyp}.

   437 \subsection{Printing types}

   438 \index{types!printing of}

   439 \begin{ttbox}

   440      string_of_ctyp :           ctyp -> string

   441 Sign.string_of_typ  : Sign.sg -> typ -> string

   442 \end{ttbox}

   443 \begin{ttdescription}

   444 \item[\ttindexbold{string_of_ctyp} $cT$]

   445 displays $cT$ as a string.

   447 \item[\ttindexbold{Sign.string_of_typ} $sign$ $T$]

   448 displays $T$ as a string, using the syntax of~$sign$.

   449 \end{ttdescription}

   452 \subsection{Making and inspecting certified types}

   453 \begin{ttbox}

   454 ctyp_of          : Sign.sg -> typ -> ctyp

   455 rep_ctyp         : ctyp -> \{T: typ, sign: Sign.sg\}

   456 Sign.certify_typ : Sign.sg -> typ -> typ

   457 \end{ttbox}

   458 \begin{ttdescription}

   460 \item[\ttindexbold{ctyp_of} $sign$ $T$] \index{signatures} certifies

   461   $T$ with respect to signature~$sign$.

   463 \item[\ttindexbold{rep_ctyp} $cT$] decomposes $cT$ as a record

   464   containing the type itself and its signature.

   466 \item[\ttindexbold{Sign.certify_typ}] is a more primitive version of

   467   \texttt{ctyp_of}, returning the internal representation instead of

   468   an abstract \texttt{ctyp}.

   470 \end{ttdescription}

   473 \index{theories|)}

   476 %%% Local Variables:

   477 %%% mode: latex

   478 %%% TeX-master: "ref"

   479 %%% End:

author	Walther Neuper <neuper@ist.tugraz.at>
	Thu, 12 Aug 2010 15:03:34 +0200
branch	isac-from-Isabelle2009-2
changeset 37913	20e3616b2d9c
parent 30184	37969710e61f
child 40277	eb47307ab930
permissions	-rw-r--r--