doc-src/IsarRef/Thy/Outer_Syntax.thy
author Walther Neuper <neuper@ist.tugraz.at>
Thu, 12 Aug 2010 15:03:34 +0200
branchisac-from-Isabelle2009-2
changeset 37913 20e3616b2d9c
parent 35841 94f901e4969a
child 40536 47f572aff50a
permissions -rw-r--r--
prepare reactivation of isac-update-Isa09-2
wenzelm@27037
     1
theory Outer_Syntax
wenzelm@27050
     2
imports Main
wenzelm@27037
     3
begin
wenzelm@27037
     4
wenzelm@27040
     5
chapter {* Outer syntax *}
wenzelm@27037
     6
wenzelm@27037
     7
text {*
wenzelm@27037
     8
  The rather generic framework of Isabelle/Isar syntax emerges from
wenzelm@27037
     9
  three main syntactic categories: \emph{commands} of the top-level
wenzelm@27037
    10
  Isar engine (covering theory and proof elements), \emph{methods} for
wenzelm@27037
    11
  general goal refinements (analogous to traditional ``tactics''), and
wenzelm@27037
    12
  \emph{attributes} for operations on facts (within a certain
wenzelm@27037
    13
  context).  Subsequently we give a reference of basic syntactic
wenzelm@27037
    14
  entities underlying Isabelle/Isar syntax in a bottom-up manner.
wenzelm@27037
    15
  Concrete theory and proof language elements will be introduced later
wenzelm@27037
    16
  on.
wenzelm@27037
    17
wenzelm@27037
    18
  \medskip In order to get started with writing well-formed
wenzelm@27037
    19
  Isabelle/Isar documents, the most important aspect to be noted is
wenzelm@27037
    20
  the difference of \emph{inner} versus \emph{outer} syntax.  Inner
wenzelm@27037
    21
  syntax is that of Isabelle types and terms of the logic, while outer
wenzelm@27037
    22
  syntax is that of Isabelle/Isar theory sources (specifications and
wenzelm@27037
    23
  proofs).  As a general rule, inner syntax entities may occur only as
wenzelm@27037
    24
  \emph{atomic entities} within outer syntax.  For example, the string
wenzelm@27037
    25
  @{verbatim "\"x + y\""} and identifier @{verbatim z} are legal term
wenzelm@27037
    26
  specifications within a theory, while @{verbatim "x + y"} without
wenzelm@27037
    27
  quotes is not.
wenzelm@27037
    28
wenzelm@27037
    29
  Printed theory documents usually omit quotes to gain readability
wenzelm@27037
    30
  (this is a matter of {\LaTeX} macro setup, say via @{verbatim
wenzelm@27037
    31
  "\\isabellestyle"}, see also \cite{isabelle-sys}).  Experienced
wenzelm@27037
    32
  users of Isabelle/Isar may easily reconstruct the lost technical
wenzelm@27037
    33
  information, while mere readers need not care about quotes at all.
wenzelm@27037
    34
wenzelm@27037
    35
  \medskip Isabelle/Isar input may contain any number of input
wenzelm@27037
    36
  termination characters ``@{verbatim ";"}'' (semicolon) to separate
wenzelm@27037
    37
  commands explicitly.  This is particularly useful in interactive
wenzelm@27037
    38
  shell sessions to make clear where the current command is intended
wenzelm@27037
    39
  to end.  Otherwise, the interpreter loop will continue to issue a
wenzelm@27037
    40
  secondary prompt ``@{verbatim "#"}'' until an end-of-command is
wenzelm@27037
    41
  clearly recognized from the input syntax, e.g.\ encounter of the
wenzelm@27037
    42
  next command keyword.
wenzelm@27037
    43
wenzelm@27037
    44
  More advanced interfaces such as Proof~General \cite{proofgeneral}
wenzelm@27037
    45
  do not require explicit semicolons, the amount of input text is
wenzelm@27037
    46
  determined automatically by inspecting the present content of the
wenzelm@27037
    47
  Emacs text buffer.  In the printed presentation of Isabelle/Isar
wenzelm@27037
    48
  documents semicolons are omitted altogether for readability.
wenzelm@27037
    49
wenzelm@27037
    50
  \begin{warn}
wenzelm@27037
    51
    Proof~General requires certain syntax classification tables in
wenzelm@27037
    52
    order to achieve properly synchronized interaction with the
wenzelm@27037
    53
    Isabelle/Isar process.  These tables need to be consistent with
wenzelm@27037
    54
    the Isabelle version and particular logic image to be used in a
wenzelm@27037
    55
    running session (common object-logics may well change the outer
wenzelm@27037
    56
    syntax).  The standard setup should work correctly with any of the
wenzelm@27037
    57
    ``official'' logic images derived from Isabelle/HOL (including
wenzelm@27037
    58
    HOLCF etc.).  Users of alternative logics may need to tell
wenzelm@27037
    59
    Proof~General explicitly, e.g.\ by giving an option @{verbatim "-k ZF"}
wenzelm@27037
    60
    (in conjunction with @{verbatim "-l ZF"}, to specify the default
wenzelm@27037
    61
    logic image).  Note that option @{verbatim "-L"} does both
wenzelm@27037
    62
    of this at the same time.
wenzelm@27037
    63
  \end{warn}
wenzelm@27037
    64
*}
wenzelm@27037
    65
wenzelm@27037
    66
wenzelm@28774
    67
section {* Lexical matters \label{sec:outer-lex} *}
wenzelm@27037
    68
wenzelm@28775
    69
text {* The outer lexical syntax consists of three main categories of
wenzelm@28776
    70
  syntax tokens:
wenzelm@27037
    71
wenzelm@28775
    72
  \begin{enumerate}
wenzelm@27037
    73
wenzelm@28775
    74
  \item \emph{major keywords} --- the command names that are available
wenzelm@28775
    75
  in the present logic session;
wenzelm@28775
    76
wenzelm@28775
    77
  \item \emph{minor keywords} --- additional literal tokens required
wenzelm@28775
    78
  by the syntax of commands;
wenzelm@28775
    79
wenzelm@28776
    80
  \item \emph{named tokens} --- various categories of identifiers etc.
wenzelm@28775
    81
wenzelm@28775
    82
  \end{enumerate}
wenzelm@28775
    83
wenzelm@28776
    84
  Major keywords and minor keywords are guaranteed to be disjoint.
wenzelm@28775
    85
  This helps user-interfaces to determine the overall structure of a
wenzelm@28775
    86
  theory text, without knowing the full details of command syntax.
wenzelm@28776
    87
  Internally, there is some additional information about the kind of
wenzelm@28776
    88
  major keywords, which approximates the command type (theory command,
wenzelm@28776
    89
  proof command etc.).
wenzelm@28775
    90
wenzelm@28775
    91
  Keywords override named tokens.  For example, the presence of a
wenzelm@28775
    92
  command called @{verbatim term} inhibits the identifier @{verbatim
wenzelm@28775
    93
  term}, but the string @{verbatim "\"term\""} can be used instead.
wenzelm@28775
    94
  By convention, the outer syntax always allows quoted strings in
wenzelm@28775
    95
  addition to identifiers, wherever a named entity is expected.
wenzelm@28775
    96
wenzelm@28776
    97
  When tokenizing a given input sequence, the lexer repeatedly takes
wenzelm@28776
    98
  the longest prefix of the input that forms a valid token.  Spaces,
wenzelm@28776
    99
  tabs, newlines and formfeeds between tokens serve as explicit
wenzelm@28776
   100
  separators.
wenzelm@28776
   101
wenzelm@28775
   102
  \medskip The categories for named tokens are defined once and for
wenzelm@28775
   103
  all as follows.
wenzelm@28775
   104
wenzelm@28776
   105
  \begin{center}
wenzelm@28775
   106
  \begin{supertabular}{rcl}
wenzelm@28775
   107
    @{syntax_def ident} & = & @{text "letter quasiletter\<^sup>*"} \\
wenzelm@28775
   108
    @{syntax_def longident} & = & @{text "ident("}@{verbatim "."}@{text "ident)\<^sup>+"} \\
wenzelm@28775
   109
    @{syntax_def symident} & = & @{text "sym\<^sup>+  |  "}@{verbatim "\\"}@{verbatim "<"}@{text ident}@{verbatim ">"} \\
wenzelm@28775
   110
    @{syntax_def nat} & = & @{text "digit\<^sup>+"} \\
wenzelm@28775
   111
    @{syntax_def var} & = & @{verbatim "?"}@{text "ident  |  "}@{verbatim "?"}@{text ident}@{verbatim "."}@{text nat} \\
wenzelm@28775
   112
    @{syntax_def typefree} & = & @{verbatim "'"}@{text ident} \\
wenzelm@28775
   113
    @{syntax_def typevar} & = & @{verbatim "?"}@{text "typefree  |  "}@{verbatim "?"}@{text typefree}@{verbatim "."}@{text nat} \\
wenzelm@28775
   114
    @{syntax_def string} & = & @{verbatim "\""} @{text "\<dots>"} @{verbatim "\""} \\
wenzelm@28775
   115
    @{syntax_def altstring} & = & @{verbatim "`"} @{text "\<dots>"} @{verbatim "`"} \\
wenzelm@28775
   116
    @{syntax_def verbatim} & = & @{verbatim "{*"} @{text "\<dots>"} @{verbatim "*"}@{verbatim "}"} \\[1ex]
wenzelm@28775
   117
wenzelm@28775
   118
    @{text letter} & = & @{text "latin  |  "}@{verbatim "\\"}@{verbatim "<"}@{text latin}@{verbatim ">"}@{text "  |  "}@{verbatim "\\"}@{verbatim "<"}@{text "latin latin"}@{verbatim ">"}@{text "  |  greek  |"} \\
wenzelm@28775
   119
          &   & @{verbatim "\<^isub>"}@{text "  |  "}@{verbatim "\<^isup>"} \\
wenzelm@28775
   120
    @{text quasiletter} & = & @{text "letter  |  digit  |  "}@{verbatim "_"}@{text "  |  "}@{verbatim "'"} \\
wenzelm@28775
   121
    @{text latin} & = & @{verbatim a}@{text "  | \<dots> |  "}@{verbatim z}@{text "  |  "}@{verbatim A}@{text "  |  \<dots> |  "}@{verbatim Z} \\
wenzelm@28775
   122
    @{text digit} & = & @{verbatim "0"}@{text "  |  \<dots> |  "}@{verbatim "9"} \\
wenzelm@28775
   123
    @{text sym} & = & @{verbatim "!"}@{text "  |  "}@{verbatim "#"}@{text "  |  "}@{verbatim "$"}@{text "  |  "}@{verbatim "%"}@{text "  |  "}@{verbatim "&"}@{text "  |  "}@{verbatim "*"}@{text "  |  "}@{verbatim "+"}@{text "  |  "}@{verbatim "-"}@{text "  |  "}@{verbatim "/"}@{text "  |"} \\
wenzelm@28775
   124
    & & @{verbatim "<"}@{text "  |  "}@{verbatim "="}@{text "  |  "}@{verbatim ">"}@{text "  |  "}@{verbatim "?"}@{text "  |  "}@{verbatim "@"}@{text "  |  "}@{verbatim "^"}@{text "  |  "}@{verbatim "_"}@{text "  |  "}@{verbatim "|"}@{text "  |  "}@{verbatim "~"} \\
wenzelm@28775
   125
    @{text greek} & = & @{verbatim "\<alpha>"}@{text "  |  "}@{verbatim "\<beta>"}@{text "  |  "}@{verbatim "\<gamma>"}@{text "  |  "}@{verbatim "\<delta>"}@{text "  |"} \\
wenzelm@28775
   126
          &   & @{verbatim "\<epsilon>"}@{text "  |  "}@{verbatim "\<zeta>"}@{text "  |  "}@{verbatim "\<eta>"}@{text "  |  "}@{verbatim "\<theta>"}@{text "  |"} \\
wenzelm@28775
   127
          &   & @{verbatim "\<iota>"}@{text "  |  "}@{verbatim "\<kappa>"}@{text "  |  "}@{verbatim "\<mu>"}@{text "  |  "}@{verbatim "\<nu>"}@{text "  |"} \\
wenzelm@28775
   128
          &   & @{verbatim "\<xi>"}@{text "  |  "}@{verbatim "\<pi>"}@{text "  |  "}@{verbatim "\<rho>"}@{text "  |  "}@{verbatim "\<sigma>"}@{text "  |  "}@{verbatim "\<tau>"}@{text "  |"} \\
wenzelm@28775
   129
          &   & @{verbatim "\<upsilon>"}@{text "  |  "}@{verbatim "\<phi>"}@{text "  |  "}@{verbatim "\<chi>"}@{text "  |  "}@{verbatim "\<psi>"}@{text "  |"} \\
wenzelm@28775
   130
          &   & @{verbatim "\<omega>"}@{text "  |  "}@{verbatim "\<Gamma>"}@{text "  |  "}@{verbatim "\<Delta>"}@{text "  |  "}@{verbatim "\<Theta>"}@{text "  |"} \\
wenzelm@28775
   131
          &   & @{verbatim "\<Lambda>"}@{text "  |  "}@{verbatim "\<Xi>"}@{text "  |  "}@{verbatim "\<Pi>"}@{text "  |  "}@{verbatim "\<Sigma>"}@{text "  |"} \\
wenzelm@28775
   132
          &   & @{verbatim "\<Upsilon>"}@{text "  |  "}@{verbatim "\<Phi>"}@{text "  |  "}@{verbatim "\<Psi>"}@{text "  |  "}@{verbatim "\<Omega>"} \\
wenzelm@28775
   133
  \end{supertabular}
wenzelm@28776
   134
  \end{center}
wenzelm@27037
   135
wenzelm@28778
   136
  A @{syntax_ref var} or @{syntax_ref typevar} describes an unknown,
wenzelm@28778
   137
  which is internally a pair of base name and index (ML type @{ML_type
wenzelm@28778
   138
  indexname}).  These components are either separated by a dot as in
wenzelm@28778
   139
  @{text "?x.1"} or @{text "?x7.3"} or run together as in @{text
wenzelm@28778
   140
  "?x1"}.  The latter form is possible if the base name does not end
wenzelm@28778
   141
  with digits.  If the index is 0, it may be dropped altogether:
wenzelm@28778
   142
  @{text "?x"} and @{text "?x0"} and @{text "?x.0"} all refer to the
wenzelm@28778
   143
  same unknown, with basename @{text "x"} and index 0.
wenzelm@28778
   144
wenzelm@28778
   145
  The syntax of @{syntax_ref string} admits any characters, including
wenzelm@27037
   146
  newlines; ``@{verbatim "\""}'' (double-quote) and ``@{verbatim
wenzelm@27037
   147
  "\\"}'' (backslash) need to be escaped by a backslash; arbitrary
wenzelm@27037
   148
  character codes may be specified as ``@{verbatim "\\"}@{text ddd}'',
wenzelm@27037
   149
  with three decimal digits.  Alternative strings according to
wenzelm@28778
   150
  @{syntax_ref altstring} are analogous, using single back-quotes
wenzelm@28778
   151
  instead.
wenzelm@28778
   152
wenzelm@28778
   153
  The body of @{syntax_ref verbatim} may consist of any text not
wenzelm@27037
   154
  containing ``@{verbatim "*"}@{verbatim "}"}''; this allows
wenzelm@28778
   155
  convenient inclusion of quotes without further escapes.  There is no
wenzelm@28778
   156
  way to escape ``@{verbatim "*"}@{verbatim "}"}''.  If the quoted
wenzelm@28778
   157
  text is {\LaTeX} source, one may usually add some blank or comment
wenzelm@28778
   158
  to avoid the critical character sequence.
wenzelm@28778
   159
wenzelm@28778
   160
  Source comments take the form @{verbatim "(*"}~@{text
wenzelm@28778
   161
  "\<dots>"}~@{verbatim "*)"} and may be nested, although the user-interface
wenzelm@28778
   162
  might prevent this.  Note that this form indicates source comments
wenzelm@28778
   163
  only, which are stripped after lexical analysis of the input.  The
wenzelm@28778
   164
  Isar syntax also provides proper \emph{document comments} that are
wenzelm@28778
   165
  considered as part of the text (see \secref{sec:comments}).
wenzelm@27037
   166
wenzelm@27037
   167
  Common mathematical symbols such as @{text \<forall>} are represented in
wenzelm@27037
   168
  Isabelle as @{verbatim \<forall>}.  There are infinitely many Isabelle
wenzelm@27037
   169
  symbols like this, although proper presentation is left to front-end
wenzelm@27037
   170
  tools such as {\LaTeX} or Proof~General with the X-Symbol package.
wenzelm@30045
   171
  A list of predefined Isabelle symbols that work well with these
wenzelm@30045
   172
  tools is given in \appref{app:symbols}.  Note that @{verbatim "\<lambda>"}
wenzelm@30045
   173
  does not belong to the @{text letter} category, since it is already
wenzelm@30045
   174
  used differently in the Pure term language.
wenzelm@27037
   175
*}
wenzelm@27037
   176
wenzelm@27037
   177
wenzelm@27037
   178
section {* Common syntax entities *}
wenzelm@27037
   179
wenzelm@27037
   180
text {*
wenzelm@27037
   181
  We now introduce several basic syntactic entities, such as names,
wenzelm@27037
   182
  terms, and theorem specifications, which are factored out of the
wenzelm@27037
   183
  actual Isar language elements to be described later.
wenzelm@27037
   184
*}
wenzelm@27037
   185
wenzelm@27037
   186
wenzelm@27037
   187
subsection {* Names *}
wenzelm@27037
   188
wenzelm@27037
   189
text {*
wenzelm@27037
   190
  Entity \railqtok{name} usually refers to any name of types,
wenzelm@27037
   191
  constants, theorems etc.\ that are to be \emph{declared} or
wenzelm@27037
   192
  \emph{defined} (so qualified identifiers are excluded here).  Quoted
wenzelm@27037
   193
  strings provide an escape for non-identifier names or those ruled
wenzelm@27037
   194
  out by outer syntax keywords (e.g.\ quoted @{verbatim "\"let\""}).
wenzelm@27037
   195
  Already existing objects are usually referenced by
wenzelm@27037
   196
  \railqtok{nameref}.
wenzelm@27037
   197
wenzelm@27037
   198
  \indexoutertoken{name}\indexoutertoken{parname}\indexoutertoken{nameref}
wenzelm@27037
   199
  \indexoutertoken{int}
wenzelm@27037
   200
  \begin{rail}
wenzelm@27037
   201
    name: ident | symident | string | nat
wenzelm@27037
   202
    ;
wenzelm@27037
   203
    parname: '(' name ')'
wenzelm@27037
   204
    ;
wenzelm@27037
   205
    nameref: name | longident
wenzelm@27037
   206
    ;
wenzelm@27037
   207
    int: nat | '-' nat
wenzelm@27037
   208
    ;
wenzelm@27037
   209
  \end{rail}
wenzelm@27037
   210
*}
wenzelm@27037
   211
wenzelm@27037
   212
wenzelm@27037
   213
subsection {* Comments \label{sec:comments} *}
wenzelm@27037
   214
wenzelm@27037
   215
text {*
wenzelm@27037
   216
  Large chunks of plain \railqtok{text} are usually given
wenzelm@27037
   217
  \railtok{verbatim}, i.e.\ enclosed in @{verbatim "{"}@{verbatim
wenzelm@27037
   218
  "*"}~@{text "\<dots>"}~@{verbatim "*"}@{verbatim "}"}.  For convenience,
wenzelm@27037
   219
  any of the smaller text units conforming to \railqtok{nameref} are
wenzelm@27037
   220
  admitted as well.  A marginal \railnonterm{comment} is of the form
wenzelm@27037
   221
  @{verbatim "--"} \railqtok{text}.  Any number of these may occur
wenzelm@27037
   222
  within Isabelle/Isar commands.
wenzelm@27037
   223
wenzelm@27037
   224
  \indexoutertoken{text}\indexouternonterm{comment}
wenzelm@27037
   225
  \begin{rail}
wenzelm@27037
   226
    text: verbatim | nameref
wenzelm@27037
   227
    ;
wenzelm@27037
   228
    comment: '--' text
wenzelm@27037
   229
    ;
wenzelm@27037
   230
  \end{rail}
wenzelm@27037
   231
*}
wenzelm@27037
   232
wenzelm@27037
   233
wenzelm@27037
   234
subsection {* Type classes, sorts and arities *}
wenzelm@27037
   235
wenzelm@27037
   236
text {*
wenzelm@27037
   237
  Classes are specified by plain names.  Sorts have a very simple
wenzelm@27037
   238
  inner syntax, which is either a single class name @{text c} or a
wenzelm@27037
   239
  list @{text "{c\<^sub>1, \<dots>, c\<^sub>n}"} referring to the
wenzelm@27037
   240
  intersection of these classes.  The syntax of type arities is given
wenzelm@27037
   241
  directly at the outer level.
wenzelm@27037
   242
wenzelm@27037
   243
  \indexouternonterm{sort}\indexouternonterm{arity}
wenzelm@27037
   244
  \indexouternonterm{classdecl}
wenzelm@27037
   245
  \begin{rail}
wenzelm@27037
   246
    classdecl: name (('<' | subseteq) (nameref + ','))?
wenzelm@27037
   247
    ;
wenzelm@27037
   248
    sort: nameref
wenzelm@27037
   249
    ;
wenzelm@27037
   250
    arity: ('(' (sort + ',') ')')? sort
wenzelm@27037
   251
    ;
wenzelm@27037
   252
  \end{rail}
wenzelm@27037
   253
*}
wenzelm@27037
   254
wenzelm@27037
   255
wenzelm@27037
   256
subsection {* Types and terms \label{sec:types-terms} *}
wenzelm@27037
   257
wenzelm@27037
   258
text {*
wenzelm@27037
   259
  The actual inner Isabelle syntax, that of types and terms of the
wenzelm@27037
   260
  logic, is far too sophisticated in order to be modelled explicitly
wenzelm@27037
   261
  at the outer theory level.  Basically, any such entity has to be
wenzelm@27037
   262
  quoted to turn it into a single token (the parsing and type-checking
wenzelm@27037
   263
  is performed internally later).  For convenience, a slightly more
wenzelm@27037
   264
  liberal convention is adopted: quotes may be omitted for any type or
wenzelm@27037
   265
  term that is already atomic at the outer level.  For example, one
wenzelm@27037
   266
  may just write @{verbatim x} instead of quoted @{verbatim "\"x\""}.
wenzelm@27037
   267
  Note that symbolic identifiers (e.g.\ @{verbatim "++"} or @{text
wenzelm@27037
   268
  "\<forall>"} are available as well, provided these have not been superseded
wenzelm@27037
   269
  by commands or other keywords already (such as @{verbatim "="} or
wenzelm@27037
   270
  @{verbatim "+"}).
wenzelm@27037
   271
wenzelm@27037
   272
  \indexoutertoken{type}\indexoutertoken{term}\indexoutertoken{prop}
wenzelm@27037
   273
  \begin{rail}
wenzelm@27037
   274
    type: nameref | typefree | typevar
wenzelm@27037
   275
    ;
wenzelm@27037
   276
    term: nameref | var
wenzelm@27037
   277
    ;
wenzelm@27037
   278
    prop: term
wenzelm@27037
   279
    ;
wenzelm@27037
   280
  \end{rail}
wenzelm@27037
   281
wenzelm@27037
   282
  Positional instantiations are indicated by giving a sequence of
wenzelm@27037
   283
  terms, or the placeholder ``@{text _}'' (underscore), which means to
wenzelm@27037
   284
  skip a position.
wenzelm@27037
   285
wenzelm@27037
   286
  \indexoutertoken{inst}\indexoutertoken{insts}
wenzelm@27037
   287
  \begin{rail}
wenzelm@27037
   288
    inst: underscore | term
wenzelm@27037
   289
    ;
wenzelm@27037
   290
    insts: (inst *)
wenzelm@27037
   291
    ;
wenzelm@27037
   292
  \end{rail}
wenzelm@27037
   293
wenzelm@27037
   294
  Type declarations and definitions usually refer to
wenzelm@27037
   295
  \railnonterm{typespec} on the left-hand side.  This models basic
wenzelm@27037
   296
  type constructor application at the outer syntax level.  Note that
wenzelm@27037
   297
  only plain postfix notation is available here, but no infixes.
wenzelm@27037
   298
wenzelm@27037
   299
  \indexouternonterm{typespec}
wenzelm@35841
   300
  \indexouternonterm{typespecsorts}
wenzelm@27037
   301
  \begin{rail}
wenzelm@27037
   302
    typespec: (() | typefree | '(' ( typefree + ',' ) ')') name
wenzelm@27037
   303
    ;
wenzelm@35841
   304
wenzelm@35841
   305
    typespecsorts: (() | (typefree ('::' sort)?) | '(' ( (typefree ('::' sort)?) + ',' ) ')') name
wenzelm@35841
   306
    ;
wenzelm@27037
   307
  \end{rail}
wenzelm@27037
   308
*}
wenzelm@27037
   309
wenzelm@27037
   310
wenzelm@28754
   311
subsection {* Term patterns and declarations \label{sec:term-decls} *}
wenzelm@28754
   312
wenzelm@28754
   313
text {*
wenzelm@28754
   314
  Wherever explicit propositions (or term fragments) occur in a proof
wenzelm@28754
   315
  text, casual binding of schematic term variables may be given
wenzelm@28754
   316
  specified via patterns of the form ``@{text "(\<IS> p\<^sub>1 \<dots>
wenzelm@28754
   317
  p\<^sub>n)"}''.  This works both for \railqtok{term} and \railqtok{prop}.
wenzelm@28754
   318
wenzelm@28754
   319
  \indexouternonterm{termpat}\indexouternonterm{proppat}
wenzelm@28754
   320
  \begin{rail}
wenzelm@28754
   321
    termpat: '(' ('is' term +) ')'
wenzelm@28754
   322
    ;
wenzelm@28754
   323
    proppat: '(' ('is' prop +) ')'
wenzelm@28754
   324
    ;
wenzelm@28754
   325
  \end{rail}
wenzelm@28754
   326
wenzelm@28754
   327
  \medskip Declarations of local variables @{text "x :: \<tau>"} and
wenzelm@28754
   328
  logical propositions @{text "a : \<phi>"} represent different views on
wenzelm@28754
   329
  the same principle of introducing a local scope.  In practice, one
wenzelm@28754
   330
  may usually omit the typing of \railnonterm{vars} (due to
wenzelm@28754
   331
  type-inference), and the naming of propositions (due to implicit
wenzelm@28754
   332
  references of current facts).  In any case, Isar proof elements
wenzelm@28754
   333
  usually admit to introduce multiple such items simultaneously.
wenzelm@28754
   334
wenzelm@28754
   335
  \indexouternonterm{vars}\indexouternonterm{props}
wenzelm@28754
   336
  \begin{rail}
wenzelm@28754
   337
    vars: (name+) ('::' type)?
wenzelm@28754
   338
    ;
wenzelm@28754
   339
    props: thmdecl? (prop proppat? +)
wenzelm@28754
   340
    ;
wenzelm@28754
   341
  \end{rail}
wenzelm@28754
   342
wenzelm@28754
   343
  The treatment of multiple declarations corresponds to the
wenzelm@28754
   344
  complementary focus of \railnonterm{vars} versus
wenzelm@28754
   345
  \railnonterm{props}.  In ``@{text "x\<^sub>1 \<dots> x\<^sub>n :: \<tau>"}''
wenzelm@28754
   346
  the typing refers to all variables, while in @{text "a: \<phi>\<^sub>1 \<dots>
wenzelm@28754
   347
  \<phi>\<^sub>n"} the naming refers to all propositions collectively.
wenzelm@28754
   348
  Isar language elements that refer to \railnonterm{vars} or
wenzelm@28754
   349
  \railnonterm{props} typically admit separate typings or namings via
wenzelm@28754
   350
  another level of iteration, with explicit @{keyword_ref "and"}
wenzelm@28754
   351
  separators; e.g.\ see @{command "fix"} and @{command "assume"} in
wenzelm@28754
   352
  \secref{sec:proof-context}.
wenzelm@28754
   353
*}
wenzelm@28754
   354
wenzelm@28754
   355
wenzelm@27037
   356
subsection {* Attributes and theorems \label{sec:syn-att} *}
wenzelm@27037
   357
wenzelm@28754
   358
text {* Attributes have their own ``semi-inner'' syntax, in the sense
wenzelm@28754
   359
  that input conforming to \railnonterm{args} below is parsed by the
wenzelm@28754
   360
  attribute a second time.  The attribute argument specifications may
wenzelm@28754
   361
  be any sequence of atomic entities (identifiers, strings etc.), or
wenzelm@28754
   362
  properly bracketed argument lists.  Below \railqtok{atom} refers to
wenzelm@28754
   363
  any atomic entity, including any \railtok{keyword} conforming to
wenzelm@28754
   364
  \railtok{symident}.
wenzelm@27037
   365
wenzelm@27037
   366
  \indexoutertoken{atom}\indexouternonterm{args}\indexouternonterm{attributes}
wenzelm@27037
   367
  \begin{rail}
wenzelm@27037
   368
    atom: nameref | typefree | typevar | var | nat | keyword
wenzelm@27037
   369
    ;
wenzelm@27037
   370
    arg: atom | '(' args ')' | '[' args ']'
wenzelm@27037
   371
    ;
wenzelm@27037
   372
    args: arg *
wenzelm@27037
   373
    ;
wenzelm@27037
   374
    attributes: '[' (nameref args * ',') ']'
wenzelm@27037
   375
    ;
wenzelm@27037
   376
  \end{rail}
wenzelm@27037
   377
wenzelm@27037
   378
  Theorem specifications come in several flavors:
wenzelm@27037
   379
  \railnonterm{axmdecl} and \railnonterm{thmdecl} usually refer to
wenzelm@27037
   380
  axioms, assumptions or results of goal statements, while
wenzelm@27037
   381
  \railnonterm{thmdef} collects lists of existing theorems.  Existing
wenzelm@27037
   382
  theorems are given by \railnonterm{thmref} and
wenzelm@27037
   383
  \railnonterm{thmrefs}, the former requires an actual singleton
wenzelm@27037
   384
  result.
wenzelm@27037
   385
wenzelm@27037
   386
  There are three forms of theorem references:
wenzelm@27037
   387
  \begin{enumerate}
wenzelm@27037
   388
  
wenzelm@27037
   389
  \item named facts @{text "a"},
wenzelm@27037
   390
wenzelm@27037
   391
  \item selections from named facts @{text "a(i)"} or @{text "a(j - k)"},
wenzelm@27037
   392
wenzelm@27037
   393
  \item literal fact propositions using @{syntax_ref altstring} syntax
wenzelm@27037
   394
  @{verbatim "`"}@{text "\<phi>"}@{verbatim "`"} (see also method
wenzelm@28754
   395
  @{method_ref fact}).
wenzelm@27037
   396
wenzelm@27037
   397
  \end{enumerate}
wenzelm@27037
   398
wenzelm@27037
   399
  Any kind of theorem specification may include lists of attributes
wenzelm@27037
   400
  both on the left and right hand sides; attributes are applied to any
wenzelm@27037
   401
  immediately preceding fact.  If names are omitted, the theorems are
wenzelm@27037
   402
  not stored within the theorem database of the theory or proof
wenzelm@27037
   403
  context, but any given attributes are applied nonetheless.
wenzelm@27037
   404
wenzelm@27037
   405
  An extra pair of brackets around attributes (like ``@{text
wenzelm@27037
   406
  "[[simproc a]]"}'') abbreviates a theorem reference involving an
wenzelm@27037
   407
  internal dummy fact, which will be ignored later on.  So only the
wenzelm@27037
   408
  effect of the attribute on the background context will persist.
wenzelm@27037
   409
  This form of in-place declarations is particularly useful with
wenzelm@27037
   410
  commands like @{command "declare"} and @{command "using"}.
wenzelm@27037
   411
wenzelm@27037
   412
  \indexouternonterm{axmdecl}\indexouternonterm{thmdecl}
wenzelm@27037
   413
  \indexouternonterm{thmdef}\indexouternonterm{thmref}
wenzelm@27037
   414
  \indexouternonterm{thmrefs}\indexouternonterm{selection}
wenzelm@27037
   415
  \begin{rail}
wenzelm@27037
   416
    axmdecl: name attributes? ':'
wenzelm@27037
   417
    ;
wenzelm@27037
   418
    thmdecl: thmbind ':'
wenzelm@27037
   419
    ;
wenzelm@27037
   420
    thmdef: thmbind '='
wenzelm@27037
   421
    ;
wenzelm@27037
   422
    thmref: (nameref selection? | altstring) attributes? | '[' attributes ']'
wenzelm@27037
   423
    ;
wenzelm@27037
   424
    thmrefs: thmref +
wenzelm@27037
   425
    ;
wenzelm@27037
   426
wenzelm@27037
   427
    thmbind: name attributes | name | attributes
wenzelm@27037
   428
    ;
wenzelm@27037
   429
    selection: '(' ((nat | nat '-' nat?) + ',') ')'
wenzelm@27037
   430
    ;
wenzelm@27037
   431
  \end{rail}
wenzelm@27037
   432
*}
wenzelm@27037
   433
wenzelm@27037
   434
end