doc-src/Sledgehammer/sledgehammer.tex
author blanchet
Fri, 20 May 2011 12:47:59 +0200
changeset 43757 4da581400b69
parent 43756 771be1dcfef6
child 43781 f838586ebec2
permissions -rw-r--r--
added see also
blanchet@36918
     1
\documentclass[a4paper,12pt]{article}
blanchet@36918
     2
\usepackage[T1]{fontenc}
blanchet@36918
     3
\usepackage{amsmath}
blanchet@36918
     4
\usepackage{amssymb}
blanchet@36918
     5
\usepackage[english,french]{babel}
blanchet@36918
     6
\usepackage{color}
blanchet@36918
     7
\usepackage{footmisc}
blanchet@36918
     8
\usepackage{graphicx}
blanchet@36918
     9
%\usepackage{mathpazo}
blanchet@36918
    10
\usepackage{multicol}
blanchet@36918
    11
\usepackage{stmaryrd}
blanchet@36918
    12
%\usepackage[scaled=.85]{beramono}
wenzelm@43382
    13
\usepackage{../../lib/texinputs/isabelle,../iman,../pdfsetup}
blanchet@36918
    14
blanchet@36918
    15
%\oddsidemargin=4.6mm
blanchet@36918
    16
%\evensidemargin=4.6mm
blanchet@36918
    17
%\textwidth=150mm
blanchet@36918
    18
%\topmargin=4.6mm
blanchet@36918
    19
%\headheight=0mm
blanchet@36918
    20
%\headsep=0mm
blanchet@36918
    21
%\textheight=234mm
blanchet@36918
    22
blanchet@36918
    23
\def\Colon{\mathord{:\mkern-1.5mu:}}
blanchet@36918
    24
%\def\lbrakk{\mathopen{\lbrack\mkern-3.25mu\lbrack}}
blanchet@36918
    25
%\def\rbrakk{\mathclose{\rbrack\mkern-3.255mu\rbrack}}
blanchet@36918
    26
\def\lparr{\mathopen{(\mkern-4mu\mid}}
blanchet@36918
    27
\def\rparr{\mathclose{\mid\mkern-4mu)}}
blanchet@36918
    28
blanchet@36918
    29
\def\unk{{?}}
blanchet@36918
    30
\def\undef{(\lambda x.\; \unk)}
blanchet@36918
    31
%\def\unr{\textit{others}}
blanchet@36918
    32
\def\unr{\ldots}
blanchet@36918
    33
\def\Abs#1{\hbox{\rm{\flqq}}{\,#1\,}\hbox{\rm{\frqq}}}
blanchet@36918
    34
\def\Q{{\smash{\lower.2ex\hbox{$\scriptstyle?$}}}}
blanchet@36918
    35
blanchet@36918
    36
\urlstyle{tt}
blanchet@36918
    37
blanchet@36918
    38
\begin{document}
blanchet@36918
    39
blanchet@36918
    40
\selectlanguage{english}
blanchet@36918
    41
blanchet@36918
    42
\title{\includegraphics[scale=0.5]{isabelle_sledgehammer} \\[4ex]
blanchet@36918
    43
Hammering Away \\[\smallskipamount]
blanchet@36918
    44
\Large A User's Guide to Sledgehammer for Isabelle/HOL}
blanchet@36918
    45
\author{\hbox{} \\
blanchet@36918
    46
Jasmin Christian Blanchette \\
blanchet@36918
    47
{\normalsize Institut f\"ur Informatik, Technische Universit\"at M\"unchen} \\
blanchet@36918
    48
\hbox{}}
blanchet@36918
    49
blanchet@36918
    50
\maketitle
blanchet@36918
    51
blanchet@36918
    52
\tableofcontents
blanchet@36918
    53
blanchet@36918
    54
\setlength{\parskip}{.7em plus .2em minus .1em}
blanchet@36918
    55
\setlength{\parindent}{0pt}
blanchet@36918
    56
\setlength{\abovedisplayskip}{\parskip}
blanchet@36918
    57
\setlength{\abovedisplayshortskip}{.9\parskip}
blanchet@36918
    58
\setlength{\belowdisplayskip}{\parskip}
blanchet@36918
    59
\setlength{\belowdisplayshortskip}{.9\parskip}
blanchet@36918
    60
blanchet@36918
    61
% General-purpose enum environment with correct spacing
blanchet@36918
    62
\newenvironment{enum}%
blanchet@36918
    63
    {\begin{list}{}{%
blanchet@36918
    64
        \setlength{\topsep}{.1\parskip}%
blanchet@36918
    65
        \setlength{\partopsep}{.1\parskip}%
blanchet@36918
    66
        \setlength{\itemsep}{\parskip}%
blanchet@36918
    67
        \advance\itemsep by-\parsep}}
blanchet@36918
    68
    {\end{list}}
blanchet@36918
    69
blanchet@36918
    70
\def\pre{\begingroup\vskip0pt plus1ex\advance\leftskip by\leftmargin
blanchet@36918
    71
\advance\rightskip by\leftmargin}
blanchet@36918
    72
\def\post{\vskip0pt plus1ex\endgroup}
blanchet@36918
    73
blanchet@36918
    74
\def\prew{\pre\advance\rightskip by-\leftmargin}
blanchet@36918
    75
\def\postw{\post}
blanchet@36918
    76
blanchet@36918
    77
\section{Introduction}
blanchet@36918
    78
\label{introduction}
blanchet@36918
    79
blanchet@36918
    80
Sledgehammer is a tool that applies first-order automatic theorem provers (ATPs)
blanchet@41190
    81
and satisfiability-modulo-theories (SMT) solvers on the current goal. The
blanchet@40254
    82
supported ATPs are E \cite{schulz-2002}, SPASS \cite{weidenbach-et-al-2009},
blanchet@43628
    83
Vampire \cite{riazanov-voronkov-2002}, SInE-E \cite{sine}, SNARK
blanchet@43628
    84
\cite{snark}, and ToFoF-E \cite{tofof}. The ATPs are run either locally or
blanchet@43628
    85
remotely via the System\-On\-TPTP web service \cite{sutcliffe-2000}. In addition
blanchet@43628
    86
to the ATPs, the SMT solvers Z3 \cite{z3} is used by default, and you can tell
blanchet@43628
    87
Sledgehammer to try Yices \cite{yices} and CVC3 \cite{cvc3} as well; these
blanchet@43628
    88
are run either locally or on a server in Munich.
blanchet@36918
    89
blanchet@40254
    90
The problem passed to the automatic provers consists of your current goal
blanchet@40254
    91
together with a heuristic selection of hundreds of facts (theorems) from the
blanchet@40254
    92
current theory context, filtered by relevance. Because jobs are run in the
blanchet@40254
    93
background, you can continue to work on your proof by other means. Provers can
blanchet@40254
    94
be run in parallel. Any reply (which may arrive half a minute later) will appear
blanchet@40254
    95
in the Proof General response buffer.
blanchet@37517
    96
blanchet@40254
    97
The result of a successful proof search is some source text that usually (but
blanchet@40254
    98
not always) reconstructs the proof within Isabelle. For ATPs, the reconstructed
blanchet@40254
    99
proof relies on the general-purpose Metis prover \cite{metis}, which is fully
blanchet@40254
   100
integrated into Isabelle/HOL, with explicit inferences going through the kernel.
blanchet@40254
   101
Thus its results are correct by construction.
blanchet@36918
   102
blanchet@39566
   103
In this manual, we will explicitly invoke the \textbf{sledgehammer} command.
blanchet@39566
   104
Sledgehammer also provides an automatic mode that can be enabled via the
blanchet@39566
   105
``Auto Sledgehammer'' option from the ``Isabelle'' menu in Proof General. In
blanchet@39566
   106
this mode, Sledgehammer is run on every newly entered theorem. The time limit
blanchet@39566
   107
for Auto Sledgehammer and other automatic tools can be set using the ``Auto
blanchet@39566
   108
Tools Time Limit'' option.
blanchet@39566
   109
blanchet@36918
   110
\newbox\boxA
blanchet@36918
   111
\setbox\boxA=\hbox{\texttt{nospam}}
blanchet@36918
   112
blanchet@43628
   113
\newcommand\authoremail{\texttt{blan{\color{white}nospam}\kern-\wd\boxA{}chette@\allowbreak
blanchet@43628
   114
in.\allowbreak tum.\allowbreak de}}
blanchet@43628
   115
blanchet@40937
   116
To run Sledgehammer, you must make sure that the theory \textit{Sledgehammer} is
blanchet@40937
   117
imported---this is rarely a problem in practice since it is part of
blanchet@40937
   118
\textit{Main}. Examples of Sledgehammer use can be found in Isabelle's
blanchet@36918
   119
\texttt{src/HOL/Metis\_Examples} directory.
blanchet@36918
   120
Comments and bug reports concerning Sledgehammer or this manual should be
blanchet@43752
   121
directed to the author at \authoremail.
blanchet@36918
   122
blanchet@36918
   123
\vskip2.5\smallskipamount
blanchet@36918
   124
blanchet@36918
   125
%\textbf{Acknowledgment.} The author would like to thank Mark Summerfield for
blanchet@36918
   126
%suggesting several textual improvements.
blanchet@36918
   127
blanchet@36918
   128
\section{Installation}
blanchet@36918
   129
\label{installation}
blanchet@36918
   130
blanchet@36918
   131
Sledgehammer is part of Isabelle, so you don't need to install it. However, it
blanchet@43628
   132
relies on third-party automatic theorem provers (ATPs) and SMT solvers.
blanchet@43628
   133
blanchet@43628
   134
\subsection{Installing ATPs}
blanchet@43628
   135
blanchet@40254
   136
Currently, E, SPASS, and Vampire can be run locally; in addition, E, Vampire,
blanchet@43628
   137
SInE-E, SNARK, and ToFoF-E are available remotely via System\-On\-TPTP
blanchet@43628
   138
\cite{sutcliffe-2000}. If you want better performance, you should at least
blanchet@43628
   139
install E and SPASS locally.
blanchet@36918
   140
blanchet@38289
   141
There are three main ways to install ATPs on your machine:
blanchet@36918
   142
blanchet@36918
   143
\begin{enum}
blanchet@36918
   144
\item[$\bullet$] If you installed an official Isabelle package with everything
blanchet@36918
   145
inside, it should already include properly setup executables for E and SPASS,
blanchet@38289
   146
ready to use.%
blanchet@38289
   147
\footnote{Vampire's license prevents us from doing the same for this otherwise
blanchet@38289
   148
wonderful tool.}
blanchet@36918
   149
blanchet@38289
   150
\item[$\bullet$] Alternatively, you can download the Isabelle-aware E and SPASS
blanchet@36918
   151
binary packages from Isabelle's download page. Extract the archives, then add a
blanchet@42618
   152
line to your \texttt{\$ISABELLE\_HOME\_USER/etc/components}%
blanchet@42618
   153
\footnote{The variable \texttt{\$ISABELLE\_HOME\_USER} is set by Isabelle at
blanchet@42618
   154
startup. Its value can be retrieved by invoking \texttt{isabelle}
blanchet@42618
   155
\texttt{getenv} \texttt{ISABELLE\_HOME\_USER} on the command line.}
blanchet@42618
   156
file with the absolute
blanchet@40444
   157
path to E or SPASS. For example, if the \texttt{components} does not exist yet
blanchet@40444
   158
and you extracted SPASS to \texttt{/usr/local/spass-3.7}, create the
blanchet@40444
   159
\texttt{components} file with the single line
blanchet@36918
   160
blanchet@36918
   161
\prew
blanchet@36918
   162
\texttt{/usr/local/spass-3.7}
blanchet@36918
   163
\postw
blanchet@36918
   164
blanchet@38289
   165
in it.
blanchet@38289
   166
blanchet@38289
   167
\item[$\bullet$] If you prefer to build E or SPASS yourself, or obtained a
blanchet@38289
   168
Vampire executable from somewhere (e.g., \url{http://www.vprover.org/}),
blanchet@38289
   169
set the environment variable \texttt{E\_HOME}, \texttt{SPASS\_HOME}, or
blanchet@38289
   170
\texttt{VAMPIRE\_HOME} to the directory that contains the \texttt{eproof},
blanchet@38309
   171
\texttt{SPASS}, or \texttt{vampire} executable. Sledgehammer has been tested
blanchet@43714
   172
with E 1.0 and 1.2, SPASS 3.5 and 3.7, and Vampire 0.6 and 1.0%
blanchet@38309
   173
\footnote{Following the rewrite of Vampire, the counter for version numbers was
blanchet@43714
   174
reset to 0; hence the (new) Vampire versions 0.6 and 1.0 are more recent than,
blanchet@43714
   175
say, Vampire 11.5.}%
blanchet@38309
   176
. Since the ATPs' output formats are neither documented nor stable, other
blanchet@43628
   177
versions of the ATPs might or might not work well with Sledgehammer. Ideally,
blanchet@43628
   178
also set \texttt{E\_VERSION}, \texttt{SPASS\_VERSION}, or
blanchet@43628
   179
\texttt{VAMPIRE\_VERSION} to the ATP's version number (e.g., ``1.2'').
blanchet@36918
   180
\end{enum}
blanchet@36918
   181
blanchet@43628
   182
To check whether E and SPASS are successfully installed, follow the example in
blanchet@43628
   183
\S\ref{first-steps}. If the remote versions of E and SPASS are used (identified
blanchet@43628
   184
by the prefix ``\emph{remote\_}''), or if the local versions fail to solve the
blanchet@43628
   185
easy goal presented there, this is a sign that something is wrong with your
blanchet@43628
   186
installation.
blanchet@36918
   187
blanchet@37517
   188
Remote ATP invocation via the SystemOnTPTP web service requires Perl with the
blanchet@39388
   189
World Wide Web Library (\texttt{libwww-perl}) installed. If you must use a proxy
blanchet@39388
   190
server to access the Internet, set the \texttt{http\_proxy} environment variable
blanchet@39389
   191
to the proxy, either in the environment in which Isabelle is launched or in your
blanchet@42618
   192
\texttt{\char`\~/\$ISABELLE\_HOME\_USER/etc/settings} file. Here are a few examples:
blanchet@39388
   193
blanchet@39388
   194
\prew
blanchet@39389
   195
\texttt{http\_proxy=http://proxy.example.org} \\
blanchet@39389
   196
\texttt{http\_proxy=http://proxy.example.org:8080} \\
blanchet@39389
   197
\texttt{http\_proxy=http://joeblow:pAsSwRd@proxy.example.org}
blanchet@39388
   198
\postw
blanchet@37517
   199
blanchet@43628
   200
\subsection{Installing SMT Solvers}
blanchet@43628
   201
blanchet@43628
   202
CVC3, Yices, and Z3 can be run locally or remotely on a Munich server. If you
blanchet@43628
   203
want better performance and get the ability to replay proofs that rely on the
blanchet@43628
   204
\emph{smt} proof method, you should at least install Z3 locally.
blanchet@43628
   205
blanchet@43628
   206
There are two main ways of installing SMT solvers locally.
blanchet@43628
   207
blanchet@43628
   208
\begin{enum}
blanchet@43628
   209
\item[$\bullet$] If you installed an official Isabelle package with everything
blanchet@43628
   210
inside, it should already include properly setup executables for CVC3 and Z3,
blanchet@43628
   211
ready to use.%
blanchet@43628
   212
\footnote{Yices's license prevents us from doing the same for this otherwise
blanchet@43628
   213
wonderful tool.}
blanchet@43628
   214
For Z3, you additionally need to set the environment variable
blanchet@43628
   215
\texttt{Z3\_NON\_COMMERCIAL} to ``yes'' to confirm that you are a noncommercial
blanchet@43628
   216
user.
blanchet@43628
   217
blanchet@43628
   218
\item[$\bullet$] Otherwise, follow the instructions documented in the \emph{SMT}
blanchet@43628
   219
theory (\texttt{\$ISABELLE\_HOME/src/HOL/SMT.thy}).
blanchet@43628
   220
\end{enum}
blanchet@43628
   221
blanchet@36918
   222
\section{First Steps}
blanchet@36918
   223
\label{first-steps}
blanchet@36918
   224
blanchet@36918
   225
To illustrate Sledgehammer in context, let us start a theory file and
blanchet@36918
   226
attempt to prove a simple lemma:
blanchet@36918
   227
blanchet@36918
   228
\prew
blanchet@36918
   229
\textbf{theory}~\textit{Scratch} \\
blanchet@36918
   230
\textbf{imports}~\textit{Main} \\
blanchet@36918
   231
\textbf{begin} \\[2\smallskipamount]
blanchet@36918
   232
%
blanchet@36918
   233
\textbf{lemma} ``$[a] = [b] \,\longleftrightarrow\, a = b$'' \\
blanchet@36918
   234
\textbf{sledgehammer}
blanchet@36918
   235
\postw
blanchet@36918
   236
blanchet@37517
   237
Instead of issuing the \textbf{sledgehammer} command, you can also find
blanchet@37517
   238
Sledgehammer in the ``Commands'' submenu of the ``Isabelle'' menu in Proof
blanchet@37517
   239
General or press the Emacs key sequence C-c C-a C-s.
blanchet@37517
   240
Either way, Sledgehammer produces the following output after a few seconds:
blanchet@36918
   241
blanchet@36918
   242
\prew
blanchet@36918
   243
\slshape
blanchet@40241
   244
Sledgehammer: ``\textit{e}'' for subgoal 1: \\
blanchet@36918
   245
$([a] = [b]) = (a = b)$ \\
blanchet@36918
   246
Try this command: \textbf{by} (\textit{metis hd.simps}). \\
blanchet@38289
   247
To minimize the number of lemmas, try this: \\
blanchet@40240
   248
\textbf{sledgehammer} \textit{minimize} [\textit{prover} = \textit{e}] (\textit{hd.simps}). \\[3\smallskipamount]
blanchet@36918
   249
%
blanchet@40241
   250
Sledgehammer: ``\textit{spass}'' for subgoal 1: \\
blanchet@36918
   251
$([a] = [b]) = (a = b)$ \\
blanchet@36918
   252
Try this command: \textbf{by} (\textit{metis insert\_Nil last\_ConsL}). \\
blanchet@38289
   253
To minimize the number of lemmas, try this: \\
blanchet@40240
   254
\textbf{sledgehammer} \textit{minimize} [\textit{prover} = \textit{spass}] (\textit{insert\_Nil last\_ConsL}). \\[3\smallskipamount]
blanchet@36918
   255
%
blanchet@40254
   256
Sledgehammer: ``\textit{vampire}'' for subgoal 1: \\
blanchet@36918
   257
$([a] = [b]) = (a = b)$ \\
blanchet@43715
   258
Try this command: \textbf{by} (\textit{metis eq\_commute last\_snoc}). \\
blanchet@38289
   259
To minimize the number of lemmas, try this: \\
blanchet@40254
   260
\textbf{sledgehammer} \textit{minimize} [\textit{prover} = \textit{vampire}]~(\textit{eq\_commute last\_snoc}). \\[3\smallskipamount]
blanchet@40254
   261
%
blanchet@40254
   262
Sledgehammer: ``\textit{remote\_sine\_e}'' for subgoal 1: \\
blanchet@40254
   263
$([a] = [b]) = (a = b)$ \\
blanchet@43715
   264
Try this command: \textbf{by} (\textit{metis hd.simps}). \\
blanchet@40254
   265
To minimize the number of lemmas, try this: \\
blanchet@43715
   266
\textbf{sledgehammer} \textit{minimize} [\textit{prover} = \textit{remote\_sine\_e}]~(\textit{hd.simps}). \\[3\smallskipamount]
blanchet@41190
   267
%
blanchet@41190
   268
Sledgehammer: ``\textit{remote\_z3}'' for subgoal 1: \\
blanchet@41190
   269
$([a] = [b]) = (a = b)$ \\
blanchet@43715
   270
Try this command: \textbf{by} (\textit{metis hd.simps}). \\
blanchet@41190
   271
To minimize the number of lemmas, try this: \\
blanchet@41190
   272
\textbf{sledgehammer} \textit{minimize} [\textit{prover} = \textit{remote\_sine\_e}]~(\textit{hd.simps}).
blanchet@36918
   273
\postw
blanchet@36918
   274
blanchet@41190
   275
Sledgehammer ran E, SPASS, Vampire, SInE-E, and Z3 in parallel. Depending on
blanchet@41190
   276
which provers are installed and how many processor cores are available, some of
blanchet@41190
   277
the provers might be missing or present with a \textit{remote\_} prefix.
blanchet@36918
   278
blanchet@40254
   279
For each successful prover, Sledgehammer gives a one-liner proof that uses the
blanchet@40254
   280
\textit{metis} or \textit{smt} method. You can click the proof to insert it into
blanchet@40254
   281
the theory text. You can click the ``\textbf{sledgehammer} \textit{minimize}''
blanchet@40254
   282
command if you want to look for a shorter (and probably faster) proof. But here
blanchet@40254
   283
the proof found by E looks perfect, so click it to finish the proof.
blanchet@36918
   284
blanchet@36918
   285
You can ask Sledgehammer for an Isar text proof by passing the
blanchet@43752
   286
\textit{isar\_proof} option (\S\ref{output-format}):
blanchet@36918
   287
blanchet@36918
   288
\prew
blanchet@36918
   289
\textbf{sledgehammer} [\textit{isar\_proof}]
blanchet@36918
   290
\postw
blanchet@36918
   291
blanchet@36918
   292
When Isar proof construction is successful, it can yield proofs that are more
blanchet@36918
   293
readable and also faster than the \textit{metis} one-liners. This feature is
blanchet@40254
   294
experimental and is only available for ATPs.
blanchet@36918
   295
blanchet@37517
   296
\section{Hints}
blanchet@37517
   297
\label{hints}
blanchet@37517
   298
blanchet@43753
   299
This section presents a few hints that should help you get the most out of
blanchet@43753
   300
Sledgehammer and Metis. Frequently (and infrequently) asked questions are
blanchet@43753
   301
answered in \S\ref{frequently-asked-questions}.
blanchet@43753
   302
blanchet@43628
   303
\newcommand\point[1]{{\sl\bfseries#1}\par\nopagebreak}
blanchet@43628
   304
blanchet@43628
   305
\point{Presimplify the goal}
blanchet@43628
   306
blanchet@37517
   307
For best results, first simplify your problem by calling \textit{auto} or at
blanchet@37517
   308
least \textit{safe} followed by \textit{simp\_all}. None of the ATPs contain
blanchet@37517
   309
arithmetic decision procedures. They are not especially good at heavy rewriting,
blanchet@37517
   310
but because they regard equations as undirected, they often prove theorems that
blanchet@37517
   311
require the reverse orientation of a \textit{simp} rule. Higher-order problems
blanchet@37517
   312
can be tackled, but the success rate is better for first-order problems. Hence,
blanchet@37517
   313
you may get better results if you first simplify the problem to remove
blanchet@37517
   314
higher-order features.
blanchet@37517
   315
blanchet@43628
   316
\point{Make sure at least E, SPASS, Vampire, and Z3 are installed}
blanchet@43628
   317
blanchet@43628
   318
Locally installed provers are faster and more reliable than those running on
blanchet@43628
   319
servers. See \S\ref{installation} for details on how to install them.
blanchet@43628
   320
blanchet@43628
   321
\point{Familiarize yourself with the most important options}
blanchet@43628
   322
blanchet@43628
   323
Sledgehammer's options are fully documented in \S\ref{command-syntax}. Many of
blanchet@43628
   324
the options are very specialized, but serious users of the tool should at least
blanchet@43628
   325
familiarize themselves with the following options:
blanchet@43628
   326
blanchet@43628
   327
\begin{enum}
blanchet@43753
   328
\item[$\bullet$] \textbf{\textit{provers}} (\S\ref{mode-of-operation}) specifies
blanchet@43753
   329
the automatic provers (ATPs and SMT solvers) that should be run whenever
blanchet@43753
   330
Sledgehammer is invoked (e.g., ``\textit{provers}~= \textit{e spass
blanchet@43753
   331
remote\_vampire}'').
blanchet@43628
   332
blanchet@43753
   333
\item[$\bullet$] \textbf{\textit{timeout}} (\S\ref{mode-of-operation}) controls
blanchet@43753
   334
the provers' time limit. It is set to 30 seconds, but since Sledgehammer runs
blanchet@43753
   335
asynchronously you should not hesitate to raise this limit to 60 or 120 seconds
blanchet@43753
   336
if you are the kind of user who can think clearly while ATPs are active.
blanchet@43628
   337
blanchet@43753
   338
\item[$\bullet$] \textbf{\textit{full\_types}} (\S\ref{problem-encoding})
blanchet@43753
   339
specifies whether type-sound encodings should be used. By default, Sledgehammer
blanchet@43753
   340
employs a mixture of type-sound and type-unsound encodings, occasionally
blanchet@43753
   341
yielding unsound ATP proofs. (SMT solver proofs should always be sound, although
blanchet@43753
   342
we occasionally find soundness bugs in the solvers.)
blanchet@43628
   343
blanchet@43753
   344
\item[$\bullet$] \textbf{\textit{max\_relevant}} (\S\ref{relevance-filter})
blanchet@43753
   345
specifies the maximum number of facts that should be passed to the provers. By
blanchet@43753
   346
default, the value is prover-dependent but varies between about 150 and 1000. If
blanchet@43753
   347
the provers time out, you can try lowering this value to, say, 100 or 50 and see
blanchet@43753
   348
if that helps.
blanchet@43628
   349
blanchet@43753
   350
\item[$\bullet$] \textbf{\textit{isar\_proof}} (\S\ref{output-format}) specifies
blanchet@43753
   351
that Isar proofs should be generated, instead of one-liner Metis proofs. The
blanchet@43753
   352
length of the Isar proofs can be controlled by setting
blanchet@43753
   353
\textit{isar\_shrink\_factor} (\S\ref{output-format}).
blanchet@43628
   354
\end{enum}
blanchet@43628
   355
blanchet@43753
   356
Options can be set globally using \textbf{sledgehammer\_params}
blanchet@43753
   357
(\S\ref{command-syntax}). Fact selection can be influenced by specifying
blanchet@43753
   358
``$(\textit{add}{:}~\textit{my\_facts})$'' after the \textbf{sledgehammer}
blanchet@43753
   359
call to ensure that certain facts are included, or simply
blanchet@43753
   360
``$(\textit{my\_facts})$'' to force Sledgehammer to run only with
blanchet@43753
   361
$\textit{my\_facts}$.
blanchet@43628
   362
blanchet@43628
   363
\section{Frequently Asked Questions}
blanchet@43628
   364
\label{frequently-asked-questions}
blanchet@43628
   365
blanchet@43628
   366
\point{Why does Metis fail to reconstruct the proof?}
blanchet@43628
   367
blanchet@43752
   368
There are many reasons. If Metis runs seemingly forever, that is a sign that the
blanchet@43752
   369
proof is too difficult for it. Metis is complete, so it should eventually find
blanchet@43752
   370
it, but that's little consolation. There are several possible solutions:
blanchet@43628
   371
blanchet@43628
   372
\begin{enum}
blanchet@43752
   373
\item[$\bullet$] Try the \textit{isar\_proof} option (\S\ref{output-format}) to
blanchet@43752
   374
obtain a step-by-step Isar proof where each step is justified by Metis. Since
blanchet@43752
   375
the steps are fairly small, Metis is more likely to be able to replay them.
blanchet@43628
   376
blanchet@43628
   377
\item[$\bullet$] Try the \textit{smt} proof method instead of \textit{metis}. It
blanchet@43628
   378
is usually stronger, but you need to have Z3 available to replay the proofs,
blanchet@43628
   379
trust the SMT solver, or use certificates. See the documentation in the
blanchet@43628
   380
\emph{SMT} theory (\texttt{\$ISABELLE\_HOME/src/HOL/SMT.thy}) for details.
blanchet@43628
   381
blanchet@43628
   382
\item[$\bullet$] Try the \textit{blast} or \textit{auto} proof methods, passing
blanchet@43628
   383
facts via \textbf{unfolding}, \textbf{using}, \textit{intro}{:},
blanchet@43628
   384
\textit{elim}{:}, \textit{dest}{:}, or \textit{simp}{:}, as appropriate.
blanchet@43628
   385
\end{enum}
blanchet@43628
   386
blanchet@43752
   387
In some rare cases, Metis fails fairly quickly. This usually indicates that
blanchet@43752
   388
Sledgehammer found a type-incorrect proof. Sledgehammer erases some type
blanchet@43752
   389
information to speed up the search. Try Sledgehammer again with full type
blanchet@43752
   390
information: \textit{full\_types} (\S\ref{problem-encoding}), or choose a
blanchet@43752
   391
specific type encoding with \textit{type\_sys} (\S\ref{problem-encoding}). Older
blanchet@43752
   392
versions of Sledgehammer were frequent victims of this problem. Now this should
blanchet@43752
   393
very seldom be an issue, but if you notice many unsound proofs, contact the
blanchet@43752
   394
author at \authoremail.
blanchet@43628
   395
blanchet@43752
   396
\point{How can I tell whether a Sledgehammer proof is sound?}
blanchet@43752
   397
blanchet@43752
   398
First, if \emph{metis} (or \emph{metisFT}) can reconstruct it, the proof is
blanchet@43752
   399
sound (modulo soundness of Isabelle's inference kernel). If it fails or runs
blanchet@43752
   400
seemingly forever, you can try
blanchet@43752
   401
blanchet@43752
   402
\prew
blanchet@43752
   403
\textbf{apply}~\textbf{--} \\
blanchet@43752
   404
\textbf{sledgehammer} [\textit{type\_sys} = \textit{poly\_tags}] (\textit{metis\_facts})
blanchet@43752
   405
\postw
blanchet@43752
   406
blanchet@43752
   407
where \textit{metis\_facts} is the list of facts appearing in the suggested
blanchet@43752
   408
Metis call. The automatic provers should be able to refind the proof very
blanchet@43752
   409
quickly if it is sound, and the \textit{type\_sys} $=$ \textit{poly\_tags}
blanchet@43752
   410
option (\S\ref{problem-encoding}) ensures that no unsound proofs are found.
blanchet@43752
   411
blanchet@43752
   412
The \textit{full\_types} option (\S\ref{problem-encoding}) can also be used
blanchet@43752
   413
here, but it is unsound in extremely rare degenerate cases such as the
blanchet@43752
   414
following:
blanchet@43752
   415
blanchet@43752
   416
\prew
blanchet@43752
   417
\textbf{lemma} ``$\forall x\> y\Colon{'}a.\ x = y \,\Longrightarrow \exists f\> g\Colon\mathit{nat} \Rightarrow {'}a.\ f \not= g$'' \\
blanchet@43752
   418
\textbf{sledgehammer} [\textit{full\_types}] (\textit{nat.distinct\/}(1))
blanchet@43752
   419
\postw
blanchet@43752
   420
blanchet@43752
   421
\point{How does Sledgehammer select the facts that should be passed to the
blanchet@43752
   422
automatic provers?}
blanchet@43752
   423
blanchet@43752
   424
Briefly, the relevance filter assigns a score to every available fact (lemma,
blanchet@43752
   425
theorem, definition, or axiom)\ based upon how many constants that fact shares
blanchet@43752
   426
with the conjecture; this process iterates to include facts relevant to those
blanchet@43752
   427
just accepted, but with a decay factor to ensure termination. The constants are
blanchet@43752
   428
weighted to give unusual ones greater significance. The relevance filter copes
blanchet@43752
   429
best when the conjecture contains some unusual constants; if all the constants
blanchet@43752
   430
are common, it is unable to discriminate among the hundreds of facts that are
blanchet@43752
   431
picked up. The relevance filter is also memoryless: It has no information about
blanchet@43752
   432
how many times a particular fact has been used in a proof, and it cannot learn.
blanchet@43752
   433
blanchet@43752
   434
The number of facts included in a problem varies from prover to prover, since
blanchet@43752
   435
some provers get overwhelmed quicker than others. You can show the number of
blanchet@43752
   436
facts given using the \textit{verbose} option (\S\ref{output-format}) and the
blanchet@43752
   437
actual facts using \textit{debug} (\S\ref{output-format}).
blanchet@43752
   438
blanchet@43752
   439
Sledgehammer is good at finding short proofs combining a handful of existing
blanchet@43752
   440
lemmas. If you are looking for longer proofs, you must typically restrict the
blanchet@43753
   441
number of facts, by setting the \textit{max\_relevant} option
blanchet@43753
   442
(\S\ref{relevance-filter}) to, say, 50 or 100.
blanchet@43752
   443
blanchet@43752
   444
\point{Why are the Isar proofs generated by Sledgehammer so ugly?}
blanchet@43752
   445
blanchet@43752
   446
The current implementation is experimental and explodes exponentially in the
blanchet@43752
   447
worst case. Work on a new implementation has begun. There is a large body of
blanchet@43752
   448
research into transforming resolution proofs into natural deduction proofs (such
blanchet@43752
   449
as Isar proofs), which we hope to leverage. In the meantime, a workaround is to
blanchet@43752
   450
set the \textit{isar\_shrink\_factor} option (\S\ref{output-format}) to a larger
blanchet@43752
   451
value or to try several provers and keep the nicest-looking proof.
blanchet@43752
   452
blanchet@43752
   453
\point{Should I let Sledgehammer minimize the number of lemmas?}
blanchet@43752
   454
blanchet@43752
   455
In general, minimization is a good idea, because proofs involving fewer lemmas
blanchet@43752
   456
tend to be shorter as well, and hence easier to re-find by Metis. But the
blanchet@43752
   457
opposite is sometimes the case.
blanchet@43752
   458
blanchet@43752
   459
\point{Why does the minimizer sometimes starts of its own?}
blanchet@43752
   460
blanchet@43752
   461
There are two scenarios in which this can happen. First, some provers (e.g.,
blanchet@43752
   462
CVC3 and Yices) do not provide proofs or provide incomplete proofs. The
blanchet@43752
   463
minimizer is then invoked to find out which facts are actually needed from the
blanchet@43752
   464
(large) set of facts that was initinally given to the prover. Second, if a
blanchet@43752
   465
prover returns a proof with lots of facts, the minimizer is invoked
blanchet@43752
   466
automatically since Metis is unlikely to refind the proof.
blanchet@43752
   467
blanchet@43752
   468
\point{What is metisFT?}
blanchet@43752
   469
blanchet@43752
   470
The \textit{metisFT} proof method is the fully-typed version of Metis. It is
blanchet@43752
   471
much slower than \textit{metis}, but the proof search is fully typed, and it
blanchet@43752
   472
also includes more powerful rules such as the axiom ``$x = \mathit{True}
blanchet@43752
   473
\mathrel{\lor} x = \mathit{False}$'' for reasoning in higher-order places (e.g.,
blanchet@43752
   474
in set comprehensions). The method kicks in automatically as a fallback when
blanchet@43752
   475
\textit{metis} fails, and it is sometimes generated by Sledgehammer instead of
blanchet@43752
   476
\textit{metis} if the proof obviously requires type information.
blanchet@43752
   477
blanchet@43752
   478
If you see the warning
blanchet@43752
   479
blanchet@43752
   480
\prew
blanchet@43752
   481
\textsl
blanchet@43752
   482
Metis: Falling back on ``\textit{metisFT}''.
blanchet@43752
   483
\postw
blanchet@43752
   484
blanchet@43752
   485
in a successful Metis proof, you can advantageously replace the \textit{metis}
blanchet@43752
   486
call with \textit{metisFT}.
blanchet@43719
   487
blanchet@43628
   488
\point{I got a strange error from Sledgehammer---what should I do?}
blanchet@43628
   489
blanchet@43628
   490
Sledgehammer tries to give informative error messages. Please report any strange
blanchet@43752
   491
error to the author at \authoremail. This applies double if you get the message
blanchet@43628
   492
blanchet@43752
   493
\prew
blanchet@43628
   494
\slshape
blanchet@43746
   495
The prover found a type-unsound proof involving ``\textit{foo}'',
blanchet@43746
   496
``\textit{bar}'', ``\textit{baz}'' even though a supposedly type-sound encoding
blanchet@43746
   497
was used (or, less likely, your axioms are inconsistent). You might want to
blanchet@43746
   498
report this to the Isabelle developers.
blanchet@43752
   499
\postw
blanchet@43628
   500
blanchet@43628
   501
\point{Auto can solve it---why not Sledgehammer?}
blanchet@43628
   502
blanchet@43628
   503
Problems can be easy for \textit{auto} and difficult for automatic provers, but
blanchet@43628
   504
the reverse is also true, so don't be discouraged if your first attempts fail.
blanchet@39566
   505
Because the system refers to all theorems known to Isabelle, it is particularly
blanchet@39566
   506
suitable when your goal has a short proof from lemmas that you don't know about.
blanchet@37517
   507
blanchet@43752
   508
\point{Why are there so many options?}
blanchet@43752
   509
blanchet@43752
   510
Sledgehammer's philosophy should work out of the box, without user guidance.
blanchet@43752
   511
Many of the options are meant to be used mostly by the Sledgehammer developers
blanchet@43752
   512
for experimentation purposes. Of course, feel free to experiment with them if
blanchet@43752
   513
you are so inclined.
blanchet@43752
   514
blanchet@36918
   515
\section{Command Syntax}
blanchet@36918
   516
\label{command-syntax}
blanchet@36918
   517
blanchet@36918
   518
Sledgehammer can be invoked at any point when there is an open goal by entering
blanchet@36918
   519
the \textbf{sledgehammer} command in the theory file. Its general syntax is as
blanchet@36918
   520
follows:
blanchet@36918
   521
blanchet@36918
   522
\prew
blanchet@36918
   523
\textbf{sledgehammer} \textit{subcommand\/$^?$ options\/$^?$ facts\_override\/$^?$ num\/$^?$}
blanchet@36918
   524
\postw
blanchet@36918
   525
blanchet@36918
   526
For convenience, Sledgehammer is also available in the ``Commands'' submenu of
blanchet@36918
   527
the ``Isabelle'' menu in Proof General or by pressing the Emacs key sequence C-c
blanchet@36918
   528
C-a C-s. This is equivalent to entering the \textbf{sledgehammer} command with
blanchet@36918
   529
no arguments in the theory text.
blanchet@36918
   530
blanchet@36918
   531
In the general syntax, the \textit{subcommand} may be any of the following:
blanchet@36918
   532
blanchet@36918
   533
\begin{enum}
blanchet@40444
   534
\item[$\bullet$] \textbf{\textit{run} (the default):} Runs Sledgehammer on
blanchet@40444
   535
subgoal number \textit{num} (1 by default), with the given options and facts.
blanchet@36918
   536
blanchet@36918
   537
\item[$\bullet$] \textbf{\textit{minimize}:} Attempts to minimize the provided facts
blanchet@36918
   538
(specified in the \textit{facts\_override} argument) to obtain a simpler proof
blanchet@36918
   539
involving fewer facts. The options and goal number are as for \textit{run}.
blanchet@36918
   540
blanchet@40444
   541
\item[$\bullet$] \textbf{\textit{messages}:} Redisplays recent messages issued
blanchet@40444
   542
by Sledgehammer. This allows you to examine results that might have been lost
blanchet@40444
   543
due to Sledgehammer's asynchronous nature. The \textit{num} argument specifies a
blanchet@36918
   544
limit on the number of messages to display (5 by default).
blanchet@36918
   545
blanchet@42591
   546
\item[$\bullet$] \textbf{\textit{supported\_provers}:} Prints the list of
blanchet@42588
   547
automatic provers supported by Sledgehammer. See \S\ref{installation} and
blanchet@42588
   548
\S\ref{mode-of-operation} for more information on how to install automatic
blanchet@42588
   549
provers.
blanchet@36918
   550
blanchet@40240
   551
\item[$\bullet$] \textbf{\textit{running\_provers}:} Prints information about
blanchet@40240
   552
currently running automatic provers, including elapsed runtime and remaining
blanchet@40240
   553
time until timeout.
blanchet@36918
   554
blanchet@40240
   555
\item[$\bullet$] \textbf{\textit{kill\_provers}:} Terminates all running
blanchet@40240
   556
automatic provers.
blanchet@36918
   557
blanchet@36918
   558
\item[$\bullet$] \textbf{\textit{refresh\_tptp}:} Refreshes the list of remote
blanchet@36918
   559
ATPs available at System\-On\-TPTP \cite{sutcliffe-2000}.
blanchet@36918
   560
\end{enum}
blanchet@36918
   561
blanchet@36918
   562
Sledgehammer's behavior can be influenced by various \textit{options}, which can
blanchet@36918
   563
be specified in brackets after the \textbf{sledgehammer} command. The
blanchet@36918
   564
\textit{options} are a list of key--value pairs of the form ``[$k_1 = v_1,
blanchet@36918
   565
\ldots, k_n = v_n$]''. For Boolean options, ``= \textit{true}'' is optional. For
blanchet@36918
   566
example:
blanchet@36918
   567
blanchet@36918
   568
\prew
blanchet@36918
   569
\textbf{sledgehammer} [\textit{isar\_proof}, \,\textit{timeout} = 120$\,s$]
blanchet@36918
   570
\postw
blanchet@36918
   571
blanchet@36918
   572
Default values can be set using \textbf{sledgehammer\_\allowbreak params}:
blanchet@36918
   573
blanchet@36918
   574
\prew
blanchet@36918
   575
\textbf{sledgehammer\_params} \textit{options}
blanchet@36918
   576
\postw
blanchet@36918
   577
blanchet@36918
   578
The supported options are described in \S\ref{option-reference}.
blanchet@36918
   579
blanchet@36918
   580
The \textit{facts\_override} argument lets you alter the set of facts that go
blanchet@36918
   581
through the relevance filter. It may be of the form ``(\textit{facts})'', where
blanchet@36918
   582
\textit{facts} is a space-separated list of Isabelle facts (theorems, local
blanchet@36918
   583
assumptions, etc.), in which case the relevance filter is bypassed and the given
blanchet@39566
   584
facts are used. It may also be of the form ``(\textit{add}:\ \textit{facts}$_1$)'',
blanchet@39566
   585
``(\textit{del}:\ \textit{facts}$_2$)'', or ``(\textit{add}:\ \textit{facts}$_1$\
blanchet@39566
   586
\textit{del}:\ \textit{facts}$_2$)'', where the relevance filter is instructed to
blanchet@36918
   587
proceed as usual except that it should consider \textit{facts}$_1$
blanchet@36918
   588
highly-relevant and \textit{facts}$_2$ fully irrelevant.
blanchet@36918
   589
blanchet@39566
   590
You can instruct Sledgehammer to run automatically on newly entered theorems by
blanchet@39566
   591
enabling the ``Auto Sledgehammer'' option from the ``Isabelle'' menu in Proof
blanchet@40240
   592
General. For automatic runs, only the first prover set using \textit{provers}
blanchet@43601
   593
(\S\ref{mode-of-operation}) is considered, fewer facts are passed to the prover,
blanchet@43601
   594
\textit{slicing} (\S\ref{mode-of-operation}) is disabled, \textit{timeout}
blanchet@40254
   595
(\S\ref{mode-of-operation}) is superseded by the ``Auto Tools Time Limit'' in
blanchet@43601
   596
Proof General's ``Isabelle'' menu, \textit{full\_types}
blanchet@43601
   597
(\S\ref{problem-encoding}) is enabled, and \textit{verbose}
blanchet@43601
   598
(\S\ref{output-format}) and \textit{debug} (\S\ref{output-format}) are disabled.
blanchet@43601
   599
Sledgehammer's output is also more concise.
blanchet@39566
   600
blanchet@36918
   601
\section{Option Reference}
blanchet@36918
   602
\label{option-reference}
blanchet@36918
   603
blanchet@36918
   604
\def\flushitem#1{\item[]\noindent\kern-\leftmargin \textbf{#1}}
blanchet@36918
   605
\def\qty#1{$\left<\textit{#1}\right>$}
blanchet@36918
   606
\def\qtybf#1{$\mathbf{\left<\textbf{\textit{#1}}\right>}$}
blanchet@36918
   607
\def\optrue#1#2{\flushitem{\textit{#1} $\bigl[$= \qtybf{bool}$\bigr]$\quad [\textit{true}]\hfill (neg.: \textit{#2})}\nopagebreak\\[\parskip]}
blanchet@36918
   608
\def\opfalse#1#2{\flushitem{\textit{#1} $\bigl[$= \qtybf{bool}$\bigr]$\quad [\textit{false}]\hfill (neg.: \textit{#2})}\nopagebreak\\[\parskip]}
blanchet@36918
   609
\def\opsmart#1#2{\flushitem{\textit{#1} $\bigl[$= \qtybf{bool\_or\_smart}$\bigr]$\quad [\textit{smart}]\hfill (neg.: \textit{#2})}\nopagebreak\\[\parskip]}
blanchet@36918
   610
\def\opsmartx#1#2{\flushitem{\textit{#1} $\bigl[$= \qtybf{bool\_or\_smart}$\bigr]$\quad [\textit{smart}]\hfill\\\hbox{}\hfill (neg.: \textit{#2})}\nopagebreak\\[\parskip]}
blanchet@36918
   611
\def\opnodefault#1#2{\flushitem{\textit{#1} = \qtybf{#2}} \nopagebreak\\[\parskip]}
blanchet@36918
   612
\def\opdefault#1#2#3{\flushitem{\textit{#1} = \qtybf{#2}\quad [\textit{#3}]} \nopagebreak\\[\parskip]}
blanchet@36918
   613
\def\oparg#1#2#3{\flushitem{\textit{#1} \qtybf{#2} = \qtybf{#3}} \nopagebreak\\[\parskip]}
blanchet@36918
   614
\def\opargbool#1#2#3{\flushitem{\textit{#1} \qtybf{#2} $\bigl[$= \qtybf{bool}$\bigr]$\hfill (neg.: \textit{#3})}\nopagebreak\\[\parskip]}
blanchet@36918
   615
\def\opargboolorsmart#1#2#3{\flushitem{\textit{#1} \qtybf{#2} $\bigl[$= \qtybf{bool\_or\_smart}$\bigr]$\hfill (neg.: \textit{#3})}\nopagebreak\\[\parskip]}
blanchet@36918
   616
blanchet@36918
   617
Sledgehammer's options are categorized as follows:\ mode of operation
blanchet@39228
   618
(\S\ref{mode-of-operation}), problem encoding (\S\ref{problem-encoding}),
blanchet@39228
   619
relevance filter (\S\ref{relevance-filter}), output format
blanchet@39228
   620
(\S\ref{output-format}), and authentication (\S\ref{authentication}).
blanchet@36918
   621
blanchet@36918
   622
The descriptions below refer to the following syntactic quantities:
blanchet@36918
   623
blanchet@36918
   624
\begin{enum}
blanchet@36918
   625
\item[$\bullet$] \qtybf{string}: A string.
blanchet@36918
   626
\item[$\bullet$] \qtybf{bool\/}: \textit{true} or \textit{false}.
blanchet@40444
   627
\item[$\bullet$] \qtybf{bool\_or\_smart\/}: \textit{true}, \textit{false}, or
blanchet@40444
   628
\textit{smart}.
blanchet@36918
   629
\item[$\bullet$] \qtybf{int\/}: An integer.
blanchet@43589
   630
%\item[$\bullet$] \qtybf{float\/}: A floating-point number (e.g., 2.5).
blanchet@40584
   631
\item[$\bullet$] \qtybf{float\_pair\/}: A pair of floating-point numbers
blanchet@40584
   632
(e.g., 0.6 0.95).
blanchet@38814
   633
\item[$\bullet$] \qtybf{int\_or\_smart\/}: An integer or \textit{smart}.
blanchet@40584
   634
\item[$\bullet$] \qtybf{float\_or\_none\/}: An integer (e.g., 60) or
blanchet@40584
   635
floating-point number (e.g., 0.5) expressing a number of seconds, or the keyword
blanchet@40584
   636
\textit{none} ($\infty$ seconds).
blanchet@36918
   637
\end{enum}
blanchet@36918
   638
blanchet@36918
   639
Default values are indicated in square brackets. Boolean options have a negated
blanchet@39228
   640
counterpart (e.g., \textit{blocking} vs.\ \textit{non\_blocking}). When setting
blanchet@36918
   641
Boolean options, ``= \textit{true}'' may be omitted.
blanchet@36918
   642
blanchet@36918
   643
\subsection{Mode of Operation}
blanchet@36918
   644
\label{mode-of-operation}
blanchet@36918
   645
blanchet@36918
   646
\begin{enum}
blanchet@40240
   647
\opnodefault{provers}{string}
blanchet@40240
   648
Specifies the automatic provers to use as a space-separated list (e.g.,
blanchet@40240
   649
``\textit{e}~\textit{spass}''). The following provers are supported:
blanchet@36918
   650
blanchet@36918
   651
\begin{enum}
blanchet@36918
   652
\item[$\bullet$] \textbf{\textit{e}:} E is an ATP developed by Stephan Schulz
blanchet@36918
   653
\cite{schulz-2002}. To use E, set the environment variable
blanchet@36918
   654
\texttt{E\_HOME} to the directory that contains the \texttt{eproof} executable,
blanchet@36918
   655
or install the prebuilt E package from Isabelle's download page. See
blanchet@36918
   656
\S\ref{installation} for details.
blanchet@36918
   657
blanchet@36918
   658
\item[$\bullet$] \textbf{\textit{spass}:} SPASS is an ATP developed by Christoph
blanchet@36918
   659
Weidenbach et al.\ \cite{weidenbach-et-al-2009}. To use SPASS, set the
blanchet@36918
   660
environment variable \texttt{SPASS\_HOME} to the directory that contains the
blanchet@36918
   661
\texttt{SPASS} executable, or install the prebuilt SPASS package from Isabelle's
blanchet@37389
   662
download page. Sledgehammer requires version 3.5 or above. See
blanchet@37389
   663
\S\ref{installation} for details.
blanchet@36918
   664
blanchet@36918
   665
\item[$\bullet$] \textbf{\textit{vampire}:} Vampire is an ATP developed by
blanchet@36918
   666
Andrei Voronkov and his colleagues \cite{riazanov-voronkov-2002}. To use
blanchet@36918
   667
Vampire, set the environment variable \texttt{VAMPIRE\_HOME} to the directory
blanchet@41190
   668
that contains the \texttt{vampire} executable. Sledgehammer has been tested with
blanchet@41190
   669
versions 11, 0.6, and 1.0.
blanchet@36918
   670
blanchet@42611
   671
\item[$\bullet$] \textbf{\textit{cvc3}:} CVC3 is an SMT solver developed by
blanchet@42611
   672
Clark Barrett, Cesare Tinelli, and their colleagues \cite{cvc3}. To use CVC3,
blanchet@42611
   673
set the environment variable \texttt{CVC3\_SOLVER} to the complete path of the
blanchet@42611
   674
executable, including the file name. Sledgehammer has been tested with version
blanchet@42611
   675
2.2.
blanchet@41190
   676
blanchet@41190
   677
\item[$\bullet$] \textbf{\textit{yices}:} Yices is an SMT solver developed at
blanchet@41190
   678
SRI \cite{yices}. To use Yices, set the environment variable
blanchet@41190
   679
\texttt{YICES\_SOLVER} to the complete path of the executable, including the
blanchet@41190
   680
file name. Sledgehammer has been tested with version 1.0.
blanchet@41190
   681
blanchet@42611
   682
\item[$\bullet$] \textbf{\textit{z3}:} Z3 is an SMT solver developed at
blanchet@42611
   683
Microsoft Research \cite{z3}. To use Z3, set the environment variable
blanchet@42611
   684
\texttt{Z3\_SOLVER} to the complete path of the executable, including the file
blanchet@42611
   685
name. Sledgehammer has been tested with versions 2.7 to 2.18.
blanchet@42611
   686
blanchet@42611
   687
\item[$\bullet$] \textbf{\textit{z3\_atp}:} This version of Z3 pretends to be an
blanchet@42611
   688
ATP, exploiting Z3's undocumented support for the TPTP format. It is included
blanchet@43313
   689
for experimental purposes. It requires version 2.18 or above.
blanchet@40254
   690
blanchet@38824
   691
\item[$\bullet$] \textbf{\textit{remote\_e}:} The remote version of E runs
blanchet@36918
   692
on Geoff Sutcliffe's Miami servers \cite{sutcliffe-2000}.
blanchet@36918
   693
blanchet@36918
   694
\item[$\bullet$] \textbf{\textit{remote\_vampire}:} The remote version of
blanchet@38824
   695
Vampire runs on Geoff Sutcliffe's Miami servers. Version 9 is used.
blanchet@36918
   696
blanchet@43406
   697
\item[$\bullet$] \textbf{\textit{remote\_tofof\_e}:} ToFoF-E is a metaprover
blanchet@43406
   698
developed by Geoff Sutcliffe \cite{tofof} based on E running on his Miami
blanchet@43406
   699
servers. This ATP supports a fragment of the TPTP many-typed first-order format
blanchet@43406
   700
(TFF). It is supported primarily for experimenting with the
blanchet@43725
   701
\textit{type\_sys} $=$ \textit{simple} option (\S\ref{problem-encoding}).
blanchet@43406
   702
blanchet@38824
   703
\item[$\bullet$] \textbf{\textit{remote\_sine\_e}:} SInE-E is a metaprover
blanchet@38824
   704
developed by Kry\v stof Hoder \cite{sine} based on E. The remote version of
blanchet@38824
   705
SInE runs on Geoff Sutcliffe's Miami servers.
blanchet@38824
   706
blanchet@38824
   707
\item[$\bullet$] \textbf{\textit{remote\_snark}:} SNARK is a prover
blanchet@38824
   708
developed by Stickel et al.\ \cite{snark}. The remote version of
blanchet@38824
   709
SNARK runs on Geoff Sutcliffe's Miami servers.
blanchet@40254
   710
blanchet@42609
   711
\item[$\bullet$] \textbf{\textit{remote\_cvc3}:} The remote version of CVC3 runs
blanchet@42609
   712
on servers at the TU M\"unchen (or wherever \texttt{REMOTE\_SMT\_URL} is set to
blanchet@42609
   713
point).
blanchet@42609
   714
blanchet@41190
   715
\item[$\bullet$] \textbf{\textit{remote\_z3}:} The remote version of Z3 runs on
blanchet@41190
   716
servers at the TU M\"unchen (or wherever \texttt{REMOTE\_SMT\_URL} is set to
blanchet@41190
   717
point).
blanchet@40254
   718
blanchet@42611
   719
\item[$\bullet$] \textbf{\textit{remote\_z3\_atp}:} The remote version of ``Z3
blanchet@42611
   720
as an ATP'' runs on Geoff Sutcliffe's Miami servers.
blanchet@36918
   721
\end{enum}
blanchet@36918
   722
blanchet@41190
   723
By default, Sledgehammer will run E, SPASS, Vampire, SInE-E, and Z3 (or whatever
blanchet@43089
   724
the SMT module's \textit{smt\_solver} configuration option is set to) in
blanchet@40254
   725
parallel---either locally or remotely, depending on the number of processor
blanchet@40254
   726
cores available. For historical reasons, the default value of this option can be
blanchet@40254
   727
overridden using the option ``Sledgehammer: Provers'' from the ``Isabelle'' menu
blanchet@40254
   728
in Proof General.
blanchet@36918
   729
blanchet@40240
   730
It is a good idea to run several provers in parallel, although it could slow
blanchet@40254
   731
down your machine. Running E, SPASS, Vampire, and SInE-E together for 5 seconds
blanchet@40254
   732
yields a better success rate than running the most effective of these (Vampire)
blanchet@40254
   733
for 120 seconds \cite{boehme-nipkow-2010}.
blanchet@40240
   734
blanchet@40240
   735
\opnodefault{prover}{string}
blanchet@40240
   736
Alias for \textit{provers}.
blanchet@40240
   737
blanchet@43753
   738
%\opnodefault{atps}{string}
blanchet@43753
   739
%Legacy alias for \textit{provers}.
blanchet@36918
   740
blanchet@43753
   741
%\opnodefault{atp}{string}
blanchet@43753
   742
%Legacy alias for \textit{provers}.
blanchet@36918
   743
blanchet@40584
   744
\opdefault{timeout}{float\_or\_none}{\upshape 30}
blanchet@40582
   745
Specifies the maximum number of seconds that the automatic provers should spend
blanchet@43719
   746
searching for a proof. This excludes problem preparation and is a soft limit.
blanchet@43719
   747
For historical reasons, the default value of this option can be overridden using
blanchet@43719
   748
the option ``Sledgehammer: Time Limit'' from the ``Isabelle'' menu in Proof
blanchet@43719
   749
General.
blanchet@39228
   750
blanchet@39227
   751
\opfalse{blocking}{non\_blocking}
blanchet@39227
   752
Specifies whether the \textbf{sledgehammer} command should operate
blanchet@39227
   753
synchronously. The asynchronous (non-blocking) mode lets the user start proving
blanchet@39227
   754
the putative theorem manually while Sledgehammer looks for a proof, but it can
blanchet@39227
   755
also be more confusing.
blanchet@39227
   756
blanchet@43314
   757
\optrue{slicing}{no\_slicing}
blanchet@43314
   758
Specifies whether the time allocated to a prover should be sliced into several
blanchet@43314
   759
segments, each of which has its own set of possibly prover-dependent options.
blanchet@43317
   760
For SPASS and Vampire, the first slice tries the fast but incomplete
blanchet@43314
   761
set-of-support (SOS) strategy, whereas the second slice runs without it. For E,
blanchet@43317
   762
up to three slices are tried, with different weighted search strategies and
blanchet@43314
   763
number of facts. For SMT solvers, several slices are tried with the same options
blanchet@43317
   764
each time but fewer and fewer facts. According to benchmarks with a timeout of
blanchet@43317
   765
30 seconds, slicing is a valuable optimization, and you should probably leave it
blanchet@43317
   766
enabled unless you are conducting experiments. This option is implicitly
blanchet@43314
   767
disabled for (short) automatic runs.
blanchet@43314
   768
blanchet@43314
   769
\nopagebreak
blanchet@43314
   770
{\small See also \textit{verbose} (\S\ref{output-format}).}
blanchet@43314
   771
blanchet@36918
   772
\opfalse{overlord}{no\_overlord}
blanchet@36918
   773
Specifies whether Sledgehammer should put its temporary files in
blanchet@36918
   774
\texttt{\$ISA\-BELLE\_\allowbreak HOME\_\allowbreak USER}, which is useful for
blanchet@36918
   775
debugging Sledgehammer but also unsafe if several instances of the tool are run
blanchet@36918
   776
simultaneously. The files are identified by the prefix \texttt{prob\_}; you may
blanchet@36918
   777
safely remove them after Sledgehammer has run.
blanchet@36918
   778
blanchet@36918
   779
\nopagebreak
blanchet@36918
   780
{\small See also \textit{debug} (\S\ref{output-format}).}
blanchet@36918
   781
\end{enum}
blanchet@36918
   782
blanchet@36918
   783
\subsection{Problem Encoding}
blanchet@36918
   784
\label{problem-encoding}
blanchet@36918
   785
blanchet@36918
   786
\begin{enum}
blanchet@36918
   787
\opfalse{explicit\_apply}{implicit\_apply}
blanchet@36918
   788
Specifies whether function application should be encoded as an explicit
blanchet@40254
   789
``apply'' operator in ATP problems. If the option is set to \textit{false}, each
blanchet@40254
   790
function will be directly applied to as many arguments as possible. Enabling
blanchet@40254
   791
this option can sometimes help discover higher-order proofs that otherwise would
blanchet@40254
   792
not be found.
blanchet@36918
   793
blanchet@36918
   794
\opfalse{full\_types}{partial\_types}
blanchet@43551
   795
Specifies whether full type information is encoded in ATP problems. Enabling
blanchet@43601
   796
this option prevents the discovery of type-incorrect proofs, but it can slow
blanchet@43601
   797
down the ATP slightly. This option is implicitly enabled for automatic runs. For
blanchet@43601
   798
historical reasons, the default value of this option can be overridden using the
blanchet@43601
   799
option ``Sledgehammer: Full Types'' from the ``Isabelle'' menu in Proof General.
blanchet@43089
   800
blanchet@43089
   801
\opdefault{type\_sys}{string}{smart}
blanchet@43756
   802
Specifies the type system to use in ATP problems. Some of the type systems are
blanchet@43756
   803
unsound, meaning that they can give rise to spurious proofs (unreconstructible
blanchet@43756
   804
using Metis). The supported type systems are listed below, with an indication of
blanchet@43756
   805
their soundness in parentheses:
blanchet@43089
   806
blanchet@43089
   807
\begin{enum}
blanchet@43756
   808
\item[$\bullet$] \textbf{\textit{erased} (very unsound):} No type information is
blanchet@43756
   809
supplied to the ATP. Types are simply erased.
blanchet@43453
   810
blanchet@43756
   811
\item[$\bullet$] \textbf{\textit{poly\_preds} (sound):} Types are encoded using
blanchet@43756
   812
a predicate \textit{has\_\allowbreak type\/}$(\tau, t)$ that restricts the range
blanchet@43756
   813
of bound variables. Constants are annotated with their types, supplied as extra
blanchet@43756
   814
arguments, to resolve overloading.
blanchet@43555
   815
blanchet@43756
   816
\item[$\bullet$] \textbf{\textit{poly\_tags} (sound):} Each term and subterm is
blanchet@43756
   817
tagged with its type using a function $\mathit{type\_info\/}(\tau, t)$.
blanchet@43756
   818
blanchet@43756
   819
\item[$\bullet$] \textbf{\textit{poly\_args} (unsound):}
blanchet@43756
   820
Like for \textit{poly\_preds} constants are annotated with their types to
blanchet@43587
   821
resolve overloading, but otherwise no type information is encoded.
blanchet@43555
   822
blanchet@43587
   823
\item[$\bullet$]
blanchet@43587
   824
\textbf{%
blanchet@43756
   825
\textit{mono\_preds}, \textit{mono\_tags} (sound);
blanchet@43756
   826
\textit{mono\_args} (unsound):} \\
blanchet@43587
   827
Similar to \textit{poly\_preds}, \textit{poly\_tags}, and \textit{poly\_args},
blanchet@43587
   828
respectively, but the problem is additionally monomorphized, meaning that type
blanchet@43587
   829
variables are instantiated with heuristically chosen ground types.
blanchet@43587
   830
Monomorphization can simplify reasoning but also leads to larger fact bases,
blanchet@43587
   831
which can slow down the ATPs.
blanchet@43453
   832
blanchet@43587
   833
\item[$\bullet$]
blanchet@43587
   834
\textbf{%
blanchet@43587
   835
\textit{mangled\_preds},
blanchet@43756
   836
\textit{mangled\_tags} (sound); \\
blanchet@43756
   837
\textit{mangled\_args} (unsound):} \\
blanchet@43587
   838
Similar to
blanchet@43587
   839
\textit{mono\_preds}, \textit{mono\_tags}, and \textit{mono\_args},
blanchet@43587
   840
respectively but types are mangled in constant names instead of being supplied
blanchet@43587
   841
as ground term arguments. The binary predicate $\mathit{has\_type\/}(\tau, t)$
blanchet@43587
   842
becomes a unary predicate $\mathit{has\_type\_}\tau(t)$, and the binary function
blanchet@43460
   843
$\mathit{type\_info\/}(\tau, t)$ becomes a unary function
blanchet@43460
   844
$\mathit{type\_info\_}\tau(t)$.
blanchet@43453
   845
blanchet@43756
   846
\item[$\bullet$] \textbf{\textit{simple} (sound):} Use the prover's support for
blanchet@43756
   847
simply typed first-order logic if available; otherwise, fall back on
blanchet@43756
   848
\textit{mangled\_preds}. The problem is monomorphized.
blanchet@43551
   849
blanchet@43551
   850
\item[$\bullet$]
blanchet@43551
   851
\textbf{%
blanchet@43756
   852
\textit{poly\_preds}?, \textit{poly\_tags}?, \textit{mono\_preds}?, \textit{mono\_tags}?, \\
blanchet@43756
   853
\textit{mangled\_preds}?, \textit{mangled\_tags}?, \textit{simple}? (quasi-sound):} \\
blanchet@43608
   854
The type systems \textit{poly\_preds}, \textit{poly\_tags},
blanchet@43756
   855
\textit{mono\_preds}, \textit{mono\_tags}, \textit{mangled\_preds},
blanchet@43756
   856
\textit{mangled\_tags}, and \textit{simple} are fully typed and sound. For each
blanchet@43756
   857
of these, Sledgehammer also provides a lighter, virtually sound variant
blanchet@43756
   858
identified by a question mark (`{?}')\ that detects and erases monotonic types,
blanchet@43756
   859
notably infinite types. (For \textit{simple}, the types are not actually erased
blanchet@43725
   860
but rather replaced by a shared uniform type of individuals.)
blanchet@43460
   861
blanchet@43756
   862
\item[$\bullet$]
blanchet@43756
   863
\textbf{%
blanchet@43756
   864
\textit{poly\_preds}!, \textit{poly\_tags}!, \textit{mono\_preds}!, \textit{mono\_tags}!, \\
blanchet@43756
   865
\textit{mangled\_preds}!, \textit{mangled\_tags}!, \textit{simple}! \\
blanchet@43756
   866
(mildly unsound):} \\
blanchet@43756
   867
The type systems \textit{poly\_preds}, \textit{poly\_tags},
blanchet@43756
   868
\textit{mono\_preds}, \textit{mono\_tags}, \textit{mangled\_preds},
blanchet@43756
   869
\textit{mangled\_tags}, and \textit{simple} also admit a mildly unsound (but
blanchet@43756
   870
very efficient) variant identified by an exclamation mark (`{!}') that detects
blanchet@43756
   871
and erases erases all types except those that are clearly finite (e.g.,
blanchet@43756
   872
\textit{bool}). (For \textit{simple}, the types are not actually erased but
blanchet@43756
   873
rather replaced by a shared uniform type of individuals.)
blanchet@43756
   874
blanchet@43098
   875
\item[$\bullet$] \textbf{\textit{smart}:} If \textit{full\_types} is enabled,
blanchet@43756
   876
uses a sound or virtually sound encoding; otherwise, uses any encoding. The actual
blanchet@43756
   877
encoding used depends on the ATP and should be the most efficient for that ATP.
blanchet@43089
   878
\end{enum}
blanchet@43089
   879
blanchet@43725
   880
In addition, all the \textit{preds} and \textit{tags} type systems are available
blanchet@43725
   881
in two variants, a lightweight and a heavyweight variant. The lightweight
blanchet@43725
   882
variants are generally more efficient and are the default; the heavyweight
blanchet@43725
   883
variants are identified by a \textit{\_heavy} suffix (e.g.,
blanchet@43725
   884
\textit{mangled\_preds\_heavy}{?}).
blanchet@43394
   885
blanchet@43725
   886
For SMT solvers and ToFoF-E, the type system is always \textit{simple},
blanchet@43725
   887
irrespective of the value of this option.
blanchet@43757
   888
blanchet@43757
   889
\nopagebreak
blanchet@43757
   890
{\small See also \textit{max\_new\_mono\_instances} (\S\ref{relevance-filter})
blanchet@43757
   891
and \textit{max\_mono\_iters} (\S\ref{relevance-filter}).}
blanchet@38814
   892
\end{enum}
blanchet@36918
   893
blanchet@38814
   894
\subsection{Relevance Filter}
blanchet@38814
   895
\label{relevance-filter}
blanchet@38814
   896
blanchet@38814
   897
\begin{enum}
blanchet@40584
   898
\opdefault{relevance\_thresholds}{float\_pair}{\upshape 0.45~0.85}
blanchet@38985
   899
Specifies the thresholds above which facts are considered relevant by the
blanchet@38985
   900
relevance filter. The first threshold is used for the first iteration of the
blanchet@38985
   901
relevance filter and the second threshold is used for the last iteration (if it
blanchet@38985
   902
is reached). The effective threshold is quadratically interpolated for the other
blanchet@40584
   903
iterations. Each threshold ranges from 0 to 1, where 0 means that all theorems
blanchet@40584
   904
are relevant and 1 only theorems that refer to previously seen constants.
blanchet@36918
   905
blanchet@40584
   906
\opsmart{max\_relevant}{int\_or\_smart}
blanchet@38985
   907
Specifies the maximum number of facts that may be returned by the relevance
blanchet@38985
   908
filter. If the option is set to \textit{smart}, it is set to a value that was
blanchet@40240
   909
empirically found to be appropriate for the prover. A typical value would be
blanchet@40240
   910
300.
blanchet@43051
   911
blanchet@43753
   912
\opdefault{max\_new\_mono\_instances}{int}{\upshape 400}
blanchet@43753
   913
Specifies the maximum number of monomorphic instances to generate beyond
blanchet@43753
   914
\textit{max\_relevant}. The higher this limit is, the more monomorphic instances
blanchet@43753
   915
are potentially generated. Whether monomorphization takes place depends on the
blanchet@43753
   916
type system used.
blanchet@43753
   917
blanchet@43753
   918
\nopagebreak
blanchet@43753
   919
{\small See also \textit{type\_sys} (\S\ref{problem-encoding}).}
blanchet@43753
   920
blanchet@43753
   921
\opdefault{max\_mono\_iters}{int}{\upshape 3}
blanchet@43753
   922
Specifies the maximum number of iterations for the monomorphization fixpoint
blanchet@43753
   923
construction. The higher this limit is, the more monomorphic instances are
blanchet@43753
   924
potentially generated. Whether monomorphization takes place depends on the
blanchet@43753
   925
type system used.
blanchet@43753
   926
blanchet@43753
   927
\nopagebreak
blanchet@43753
   928
{\small See also \textit{type\_sys} (\S\ref{problem-encoding}).}
blanchet@36918
   929
\end{enum}
blanchet@36918
   930
blanchet@36918
   931
\subsection{Output Format}
blanchet@36918
   932
\label{output-format}
blanchet@36918
   933
blanchet@36918
   934
\begin{enum}
blanchet@36918
   935
blanchet@36918
   936
\opfalse{verbose}{quiet}
blanchet@36918
   937
Specifies whether the \textbf{sledgehammer} command should explain what it does.
blanchet@41456
   938
This option is implicitly disabled for automatic runs.
blanchet@36918
   939
blanchet@36918
   940
\opfalse{debug}{no\_debug}
blanchet@40444
   941
Specifies whether Sledgehammer should display additional debugging information
blanchet@40444
   942
beyond what \textit{verbose} already displays. Enabling \textit{debug} also
blanchet@41456
   943
enables \textit{verbose} and \textit{blocking} (\S\ref{mode-of-operation})
blanchet@41456
   944
behind the scenes. The \textit{debug} option is implicitly disabled for
blanchet@41456
   945
automatic runs.
blanchet@36918
   946
blanchet@36918
   947
\nopagebreak
blanchet@36918
   948
{\small See also \textit{overlord} (\S\ref{mode-of-operation}).}
blanchet@36918
   949
blanchet@36918
   950
\opfalse{isar\_proof}{no\_isar\_proof}
blanchet@36918
   951
Specifies whether Isar proofs should be output in addition to one-liner
blanchet@36918
   952
\textit{metis} proofs. Isar proof construction is still experimental and often
blanchet@36918
   953
fails; however, they are usually faster and sometimes more robust than
blanchet@36918
   954
\textit{metis} proofs.
blanchet@36918
   955
blanchet@40584
   956
\opdefault{isar\_shrink\_factor}{int}{\upshape 1}
blanchet@36918
   957
Specifies the granularity of the Isar proof. A value of $n$ indicates that each
blanchet@36918
   958
Isar proof step should correspond to a group of up to $n$ consecutive proof
blanchet@36918
   959
steps in the ATP proof.
blanchet@36918
   960
blanchet@36918
   961
\end{enum}
blanchet@36918
   962
blanchet@39228
   963
\subsection{Authentication}
blanchet@39228
   964
\label{authentication}
blanchet@36918
   965
blanchet@36918
   966
\begin{enum}
blanchet@39228
   967
\opnodefault{expect}{string}
blanchet@39228
   968
Specifies the expected outcome, which must be one of the following:
blanchet@39228
   969
blanchet@39228
   970
\begin{enum}
blanchet@40444
   971
\item[$\bullet$] \textbf{\textit{some}:} Sledgehammer found a (potentially
blanchet@40444
   972
unsound) proof.
blanchet@39228
   973
\item[$\bullet$] \textbf{\textit{none}:} Sledgehammer found no proof.
blanchet@40444
   974
\item[$\bullet$] \textbf{\textit{unknown}:} Sledgehammer encountered some
blanchet@40444
   975
problem.
blanchet@39228
   976
\end{enum}
blanchet@39228
   977
blanchet@39228
   978
Sledgehammer emits an error (if \textit{blocking} is enabled) or a warning
blanchet@39228
   979
(otherwise) if the actual outcome differs from the expected outcome. This option
blanchet@39228
   980
is useful for regression testing.
blanchet@39228
   981
blanchet@39228
   982
\nopagebreak
blanchet@39228
   983
{\small See also \textit{blocking} (\S\ref{mode-of-operation}).}
blanchet@36918
   984
\end{enum}
blanchet@36918
   985
blanchet@36918
   986
\let\em=\sl
blanchet@36918
   987
\bibliography{../manual}{}
blanchet@36918
   988
\bibliographystyle{abbrv}
blanchet@36918
   989
blanchet@36918
   990
\end{document}