wneuper/isa: doc-src/TutorialI/Misc/AdvancedInd.thy@028f54cd2cc9 (annotated)

nipkow@9645	1	(<)
nipkow@9645	2	theory AdvancedInd = Main:;
nipkow@9645	3	(>)
nipkow@9645	4
nipkow@9645	5	text{*\noindent
nipkow@9645	6	Now that we have learned about rules and logic, we take another look at the
nipkow@9645	7	finer points of induction. The two questions we answer are: what to do if the
nipkow@9645	8	proposition to be proved is not directly amenable to induction, and how to
nipkow@9645	9	utilize and even derive new induction schemas.
nipkow@9689	10	*};
nipkow@9645	11
nipkow@10217	12	subsection{Massaging the proposition};
nipkow@9645	13
nipkow@10217	14	text{*\label{sec:ind-var-in-prems}
nipkow@9645	15	So far we have assumed that the theorem we want to prove is already in a form
nipkow@9645	16	that is amenable to induction, but this is not always the case:
nipkow@9689	17	*};
nipkow@9645	18
nipkow@9933	19	lemma "xs \<noteq> [] \<Longrightarrow> hd(rev xs) = last xs";
nipkow@9645	20	apply(induct_tac xs);
nipkow@9645	21
nipkow@9645	22	txt{*\noindent
nipkow@9792	23	(where @{term"hd"} and @{term"last"} return the first and last element of a
nipkow@9645	24	non-empty list)
nipkow@9645	25	produces the warning
nipkow@9645	26	\begin{quote}\tt
nipkow@9645	27	Induction variable occurs also among premises!
nipkow@9645	28	\end{quote}
nipkow@9645	29	and leads to the base case
nipkow@9723	30	\begin{isabelle}
nipkow@9645	31	\ 1.\ xs\ {\isasymnoteq}\ []\ {\isasymLongrightarrow}\ hd\ (rev\ [])\ =\ last\ []
nipkow@9723	32	\end{isabelle}
nipkow@9645	33	which, after simplification, becomes
nipkow@9723	34	\begin{isabelle}
nipkow@9645	35	\ 1.\ xs\ {\isasymnoteq}\ []\ {\isasymLongrightarrow}\ hd\ []\ =\ last\ []
nipkow@9723	36	\end{isabelle}
nipkow@10242	37	We cannot prove this equality because we do not know what @{term hd} and
nipkow@10242	38	@{term last} return when applied to @{term"[]"}.
nipkow@9645	39
nipkow@9645	40	The point is that we have violated the above warning. Because the induction
nipkow@10242	41	formula is only the conclusion, the occurrence of @{term xs} in the premises is
nipkow@9645	42	not modified by induction. Thus the case that should have been trivial
nipkow@10242	43	becomes unprovable. Fortunately, the solution is easy:\footnote{A very similar
nipkow@10242	44	heuristic applies to rule inductions; see \S\ref{sec:rtc}.}
nipkow@9645	45	\begin{quote}
nipkow@9645	46	\emph{Pull all occurrences of the induction variable into the conclusion
nipkow@9792	47	using @{text"\<longrightarrow>"}.}
nipkow@9645	48	\end{quote}
nipkow@9645	49	This means we should prove
nipkow@9689	50	*};
nipkow@9689	51	(<)oops;(>)
nipkow@9933	52	lemma hd_rev: "xs \<noteq> [] \<longrightarrow> hd(rev xs) = last xs";
nipkow@9645	53	(<)
nipkow@9689	54	by(induct_tac xs, auto);
nipkow@9645	55	(>)
nipkow@9645	56
nipkow@9645	57	text{*\noindent
nipkow@9645	58	This time, induction leaves us with the following base case
nipkow@9723	59	\begin{isabelle}
nipkow@9645	60	\ 1.\ []\ {\isasymnoteq}\ []\ {\isasymlongrightarrow}\ hd\ (rev\ [])\ =\ last\ []
nipkow@9723	61	\end{isabelle}
nipkow@9792	62	which is trivial, and @{text"auto"} finishes the whole proof.
nipkow@9645	63
nipkow@9792	64	If @{thm[source]hd_rev} is meant to be a simplification rule, you are
nipkow@9792	65	done. But if you really need the @{text"\<Longrightarrow>"}-version of
nipkow@9792	66	@{thm[source]hd_rev}, for example because you want to apply it as an
nipkow@9792	67	introduction rule, you need to derive it separately, by combining it with
nipkow@9792	68	modus ponens:
nipkow@9689	69	*};
nipkow@9645	70
nipkow@9689	71	lemmas hd_revI = hd_rev[THEN mp];
nipkow@9645	72
nipkow@9645	73	text{*\noindent
nipkow@9645	74	which yields the lemma we originally set out to prove.
nipkow@9645	75
nipkow@9645	76	In case there are multiple premises $A@1$, \dots, $A@n$ containing the
nipkow@9645	77	induction variable, you should turn the conclusion $C$ into
nipkow@9645	78	\[ A@1 \longrightarrow \cdots A@n \longrightarrow C \]
nipkow@9645	79	(see the remark?? in \S\ref{??}).
nipkow@9645	80	Additionally, you may also have to universally quantify some other variables,
nipkow@9645	81	which can yield a fairly complex conclusion.
nipkow@9792	82	Here is a simple example (which is proved by @{text"blast"}):
nipkow@9689	83	*};
nipkow@9645	84
nipkow@9933	85	lemma simple: "\<forall>y. A y \<longrightarrow> B y \<longrightarrow> B y & A y";
nipkow@9689	86	(<)by blast;(>)
nipkow@9645	87
nipkow@9645	88	text{*\noindent
nipkow@9645	89	You can get the desired lemma by explicit
nipkow@9792	90	application of modus ponens and @{thm[source]spec}:
nipkow@9689	91	*};
nipkow@9645	92
nipkow@9689	93	lemmas myrule = simple[THEN spec, THEN mp, THEN mp];
nipkow@9645	94
nipkow@9645	95	text{*\noindent
nipkow@9792	96	or the wholesale stripping of @{text"\<forall>"} and
wenzelm@9941	97	@{text"\<longrightarrow>"} in the conclusion via @{text"rule_format"}
nipkow@9689	98	*};
nipkow@9645	99
wenzelm@9941	100	lemmas myrule = simple[rule_format];
nipkow@9645	101
nipkow@9645	102	text{*\noindent
nipkow@9689	103	yielding @{thm"myrule"[no_vars]}.
nipkow@9645	104	You can go one step further and include these derivations already in the
nipkow@9645	105	statement of your original lemma, thus avoiding the intermediate step:
nipkow@9689	106	*};
nipkow@9645	107
wenzelm@9941	108	lemma myrule[rule_format]: "\<forall>y. A y \<longrightarrow> B y \<longrightarrow> B y & A y";
nipkow@9645	109	(<)
nipkow@9689	110	by blast;
nipkow@9645	111	(>)
nipkow@9645	112
nipkow@9645	113	text{*
nipkow@9645	114	\bigskip
nipkow@9645	115
nipkow@9645	116	A second reason why your proposition may not be amenable to induction is that
nipkow@9645	117	you want to induct on a whole term, rather than an individual variable. In
nipkow@10217	118	general, when inducting on some term $t$ you must rephrase the conclusion $C$
nipkow@10217	119	as
nipkow@10217	120	\[ \forall y@1 \dots y@n.~ x = t \longrightarrow C \]
nipkow@10217	121	where $y@1 \dots y@n$ are the free variables in $t$ and $x$ is new, and
nipkow@10217	122	perform induction on $x$ afterwards. An example appears in
nipkow@10217	123	\S\ref{sec:complete-ind} below.
nipkow@10217	124
nipkow@10217	125	The very same problem may occur in connection with rule induction. Remember
nipkow@10217	126	that it requires a premise of the form $(x@1,\dots,x@k) \in R$, where $R$ is
nipkow@10217	127	some inductively defined set and the $x@i$ are variables. If instead we have
nipkow@10217	128	a premise $t \in R$, where $t$ is not just an $n$-tuple of variables, we
nipkow@10217	129	replace it with $(x@1,\dots,x@k) \in R$, and rephrase the conclusion $C$ as
nipkow@10217	130	\[ \forall y@1 \dots y@n.~ (x@1,\dots,x@k) = t \longrightarrow C \]
nipkow@10217	131	For an example see \S\ref{sec:CTL-revisited} below.
nipkow@9689	132	*};
nipkow@9645	133
nipkow@9689	134	subsection{Beyond structural and recursion induction};
nipkow@9645	135
nipkow@10217	136	text{*\label{sec:complete-ind}
nipkow@9645	137	So far, inductive proofs where by structural induction for
nipkow@9645	138	primitive recursive functions and recursion induction for total recursive
nipkow@9645	139	functions. But sometimes structural induction is awkward and there is no
nipkow@9645	140	recursive function in sight either that could furnish a more appropriate
nipkow@9645	141	induction schema. In such cases some existing standard induction schema can
nipkow@9645	142	be helpful. We show how to apply such induction schemas by an example.
nipkow@9645	143
nipkow@9792	144	Structural induction on @{typ"nat"} is
nipkow@9645	145	usually known as ``mathematical induction''. There is also ``complete
nipkow@9645	146	induction'', where you must prove $P(n)$ under the assumption that $P(m)$
nipkow@9933	147	holds for all $m<n$. In Isabelle, this is the theorem @{thm[source]nat_less_induct}:
nipkow@9933	148	@{thm[display]"nat_less_induct"[no_vars]}
nipkow@9645	149	Here is an example of its application.
nipkow@9689	150	*};
nipkow@9645	151
nipkow@9689	152	consts f :: "nat => nat";
nipkow@9689	153	axioms f_ax: "f(f(n)) < f(Suc(n))";
nipkow@9645	154
nipkow@9645	155	text{*\noindent
nipkow@9645	156	From the above axiom\footnote{In general, the use of axioms is strongly
nipkow@9645	157	discouraged, because of the danger of inconsistencies. The above axiom does
nipkow@9645	158	not introduce an inconsistency because, for example, the identity function
nipkow@9645	159	satisfies it.}
nipkow@9792	160	for @{term"f"} it follows that @{prop"n <= f n"}, which can
nipkow@9645	161	be proved by induction on @{term"f n"}. Following the recipy outlined
nipkow@9645	162	above, we have to phrase the proposition as follows to allow induction:
nipkow@9689	163	*};
nipkow@9645	164
nipkow@9933	165	lemma f_incr_lem: "\<forall>i. k = f i \<longrightarrow> i \<le> f i";
nipkow@9645	166
nipkow@9645	167	txt{*\noindent
nipkow@9933	168	To perform induction on @{term"k"} using @{thm[source]nat_less_induct}, we use the same
nipkow@9645	169	general induction method as for recursion induction (see
nipkow@9645	170	\S\ref{sec:recdef-induction}):
nipkow@9689	171	*};
nipkow@9645	172
wenzelm@9923	173	apply(induct_tac k rule: nat_less_induct);
nipkow@9645	174	(<)
nipkow@9689	175	apply(rule allI);
nipkow@9645	176	apply(case_tac i);
nipkow@9645	177	apply(simp);
nipkow@9645	178	(>)
nipkow@9645	179	txt{*\noindent
nipkow@9645	180	which leaves us with the following proof state:
nipkow@9723	181	\begin{isabelle}
nipkow@9645	182	\ 1.\ {\isasymAnd}\mbox{n}.\ {\isasymforall}\mbox{m}.\ \mbox{m}\ <\ \mbox{n}\ {\isasymlongrightarrow}\ ({\isasymforall}\mbox{i}.\ \mbox{m}\ =\ f\ \mbox{i}\ {\isasymlongrightarrow}\ \mbox{i}\ {\isasymle}\ f\ \mbox{i})\isanewline
nipkow@9645	183	\ \ \ \ \ \ \ {\isasymLongrightarrow}\ {\isasymforall}\mbox{i}.\ \mbox{n}\ =\ f\ \mbox{i}\ {\isasymlongrightarrow}\ \mbox{i}\ {\isasymle}\ f\ \mbox{i}
nipkow@9723	184	\end{isabelle}
nipkow@9792	185	After stripping the @{text"\<forall>i"}, the proof continues with a case
nipkow@9792	186	distinction on @{term"i"}. The case @{prop"i = 0"} is trivial and we focus on
nipkow@9792	187	the other case:
nipkow@9723	188	\begin{isabelle}
nipkow@9645	189	\ 1.\ {\isasymAnd}\mbox{n}\ \mbox{i}\ \mbox{nat}.\isanewline
nipkow@9645	190	\ \ \ \ \ \ \ {\isasymlbrakk}{\isasymforall}\mbox{m}.\ \mbox{m}\ <\ \mbox{n}\ {\isasymlongrightarrow}\ ({\isasymforall}\mbox{i}.\ \mbox{m}\ =\ f\ \mbox{i}\ {\isasymlongrightarrow}\ \mbox{i}\ {\isasymle}\ f\ \mbox{i});\ \mbox{i}\ =\ Suc\ \mbox{nat}{\isasymrbrakk}\isanewline
nipkow@9645	191	\ \ \ \ \ \ \ {\isasymLongrightarrow}\ \mbox{n}\ =\ f\ \mbox{i}\ {\isasymlongrightarrow}\ \mbox{i}\ {\isasymle}\ f\ \mbox{i}
nipkow@9723	192	\end{isabelle}
nipkow@9689	193	*};
nipkow@9645	194
wenzelm@9923	195	by(blast intro!: f_ax Suc_leI intro: le_less_trans);
nipkow@9645	196
nipkow@9645	197	text{*\noindent
nipkow@9645	198	It is not surprising if you find the last step puzzling.
nipkow@9792	199	The proof goes like this (writing @{term"j"} instead of @{typ"nat"}).
nipkow@9792	200	Since @{prop"i = Suc j"} it suffices to show
nipkow@9792	201	@{prop"j < f(Suc j)"} (by @{thm[source]Suc_leI}: @{thm"Suc_leI"[no_vars]}). This is
nipkow@9792	202	proved as follows. From @{thm[source]f_ax} we have @{prop"f (f j) < f (Suc j)"}
nipkow@9792	203	(1) which implies @{prop"f j <= f (f j)"} (by the induction hypothesis).
nipkow@9792	204	Using (1) once more we obtain @{prop"f j < f(Suc j)"} (2) by transitivity
nipkow@9792	205	(@{thm[source]le_less_trans}: @{thm"le_less_trans"[no_vars]}).
nipkow@9792	206	Using the induction hypothesis once more we obtain @{prop"j <= f j"}
nipkow@9792	207	which, together with (2) yields @{prop"j < f (Suc j)"} (again by
nipkow@9792	208	@{thm[source]le_less_trans}).
nipkow@9645	209
nipkow@9645	210	This last step shows both the power and the danger of automatic proofs: they
nipkow@9645	211	will usually not tell you how the proof goes, because it can be very hard to
nipkow@9645	212	translate the internal proof into a human-readable format. Therefore
nipkow@9645	213	\S\ref{sec:part2?} introduces a language for writing readable yet concise
nipkow@9645	214	proofs.
nipkow@9645	215
nipkow@9792	216	We can now derive the desired @{prop"i <= f i"} from @{text"f_incr"}:
nipkow@9689	217	*};
nipkow@9645	218
wenzelm@9941	219	lemmas f_incr = f_incr_lem[rule_format, OF refl];
nipkow@9645	220
nipkow@9689	221	text{*\noindent
nipkow@9792	222	The final @{thm[source]refl} gets rid of the premise @{text"?k = f ?i"}. Again,
nipkow@9792	223	we could have included this derivation in the original statement of the lemma:
nipkow@9689	224	*};
nipkow@9645	225
wenzelm@9941	226	lemma f_incr[rule_format, OF refl]: "\<forall>i. k = f i \<longrightarrow> i \<le> f i";
nipkow@9689	227	(<)oops;(>)
nipkow@9645	228
nipkow@9645	229	text{*
nipkow@9645	230	\begin{exercise}
nipkow@9792	231	From the above axiom and lemma for @{term"f"} show that @{term"f"} is the
nipkow@9792	232	identity.
nipkow@9645	233	\end{exercise}
nipkow@9645	234
nipkow@10236	235	In general, @{text induct_tac} can be applied with any rule $r$
nipkow@9792	236	whose conclusion is of the form ${?}P~?x@1 \dots ?x@n$, in which case the
nipkow@9645	237	format is
nipkow@9792	238	\begin{quote}
nipkow@9792	239	\isacommand{apply}@{text"(induct_tac"} $y@1 \dots y@n$ @{text"rule:"} $r$@{text")"}
nipkow@9792	240	\end{quote}\index{*induct_tac}%
nipkow@9792	241	where $y@1, \dots, y@n$ are variables in the first subgoal.
nipkow@10236	242	A further example of a useful induction rule is @{thm[source]length_induct},
nipkow@10236	243	induction on the length of a list:\indexbold{*length_induct}
nipkow@10236	244	@{thm[display]length_induct[no_vars]}
nipkow@10236	245
nipkow@9792	246	In fact, @{text"induct_tac"} even allows the conclusion of
nipkow@9792	247	$r$ to be an (iterated) conjunction of formulae of the above form, in
nipkow@9645	248	which case the application is
nipkow@9792	249	\begin{quote}
nipkow@9792	250	\isacommand{apply}@{text"(induct_tac"} $y@1 \dots y@n$ @{text"and"} \dots\ @{text"and"} $z@1 \dots z@m$ @{text"rule:"} $r$@{text")"}
nipkow@9792	251	\end{quote}
nipkow@9689	252	*};
nipkow@9645	253
nipkow@9689	254	subsection{Derivation of new induction schemas};
nipkow@9689	255
nipkow@9689	256	text{*\label{sec:derive-ind}
nipkow@9689	257	Induction schemas are ordinary theorems and you can derive new ones
nipkow@9689	258	whenever you wish. This section shows you how to, using the example
nipkow@9933	259	of @{thm[source]nat_less_induct}. Assume we only have structural induction
nipkow@9689	260	available for @{typ"nat"} and want to derive complete induction. This
nipkow@9689	261	requires us to generalize the statement first:
nipkow@9689	262	*};
nipkow@9689	263
nipkow@9933	264	lemma induct_lem: "(\<And>n::nat. \<forall>m<n. P m \<Longrightarrow> P n) \<Longrightarrow> \<forall>m<n. P m";
nipkow@9689	265	apply(induct_tac n);
nipkow@9689	266
nipkow@9689	267	txt{*\noindent
nipkow@9792	268	The base case is trivially true. For the induction step (@{prop"m <
nipkow@9933	269	Suc n"}) we distinguish two cases: case @{prop"m < n"} is true by induction
nipkow@9933	270	hypothesis and case @{prop"m = n"} follows from the assumption, again using
nipkow@9689	271	the induction hypothesis:
nipkow@9689	272	*};
nipkow@9689	273	apply(blast);
nipkow@9933	274	by(blast elim:less_SucE)
nipkow@9689	275
nipkow@9689	276	text{*\noindent
nipkow@9792	277	The elimination rule @{thm[source]less_SucE} expresses the case distinction:
nipkow@9689	278	@{thm[display]"less_SucE"[no_vars]}
nipkow@9689	279
nipkow@9689	280	Now it is straightforward to derive the original version of
nipkow@9933	281	@{thm[source]nat_less_induct} by manipulting the conclusion of the above lemma:
nipkow@9792	282	instantiate @{term"n"} by @{term"Suc n"} and @{term"m"} by @{term"n"} and
nipkow@9792	283	remove the trivial condition @{prop"n < Sc n"}. Fortunately, this
nipkow@9689	284	happens automatically when we add the lemma as a new premise to the
nipkow@9689	285	desired goal:
nipkow@9689	286	*};
nipkow@9689	287
nipkow@9933	288	theorem nat_less_induct: "(\<And>n::nat. \<forall>m<n. P m \<Longrightarrow> P n) \<Longrightarrow> P n";
nipkow@9689	289	by(insert induct_lem, blast);
nipkow@9689	290
nipkow@9933	291	text{*
nipkow@10186	292
nipkow@9645	293	Finally we should mention that HOL already provides the mother of all
paulson@10241	294	inductions, \textbf{well-founded
paulson@10241	295	induction}\indexbold{induction!well-founded}\index{well-founded
paulson@10241	296	induction\|see{induction, well-founded}} (@{thm[source]wf_induct}):
nipkow@10186	297	@{thm[display]wf_induct[no_vars]}
paulson@10241	298	where @{term"wf r"} means that the relation @{term r} is well-founded
paulson@10241	299	(see \S\ref{sec:well-founded}).
nipkow@9933	300	For example, theorem @{thm[source]nat_less_induct} can be viewed (and
nipkow@9933	301	derived) as a special case of @{thm[source]wf_induct} where
nipkow@10186	302	@{term r} is @{text"<"} on @{typ nat}. The details can be found in the HOL library.
paulson@10241	303	For a mathematical account of well-founded induction see, for example, \cite{Baader-Nipkow}.
nipkow@9689	304	*};
nipkow@9645	305
nipkow@9645	306	(<)
nipkow@9645	307	end
nipkow@9645	308	(>)

author	nipkow
	Wed, 18 Oct 2000 17:19:18 +0200
changeset 10242	028f54cd2cc9
parent 10241	e0428c2778f1
child 10281	9554ce1c2e54
permissions	-rw-r--r--