next up previous


Agent Based Mathematical Reasoning
Case for Support

Manfred Kerber
15 July 1999

Description of the Proposed Research and its Context




Motivation

A classical approach to model intelligence consists of carefully selecting a knowledge representation formalism and then modelling parts of a domain in this formalism. The power of the reasoner depends on the ability to reason in the knowledge representation formalism. In classical approaches to planning, the situation calculus [31] and in STRIPS [15], the domain is modelled by an initial state, a goal state, and planning operators. Proof planning with abstraction also makes use of a model of the world and can be viewed as a deliberative approach to solve reasoning tasks.

As an antithesis to this classical AI paradigm (and in particular to the planning paradigm) an approach has been developed that explicitly does not make use of knowledge representation and complicated deliberations (such as planning the best next step in the proof search). Brooks phrased it as ``Intelligence without Reason'' [8]. In this approach it is possible to obtain complex, apparently goal directed and intentional behaviour which has no long term internal state and no internal communication. This is referred to as a reactive form of modelling behaviour. Its key notions are: Situatedness (``The world is its own best model.''), embodiment (``The world grounds regress.''), Intelligence (``Intelligence is determined by the dynamics of interaction with the world.''), and Emergence (``Intelligence is in the eye of the observer.'').

Although the machine-oriented approaches were not designed in the light of recent work in reactive systems, they can be reinterpreted in this framework. The main aspect is the locality of the search for a solution. For instance, when we consider binary resolution theorem proving, the decision on which two literals to perform a resolution step is often made on the basis of the knowledge of the current proof state only. That is, it does not depend on what has been done previously (of course this view simplifies matters and is not true in a strict sense for all strategies). In particular, there is no overall long term strategy to derive the empty clause. We can view the behaviour of the theorem prover as a reactive process: the world consists of clauses and there is no abstract model of these clauses. The theorem prover acts directly in this world, and the behaviour is determined by the interaction with the world. It is a characteristic of reactiveness that some reactive systems such as  normally do not do any backtracking. Furthermore, some reactive systems do complete restarts when the search for a proof is lost in the search space. Such restarts can also be viewed as a typical characteristic of reactive systems. For a detailed discussion of Brooks' approach and its relationship to theorem proving see [10].

Recent years have seen an attempt to reconcile the deliberative and the reactive approaches in single agent architectures [36]. This is partly motivated by looking at the human way of acting and reasoning which can be better explained as a combination of the two cases rather than by any one of them alone. Also, practical issues play an important rôle: in certain cases reactive behaviour is computationally more efficient, while in others reactive behaviour gets stuck. In the latter case deliberative behaviour can sometimes prevent blocking of a reasoning process.

Agent based mathematical reasoning

A weakness of most state of the art reasoning systems is that they usually follow rigid and inflexible solution strategies in their search for proofs. Instead, human mathematicians use -- depending on their level of expertise -- ``a colourful mixture of proof strategies'' (as Wittgenstein phrases it). In an attempt to prove a mathematical theorem they typically first try a well known standard technique in the focus of the mathematical theory. If this technique does not lead to the wanted results in a reasonable amount of time, they may doubt that the theorem holds at all and look for a counterexample. If this also fails, they may try again by widening or deepening the proof search.

The aim of this research project is to emulate this flexible problem solving behaviour of human mathematicians in an agent based reasoning approach. Thus, our system shall reflect at least some of the ideas of a sophisticated and experienced problem solver as described by Pólya in [34], p. 64: $\ldots$ ``when he does not succeed in guessing the whole answer, he tries to guess some part of the answer, some feature of the solution some approach to the solution, or some feature of an approach to the solution. Then he seeks to expand his guess, and so he seeks to adapt his guess to the best information he can get at the moment.''

Agents allow a number of proof search attempts to be executed in parallel. Each agent may try a different proof strategy to find the proof of a conjecture. Hence, a number of different proof strategies are used at the same time in the proof search. However, following all the available strategies simultaneously would quickly consume the available system resources consisting of computation time and memory space. In order to prevent this, and furthermore to guide the proof search we propose to develop and employ a resource management concept into proof search. Resource management is a technique which distributes the available resources amongst the available agents. Periodically, it assesses the state of the proof search process, evaluates the progress and redistributes the available resources accordingly. Hence, only successful or promising proof attempts will be allowed to continue searching for the proof. This process is repeated until the proof is found, or some other terminating condition is reached. Of course, the evaluation of the success of a proof strategy is crucial to determine the amount of resources that is allocated for an agent. This evaluation is based on the contribution that the agent has made in the proof attempt as well as on its prospect for the rest of the search. For example, a favourable contribution is a partial problem solution.

As will be illustrated below, the proposed system shall tackle mathematical problems that are currently not automatically solvable by any of the embedded systems alone (neither by any other system). In one particular example of [3] after splitting the initial proof goal into three subgoals, higher-order equality and extensionality reasoning is required. Whereas the first two subgoals can be solved by the higher-order prover  [1], the third subproblem, which requires a fair amount of extensionality reasoning, should be solvable in a cooperation of the higher-order extensionality prover  [4,2] with a first-order system like  [37]. The system we propose shall be able to organise the required cooperation between the integrated systems in a goal oriented way in order to solve such kinds of problems automatically.

Classical first-order theorem provers (e.g. , , , ), analytical provers (e.g.  or ), and completion methods (e.g. ) have reached a considerable reasoning power. This is underlined by the recent solution of the Robbins problem by  [30]. However, these traditional systems follow fixed search strategies, which are unlikely to fully model the problem solving expertise of real mathematicians. Also classical higher-order theorem provers like  [1], , or the -system [4,2] get lost in the enormous search spaces stretched on a fine-grained calculus level.

As an alternative approach Bundy developed proof planning [9], which instead of a fine-grained calculus level search looks for promising proof plans on a more abstract level by chaining proof methods. State of the art proof planner are , , and the proof planner of  [7].

Agent architectures for automatic and interactive theorem proving are discussed in [16,14] and the preliminary work of the research fellow in this research field is described in [5,6].

An advantage of (which will form the basis of this project) is that it already provides various integrated classical reasoning systems (e.g. all the ones mentioned above) as well as some specialised decision procedures (e.g. a constraint solver and the integrated computer algebra systems Maple and $\mu$ CAS [28]), and a basic analogy module [33]. Additional features are a multi-modal graphical user interface [35], a proof verbalisation tool [21] and a connected database of mathematical theories. Using the agent architecture [17], most of these integrated system can be distributed over the internet. Information on successful or unsuccessful proof attempts of the integrated systems (e.g. partial proofs) can be translated back into 's central proof data structure, which is based on a higher-order variant of Gentzen's natural deduction calculus.

Human proof search behaviour can best be modelled as a mixture of proof planning, classical theorem proving, computing, and model generation. In (and related systems like  [12], which integrates only rather homogeneous first-order reasoning systems) this has mainly to be done by the user and rather poor support is provided for a fruitful and guided cooperation of the available subsystem. this weakness will be addressed in the proposed project. A particular problem of automated reasoning system is their ``brittleness'', that is, the feature that minor changes in the search strategy of in the problem formulation may totally change the behaviour of the system. The approach advocated in this proposal should at least partly remedy this problem, since by its explicit resource handling it enables explicit redirection of resources without abandoning lines of search altogether.

Description of the research project

The system will provide a powerful architecture for reasoning systems and a society of specialised reasoning agents (which are aware of their own capabilities and probably even of those of the others). Initially the given mathematical problem is investigated in order to estimate and classify the potential of the available solution strategies, i.e., agents, for solving this problem. Depending on the evaluation process an initial resource distribution is computed, e.g. the main strategy line is manifested. In a first approach we will develop an infrastructure allowing the user to distribute the resources. An automatic evaluation module will be added to the system later. The goal is not to remove the human in this process, but to build an automated mathematical assistant. The evolution approach and resource approach should strongly facilitate the communication between the human and the machine.

After consuming the available resources, the reasoning agents terminate and investigate whether they have produced useful information or not. For instance the -agent could look for the shortest derived clauses in order to estimate how close it is to a proof. The contributions of the  system, which already returns a partial proof-plan to the -system, may be evaluated using criteria like the complexity and number of the remaining open subproblems (e.g. the only open subproblem might be a first-order goal whereas the original problem was a higher-order one; then the partial proof may be communicated to other systems and the open subgoal can be passed to first-order provers). Depending on the evaluation of the agents contributions, a new resource distribution is computed.

  
agent-planning-figure
Figure: The reasoning process - iterative allocation of resources to proof agents (PAx) by assessment/evaluation, and the subsequent construction of a proof of a given theorem.

The most challenging research tasks are:
(i) The extension of 's underlying -architecture [17] by suitable resource distribution and communication facilities. Whereas the -architecture provides an excellent basis for distributing reasoning systems over the Internet it does not solve the question how to guide a fruitful co-operation between the involved systems. Therefore we want to adapt the blackboard mechanism underlying 's interactive suggestion mechanism [5,6].
(ii) The development and realisation of suitable evaluation criteria; obvious candidates are the simplicity/complexity of partial proofs, the theory/logic a subproblem belongs to (e.g. first-order logic, set theory), or the similarity of open subproblems to already solved problems stored in the database (note that already provides a basic analogy component).
(iii) The extension of the system, such that it allows a grouping of homogeneous agents tackling similar kinds of problems into one single meta-agent. For instance, it may be useful to group classical first-order reasoners together to form a centre of expertise for classical first-order logic. Optimally such centres of expertise may use an analogous mechanism to the overall system to organise the communication between its systems (sub-agents) and to further distribute the resources it obtains at the upper level. The systems in a centre of expertise can be evaluated using more fine-grained evaluation criteria and more homogeneous system communications can be realised. In particular, for quite homogeneous (e.g. first-order) theorem provers experiments of this kind have been successful (cf. the [14] or [16]).

Example

The proposed system should be able to tackle mathematical problems that are currently not automatically solvable by any of the embedded systems alone (nor by any other system). An example of such a problem is described in detail in [3], it is briefly summarised here. The problem states that if there is a partition p of some set, then there is an equivalence relation q whose equivalence classes are exactly the elements of p:

\begin{displaymath}\arraycolsep0pt\begin{array}{ll}
\forall p \sdot \mbox{\it pa...
...\wedge\\
&(\mbox{\it equivalence-classes}(q) = p))
\end{array}\end{displaymath}

Note that partition, equivalence-classes, and equivalence-rel are derived higher-order concepts defined in the knowledge base of mathematical theories.

The search for a proof of this theorem has not been automated yet. We indicate here how the proof might be found within our proposed architecture. First, the initial proof goal is split into three subgoals (e.g., with the help of a proof planner). Namely, from a given partition p we can derive the existence of an equivalence relation q, constituting subgoal (1). For the same equivalence relation q it holds that its equivalence classes are exactly p. The two directions of the set equality give us subgoals (2) and (3). Next, higher-order equality and extensionality reasoning is required. The first two subgoals (1) and (2) can be solved automatically by the higher-order prover  [1]. The last subproblem (3), which requires a fair amount of extensionality reasoning, we expect to be solvable by cooperation between the higher-order extensionality prover  [4] and a first-order automated theorem prover. provides the necessary higher-order extensionality treatment, however, it cannot cope with the large number of first-order clauses that are generated subsequently. Therefore, this set of clauses could be passed via to the first-order specialist available within our agent society. Our proposed system will be able to organise the sketched cooperation between the integrated systems in a goal oriented way in order to solve such kinds of problems automatically.

Work plan

Months 1-3: Realisation of a suitable integration and extension (resource concept, communication aspects) of the agent based architectures described in [5,17,6] as a basis architecture for the proposed system. Elaboration of the detailed system architecture.
Months 4-6: Integration of some reasoners already provided by in the new architecture (e.g., 's proof planner, , , , ).
Months 7-9: First cooperatively developed proofs (e.g., for the examples in [3]) based on user defined resource distributions. Development and realisation of automatic evaluation criteria.
Months 10-12: First cooperatively developed proofs based on automatic resource distributions. Development of additional evaluation criteria and fine-tuning. Integration of additional systems, e.g., a model-checker and a computer algebra system. Experiments on conditions for the stability of reasoning.

Expected results

The project will demonstrate whether a flexible agent approach is capable of modelling significant features of human problem solving behaviour. If yes, the system will be able to improve the capacity of to prove difficult mathematical theorems in an interactive as well as in a stand alone mode. Not only should it be possible to prove a certain class of them for the first time fully automatically, interaction and user guidance should be strongly extended.

In contrast to former agent-based reasoning approaches (e.g. [14,38,13,12]) the proposed system focuses on the integration of quite heterogeneous systems (e.g. proof planner, higher-order prover, first-order prover, and model generators) on a more abstract communication layer.

Bibliography

1
P. Andrews et al.
TPS: A theorem proving system for classical type theory.
Journal of Automated Reasoning, 16(3):321-353, 1996.

2
C. Benzmüller.
Equality and Extensionality in Automated Higher-Order Theorem Proving, PhD thesis, FB Informatik, Universität des Saarlandes, submitted in April 1999.

3
C. Benzmüller, M. Bishop, and V. Sorge.
Integrating TPS and $\Omega$MEGA.
Journal of Universal Computer Science, 5(3):188-207, 1999.

4
C. Benzmüller and M. Kohlhase.
LEO -- a higher-order theorem prover.
In C. Kirchner and H. Kirchner, eds., Proc. of the 15th Conference on Automated Deduction, LNAI 1421, pp. 139-144, Lindau, Germany, 1998. Springer.

5
C. Benzmüller and V. Sorge.
A blackboard architecture for guiding interactive proofs.
In F. Giunchiglia, ed.
Proc. of the of the 8th International Conference (AIMSA'98), LNAI 1480, pp. 102-114, Sozopol, Bulgaria, 1998. Springer.

6
C. Benzmüller and V. Sorge.
Critical agents supporting interactive theorem proving.
SEKI-Report SR-99-02, FB Informatik, Universität des Saarlandes, 1999.

7
C. Benzmüller et al.
: Towards a mathematical assistant.
In W. McCune, ed., of the 14th , pp. 252-255, Townsville, , 1997. Springer, 1249.

8
R. A. Brooks.
Intelligence without reason.
Memo No. 1293, MIT, 1991.

9
A. Bundy.
The use of explicit plans to guide inductive proofs.
In E. Lusk and R. Overbeek, eds., of the 9th , pp. 111-120, Argonne, Illinois, USA, 1988. Springer, 310.

10
A. Bundy.
A comparison of proof planning methodologies.
Blue Book Note 966, University of Edinburgh, Edinburgh, , 1994.

11
L. Cheikhrouhou.
Planning diagonalization proofs.
In G. Brewka, C. Habel, and B. Nebel, eds., of the 21st Annual German Conference on Artificial Intelligence, KI-97, pp. 377-380, Freiburg, Germany, 1997. Springer, LNAI 1303.

12
I. Dahn.
Integration of automated and interactive theorem proving in ILF.
In McCune [29], pp. 57-60.

13
J. Denzinger.
Knowledge-based distributed search using teamwork.
In Proc. ICMAS-95, pp. 81-88, San Francisco, 1995.

14
J. Denzinger and I. Dahn. Cooperating theorem provers. In W. Bibel and P. H. Schmitt, eds., Automated Deduction, a Basis for Applications, Vol. II, 1989. Kluwer.

15
R. E. Fikes and N. J. Nilsson.
STRIPS: A new approach to the application of theorem proving to problem solving.
, 2:189-208, 1971.

16
M. Fisher.
An open approach to concurrent theorem proving.
In J. Geller, H. Kitano, and C. Suttner, eds., Parallel Processing for Artificial Intelligence, volume 3. Elsevier/North Holland, 1997.

17
A. Franke and M. Kohlhase.
System description: Mathweb, an agent-based layer for distributed automated theorem proving.
In the Proc. of the 16th International Conference on Automated Deduction (CADE-16), Trento, Italy, 1999.

18
X. Huang and M. Kerber.
Reuse of proofs by meta-methods.
In J. Köhler, F. Giunchiglia, C. Green, and C. Walther, eds., Working Notes of the IJCAI-Workshop on Formal Approaches to the Reuse of Plans, Proofs, and Programs, pp. 46-50, Montréal, , 1995.

19
X. Huang, M. Kerber, and L. Cheikhrouhou.
Adapting the diagonalization method by reformulations.
In A. Y. Levy and P. P. Nayak, eds., of SARA-95 - Symposium on Abstraction, Reformulation, and Approximation, pp. 78-85, Ville d'Esterel, , 1995.

20
X. Huang, M. Kerber, M. Kohlhase, E. Melis, D. Nesmith, J. Richts, and J. Siekmann.
: A proof development environment.
In Alan Bundy, ed., of the 12th , pp. 788-792, Nancy, 1994. Springer, 814.

21
X. Huang and A. Fiedler.
Proof verbalization in PROVERB.
In Proc. of the First International Workshop on Proof Transformation and Presentation, pp. 35-36, Schloss Dagstuhl, Germany, 1997.

22
X. Huang, M. Kerber, and L. Cheikhrouhou.
Adaptation of Declaratively Represented Methods in Proof Planning. Annals of Mathematics and Artificial Intelligence, 23, (3-4), 299-320, 1998.

23
M. Jamnik.
Automating Diagrammatic Proofs of Arithmetic Arguments.
PhD thesis, University of Edinburgh, 1999.

24
M. Kerber.
Proof planning: A practical approach to mechanized reasoning in mathematics.
In W. Bibel and P. H. Schmitt, eds., Automated Deduction, a Basis for Application, chapter III.4, pp. 77-95. Kluwer, 1998.

25
M. Kerber, M. Kohlhase, and V. Sorge.
Integrating computer algebra with proof planning.
In J. Calmet and C. Limogelli, eds., Design and Implementation of Symbolic Computation Systems, DISCO'96, LNCS 1128, pp. 204-215, Karlsruhe, Germany, 1996. Springer.

26
M. Kerber, M. Kohlhase, and V. Sorge.
An integration of mechanised reasoning and computer algebra that respects explicit proofs.
Technical Report CSRP-96-9, University of Birmingham, School of Computer Science, June 1996.
also as: SEKI Report SR-96-06.

27
M. Kerber and A. C. Sehn.
Proving ground completeness of resolution by proof planning.
In D. D. Dankel, ed., FLAIRS-97 - Proceedings of the Tenth International Florida Artificial Intelligence Research Symposium, pp. 372-376, Daytona Beach, Florida, USA, 1997.

28
M. Kerber, M. Kohlhase, and V. Sorge.
Integrating Computer Algebra Into Proof Planning.
Journal of Automated Reasoning, 21(3):327-355, 1998.

29
W. McCune, ed.
Proc. of the 14th Conference on Automated Deduction, LNAI 1249, Townsville, Australia, 1997. Springer.

30
W. McCune.
Solution of the Robbins problem.
Journal of Automated Reasoning, 19(3):263-276, 1997.

31
J. McCarthy and P. Hayes.
Some philosophical problems from the standpoint of artificial intelligence.
Machine Intelligence, 4:463-502, 1969.

32
E. Melis.
Representing and reformulating diagonalization methods.
Technical Report CMU-CS-94-174, School of Computer Science, Carnegie Mellon University, Pittsburgh, Pennsylvania, USA, 1994.

33
E. Melis.
A model of analogy-driven proof-plan construction.
In C. S. Mellish, ed., Proc. of the 14th International Joint Conference on Artificial Intelligence (IJCAI95), pp. 182-189, Montréal, Canada, 1995. Morgan Kaufmann.

34
G. Polya.
Mathematical Discovery.
John Wiley & Sons, 3rd edition, 1981.

35
J. Siekmann et al.
$\LOUI$: Lovely $\OMEGA$ User Interface.
Special Issue on Integrated Systems of the Journal of Formal Aspects of Computer Science, 1999.
forthcoming.

36
A. Sloman.
Architectural requirements for human-like agents - both natural and artificial. (what sorts of machines can love?).
In K. Dautenhahn, ed., Human Cognition and Social Agent Technology. John Benjamins Publishing, 1999.

37
C. Weidenbach.
SPASS: Version 0.49.
Journal of Automated Reasoning, 18(2):247-252, 1997.

38
A. Wolf
p-SETHEO: Strategy Parallelism in Automated Theorem Proving.
In Proc. TABLEAUX 1998, pp. 320-324, LNCS 1397, 1998. Springer.

About this document ...

Agent Based Mathematical Reasoning
Case for Support

This document was adapted from a file using the LaTeX2HTML translator Version 98.1p1 release (March 2nd, 1998)

Copyright © 1993, 1994, 1995, 1996, 1997, Nikos Drakos, Computer Based Learning Unit, University of Leeds.

The command line arguments were:
latex2html -split 0 epsrc-html.tex.

The translation was initiated by Manfred Kerber on 2001-03-28


next up previous
Manfred Kerber
2001-03-28