#### AN ABSTRACT OF THE THESIS OF | Abstract approved: | Redacted for privacy | |---------------------|---------------------------------------------| | | Q. | | Title: DESIGNING A | HIGH LEVEL MICROPROGRAMMING LANGUAGE | | inELECTRICAL AND CO | MPUTER ENGINEERING presented on 4 JUNE 1979 | | KAMRAN MALIK | for the degree of DOCTOR OF PHILOSOPHY | TED LEWIS The purpose of this research is to design a high level language (HLL) suitable for microprogramming. A top down design technique has been adopted which makes the language design process simple and accurate. The primitive operations of a high level language for producing emulators is shown to include special purpose features specific to virtual machine implementations. A hierarchy of data types, short reliable language constructs, and control structures that minimize emulator complexity are suggested by a goal-directed, structural design methodology. In addition, structural clues generated by the language compiler assist in producing portable yet efficient horizontal microcode for partially encoded host architectures. Furthermore, software tools in the form of a simulator-compiler combination are presented which provide features for design, development, test and eventual certification of microprograms. C Copyright by Kamran Malik 4 June 1979 All Rights Reserved # DESIGNING A HIGH LEVEL MICROPROGRAMMING LANGUAGE by #### KAMRAN MALIK A THESIS submitted to Oregon State University in partial fulfullment of the requirements for the degree of Doctor of Philosophy Completed 4 June 1979 Commencement June 1980 | Þ. | | | | | |----|--|--|--|--| | | | | | | | | | | | | ## Redacted for privacy Associate Professor of Computer Science in charge of major # Redacted for privacy Head of Department - Electrical and Computer Engineering # Redacted for privacy | <b>v</b> <sub>A</sub> | | one African | |-------------------------|-----------------------------------------|-------------| | Dean of Graduate School | *************************************** | | Date thesis is presented 4 June 1979 Typed by Cheryl DeHart for KAMRAN MALIK #### ACKNOWLEDGEMENT I would like first of all, to thank my supervisor, Dr. Ted Lewis, for his help, guidance and constant encouragement during the preparation of this dissertation. The innumerable conversations, which I have had with him over the years, have so vastly influenced my philosophical views on computers and computation, that it is an intellectual debt I shall always carry with me. I also thank the members of my committee and, in particular, Dr. V. M. Powers for his careful reading and critical comments on this dissertation. I cannot express the thanks I owe my parents through whom I have received everything. The computer programs were made possible by the facilities and staff of the O.S.U. Computing Center. In addition, the work reported in this dissertation was supported by N.S.F. grant MCS 76-20710. Last but not least, I would like to thank Cheryl DeHart for her relentless typing of this thesis. #### TABLE OF CONTENTS | | ACKN | OWLEDGEME | NTS | PAGE<br>i | |----|------|-----------|-----------------------------------------------|-----------| | | | E OF CONT | | ii | | | | OF FIGUR | | V | | | | OF TABLE | | vii | | 1. | INTR | ODUCTION | | . 1 | | | 1.1 | EVOLUTIO | N OF MICROPROGRAMMING | 1 | | | | 1.1.1 E | CONOMIC ADVANTAGE | 2 | | | | 1.1.2 A | RCHITECTURAL COMPATIBILITY | 2 | | | | 1.1.3 D | ELAYED BINDING | 2 | | | 1.2 | MOTIVATI | ОИ | 4 | | | | 1.2.1 H | ARDWARE DESCRIPTION LANGUAGES | 6 | | | | 1.2.2 T | AILORED LANGUAGES | 6 | | | | 1.2.3 M | ACHINE INDEPENDENT LANGUAGES | 7 | | | 1.3 | OBJECTIV | ES | 12 | | | | | | | | 2. | A LA | NGUAGE FO | R MICROPROGRAMMING | 13 | | | 2.1 | INTRODUC | TION | 13 | | | 2.2 | BASIS OF | LANGUAGE | 14 | | | 2.3 | LANGUAGE | DESIGN | 15 | | | 2.4 | DEFINITI | ON OF VMPL | 34 | | | | 2.4.1 D | ECLARATIONS | . 34 | | | | 2 | .4.1.1 VARIABLE DECLARATIONS | 34 | | | | 2 | .4.1.2 INFORMATION DECLARATIONS | 37 | | | | 2.4.2 A | SSIGNMENT STATEMENTS, OPERATORS & EXPRESSIONS | 47 | | | | 2.4.3 C | ONTROL CONSTRUCTS | 54 | | | | 2.4.4 M | ISCELLANEOUS STATEMENTS | 58 | | | | 2.4.5 P | ROGRAM STRUCTURE | 58 | | | 2.5 | CONCLUST | ONS | 60 | | 3. | MACH | INE CONSIDERATIONS | 62 | |----|------|-------------------------------------------|-----| | | 3.1 | INTRODUCTION | 62 | | | 3.2 | m-COMPUTER ARCHITECTURAL FEATURES | 64 | | | - | 3.2.1 STORAGE FACILITIES | 64 | | | | 3.2.2 FUNCTIONAL UNITS | 68 | | | | 3.2.3 DATA WIDTHS | 70 | | | | 3.2.4 BUSSES | 72 | | | | 3.2.5 MICROINSTRUCTIONS | 72 | | | | 3.2.6 INPUT, OUTPUT & INTERRUPT STRUCTURE | 73 | | | 3.3 | m-COMPUTER PRIMITIVES | 76 | | | 3.4 | m-COMPUTER CLASSES | 80 | | | | 3.4.1 CLASS 1. | 81 | | | | 3.4.2 CLASS 2. | 81 | | | | 3.4.3 CLASS 3. | 82 | | | | 3.4.4 CLASS 4. | 83 | | | | 3.4.5 CLASS 5. | 84 | | | | 3.4.6 CLASS 6. | 85 | | | 3.5 | CONCLUSIONS | 85 | | 4. | ₽∩₽Ͳ | ABILITY | 86 | | • | | INTRODUCTION | 86 | | | | CLASSIFICATION OF TECHNIQUES | 86 | | | | 4.2.1 CONVERSION TECHNIQUES | 86 | | | | 4.2.2 INHERENTLY PORTABLE TECHNIQUES | 87 | | | 4.3 | APPLICATION TO MICROPROGRAM PORTABILITY | 90 | | | | 4.3.1 THE MALLETT-LEWIS MODEL | 91 | | | 4.4 | CONCLUSIONS | 91 | | | | | | | 5. | INTE | RMEDIATE MACHINE LANGUAGE | 94 | | | 5.1 | INTRODUCTION | 94 | | | 5.2 | IML FORMATS & ATTRIBUTES | 94 | | | 5.3 | INTERMEDIATE LANGUAGE | 107 | | | | 5.3.1 IISG | 108 | | | | | iv | |----|------|---------------------------------------|-----| | | | 5.3.2 IESG | 110 | | | | 5.3.1.1 IESG FORMATS | 110 | | | | 5.3.1.2 IESG STATEMENTS | 116 | | | 5.4 | CONCLUSIONS | 118 | | | | | | | 5. | RESU | ITS | 123 | | | 6.1 | SIMULATOR | 123 | | | 6.2 | VMPL COMPILER | 125 | | | 6.3 | VMPL-IML ANALYSIS | 126 | | | | 6.3.1 COUNTING EXPERIMENTS | 127 | | | | 6.3.2 VARIABLE ASSIGNMENT EXPERIMENTS | 132 | | | 6.4 | VMPL EXTENSIONS | 136 | | | 6.5 | CONCLUSIONS & FUTURE WORK | 138 | | | REFE | RENCES | 139 | | | | NDIX A | 144 | | | | NDIX B | 147 | | | | NDIX C | 150 | | | | | | GLOSSARY 172 #### LIST OF FIGURES | 1.1 | COST OF MICROPROGRAMMED CONTROL VS. CONVENTIONAL CONTROL. | 3 | |------|----------------------------------------------------------------------------------------------|-------| | 1.2 | AN EXAMPLE PROGRAM WRITTEN IN MPL. | . 8 | | 1.3 | AN EXAMPLE DATA TYPE WRITTEN IN EMPL. | 10 | | 2.1 | CARRY FLAG SETUP SCHEME. | 19 | | 2.la | CARRY FLAG SETUP AS A SUBROUTINE | 20 | | 2.2 | OVERFLOW FLAG SETUP SCHEME. | 21 | | 2.3 | DEFINITION OF D'STRUCTURES | 27/28 | | 2.4 | VARIABLE ALLOCATION PROBLEM. | 32 | | 2.5 | VMPL GLOBAL AND LOCAL VARIABLE DECLARATIONS. | 38 | | 2.6 | VMPL GLOBAL INFORMATION DECLARATIONS. | 40 | | 2.7 | WORDSIZE DECLARATION EQUIVALENTS. | 41 | | 2.8 | VMPL LOCAL INFORMATION DECLARATIONS. | 43 | | 2.9 | EXPECT & RETURN DECLARATION EXAMPLES. | 45/46 | | 2.10 | EXAMPLES OF VMPL FIELD FACILITY. | 51 | | 2.11 | PRECEDENCE RULES FOR VMPL OPERATORS. | 53 | | 2.12 | ENABLING VMPL FLAGS. | 55 | | 2.13 | VMPL CONTROL CONSTRUCTS. | 57 | | 2.14 | MISCELLANEOUS VMPL STATEMENTS. | 59 | | 3.1 | MICROPROGRAMMED IMPLEMENTATION OF HIGH LEVEL LANGUAGES. | 65 | | 3.2 | OVERVIEW OF m-COMPUTER ARCHITECTURAL FEATURES. | 66 | | 3.3 | HIERARCHY OF INSTRUCTIONS. | 69 | | 3.4 | SOME ALU STRUCTURES. | 71 | | 3.5 | NO ENCODING (a) vs. SINGLE LEVEL ENCODING (b). | 74 | | 3.6 | MICROINSTRUCTION ENCODINGS. | 75 | | 3.7 | A GENERAL ALU STRUCTURE. | 77 | | 4.1 | HIGH LEVEL LANGUAGES AND COMPLICATION. | 89 | | 4.2 | THE MALLETT-LEWIS MODEL OF A TRANSLATION SYSTEM FOR PRODUCING INHERENTLY PORTABLE MICROCODE. | 92 | | 5.la | A SMALL PART OF A VMPL PROGRAM. | 100 | |------|------------------------------------------------------|-----| | 5.1b | THREE ADDRESS CODE FOR SOURCE PROGRAM OF Fig. 5.la. | 101 | | 5.1c | SINGLE ADDRESS CODE FOR SOURCE PROGRAM OF Fig. 5.la. | 102 | | 5.2 | OTHERINFORMATION VALUES FOR IISG. | 111 | | 5.3 | VMPL-IML EQUIVALENTS. | 112 | | 5.4 | CLASS 6 IESG STATEMENTS. | 120 | | 5.5 | CLASS 7 IESG OPERATIONS. | 121 | | | | | | 6.1 | MICROCODE CORRECTNESS MODEL. | 124 | | 6.2 | RESULTS OF COUNTING EXPERIMENT CE(2). | 129 | | 6.3a | RESULTS OF VARIABLE ASSIGNMENT EXPERIMENT ON VMl. | 133 | | 6.3b | RESULTS OF VARIABLE ASSIGNMENT EXPERIMENT ON VM2. | 134 | | 6.3c | RESULTS OF VARIABLE ASSIGNMENT EXPERIMENT ON VM3. | 135 | #### LIST OF TABLES | 2.1 | VMPL STANDARD FLAGS | 22 | |------|----------------------------------------------------------------------------------------|-----| | 2.2 | ELSHOFF'S ANALYSIS OF 120 COMMERCIAL PL/I PROGRAMS. | 24 | | 2.3 | STACK OPERATIONS AND THEIR VMPL DESCRIPTOR REPRESENTATIONS. | 36 | | 2.4 | VMPL DIADIC OPERATORS. | 48 | | 2.5 | VMPL SHIFT OPERATORS. | 48 | | 2.6 | VMPL ROTATE OPERATORS. | 50 | | 5.1 | INSTRUCTION STREAM COMPARISON FOR THE ASSIGNMENT STATEMENT. | 96 | | 5.2 | BASIC QUANTITIES TO BE MEASURED FOR DEVELOPING SOFTWARE SCIENCE CRITERIA FOR PROGRAMS. | 103 | | 5.3 | BASIC QUANTITIES FOR THE PROGRAMS OF Fig. 5.1. | 104 | | 5.4a | CALCULATED MEASURES FOR PROGRAMS OF Fig. 5.1. | 105 | | 5.4b | CALCULATED MEASURES FOR VMPL PROGRAM AND ITS EQUIVALENT OF APPENDIX C(a). | 105 | | 5.5 | DECLARATIONTAG VALUES FOR IISG. | 109 | | 5.6 | IML COLUMN DESIGNATIONS. | 114 | | 5.7 | OPERAND MODIFIERS. | 115 | | 5.8 | CLASS 1 IESG OPERATIONS. | 117 | | 5.9 | CLASS 2 IESG OPERATIONS. | 117 | | 5.10 | CLASS 3 IESG OPERATIONS. | 119 | | 5.11 | CLASS 4 IESG OPERATIONS. | 119 | | 5.12 | CLASS 5 IESG OPERATIONS. | 119 | | 6.1 | RESULTS OF COUNTING EXPERIMENT CE(1). | 128 | | 6.2 | RESULTS OF COUNTING EXPERIMENT CE(3). | 131 | #### 1. INTRODUCTION #### 1.1 EVOLUTION OF MICROPROGRAMMING Microprogramming was introduced by Wilkes (1) in 1951. His paper (1) introduces the fundamental ideas which lie behind the concept of microprogramming and can be summarized in his words as: "....consider the control proper, that is, the part of the machine which supplies the pulses for operating the gates associated with the arithmetical and control registers. The designer of this part of a machine usually proceeds in an ad hoc manner, drawing block diagrams until he sees an arrangement which satisfies his requirements and appears to be reasonably economical. I would like to suggest a way in which the control can be made systematic, and therefore less complex. Each operation called for by an order code of the machine involves a sequence of steps which may include transfers from the store to control or arithmetical registers, or vice versa, and transfers from one register to another. Each of these steps is achieved by pulsing certain of the wires associated with the control and arithmetical registers and I will refer to it as a 'micro-operation'. Each true machine operation is thus made up of a sequence or 'micro-program' or micro-operations." In a microprogrammed control unit a memory is used to hold a program comprised of microinstructions. The execution of this program performs the function of the control unit--fetching and interpreting machine language instructions and then activating the appropriate control lines to execute the instructions. Each instruction in this program is termed a microinstruction since the level of control exercised by each is at the gate level. Although microprogramming received some attention during the 1950's (2), (3), it wasn't until the early 1960's that technological development made it economically feasible to be used in computer design. IBM in the mid 1960's employed microprogrammed processors in most of the models of the system 360. The reason for microprogrammed processors given by IBM (4) was: "....it has been used to help design a fixed instruction set capable of reaching across a compatible line of machines in a wide range of performances in the cheapest way possible." Some general reasons for building microprogrammed processors are: #### 1.1.1 ECONOMIC ADVANTAGE There is a clear economic superiority of microprogrammed control over conventional logic. One of the most widely used graphs (3) to illustrate this fact is reproduced in Fig. 1.1. Here by complexity we mean the richness of the instruction repertoire. From this curve we see that after a heavy initial investment incurred for microprogram control, the cost of implementation by microprogram control increase slowly as compared to conventional logic. This curve and the obvious cost advantage which it indicates is the leading force behind the implementation of microprogrammed control units for commercially produced computers. #### 1.1.2 ARCHITECTURAL COMPATIBILITY Looking at economic advantages from a software point of view, we see that by providing a basic set of machine instructions in a whole line of new computers we can run the same software with little modifications on all the 'different' computers. As a result we get architectural compatibility achieved through the use of microprogrammed control units. The IBM 360 series, as mentioned earlier, is a good example of this concept. Furthermore, because of the changing trend of software being costly or 'hard' and hardware being cheap or 'soft', we would rather duplicate computers with different performance levels, etc. but having the same machine instructions—this being achieved via microprogramming, so that we can execute the same software on all of them. #### 1.1.3 DELAYED BINDING Architectural extensions and modifications are possible when we use microprogrammed control units. This results in what is known as the 'delayed binding' of the computer to its hardware. The hardware Fig. 1.1 COST OF MICROPROGRAMMED CONTROL VS CONVENTIONAL CONTROL design primitives are known and are used to build the machine. The processing primitives may be defined at any later time and even changed or modified. This aspect of microprogramming also introduces the concept of 'amorphous' machines. Continued technological improvements have facilitated the development of user microprogrammable computers. In these computers the control memory is of the read/write type. As such, new microprograms can be readily introduced into this memory by the user and thus user-microprogramming is possible. A large number of user-microprogrammable computers are available in the market today, with a variety of architectures (5). #### 1.2 MOTIVATION The rapidly increasing acceptance of user-microprogramming because of the availability of a large number of user-microprogrammable computers, has focused increasing attention on the problems associated with writing microprograms. Traditionally, a user was provided a high level language translating system for programming. Some reasons which have been offered (6) for using high level languages are: - They do not require the user to be aware of such specific machine features as registers, internal representation of data, etc. - 2) They offer the possibility of transferring programs from one machine to another. This means that they provide a degree of independence from a particular machine or system. - 3) They allow programs to be written more easily than is possible in machine language. Furthermore, the whole software development cycle involves less time and effort and hence is efficient and cheap. - 4) They allow programs to be written in problem-oriented terms. Examples of this include the ability to give symbolic names to data and the inclusion of mathematical operators and expressions in programming languages. From the reasons enumerated above we see that a high level language translating system should also be provided for microprogramming. A microprogrammer would thus be free from machine register allocations, primitive I/O referencing, concurrency recognition, etc. In short, the microprogrammer would be able to concentrate more on the programming tasks of implementing his algorithm rather than the intricate features of the particular computer being used. The earliest reference to a high level microprogramming language is made by Husson (3). The basic language is procedural in nature and is an adaption of the FORTRAN and PL/I languages with additional statements and declarations. The language is implemented via a multipass translator system. In the first pass the high level language statements are converted into a sequence of intermediate language statements. This intermediate language is a macro language. In the second pass the compiler 'expands' the macros to produce machine dependent microcode. Thus the compiler must have access to a library of macros which contain the microcode equivalent for each macro statement. Several other efforts have been carried out in the direction of high level microprogramming languages since Husson's proposal. These languages can be classified into three basic groups (7) which are: - 1) Hardware description. - 2) Tailored. - 3) Machine independent. We review this classification scheme for two reasons: - 1) This review will assist the reader in getting a better perspective of the various microprogramming languages which have been designed in the past. Thus he will see what can be done and what has been done to solve the problem of user microprogramming support and he will become aware of the richness of the attempts in several competing directions. - 2) Furthermore, this review will indicate that no current microprogramming language is truly high level and machine independent. This will in part justify this research. #### 1.2.1 HARDWARE DESCRIPTION LANGUAGES Hardware description languages are basically used in the design of computers but have been proposed as high level microprogramming languages (8). APL, CDL and ISP belong to this class of languages (8). Specific resource assignments are done by the microprogrammer who is also responsible for taking care of all machine details. The microprogrammer must have a complete knowledge of the host computer in order to program in these languages. Therefore, these languages are not high level. #### 1.2.2 TAILORED LANGUAGES Most of the high level microprogramming languages belong to the tailored language group. The language lets the microprogrammer interact with the machine at the register transfer level. The syntax of these languages resembles the syntax of traditional high level languages like ALGOL or PL/I. Control constructs and a limited use of symbolic variables are allowed. This level of abstraction shields the microprogrammer from the trivial details of the computer's environment, but constraints him to the functions which are available on the computer since they are the only ones available in the language. SIMPL is a tailored language proposed by Ramamoorthy (9). It is an ALGOL-like language with the conventional ALGOL reserved words. All variables are predefined and explicitly name all host machine resources. Lloyd (10) describes a high level microprogramming language which was designed for the microprogrammed control unit (MCU) of the AN/UKY-17 Signal Processing Element. The syntax of the language resembles LSD (11) with several added built-in functions which correspond directly to testable conditions available on the MCU. Tailored languages are designed to fit a particular machine. As such they are successful in producing microprograms for a spacific machine. This means a different high level language for each different machine. Besides being impractical, this approach also goes against the basic objectives underlying machine independent high level languages. #### 1.2.3 MACHINE INDEPENDENT LANGUAGES Theoretically, any existing high level language like ALGOL, PL/I, etc. could be included in this group. However, these languages are often complex and require a large runtime support system which is difficult to maintain on most microprogrammed computers. This difficulty primarily arises from the high cost of control memory, resulting in the restriction of the amount of microcode which can be stored in it. A high level microprogramming language should be simple and not incorporate complex data structures or operations found in languages for general purpose use. MPL, a microprogramming language designed by Eckhouse (12), was the first true attempt to define a machine independent microprogramming language. MPL is a procedure oriented language and is essentially a dialect of PL/I. There are basically six types of data items: - 1) Machine registers and their parts--both real and virtual. - 2) Memory--main and control. - 3) Local and auxiliary storage. - 4) Events which correspond to testable machine conditions. - 5) Constants. - 6) Variables which take on constant values. There are three types of statements in MPL: - a) <u>DECLARATIVE STATEMENTS</u>: These statements are used to declare various data items along with their various attributes. - allowing at most two operands on the right hand side of the assignment operator. A provision has been made in the syntax of the language to allow concatenated registers as single operands. - c) <u>CONTROL STATEMENTS</u>: An IF and DO statement is provided. The IF statement permits branching on the value of some testable event which is previously declared. An example of part of a program written in MPL and taken from (13) is given in Fig. 1.2. The following comments can be made about MPL. 1) Although Eckhouse claims that his language is machine #### INTERDATA3: PROCEDURE OPTIONS (MAIN): DECLARE (RO,R1,R2,R3,R4,R5,R6,Ar,DFR,MDR) BIT (8) MS (0:32767) BIT (16) MAR BIT (16) MAH BIT (8) DEFINED MAR POSITION (1) MAL BIT (8) DEFINED MAR POSITION (9) "LOCCNT" BIT (16) (CARRY, SINGL, CATN, TRUE, FALSE) EVENT: #### FETCH: PROCEDURE /\*INSTRUCTION FETCH, LOC CNTR UPDATE & OP CODE DECODE\*/ /\*INSTRUCTION ADDRESS\*/ MAR = RO//R1; MDR = MS(MAR); RO//Rl = RO//Rl+2 /\*INCREMENT LOCATION COUNTER\*/ R4//R3 = MDR /\*GET OP CODE\*/ R5 = R3.RSH.3; /\*RIGHT JUSTIFY R1/X1\*/ AR = (R3.LSH.1)/1; /\*LEFT SHIFT REGISTERS R2/X2\*/ /\*OF THE EMULATED 360 MACHINE\*/ R2,DFR = R4.RSH.4 /\*INTO AR WITH LSB SET\*/ IF CARRY THEN GO TO RXFORM: RRFORM: R6 = AR&1; /\*REG-REG FORMAT\*/ R4 = 0; DECODE: IF SNGL/CATN THEN GO TO SUPORT: SUPRET: R3 = R4&OFX: /\*MASK OP CODE\*/ AR = R3+(R3.LSH.1); /\*MULTIPLY BY 3\*/ DFR = R2: IF TRUE THEN GO TO ILLEG: ELSE IF FALSE/CARRY THEN GO TO TROUBL: END/FETCH: END INTERDATA3: Fig. 1.2 AN EXAMPLE PROGRAM WRITTEN IN MPL. (INCOMPLETE PROGRAM FRAGMENT) independent, it is so in only a very limited sense. Since real machine registers show up as predefined variables, programs written in MPL are not transportable from machine to machine. - 2) Semantic interpretation of some of the basic operations is drawn from the particular implementation. For example, shifts may or may not be circular. In other words, the actual interpretation is implementation dependent. This makes the language machine dependent. - 3) MPL programs are compiled down into a simple machine language (SML). This language resembles the code of a single address computer. Arguments needed for operations must first be loaded into argument or Aregisters. Results of operations are left in result or R-registers, and may be stored in other registers or memory locations. Temporary or T-registers are available for intermediate results. This process generates redundant loads and stores which produce more microcode than is necessary. A recent addition to the family of machine independent languages is EMPL (14). This language is based on the core approach to language design (15) which allows the language to include only very basic features. Thus the user is permitted to extend the set of data structures and operations included in the core to customize the language. Syntactically, EMPL is very much like MPL with the added statements which help define user data types. These statements are known as EXTENSION statements. An extension statement includes the new data structure, the operations on that data structure and the microperations which correspond to these operations. An example of a data type named STACK taken from (14) is shown in Fig. 1.3. Analyzing both these machine independent languages, we see that: 1) MPL specifically is machine dependent since real machine registers show up as predefined variables in the language. EMPL on the other hand becomes machine dependent because #### TYPE STACK DECLARE STK (16) FIXED; DECLARE STKPTR FIXED; DECLARE VALUE FIXED; INITIALLY DO; STKPTR=O; END; PUSH: OPERATION ACCEPTS (VALUE) MICROOP: PUSH 3 O; IF STKPTR=16 THEN ERROR; /\*OVERFLOW\*/ ELSE DO; /\* PUSH VALUE \*/ STKPTR=STKPTR+1; STK (STKPTR) = VALUE; END; END, POP: OPERATION RETURNS (VALUE) MICROOP: POP 3 O; IF STKPTR=O THEN ERROR; /\*underflow\*/ ELSE DO; /\* POP VALUE \*/ VALUE=STK (STKPTR); STKPTR=STKPTR-1; END; END, ENDTYPE: Fig. 1.3 AN EXAMPLE DATA TYPE WRITTEN IN EMPL. - of the customizing feature which is primarily just a variation of the real resource associations found in MPL. - 2) Furthermore, the languages are machine dependent due to the early binding to their host machine architecture. - 3) The languages are designed without any consideration of the fact that code will eventually be produced for microprogrammable computers, i.e., no primitives were identified for writing code on microprogrammable computers. We believe that certain features of a special-purpose high level language for microprogramming must reflect the fact that efficient microcode is a design and implementation goal. Because of these reasons the compilers for these languages fail to produce efficient code. In general the earlier tailored and machine independent languages can be viewed as just another set of languages. Some of them are tailored to certain machines and produce efficient code, while others are partially portable but incapable of being translated into efficient microcode for a variety of target machines. Furthermore, in both these cases we see that a general microprogramming support system has not been provided. A microprogramming support system consists of utilities that support the construction, debugging and testing of microprograms. When a microprogram has been produced from a program written in a high level language, then it should be tested for correct-This can be achieved by providing a simulator for the microprogrammable machine. This would require a simulator for each microprogrammable machine for which microcode is to be produced. An alternate way is to compile the high level language program into an intermediate code and provide a simulator for this code. An even better situation would be to provide an interpretive system for the high level language so that the user could debug his programs in an interactive mode. #### 1.3 OBJECTIVES The problem we have identified in the previous sections are: PROBLEM 1: In order to produce low cost microprograms, a machine independent high level microprogramming language is needed, yet according to the analysis above, no such language exists. PROBLEM 2: The languages which have been designed have not solved PROBLEM 1 effectively. This is because of two reasons: - 1) The language designers have not identified the primitives for writing code on microprogrammable computers. - 2) No design principles for designing high level microprogramming languages have been fabricated or adhered to. As such earlier language design is not justified in any manner. PROBLEM 3: A general tool has not been provided. Specifically, the problem of portability has not been addressed or solved effectively. PROBLEM 4: No testing mechanism has been provided; e.g. a topdown design cycle for firmware engineering which includes testing of microprograms etc. before implementation. In Chapter 2 we present the formal definition of a new high level language for microprogramming. The design of this special purpose language includes constructs specifically suited for emulation, and simultaneously, the constructs that foster production of 'correct' emulators. Chapter 3 considers the characteristics of current m-computers. The architectural features of m-computers are studied in detail and a formal classification scheme for m-computers is presented. Chapter 4 discusses the various problems associated with producing portable software. Since our goal is to produce microcode for a variety of m-computers, we discuss the portability model which we have adopted to attain this goal. In Chapter 5 we present a variety of intermediate language formats and their attributes. A number of experiments are done to find out the best possible format. We select the quadruple format and formally present all the intermediate language constructs. In Chapter 6 we discuss the language translator and the intermediate language simulator. Based on well known software engineering techniques we present the results of a number of experiments which were done on the high level language and the intermediate language. A summary of the overall significance and contribution of this research is presented, along with suggestions for further study. Finally, there are several appendices which give in-depth presentations of the material discussed in Chapters 2 through 6. A list of references and a glossary of terms follows the appendices. #### 2. A LANGUAGE FOR MICROPROGRAMMING #### 2.1 INTRODUCTION Before we study the various m-computer (we abbreviate: m-computers for microprogrammable computers) hardware features that have such an influence on microprogramming, we have to ask ourselves: ### What is the language going #### to be used for i.e., what kinds of programs are we going to write in the language. By knowing the kind of problems we are going to program in the language, we can put constructs in the language which will make this task, i.e., of programming these problems in the language, simpler. If the problems are numerical in nature, then maybe the language should be similar to FORTRAN (40). If it is business programming we have to do, then a COBOL (41) like syntax is more appropriate. Then again, maybe we want to do all these things, i.e., make a universal language more on the lines of UNCOL (42) or PL/I (43). But is all of this really practical? The answer is no, and there are two reasons for this. - 1) All the languages pointed to above require a large run time support system. This requires a lot of memory. Control store in m-computers is very limited (2-8 K words) and as such not enough to support a large runtime support system. - 2) Scientific, business programming, etc., are not the most common applications of microprogramming. Historically, microprogramming has been viewed as an emulation tool or a means of extending hardware instruction sets (44). According to (5) the most common application of microprogramming is <u>emulation</u>. This leads us to conclude that the most frequent program written in our language for microprogrammable computers will be an emulator, i.e. a program which realizes the functional characteristices of a computer. The usual term for such a system is a <u>virtual machine</u>. The computer, i.e., the m-computer on which the virtual machine is realized, is known as the <a href="host">host</a> and the virtual machine itself is known as the <a href="target">target</a> machine. We have thus constrained the problem to writing emulators. Problems 1 and 2 (chapter 1) can now be restated in terms of the following six objectives and constraints referred to as the basis on which the language is to be designed. #### 2.2 BASIS OF LANGUAGE PREMISE 1: The language should be 'high level' and facilitate the writing of programs which realize virtual machines. PREMISE 2: The language should have simple, reliable constructs. PREMISE 3: The language should be compilable into compact microcode for a variety of microprogrammable computers. PREMISE 4: The microprogrammable computers taken into consideration are those with horizontal microinstructions. PREMISE 5: The class of target machines emulated are register oriented, word addressable machines. The first premise is a general statement of the problem as outlined in PROBLEM 1 Chapter 1. Premise 2 is an objective of any good high level language design experiment. The reliability data available about syntactical entities of computer languages should be used in designing the language. Simplicity, of course, is a key to good design. Premise 3 is motivated by the need for fast execution of target machine emulators. Efficient microcode is defined as horizontal (parallel) microcode that takes maximal advantage of host machine concurrency. Efficiency is contradictory to generality, however, and the goal poses a problem for portability (PROBLEM 3 Chapter 1). Premise 4 is used to make the problem more general. The problem of producing code for m-computers with vertical microinstructions is a degenerate case of the more general problem of producing code for m-computers with horizontal microinstructions. Premise 5 is included to solve the following problem: Although we have narrowed the scope of problem 1, by constraining the universality of the language, we are still covering a very wide range of targets, i.e., the sub problem is still nearly universal. Should the language have the capability of writing emulators of targets as distinct and far apart as the IBM 360/50 and the INTEL 8080 (45)? No. IBM 360/50 is a very complex machine and a very difficult one to emulate. Furthermore, things like the I/O channels and interrupt structure of IBM 360 like machines (28) will be very difficult to emulate. INTEL 8080 on the other hand is a simple machine to emulate. This leads us to modify our constraint of virtual machine realization to one of word addressable, register oriented second generation machines as the targets which we will use in writing emulators. The qualifiers 'Word addressable, register oriented' have been included only to define what we mean by second generation machines. In the rest of this chapter we refer to the language and programs written in the language interchangeably. The language is called <u>VMPL</u> which is an acronym for <u>Virtual MicroProgramming Language</u>. A reference to machine instructions of a target machine refer to the instruction set processor (46) representation of the target machine. The premises are reference by numbers Pl (Premise 1), P2 (Premise 2), etc. #### 2.3 LANGUAGE DESIGN We are now at the stage where the problem has been completely defined and are ready to proceed with the design of the language. In the previous section we have specified a set of premises which form the basis of the language. Next, we specify eight <u>implications</u> or design features of the language. Each feature is justified by reference to the premises and what is known about programming language in general. In following this approach we 'derive' the language. IMPLICATION 1: VMPL must have facilities for declaring variables, performing basic operations on the declared variables, and executing a basic set of control structures. JUSTIFICATION 1: Since VMPL is going to be used for writing virtual machines (P1), it should be able to describe and represent storage and transformation/control entities which comprise a virtual machine. What this means is that: - VMPL must be able to describe all the storage resources of the target machine like registers, memory, etc. - 2) VMPL should be able to describe (perform) the various functional processes of a computer (target), e.g., instruction fetch, instruction decode, operations on storage resources, etc. This can be achieved in VMPL in the following way: - Declare variable types in the language which can be associated with the various storage resources of the target. - 2) Provide a set of operators which can operate on the variables and a set of control structures which can be used to control the flow of the program. IMPLICATION 2: The variables declared in VMPL should have various attributes associated with them. JUSTIFICATION 2: Although some programming languages allow the programmer to associate sizes with the declared variables, these sizes are a multiple of a fixed size, e.g., FORTRAN's (40) single and double precision words. Furthermore, this size is implicit and dependent upon the computer on which FORTRAN is being executed, i.e., the size of a double precision FORTRAN variable on the CDC 6600 (47) is 120 bits whereas on a PDP 11/40 (48) it is only 32 bits. In VMPL since the variables declared are used to represent registers, etc., they should have an explicit SIZE attribute associated with them. This attribute indicates the width of a register or in other words the precision of a variable explicity. This is required because of Pl and P2 which require high level machine independence. Since the language is used to write emulators for different targets (P5), each with (possibly) different word widths, we cannot have an implicit (built in) precision associated with all variables of a given program. Therefore, each individual declaration of a variable should carry the information about its size. But in case it doesn't, for programmer convenience, a default size for a variable is taken from a globally defined parameter. To declare storage entities like memory, stack, etc., a <u>DIMENSION</u> attribute has to be associated with some variables. The dimension attribute then gives the number of elements contained in the array. Target machine registers often serve both general and special purposes. As an example, a stack pointer register is incremented or decremented automatically before or after a push or a pull operation. Thus, a variable which represents the stack pointer register should have this information associated with it as a DESCRIPTOR attribute. Finally, we describe the <u>CLASS</u> attribute through the following illustration. Suppose a particular host machine and target machine both have a stack. Further, suppose the target machine stack can be mapped down exactly into the host's stack. Pl and P2 forbid us to make this association, directly. In fact, as far as P2 is concerned, we are not even be aware that the host has a stack. How can the 'virtual' stack be mapped into the real stack? The only sensible or efficient way to do this is to retain the identity of the storage resource which a particular variable is representing. This is achieved by the <u>CLASS</u> attribute of a variable. Thus, the compiler retains the identity of the target resources and may (if possible) associate a certain variable with a certain resource on the host. The target memory is an obvious and trivial example of this attribute. Target memory should be (has to be) mapped into the host memory for a successful emulation of the target. IMPLICATION 3: The language should have a provision to declare a standard set of condition code flags. Furthermore, there should be a mechanism to indicate directly which flags are to be affected in a particular operation. For example, an action like an assignment statement in the language must be able to affect the condition codes. JUSTIFICATION 3: Condition code flags are an important part of a computer. The setting/clearing of these flags is a function of the various machine instructions of a computer. A number of approaches can be adopted to represent the target flags in the emulator program. The flags can be declared as variables. Each time a flag is used, i.e., set or cleared, emulation code has to be written corresponding to the usage of the flag. Consider the example of a carry flag used during the emulation of an addition instruction Fig. 2.1. The addition operation is followed by code which emulates carry generation. A result of zero or one is placed in the carry flag. Adapting this approach for a flag like overflow becomes even more complex Fig. 2.2. This approach is thus complex and costly, producing a lot of microcode contradictory to P3. An alternate procedure is to implement flag processing code as a sub-procedure which is called when needed Fig. 2.la. This approach has the cost of subroutine linkage and parameter passing associated with it. Host machines usually lack subroutining facilities and as such trying to create this environment is costly and impractical. Parameter passing is specially costly because of the allocation/deallocation scheme used for assigning VMPL variables to host machine registers discussed later on. Both approaches neglect the fact that the host may provide a set of target flags that can be enabled directly. All that is required then, is to declare the flags using some standard (reserved) names. Then when an operation happens, a way should be provided to indicate which flag is to be affected by the operation. The exact way in which the desired results (on the flags) are obtained, is left to the compiler until the point when the compiler knows the machine for which microcode is being produced. Four standard flags have been provided for in VMPL Table 2.1. Lunde (49) suggests that a limited number of conditions are sufficient for an instruction set processor. Therefore, the four standard flags used in VMPL appear to be adequate. ADDITION OF : SRC1, SRC2 RESULT TO BE STORED : DEST. CARRY FLAG : CARRY. CODE PRODUCED BEGIN DEST = SRC1 + SRC2 ; IF DEST < SRC1\* THEN CARRY = 1 ELSE CARRY = 0; END \*NOTE: IN CASE DEST IS DISTINCT FROM BOTH SRC1 AND SRC2 THEN DEST CAN BE COMPARED AGAINST EITHER ONE, OTHERWISE DEST HAS TO BE COMPARED AGAINST THE 'SOURCE' WHICH IS DISTINCT FROM IT. Fig. 2.1 CARRY FLAG SET-UP SCHEME ``` PROCEDURE CARRY (SRC, DEST); BEGIN IF DEST < SRC1 THEN CARRY = 1 ELSE CARRY = 0; END: PROCEDURE MAIN; BEGIN DEST = SRC1 + SRC2; CARRY (SRC1, DEST); /*THIS INVOKES THE CARRY PROCEDURE*/ END ``` Fig. 2.la CARRY FLAG SET-UP AS A SUBROUTINE ADDITION OF : SRC1, SRC2 RESULT TO BE STORED : DEST. OVERFLOW FLAG : OVFLAG FUNCTION AVAILABLE : MSB-GETS THE MOST SIGNIFICANT BIT OF ITS PARAMETER, i.e., THE SIGN BIT CODE PRODUCED : BEGIN SIGN1 = MSB(SRC1), SIGN2 = MST(SRC2); DEST = SRC1 + SRC2; IF ((SIGN1=SIGN2) ^ (SIGN1=MSB(DEST))) THEN OVFLAG = 1 ELSE OVFLAG = 0; END Fig. 2.2 OVERFLOW FLAG SET-UP SCHEME TABLE 2.1 VMPL STANDARD FLAGS | FLAG | VMPL NAME | |------|-----------------| | С | CARRY | | 0 | OVERFLOW | | N | NEGATIVE (SIGN) | | Z | ZERO | IMPLICATION 4: The language should support very simple assignment statements using infix notation. It should allow only one variable on the left hand side and at most two operands and a diadic operator on the right hand side. However, the operands on the right hand side can have one of a number of monadic operators associated with each of them. JUSTIFICATION 4: A study carried out by Elshoff (50) on 120 commercial PL/I programs found that assignment statements account for 41.2% of all program statements Table 2.2. In this same study it is revealed that 98% of all expressions (right hand side of assignment statements) have zero or one operator (excluding the assignment operator). Another study of 440 Fortran programs (51) produced similar results: 60% of all assignment statements contained no operator other than the assignment (=) operator. Based on this data, VMPL is designed to support very simple assignment statements. VMPL allows only one variable on the left hand side, and at most two operands and one diadic operator on the right hand side of the assignment operator. There can be two objections to using the above mentioned data as the basis of our justification. The first one is that the data was gathered from the pre-structured program era and as such the sample programs are 'bad'. Secondly, some of the data comes from business programs rather than emulators. The first objection is overruled simply because no other data exists. Until data is collected for emulators, we are obliged to use what is known. The second objection is overruled on the basis that data collected for business and scientific applications seems to agree, even for diverse applications and languages such as PL/I and FORTRAN. This indicates that there is something consistent about the use of assignment statements in at least two samples. We assume that our results will be the same. The next question deals with the number and type of operators that should go into an emulator-writing language. An instruction set processor has a large number of operators. Allowing a large number of operators TABLE 2.2 ELSHOFF'S ANALYSIS OF 120 COMMERCIAL PL/I PROGRAMS | STATEMENT_ | PERCENTAGE | |--------------|------------| | ASSIGNMENT | 41.2 | | IF | 17.8 | | GOTO | 11.7 | | END | 7.5 | | DO | 7.2 | | DECLARE | 6.3 | | WRITE | 2.6 | | CALL | 2.0 | | READ | 0.5 | | PUT | 0.4 | | NULL | 0.5 | | PROCEDURE | 0.2 | | PREPROCESSOR | 0.1 | | OTHER | 2.0 | in the language will increase its complexity and affect reliability (P2). An alternate is to include a primitive set of operators only. There are two different considerations for selecting these operators. One is to use the data available on frequency of usage of various operators in programs and the effect they have on the reliability of the language. The other consideration is to determine which operators are best for the language based on their similarity with the instruction set processor of target machines to be emulated. The 'rotate' and 'shift' instructions, for example, are common target machine instructions. Although relational operators are present in VMPL they are not used in assignment statement expressions but instead limited to boolean expressions. Gannon (52) shows that this makes expressions more reliable and reduces errors in assignment statements by 20%. Similarly, limiting the assignment operator to assignment statements and not allowing it in expressions reduces errors by 20% (52). It is also found (52) that the 'traditional' infix precedence expression evaluation technique is 5% more reliable than 'APL' (52) like expression evaluation technique and much more readable. The model of an assignment statement based on Elshoff's and Gannon's data-single variable on the left and two operands and a diadic operator on the right of the assignment operator—is an extremely simple and reliable one and as indicated in the implication above is the basic one adopted by VMPL assignment statements. However, we augment this model based on the characteristics of host ALU's. Since masking and shifting capabilities are available on host computers, VMPL expressions are modified to reflect these resources. We handle these 'side-effect' operators by a number of 'monadic' operators. These operators are 'shift', 'rotate', 'mask', 'field extraction', 'indirection' and 'register concatenate'. Each of the two operands on the right hand side can be modified or operated upon by one of these six operators. 'Shift' and 'rotate' make use of the shifter associated with the ALU. 'Mask' uses the masker and 'field extraction' uses both the shifter and the masker. 'Indirection' and 'register concatenate' makes use of the iterative capabilities of the host machine. 'Indirection' is the fetching of data from (host) memory via an indirect address and 'register-concatenate' operates on two more registers or data words as if they were a cascaded pair, i.e., a single long data word. IMPLICATION 5: The control constructs of the language should be simple in nature and tailored to the needs of emulator writing. JUSTIFICATION 5: The control constructs of the language should be of the D' type according to Kosaraju's hierarchy of control structures (53). The reason for choosing these structures is that they are found to be reliable, structured, simple (P2) and as pointed out by Ledgard (54) - 'the need for higher level (above D') control structures remains unproven'. The D' structures include the sequence, conditional, loop and case statements Fig. 2.3. However, in VMPL these structures have been modified for emulator writing. For specifying loop operations two constructs are provided. One allows the execution of the loop, a fixed number of times and can take advantage of the iterative mechanism available on the host. The other is a conditional looping mechanism. Before any computer (target) executes any instruction, it has to decode it. This process is done by making a decision based on the opcode of the instruction (macro) and then executing it by enabling the appropriate circuit. The equivalent of this in VMPL is a special form of the case control structure. A jump to a multiple statement case statement is made based on the value of a variable. The statement to which the jump is made is another jump statement which mades a jump to a procedure (part of the program) which executes the instruction. This form of the case statement fits nicely into the N-way branch instruction available on hosts. At times an equivalent operation is done wherein a number of conditions are checked one after the other and if any of them are true, to execute the code associated with them and then check the next condition. The case statement won't do this as it executes code corresponding to only one branch, i.e., one true condition. This is provided by an alternate version of the case statement. Fig. 2.3 DEFINITION OF D'STRUCTURE g, CASE Fig. 2.3 DEFINITION OF D'STRUCTURES In VMPL boolean expressions only the three relational operators which are found to be used 85% of the time by Elshoff (50) are used. Because of target machines capabilities of doing bit testing, which can be achieved on hosts by masking and shifting, we have explicitly provided for a way of directly indicating a certain bit of a variable as the one under test in a boolean expression (explicit feature). This can be considered as programmer convenience since the same effect can be obtained by using the 'shift' and 'mask' monadic operators discussed earlier (implicit feature). However, since some hosts do provide direct bit test capability, it is better (efficient) to keep this feature in the language explicitly and let the compiler decide on what to do. IMPLICATION 6: The language should have the provision of declaring external procedures, variables, flags, etc. These are 'implemented' in runtime support microcode and linked to the rest of the emulator program. JUSTIFICATION 6: There are two aspects of emulation which have not been discussed until now. These are Input/Output and interrupts. It is extremely difficult to match host I/O and interrupt structure to target machine I/O and interrupt structure without knowing the target machine in advance. The true solution to this problem lies in the design of host machines. Host machines should provide a 'soft' (38) I/O and interrupt structure that can be mapped into primitive functions (in VMPL) which can in turn be used to describe the I/O and interrupt structures of various targets. Without soft I/O and interrupt structures, m-computers do not share a common set of properties that can be characterized for a class of machines. This aspect of emulation is further complicated by the wide variation of I/O and interrupt structures in various target machines. Although this wide variation is somewhat narrowed down because of P5, by limiting ourselves to targets which do not have I/O channels, I/O processors, direct memory access and other sophisticated I/O and interrupt structures (P5), we still have not found enough common primitives in hosts and target machines to have some kind of primitive language constructs which can be used to specify target I/O and interrupts. This problem is temporarily 'solved' in VMPL by the use of externals. Each external name has associated with it the descriptor attribute which indicates whether the name is a procedure, a flag or a simple variable. Any peculiarity of the target which cannot be described in VMPL can thus be described in this way. IMPLICATION 7: The overall program is broken into arbitrary sized blocks called procedures or subprocedures. All variables used within the block are declared at the start of the block. A priority status is associated with the variables used in a block by the programmer. The code is executed sequentially from one block to the next. There are two cases where control is nonsequential: 1) when a subprocedure is called, or 2) a 'case' statement is executed which jumps to another block. JUSTIFICATION 7: Register allocation/deallocation is a major problem in producing compact microcode (P3) for m-computers. Briefly, the problem is as follows. Suppose either the number, the size, or both of the target registers defined as variables in the VMPL program are greater than the local storage registers available on the host. Because of this disparity, the compiler is unable to permanently store these variables in local storage. The variables will have a copy somewhere in the main memory as the need arises. Now, when the compiler is binding (allocating) a variable to a register in the local storage, it may have to produce microcode to store the register (deallocate) in main memory. This may be necessary because the register may be associated with another variable and its copy in main memory is not fresh (active variable). Suppose at this point the compiler has the option of allocating/ deallocating more than one register. Which one should the compiler choose? Similarly, the compiler has a very limited amount of look ahead and doesn't know what requirements for variables will be in the next statements. As such it might generate a large number of load-store instructions and hence a lot of microcode which goes against P3. This situation is depicted in Fig. 2.4 and illustrated by the following example. A four register (RO, Rl, R2, R3) local storage host is assumed to have variables Al, A2, A3, A4, respectively, at some point within the compilation process. The compiler needs to put variable A0 in one of the four registers. However, if it stores the variable in any of the registers, it has to store the variable already present there back in main memory. Which register should the compiler choose? For example, if the compiler knew that variable A2 is not going to be used in the next statements, then it will store it back in main memory and allocate register R2 to variable A0. How is the compiler supposed to know this? Sophisticated schemes have been developed to minimize register-variable swapping (14). One thing overlooked in these schemes is the fact that the programmer knows what variables are used in a certain 'block' of code. Why not take advantage of programmer 'look-ahead' with a compiler that uses this information when allocating/deallocating registers? This is achieved by making the program modular. All variables used throughout the program are declared at the start of the program. These are the global variables. At the start of each block the programmer specifies which of the globals are going to be used in a block of code and declares them as such. He also declares variables which are local to the particular block and will only be needed as long as the block is executing. The compiler uses this declarative information when allocating/deallocating variables. Since the variables which have not been declared will not be used in a block, they need not be kept in local storage. An implicit priority associated with the local-global classification of variables. Globals are given a higher priority than locals. By this, we mean that if the compiler has to allocate both a global and a local variable, it will first allocate the global variable. Similarly, if it has the similar options of deallocating variables, it will deallocate the local variable first. Further levels of priorities can be associated as tags with various variables. This provides the compiler with a hierarchy of look ahead. The compiler thus has finer control over which Fig. 2.4 VARIABLE ALLOCATION PROBLEM variables to allocate/deallocate based on the information provided by the programmer. Furthermore, the reliability of emulators is improved by introducing the redundant declarative information. The compiler can use the redundant declarative information to check for program consistency by seeing that only those variables which have been declared at the head of a block are used within the block. Since the VMPL program is broken up into arbitrary sized blocks, program execution proceeds from one block to the next in a sequential manner. However, some blocks can be declared as subprocedures. These are the same as FORTRAN's subroutine (40). The sequential execution of a VMPL program is thus broken when a call to a subprocedure is made. Subprocedures return control to the place of their invocation. Another situation of non-sequential execution is the execution of a 'case' like statement. This will be discussed later on. IMPLICATION 8: A subprocedure should be declared before it is used. JUSTIFICATION 8: The reason for this is that the state of the machine (host), in terms of what is contained in its local storage at the entry and exit to a subprocedure, has to be known before a call is made to it from any other place in the program. Thus, the subprocedure is declared before it is used so that it gets compiled first. Then information about the state in which subprocedures expects and leaves the machine is known. The global variables that are expected by a subprocedure and those that are returned by it must be explicitly declared as such. The only variables that are passed as parameters are the local variables of the block making the call to the subprocedure. This is because the only variables not 'accessible' to the subprocedure are the locals of the calling block, since the globals are global to all the blocks. The idea presented here is not in any way associated with the problems which arise with one pass compilers (55). As we will see later on, ours is not a one pass compiler. The only point of importance is that the subprocedure has to be compiled before it is used. 'Declaring' it before it is used also helps the programmer in structuring his whole program. #### 2.4 DEFINITION OF VMPL In this section we specify the syntax of VMPL. The complete BNF representation of VMPL is given in Appendix A. The various syntactic entities of the language have been grouped together according to the implications presented in the previous section. Any syntactic feature of VMPL which is not covered by the implications of the previous section is justified as being in the language for programmer convenience. All implications discussed in the previous section are referred to as II, I2, I3, I4, I5, I6, I7 and I8. # 2.4.1 DECLARATIONS Declarations for the various data items provide the attributes of the items. Declarations can be grouped into the following four types: - 1) Global variable declarations. - 2) Global information declarations. - 3) Local variable declarations. - 4) Local information declarations. # 2.4.1.1 VARIABLE DECLARATIONS The syntactic description of VMPL variable declarations is: DCL TYPE PRIORITY CLASS NAME : DIMENSION : DESCRIPTOR : SIZE The TYPE of the variable is either LOCAL or GLOBAL. Variables which are used throughout the program are declared at the start of the program and are of the GLOBAL type. Variables used in any particular block (discussed later on) are of type LOCAL. Globals have a higher priority than locals (I7). In VMPL a further level of PRIORITY is associated with variables. The PRIORITY of a variable can be either PERMANENT or TEMPORARY. Permanents have a higher priority than temporaries. We will see in Chapter 6 how this hierarchy of priorities helps the compiler in producing efficient code. There are six classes of data items or variables (I2 & I6). This is given by the CLASS entity of the variable declaration. The six classes are: - 1) MEMORY - 2) STACK - 3) PSTACK - 4) FLAG - 5) SIMPLE - 6) EXTERNAL The MEMORY variable is used to define the target memory. It has associated with it the dimension attribute which is an integer defining the number of words (elements) which the memory has. In case the target has a stack, it can be described by the STACK variable. The stack pointer is described by the PSTACK class. In VMPL, we can only declare one stack and one stack pointer (target machines with one stack at most). stack declaration requires the stack pointer declaration. However, the stack pointer declaration does not necessarily require the stack declaration. This covers those targets which have stack pointers but no explicit stack. In this case, the variable declared as MEMORY will be the default stack. In these machines the stack resides in the main memory. The question is why we need the PSTACK class, i.e., why treat the stack pointer as a special register. It could be covered by the same declaration which covers the other registers of the target. The reason is that the most frequent operations done on a stack are the PUSH & POP operations via the stack pointer. The PUSH & POP operations along with the knowledge about whether the stack grows upwards (from higher address towards low addresses) or downwards (from lower address towards higher addresses) completely determine the actions of the stack pointer. These actions are grouped together and shown in Table 2.3. The notation given in the last two columns of Table 2.3 is used in the DESCRIPTOR part of the PSTACK variable. Thus, by declaring the PSTACK variable, the compiler knows what code to generate to emulate the PUSH & POP operations which are a part of VMPL syntax. It may also match the stack pointer of the host (if one is present) to that of the target since it has all relevant information. TABLE 2.3 STACK OPERATIONS AND THEIR VMPL DESCRIPTOR REPRESENTATION | FOR PUSHING DATA INTO STACK | FOR POPPING DATA FROM THE STACK | CORRESPONDING PUSHING | VMPL | NOTATION<br>POPPING | |-------------------------------------------------|---------------------------------------------|--------------------------------------|------|-------------------------------------| | (PUSH, INC) (INC, PUSH) (PUSH, DEC) (DEC, PUSH) | (DEC, POP) (POP, DEC) (INC, POP) (POP, INC) | (↓, +)<br>(+, ↓)<br>(↓, -)<br>(-, ↓) | | (-, ↑)<br>(↑, -)<br>(+, ↑)<br>(↑,+) | (a) (b) Single bit flags are described in VMPL via the FLAG class. One of four descriptors may be associated with a flag variable. These descriptors are C, O, N, Z. The C descriptor makes the flag equivalent to the CARRY flag. For example, flag variable 'LINK' has associated with it the descriptor C. 'LINK' can now be used in conjunction with an assignment statement. Suppose the operation in the assignment statements is the add operation. Any carry generated would be stored in the flag variable 'LINK'. All the programmer has to do is to specify whether he wants LINK to be effected or not by a certain operation. The compiler then generates the code (I3) which actually affects the flag. The O, N and Z descriptors are associated with the overflow, negative and zero flags, respectively. Target registers are described in VMPL via the SIMPLE class. The use of the reserved word SIMPLE is optional however. LOCAL variables can only be of the SIMPLE class or FLAG class. The external problem as outlined in (I6) is covered by the EXTERNAL attribute in VMPL. The external data element can be a flag, a subprocedure or a simple variable. This qualification is done via the descriptors F, P or S, respectively. The DIMENSION and SIZE entities of variable declaration are given by positive integers. The DIMENSION entity is enclosed in rectangular ([]) brackets. Examples of VMPL declarations which illustrate the various aspects discussed above are shown in Fig. 2.5. # 2.4.1.2 INFORMATION DECLARATIONS There are three global information declaration statements in VMPL and four local information declaration statements. ## 2.4.1.2.1 GLOBAL INFORMATION DECLARATIONS Since the size attribute is associated with data items of different types and since it usually is the same for most of them in one program (the width of target stack registers memory words, etc. is usually the same), it seems better and convenient to make a global declaration of | | DCL GLOBAL PERMANENT MEMORY | MEM: [4096]:12 ; | |-------------|-----------------------------|--------------------------| | | DCL GLOBAL PERMANENT STACK | STK+[32]:12 ; | | | DCL GLOBAL PERMANENT PSTACK | PSTK: ( <b>↓</b> ,+):5 ; | | | DCL GLOBAL TEMPORARY FLAG | LINK:C, OV:O ; | | EQUIVALENT | DCL GLOBAL PERMANENT SIMPLE | ACC: 8,T1:12 ; | | EQUI VALENI | DCL GLOBAL PERMANENT | B1:8, B2:12 ; | | | DCL EXTERNAL PERMANENT | REG1:S:4,IOHAND:P; | | | DCL LOCAL PERMANENT SIMPLE | REG2:8 ; | | | DCL LOCAL TEMPORARY FLAG | FLG ; | Fig. 2.5 VMPL GLOBAL AND LOCAL VARIABLE DECLARATIONS this fact. This is done via the WORDSIZE declaration as shown in Fig. 2.6a. Each individual declaration of a data item may carry the information about its size, but in case it doesn't, and the information is required for that type, then the default value of the size is taken from the globally provided information. Fig. 2.7 illustrates this wherein both (a) and (b) are equivalent declarations for an 8-bit register Rl. The two basic ALU arithmetic modes for computers (targets & hosts) are one's-complement and two's complement arithmetic. The host ALU may perform arithmetic differently than the target ALU. In order to keep VMPL machine independent, possible conflicts must be handled at compile time. The VMPL programmer declares the mode of arithmetic to execute under, (the mode of arithmetic done on the target) and the compiler then handles any mismatches which may occur. The declarations are shown in Fig. 2.6b and c. In VMPL bits of a n bit data item are numbered from 0 to n-1 starting at the right or the least significant bit. The 'field extract' operation is frequently done in the instruction decode part of a virtual machine. A field is meant to be the result of the logical AND operation between a data item and a mask followed by a shift operation. The shift may be left or right. Fields are declared and given a name in VMPL. They have two or three integer parameters associated with them. The first two (equivalent to the mask) indicate the range in bit position which has to be masked out. The third optional integer indicates the shift amount. Its absence indicates no shift. A left shift is indicated by a negative number and a right shift by a positive number. The field OPCODE in Fig. 2.6d refers to bits 9, 10 & 11 to be extracted and shifted right by 9 positions. ADDRES on the other hand is a reference to the extracted bits 0 to 8 but doesn't shift them. The way fields are used is discussed later on. # 2.4.1.2.2 LOCAL INFORMATION DECLARATIONS As discussed in I7, we expect to improve the efficiency of the microcode produced by indicating to the compiler the global variables | DCL | WORDSIZE | 12 ; | | |-----|------------|-----------------|---| | | (a) | | | | DCL | ARITHMETIC | 1 ; | | | | (b) | | | | DCL | ARITHMETIC | 2 ; | | | | (c) | | | | DCL | FIELD | OPCODE (9,11,9) | ; | | | | ADDRES (0,8) | ; | | | (d) | | | Fig. 2.6 VMPL GLOBAL INFORMATION DECLARATIONS DCL GLOBAL PERMANENT R1:8; (a) DCL WORDSIZE 8; DCL GLOBAL PERMANENT R1; (b) Fig. 2.7 WORDSIZE DECLARATION EQUIVALENTS which will be used in a certain block. This information is provided by importing the global variables used in a certain block, see Fig. 2.8a (global USE variables). The next two information declarations are associated only with subprocedure blocks. The only variables passed as parameters to a subprocedure are the variables which will be local to the block from which the subprocedure is called. Global variables (like FORTRAN's (40) COMMON) needn't be passed as parameters since they can be declared as global USE variables as discussed above. Thus, within the subprocedure we have three kinds of variables being used; the subprocedure parameters, the global variables imported with the USE statement, and any variables LOCAL to the subprocedure block. We have already discussed the reasons why a subprocedure has to be compiled before a call to it is made (18). The non-local variables (i.e., variables not declared by the LOCAL statement) can be used in three ways within the subprocedure. - a) Their value is used somewhere within the subprocedure block, i.e., the subprocedure expects their value when a call is made to it or it doesn't modify the variables value. - b) They are assigned a value somewhere within the subprocedure block, i.e., the subprocedure returns their value when a call is made to it or it modifies the variables value. - c) Both a & b happen. We will discuss these three cases by the VMPL code shown in Fig. 2.9. The subprocedure block is named SUB. It has one parameter PAR. The procedure block which 'calls' SUB is block MAIN. It has a local variable LOCL which it passes to SUB, i.e., PAR is the formal parameter and LOCL is the actual argument. We assume that SUB doesn't use any global variables. The m-machine for which code is being produced has two registers R1 & R2 in its local storage. # CASE\_a SUB expects the value of PAR, i.e., LOCL when it is called from MAIN. Since SUB is compiled before MAIN, we associate a certain register, DCL GLOBAL USE MEM, ACC, T1; (a) DCL GLOBAL EXPECT REG2; DCL LOCAL EXPECT TMP1; DCL LOCAL RETURN TMP1; DCL LOCAL PERMANENT VAL1, VAL2:3; (b) DCL SPROC USE SUB; (c) Fig. 2.8 VMPL LOCAL INFORMATION DECLARATIONS say R1, with the parameter PAR. This means that when we compile SUB and we want the value of PAR (statement 9), we assume it to be in R1 and pick it up from there. Thus, when we call SUB from MAIN, we first store the value of LOCL in R1 and then do the call operation. It also means that when we compile SUB, we do not use R1 for any other variable being used in SUB until statement 9, i.e., until the time when we first use PAR (R1), we want to preserve it since it has the value which was passed from MAIN to SUB. From there onwards all registers are handled by the compiler in the same manner. All this can be done if within SUB we make a declaration of the fact that the value of the variable PAR is EXPECTED from the block which makes the call. Thus, all global variables and parameters which are expected by a subprocedure are declared as having this property. ## CASE b SUB returns the value of PAR, i.e., LOCL when it is called from MAIN. After having compiled SUB, we know that PAR is in register R2. Within MAIN after producing the code for making the call to SUB (statement 24), we have to produce code for statement 25. For this statement and the following statements, we know LOCL is in R2 since that is where SUB leaves it or returns it. This information is available if we declare PAR within SUB to have the RETURN attribute. Now when the compiler compiles SUB, it will preserve R2 from the point where it last stores the value of PAR in it to the point where it leaves and returns to MAIN. Fig. 2.9 EXPECT & RETURN DECLARATION EXAMPLES SPROC: SUB(PAR:8); DCL LOCAL EXPECT PAR; DCL LOCAL RETURN PAR; DCL LOCAL PERMANENT T1,T2; -- PAR = T1; -- T2 = PAR; -- T-- T2 = PAR; CASE c Fig. 2.9 EXPECT & RETURN DECLARATION EXAMPLES #### CASE c This is just a combination of cases a and b. For this case a variable has to be declared both as an EXPECT & RETURN variable. Examples for these declarations are shown in Fig. 2.9b. Using the same arguments, we can say that upon entry to a block if the compiler knows which, if any, subprocedure will be called from within that block, then it can produce more efficient This is because while producing code for the block, the compiler knows at entry which variables are going to be expected and returned by the subprocedure which is going to be called from within the block. It knows the registers which will hold those values when the call is made to the subprocedure since the subprocedure has been compiled first. Thus, for the statements before the call statement in the block, it tries to utilize the machines registers in a way which will make the change to the subprcedure's environment efficient. As in our previous example, case a, if the compiler knows that SUB is going to be called from within MAIN, then while producing code for statements 1 through 23, it will try to keep LOCL in Rl so that when the call is made to SUB, the value of LOCL needn't be loaded into Rl (extra code). ## 2.4.2 ASSIGNMENT STATEMENTS, OPERATORS & EXPRESSIONS A very simple model for the assignment statement is discussed in I4. Expressions in assignment statements are limited to diadic arithmetic operators and special monadic operators. Logical and relational operators are limited to boolean expressions. The basic arithmetic, logical and relational operators available in VMPL are shown in Table 2.4. Each operand of an assignment statement can have a number of monadic operators associated with it. These monadic operators are: TABLE 2.4 VMPL DIADIC OPERATORS | ARITHMETIC | LOGICAL | RELATIONAL | |------------|---------|------------| | + | .AND. | .GT. | | - | .OR. | .LT. | | * | .XOR. | .EQ. | | / | | | TABLE 2.5 VMPL SHIFT OPERATORS | SHIFT DIRECTION | FILL UP BIT | VMPL NOTATION | |-----------------|-------------|---------------| | RIGHT | Ø | .SHTRØ. | | RIGHT | 1 | .SHTRl. | | LEFT | 0 | .SHTLØ. | | LEFT | 1 | .SHTLl. | (a) A = B.SHTRØ.6; X = Y.SHTL1.2 + 3; a) SHIFT AND ROTATE: There are four monadic shift operators available in VMPL. A data item can be shifted left or right and the bit positions which become available can be filled by either a one (1) or a zero (0) Table 2.5a. In reality, a shift operator is diadic because it requires another operand which specifies the shift amount. In VMPL this operand has to be an integer, i.e., the shift amount is fixed at compile time. Table 2.5b illustrates the use of the 'monadic' shift operator. In the first example, variable B is right shifted and zero filled by six positions and the result stored in A. In the second example Y is shifted as indicated by the shift operator and the shifted result is added to 3. The result is then stored in X. Similarly, two other operators which correspond to rotation are available. These are used for left and right rotation of data items. ROTL is used for left rotation and ROTR is used for right rotation. Like shift, the rotate operator also requires an integer specifying the amount of rotation Table 2.6. - b) FIELD EXTRACTION: As mentioned in the section on global information declarations, in VMPL a programmer can define field operators, which effectively mask and shift various bits out of a data item, i.e., perform the extract operation. The statements in Fig. 2.10b use the fields defined in Fig. 2.10a. The first statement extracts the OPCODE field, i.e., (bits 9, 10 and 11 shifted 9 places to the right) from the variable IR and stores the result in OPCD. The second statement extracts the ADDRES field from IR, adds the result to 20 and stores this result in PAGE. - c) INDIRECTION: One of the most common addressing modes found on various target machines is the indirect memory mode. In VMPL we define an indirection operator by the @ sign. It can only be used on the MEMORY class of data item. If M is a memory variable and ADDR & IR are simple variables, then IR = @M [ADDR] is equivalent to IR = M [M[ADDR]] which by itself is not allowed in VMPL. The square brackets ([]) are TABLE 2.6 VMPL ROTATE OPERATORS | ROTATE DIRECTION | VMPL NOTATION | |------------------|---------------| | LEFT | .ROTL. | | RIGHT | .ROTR. | A = B.ROTL.7; R1 = REG1.ROTR.3; (c) DCL FIELD OPCODE (9,11,9) , ADDRES (0,2) ; (a) OPCD = OPCODE (IR) ; PAGE = 20 + ADDRES (IR) ; (b) Fig. 2.10 EXAMPLES OF VMPL FIELD FACILITY themselves a monadic operator which are used to index into dimensioned variables, i.e., variables which have the DIMENSION attribute. In the example above, once the data at the ADDR index of M is obtained, it is used to index again (indirection) into M to fetch new data which is then assigned to IR. d) CONCATENATION: The concatenation operator allows concatenated items to appear on either side of an assignment. Two SIMPLE variables A and B, can be concatenated together and then logically used as if they existed as a single entity, i.e., $$A//B = A//B+2$$ The integer 2 is added to the logically formed variable A//B and the result stored back in A//B. e) BIT SELECTION: Last of all an operator is available which can test a selected bit of a data item. This operator is limited to boolean expression only. It returns a value of true if the bit in question is a 1, otherwise it returns a false value: This means that the sixth bit of IR is to be tested to see if it is true or false. Since we have defined a number of diadic and monadic operators, precedence rules have to be established for expression evaluation. The primary rule for expression evaluation is: # All monadic operators are executed before the diadic operator is executed We present these rules in the form of a chart as shown in Fig. 2.11. The blank entries mean that the corresponding entries cannot occur simultaneously in an assignment statement or a boolean expression. There is only one predefined variable in VMPL. This is the reserved name POP. Its value is the value of whatever gets popped off the stack, i.e., it is a way of getting data off the stack via the pop function. POP is treated as a simple variable and it can be used anywhere a simple | | = | [ ] | () | @ | // | + | · - | * | / | .AND. | OR. | . XOR. | .NOT. | .GT. | .LT. | .Ед. | .SHTXX. | .ROTY. | |----------|---|-----|----|---|----|---|-----|---|---|-------|-----|--------|-------|------|----------|------|---------|--------| | = | | > | > | > | > | > | > | > | > | > | > | > | > | | | | > | > | | [] | | | < | < | | < | 4 | 4 | < | < | < | < | < | 4 | < | < | < | 4 | | () | | | | > | | < | 4 | < | < | < | 4 | < | < | | | | < | < | | @ | | | | | | < | < | 4 | < | < | < | 4 | < | < | < | < | < | < | | // | | Ţ . | | | | ( | < | < | < | 4 | 4 | < | < | | | | < | < | | + | | | | | | | | | | | _ | | 7 | | | | | | | - | | | | | | | | | | | | | 7 | | | | | | | * | | | | | | | | | | | | | 7 | | | | | | | / | | | | | | | | | | | | | > | | <u> </u> | | > | > | | .AND. | | | | | | | | | | | | | > | | | | > | > | | .OR. | | | | | | | | | | | | | > | | | | > | > | | .XOR. | | | | | | | | | | | | | > | | | | > | > | | .NOT. | | | | | | | | | | | | | | < | < | < | > | > | | .GT. | | | | | | | | | | | | | | | | | 7 | > | | .LT. | | | | | | | | | | | | | | | | | > | > | | .EQ. | | | | | | | | | | | | | | | | | 7 | > | | .SHTXX.* | | | | | | | | | | | | | | | | | | | | .ROTY.** | | | | | | | | | | | | | | | | | | | Fig. 2.11 PRECEDENCE RULES FOR VMPL OPERATORS variable is used, i.e., an operand in an expression, index to the variable of class MEMORY, etc. However, it should be remembered that using this variable has a side effect on the PSTACK variable. The PSTACK variable gets incremented or decremented as defined by its descriptor. In I3 and I5, we mentioned that the setting/clearing of VMPL's predefined flags can be directly indicated in an assignment statement. This is shown in Fig. 2.12. For this example we assume CARY and ZRO are user defined flags associated with VMPL defined carry and zero flags (via the descriptors C and Z). Variable B is added to C and the result is stored in A. The carry and zero conditions generated by this operation is reflected by the setting/clearing of the CARY and ZRO flags. The code to achieve this is generated by the compiler. ## 2.4.3 CONTROL CONSTRUCTS In I5 we indicated the need for a simple and complete set of control constructs modified to the needs of emulator writing. Conditional statements, which according to Elshoff's (50) study make up 17.8% of a program, are provided for in VMPL by the IF...THEN... ELSE....construct with certain modifications. Two IF statements are provided in VMPL; IFTRUE which causes the THEN portion to be executed if the boolean expression is true, and the IFFALSE, which avoids confusion over negative predicates by executing the THEN clause when a predicate is false. This will hopefully remove the semantic difficulty for a programmer, when deciding between: IF NOT (Boolean expression) or #### IF (Boolean expression) The FOR statement allows the repetition of a loop a fixed number of times. As such the FOR statement only allows integers to be used as it indexes—so it may be easily mapped into loop counter mechanisms available on some host machines. The WHILE statement provides a way for conditional looping, executing the loop as long as the condition is true. As mentioned in I5, the case statement of Pascal (56) has been modified so that it can be directly mapped into a N-way branch Fig. 2.12 ENABLING VMPL FLAGS ; instruction found on many host machines. VMPL's case statement is the SELECT statement. An example is shown in Fig. 2.12. The selection in the example is done based on the value of OPCD (first operand). If its value is 1 (target operand), a jump is made to the procedure block LAB1 (target label 1); if its value is 2, a jump is made to LAB2, etc. digit 4 (second operand) following the identifier OPCD is a count of the number of labels available to the SELECT statement. This provides two pieces of information to the compiler. One, it indicates to the compiler how many bits of the first operand (in our example OPCD) will be used for the jump selection. This quantity (the number of bits of the first operand used for jump selection) is Log2 of the second operand (in the example $Log_24=2$ ). It is used by the compiler to mask any extra bits from the second operand (OPCD) if it is wider than it (in the example if OPCD is wider than 2 bits, it's significant bits are masked to make it only 2 bits wide). Secondly, it indicates to the compiler the number of jumps involved in the select operation. The host may have a larger or smaller n-way branch instruction and as such has to handle these cases appropriately. Again, a multipass compiler can handle this situation by counting the number of labels a jump has to be made to, and as such the presence of the second operand indicates redundancy. As mentioned earlier redundancy leads to reliability, one of the requirements put on VMPL by P2. The CONDition statement is equivalent to the SELECT statement in some respects. Each boolean expression is evaluated and if true, the corresponding block of code is executed Fig. 2.12. However, unlike the SELECT statement, the CONDition statement after executing the block of code tests the next condition to see if it is true, and so on, until the end of the statement. Also, unlike the SELECT statement the CONDition statement requires each block of code to follow the statement rather than as a separate procedure block. Two things are obvious from Fig. 2.12. One is the consistent use of 'BEGIN - END' brackets for all five control constructs. Since the 'END' recognizes the statement to which it belongs (ENDIF, ENDFOR, ENDWHILE, ENDSELECT, ENDCOND), it solves the dangling END problem which reduces errors by atleast 5-9% (52) and makes VMPL more reliable. The other is ``` IFTRUE (A .EQ. B) THEN ; BEGIN ; ---- END ; ELSE ; BEGIN ; ---- END ; ENDIF ; WHILE (A .LT. B) ; BEGIN ; ---- END ; ENDWHILE ; FOR A = 3 TO 6 ; BEGIN ; ---- END ; ENDFOR; SELECT (OPCD,4) FROM; COND ; (0,LAB1) ; (A.EQ.B) ; BEGIN ; ---- END ; (1,LAB2) ; (P.EQ.Q) ; BEGIN ; ---- END ; (2,LAB3) ; ENDCOND ; (3,LAB4) ; ENDSELECT ; ``` Fig. 2.13 VMPL CONTROL CONSTRUCTS the consistent use of the BEGIN-END bracket within four of the control constructs. This helps to contain and delimit the statements if more than one of them is present within the control structure. ## 2.4.4 MISCELLANEOUS STATEMENTS There are a few more statements available in VMPL. Elshoff (50) and Knuth (51) determined that increment and decrement accounts for many of the operations in assignment statements. As such an INCrement and DECrement statement is provided. Since register clear (all zeroes) and set (all ones) is a common instruction found in most target instruction set processors, VMPL provides the CLEAR & SET instructions. In order to be able to push data into the stack, the PUSH statement is available in VMPL. As mentioned earlier, data always pops out of the stack when POP is referenced. EXECUTE and RETURN statements are provided to transfer control to and from a subprocedure. VMPL does have a GOTO statement. However, we have limited the scope of the GOTO destination to its block, i.e., the label to which a GOTO transfer's control has to be in the same block as the GOTO statement. The LEAVE statement is used to leave one block of code and enter another. The only other statement which can transfer control from within one block to another is the SELECT statement since its labels are names of other blocks. Thus, the compiler knows the place within a block from where control passes from one block to another. The default place for this transfer is the last statement within the block. A HALT statement is provided to map the targets halt instruction into the host's halt instruction. Examples of these statements are given in Fig. 2.13. # 2.4.5 PROGRAM STRUCTURE I2, I7 and I8 indicate that the VMPL program is made up of two parts. First come the global declarations. These are followed by a set of ``` INC A // B; DEC A[POP] ; SET ADDR ; CLEAR IR ; PUSH DATA ; LEAVE LABEL1 ; HALT ; GOTO IOEND; ``` Fig. 2.14 MISCELLANEOUS VMPL STATEMENTS blocks each with their own set of declarations and executable code. The blocks are of two types: PROCedures and SubPROCedures, and each has a unique name. Each block declares all the variables it is going to use from the global variable set, all the subprocedures it is going to call and any LOCAL variables it requires. Local variables exist only for the execution life time of the block. All identifier and labels in VMPL can have any number of characters. The first character has to be alphabet and the first seven should be a unique combination. A colon (:) must follow a label to indicate the termination of the label. Comments can be placed anywhere in a VMPL program. They have to be enclosed by ((\*) and (\*)) brackets. The only restriction is that they should not split a basic token in the language, e.g., an identifier, a reserved word or an integer. ## 2.6 CONCLUSIONS In this section we have presented the design and syntax of VMPL. The language design is based on a set of 'bases' which are used to derive a set of implications, each of which have been rigorously examined and justified. By providing a basis for the language and a set of implications, we provide a uniform way for making modifications and extensions to the language. This may be required for two reasons. 1) Once a sufficient number of programs have been written in VMPL, we should have enough information about its strong and weak points. We can then go back to our original bases and see which are the ones which require modification. Having done that we can go through the whole process of redesigning the language in the way we did originally. The point is that, since we tried to justify the constructs of the language when we designed them, it will be easier to change them based on the data we obtain about the language at a later time. 2) We may want to make extensions to the language by removing some of the constraints which we put on the problems we are trying to solve, e.g., we may want to include third generation computers with I/O processors in the list of target machines we can emulate using VMPL. This can be done by making the appropriate extensions to the original set of 'bases' and then design the constructs needed to fit this new requirement. Since we have the complete 'design plan' of the language in the form of an implication-justification 'chart', we can see which parts of the language are affected by the new extensions and which are not. Thus, the language can grow easily or be extended according to new demands without requiring the language designer to start from ground zero every time he has to make some changes in the language. The only bias of the author which becomes obvious in this language design experiment is the usage of the instruction use and reliability data available about other programming languages. We feel that it is necessary to use this data to design programming languages. This helps the language designer in producing a language each of whose syntactical constructs have justified their inclusion. It also tells him all the constructs which should not be included. Already we see that this approach is being used in designing a major universal programming language (65) which may revolutionize the whole software industry. #### 3. MACHINE CONSIDERATIONS ## 3.1 INTRODUCTION As indicated in chapter 1 problem 2, one of the reasons why earlier efforts have not effectively solved the high level microprogramming language problem is the lack of identification of primitives for writing code on m-computers. What are the characteristics of the host machine? Can we develop sufficient generalizations about the host? Can these generalizations be used to transport a higher-level language emulator which in turn effectively manipulates the host? Are there any common characteristics among the various machines which defines a class of host machines for which a portable VMPL system is suitable? If we want to design an efficient compiler for a high-level machine-independent language for a m-machines, we must classify the host machines before generating code Eckhouse (13) was the first one to make an attempt in this direction. He took sixty-five architectural features and seven m-computers and made a table of features appearing in the various computers. However, his investigation has the following problems: Pour out of the seven m-computers investigated were microprogrammed computers (INTERDATA 3 & 4, IBM 2025, and the SPIRAS 65). Another one of the computers was Rosin's (16) hypothetical MPP. Another was the IBM 2050, host for the IBM System 360 Model 50 and as such not truly user microprogrammable. His analysis thus included only one computer, the Standard MLP-900, which is user microprogrammable. Although this m-computer was designed for general purpose emulation, it is not widely used. In fact, the only know MLP-900 is the PRIM project machine at the University of Southern California (17). The results which Eckhouse obtained were truly based on one machine, which is not widely used. 2) A closer look at the features which Eckhouse selected indicate that they are tailored for the seven machines chosen for the tabulation. As such some of the results which he obtained in terms of the specific categorizations or the general features of m-computer architecture, are biased and not general enough. A valuable result obtained by Eckhouse related to Input/Output states: "I/O is generally integrated into the basic machine (m-machine) architecture. Therefore, I/O programming is a task which requires specific knowledge of timing constraints, device idiosyncracies, etc." However, while designing MPL (his language) Eckhouse doesn't specifically deal with the problem of I/O and gets around it by saying: "...problem with I/O is the particular device-dependent structure of I/O at the microprogram level which tends to prevent a generalized, higher level language approach to the problem. However, if we consider the simpler machines such as MPP and the INTERDATA 3, we can easily discover several ways to treat I/O." This further reinforces our earlier claim of the machine dependent nature of MPL. Furthermore, it indicates that the results about m-computer architecture which Eckhouse derives were subsequently not used in designing the language, or modified and narrowed down to specific cases. In order to recognize the architectural features of m-computers, we studied approximately 30 m-computers (18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37). In the next section we will describe and analyze briefly the architectural features of m-machines with particular emphasis on those features that impact on generality. The notion of "generality" is included to make conclusions about m-machine architecture general enough to be machine-independent. This will then lead to a classification of machines acceptable to WMPL's compiler. #### 3.2 m-COMPUTER ARCHITECTURAL FEATURES The features of m-computer architecture which we will discuss are: - 1) Storage facilities. - 2) Functional units. - 3) Data widths. - 4) Bus structure. - 5) Microinstructions. - 6) Input, Output & Interrupt Structure. Although we discuss these features in general, we make specific references to emulation. Emulation is not only the most common application of microprogramming (5), The execution of a program in some language is sometimes performed as an emulation of a virtual machine (target machine) that interprets some high level language. Microprograms are used to implement high level language programs in a variety of ways as shown in Fig. 3.1. All facilities where 'data' (program + data) can reside are known as storage facilities. In m-computers, we find a hierarchy of storage units quite different from other computers. The functional units are usually combinational circuits which operate on data in various ways. The data width along with the bus structure determines the computer's real usefulness in emulation. Microinstructions are the mechanism through which the programmer exercises control over the various facilities available on the machine. Input, Output & Interrupts are sources through which the m-computer communicates with the external world. An overview of the various features of m-computers taken into account is shown in Fig. 3.2. ### 3.2.1 STORAGE FACILITIES The storage facilities of an m-computer can be classified into the following four categories. a) LOCAL STORAGE: Local storage consists of host machine registers usually used by the target machine as working registers. Efficient emulation requires atleast as many dedicated and general purpose registers as the target machine being emulated. A number of single bit Fig. 3.1 MICROPROGRAMMED IMPLEMENTATION OF HIGH LEVEL LANGUAGES (HLL) # 1- STORAGE LOCAL STORAGE MAIN MEMORY CONTROL STORE NANO STORE # 2- FUNCTIONAL UNITS ALU SHIFTERS MASKERS EXTRACTERS OTHER FUNCTIONAL UNITS ### 3- DATA WIDTH FIXED ITERATIVE CAPABILITY FLOATING ### 4- BUSSES FIXED FLOATING # 5- MICROINSTRUCTIONS VERTICAL DIAGONAL HORIZONTAL ENCODING RESIDUAL CONTROL SEQUENCING # 6- INPUT, OUTPUT AND INTERRUPTS FIXED FLEXIBLE SOFT # Fig. 3.2 OVERVIEW OF m-COMPUTER ARCHITECTURAL FEATURES. registers usable for storing condition codes and other status bits can also be of great utility. Special purpose registers or storage hardware frequently improve the efficiency of a machine. For example, the presence of a hardware stack has two advantages. First, it can be used to facilitate the emulation of target machines that have stacks. Second, it can be used for subroutine linkage at the microprogram level. Another type of local storage, present in the market these days, is the availability of intergrated circuits which form a 2, 4 or more bit wide local storage register. However, the physical units (IC's) containing this local storage can be concatenated together to form a local storage registers of any width. - b) MAIN MEMORY: Arbitrary target machine instructions (macro instructions) reside in main memory. Thus, word width, total number of words available and the addressing modes are important factors for general purpose emulation. - c) CONTROL STORE: Control store refers to the memory containing microinstructions of the m-computer. The structure, size and volatility of the control store is a large contributing factor in determining different classes of m-computers. Depending on volatility this memory can be read only (ROM) or read/write (RAM). In order to have concurrent fetch & execution of microinstructions, the control store support registers (e.g. the address register, data register, etc.) should be separate from other local storage registers. Further support units may be provided to address this memory in different ways, i.e., a separate ALU may be provided to do address calculations, etc. - d) NANO STORAGE: This is a read/write memory, one level below the control store. In the nano storage are stored nano-instructions which make up the nano program. Each microinstruction is associated with a nano-program. A microinstruction is thus executed by executing its corresponding nano program. The nano program is thus the micro-architecture emulator. This is equivalent to a macro-instruction being executed by its corresponding microprogram which is made up of a set of microinstructions. This level or hierarchy of instructions is shown in Fig. 3.3. The reason why some machines (38) are provided with a nano-storage is that the user can have a much more 'finer' control over the machine. With microinstructions he can define his own macro-instructions. With nano-instructions the user can define his own micro- instructions. This helps him in designing the microinstructions which are most suitable for designing the macro-instructions. ## 3.3.3 FUNCTIONAL UNITS The most common and dominant of the functional units is the arithmetic logic unit (ALU). One or more ALU's provide the basic functional operations (ADD, SUBTRACT, AND, OR, etc.) done by the m-machine on data. Generality of ALU's requires them to process data in different arithmetic conventions (one's and two's complement), process data of different arithmetic modes (binary, decimal, etc.) and process data of varying length (word, byte, etc.). The availability and generality of various kinds of status flags (CARRY, ZERO, etc.) associated with the ALU are also important characteristics of the machine. The presence of a separate set of flags for the target-virtual flags, is a definite help for target flag emulation. At times secondary ALU's are provided for a fixed set of tasks like evaluating the effective address, operating in a certain mode, etc. Typically, a fixed size of ALU is available, i.e., it can operate on operands of a certain length. However, one or more of the ALU's can be physically cascaded to form an ALU of larger width, if it is required to operate on operands of a larger width. This effect can also be achieved in some m-computers which have an iterative or/and residual control mechanism for the ALU (20). A shifter unit (or units may be associated with either the inputs or the outputs of the ALU (implicit shifting) or it may be a separate unit in the machine (explicit shifting). Shifters may be single or multiple bit shifters, arithmetic, logical or circular shifters and may shift single or double words. Fig. 3.3 HIERARCHY OF INSTRUCTIONS Masking of data before sending it to the ALU or after receiving it from the ALU is achieved by units known as maskers. They can also be implicit or explicit units. As mentioned earlier, this facility of the machine improves the generality of the ALU functions. It can also, along with the shifter, be used to extract a field of arbitrary length and position from a word or a register. This 'extract' feature can also be provided in a variety of different ways including a special register designed to provide extract capabilities or have hardware assisted features which decode target machine instructions by extracting various fields. In the case of implicit shifters, the capability of pre or post shifting, along with pre or post masking of data to and from the ALU is useful for calculations involving variable bit arithmetic, bit testing, etc. Fig. 3.4 illustrates some common ALU structures found in current m-computers (5). Other types of functional units are also available on m-computers. A common set of units found on some machines is the multiply and divide unit. This may also be present implicitly in a machine by providing a microperation which does the basic multiply-step/divide-step used in a repeated addition/subtraction algorithm for multiplication/division. #### 3.2.3 DATA WIDTHS Data widths characterize the machine according to the width of the previous two features and the busses connecting them. The target data width can be smaller, equal to or greater than the data width available on the host. This poses a problem in emulation which can be solved in various ways, e.g., by masking, shifting, iteration, etc. Alternately, physical concatenation of modular units can be used to achieve wider data widths. However, once configured, the host data width is permanent since it is hardwired, but the emulated target machine data width may be multiples of the host machine. Fig. 3.4 SOME ALU STRUCTURES #### 3.2.4 BUSSES often, all the functional units and storage units are connected to each other by a number of data paths called busses. The bus structures, in terms of the number of busses and their width, affects the flexibility and ease of microprogramming the machine. By generality we mean that this aspect of host machines can ease the process of emulating targets with a wide variety of (macro) instruction formats and addressing modes. These two aspects of the bus structure also affect the addressing and microinstruction format which in turn leads to generality of the machine. At times, the bus structure is alterable in that connections can be set up and altered under microprogram control through special purpose local storage registers called residual control registers. # 3.2.5 MICROINSTRUCTIONS The mechanism through which the microprogrammer exercises control of the various facilities available on the machine, mentioned in the previous features, is obtained through microinstructions. Each facility in an m-computer is either directly or indirectly controlled by micro-operations which in turn are combined together in different ways to produce microinstructions. We can classify microinstructions depending on the fixed number of micro-operations per microinstruction. - Vertical microinstruction format represents at most one or two micro-operations and resembles classical machine language instructions. - 2) Horizontal microinstruction format represents several micro-operations. Thus a single horizontal micro-instruction controls a number of hardware facilities at the same time. - 3) <u>Diagonal</u> microinstruction format lies between the two extremes. It has a limited capability to perform a few micro-operations in parallel. Micro-operations within microinstructions are represented either by single bits or multiple bits known as fields. The latter approach is known as encoding of micro-operations. In the simplest design, reminiscent of Wilkes' original proposal (1), there is no encoding at all and each bit represents one micro-operation (direct control), i.e., controls one resource or operation in the machine. Typically, single level or direct encoding of micro-operations is represented as a field within the microinstruction. Bits that control mutually exclusive resources, such as operations that an ALU can perform, are combined into fields as shown in Fig. 3.5. The next step is to have two level or indirect encoding wherein mutually exclusive micro-operations may be combined to form fields. The meaning of a field depends on the value of another field. This scheme is known as bit steering. The various forms of microinstruction encoding are shown in Fig. 3.6. The most common technique for microinstruction sequencing is to include the address of the next microinstruction within the current microinstruction. A second technique is to provide an incrementing capability for the microinstruction address register. Conditional and unconditional branches are implemented in both sequencing techniques. By storing the next address in the microinstruction, multiple or n-way branch capability is obtained. This is a useful feature to have in m-machines used for general purpose emulation. ### 3.2.6 INPUT, OUTPUT & INTERRUPT STRUCTURE Requirements for input/output facilities and an interrupt structure depends on the environment in which the machine is to be used. For general purpose microprogramming any 'reasonable' I/O and interrupt structure provides the required support. However, if the machine is to be used for general purpose emulation the requirements for I/O and interrupt structure are very demanding. Mapping of target I/O facilities to host I/O facilities must be complete. This means that the host should allow the emulation of all target I/O facilities and I/O activities. Interrupts should be soft (38) for the sake of generality. This means that interrupts set up bits for examination rather than forcing | NO ENCODING | ENCODED | ALU | OPERATION | |-------------|---------|-----|----------------------| | 00000001 | 000 | 1- | ADD | | 00000010 | 001 | 2~ | SUBTRACT | | 00000100 | 010 | 3- | ADD WITH CARRY | | 00001000 | 011 | 4- | SUBTRACT WITH BORROW | | 00010000 | 100 | 5~ | AND | | 00100000 | 101 | 6- | OR | | 01000000 | 110 | 7- | COMPLEMENT | | 10000000 | 111 | 8 | COMPLEMENT + 1 | Fig. 3.5 NO ENCODING (a) VS SINGLE LEVEL ENCODING Fig. 3.6 MICROINSTRUCTION ENCODINGS: - a, NO ENCODING - b, SINGLE LEVEL ENCODING - c, BIT STEERING predetermined control storage transfers or any other equivalent predetermined (built in) activity. Thus the microprogrammer has the flexibility in handling them according to his needs or to be more accurate, according to the needs of his emulated target. ### 3.3 m-COMPUTER PRIMITIVES We can arrive at some interesting and more specific conclusions about common characteristics of m-computers, now that we have identified their various architectural features. These characteristics form or identify the primitives of the host for which we seek to generate efficient VMPL code. We will make use of them when we identify primitives to manipulate the hardware. Essentially, this latter design is a syntactical and semantic description of a language. Hence, the design of microprogramming language primitives is achieved by associating or mapping the code generation primitives that we derived in chapter 2 onto the 'hardware'primitives that exist for a class of m-machines. The basic characteristics of m-computers belonging to the class that VMPL is portable over are: - 1) Word-oriented, Von-Neuman type fixed data width m-machines (however, there are large and small data width machines). - 2) Local storage limited to a few (8-16) general purpose registers. - 3) Input-Output and interrupt structure mostly hard and very much integrated into the machine architecture. - 4) Microinstruction format (horizontal, diagonal or vertical) not limited to one type. - 5) Microinstructions belong to four classes: 1) data modifications, 2) data transfer, 3) transfer of control and 4) machine control. - 6) Subroutining and functional calls may not be present at the host level. - 7) The most general ALU structure which completely covers all ALU structures is shown in Fig. 3.7. Most m-computer ALU's are a subset of this structure. Fig. 3.7 A GENERAL ALU STRUCTURE - 8) An n-way branching instruction may be included in the microinstruction repertoire. - 9) Iterative, shifting and masking capabilities, which might be present would make the effective data widths variable in nature. - 10) Generally, only one set of status flags is present. Furthermore, flag generation for variable length operations is not available (implementation can be obtained because of 8 above). The first characteristic means that the m-machines are not stack or parallel in nature. In general the machines are word oriented in that the operations on the data, and the data widths available on the machines are equal to the word size of the machine. As such, byte (or any other word size multiple) operations have to be explicitly executed by the microprogrammer by masking and shifting. What this characteristic actually indicates is that the experience gained from the design and implementation of compilers for higher level languages for conventional computers applies to the design of a compiler for VMPL too. Data widths of m-computers reflect on the class of computers which can be considered as targets for the purpose of emulation. As an example, if all m-computers were eight bits wide with no capability of repetitive (or pseudo iterative obtained via masking and shifting) control then the only computers which could be emulated (serve as targets) would be eight bit computers. Since local storage is generally limited to a few registers, emulation of computers with a large number of registers becomes difficult. However, there are exceptions to this case. MATHILDA (35), a dynamically microprogrammable m-computer, has large amounts of local storage organized as a hierarchy of registers. Special purpose registers are very limited in nature, like the stack control registers on the MICRODATA 3200, the floating point registers on the INTERDATA 8/32, the extract registers on the Burroughs 1700. The problem with I/O is its particular machine dependent structure at the micro-level. A few m-computers (HP 21-MX, VARIAN 73, PDP 11/40E, INTERDATA 8/32, etc.) do their I/O via a set of busses (command, data, address, status, etc.) a few others (MICRODATA 3200, CAL DATA PROCESSOR, AMP, etc.) use memory mapped I/O, and yet another group (QM-1, META 4, INTERDATA 85, PRIME 300, MATHILDA, etc.) do their I/O via special purpose registers. All these variations suggest a lack of a primitive I/O structure associated with m-computers. In fact, the memory mapped case is very restrictive as far as emulation of general target machines is concerned. Interrupt structures also vary a lot between different machines. Interrupts can be soft (MICRODATA 3200, MATHILDA, QM-1) in that they are under the control of the microprogrammer, partially soft (VARIAN 73, HP21MX) wherein software or hardware logic can be provided to mask out or conditionally jump on interrupts, or hard (PRIME 300, PDP 11/40-E) wherein hardware logic determines a fixed set of trap address in case of an interrupt. This last case is obviously the most restrictive for general purpose emulation. With respect to the fourth characteristic, microinstruction format, we can assume that all m-computers have horizontal microinstructions, since this structure essentially covers the other two cases. That is, diagonal microinstructions are a degenerate case of horizontal microinstructions and vertical microinstructions are a degenerate case of diagonal microinstructions. Thus we can limit our treatment to m-computers with horizontal microinstructions. The fifth characteristic, classes of microinstructions, indicates that the same type of microinstructions are found in most m-computers. Thus the task of designing a high level language processor is simplified if an intermediate language compatible with these microinstructions is designed. For example, the branch-on-decode opcode process of emulation can be efficiently realize via an appropriate intermediate language construct because of the presence of the n-way branch microinstruction found in most m-computers. Iteration, shifting and masking help achieve emulation of a target machine on a host whose data widths do not match the target. This feature is a requirement for the so-called universal emulator. (39) Conversely, the sixth characteristic, subroutining, points to those features that are not available to the microprogrammer. A high level language creates a virtual machine with language constructs such as subroutine, function calls. Furthermore, most m-computers do not have all operations associated with programming languages, e.g., multiply and divide. At what level should high level language constructs not present in the host machine be handled? Should the programmer (in the high level language) be concerned about these things or should the compiler handle these deficiencies? In order to keep the programmer away from the machine the compiler should handle all those situations which can be recognized and solved effectively and in an efficient manner. Although a variety of ALU structures are present, we can basically recognize the presence of an ALU, shifter and masking units. The shifts generally are single bit and can be achieved separately from the ALU. The masking unit is usually associated with the output of the ALU but in general is implicitly provided via the AND microinstruction of the ALU. Status flags create a problem for general purpose emulation. Not only are the ways in which the status flags get activated non-uniform across the different m-machines but no machine other than the MICRODATA 3200 actually provides two sets of flags which can be used separately by the host and the target machine. Furthermore, even if iterative control is available, status flag generation is limited to fixed length data. Thus variable precision operations—because of host target data width mismatches—at the microlevel are difficult to achieve. All these statements illustrate the need for sophisticated software which can minimize the differences, supplement the deficiencies and recognize the variations inherent in m-computers. #### 3.4 m-COMPUTER CLASSES We can classify m-computers by their architectural features according to the following overlapping classes: - 1) <u>CLASS 1</u> Computers that are microprogrammed. - 2) <u>CLASS 2</u> Computers that are virtual machine dependent. - 3) <u>CLASS 3</u> User microprogrammable machines. - 4) CLASS 4 General purpose emulation computers. - 5) CLASS 5 High level language support computers. - 6) CLASS 6 Bit slice architecture based machines. # 3.4.1 CLASS 1 Class 1 m-computers cannot be microprogrammed by their users. Computers belonging to this class have a control store made up of read-only-memory. The IBM 2050 'host' for the System 360 Model 50, and the PDP 11/40 are two computers that belong to this class. ## 3.4.2 CLASS 2 The next class of m-computers to be considered are those that have been designed with the primary purpose of implementing a particular target machine architecture. These machines often support limited user microprogramming. Thus this is the class of virtual machine dependent hosts. Most of the features like data widths, word and byte operations, bus structure, input & output, interrupt structure, in these computers are reflections or direct mappings of the architecture of the target machine. In general these machines have more to offer than the original machine they emulate. They may have more hardware functional units and software, to implement extra user level machine instructions. Furthermore, additional read write control is provided for the user to implement his own instructions. These enhancements are the primary reason why such machines are available in the market and are in general a cost effective buy over the original machine. Although these machines are called 'general purpose' emulators by their manufacturers, they are not so because of their 'machine dependent' nature. INTERDATA 8/32 and CALDATA 1/35 are two machines belonging to this class. INTERDATA 8/32 has the same basic system level architecture as the system 360 series. Data width, ALU operations, input & output and interrupt handling is the same as in the 360's. Special hardware features are provided for machine language instruction decoding and are tailored for the 360 instruction set. Provisions for more than a single set of registers useful in a multiprogramming environment and hardware multiply & divide are enhancements over the earlier models of the 360. The standard emulator for the 360 instruction set uses only part of the total available control store, so the rest is available for user microprogramming. CALDATA 1/35 emulates a PDP 11/35 which is a low end version of the PDP 11/40. Keeping in line with Digital Equipment Corporation (manufacturers of PDP computers), all devices on the CALDATA are addressed via the MACROBUS (equivalent to DEC's UNIBUS). The word size, number of registers, etc., are the same. Input-output and interrupt handling is hardwired according to PDP 11/35 specifications. Of course, a writable control store is provided which makes the computer user microprogrammable. # 3.4.3 CLASS 3 This is the class of user microprogrammable computers. There are two categories of these machines. To one category belong those computers which are general purpose machines. They are provided with a variety of microinstructions (horizontal or diagonal in nature), some general processing facilities, 8 to 16 general purpose registers, etc. Interrupt and I/O handling is partially soft. Usually an emulator for a machine language and other software development aids are provided for user microprogramming. It is important to note that the emulator is designed 'after' the machine has been constructed, i.e., the micro-architecture is quite general and not restricted to the requirements of the machine language. The other category is of special purpose machines which are simple in structure (usually vertical microinstructions) and are used as peripheral processors, signal processors, etc. They offer the flexibility which a hardwired machine doesn't have and are cost effective alternatives to the general purpose m-computers discussed above. HP 21-MX and CASH 8 belong to this class. HP 21-MX is a general purpose 16 bit computer with diagonal microinstructions. It does provide some facilities to handle machine language instructions. However, these facilities are very general and can be used effectively by a microprogrammer. Interrupt handling is partially soft and input & output is done via a main I/O bus. CASH 8 is an inexpensive microprogrammable computer. It is used most frequently as a substitute for hardwired process controllers. It is extremely simple in architecture (doesn't have a main memory) with very few restrictive facilities. ## 3.4.4 CLASS 4 This is the class of computers which can be used as general purpose emulation machines. There are essentially two categories of architectures which fit this class. In the first category, we have m-computers with a large and flexible local storage and enough processing facilities to efficiently emulate various user level architectures with ease and efficiency. Emulation of various word widths can be done easily by providing residually controlled masking and shifting capabilities. The input & output and the interrupt structure is soft to be tailored according to requirements. In the m-machines of the second class the microarchitecture can be designed by the user. This is done by providing an architectural level below the micro level known as the nano level and correspondingly nano-architecture. Data widths, input & output and interrupt handling are determined by nano-instructions which make the machine at the micro-architecture look very soft. This two level structure permits flexibility in the design of upper level microinstructions which in turn make the design of machine language instructions simpler. MATHILDA and QM-1 are two computers belonging to this class. MATHILDA is a general purpose dynamically user microprogrammable computer with an abundance of local storage and provision for masking and iterative control that make this computer suitable for general purpose emulation. Input-output registers are provided for general purpose I/O transfers and a soft interrupt structure is available. QM-1 is also a general purpose m-machine employing two levels of control. The microlevel architecture is defined by nanoprograms residing at the nano level. Data widths are fixed but iteration is available via automatic repeated execution of the nano instructions. I/O and interrupt handling are soft. ## 3.4.5 CLASS 5 This class of machines is the class of so-called high level language (hll) support machines. These machines have features built into them which support a particular style of language. Essentially, the various data types of a language require some kind of iterative or residual control for 'varying' the data width dynamically. A block structure language is easily implemented on a stack-like architecture, i.e., the hardware support of a stack at the microarchitecture level. BURROUGHS 1700 and MICRODATA 3200 are high level support machines. The BURROUGHS 1700 was designed to support a variety of intermediate languages known as s-languages (34). Various high level lanaguages are compiled to different s-languages which are then executed on the computer. Main memory is bit addressable and the processing facilities can be used iteratively, providing any required effective data width. Residual control is provided to configure the machine into a structure which is most efficient for a particular s-language. All I/O and interrupts are soft. MICRODATA 3200 is provided with an emulator for a stack oriented machine. The emulator is known as the 32/S. The architecture of the 32/S was designed in conjunction with MPL-Microdata Programming Language, a machine independent high level programming language. Since MPL is a block structured language, the stack on the 3200 is used extensively. This is an example of direct hardware support for a high level language. ## 3.6.6 CLASS 6 This class of machines is a set of families of integrated circuit 'chips' available in the market and microprogrammable by their nature. They are known as bit slice processors. The ALVs are essentially parts of registers, functional processing units, etc., 2 to 4 bits wide. Any number of these can be cascaded together to produce the desired word width. Also any support circuitry comprising extra registers, input & output and interrupt handling can be attached to the machine to form a variety of architectures. Thus tailoring is achieved using other support chips. Many architectures can be emulated as long as the chips are connected together as required. INTEL 3000 and AMD 2900 series of IC's belong to this class of machines. # 3.5 CONCLUSIONS In this chapter we have recognized some of the hardware primitives and constraints we are going to work with. These can be summarized as follows: - 1) Word oriented, Von Neuman type machine architecture. - 2) Fixed data widths. - 3) Limited local storage. - 4) Hard Input & Output and interrupt structure. - 5) Conventional microinstructions. - 6) No subroutining facilities. - 7) Limited number of functional units. - 8) N-way branching. - 9) Iteration, shifting & masking capabilities available. - 10) Only host status flags available. We also categorized m-computers into 6 classes. In the rest of this thesis all references to m-computers will mean computers belonging to classes 2 to 6, i.e., those m-computers which are user microprogrammable in nature. # 4. PORTABILITY ## 4.1 INTRODUCTION This chapter is concerned with solving the third problem which is identified in Chapter 1. The statement of the problem is: A general tool (for microprogramming) has not been provided. Specifically, the problem of portability has not been addressed or solved effectively (in earlier attempts for high level microprogramming languages). Portability has been defined in several ways. According to Waite (57), the portability of a program is a measure of the ease with which it can be implemented on a new machine. Another definition is that portability is the property of a system which permits it to be mapped from one environment to a different environment (58). Although these qualitative definitions sufficiently describe the problem, Brown (59) established a more quantitative measure: A <u>program</u> or programming system is called <u>portable</u> if the effort required to move it into a new environment is much less than the effort that would be required to reprogram it for the new environment. # 4.2 CLASSIFICATION OF TECHNIQUES Techniques for transferring software from one environment to another can be divided into two general categories. These two categories are: - 1) Conversion techniques - 2) Inherently portable techniques #### 4.2.1 CONVERSION TECHNIQUES Within this category come all techniques which accept programs which were written for a particular system without any forethought of portability. This means that the requirement to transfer the programming system is not considered at system development time so as to provide inherent portability properties. Consequently, such systems are transferred by various conversion techniques. Some of these techniques produce an equivalent program for the new machine via decompilers, while others simply attempt to execute the original program (program written for machine A) via emulation or simulation (for machine B). Although these techniques have been used for some programming systems, it seems obvious that inherently portable systems provide a more general and desirable solution to the portability problem. # 4.2.2 INHERENTLY PORTABLE TECHNIQUES From the discussion on conversion techniques, one comes to the conclusion that those techniques are not the real solutions to the problem of portability simply because the problem is not an initial design constraint. As indicated by Halstead (58), a procedure which avoids a problem instead of solving it is often the best solution after all. Inherently portable techniques as such attempt to facilitate the transfer problem by writing the original code in such a way that it will 'inherently' (by its very nature) be easy to transfer. This is very much the way that 'machine independent high level' languages attempt to solve the portability problem. However, because of the various and varied implementations of these languages on different computer systems, the portability problem is only partially solved (60). Lack of universal standards, provisions of extensions by vendors which are usually machine dependent and efficiency considerations have made simple recompilation of software to facilitate the transfer from one system to another impractical. Thus, programs written in these languages cause invalid results when run on one system even though they ran correctly on the original system. The problem of providing compilers for many high level languages on many machines is the basic goal behind the UNCOL (Universal Computer Oriented Language) concept (42). UNCOL was a language proposal intended to provide a common path between many high level languages and many machine languages. Each high level language could be translated into UNCOL once and in turn one UNCOL translator could be written for each machine. Thus, in order to implement M high level languages on N distinct machines, we would require M + N compilers. This implementation effort would be considerably less than that of conventional compilation techniques which would require M \* N compilers (42) Fig. 4.1. Although several attempts have been made in this direction, the need for UNCOL to be 'universal' is the main reason for its lack of success. The only viable example of this approach is the implementation of one high level language on various machines. PASCAL (56) is easily, i.e. with little effort, implemented on different systems. A compiler for PASCAL written in PASCAL is available in an abstract intermediate language called p-code. In order to implement PASCAL on a system, the implementor has to write a simulator for p-code. He can then transfer the available p-code program (actually the PASCAL compiler) to his system. At this point, he has a working PASCAL compiler on his system. The effort involved in writing the simulator (for p-code) is quite small as compared to writing a compiler for PASCAL. In bootstrapping the compiler for a language, L is realized in a subset of that language. Then the compiler for this subset is realized in a low level language for the machine on which it is desired to implement the language L. The language is portable because the only effort required to implement L on any machine is the realization of the translator for the subset. The main drawback of this technique is that most high level languages (least of all subsets of high level languages) are not well suited for writing compilers for themselves. As such, a variety of modifications to the pure bootstrapping techniques have evolved (61). One variation is the development of several special languages which are well suited for the first stage in the bootstrapping technique. Another variation makes use of several languages in a hierarchical structure. Only the lowest language need be mapped into a new machine in order to initiate the bootstrapping sequence. ### (a) UNCOL USED AS INTERMEDIATE LANGUAGE # (b) DIRECT IMPLEMENTATION OF LANGUAGES Fig. 4.1 HIGH LEVEL LANGUAGES AND COMPILATION 'Abstract machine modelling' is described by Newey (62) as a procedure based on the concept that the fundamental data types and operations required to solve a particular problem define a special purpose computer (the abstract model) which is ideally suited to that problem. The algorithm for obtaining the solution can be encoded as a program for this 'abstract machine model.' In order to obtain a running version, the abstract machine model is realized on an existing computer by implementing its basic data types and operations. The term 'descriptive language' instead of 'abstract machine' has been used by others (59). We will refer to these models as the Intermediate Language Models and the language describing the model as the intermediate language. In some cases only one level of intermediate language is required to implement the compiler. This results in a very simple variation of the bootstrap technique described previously. However, an attractive alternative exists in the development of an entire hierarchy of intermediate languages to provide convenient steps from the high level language for describing the compiler to the lowest level to be bootstrapped. The objective of the hierarchy is to ease the implementation of each intermediate language and to obtain a 'base' intermediate language which is easy to implement on the target computer. It is important that this base language be easy to implement because it is at this level that true portability is determined. This is so because after the hierarchy has been completed once, the only requirement for transferring the system to a new machine is the realization of the base intermediate language on the new machine (63). # 4.3 APPLICATION TO MICROPROGRAM PORTABILITY We have reviewed some of the techniques used to transfer computer software from one machine to another. These include methods of conversion and the production of inherently portable software. Unfortunately, none of these techniques provide a complete and general solution to the microprogram portability problem. Perhaps a combination of techniques will provide a reasonable solution to transfer programs; namely, a machine independent language using a multiphase compilation system. In such a combination, the compiler works downwards through a series of intermediate microprogramming languages for abstract machines. In the last phase, the compiler generates target machine dependent code. This last phase would have to be created for each target m-computer. This procedure is adopted by Mallett and Lewis (7) and is the one we use for producing inherently portable microprograms. In the next section we briefly describe their model for the compilation system. # 4.3.1 THE MALLETT-LEWIS MODEL This compilation system has three phases as shown in Fig. 4.2. The first phase is the machine independent part. This consists of a compiler that does the syntax and semantic analysis, translating the high level language through various levels of intermediate languages, as needed, to produce a sequential string of micro-operations for an abstract m-computer called the intermediate microprogramming language machine (IMLM). Next, a machine (IMLM) dependent concurrency recognition and optimization algorithm composes the sequential micro-operations in the IMLM horizontal format. The last phase is the code generation part of the system and is machine dependent. For each host m-computer a machine dependent interface is needed. The interface programs will decompose (if necessary) the optimized IMLM microinstructions and translate them into host microinstructions. A final code optimization pass produces horizontal microcode. The most important component of this system is the intermediate microprogramming language (IML). In the next chapter we describe the particular IML which we have used for our compilation system. #### 4.4 CONCLUSIONS In this chapter we have discussed the portability problem in general and looked at the various solutions available. We have adopted the model of multiphase compilation proposed by Mallett and Lewis (7). Thus, the third problem in this chapter (and Chapter 1) as stated is solved via Fig. 4.2 THE MALLETT-LEWIS MODEL OF A TRANSLATION SYSTEM FOR PRODUCING INHERENTLY PORTABLE MICROCODE the following implication: IMPLICATION 9: Machine independency (portability) is obtained by compiling the language into an abstract intermediate machine language (IML). The intermediate language is then compiled into microcode. #### 5. INTERMEDIATE MACHINE LANGUAGE #### 5.1 INTRODUCTION The use of an intermediate machine language (IML) in this research has greatly facilitated the translation process from a higher-level machine independent language into the final machine-dependent microcode. The Mallett-Lewis model (chapter 4) requires an intermediate language. The question now is what kind of intermediate language format should be used? In the next section we present a brief summary of the various intermediate language formats which are available and results obtained by various people as to their usage. ## 5.2 IML FORMATS & ATTRIBUTES - In (66) Lawson discusses the pragmatics of various intermediate language text representations (formats), along with their hardware implications. The five types of IML representations which he describes are: - a) QUADRUPLE OR THREE OPERAND FORMAT: A quadruple is composed of an operation code, and one to three operands. The basic structure for the quadruple intermediate text is: op-code 1st operand 2nd operand 3rd operand The first two operands are essentially the sources and the third operand is the destination of the result. b) TRIPLE OR TWO OPERAND FORMAT: A triple is composed of an operation code and one or two operands. The basic structure for a triple is: op-code 1st operand 2nd operand In the case of the operation specified for the triple requires more than two operands, the third operand is on the top of the stack. Thus, this representation assumes the presence of a stack. c) DUO OR ONE OPERAND FORMAT: A duo is composed of an operation code followed by one operand. Its basic structure is: op-code operand When the specified operation requires more than one operand, the other operands are assumed on top of the stack. - d) POLISH NOTATION: There is no rigid structure that determines a fixed relationship between operation codes and operands. Each position of the IML stream is examined to determine whether it is an operation or an operand. However, the representation does require the presence of an operand stack. - e) PROGRAM TREE: The elements of the IML stream (the IML instructions comprising the program) are not necessarily located in contiguous locations. Each node in the instruction stream must be examined to determine whether it is an operation or an operand. In this respect, it is similar to the polish notation. This notation also requires an operand stack. Four measurement units are used to indicate the differences between the five IML text representation. These are: - Number of instruction elements: This is simply the number of unique units of information in the IML stream, i.e., an operation code, operand, etc. - 2) <u>Instruction increment size</u>: This measurement indicates the number of units that must be skipped by the instruction fetch-decode unit of a computer to access the next element of the instruction stream. - 3) <u>Number of units directly examined</u>: This measurement indicates the number of units that must be examined by the instruction fetch-decode unit of a computer to determine the composition of the instruction. - 4) Stack requirements: This measurement indicates what stacks are required for each of the IML format and the type of stack usage. Lawson's (66) results for the five IML formats are given in Table 5.1. These results are based on the IML representation of one assignment statement which is similar to those used in several high-level programming languages: $$X = A * B/(C+D)$$ With these measurements in mind, we may now evaluate and contrast the attributes of the various IML formats. TABLE 5.1 INSTRUCTION-STREAM COMPARISON FOR THE ASSIGNMENT STATEMENT | INSTRUCTION-<br>STREAM<br>FORM | NUMBER OF<br>INSTRUCTION<br>ELEMENTS | INSTRUCTION<br>INCREMENT<br>SIZE | NUMBER OF<br>UNITS<br>DIRECTLY<br>EXAMINED | STACK<br>REQUIREMENTS | |--------------------------------|--------------------------------------|----------------------------------|--------------------------------------------|---------------------------------------------| | THREE ADDRESS | 16 | 4 | 4 | NONE | | TWO ADDRESS | 12 | 3 | 12 | l<br>(For Temporaries) | | ONE ADDRESS | 12 | 2 | 12 | l<br>(For a Push-<br>Down Accumu-<br>lator) | | POLISH | 9 | 1 | 9 | l<br>(Operand<br>Stack) | | PROGRAM-TREE | 27 | VARIABLE | 9 | 2<br>(Operand &<br>Operator<br>Stack) | # 1) CONCISENESS Conciseness is a measure of the number of IML elements required to represent the decomposed form of the high level language statement. The order of conciseness is as follows: Polish; one address, two addresses, three addresses, program tree. # 2) COMPLEXITY IN INTERPRETATION This complexity deals with the work which the instruction fetch-decode unit of a computer has to perform in decoding the instruction and preparing the operands for the execution of the instruction. The three address format is the least complex since the instruction fetch-decode unit does not have to examine each unit (operation code, operand) to determine its class. The polish and program tree notation on the other hand, are the most complex since each unit of the IML stream has to be examined to determine the class of the element (i.e., operation-code or operand). The complexity ratings for the five formats are: three addresses; one address, two address; Polish; program tree. In another paper by Merwin (67) three IML formats are compared for execution speeds. The three formats selected are quadruples, triples and duos. Four programs written in FORTRAN were used to evaluate the performance of the three IML representations. It was found that the quadruple format is the most 'compact' when compactness is related to the execution time of the IML instruction stream of the four programs. From the data which is collected in these two papers, we come to the following conclusions: - The quadruple (three address) IML format takes the least amount of execution time. - The polish IML format is the most concise. This means that when compiling a high level language statement into the IML statement, the polish notation produces the least number of IML units. In other words, the resulting polish statements are nearest to the original high level language statement. We will show later how this 'nearest' criteria is an important requirement for the IML which we select. Other than the quadruple format, all IML formats require a stack. Eckhouse (12) was the first to use the concept of an IML for the compilation of a high level microprogramming language to microcode. He uses SML (13) as an intermediate language for the conversion of MPL into microcode. SML resembles the code of a single-address computer. There are only two types of instructions in SML--register loads and stores, and execute operations. Arguments needed for operations must first be loaded in argument or A-registers. Results of operations are left in result or R-registers, and may be stored in other registers or memory locations. Temporary or T-registers are available for intermediate results. SML fails in solving the register allocation-deallocation problem. The compiler can use any number of A & T registers and no information is kept of the variables and the corresponding A & T registers in which they reside. This produces a lot of redundant load and store instructions at the SML level. These redundant load and store instructions then migrate to the microcode level producing non-optimum microcode. Furthermore, the SML code produced is not highly suited for two or three address m-computers. From the initial discussion, it is obvious that we should not use the two address, one address, polish & program tree IML formats. The main reason for this decision is the requirement of a stack for their execution. Thus, the microcode which would be produced on further compilation of these formats would be well suited (efficient) for m-computers with stacks available at the microlevel. The availability of a stack at that level is not a primary characteristic of m-computer architecture (Chapter 3). MPL (18) provides us with a case where a one address IML format is used as the intermediate language. The intermediate language is executed on the MICRODATA 3200 by a set of microprograms. Central to execution of this one address IML (as indicated earlier) is the stack capability which is a hardware feature of the 3200 at the microarchitecture level. Thus, the hardware feature (stack) is used by the micro-programs in executing the IML instructions. We did some tests on the two formats to further justify the usage of the three address IML format over the one used by Eckhouse. A high level VMPL program (VM1-Appendix C) was taken and broken into three equal parts. These three parts were then coded or compiled down to a three address IML and a single address SML like language. Fig. 5.1 shows a piece of the VMPL codes and its corresponding three address and single address equivalents. The three address IML constructs shown are the actual three address IML constructs designed in the next section. Analysis of the resulting code is based on the software science criteria developed by Halstead (68). The basic quantities which have to be measured are given in Table 5.2. Table 5.3 indicates these quantities for the programs shown in Fib. 5.1. The three measuring units are: a) PROGRAM VOLUME (V): Whenever a given algorithm is translated from one language to another, its size changes. This size change is reflected by the volume that the algorithm occupies in the various languages. Volume V can be defined as: $$V = Nlog_2 n$$ b) PROGRAM LEVEL (L): The level of a program is a measure of the relative 'goodness' of a certain representation of a program as compared to another representation. The 'higher-level' language concept can be measured by this criteria through the equivalent program representation. $$L = (2+n_2^*) Log_2 (2+n_2^*)/V$$ c) INTELLIGENCE CONTENT (I): It seems fundamental to measure how much is said in a program in terms of its information content. This measure is called the intelligence content and is defined as: $$I = L \times V$$ The results are tabulated in Table 5.4. Comparing the two IML format results, we come to the following conclusions: 1) Based on the volume criteria V, the three operand format has a lower volume than the single address format. This means that when VMPL is translated to the three address IML format, we ``` IFTRUE (IR,7) THEN; BEGIN: PCTEMP=PC-1; PCTEMP=CRNTPG(PCTEMP); MAR=PCTEMP.OR.ADR; MDR=MEM[MAR]; END; ELSE; BEGIN; MAR=ADR; END; ENDIF; IFTRUE (IR,8) THEN; BEGIN; MART=MEM [MAR]; IFTRUE (MAR.GT.7) THEN; BEGIN; IFTRUE (MAR.LT.16) THEN; BEGIN; INC MEM[MAR]; END; ENDIF; END; ENDIF; MAR=MART; END; ENDIF; ``` Fig. 5.1(a) A SMALL PART OF A VMPL PROGRAM | | CONDF | .IR.7 | TL.001 | | |-------|-------|---------|---------|--------| | | SUB | PC | C 1 | PCTEMP | | | EXTR | CRNTPG | PCTEMP | PCTEMP | | | OR | PCTEMP | ADR | MAR | | | RMOVE | MEM | MAR | MDR | | | BRCH | E L.002 | | | | L.001 | | | | | | | MOVE | MAR | ADR | | | L.002 | | | | | | | CONDF | .IR.8 | T L.003 | | | | RMOVE | MEM | MAR | MART | | | COMP | MAR | C 7 | | | | CONDT | N | T L.004 | | | | COMP | MAR | C 16 | | | | CONDF | N | T L.004 | | | | RMOVE | MEM | MAR | +T.001 | | | INC | +T.001 | | | | | WMOVE | MEM | MAR | -T.001 | | L.004 | MOVE | MART | MAR | | | L.003 | | | | | Fig. 5.1(b) THREE ADDRESS CODE FOR SOURCE PROGRAM OF Fig. 5.1(a) | | LOAD | Rl,IR | L-002 | LOAD | Rl,IR | |-------|-------|------------|-------|-------|----------------| | | LOAD | R9,*64 | | LOAD | R2,*128 | | | AND | | | AND | | | | BNZ | L-001 | | BNZ | L.003 | | | LOAD | Pl,PC | | LOAD | Rl,MAR | | | LOAD | R9,*1 | | READ | MEM | | | SUB | | | STORE | Rl,MART | | | STORE | RO, PCTEMP | | LOAD | Rl,MAR | | | LOAD | Rl,CRNTPG | | LOAD | R9 <b>,*</b> 7 | | | LOAD | R2,PCTEMP | | SUB | | | | AND | | | BIN | L.004 | | | STORE | RØ,PCTEMP | | LOAD | Rl,MAR | | | LOAD | R1,ADR | | LOAD | R2,*16 | | | LOAD | R2,PCTEMP | | SUB | | | | OR | | | BNN | L.004 | | | STORE | RO, MAR | | LOAD | Rl,MAR | | | LOAD | Rl,MAR | | READ | MEM | | | READ | MEM | | LOAD | Rl,R0 | | | STORE | RØ,MDR | | LOAD | R2,*1 | | | BRH | L.002 | | ADD | | | L.001 | LOAD | Rl,MAR | | LOAD | Rl,MAR | | | STORE | Rl,ADR | | WRITE | MEM | | | | | L.004 | LOAD | Rl,MAR | | | | | | STORE | Rl,MART | | | | | L.003 | | | Fig. 5.1(c) SINGLE ADDRESS CODE FOR SOURCE PROGRAM OF Fig. 5.2(a) # TABLE 5.2 BASIC QUANTITIES TO BE MEASURED FOR DEVELOPING SOFTWARE SCIENCE CRITERIA FOR PROGRAMS $n_1$ = NUMBER OF UNIQUE OR DISTINCT OPERATORS n2 = NUMBER OF UNIQUE OR DISTINCT OPERANDS $N_1$ = TOTAL USAGE OF ALL OF THE OPERATORS $\mathrm{N}_2$ = TOTAL USAGE OF ALL OF THE OPERANDS $n_2^*$ = NUMBER OF UNIQUE OR DISTANCE I/O PARAMETERS $n = n_1 + n_2$ $N = N_1 + N_2$ TABLE 5.3 BASIC QUANTITIES FOR THE PROGRAMS OF Fig. 5.1 | INSTRUCTION-<br>STREAM<br>FORM | n <sub>1</sub> | n <sub>2</sub> | N <sub>1</sub> | N <sub>2</sub> | n <sub>2</sub> * | n | N | |--------------------------------|----------------|----------------|----------------|----------------|------------------|----|----| | VMPL | 14 | 10 | 59 | 26 | 7 | 24 | 85 | | THREE ADDRESS | 11 | 12 | 17 | 31 | 7 | 23 | 48 | | SINGLE ADDRESS | 12 | 13 | 46 | 31 | 7 | 25 | 77 | TABLE 5.4(a) CALCULATED MEASURES FOR PROGRAMS OF Fig. 5.1 | INSTRUCTION-<br>STREAM<br>FORM | V<br>(VOLUME) | L<br>(LEVEL) | I<br>(INTELLIGENCE<br>CONTENT) | |--------------------------------|---------------|--------------|--------------------------------| | VMPL | 389.72 | .0732 | 28.53 | | THREE ADDRESS | 217.13 | .1314 | 28.53 | | SINGLE ADDRESS | 357.5 | .0797 | 28.49 | TABLE 5.4(b) CALCULATED MEASURES FOR VMPL PROGRAM AND ITS EQUIVALENT OF APPENDIX C | INSTRUCTION-<br>STREAM<br>FORM | V<br>(VOLUME) | L<br>(LEVEL) | I<br>(INTELLIGENCE<br>CONTENT) | |--------------------------------|---------------|--------------|--------------------------------| | VMPL | 3039.82 | .0741 | 225.25 | | THREE ADDRESS | 1737.04 | .1331 | 231.20 | | SINGLE ADDRESS | 2806.38 | .0811 | 227.60 | - produce the least number of statements. Thus, on a per statement basis, each three address IML statement is 'nearer' to the original VMPL statement than a single address IML statement. - 2) The level criterion L indicated that the three address IML format is at a higher level than the single address IML format. This further reinforces the claim of the 'nearness' of the three operand IML format to VMPL. By keeping the same power in the instruction set of the three & single address IML's and changing only the addressability of the underlying machine, the level of the language changes, i.e., 'reducing addressability of a language reduces the level of the program written in the language.' - The intelligence content of the two formats indicates that this criteria holds invariant to within 10% under the translation process. This indicates an equivalence in the algorithmic power of the two IML formats. A third observation is based on the level figures of VMPL program segments compared to the corresponding IML figures. The VMPL level is found to be less than the three address IML notation! This is a strange result and can be explained in the following way. Halstead's level calculations are based on the information—theoretic point of view of a program, i.e., essentially its alphabet. VMPL has a lot of redundancy for reliability which inflates the alphabet count. Thus, the level of the representative programs in VMPL is less than the level of the corresponding three address IML format. However, in spite of this redundancy, the VMPL representation has a higher level than the equivalent single address IML format. Thus, we reach an important result that: 'Redundancy in a language reduces the level of the programs written in the language.' This result is also confirmed by Elshoff (69). The general result from all this analysis is that the three operand IML format is 'nearer' to VMPL, than the single address VMPL format. By keeping 'nearer' to VMPL, IML statements reflect as much as possible the semantic content of VMPL statements. Thus, each IML statement, on a per statement basis, has a lot of information about the original VMPL statement for which it was produced. The three address IML statements bind the translated code to a high level register oriented machine. Keeping the IML code high level helps in the later stages of compilation. The compilation model selected in Chapter 4, based on portability considerations, requires a high level IML which can easily be decomposed into a variety of low level microinstruction formats corresponding to various m-computers. Since most m-computers are register oriented, (Chapter 2) having an IML which is register oriented further helps us in achieving the goal of portability and efficiency. By selecting the three address format, the IML achieves the following things: - 1) Provides the most opportunities for subsequent economization of micro-code. - 2) Resembles the code of a general class of register oriented m-machines. - 3) Because of 1 & 2 tight micro-code can be produced for a variety of m-machines. All this can be summarized by the following implication. IMPLICATION 10: The IML format selected for the compilation model is the three address format or quadruples. Furthermore, the IML statements are high level in that they resemble VMPL statements as much as possible. Known characteristics of m-computer architecture and instruction repertoire can be used in designing various facets of IML statements. # 5.3 INTERMEDIATE LANGUAGE A program written in VMPL gets translated into an abstract quadruple like intermediate language. The various statements of the IML are discussed here. In discussing the IML, reference to VMPL statements has been made, since, IML is highly dependent on VMPL (I-10). Basically there are two kinds of statements in IML. One group is associated with the various declaration statements of VMPL and is known as the Intermediate Information Statement Group (IISG). The other group is associated with the actual executable statements of VMPL and is known as the Intermediate Executable Statement Group (IESG). #### 5.3.1 IISG An IISG statement is made up of five objects. The basic format of the statement is: DECLARATIONTAG, IDENTIFIER, DIMENSION, LENGTH, OTHERINFORMATION where DECLARATIONTAG - is a unique number of the form NMA. N and M are numeric and A is an alphanumeric digit. This number uniquely identifies the corresponding VMPL declaration statement to which the IDENTIFIER belongs, i.e., identifies the type of the IDENTIFIER. A list is shown in Table 5.5. IDENTIFIER - is the identifier (as defined in VMPL syntax) which is associated with the VMPL declaration. DIMENSION - is an integer which gives the number of unique elements associated with the identifier. Its absence, a zero (0) or a one (1) all mean that the IDENTIFIER is dimensionless. Essentially, the only time this object is used is with the VMPL, MEMORY and STACK type variables. LENGTH - is the length of the identifier, i.e., the number of bits. This object is used with most of the declarative statements of VMPL. OTHERINFORMATION - is only associated with a few DECLARATIONTAG's. Its meaning and format varies with the tag and all the cases are discussed individually below: > When this object is associated with the tag belonging to the FIELD declaration statement of VMPL, it consists of three integers. These three integers are the three integers associated with the FIELD declaration which identify the bit positions and the shift amount (Fig. 5.2). TABLE 5.5 DECLARATIONTAG VALUES FOR IISG # DECLARATIONTAG # NMA | N | MEANING | |---|--------------------| | 0 | NAME OF THE OTHERS | | 1 | LOCAL | | 2 | GLOBAL | | 4 | SPROC | | | М | MEANING | |---|---|--------------------| | | 0 | NONE OF THE OTHERS | | - | 1 | TEMPORARY | | - | 2 | PERMANENT | | - | | | | | ſ | | |---|---|----------| | | A | MEANING | | | 0 | SIMPLE | | - | 1 | MEMORY | | | 2 | STACK | | - | 3 | PSTACK | | | 4 | FLAG | | | 5 | FIELD | | | 6 | USE | | | 7 | EXPECT | | | 8 | RETURN | | - | 9 | EXTERNAL | | - | | | | A | MEANING | |--------|--------------------------| | A | EMULATOR NAME | | В | PROGRAM START | | С | PROGRAM END | | D | WORDSIZE | | E | ARITHMETIC | | F | PROC NAME | | G | SPROC NAME | | H | BLOCK CODE START | | I | BLOCK CODE END | | J<br>K | BEGIN BLOCK<br>END BLOCK | - 2) The other information object which gets associated with the stack pointer (PSTACK) VMPL declaration, indicates the push-pop sequence associated with the stack. It consists of four distinct symbols which were shown in Chapter 2 Table 2.3. - When this object gets associated with the VMPL, EXTERNAL variables it consists of a single alphabet. The single alphabet indicates whether the external IDENTIFIER is a simple variable, a flag variable or a procedure(Fig. 5.2.). - 4) In the case of the global FIAG declaration statement of VMPL, this object (an integer) indicates which of the predefined four flags does the IDENTIFIER represent. In case the IDENTIFIER is a general flag, the value of this integer is O(Fig. 5.2.). Examples of VMPL declaration statements and the corresponding IISG statements are shown in Fig. 5.3. # 5.3.2 IESG The IESG statements are based on quadruples with an operation and three operands. All three operands are optional in that some statements have none, some one, some two and some all three operands. First, the overall format is discussed and then the individual statements are discussed. #### 5.3.2.1 IESG FORMATS The IESG statements are fixed formats with labels starting in the first column and existing by themselves in a line. A star (\*) in the first column indicates a continuation of the previous statement. If the line with the star is empty, it indicates the end of the continuation. #### OTHER INFORMATION Fig. 5.2 OTHERINFORMATION VALUES FOR IISG | DCL GLOBAL PERMAN | ENT MEMORY | MEM: [4096]:8; | (VMPL) | |--------------------|----------------|----------------|--------| | 221 MEM | ,4096,8 | | (IML) | | DCL GLOBAL TEMPORA | RY REG1:8, | REG2:3; | (VMPL) | | | 1, ,8<br>2, ,3 | | (IML) | | DCL LOCAL PERMANE | NT REG3:8; | | (VMPL) | | 120 REG3 | , ,8 | | (IML) | | DCL FIELD OPCODE | (11,9,8); | | (VMPL) | | 005 OPC | ODE, , ,11,9 | 9,8 | (IML) | Fig. 5.3 VMPL - IML EQUIVALENTS All other statements start in column seven or eight. The various column designation are shown in Table 5.6. The general format of an IESG statement is: OPERATION OPERAND1 OPERAND2 OPERAND3 FLAGS The OPERATION object recognizes the action which the IESG statement has to take using the three operands which follow, i.e., OPERAND1, OPERAND2 and OPERAND3. The FLAGS object indicate the predefined VMPL flags which have to be enabled during this operation. Another aspect of these IESG statements is the presence of modifiers with the OPERATION and OPERAND objects. This is done to retain the information contained in VMPL statements that will be useful in the production of compact, machine dependent microcode. A simple example of this are the compiler generated temporary variables. All such variables are tagged with modifiers (+) or (-). This helps the register allocation/ deallocation phase of compilation know if the temporary is used later on in a sequence of IML statements (+), or not used (-). Since the compiler generates the temporaries, it knows their 'life-time' and can easily generate these modifiers. Another group of tags are associated with the various labels associated with the GO TO, FOR, LEAVE & IF-THEN-ELSE statements. These along with some of the IISG statements recognize each sequential block of IML statements. The later phases of compilation uses these tags for microcode compaction by noting the beginning and end of every straight-line code segment. The heuristic underlying this approach is that there cannot be more parallelism at the microcode level than is available in the HLL (VMPL) program structure. The compiler is able to reveal the inherent concurrency of the high level algorithm by tagging the intermediate code with 'clues'. These 'clues' are subsequently used by the compiler. We believe that maximal code concurrency can be detected and used in producing a minimal number of horizontal microinstructions over straight line code segments. This aspect of code compaction is given in (72). The various modifiers are listed in Table 5.7. TABLE 5.6 IML COLUMN DESIGNATIONS | COLUMNS | VALUES | |----------|--------------------| | 8-14 | OPERATION | | 17-23 | OPERAND 1 | | 20-32 | OPERAND 2 | | 35-41 | OPERAND 3 | | 43-46 | FLAG SETTINGS | | 7 | OPERATION MODIFIER | | 16,25,35 | OPERAND MODIFIER | TABLE 5.7 OPERAND MODIFIERS | SYMBOL | MEANING | |--------|--------------------------------------| | • | BIT OPERAND | | 1 | CONCATENATED OPERAND | | + | TEMPORARY NEEDED | | - | TEMPORARY NOT NEEDED | | С | CONSTANT | | P | PARAMETER IDENTIFIER | | Т | LABEL FOR FIRST IF-THEN-ELSE BRANCH | | E | LABEL FOR SECOND IF-THEN-ELSE BRANCH | | G | LABEL FOR A GOTO | | F | LABEL FOR A FOR | | L | LABEL FOR A LEAVE | | S | LABEL FOR A SELECT | | А | ARGUMENT IDENTIFIER | #### 5.3.2.2 IESG STATEMENTS There are seven classes of IESG statements. Each class is treated separately. # CLASS 1 IESG STATEMENTS OPERATION The general format of statements belonging to Class l is: SRC1 SRCI The OPERATION is either an arithmetic or a logical operation (one to one correspondence with VMPL arithmetic and logical operators Table 2.4) which require two sources, SRC1 & SRC2 and produce a result which is stored in DEST. The NOT logical operation only requires one source. As such, it does not have the SRC2 object. The various OPERATION's are listed in Table 5.8. SRC2 DEST DEST # CLASS 2 IESG STATEMENTS OPERATION The shift and rotate instructions belong to Class 2 corresponding to the shift and rotate instructions of VMPL Table 2.5. The format is: COUNT, VALUE In case the operation is one of the shift instructions, SRCl is shifted by COUNT places filling the empty places by VALUE and storing the result in DEST. If the operation is a rotate instruction, the VALUE part from the above format is missing Table 5.9. # CLASS 3 IESG STATEMENTS Class 3 operations are for reading and writing into the MEMORY class variable of VMPL. The two operations are associated with reading from and writing into memory Table 5.10. The format is: OPERATION MEMVAR **INDEX** VALUE The object INDEX is used as an index into the memory array MEMVAR to point to a location. VALUE either receives the value or sends the value to this location. TABLE 5.8 CLASS 1 IESG OPERATIONS | OPERATION | MEANING | | | | |-----------|-------------|--|--|--| | ADD | ADDITION | | | | | SUB | SUBTRACTION | | | | | MPY | MULTIPLY | | | | | DVD | DIVIDE | | | | | AND | LOGICAL AND | | | | | OR | LOGICAL OR | | | | | XOR | LOGICAL XOR | | | | TABLE 5.9 CLASS 2 IESG OPERATIONS | OPERATION | MEANING | |-----------|--------------| | SHIL | SHIFT LEFT | | SHTR | SHIFT RIGHT | | ROTL | ROTATE LEFT | | ROTR | ROTATE RIGHT | # CLASS 4 IESG STATEMENTS Class 4 deals with the various branch operations. It can have one or two operands with the second (if two) or first (if one) always being a label. A list of operations is given in Table 5.11. # CLASS 5 IESG STATEMENTS The seven statements which belong to **Cl**ass 5 are there for the translation of some of the miscellaneous statements of VMPL (INC, DEC, POP, etc.). In all cases there is a one-to-one relation with the corresponding VMPL statement Table 5.12. They are one, two or implicit operand statements. #### CLASS 6 IESG STATEMENTS Class 6 contains two statements which are translated from the FOR and SELECT, VMPL statements. Their format is shown in Fig. 5.4. Note that for the SLCT statement we use the continuation aspect of IML statement format. # CLASS 7 IESG STATEMENTS The three statements in Class 7 have again a one-to-one relation with corresponding VMPL statements and deal with the control aspect of program execution. They are shown in Fig. 5.5. This finishes the design of the IML constructs. #### 5.4 CONCLUSIONS Comparing the designed IML constructs and the various constraints and requirements on the IML from in Chapter 5 and the earlier sections of this chapter (IlO), we come to the following conclusions: - The IML constructs are high level in that most of them have a one-to-one correspondence with VMPL statements. - 2) The IML constructs are register oriented, i.e., they can be thought of as the machine language of an abstract register oriented machine. TABLE 5.10 CLASS 3 IESG OPERATIONS | OPERATION | MEANING | | | | |-----------|--------------|--|--|--| | RMOVE | READ MEMORY | | | | | WMOVE | WRITE MEMORY | | | | TABLE 5.11 CLASS 4 IESG OPERATIONS | OPERATION | MEANING | |-----------|-----------------| | COMP | COMPARE | | BRCH | BRANCH | | CONDF | CONDITION FALSE | | CONDT | CONDITION TRUE | TABLE 5.12 CLASS 5 IESG OPERATIONS | OPERATION | MEANING | | | | | |-----------|----------------------|--|--|--|--| | INC | INCREMENT (ADD 1) | | | | | | DEC | DECREMENT (SUB 1) | | | | | | CLR | SET TO ALL ZEROES | | | | | | SET | SET TO ALL ONES | | | | | | MOVE | MOVE SRC1 TO SRC2 | | | | | | PUSH | PUSH SRC1 INTO STACK | | | | | | POP | POP STACK INTO DEST | | | | | Fig. 5.4 CLASS 6 IESG STATEMENTS OPERATION MEANING HALT HALT RET RETURN EXECUTE SRC1 (PAR1, PAR2) ; (VMPL) XEQ SRC1 PARl PAR2 (IML) Fig. 5.5 CLASS 7 IESG OPERATIONS 3) Furthermore, the IML code contains tags or 'clues' which recognize its various properties. These tags are used for the production of compact microcode. From 1, 2 and 3 we see that the designed IML meets the requirements which were placed on it due to I9 and IlO. #### 6. RESULTS #### 6.1 SIMULATOR In Chapter 1 the objectives of this research were identified in terms of four problems. Problems 1, 2 and 3 were solved in the earlier chapters. According to the statement of problem 4, a testing methodology for microprogram correctness should be designed. We achieve this aspect of microprogram development via the provision of a simulator for the intermediate language, and testing as diagramed in Figure 6.1. A program (Pl) (emulator) written in VMPL is translated into the intermediate language (IML1). Say the program Pl emulates a machine Ml (target). Next we give the IML1 code to the simulator. We also provide the simulator with code which correspond to machine code programs for the machine Ml. These programs (DPl) essentially correspond to a set of diagnostics written for the target machine and get stored in the MEMORY variable of the emulator. The simulator then executes (simulates) the IML1 code. This corresponds to executing the diagnostic programs DPl on the machine Ml. If the 'diagnostics' produce correct results, we suspect the correctness of the emulator (Pl or IML1). Now if this same IML1 code is translated into microcode of a host machine (H1), we suspect the correctness of this microcode. Since we would like the simulator to run in an interactive fashion, simulator commands are added (embedded) to the original VMPL program, i.e. Pl. This requires the addition of a simulator command statement to the VMPL and IML instruction repertoire. The format for a simulator command statement in VMPL is: #### ? COMMAND PARAMETERS where the COMMAND object is a three letter mnemonic identifying a unique simulator command and the PARAMETERS object is a set of parameters which the COMMAND object requires. The simulator along with the commands, etc. is described in Appendix B. The IML representation of a command follows the general IML format, with the command mnemonic as the op-code and the parameters acting as the operands. Thus, by the provision of a simulator for the IML, we have achieved Fig. 6.1 MICROCODE CORRECTNESS MODEL a solution to problem 4. This solution is an indirect approach in that we do not prove the correctness of the microcode directly. We try to prove the correctness of the code which is one level above the microcode; namely, the IML code which will ultimately be converted into microcode. Proving correctness at this level is found to be a simple task via simulation. To make the final assertion that the microcode is correct, we have to prove two things. The correctness of the simulator and the compiler which converts the IML code into microcode. This is a one time task and can be done by the compiler implementor. # 6.2 VMPL COMPILER A compiler for VMPL has been written using the OSU META system (71) (APPENDIX B) available on the OSU CYBER 73. The META system is based on Schorre's META II (70) and consists of three main components: - 1) The META language - 2) The META compiler - 3) A package of support routines called METASUBS. The META language is designed specifically for compiler-writing. Basically, the notation is BNF with slight modifications. To write a compiler, the user expresses the syntax of the language and the code to be produced in a series of META statements called rules. Each rule consists of items called 'recognizers' to be searched for in the input source language. The recognizers are the names of other rules, specific characters to be looked for, or basic recognizers suppled by META, i.e., .ID recognizes an identifier; .STRING, a string of characters enclosed by apostrophies; 'BEGIN', the word BEGIN, etc. In addition to items to be recognized, each rule may also contain actions to be taken if the items are found. These actions may look up an item in the symbol table, put a string of characters on a stack maintained by META, or put something on the output buffer. Once a rule or set of rules has recognized a statement, these actions output the low level language equivalent for that statement. Once the set of rules comprising the user's compiler is written in META language, the META compiler produces a sequence of calls to the METASUBS package. The sequence of calls which comprise the user's compiler can then be loaded into the machine (CYBER 73) along with METASUBS to process the user's language. METASUBS scans the source code in a top down, recursive-descent manner according to the user's rules and produces code for the associated actions. In addition, the package handles input and output, syntax error messages, a symbol table facility, listing control, etc. The symbol table facility of META was found to be too primitive for successful compilation of VMPL programs. Thus, a symbol-table program (STP) was written using PASCAL (56). A program written in VMPL is translated into IML statements by a META-VMPL compiler. This output is then fed into STP which does all the symbol table cross checking of the IML code or indirectly the VMPL code. In doing this checking STP generates a series of symbol tables which can later on be used by the simulator described in the previous section. According to the compilation model selected in Chapter 4, there is still another phase of compilation. This phase deals with the conversion of the IML code into microcode. This part of the compiler was written by P.Y. Ma and is discussed in (72). In the next section, we discuss some of the results obtained on the compilation of various VMPL programs into IML programs. #### 6.3 VMPL-IML ANALYSIS Three emulators were written using VMPL as source language. The three emulators are: - 1) VMl PDP-8 minicomputer - 2) VM2 INTEL 8080 microprocessor - 3) VM3 An abstract register oriented machine. These three emulator programs were translated into IML code. Two groups of experiments were then conducted on this resulting code. #### 6.3.1 COUNTING EXPERIMENTS (CE) Several different counting experiments were performed. These experiments essentially consist of counting various objects of the IML code. #### CE(1) The first counting experiment consisted of counting the number of occurrences of the various variables used in the program. There are three types of variables: - 1) Global variables - 2) Local variables - 3) Temporary variables Temporary variables are the variables generated by the compiler. The results are shown in Table 6.1. We find that the percentage of usage of local plus temporary variables is about the same as the percentage of usage of global variables. This locality of usage reflects on the block structured nature of emulator programs and can be used as an indicator of the <u>locality of</u> variable usage at the block level, i.e.: 'Local variables on a percentage basis are used as much (if not more) as global variables' # CE(2) The next experiment we did was to measure the distance between successive references to a particular variable. IML statements were numbered in a sequential manner, say 1 to N. Then for each variable of all the three emulators we did the following: We counted the number of times the variable was used twice in the same IML statement (distance between usage of variable is 0), the number of times the variable was used in two consecutive IML statements (distance between usage of variable is 1), the number of times the variable was used in two IML statements, separated by one IML statement which did not use it (distance between usage of variable is 2) etc. This count was then averaged for all the variables of all three emulators to come up with a common characteristic of variable usage. This characteristic is plotted as a curve in Fig. 6.2. TABLE 6.1 RESULTS OF COUNTING EXPERIMENT CE(1) | EMULATOR | GLOBAL<br>VARIABLES | | LOCAL<br>VARIABLES | | TEMPORARY<br>VARIABLES | | LOCAL plus<br>TEMPORARY<br>VARIABLES | | |----------|---------------------|-------|--------------------|-------|------------------------|-------|--------------------------------------|-------| | | TOTAL | 8 | TOTAL | 00 | TOTAL | % | TOTAL | બ | | VM1 | 121 | 52.4% | 54 | 23.4% | 56 | 24.2% | 110 | 47.6% | | VM2 | 183 | 49.6% | 97 | 26,3% | 89 | 24.1% | 186 | 50.4% | | VM3 | 159 | 55.4% | 68 | 23.7% | 60 | 20.9% | 128 | 44.6% | Fig. 6.2 RESULTS OF COUNTING EXPERIMENT CE(2) Taking a point on this curve (for example) we see that on the average there is only 1 variable which is used every 18 IML statements. From the shape of this curve we see that the average distance between the usage of variables is between three and five. This curve does not include the figures for the temporary variables. The distance of usage for temporary variables was consistent and equal to 1, i.e., a temporary is always used immediately in the IML statement following the IML statement which generated it (a property of the way in which the compiler generates temporary variables). Thus, in general, we can conclude that: 'The average distance of usage for all variables is small' This statement can then be interpreted as a <u>locality of variable</u> usage at the statement level. # CE(3) Next, we did some counting experiments similar to the ones done by Elshoff (50) and Gannon (52) on the IML programs. It may be recalled that their data was used in the first place to design VMPL. Our results, Table 6.2, are more or less similar to their results. Some differences can be explained as follows: - 1) The large number of branch statements is explained by the block nature of VMPL programs, the structured VMPL constructs and the overall structure of an emulator program. If we count CONDF, CONDT and BRCH as the only kind of branch statements, our figures match the Elshoff, Gannon data. This condition corresponds to a SELECT-less, non blocked program. - We get a low value for 'memory reference instructions.' This is primarily because the emulator (programs) are of register oriented machines. We suspect they will be higher for memory oriented target machines. Thus, we see that programs written in VMPL produce the same kind of results for instruction usage as programs written in Fortran and PL/I. In general, we can come to the following conclusion from this experiment: 'The usage of the Elshoff-Gannon data for the design of VMPL is valid' TABLE 6.2 RESULTS OF COUNTING EXPERIMENT CE (3) | INSTRUCTION<br>TYPE | VMI | VM2 | VM3 | |----------------------------------------------|--------|-------|-------| | 3 OPERAND<br>(ARITHMETIC 8<br>LOGICAL OPER.) | 29.41% | 31.2% | 30.1% | | 2 OPERAND (MOVE etc.) | 21% | 20.8% | 19.8% | | 1 OPERAND<br>(VMPL MISC) | 6% | 8.7% | 7.8% | | BRANCH | 37.6% | 35% | 39.9% | | MEMORY<br>REFERENCE | 5.99% | 4.3% | 2.4% | ### 6.3.2 VARIABLE ASSIGNMENT EXPERIMENT In Chapter 3 we made the assertion that by having global-local variable declaration (variable priority) and a block structure for VMPL programs, we will produce tighter microcode. This was to happen because of a smaller number of load and store operation. The variable assignment experiment was conducted at the IML level, we believe that fewer IML statements produce fewer microinstruction statements (microcode). There are two sets of variables in the experiment, the number of available 'host' registers and the register allocation scheme which is adopted. Four register allocation schemes were selected: - All program variables are treated as global variables. In case a register is needed for allocation, the first available register (registers are numbered in an arbitrary sequential form) is deallocated. (Type a) - 2) The same as above except a count on the frequency of usage of a variable is kept. In case of deallocation, the variable with the smallest frequency count is deallocated. (Type b) - 3) The VMPL designed priority structure is adopted. If two variables having the same priority have to be deallocated, the first one (based on an arbitrary number sequence) is deallocated. (Type c) - 4) Same as above except a count on the frequency of usage of a variable is kept. In case of deallocation of two variables having the same priority, the variable with the smallest frequency count is deallocated. (Type d) TYPE a allocation scheme is one adopted by most present day compilers. TYPE b is a step forward in variable allocation where some information about variable usage is used. TYPE c is the priority scheme outlined by us in the design of VMPL. TYPE d is a further enhancement of this technique. The results for the three emulators are plotted in Fig. 6.3. The dependent variable is the number of load/store instructions required Fig. 6.3(a) RESULTS OF VARIABLE ASSIGNMENT EXPERIMENT ON VM3 Fig. 6.3(b) RESULTS OF VARIABLE ASSIGNMENT EXPERIMENT ON VM2 Fig. 6.3(c) RESULTS OF VARIABLE ASSIGNMENT EXPERIMENT ON VMl for each emulator. For all three cases we can rate the register allocation schemes as follows: TYPE d, TYPE c, TYPE b, TYPE a with TYPE d producing the least number of load/store instructions and TYPE a producing the greatest number of load/store instructions. The difference between TYPE d and TYPE c is not large but both are significantly better than TYPE b or TYPE a. Thus, we have proved (for the cases under consideration) that: 'Using a VMPL variable priority scheme produces fewer load/store instructions than other schemes' From the Counting Experiments and the Variable Assignment Experiments done on the three emulator programs, we come to the general conclusion that VMPL has successfully met its design goals of producing tight microcode. ### 6.4 VMPL EXTENSIONS While designing VMPL, we also pointed out the modular top down design approach we had followed. To test this aspect of VMPL, we do the following three experiments on VMPL definition and syntax. These experiments are done by changing PREMISE 5 (Chapter 3) to include target machine which have a) both word and byte operations, b) multiple stacks, and c) multiple memories. ## BYTE OPERATIONS The effects of including byte operations as a basic design goal on VMPL and IML would be: Similar to the global size declaration, we have a global byte declaration which gives the size of a byte in terms of the number of bits. The number of bytes in a word would then be: number of bytes = $$\frac{\text{size of word}}{\text{size of byte}}$$ -1 The least significant byte would be byte zero. 2) In order to recognize a byte of a certain variable (word), a digit would have to be appended to the variable identifier in VMPL statements. The exact format may be similar to a FORTRAN index, a PASCAL structured element, etc., i.e., ACC (0) ACC.0 referring to the first byte of the variable ACC. The second format would be preferred because we already use square brackets to refer to memory variables and a set of two brackets may be less reliable. 3) The modification of variables as pointed out in statement b above would have to be migrated down the the IML level. Thus, the integer identifying the byte could be used as one of the operand modifiers in the IML statement. #### MULTIPLE STACKS The effect of including multiple stacks on VMPL and IML syntax would be: - 1) The STACK and PSTACK global declaration statements would have to be modified. This would have to be done to recognize which stack pointer is associated with which stack. - 2) The PUSH and POP statements, both in VMPL and IML, would similarly have to be modified to recognize the stack associated with the operation. #### MULTIPLE MEMORIES To include this aspect of target machines, the only change required of VMPL is the modification of the global MEMORY declaration. More than one memory variable would be allowed in the declaration. The RMOVE & WMOVE IML statements already include the name of the memory variable as part of their syntax. From these three experiments, we infer that making extensions to VMPL, via the redefinition of the original premises on which the language was designed, does not effect the parts of VMPL and IML already designed. In fact, the designed parts make it easy to modify and extend VMPL and consequently IML. Thus, because the additions were done easily, we consider this another aspect of a successful language design experiment. #### 6.5 CONCLUSIONS AND FUTURE WORK The original objectives in developing a high level machine independent microprogramming language have been met as described in the body and appendices of this thesis. All through this language design experiment we have followed a top down structured approach using all the data available on language syntax and statements. The language must be used to produce a number of large-scale emulators before the practical benefits of this design approach can be used as feedback to improve upon some of the language implications and maybe even modify some of the constraints. Questions which remain to be considered in further research in this area include: - 1) How does one produce even more efficient and reliable microcode from a high level microprogramming language? - 2) How can high level languages be used for m-machines with two or more levels of microinstruction interpretation? Should the nano programs directly interpret the IML statements or not? - 3) The study of universal emulators or hosts which are suitable for the emulation of a variety of targets if needed. - 4) Finally, a more detailed study of I/O is needed for making the whole emulation process successful and accurate. #### REFERENCES - 1) M. V. Wilkes, 'The Best Way to Design an Automatic Calculating Machine', Report of the Manchester University Computer Inaugral Conference, Manchester, England, July 1951. - M. V. Wilkes, 'The Growth of Interest in Microprogramming: A Literature Survey'. Computing Surveys, September 1969. - 3) S. S. Husson, Microprogramming Principles and Practices, Prentice Hall, Englewood Cliffs, New Jersey, 1970. - 4) S. G. Tucker, 'Microprogram Control for System 1360', IBM Systems Journal, October 1967. - A. K. Agrawala, T. G. Rausher, <u>Foundations of Microprogramming:</u> <u>Architecture</u>, <u>Software and Applications</u>, Academic Press, New York 1976. - 6) J. E. Nicholls, <u>The Structure</u> and Design of Programming Languages, Addison-Wesley, Menlo Park, California, 1975. - 7) P. W. Mallett, T. G. Lewis, 'Considerations for Implementing a High Level Microprogramming Language Translation System', Computer Magazine, August 1975. - 8) Y. Chu, Computer Organization and Microprogramming, Prentice Hall, Englewood Cliffs, New Jersey 1972. - 9) C.V. Ramamoorthy, T. Masahiro, 'A High Level Language for Horizontal Microprogramming', IEEE Transactions on Computers, August 1974. - 10) G. R. Lloyd, 'PUMPKIN (Another) Microprogramming Language', SIGMICRO Newsletter, April 1974. - 11) G. R. Lloyd, A. Van Dam, 'Design Considerations for Microprogramming Languages', SIGMICRO Newsletter, April 1974. - 12) R. Eckhouse, 'A High Level Microprogramming Language', Sprin Joint Computer Conference, AFIPS Press, Montvale, New Jersey, 1971. - R. Eckhouse, 'A High Level Microprogramming Language (MPL)', Ph.D. Thesis, State University of New York at Buffalo, June 1971. - 14) D. J. DeWitt, 'A Machine Independent Approach to the Production of Horizontal Microcode', Ph.D. Thesis, University of Michigan, June 1976. - 15) J. E. Cheatem, et. all., 'On the Basis of ELF--an Extensive Language Facility', Spring Joint Computer Conference, AFIPS Press, Montvale, New Jersey, 1969. - R. F. Rosen, 'MPP--A Tool for Teaching and Research in Micro-programming', Technical Report PN-3R, SUNY at Buffalo, 1970. - 17) L. C. Richardson, 'PRIM--Overview', ISI/RR-76-19, University of Southern California, February 1974. - 18) MICRODATA 3200 COMPUTER, MICRO 32/S COMPUTER REFERENCE MANUAL, Microdata Corporation, May 1974. - 19) MODEL 8/32 MICRO-PROGRAM DESCRIPTION, no. 05-058Al5, Interdata Incorporated, December 1974. - 20) BURROUGHS B1700 SYSTEMS REFERENCE MANUAL, Burroughs Corporation 1972. - 21) QM-1 HARDWARE LEVEL USER'S MANUAL, Nano-data Corporation, March 1974. - 22) CASH-8 REFERENCE MANUAL, Standard Logic Incorporated, June, 1973. - 23) 21MX COMPUTER SERIES REFERENCE MANUAL, Manual Part No. 02108-90002, Interdata Incorporation. - H. W. Lawson, B. Magnhagen, "Advantages of Structured Hardware", Second Annual Symposium on Computer Architecture, IEE, January 1975, (DATASAAB FCPU). - D. R. Oestreicher, J. Goldberg, "MLP-900 Reference Manual", Information Sciences Institute, University of Southern California, March 1974. - 28) CONTROL DATA 5000 SERIES OF MICROPROGRAMMABLE PROCESSORS REFERENCE MANUAL, Publication No. 14232000, Control Data Corporation, August 1972. - "Data General Corporation Introduces New Eclipse Line of Small Computers", SIGMICRO Newsletter, October 1974. - 30) CAL DATA 1 COMPUTER FAMILY, California Data Products, June 1974. - 31) PRIME 300 COMPUTER, Prime Computer Incorporated. - 32) VARIAN 73 SYSTEM HANDBOOK, Varian Data Machines, June 1972. - R. G. Barr, et. all,. "A Research-Oriented Dynamic Microprocessor", IEEE Transactions on Computers, November 1973. - D-MACHINE USERS MANUAL, Burroughs Corporation, April 1971, (BURROUGHS INTERPRETER). - 35) B.D. Shriver, P. Kornerup, "An Overview of the MATHILDA System", Department of Computer Science, University of Aarhus, Aarhus, Denmark, 1975. - 36) WRITABLE CONTROL STORE FOR PDP-11/40, 3 Rivers Corporation, June 1977. - 37) SCHOTTKY BIPOLAR LSI MICROCOMPUTER SET: 3001 MICROPROGRAM CONTROL UNIT AND 3002 CENTRAL PROCESSING ELEMENT, Intel Corporation, 1975. - A. B. Salisbury, Microprogrammable Computer Architectures, Elsevier Computer Science Library, New York, 1976. - 39) R. F. Rosin, 'Contemporary Concepts of Microprogramming and Emulation', Computing Surveys, December 1969. - 40) J. T. Golden, FORTRAN IV Programming and Computing, Prentice Hall, Englewood Cliffs, New Jersey, 1965. - 41) A. Lysegard, Introduction to COBOL, Studentlitteratur, Lund, Sweden, 1908. - 42) M. E. Conway, 'Proposal for an UNCOL', Communications of the ACM, October 1958. - F. Bates and M. L. Douglas, Programming Language/One, Prentice Hall, Englewood Cliffs, New Jersey, 1970. - 44) E. G. Mallach, 'Emulation: A survey', Honewell Computer Journal, Volume 6 Number 4, 1972. - 45) Intel 8080 Reference Manual, Intel Corporation, Santa Clara, California, 1976. - 46) C. G. BEll and A. Newall, Computer Structures, Readings and Examples, McGraw Hill, New York, 1971. - 47) CDC 6600 REFERENCE MANUAL, Control Data Corporation, June 1976. - PDP 11/40 REFERENCE MANUAL, Digital Equipment Corporation, Maynard, Massachusetts, 1972. - 49) A. Lunde, 'Empirical Evaluation of Some Features of Instruction Set Processor Architecture', Communications of the ACM, March 1977. - J. L. Elshoff, 'An Analysis of Some Commercial PL/I Programs;, IEEE Transactions on Software Engineering, June 1976. - D. E. Knuth, 'An Empirical Study of FORTRAN Programs', Software Practice and Experience, 1971. - 52) J. D. Gannon, and J. J. Horning, 'The Impact of Language Design on the Production of Reliable Software', Proceedings International Conference on Reliable Software, ACM SIGPLAN Notices, June 1975. - R. Kosaraju, 'An Analysis of Structured Programs', Journal of Computing and Systems Science, December 1974. - 54) H. F. Ledgard, and M. Marcotty, 'A Genealogy of Control Structures', Communications of the ACM, November 1975. - 75) R. M. Lewis, D. J. Rosenkrantz, R. E. Stearns, Compiler Design Theory, Addison-Wesley, Menlo Park, California, 1976. - 56) K. Jensen, N. Wirth, PASCAL Users Manual and Report, Springer-Verlag, New York, 1974. - 57) W. M. Waite, R. J. Orgas, 'A Base for a Mobile Programming System', Communications of the ACM, September 1969. - 58) M. H. Halstead, 'Using the Computer for Computer Conversion', Datamation, May 1970. - 59) P. J. Brown, 'Levels of Languages for Portable Software', Communications of the ACM, December 1972. - 60) F. L. Alt, 'The Standardization of Programming Languages', Proceedings of the ACM 19th National Conference, 1964. - 61) R. C. Smeder, 'An Investigation of the Bootstrapping Process as Applied to Compiler Generation', A.U.S. Government Research Report, AD-727673, 1971. - M. C. Newey, P. D. Poole and W. M. Waite, 'Abstract Machine Modelling to Produce Portable Software', Software-Practice and Experience, Vol. 2, April 1972. - P. C. Poole, 'Hierarchical Abstract Machines;, Proceedings of the Software Engineering Conference, Culham, England 1971. - P. C. Poole, W. M. Waite, 'Machine Independent Software', Proceedings of the ACM, 2nd Symposium on Operating System Principles, 1969. - A. D. Fisher, 'A Common Programming Language for the DOD-Technical Requirements;, DOD Report No. AD-A028 297/Owc. - 66) H. W. Lawson, 'Programming-Language-Oriented Instruction Streams', IEEE Transactions on Computers, May 1963. - 67) R. E. Merwin et. all., 'Direct-Micorprogrammed Execution of the Intermediate Text From a High-Level Language Compiler'. - 68) M. H. Halstead, Elements of Software Science, Elsevier North-Holland, 1977. - 69) J. L. Elshoff, 'An Investigation into the Effect of the Counting Methods used on Software Science Measurements', SIGPLAN Notices, February 1978. - 70) Schorre, 'META-II. A Syntax Oriented Compiler Writing System', Proceedings ACM 19th National Conference 1964. - 71) G. A. Bachelor, 'META/CYBER 73 Reference Manual, Department of Computer Science, Oregon State University, 1975. - 72) P. Y. Ma, "Optimizing Microcode Produced from a High Level Language", Ph.D. Thesis, Oregon State University, August 1978. - 73) K. Malik, IML Simulator Reference Manual, Oregon State University, 1979. #### APPENDIX A #### VMPL SYNTAX This appendix defines the syntax of Virtual Microprogramming Language (VMPL) in a slightly modified BNF form. The differences from the standard BNF are: In order to save space and repetition, the following statement is used $$\langle A \rangle$$ , $\langle B \rangle$ , $\langle C \rangle$ ::= $\langle E \rangle$ which means $$\langle A \rangle ::= \langle E \rangle$$ $$\langle B \rangle ::= \langle E \rangle$$ $$\langle C \rangle ::= \langle E \rangle$$ 2) In case one of the meta symbols has to be used as part of the syntax of VMPL, it is enclosed in apostrophes, i.e. '< ' meaning that the angle bracket ( < ) is used as a VMPL symbol and not a meta symbol. #### B.N.F. ``` PROGRAM: = (PROGRAM HEADING) { (GLOBAL DECL) } (BLOCKS) PROGRAM ENDING ⟨PROGRAM HEADING⟩ ::=EMULATOR: ⟨ID⟩ ; ⟨PROGRAM ENDING⟩ ::=ENDEMULATOR; ⟨GLOBAL DECL〉::=DCL ⟨GLOBAL INFO DECL〉;/DCL ⟨GLOBAL VAR DECL〉; (GLOBAL INFO DECL) ::= (WORDSIZE DECL) / (ARITHMETIC DECL) / (FIELD DCL) \WORDSIZE DECL\> ::=WORDSIZE \( \)INTEGER\> (ARITHMETIC DECL) ::= ARITHMETIC (ARITHMETIC TYPE) (FIELD DCL) ::=FIELD (FIELD) (,FIELD) \[ \( \forall \text{TIELD} \) ::= \( \forall \text{D} \) ( \( \forall \text{BP} \) , \ \langle \texttt{FBP} \rangle , \langle \texttt{LBP} \rangle ::= \langle \texttt{INTEGER} \rangle ⟨SHIFT COUNT⟩ ::= ⟨INTEGER⟩ / ⟨NEG INTEGER⟩ (GLOBAL VAR DECL) ::=PERMANENT (GLOBAL TYPE DECL) /TEMPORARY (GLOBAL TYPE DECL) ⟨GLOBAL TYPE DECL⟩ ::= ⟨MEMORY DECL⟩ / ⟨STACK DECL⟩ / ⟨PSTACK DECL⟩ / (EXTERNAL DECL) / (FLAG DECL) / (SIMPLE DECL) 〈MEMORY DECL〉::=MEMORY 〈MEMORY〉 〈SIZE DECL〉 〈MEMORY〉 ::= ⟨ID⟩ : [ ⟨INTEGER⟩ ] (STACK DECL) ::=STACK (STACK) (SIZE DECL) ``` ``` (STACK) ::= <ID> :[ <INTEGER> ]/ <ID> 〈PSTACK DECL〉::=PSTACK \( PSTACK \) \( SIZE DECL \) \( STACK OPS \) ⟨PSTACK⟩ ::= ⟨ID⟩ ⟨EXTERNAL DECL⟩ ::=EXTERNAL ⟨EXTERNAL⟩ ┤ ,⟨EXTERNAL⟩∫ 〈EXTERNAL〉::= 〈ID〉 〈EXT TYPE〉 ⟨FLAG DECL⟩::=FLAG ⟨FLAG⟩ {,⟨FLAG⟩ } ⟨FLAG⟩ ::= ⟨ID⟩ ⟨FLG TYPE⟩ ⟨SIMPLE DECL⟩ ::=⟨ID⟩ ⟨SIZE DECL⟩ {, ⟨ID SIZE DECL⟩ } ⟨SIZE DECL⟩ ::=: ⟨INTEGER⟩ / ⟨EMPTY⟩ \langle STACK OPS \rangle ::= \uparrow \langle SOP1 \rangle / \downarrow \langle SOP1 \rangle / + \langle SOP2 \rangle / - \langle SOP2 \rangle (SOP1) ::= +/- ⟨SOP2⟩ ::= 1/↓ (EXT TYPE) ::= :P/:F/L (INTEGER) / (EMPTY) ⟨FLG TYPE⟩ ::=:C/:0/:N/:Z/ ⟨EMPTY⟩ \langle BLOCKS \rangle ::= \langle BLOCK \rangle \{\langle BLOCK \rangle \} ⟨BLOCK⟩ ::=PROC: ⟨ID⟩ : ⟨PROC BLOCK⟩ SPROC: ⟨ID⟩ ⟨SPROC ARG⟩; ⟨SPROC BLOCK⟩ \langle PROC BLOCK \rangle ::= \{\langle PROC DECL \rangle\}; \# \langle CODE \rangle \# ⟨SPROC BLOCK⟩ ::= {⟨SPROC DECL⟩ ;} # ⟨CODE⟩ # ⟨SPROC ARG⟩ ::= ⟨EMPTY⟩ /( ⟨ARG⟩ { , ⟨ARG⟩ } ) 〈ARG〉::= 〈ID〉 〈SIZE DECL〉 ⟨PROC DECL⟩ ::=DCL ⟨GLB USE DECL⟩ / DCL ⟨LOCAL DECL⟩ / DCL⟨SPR USE DECL⟩ ⟨GLB USE DECL⟩ ::=GLOBAL USE ⟨ID⟩ { , ⟨ID⟩ } LOCAL DECL> ::=PERMANENT \SIMPLE DECL> / TEMPORARY (SIMPLE DECL) ⟨SPR USE DECL⟩ ::=SPROC USE ⟨ID⟩ ⟨ , ⟨ID⟩ ⟩ (SPROC DECL) ::=DCL GLOBAL (ER DECL) / DCL LOCAL (ER DECL) / DCL (SPR USE DECL)/ (GLB USE DECL) \( \text{ER DECL} \) ;; = EXPECT \( \text{ID} \) \( \text{RETURN \( \text{ID} \) } \\ \( \text{CODE} \) ::= \( \text{STATEMENT} \) ; \tex <STATEMENT> ::= (UNLABELLED STMT> / (LABEL) : (UNLABELLED STMT>) UNLABELLED STMT⟩ ::=⟨SIMPLE STMT⟩ / ⟨STRUCTURED STMT⟩ (SIMPLE STMT) ::= (ASSIGNMENT STMT) (SET FLAGS) / (SET STMT) <PUSH STMT> / <RETURN STMT> / COMMENT STMT ⟨ASSIGNMENT STMT⟩ ::= ⟨CONCAT ASSIGN⟩ / ⟨SIMPLE ASSIGN⟩ ⟨CONCAT ASSIGN⟩ ::= ⟨CONCAT VAR⟩ ⟨ASSIGNMENT OPERATOR⟩ (CONCAT EXPR) ⟨CONCAT EXPR⟩ ::= ⟨CONCAT FACTOR⟩ / ⟨CONCAT FACTOR⟩ ⟨PM OPERATOR⟩ (INTEGER) (CONCAT FACTOR) ::= (VAR) / (CONCAT VAR) (CONCAT VAR) ::= (ID) // (ID) ⟨SIMPLE ASSIGN〉 ::= ⟨VAR⟩ ⟨ASSIGNMENT OPERATOR⟩ ⟨RIGHT HAND SIDE⟩/ (SIMPLE CONCAT ASSIGN) ⟨VAR⟩ ::= ⟨ID⟩ / ⟨ARRAY VAR⟩ ⟨ARRAY VAR⟩ ::= ⟨ARRAY ID⟩ ⟨ARRAY SUBSCRIPT⟩ \langle ARRAY ID \rangle ::= @ \langle ID \rangle / \langle ID \rangle <array subscript> ::= <id> / <integer> / <pop> <array subscript> ::= <id> / <integer> / <pop> <array subscript> ::= <id> / <integer> / <id> / <TERM> ::= (VAR) / (INTEGER) / (UNARY) / (SHIFT VAR) / (FIELD VAR) / (POP) / (ROTVAR) ``` ``` (UNARY) ::= .NOT. (VAR) ⟨SHIFT VAR⟩ ::= ⟨VAR⟩ ⟨SHIFT OPERATOR⟩ ⟨INTEGER⟩ ⟨FIELD VAR⟩ ::= ⟨ID⟩ ( ⟨VAR⟩ ) ⟨ROT VAR⟩ ::= ⟨VAR⟩ ⟨ROT OPERATOR⟩ ⟨INTEGER⟩ <SET STMT> ::= SET <VAR> <CLR STMT> ::= CLAR (VAR) UNC STMT> ::= INC ⟨VAR> 〈DEC STMT〉::= DEC 〈VAR〉 <HLT STMT> ::= HALT 〈LEAVE STMT〉::= LEAVE 〈LABEL〉 ⟨GOTO STMT⟩ ::= GOTO ⟨LABEL⟩ \langle XEQ STMT \rangle ::= EXECUTE \langle ID \rangle / EXECUTE \langle ID \rangle (\langle ID \rangle ) ⟨PUSH STMT⟩ ::= PUSH ⟨ID⟩ ⟨RETURN STMT⟩ ::= RETURN ⟨COMMENT STMT⟩ ::= [* {⟨ALPHABET⟩} {⟨DIGIT⟩} {⟨SYMBOL⟩} *] ⟨SET FLAGS⟩ ::= EMPTY /'⟨' ID {, ⟨ID⟩ '⟩'} ⟨STRUCTURED STMT⟩ ::= ⟨IFTRUE STMS⟩ / ⟨IFFALSE STMT⟩ / ⟨SELECT STMT⟩ / (COND STMT) / (FOR STMT) / (WHILE STMT) (IFTRUE STMT) ::= IFTRUE (IFSTMT) (IFFALSE STMT) ::= IFFALSE (IF STMT) 〈IF STMT〉::= ( 〈BOOLEAN EXPRESSION〉 ) THEN; 〈COMPOUND STMT〉 ELSE (STMT) ENDIF ⟨SELECT STMT⟩ ::= SELECT ( (ID ) , (INTEGER> ) FROM; ⟨SLCT ITEM⟩ ; ⟨ ⟨SLCT ITEM⟩ ; } ENDSELECT ⟨COND STMT⟩ ::= COND; ⟨COND ITEM⟩; {⟨COND ITEM⟩;} END COND (FOR STMT) ::= FOR (ID) = (INTEGER) ; COMPOUND STMT> WHILE STMT> ::= WHILE ( \ COOLEAN EXPRESSION > ); \ COMPOUND STMT > ENDWHILE ⟨COMPOUND STMT⟩ ::= BEGIN; ⟨CODE⟩ END; ⟨ELSE STMT⟩ ::= EMPTY / ELSE; ⟨COMPOUND STMT⟩ ⟨BOOLEAN EXPRESSION⟩ ::= ⟨BOOL FACTOR⟩ / ⟨BOOL FACTOR⟩ .EQ. ⟨BOOL FACTOR⟩/ 〈BOOL FACTOR〉 .LT. 〈BOOL FACTOR〉 /⟨BOOL FACTOR〉 .GT. (BOOL FACTOR) / (BIT VAR) ⟨BOOL FACTOR⟩ ::= ⟨TERM⟩ ⟨SLCT ITEM⟩ ::= ( ⟨INTEGER⟩ , ⟨LABEL⟩ ) ⟨COND ITEM⟩ ::= ( ⟨BOOLEAN EXPRESSION⟩ ); ⟨COMPOUND⟩ 〈LABEL〉 ::= ⟨ID⟩ 〈PM OPERATOR〉 ::= +/- ⟨ARITHMETIC TYPE⟩ ::= 1/2 ⟨NEGATIVE INTEGER⟩ ::= - ⟨INTEGER⟩ <OPERATOR> ::= +/-/*/ // .AND./.OR./.XOR. \( SHIFT OPERATOR \) ::= .SHTLO./.SHTLI./.SHTRO./.SHTRI. <ROT OPERATOR> ::= .ROTL./.ROTR. (ASSIGNMENT OPERATOR) ::= '=' 〈ID〉 ::= 〈ALPHABET〉 / 〈DIGIT〉} <EMPTY> := \langle ALPHABET \rangle := A/B/C/D/E/F/G/H/I/J/K/L/M/N/O/P/Q/R/S/T/U/V/W/X/Y/Z <DIGIT> :=0/]/2/3/4/5/6/7/8/9 \langle SMBOL \rangle ::= ./// (/) /[/]/^/\/\\/\/// +/-/*/./?/ /&/ ``` #### APPENDIX B ### SIMULATOR DESCRIPTION The simulator for the IML is written in PASCAL. It is logically broken up into two parts. The first part sets up the symbol table, checks to see if all the variables have been used according to the requirements of VMPL, and sets up a binary equivalent of the IML. The second part then executes the binary code set up by the first part. As indicated in Chapter 6, VMPL is enhanced by the COMMAND statement. The modification which this statement makes to VMPL BNF is shown in Fig. B.1. All commands are given by three unique alphabets. While translating commands to IML, the IML operation is the command itself. However, in column 1, we have a down arrow ( $\checkmark$ ) symbol which indicates to the later part of compilation (which produces microcode) that this IML statement is essentially a comment. The simulator commands can be broken down into types: - 1) DISPLAY - 2) INSERT - 3) CONTROL The DISPLAY commands display the value of the various variables declared in the VMPL program. The INSERT command inserts (sets) values into the various variables declared in the VMPL program. The CONTROL command covers a number of commands. These are: - a) Set and clear break points. The VMPL variables can be tagged such that whenever they are used, control returns to the simulator user. - b) <u>Single step</u>. This is for single step control of the IML program. - c) <u>Symbol table</u>. Display symbol tables set up in part 1 of simulator. - d) Error history. Display error history. When error occurs, this command gives information about the original VMPL statement which generated the IML statement which generated the error. PARAMETERS ::= SEE (73) Fig. B.1 MODIFICATION TO VMPL BNF - e) Option control. A number of options associated with error control can be enabled or disabled. - f) Load. Activate the loader to load data into the emulator memory. - g) <u>Execution Control</u>. This starts and stops monitor and starts and stops the simulation. A detailed description of these commands, their parameters is given in (73). # APPENDIX C This Appendix contains the VMPL-META compiler and the source of the emulator programs written in VMPL and referenced in Chapter 6 as VMl, VM2 and VM3. The programs are in the following order: - 1) VMPL-META compiler. - 2) VMl. - 3) VM2. - 4) VM3. 59 50 ``` META-LINGUISTICAL TRANSLATOR WRITING SYSTEM 78/04/03. 23 HETA V3.12 #META COMPILER FOR V M P L .# .SYNTAX VMPL .VARS SIZE, STK1, STK2 .VARS TMP, GL, PT, BUM, CUM1 .VARS GP1, GP2, N1, N2, N3 5 .VARS CM,ZM.NM,OM 7 .FLAGS ART, C1, C2, C3, NOPR, POP1. CAT1, CAT2 ., 3 VMPL = .LENID 6 .COMFLAG =[* 9 #EMULATOR# #:# .IO .CUT( .LB #00A # # .COL 72) .ONERROR RECVR1 #:# 10 S(GLOBALDECL) .OUT( .L8 ±008 PROGRAMSTART# .COL 72) $(#PROC# PROS / #SPROC# SPROC /#ENGEMULATOR# .OUT( .LS 12 13 #GOC PROGRAMEND# .COL 72) #;# .TRETURN) 14 RECVR1 = .EARROW .MESSAGE #SYNTAX ERROR. SCANNING RESUMES AT NEXT <:># 15 .SCAN #:# ., 16 17 13 [* GLOBAL CECLARATIONS *1 19 23 GLOSALDECL = #DCL + . SELECT 21 (#WCRDSIZE# WORDEF #:# / 22 #:# / #ARITHMETIC# ARTCEF 23 z: = / #GLOBAL# GLEDEF 24 #:# / 25 #IPROC# IPROEF #:# ) 25 ≠FIELO≠ FLODEF 27 23 WORSEF = .INTEGER .OUT(.LB \pm000 ,.,\pm * .COL 72) .SET(SIZE, *) ., 29 30 ARTOEF = .PUT(.LB ±30E±) (\pm1± .OUT(\pm0NE± .COL 72) / 31 #2# .GUT(#TWG + .COL 72) / #CNE# .GUT(#ONE# .COL 72) / 32 3.3 #TW0# .CLT (#TW0# .COL 72)) .. 34 35 IPROEF = IPRO1 3(\pm, \pm IPRO1) 35 37 IPRC1 = .PUT(.L3 #306 #) .SELECT (±0± .OUT(±0± .COL 72) 33 #0# .0UT(#0# .COL 72) 39 ±N≠ .OUT(≠N≠ .COL 72) 1 40 #Z# .OUT(#Z# .COL 72) 41 42 FLOGEF = FLOCUT \$(\pm,\pm) FLOGUT) 43 44 FLOCUT = .10 .PUT(.LB \pm005 \pm * \pm,...\pm) \pm(\pm.INTEGEP .PUT(* \pm,\pm) \pm,\pm .INTEGE 45 .PUT(* ±,±) (x,±(.INTEGER .PUT(*) ±)± / ±-± .INTEGER .PUT(±-± *) : 46 ) / ±) ± .PUT(±0±) ) .CUT(.COL 72) 47 43 49 GLEGEF = .SET(GL. #2#) PTOEF GLED1 .. 53 PTCEF = (#PEPMANENT# .SET(PT,#2#) / #TEMPORARY# .SET(PT,#1#)) 51 52 GLED1 = (#MEMORY# .SET(TMP,#1 #) MEMORE 53 #STACK# .SET (TMP,#2 #) -STKDEF 54 #PSTACK# .SET(TMP,#3 #)PSTKOEF 55 #EXTERNAL# .SET(TMP,#9 #) EXTOSE 3(#,# EXTOSE) 56 #FLAG = .SET.TMP, #4 #) FLAGDEF $(#,# FLAGDEF) 57 #SIMPLE# .SET(TMP, #0 #) SIMDEF 3(#, # SIMDEF) 53 ``` .EMPTY .SET(TMP, #8 #) SIMDEF 3(#,# SIMDEF) ``` MEMBER = .ID TAGRUT .PUT(* #,#) ### #(# .INTEGER .PUT(* #,#) #)# SIZEDECL 51 52 .OUT(.COL 72) ., 63 STKCEF = .IO TAGPUT .PUT(*\pm,\pm) \pm1\pm2(\pm .INTEGER .FUT(*\pm,\pm) \pm1\pm SIZEDECL 64 . 65 .OUT(.COL 72) 66 PSTROEF = .ID TAGPUT .PUT(* \pm,,\pm) \pm1\pm1 (.INTEGER .PUT(* \pm,\pm) \pm1\pm4(\pm) 67 #(# .PUT(SIZE #,#)) STACKOPS #)# .CUT(.COL 72) 58 59 70 STACKOPS = .SELECT ( #+# .PUT(#+,#) .SET(STK1,#+#) STOP1 / 71 72 ### .PUT(##,#) .SET(STK1,###) STOP1 / z+z .PUT(z+,z) .SET(STK1,z-z) STOP2 / 73 \pm \pm \pm .PUT(\pm \pm \pm \pm) .SET(STK1, \pm \pm \pm) STOP2 ) 74 75 75 FLAGDEF = TAGPUT .ID .SET(DUM,*) .PUT(* ±,,1,±) 77 (### ( #C# .SET(CM, EUM) .. PUT(#1#) / #Q# .SET(OM, QUM) .PUT(#2#) 73 79 #N# .SET(NM, CUM) .PUT(#3#) #Z# .SET(ZM, DUM) .PUT(#4#) 80 ١, .EMPTY (.IFEQUAL(DUM, ±C±) .SET(CM, ±C±) .PUT(±1±) 81 .IFEQUAL(DUM, #0#) .SET(OM, #0#) .PUT(#2#) 32 .IFEQUAL(QUM, #N#) .SET(NM, #N#) .PUT(#3#) 9.7 .IFEQUAL(DUM, #Z#) .SET(ZM, #Z#) .PUT(#4#) 84 )) .guf(.cgl 72) .EMPTY .PUT(±0±) 85 85 EXTOEF = .ID TAGRUT .PUT(* \pm,,\pm)(\pm!\pm)(\pmP\pm .PUT(\pm,P\pm)/ \pmF\pm .PUT(\pm1.E\pm)/ 87 .INTEGER .PUT(*) ) / .EMPTY .PUT(SIZE)) .OUT(.COL 72) ., 88 89 911 SIMBEF = .10 TAGPUT .PLT(* \pm,,\pm) SIZEDECL .CUT(.CQL 72) 91 92 TAGPUT = .FUT(.LB GL PT TMP) 93 STOP1 = (\pm + \pm \cdot PUT(\pm + \pm) \cdot SET(STK2 \cdot \pm - \pm) / 94 \pm\pm\pm .PUT(\pm\pm\pm) .SET(STK2,\pm\pm\pm) ) .PUT(\pm\pm\pm STK2 \pm\pm\pm STK1) 95 95 97 t+t .PUT(t+t) .SET(STK2, t+t) ) .PUT(t, t STK2 t, t STK1) 93 99 SIZEDECL = (### .INTEGER .PUT(*) / .EMPTY .PUT(SIZE)) 100 101 102 103 E* MAIN BLOCKS *1 104 105 PROC = \pm i \pm .ID .OUT(.L3 \pm 0.0F \pm * .COL 72) \pm i \pm 5(\pm 0.0L\pm PROCDECL ) CODE ., 106 187 SPROC = \pm1# .ID .PUT(.L8 #80G #*) ARG #1# 5(#80CL# SPROCDECL ) CORE 103 109 ARG = ( \pm(\pm ARGCD \cdot OUT(\cdot GOL 1 \pm \pm \pm \cdot GOL 72) / \cdot EMPTY \cdot OUT(\cdot GOL 72)) 110 ARGCD = S(.ID .PUT(.OOL 25 \pm Az +) (t,t .ID .OUT(.COL 34 \pm Az + .COL 72) 111 / #) # .OUT(.COL 72) .TRETURN) 112 (z, \pm .PUT(.Col i \pm+\pm)/\pm) .TRETURN )) ., CODE = \pm3\pm .OUT(.L9 \pm00H\pm .COL 72) EXECONE \pm5\pm .OUT(.L9 \pm00H\pm .COL 72) 113 114 115 FROCOFOL = .SELECT 116 (#GLOBAL# #USE# GLUDEF #:# 117 #LOCAL# LOLGEF #:# 118 #SPROC# #USE# SPROEF ### 119 120 GLUDEF = GLUD1 S(\pm,\pm) GLUD1) ., 121 122 GLUD1 = .IG .CUT(.L3 #206 # * .COL 72). ., 123 ``` ``` 124 125 LOLDEF = .SET(GL, #1#) PIDEF (#SIMPLE#/.EMPTY) ..SET(TMP, #0 #) SIMDEF 125 $(#,# SIMDEF) ., 127 123 SPROEF = SPRO1 \delta(\pm, \pm) SPRO1) 129 131 SPR01 = .ID .OUT(.L3 #406 # * .COL 72) 131 132 SPROGDECL = (#SPROC# #USE# SPROEF #:# 133 #GLOBAL# GLERD #:# 134 #LOCAL# LOLSERD #1# 135 135 GLERO = .SET(GL, #2 *) .SET(PT, #0 #) (#EXPECT # .SET(TMP, #7 #) / 137 / #RETURN# .SET(TMP,#8 #)) GLER01 3(#,# GLER01) ., 133 139 GLERD1 = .IO TAGPUT .OUT(* .COL 72) 140 141 LCLSERO = .SET(GL. #1#) .SET(PT, #0#) (#EXPECT# .SET(TMP. #7 #) LCLER / 142 #RETURN# .SET(TMP,#8 #) LOLER / .EMPTY PTDEF (#SIMPLE# / 143 .EMPTY) .SET(TMP, #0 #) LCLS ) 144 145 LOLER = GLERO1 \delta(z, z) GLERO1) ., 146 147 LOUS = SIMCEF $(#, # SIMDEF) 148 149 EXECODE = .ONEPROR FECURI R(.ID + #1# .OUT(.CCL 1 * .COL 72) 150 / STMT1 151 SIMT = (NOTASSIGN \pm : \pm / ASSIGN \pm : \pm) .. 152 153 NOTASSIGN = .SELECT 154 (#SET# SETSTMT 155 #CLEAR# CLRSTYT 155 #INC # INCSTMT 157 ≠9EC≠ DECSTAT 153 #HAL T # HALTSTMT #RETURN# 159 RETSTAT 160 #LEAVE# LEAVSTMT 161 #IFTRUE# IFTRSTMT 162 #IFFALSE# IFFLSTMT 163 #COND# CONDSTMT 164 #BEGIN# COMPOUND 165 #SELECT# SELTSTMT 165 ≠GOTO± GOTOSTMT 167 ≠EXECUTE≠ EXECSIMI 163 #PUSH# PUSHSTMT 169 #F02# FORSTMI 173 CMOSTMT 171 #WHILE# WILESTMI ) 172 COMPOUND = #:# .OUT(.COL 1 #88J# .COL 72) .ONERROR RECVR1 3(#END# .EXIT 173 / .ID+#:# .OUT(.COL 1 * .COL 72) / STMT) .OUT(.COL 1 174 #80K# .COL 72) 175 175 SETSTMT = .SET(OP1, #SET#) SCOUT1 177 CLRSTMT = .SET(OP1, #CLR#) SCOUT1 173 SCCUT2 = (#POF+± .T+ .OUT(.COL & #POP+ .COL 16 #+# *T .COL 72) .SET(N2.*T) 179 / .ID .SET(N2,*) / .INTEGER .SET(N2,f) .SETFLAG C1 ) ., 180 181 $COUT3 = .T+ .OUT(.OCL 8 OF1 .COL 16 #+# #T .COL 72) $ETFLAGS .PUT( .COL 8 #WMOVE# .COL 17 N1 .COL 26 N2 .COL 34 #-# #T) ( 132 183 .IFTEMP N2 .PUT(.COL 25 ±-±) / .IFFLAG C1 .PUT(.COL 25 134 (YTRME. \ (#3# 185 135 SCOUT1 = .CLRFLAG C1 (.ID+#(# .SET(N1,*) SCOUT2 #1# SCOUT3 / ``` ``` #5# .ID+#(# .SET(N1,*) SCOUT2 #1# SCOUT4 SCOUT3 / .ID SETHLAGS 197 .PUT(.COL 8 OP1 .COL 17 *) ) .OUT(.COL 72) ., 1.83 189 SCOUT4 = .T+ .PUT(.SOL 8 #RMOVE# .COL 17 N1 .COL 26 N2 .COL 34 #+# *T 190 191 (.IFTEMP N2 .PUT(.COL 25 #-#)/ .EMPTY)( .IFFLAG C1 .PUT(.COL 25 192 #C#)/ .EMPTY) .OUT(.COL 72) .SET(N2.*T) 193 194 INCSTMT = .SET(0P1, #4000#) .SET(0P2, #INC#) IDOUT1 195 196 DECSTMT = .SET(OP1, #SUEC#) .SET(OP2, #0EC#) IDCUT1 197 193 IDCUT1 = .CLPFLAG C1 (.ID+#(# .SET(N1,*) SCOUT2 #1# IDOUT2 / 199 #≤# .IO+#(# .SET(M1,*) 3CQUT2 #1# SCQUT4 IJQUT2 / 200 .ID+#//# .SET(N1,*) .ID .OUT(.COL 7 #+# OP2 .COL 17 * 201 .COL 43 #0# .CCL 72) SETFLAGS .CUT(.COL 8 OP1 .COL 17 N1 .COL 25 202 .CCL 72 ) / .I3 SETFLAGS .PUT(.COL & CP2 #C0# .COL 35 N1 203 .CCL 17 *) ) .OUT(.COL 72) 2114 IDCUT2 = .T+ .PUT(.CCL 8 #RMCVE# .COL 17 N1 .COL 26 N2 .COL 34 #+# 205 *T) (.IFTEM® NZ .PUT( .CCL 25 ±+±) / .IFFLAG C1 .PUT(.COL 25 ±C±) / .EMPTY ) .OUT(.CCL 72) SETFLAGS .OUT(.COL 8 OP2 .COL 16 ±+± *T .COL 72) .PUT(.COL 8 206 207 208 \pmWMOVE\pm .COL 17 N1 .COL 26 N2 .COL 34 \pm-\pm *T) (.IFTEMP N2 .PUT(.COL 25 \pm-\pm) / .IFFLAG C1 .PU 209 / .IFFLAG C1 .PUT(.COL 25 210 211 #C#) / .EMPTY ) 212 HALTSTMT = .GUT(.COL 8 #HALT# .COL 72 ) 213 RETSTMT = .OUT(.COL 8 #RET# .COL 72 ) 214 215 LEAVSTMT = .IO .OUT(.COL 8 #38CH# .COL 16 #3# * .COL 72 ) ., 215 217 IFTRSTMT = .CLRFLAG C2 # (#IFTR#) # IFOUT ., 213 219 IFTR = TERM1 .STACK N2 (.IFFLAG C2 .SETFLAG C3 .CLRFLAG C2 223 / .EMPTY) (±,± .puT(.CCL 12 ±F±) BITOUT .TRETURN 221 222 / #.# .SELECT (#EG#.SET(OP1,#Z#) #.# TERM1 GPR2 .PUT(.CCL 12 #F#) IFOUT1 / 223 #LT#.SET(CF1,#N#) #.# TERM1 OPR2 . PUT(.COL 12 #F#) IFOUT1 / 224 #GE# .SET(OP1,#Z,N#) #.# TERM1 .UNSTACK DUM .STACK 42 225 .SET (N2'.DUM) CHFLAG OPR2 .PUT(.COL 12 #F#) IFOUT1 / 226 #LE# .SET(OP1,#Z,N#) #.# TERM1 OPR2 .PUT(.COL 12 #F#) IFOUT1 / 227 #NE# .SET(OP1, #Z#) #.# TERM1 OPR2 .PUT(.COL 12 #T#) IFOUT1 / 223 #GT#.SET(OP1,#N#) #.# TERM1 OPR2 .=UT(.CCL 12 #T#) IFOUT1 ) 229 230 . . 231 .CLRFLAG C2 #(# IFFL #)# IFOLT 232 TEFISTMT = CHFLAG = .CLRFLAG C1 (.IFFLAG C2 .SETFLAG C1 / .EMPTY) 233 (.IFFLAG C3 .SETFLAG C2 /.CLRFLAG C2) 234 (.IFFLAG C1 .SETFLAG C3 /.CLRFLAG C3) 235 236 IFFL = TERM1 .STACK N2 (.IFFLAG C2 .SETFLAG C3 .CLRFLAG C2 237 / .EMPTY) (#,# .PUT(.COL 12 #F#) BITOUT .TRETURN / 278 ≠.≠ .SELECT 239 (#EG# .SET(OP1,#Z#) #.# TERM1 OPR2 .PUT(.CCL 12 #T#) IFGUT1 / 249 #LT# .SET(OP1,#N#) #.# TERM1 OFR2 .PUT(.CCL 12 #T#) IFGUT1 / 241 INE# .SET(GP1,#Z#) #.# TERM1 OPR2 .PUT(.COL 12 #F#) IFOUT1 / 242 #LE# .SET(GP1,#Z,N#) #.# TERM1 OPR2 .PUT(.COL 12 #T#) IFOUT1 / 243 #GE# .SET(OP1,#Z,N#) #.# TERM1 .UNSTACK DUM .STACK N2 244 .SET(N2, DUM) CHFL46 CPR2 .PUT(.COL 12 ±T±) IFOUT1 / 245 #GT# .SET(OP1,#N#) #.# TEPM1 OFR2 .PUT(.COL 12 #F#) IFCUT1 ) 246 247 } 248 IFOUT1 = . PUT(.COL 3 #COND# .CCL 17 0P1) .. 2.0 ``` ``` 250 IFOUT = .OUT(.COL 25 \pmT\pm *1 .COL 72) \pmTHEN\pm \pm1\pm1 (.IO+\pm1\pm .OUT(.COL 1 * .COL 72) / .EMPTY) NOTASSIGN \pm1\pm1 (\pm5LSE\pm2 251 252 #:# .OUT(.GGL 3 #3RCH# 253 254 .COL 16 #E# .COL 17 *2 .COL 72 ) .OUT(.COL 1 *1 .COL 72 ) 255 (.IO+### .OUT(.COL 1 * .COL 72) / .EMPTY) NOTASSIGN #;# .OUT( 256 .COL 1 *2 .COL 72) / .EMPTY .OUT(.COL 1 *1 .COL 72)) #ENDIF# 257 258 259 CPR2 = .UNSTACK N1 .PUT(.COL 7 #+COMP# .COL 17 N1 .COL 25 N2 260 .COL 43 OP1) / .EMPTY } (.IFTEMP N1 .PUT(.COL 16 #-#) 261 (.IFTEMP N2 .PUT(.COL 25 ±-±) / .EMPTY) (.IFFLAG C3 .PLT(.COL 16 ±C±) .CLRFLAG C3 / .EMPTY) 262 263 264 (.IFFLAG C2 .PUT(.COL 25 ±0±) .CLRFLAG C2 / .EMPTY) 265 .OUT(.COL 72) 266 267 RITCUT = .INTEGER .PUT(.COL & #COND# .COL 16 #.# N2 #,# *) 258 (.IFTEMP NO .PUT(.COL 15 ±-±) / .EMPTY) 259 273 CONDSTMT = #:# $( #(# CONDITEM / #ENDCONE# .TRETURN ) .. 271 272 CONDITEM = IFTR .OUT(.COL 25 #L# *1 .COL 72) #1# #:# 273 {.ID+#:# .OUT(.COL 1 * .COL 72) / .EMPTY) NCTASSIGN #:# .OUT( .COL 1 *1 .COL 72) 274 275 276 SELTSTMT =.PUT(.COL A \pmSLCT\pm) \pm(\pm .IO .PUT( .COL 17 \pm) \pm,\pm .INTEGER \pm) 277 .OUT(.COL 25 ±G± * .CCL 72) #FRCM# #:# 3(#(# SLITEM #)# #:# / 273 #ENDSELECT# .OUT(.COL 1 ### .COL 72 ) .TRETURN ) 279 280 SLITEM = .INTEGER .PUT(.COL 1 x** .COL.7 #C# *) #.# .IG .OUT(.COL 16 #S# 281 * .COL 72} ., 282 233 GOTOSTMT = .ID .OUT(.COL 8 #BROH# .COL 16 #G# * .COL 72) 284 295 EXECSTMT = .ID .PUT(.COL 8 #XEG# .COL 17 *) ( #(# EXECPAR .OUT(.COL 1 286 ### .COL 72) / .EMPTY .CUT(.COL 72)) 237 288 289 EXECPAR =*(.ID .PUT(.COL 25 ±P± *)(±,± .ID .OUT(.COL 34 ±P± * .COL 72) 290 / #) # .OUT(.COL 72) .TRETURN) (#,# .PUT(.COL 1 ###) / #)# 291 .TFETURN )) 292 293 PUSHSTMT = .CLRFLAG 02 (.INTEGER .SET(N2, *) .SETFLAG 03 / .EMPTY TERM2) 294 .PUT(.COL 8 ≠PSH≠ .COL 17 N2) 295 (.IFFLAG C2 .PUT(.CCL 15 ±C±) / .EMPTY) 296 (.IFTEMP N2 .PUT(.COL 16 #-#) / .EMPTY) 297 SETFLAGS .QUT(.COL 72) 298 299 FORSTMT = .DUT(.COL 1 *1 .COL 72) .IO .PUT(.COL 3 ±LOOP± .COL17 *) ±=± •INTEGER .FUT(.COL 25 ≠C≠ +) ≠TO≠ •INTEGER .OUT(.COL 34 ≠C≠ + .COL 72 300 ) \pm; \pm (.10+\pm; \pm .CUT(.COL 1 * .COL 72) / .EMPTY) NOTASSIGN \pm; \pm 301 #ENDFOR= .CLT(.COL 8 #8RCH# .CCL 16 #F# *1 .COL 72) 302 303 304 WILESTMT = ±(± .0UT(.COL 1 *1 .COL 72) IFT9 .CUT(.COL 25 ≠L± *2 .GOL 72 305 ) t) t t! t (.ID+#:± .OUT(.COL 1 * .COL 72) / .EMPTY) NOTASSIGN #:± .CUT(.COL 3 ±8RCH± .OOL 16 ±L± *1 .COL 72 / .COL 1 *2 306 307 308 .COL 72) #ENOWHILE# 309 713 OMOSTMT = .ID .OUT(.COL 1 ## # * .SCL 72) #+# ASSIGN = .JLRFLAG C2 .GLRFLAG C3 .GLRFLAG POP1 .GLFFLAG NOPR 311 STAS BAFFLAG ART . CLRFLAG C1 . GLRFLAG CAT1 . CLRFLAG CAT2 312 ``` ``` ( .IC+±//± CATASSG / .EMPTY SIMASSG ) 313 314 SIMASSG =(.I)+#[# .STACK * SBVAR1 #]# (#=## .SETFLAG ART/#=# 315 .EMPTY) EXPR2 (.IFFLAG NOPR .EMPTY / .EMPTY EXPR1) ASS1 / 315 .ID .STACK * (#=## .SETFLAG ART / #=# .EMPTY) EXPRZ 317 (.IFFLAG NCPR EXMY / .EMPTY EXPR1) ASS2 / ± \(\pm \). ID+±(\pm \). STACK * SBVAR1 \(\pm \). \(\pm \). SETFLAG ART / \(\pm \). 313 319 .EMPTY) EXPR2 (.IFFLAG NOPR .EMPTY / .EMPTY EXPR1) ASS3) 320 321 322 SBVAR1 = (#POP+# .SETFLAG POP1 /.IO .STACK * / .INTEGER .STACK * 323 324 .SETFLAG C1 ) 325 EXPR1 = (.IFFLAG NOFR .EMPTY / .EMPTY .UNSTACK N3 .PUT(.COL 3 GP1 326 327 .COL 17 N3 .COL 26 N2) (.IFFLAG 03 .PUT(.COL 16 #G#) / .EMPTY) 329 (.IFFLAG G2 .PUT(.COL 25 ±C±) / .EMFTY) 329 (.IFTEMP N3 .PUT(.COL 16 ±-±) / .EMPTY) 330 (.IFTEMP N2 .PLT(.COL 25 ±-±) / .EMPTY) 331 (.IFFLAG CAT1 .PUT(.COL 25 ±/±) / .EMPTY) 332 (.IFFLAG GAT2 .PUT(.COL 16 #/#) / .EMPTY) 333 SETFLAGS (.IFFLAG ART .PUT(.COL 7 #%#) / .EMPTY)) ., 334 335 336 337 EXPR2 = TERM1 .STACK NZ (.IFFLAG C2 .SETFLAG C3 .CLRFLAG C2 / .EMPTY ) 338 (.IFFLAG CAT1 .SETFLAG CAT2 .CLRFLAG CAT1 / .EMPTY) 339 ( #+# TERM1 .SET (OP1, #ABC#) 340 t=t TERM1 .SET(OP1, #SUB #) 341 1 ### TERM1 .SET(OP1,#MPY#) 342 #/# TERM1 .SET(OP1,#DVD#) 343 1 #.AND.# TERM1 .SET(OP1, #AND#) 344 #.0R.# TERM1 .SET(0P1,#0R#) 345 #.XOR. # TERM1 .SET(OP1, #XOR#) 346 .EMPTY .SETFLAG NOPR 347 348 TERM1 = (\pm.NGT.\pm TERM2 .T+ .PUT(.COL 3 \pmNOT\pm.COL 17 N2 .CCL 25 \pm+\pm *T ) 349 (.IFTEMP N2 .PUT(.COL 16 ±-±) / .EMPTY) .OUT(.COL 72) 350 .SET(N2, *T) .TR (TURN / 351 .INTEGER .SET(N2,*) .SETFLAG 02 .TRETURN / .EMPTY TERM2 ) 352 ( #.SHTL0.# .INTEGER .SET(QP1,#L#) .SET(OP2,#0#) SHOUT 353 #.SHTPO. # .INTEGER .SET (OP1, #R#) .SET(CP2, #8#) SHOUT 354 #.SHTL1. # .INTEGER .STT(0P1, #L#) .SET(CP2, #1#) SHOUT 355 #.SHTR1.# .INTEGER .SET(OP1, #R#) .SET(CP2, #1#) SHOUT 356 #.ROTR.# .INTEGER .SET(OP1,#R#) ROUT / 357 #. POTL. # . INTEGER . SET(OP1, #L#) ROUT / 353 359 .EMPTY ) 360 ROUT = .T+ .FUT(.COL 3 #RO# OF1 .COL 17 N2 .COL 26 * 361 .CGL 34 #+# *T) (.IFTEMP N2 .PUT(.COL 16 #+#) / .EMPTY) 362 (.IFFLAG CAT1 .PUT(.COL 16 #/#) / .EMPTY) 363 .OUT(.COL 72) .SET(N2,*T) 364 . . -T+ .PUT(.COL 8 #SH#OP1 .COL 17 N2 .COL 26 # #,# OP2 .COL 34 #+# 365 *T ) ( .IFTEMR N2 .PUT(.COL 16 #-#) / .EMPTY) 366 (.IFFLAG CAT1 .PUT(.COL 15 \pm/\pm) / .EMPTY) .CUT(.COL 72) 367 .SET(N2.*T) 368 369 TEPM2 = (. ID+#[# SBVAR3 .. SET(N2, *T) 379 .ID+#(# FLEVAF .SET(N2, *T) 371 .IO+#//# .SET(CUM1,*) .IO .SET(N2,DUM1 #,# *) .SETFLAG CAT1 / 372 #POP+# POP2 .SET(N2,*T) 373 .ID .SET(N2,*) 374 #5# .IO+#[# S8VAR3 S8VAR5 .SET(N2,*T) 375 ``` ``` 375 FLOVAR = .STACK * (.IC+#[# SBVAR3 FL91 377 378 #POP+# POP2 .SET(N2,*T) FLO1 .ID .SET(N2,*) FLD1 379 383 #S# .ID+#E# SBVAR3 SEVAR5 FLO1 ) #)# FLO1 = .T+ .PUT(.COL 8 #EXTR#) .UNSTACK BUM1 .PUT(.COL 17 DUM1 381 .COL 34 ±+± *T) (.IFTEMP N2 .PUT(.COL 25 ±-±) / .EMPTY) .CUT( 392 333 .CCL 26 N2 .CCL 72) POP2 = .T+ .OUT(.SOL 8 #POP# .COL 16 #+# *T.COL 72 ) ., 384 785 SBVAR3 = .SET(N1.*) SBVAR4 #1# 386 SBVAR4 = .T+ (±POP+±.QUT(.COL 8 ±POP± .COL 16 ±+± *T .COL 72).PUT(.COL 8 387 #RMOVE# .COL 17 N1 .COL 25 #-# *T .COL 34 #+#) .T+ .OUT(*T .COL 72) / .IO .OUT(.CCL 8 #RMOVE# .COL 17 N1 .COL 26 * .COL 34 #+# *T .COL 72) 383 389 / .INTEGER .QUT(.COL 8 ±RMOVE≠ .COL 17 N1 .COL 25 ≠G≠ ₹ .COL 34 390 #+# *T .COL 72 )) 391 392 SBVAR5 = .FUT(.COL 3 \pmRMOVE\pm .COL 17 N1 .COL 25 \pm-\pm \pmT) .T+ .OUT(.COL 34 \pm+\pm \pmT .COL 72) ., 393 394 395 EXMV =.UNSTACK DUM .UNSTACK DUM .PUT(.COL 3 \pmMOVE\pm.COL 17 N2 .COL 26 DUM) (.IFFLAG G3 .PUT(.COL 16 \pmG\pm) / .EMPTY) SETFLAGS 396 397 (.IFTEMP N2 .PUT(.COL 16 #-#) / .EMPTY) 393 (.IFFLAG ART .PUT(.COL 7 #%# ) 399 (.IFFLAG CAT1 .PUT(.COL 16 #/#) / .EMPTY) 400 401 (YTEMB. \ ASS1 = (.IFFLAG NOPR .UNSTACK N1 / .I+ .CUT(.COL 34 #+# *T .COL 72) 402 .SET(N1, +T)) (.IFFLAG POP1 .T+ .OUT(.COL 3 #POP# .COL 16 #+# 403 .COL 72 *T) .SET(N2,*T) / .EMFTY .UNSTACK N2) .UNSTACK N3 .PUT( 404 .COL 8 #WMOVE# .COL 17 N3 .COL 26 N2 .COL 35 N1) 405 (.IFTEMP N1 .PUT(.COL 34 ±+±) / .EMPTY) 406 (.IFFLAG NORR SETFLAGS (.IFFLAG C3 .PUT(.COL 34 #C#) / 407 .EMPTY) (.IFFLAG CAT2 .PUT(.COL 34 ±/±) / .EMPTY) / .EMFTY) 403 (.IFTEMP N2 .PUT(.COL 25 ±-±) / .EMPTY) 409 (.IFFLAG C1 .PUT(.COL 25 #G#) / .EMPTY) .CUT(.COL 72) 410 ASS2 = (.IFFLAG NOPR .EMPTY / .UNSTACK DUM .FUT(.COL 35 DUM)) 411 412 .OUT (.COL 72) ASS3 = (.IFFL&G NOPR .UNSTACK N1 / .T+ .OUT(.COL 34 #+# FT .CCL 72) 413 414 .SET(N1.*T)) (.IFFLAG POP1 .T+ .OUT(.COL 8 #PCP# .COL 15 #+# 415 +T .CCL 72) .SET(N2,+T) / .EMPTY .UNSTACK N2) .UNSTACK N3 .T+ 415 .PUT(.COL 8 #RMCVE# 417 .COL 17 N3 .COL 25 N2 .COL 34 ### #T) (.IFTEMP N2 .PUT(.COL 25 #-#) 418 / .EMPTY) (.IFFLAG C1 .PUT(.COL 25 #C#)/ .EMPTY) .OUT(.COL 72) 419 .FUT(.COL 8 #WMOVE# .COL 17 N3 .COL 25 #-# #T .COL 35 N1) 420 (.IFTEMP N1 .PUT(.COL 34 ±-±) / .EMPTY) (.IFFLAG NOPR SETFLAGS 421 (.IFFLAG C3 .PUT(.COL 34 ±C±) / .EMPTY) (.IFFLAG CA72 .PUT(.CCL 34 ±/±) / .EMPTY) / .EMPTY 422 423 ) .OUT(.COL 72) 424 425 426 CATASSG = .SET(STK1,*) .ID .SET(STK2,*) (#### .SETFLAG ART / ### .EMPTY) EXPR2 427 (.IFFLAG NOPR .SET(OP1, #HOVE#) .SET(N2,# 428 ±) / .EMPTY) .UNSTACK DUM1 .PUT(.COL 3 OP1 429 .COL 17 DUM1 .COL 26 N2) 430 (.IFFLAG NOPF .PUT(.COL 25 #/# STK1 #,# STK2) / .EMPTY 431 .FUT(.COL 34 #/# STK1 #,# STK2)) 432 (.IFFLAG CAT2 .PUT(.COL 16 #/#) / .EMFTY) 433 (.IFFLAG C3 .PUT(.COL 16 ≠C≠) / .EMPTY) 434 (.IFFLAG CAT1 .PUT(.COL 25 ±/#) / .EMFTY) 435 (.IFFLAG C2 .PUT(.COL 25 ≠C±) / .EMPTY) 435 (.IFTEYP DUM1 .PUT(.COL 16 ≠-±) / .EMPTY) 437 (.IFTEMP N2 .PUT(.CCL 16 #+#) / .EMPTY) SETFLAGS 433 ``` ``` (.IFFLAG ART .PUT(.COL 7 #4#) / .EMPTY) .OUT(.COL 72) 439 440 SETFLAGS = .PUT(.GCL 43) (.IF \pm: \pm .TRETURN / \pm<\pm .EMPTY) 3(.ID SYMTEST /\pm>\pm .TRETURN / \pm, \pm .EMPTY) ., 441 442 443 SYMTEST = (.IFEQUAL(CM,*) .PUT(#C#) UPARW / 444 .IFEQUAL(ZM,*) .PUT(±Z±) UPARW / .IFEQUAL(NM,*) .PUT(±N±) UPARW / .IFEQUAL(OM,*) .PUT(±O±) UPARW ) ., 445 446 447 UPARW = .PUT(.COL 7 #+#) ., 448 449 450 .END .EOF ``` ``` EMULATOR: POP8 : SETUPNOCEALM 6-POR EHT FOR POTALUME AN ZE ZEHT MAIN DECLAPATIONS *] 3 COL WORDSIZE 12 : COL ARITHMETIC 2 : 3 1.3 TOL GLOBAL PERMANENT MEMORY MEMICAJORI : 11 DOL GLOBAL PERMAMENT SIMPLE ACCHIPCIMAR : 12 COL GLOSAL TEMPORARY IR, MOR, OPCD:3: COL GLOSAL TEMPOPARY FLAG LINK:C: 13 14 DOL GLOBAL PERMAN NT EXTERNAL IDINST: P.DATASWH: 12 : COL FIELD CPCODE (9, 11, -9), ORYTRG (8, 11), 15 15 =GAG=(0.6) , RCTFL3 (1,3,+1), 17 OSC (3,4,-3) ,058(0,2): 13 19 23 START OF FIRST PROCEDURE - INSTRUCTION FETCH 21 #1 22 PROC: INF : 23 [ * 24 25 PROCEDURE DECLARATIONS * 1 25 27 DOL GLOBAL USE MEM.IR.PO : 23 23 EXECUTABLE CODE - 1 33 31 32 IR=MEM(PCI : INC PC : 33 34 35 [ * INSTRUCTION DECODE 35 * 1 37 33 PROC: INSTOCO : DOL GLOSAL USE IR. CPCO : 33 40 CPCD=OPCODF(IR) : 41 SELECT (GPCG.3) FRC* : 42 43 (IPM, D) (1, MRI) 44 (2, MRI) 45 (3,0CA) : 45 (4.JMS) 47 43 (5,JM0) (6, IO) 49 51 (7,0PT) : ENGSELECT : 51 53 53 THIS PROCECURE CALCULATES THE REFECTIVE ACCRESS FOR REMORY REFERENCE INSTRUCTIONS E . 55 * ] 55 SPROCIEFTACR : 57 DOL GLOBAL EXPERT MEM,IR,PR : COL GLOBAL RETURN MAR : DOL LOCAL PERMANENT ADRIF,PROTEMP,MART : 53 53 50 ``` ``` 61 62 IS CURRENT PAGE. PAGE ZERO 63 54 ADR=PGEADR(IR) : 55 IFTRUE (IR,7) THEN : 66 57 BEGIN : POTEMP= PC-1 : 63 69 POTEMP=CRNTPG(POTEMP) : 70 MAR=POTEMP.OR.AGR : 71 END : ELSE : 72 73 BEGIN : 74 MAR=40= : END : ENDIF : 75 75 73 DIRECT-INDIRECT ADDRESSING 79 3 G IFTRUE (IR,8) THEN : 31 BEGIN : 82 MART=MEM[MAR] : IFTRUE (MAR.GT.7) THEN : 83 84 BEGIN : IFTRUE (MAR.LT.16) THEN : 85 35 BEGIN : 87 MEMEMARIA MEMEMARIAL : 33 END : ENDIF : 99 ENO : 30 ENGIF : 91 92 MAR=MART : 93 END : ENDIF : 94 95 RETURN : 95 97 MAR NOW CONTAINS THE EFFECTIVE ADDRESS * 1 93 99 [* 100 101 MEMORY REFERENCE INSTRUCTIONS * ] 102 PROCIMEI : 103 104 OCL GLOBAL USE MAR, MEM, MOR : DOL SPROC USE EFTAGE 1 105 105 EXECUTE EFTAGR : 107 MOR=MEM(MAP) : SELECT (OPCD+2) FROM : 103 109 (CNA, 0) 110 111 (1,TAO) 112 (2.ISZ) : ENDSELECT : 113 114 PROC:AND : 115 DOL GLOSAL USE ACOM, MOR, LINK : 115 117 ACCM=ACCM.AND.MDR : 113 113 LEAVE INF : 123 121 PROCETAD : COL GLOBAL USE AGG 4, MOR, LINK : 122 123 ``` ``` 124 1 <>NIJ> ROM+MODA = MODA : LEAVE INF : 125 125 127 PROC:ISZ : DOL GLOBAL USE ACCH. MOR, MAR, PO : 123 129 MEMEMART = MEMEMART+1 : 130 131 IFTRUE (MERCMARI.EG. 0) THEN 1 132 BEGIN : INC PC : END : ENDIF : 133 134 135 LEAVE INF : 136 137 PROC:DCA : 133 DOL GLOBAL USE MEM, ACOM : 139 140 141 EXECUTE EFTADR : MEMEMARIHACOM : 142 143 CLEAR ACCY : LEAVE INF : 144 145 PROCIJMS : 146 147 DOL GLOBAL USE MEM, MAR, PO : 143 149 : RCATPE BTUDBXE 150 MEMEMARIERS : 151 INC MAR : PO=MAR : 152 153 LEAVE INF : 154 155 FRGC:JMO : 156 BOL GLOBAL USE PO, MAR : 157 153 EXECUTE SETABR : 153 PC=MAR : LEAVE INF : 160 161 emoc:Io : 162 DOL GLOSAL USE IR : 153 DOL SPROC USE TOINST : DOL LOCAL PERMANENT DS:5,05:3 : 1ó+ 165 155 GS=8SC(IF) : 167 153 CS=OS8(I*) : : : (20.20)TZMICI STUDBXE 169 170 PROC:OPT : 171 DOL GLOBAL USE IR : 172 173 IFTAUE (IR,8) THEY : 174 175 BEGIN : 175 LEAVE CPRI : 177 END : 173 ELSE : 179 REGIN 4 LEAVE OPER : 131 181 132 ENCIF : 183 134 FRCC: OPR1 : DOL GLOBAL USE IP. ACCA, LINK : 135 COL LOCAL PERMAN NT ROTACTIS : 135 ``` ``` 187 188 COND: (IR,7) : 183 BEGIN : 190 CLEAR ACCM : 191 192 ENO : 193 (IR,6) : BEGIN : 194 195 CLEAR LINK : END : 196 197 (IR,5) : BEGIN : 193 ACCH= .NOT . ACCM : 199 200 END : (IR,4) : 201 202 BEGIN : LINK= .NOT.LINK : 203 2.04 END : (1R,0) ; 235 206 BEGIN : ACCM= ACCM+1 : 297 208 END : # GNOCONE 209 ROTAGT=ROTFLD(IR) : SELECT(ROTAGT,7) FROM : 210 211 (0, INF) : 212 (1, INF) 213 (2, RAL) 214 (3, RTL) 215 216 (4, RAR) (5,272) : 217 213 (6, INF) : (7, INF) : ENGSELECT : 219 223 221 222 PROGRAL : DOL GLOBAL USE LINK. ACCM : 223 224 LINK//ACCM=LINK//ACCh.ROTL.1 : 225 225 227 FROC:RTL : 223 OCL GLOBAL USE LINK. ACC1 : 223 230 LINK//400 H=LINK//400 N. ROTL.2 ; 231 232 PPOCIRAR : 233 DOL GLOSAL USE ACCM.LINK : 234 235 LINK//ACCM=LINK//ACCM.ROTR.1 : 236 237 PROCIPTR : DOL GLOSAL USE LINK, ACCM ; 233 239 240 LINK//ACCM=LINK//ACCM.ROTR.2 : 241 242 PROCIOPR2 : DOL GLOBAL USE IR, 400 x, 70, LINK, DATASWH : 243 DOL LOCAL PERMAN NT COUNT:2, CHECK:2 : 24+ 245 CLEAR COUNT : CLEAR CHECK : 246 24? COND: 243 249 (12.6) : ``` ``` BEGIN : 250 251 COUNT=1 : IFTRUE (ACCM.LT.8) THEN : 252 BEGIN : 253 254 CHECK=1 : 253 END : 256 ENDIF : 257 END : 253 (IR,5) : 259 BEGIN : 260 COUNT=GCUNT+1 : 251 IFTRUE (ACCM.EG.G) THEN : BEGIN : 262 CHECK=CHECK+1 : 263 264 END : 265 ENDIF : 265 END : 267 (IR,4) : 263 BEGIN : 269 COUNT=COUNT+1 : 270 IFTRUE (LINK.EQ.1) THEN : 271 BEGIN : CHECK=CHECK+1 : 272 273 ENO : ENDIF : 274 275 ENO : ENDCOND : 275 277 IFFALSE (IP,3) THEN : 273 BEGIN : 279 IFTRUE (CHECK.NE.O) THEN : 280 BEG IN : 231 ING FG : 282 END : 283 ENDIF : END : 284 235 ELSE : BEGIN : 285 237 IFTRUE (CHECK.EG.COUNT) THEN : 283 SEGIN : INC PC : 289 END : 290 ENDIF : 231 ENO : 292 ENDIF : 293 : CM00 294 295 (IR,7) : 296 BEGIN : 297 CLEAR ACCM : ENC : 298 299 (IR,2) : BEGIN : 300 ACCH=ACCM.OR. CATASWH : 301 END : 302 (IR.1) : 303 304 SEGIN : 305 H4LT : END : 306 307 ENGCOND : 308 LEAVE INF : 309 310 ENGEMULATOR : ``` ``` FMH ATOR: INDARA : E * THIS IS AN EMPLATOR FOR THE THE THE RIPO MICROPROCESSUR *1 GIOBAL DECLAPTIONS *1 1: no wordside a : Ξ DCI ADITHMETIC : ; DCI GLOBAL DERMANENT MENORY MEM-1655361 : DCI GLOBAL OFPMANENT CIMPLE A.B.C.D.E.H.I. +FC:16.DT:3.58:3 : ы (2 ACT GLOBAL +FUPARARY TR : ACT GLOBAL DERMANENT CA+C+7.N ; 10 DOL FIELD MITTER(6,7.-6) , aITS-4(3,4,-2) , aITS-414,5,-41 . 1.1 * ulBY+(15*8*=R) ; 12 1 ARYT (7+A) 1 3 DCI GLOBAL TEMPORARY PSTACK SP: 161 (A++) : 1 11 DCI GLOPAL EXTERNAL OKAVIP : 15 PROC: THE : 16 DCL GLOBAL HSE MEM.PC.IP : 17 ACL SPROCHISE OKAY : 12 # FXECUTE ONAY : ر 1 I # THIS EVERNAL PROCEDURE IFSES FOR INTEURTS FIC .. TO SEE 20 TE ITE IS TOKAYT TO FEECH THE NEXT INSTRUCTION #1 21 TPEMEMENCHCT ; つつ INC DC 4 つる PROCETUSEO : 211 25 NCL GLOBAL HSF TOINT, SR : BCL LOCAL TEMPORARY ODCOME:2 : 26 27 # OPCODE #MTVPF(TA) : 24 OTEDEST(to) : 20 SRECOUNCE(19) : 30 SELECT(Opconf.4) FROM : (n.TynF1) : 31 32 (1 . INE ) ; 2 3 (2. INC) : 7 (1 (3.I+c) ; 35 FNOCEL ECT : # 36 SPROC: ADRET/ADDa:16) : 27 DOL GLOBAL FYDECT MEM.PC : 4 9 DCL GLOBAL OFTURN PC + 30 NCL LOCAL RETHRN ADDR ; L O NCL LOCAL TEMPORARY LEYTE, HRY+E ; # I BYTE=MEMCPCT : INC Pc ; 11.1 HRYTE=MEMPPCT : INC Pr ; u 2 43 APDo=HnYTE/// aYTF : # SPHOC:STORF(RESTR) : Li (1 115 DCI, GLOBAL FYDFCT H.L : DCL GLOBAL RETURN MEM : DCL LOCAL FYPECT DGSTD : 116 117 OCL LOCAL TEMPORAY TEMPINE : # TEMPIHEH// : MEMETEMPIHEGEN # 11 Q na PROC: TYPE1 . 50 51 DCL GLUBAL HSF IR.SR : 50 # SFLECT(Sm.8) #80% ; 53 (n.INC) : 54 (1.TYOF11) ; (2.Tv-F12) ; 55 56 (3.TVDF13) : ``` ``` 57 (4) TypFiul ; 50 (SITYOFIS) 1 <u>= 0</u> (A)TVEFIAL I 60 (7.TvmF17) : د 1 FMDcFLECT : E 60 PROC: TYPE1: : 63 DOL GLOBAL HSF TD. P.C.D. F. H., . SP. CA : DOL LOCAL TEMBORARY ODCODE: 2. AUDR:16 : 64 65 DCL SPROC USE ADDRET : a OPCODE=81+Sus(IR) : 22 47 TETOUR (In.3) THEN : AFGIN : ڍے 49 COND . 70 (Opconf.fo.n) : 71 GEGIN : 70 HIM =HIM +HIMC CCAL : LEAVE INF : フス END : 74 (Opronf.F0.1) ; 75 HEGIN : 74 HILLEHILLEDIZE COAD : LEAVE INF : 77 END : フス IMPCODE.FA.2) ; 79 HEGIN : HI/LEHI/LIH// (CA) | LEAVE IME : ផ្ស s 1 END : 42 (APCONF.FA.3) ; ፍ 3 HEGIN : HIVLEHIVLISP ICAS ; LEAVE THE : 111 45 FMO : בייחרטאים : 26 27 Fid) ; FISE : 6.0 ų Q AFGIN : 0.0 EXECUTE ADDRET(ADDR) : COND : 0.5 (Cornne.FO.a) : REGIN : B//CHANDE : LEAVE THE : EMB : 02 (Oprone.FO.1) ; HEGAN : A//ETADDR : I FAVE THE : END : (OPCORF.FR. 2) : REGIN :U//L FADDR : LEAVE THE I. END : oц (Oprone.FQ.x) : DEGIN : SO=ADDR : LEAVE INE : END : 05 ڪن EMOCAMO 3 97 END : ENDIE : = Q.A DROC: TYPE1: oα DOL GLOBAL USE TRABACADAEAHALASP ; 100 BCL LOCAL TEMBORARY OCCODE:2 . # OPCODETRITECTED : 101 TETOUS ( (to.3) THEN : 1112 REGIN 1 103 1 114 COND . (Opconf.FO.n) ; HERTN : b//CTR//C-1 : 1FAVE INF : END : 165 (Opcome.eq.1) ; REGIN : U//E=D//E-1 : LEAVE INE ; END ; 165 (Opcore.FQ.5) ; REGIN ; H//(FH//I-) ; I FAVE INF : = 110 : 107 (Opronf.FO.3) ; HERTN : DEC SP : I FAVE INF : FNO ; 108 109 FNOCOME 110 F-17 : 111 FISE : HEGIN : 112 113 COND : ``` ``` 114 (OOCODE FO.A) ; BEGIN ; B//C=B//C+1 ; FEAVE INE ; END ; 115 (Opconf.FQ.1) : BEGTN : D//F=D//F+1 : LFAVE INF : CND : 116 (OOCODE.ER. 2) ; BEGIN ; H//L=H//1+1 ; I FAVE INF ; FMD ; 117 (OPCORF.FQ.3) ; BEATN ; INC PC : I FAVE INF : END : 112 ENDCOMD : 119 FNO : FNOTE : # 120 PPOC: TYPF1# : DCL GLOBAL HISE B.C.D.E.H.L.A. 121 122 MEMODTICA. ZINE : 123 DCL LOCAL TEMPORARY TEMP. AUDR. 16 : 124 # COND : 125 (DT.FQ.A) : BEGIN : INC B <AA.Z.N > : LEAVE INF : END : 126 (OT.EQ.1) ; REGIN : INC C KCA.Z.N > : LEAVE INF : END : 127 (DT.EQ. D) : HEGIN : INC D <CA.Z.N > : LEAVE INF : END : 128 (DT.EQ.x) ; REGIN ; INC & <CA.Z.N > : LEAVE INF ; END : 100 (OT.EQ.") : REGIN : INC H < A.Z.N > : LEAVE INF : ENU : (DT.EQ. =) : REGIN : INC L <CA.Z.N > : LEAVE INF ; ENU : 130 131 (DT.EQ.4) : REGIN : 132 ADDREHIZE : NEMEADOR TEMEMEAUDR THE COARTINE : 1 2 2 I FAVE THE FIRM ! 134 (OT.FO.T) : HEGIN : INC A <CA.Z.N > : LEAVE INE : EMD : 135 EMBCOMB : = 136 PROC: TYPEIR : 137 DCL GLOBAL USE A.C.D.F.H.L.A. 138 HEM.DT.CA.T.N : 139 OCL LOCAL TEMPORARY TEMP. AUDR. 16 : 140 a COND : (OT.EQ.A) : BEGIN : DEC & CAAZAN > : LEAVE INF : ENG : 1 (1.1 142 (DT.FO.+) : REGIN : DEC 4 COA.Z.N > : LEAVE INF : END + (DT.EQ.5) : BEGIN : DEC U CCA.Z.N > : LEAVE IME : END : 1/13 1114 (DT.EQ. a) : BEGIN : DEC & KOA, Z.N > : LEAVE IME : END : 145 (DT.FQ. ") : HEGIN : DEC H < CA.Z.N > : LEAVE INF : END : (DT.FR. a) : BEGIN : DEC L (CA.Z.N > : LEAVE INF : END : 1/16 147 (DT.EQ.4) ; BEGIN ; ADDREHIM : VEME ADDRIEMEME AUDRIEL COATTINE : 140 I FAVE THE ; END : 150 (OT.EO. -) : HEGIN : DEC A COA, Z, N > : LEAVE INF : END : 151 FNDCOND : 4 150 PROCITYPE1: 153 DCL GLOBAL USE TO MEMBER B. C. D. E. H. L. A : 154 NCL LOCAL PERMANENT OC:2. AUDR:16 : 155 NCL SPROC USE AUMPET : 155 # 00 - HITC+4(To) ; 157 TETOLIF ((TD, E) THEN ; HEGIN : 158 150 EXECUTE ADDRET (ADDR) : 160 COMP : 161 (OC.Fo.n) : REGIN : MEMCADOD JEL ; INC ADDO : HEMEMCADOR) ; 162 IFAVE INE : END ; 163 (OC.Fo.1); HEGAN; I =MEMEADUR]; INC ADDR; HEMEMEADD]; 1611 I FAVE INE ; END ; 165 (OC.Fo.2) : BEG+N : MEMCADDO JEA : LEAVE INF : FMD : 144 (OC.Fo.3) : BEGIN : AEMEMEADURE : LEAVE INF : END : 167 END-OND : 1 44 FSO : F1 5F : 140 170 AFGIN : ``` ``` 171 COND + (OC.FO.O) :DEGIN :ADDRED//C :MENLAUDRIEA :LEAVE INE: END : 170 (OC.FA.1) :PEGIN | ADDRED//C | AEHEMEAUDRILLEAVE INF: END | (OC.FA.1) :PEGIN | ADDRED//F | MEHEAUDRILLEAVE INF: END | 173 174 (OC.Fo.3) : FEGTM : ADDRED//F: AENEMEADDRET : LEAVE THE: END : 175 175 EMIDCOMID : 177 FND : ENDIE : # PPOC:16 : 178 DCL GLOBAL USE B.C.D.E.H.I.DT.MEM ; 179 190 DOL LOCAL TEMPORAY OPCODE: 3.DATA : 1 1 1 DOL SPHOR HEE STORE : 1 42 H DATA=MEMOBOT : INC PC : 193 (NT.EQ.A) : HEGIN : HEDATA : LEAVE INF : FND : 1 24 (NT.EG.+) : REGIN : C=DATA : LEAVE THE : FNN : 195 (DT.EQ. 0) : HEGIN : DEDATA : LEAVE INF : FND : 186 (DT.EG. ) ; REGIN ; FEDATA : LEAVE THE : FAD : (DT.EG. ) ; REGIN : HEDATA : LEAVE THE : FAD : 197 1 8 8 (nt.EQ.s) : REGIN : LEDATA : LEAVE INF : FND : 140 (DT.FQ.4) ; HEGIN : EXECUTE STURE(DATA) : LEAVE THE : END ; 100 (DT.EQ. 7) : REGIN : AFDATA . LEAVE INF : FND : 191 FNOCOND : " 100 PROCITYPE17 : 143 DOL GLOBAL HSF A.CAINT ! 194 105 DOL LOCAL PERVANENT TEMP : 104 a COND I (DT.EQ a) ; REGINITEMPER.7 : AFA. SHILT.1 : CAFTEMP ; 197 ATTEMP. OP.A : LEAVE THE : FN7 : 100 (nt.Eq. 1) : HEGIN; TEMPEA . 0 : ARA . SHTQ . 1 : CARTEUP ; 100 ATTEMP OR . A ! LEAVE INF : FNO : 200 (DT.FO.2) ; REGINITEMPECA ; CA//AECA//A.SHTI.1 : 201 A=A.CO.TEMP : LEAVE INF : END : 20:2 .(nt.Eq. a) : REGINITEMPEA. 0 : CA//A=CA//A.CHTR.1 : 203 CA-TEMO : LEAVE THE : END : 2014 (DT.FR. H) : OFGIN: | FAVE THE :END ; 205 (DT.EU.E) : REGIN: A=.NOT.A : LEAVE INF : FND : 206 (AT.FN.A) : BEGIN: CAS.NAT.CA : LEAVE INF : FNO : 207 (OT.EO.7) : AFGIN: SET CA ; END ; 20.8 2010 FNDCAND : " ENDEMHEATOD : 210 ``` ``` FMHLATORIXCHPTD : • 2 THIS IS AN EMPLATOR FOR A HYDOTHETICAL TO BIT . . PEGISTER COMPLITER. EACH INSTRUCTION (EXCEPT A FFW) HAS PASICALLY THREE 5 FIFL Oc: 1- OPCODE FIFLM: 4 BITC (BIT 11 TO BIT 8) 4 2- FIRST SOMPOR FIGEN: 4 BITS (MIT 7 TO MIT 4) 3- SECOND SOURCE / DESTINATION FIELD: 4 SITS KBIT 3 TO BIT 0> a THE 4 BIT SOURCE/DESTINATION FIELD IS AUTUALLY MADE HP OF 10 TWO FIFE OS. THE MOST STENTEY CANT BIT OF THIS FIFED IS A 1 1 DIRECT/INDIDECT BIT . WHILE THE 3 LEAST SIGNIFICANT BITS 12 POINT TO OME OF THE FIGHT MACHINE REGISTES. IF THE DIRECT! 13 INDIRECT BIT IS A 111 THEN THE APERAND (SOUDCE/DESTINATION) 1 (1 IS IN MEMORY AND THE ADDRESS IS IN THE REGISTER . IF II IS 15 A IN THE ODERAND IS IN THE REGISTER. 16 THE FOLIAMING INSTRICTIONS HAVE THE APOVE MENTIONED FORMAT 17 ADD SOURCET TO SOURCES AND STORE RESULT IN SOURCES 1 2 A- !ADn! 1.0 8- 'SHR' SURTACT SUBTACT ...... AND WITH CARRY ..... 20 C_ 'ADDCR' 0- 'SHACR' SUBTRACT WITH CAPRY ..... 21 E_ 'ANO' AND 22 23 F_ '0R' OR COMPLEMENT SOURCES AND PHT IT IN DESTINATION G- 'NOT' 211 H_ 14001 MOVE SOURCEL TO DESTINATION 25 THOSE INSTRUCTIONS HAVE THE COLLOWING FORMAT : 24 1- OPCODE CRIT 11 TO BIT 6> 27 つみ 2_ ADDoFSC KRIT 7 TO BIT 0> THE FFFENTIVE ADDRESS IS CALCULATED BY ADIMG THE ADORESS 29 FIFTH TO THE BROBAM CONNIER, TWO'S COMPLEMENT SIGN EXTENHED 30 ADDITION IS DONE. THE INSTRUCTIONS ARE : 31 GO TO THE CUBROUTINE AT THE EFFECTIVE ADDRESS. A- 10011 1 32 THE PO IS BUSHFU IN THE STACK. 23 H_ I,IMP! JIMP TO THE AUDRESS. 3/1 EXECUTE AN TO OPERATION AS SPECIFIED BY THE ADDRESS マニ C- 'TO' FIELD. THIS FIELD HASNY BEEN OFFINED FOR THIS BROGGAM. 36 THOFF INCTOLICTIONS AND THE FIFLDS APE: 37 WORD .. 1- GPCODE ZBIT 11 TO BTT AS 30 >- n call → To bit 4> 34 THE DEST FIELD POIT 3 TO RIT 02 6.0 WORD 2. 1- DATA / ADDRESS FIELD CRIT 1: TO RIT 0> 14 1 40 THE THREE INCTRICTIONS ADE: MAVE THE CONTENTS OF THE MEMORY LOCATION A_ IMOVEMI 43 PAINTED BY THE SECOND WORD INTO THE DESTINATION 11 4 CORRESPONDING TO THE DEST FIFLD 1:5 MAVE THE CECOND WORD INTO THE DEST FIELD B- 'MOUT' 116 JUMP ON COMUITION. THE REGISTEDS POINTED BY THE C- 1,1001 117 REGISTER FIELD APP COMPAPER AND THE TWO DIRECTY US INDIDECT FIELD SPECIFY ONE OF FOUR CANDITIONS. THE 1: 0 CONDITIONS ARE 1) LESS THAN DIGREATED THAN 3) EQUAL 411 TO AND 4) NOT EGHAL TO. THE JUMP AUDDESS IS THE 61 SECOND WOOD . 65 THE LAST THE INCTPLICTIONS ARE 53 WHICH DOES A RETURN FROM A SUBBOULINE BY BORING A. 'DFT' 54 THE STACK INTO THE PL. 44 8- INOD! NO OPERATION 56 ``` ``` THE STACK IS RESIDENT IN THE MAIN MEMORY AND REGISTER 7 SERVES AS THE STACK POINTER \pm 1 57 58 50 DECLARTIONS GIOBAL 40 × 1 61 40 DOL WORDSIDE 12 : ۲ ۲ NOL ADITHMETIC TWO : DCI NEMORY WEM: [4096] : 64 DCI GLOBAL DERMANENT DO.RI.DZ.RY.R4.R5.R6.ID : 45 DOL GLORAL TEMPORARY PC 1 44 DOL GLOBAL DEPMANENT FLAG C.C.7:7 ; 47 42 DCI GLOBAL DERMANENT PSTACK (R7: (A)+) ; DCI EYTERNAL TOPROCIP : 40 OCI FIELD COCI(7,4,-4), CRC2(3,0),OC(10,8,-6), 70 ABDRES(7.0), RITOn(9,8,-8), SEC10(6,4,-4), 71 cRC2n(2.d).BtTS7(7.7.-7),BtTC3(3.3.+3). 70 73 01TCn2(2, n) : PROCEMENT : 74 I* THIS PROCEDULE FETCHES THE NEXT INSTRUCTION AND 75 DOES SOME POIMARY DECODE *1 74 77 DOL GLOBAL HEE MEMIPOITE : DCI FOCAL -EMPORARY OPPODE+4 1 72 # IRENEMEPOR : INC OC : 74 IFTOUF (10.11) THEN : REGIN LEAVE TYPE1 ; FAID: a n FISE : BEGIN : LEAVE TYPES : END; EMPTE : # ឆ្នាំ SPROC: FFTCH (ADDELD: 1. VALUE) 82 DCI GLOBAL USE MEM. PO. 01. Par R3. R4. 95, R6. P7 ; in 2 Q li DOL 1 OCAL EXPECT ADRESD : DOL DETURN VALUE : 45 DCI LOCAL DEPMANENT RITA: 1. DITO2:2 ; 24 # ATTREMITER (ANDFLI)) : 47 PA ATTARENTIAN (ADREID) : 60 COND : (SITO2.EQ.O) : GEGIN : WALUE = RU : GO TO IMPRET : END ; cin (alth2.ed.1) : AFGIN : WALLE = R1 : GO TO INDRCT : FHD : C 1 (SITHE FR. 2) : REGIN : WALLE = P2 : GO TO INDECT : END : 40 (mITH2.ma.3) | PEGIN | WHITH = P3 : G0 TO INDRCT 0.3 (OITH)2.EQ.4) ; REGIN ; VALUE = 84 : GO TO INDECT ; END ; Ca/1 (BITU2.ER.S) : BEGIN : WALUE = R5 : GO TO INDRCT : FND : GG (SITU2.FQ.A) ; REGIN ; VALUE = R6 ; GO TO INDECT ; END ; 44 (SITHE FR. 7) : PEGIN : VALUE = R7 : 07 FMCCONI) ; Q.O INDRCT : ICTRUF (STTR.1) THEN ; REGIN : 00 VALUETMENT VALUET : 160 FMD : FNDTF : # 101 SPROC: STURE (ADRELD: 4. VALUE) ; 102 DCI ALOBAL HEF MEM.RO. 01, Pa. R3. R4. R5. R6. P7 : 103 DCI TOCAL EXPENT ADDELD . VALUE ! 104 DCI LOCAL OFRMANENT BIT3:1. DITA2:2. TEMP. VALUE1 : 105 106 DOL SPROCHEF FETCH : # HTT%=BiTc%(AnoFLD) : 107 102 RITO2=HITCO2(4DRFID) IFTOLIF (BT+3.FA.A) THEN : 100 AFAIN ! 110 C0MB) : 1 1 1 (HITO2.FO.0) : HEGIM : ROTVALUE : LEAVE NEYT : END : 112 (HITO2.FO.1) ; REGIN ; RIZVALUE : LLAVE MEXT ; END : 113 ``` ``` (hita2.Fa.2) : REGIN ; RETVALUE ; LEAVE NEYT ; END ; (hita2.Fa.3) : REGIN ; RETVALUE ; LEAVE NEYT ; END ; 114 115 (MITO2.FO.4) : REGIM : RHEVALUE : LEAVE NEXT : END : 116 (HITO2.FO.5) : HEGIN : RETVALUE : LEAVE NEXT : END : 117 (BITAD. FO. 6) : REGIN : RETVALUE : LEAVE MEYT : EMD : 110 (BITO2.FA.7) : BEGIN ; REAVALUE ; LEAVE MEXT : END : 119 ENDCOMD : 120 FMI) : 191 FISF : 122 123 REGIN: FXECUTE RETCH (ADDELD . VALUE1) : 124 MEMEVALUET TEVALUE : END : ENGIF : H 125 PROC: TYPE1 126 NCI GENHAL LICE IR : 127 DOL LOCAL DEPMANENT ADDED: 4.AnREUS: 4.ADPMAN. 124 OPPN05:0P:2:00:3 : 129 130 DOL SPROCHISE FFTCHISTARE : a ADREDIESOCI(To) : 131 130 ADRED2=Sec2(To) : on=oC(IR) : 1 7 7 1 34 FYECUTE FETCH (ADRED1 . APRNO1) . TETOHE (On. En. 7) THEN : BEGIN : ; 35 F* NAT *] OPPMETS NOT OPPMET COZO : FND : 134 137 END IF : TETOUF (On.GT.S) THEN : BEGIN : 138 E + ∀OV . NOT +1 130 STARF (AnnEng. OPRNO1) : 100 10.1 FNn : FISE : 142 143 MEGIN : C# ADO.SIIR.ADDCR.SIIRCP.AND.CR *1 1114 FXECUTE FETCH(ADRED2+DBRNDs) : 1115 で(02年) t 1/15 (OD. FO. A) PERIN ; 1 /1 7 CPOND2=OPRHD2+OPRNO1 (C+7) : GO TO STRELT : END : 142 (00 EQ.1) ; REGIN ; 100 Ubanusaubstustubanu1 : 150 OPEND2=OPRND2+c (C.Z) ; GO TO STOSET ; FMO ; 151 152 (OD FR. 2) | REGIN ! OPPND2=OPRND2-OPRND1 (C+7) : GO TO STRELT : END : 1 = 3 (00.50.3) : SEGIN : 154 OPENDATORNOS-APENAT : 155 OPONDSEOPRHOSEC (C.Z) : GO TO STOSET : FMU ; 156 (UD. -0.4) | PEGIN | 167 OPONDO=OPRHO2.xND.oPRND1 <</p> 158 (OD, EQ.5) ; PEGIN ; 150 OPONDOZOPRIJOZ.OR.ODRNO1 (C.7) : GO TO CIPSLI + ENO ; 160 ENDCOMP : 161 162 STRELT : F* STORE RESULT 41 EVECUTE STORF (ADREDS: ODRNOS) 163 FND : FADTE : H 164 PROC: TYPE2 : 165 DCI GEOHAI HISFO TO PC ... FM . o 7 ; 155 DOL LOCAL REPMANENT OC. 2. ADBRIG : 157 DOL SPRUC HISE TOPROC ; 1 4 4 160 a TEFALSE (10:10) THEN : HEGIN : 170 ``` ``` T* CALL. IND. TO: NOP *7 171 OC-HITIOC(IR) : ADDR-ADDOES(+R) : 172 173 COMD : (OC.FO.A) : EFGIN : 174 PHSHIPC) : PC=PC_ADDS : (EAVE MEXT ; FND ; 175 176 (OC.FR.+) : AFGIN ; 177 PC=PC_ADDB : I FAVE NEVT : END : 178 (OC.FO.S) : SFGIN : EXECUTE INDROCIADDO) : LEAVE NEXT ; FIND ; 170 140 (OC.FR. X) : REGIN : | FAVE MEVT : EMD : FNDCOMD : 1 : 1 FND : ENDIF : # 162 PROCITYPEPA : 103 184 DCI GLOBAL USF MEM. PC. TR : DCI 1 OCAL OFOMANENT RYTE2. ALREALIA. ADRED218 OC:2: 105 HIT3 . 1 . PITT 7:1 . TEMP, TEUP1 . ADRES : 184 197 DOI SPRUC HISE METCH STARE : 1 22 4 0C=0IT98(TR) : IF(AC.EQ.A) THEN : BEAIN : 100 T* RET +1 DCEPOP : LEAVE NEXT : END : 1 0/1 FISE : 101 PFOIN : 102 103 E * MOVEM . MOVIT . INC 43 SYTE2-MEMERCI: THE PE ; 104 105 CON1) : (OC. FR. 1) : REGIN : +EMP-MEMEBYTE2] : 104 angeno=Seco((a) : EXEAUTE STORE(Angeno,TEMP) 147 102 LEAVE NEXT : END : (OC.ER.2) : REGIN : AURENZESRC2(TB) : EVECUTE STORE/ADDCU2/BYTE2) : LEAVE NEXT : FND : 100 200 (OC.EG.3) ; AFGIN ; AURECEPC+BYTF2 ; 201 21.2 AnnEntesRoth(In) : 203 AneFna=SRCon(In) : EVECUTE FETCH (ADPENTATIONE) : 264 EVECUTE FETCH (ADPENSITEMPI) : 2115 BitTEHITS7(IR) ; HITEPITS3(IR) ; 206 OC-HITS7//PITS: 207 Cox n : 202 Inc.Fo.8) : REGIN : IETRUF(TEMP.GT.TEMP1) THEN : 200 DEGIN : DCHADRES : END : ENDIF : 210 LEAVE DEXT : FND : 211 (AC.Fa.1) : HEBIN : IFTRUF(TEMP.LT.TEMP1) THEM ; 212 BEGIN : BCHADRES : END : EMDIE ; 213 I FAVE NEXT : FND : 214 (AC.FA.2) : HEGIN : IFTRUF(TEMP.EA.TEMP1) THEN : 215 DEGTH : OCHADRES : END : ENDTE : 216 LEAVE NEXT : END : 217 INC.FA. 3) : DEGIN ; TEFALSE (TEMP.FA.TEMP1) THEN : 212 BEGIN : BCHADRES : END : ENDIF : 219 LEAVE NEX+ ; FND : 220 ENDCOND : 221 FND: 222 223 FNDCOND : FARE: 224 225 FNITTE : # FAMFMIR ATOD : 226 ``` ### GLOSSARY ARCHITECTURE: Those facilities of a computer that are visible to a programmer. <u>COMPILER</u>: A program that converts a high level representation of a program into a low level representation. CONTROL STORE: The memory of a m-computer which holds the micro-instructions. DYNAMICALLY MICROPROGRAMMABLE COMPUTER: A microprogrammable machine with the capability of swapping microprograms in and out of a writable control store at a speed which matches the basic processor clock speed. This capability can be looked upon as the dynamic redesign of the architecture of the machine to meet the needs of the immediate job to be done. EMULATOR: A collection of microprograms which when stored in control store, define a computer, i.e., its machine instruction set, is known as an emulator. The machine doing (supporting) the emulation is known as the HOST and the machine which is emulated is known as the TARGET or the VIRTUAL machine. FIRMWARE: Firmware is described as microprograms which are resident in the control memory of a computer. ITERATIVE CONTROL: Some machines have the capability of repeatedly doing an operation (corresponding to some microoperation) until some condition (loop count termination, flag generation etc.) makes them stop. This is a useful feature for multiple precision operations. INTERPRETER: A program that performs the instructions of another program. It differs from a compiler in that it produces the results directly while a compiler produces a representation of a program that must be interpreted by a program or directly executed by a machine to produce results. m-COMPUTER: A microprogrammed or a microprogrammable computer. MACRO-INSTRUCTIONS: These are the instructions which reside in main memory i.e., the conventional machine instructions. The highest level of control over the CPU is exercised by these instructions. Thus they are one level above microinstructions just as nanoinstructions are one level below microinstructions. MICRO ARCHITECTURE: Those facilities of a microprogrammable computer that are visible to a microprogrammer. MICRO CONTROL PROCESSOR: A microprocessor with a control store as part of its control unit. MICROINSTRUCTION: A word contained in the control store of a micro-programmed control unit. It consists of a number of fields; some of which are microoperations and some are literal data like a constant, an address, etc. MICROOPERATIONS: The most primitive or elementary operations which a machine can execute. According to Wilke's original model, these are the signals which go over a single wire to a well defined destination. MICROPROCESSOR: A large scale integrated (LSI) circuit processor on a single chip or a couple of chips. The term 'micro' in microprocessor refers to the physical size of the unit involved. MICROPROGRAMMABLE COMPUTER: When the control store is made up of read-write memory and facilities are provided for changing the contents of the control store, the computer is said to be microprogrammable. The extent of these facilities on a computer (both hardware and software) determines Microprogrammability of the computer. MICROPROGRAMMED COMPUTER: A computer is microprogrammed if the microinstructions which the computer executes are stored in a read only memory. MICROPROGRAMMING: A technique for designing and implementing the control function of a computer, as a sequence of control signals to interpret fixed or dynamically changeable instruction set of the computer. In more general terms it is the activity of programming using microinstructions. NANO ARCHITECTURE: A level below microarchitecture. The nanoprograms residing in NANO STORE define the microarchitecture of a computer. This means a sequence of NANOINSTRUCTIONS are executed to emulate a microinstruction. This is similar to the execution of a sequence of microinstructions to emulate a machine (macro) instruction. NANOPROGRAMMED COMPUTER: A computer is nanoprogrammed if the nanoinstructions which the computer executes are stored in a read only memory. NANOPROGRAMMING: The activity of programming using nanoinstructions. NANO STORE: The memory where the nanoinstructions reside. RESIDUAL CONTROL: In m-computers the microoperations are converted into control signals which directly and immediately control machine resources. This is known as <a href="IMMEDIATE CONTROL">IMMEDIATE CONTROL</a>. An alternative is the residual control scheme where microoperations do not control resources directly, but rather use several <a href="SETUP REGISTERS">SETUP REGISTERS</a> to control hardware resources. The value of a setup register may indicate the microoperation a functional unit has to perform or the address of a register etc. Microinstructions are used to control the values in these setup registers. SIMULATOR: An interpreter in which the interpreted instructions are machine language instructions for some machine (real or abstract). UNIVERSAL HOST: A computer that can be microprogrammed to emulate any desired target machine. WRITABLE CONTROL STORE: The control store of a microprogrammable computer.