#### **Control & Execution**

Finite State Machines for Control

**MIPS** Execution

### Synchronous Systems



On the leading edge of the clock, the input of a latch is transferred to the output and held.

We must be sure the output of the combinational logic has *settled* before the next leading clock edge.

### Asynchronous Systems



No clock!

The data carries a "valid" signal along with it

System goes at greatest possible speed.

Only "computes" when necessary.

Everything we look at in this class will be synchronous

# Fetching Sequential Instructions



How about branch?

### Datapath for R-type Instructions



#### **MUX Blocks**



The select signal determines which of the inputs is connected to the output

# Inside there is a 32 way MUX per bit



And this is just one port! Remember, we have data1 and data2 coming out of the register file!

## Our Register File has 3 ports



# **Implementing Logical Functions**

Suppose we want to map M input bits to N output bits

For example, we need to take the OPCODE field from the instruction and determine what OPERATION to send to the ALU.



#### Remember our ALU?



# **Implementing Logical Functions**

Suppose we want to map M input bits to N output bits

For example, we need to take the OPCODE field from the instruction and determine what OPERATION to send to the ALU.



## We can get 1 bit out with a MUX



#### Or use a ROM





Think of the SUM of PRODUCTS form.

The AND layer generates the products of various input bits

The OR layer combines the products into various outputs

You could also use two NAND layers instead

Could be implemented using Boolean gates, or also using a "programmable logic array" (PLA) [similar to a PROM, but both the AND and the OR parts are programmable].

### Finite State Machines

- •A set of STATES
- •A set of INPUTS
- •A set of OUTPUTS

•A function to map the STATE and the INPUT into the next STATE and an OUTPUT

Remember automata?

### **Traffic Light Controller**



### Controller State Table

| Inputs  |       |       | Outputs |         |         |
|---------|-------|-------|---------|---------|---------|
| State   | NScar | EWcar | Next    | NSlight | EWlight |
| NSgreen | 0     | 0     | NSgreen | 1       | 0       |
| NSgreen | 0     | 1     | EWgreen | 0       | 1       |
| NSgreen | 1     | 0     | NSgreen | 1       | 0       |
| NSgreen | 1     | 1     | EWgreen | 0       | 1       |
| EWgreen | 0     | 0     | EWgreen | 0       | 1       |
| EWgreen | 0     | 1     | EWgreen | 0       | 1       |
| EWgreen | 1     | 0     | NSgreen | 1       | 0       |
| EWgreen | 1     | 1     | NSgreen | 1       | 0       |

# Implementing an FSM



# FSM Example: Recognizing Numbers

Recognize the regular expression for floating point numbers

```
[\t]^{*} [-+]?[0-9]^{*}(. [0-9]^{*})? (e[-+]?[0-9]+)?
```

Examples: +123.456e23 .456 1.5e-10 -123

"a" matches itself
"[abc]" matches one of a, b, or c
"[a-z]" matches one of a, b, c, d, ..., x, y, or z
"0\*" matches zero or more 0's ("", "0", "00", "0000")
"Z?" matches zero or 1 Z's

### FSM Diagram



### FSM Table

- IN : STATE → NEW STATE
- i': start  $\rightarrow$  start  $(0' | (1' | ... | (9' : start \rightarrow whole))$ '+' | '-' : start  $\rightarrow$  sign  $\therefore$  start  $\rightarrow$  frac  $(0' | (1' | ... | (9' : sign \rightarrow whole))$ '.': sign  $\rightarrow$  frac  $(0' | (1' | ... | (9' : whole \rightarrow whole))$  $\therefore$  whole  $\rightarrow$  frac '': whole  $\rightarrow$  done 'e': whole  $\rightarrow \exp$
- 'e' : frac  $\rightarrow$  exp  $(0' | (1' | \dots | (9') : \text{frac} \rightarrow \text{frac})$ '': frac  $\rightarrow$  done  $(0' | (1' | \dots | (9' : exp \rightarrow exp))$ '': exp  $\rightarrow$  done STATE ASSIGNMENTS start = 0 = 000sign = 1 = 001whole = 2 = 010frac = 3 = 011exp = 4 = 100done = 5 = 101error = 6 = 110

# **FSM** Implementation



- Our ROM has:
- •10 inputs
- •5 outputs

### FSM Summary

- With JUST a register and some logic, we can implement complicated sequential functions like recognizing a FP number.
- This is useful in its own right for compilers, input routines, etc.
- The reason we're looking at it here is to see how designers implement the complicated sequences of events required to implement instructions
- Think of the OP-CODE as playing the role of the input character in the recognizer. The character AND the state determine the next state (and action).

### **Five Execution Steps**

- 1. Instruction Fetch
- 2. Instruction Decode and Register Fetch
- 3. Execution, Memory Address Computation, or Branch Completion
- 4. Memory Access or R-type instruction completion
- 5. Memory Read Completion

INSTRUCTIONS TAKE FROM 3 - 5 CYCLES! An FSM looks at the op-code to determine how many...

### Step 1: Instruction Fetch

Use PC to get instruction and put it in the Instruction Register. Increment the PC by 4 and put the result back in the PC. Can be described succinctly using RTL "Register-Transfer Language"

IR = Memory[PC]; IR is "Instruction Register"
PC = PC + 4;

#### Step 2: Instruction Decode and Register Fetch

Read registers rs and rt in case we need them Compute the branch address in case the instruction is a branch

RTL:

```
A = Reg[IR[25-21]];
B = Reg[IR[20-16]];
ALUOut = PC + (sign-extend(IR[15-0]) << 2);</pre>
```

We aren't setting any control lines based on the instruction type (we are busy "decoding" it in our control logic)

# Step 3 (instruction dependent)

ALU is performing one of three functions, based on instruction type

Memory Reference: ALUOut = A + sign-extend(IR[15-0]);

R-type: ALUOut = A op B;

**Branch:** 

if (A==B) PC = ALUOut;

# Step 4 (R-type or memory-access)

Loads and stores access memory

R-type instructions finish

Reg[IR[15-11]] = ALUOut;

# Step 5 Memory Read Completion

Reg[IR[20-16]] = MDR;

#### Summary:

|                         | Action for R-type                           | Action for memory-reference | Action for      | Action for         |  |  |
|-------------------------|---------------------------------------------|-----------------------------|-----------------|--------------------|--|--|
| Step name               | instructions                                | instructions                | branches        | jumps              |  |  |
| Instruction fetch       | IR = Memory[PC]                             |                             |                 |                    |  |  |
|                         |                                             | PC = PC + 4                 |                 |                    |  |  |
| Instruction             | A = Reg [IR[25-21]]                         |                             |                 |                    |  |  |
| decode/register fetch   | le/register fetch B = Reg [IR[20-16]]       |                             |                 |                    |  |  |
|                         | ALUOut = PC + (sign-extend (IR[15-0]) << 2) |                             |                 |                    |  |  |
| Execution, address      | ALUOut = A op B                             | ALUOut = A + sign-extend    | if (A ==B) then | PC = PC [31-28] II |  |  |
| computation, branch/    |                                             | (IR[15-0])                  | PC = ALUOut     | (IR[25-0]<<2)      |  |  |
| jump completion         |                                             |                             |                 |                    |  |  |
| Memory access or R-type | Reg [IR[15-11]] =                           | Load: MDR = Memory[ALUOut]  |                 |                    |  |  |
| completion              | ALUOut                                      | or                          |                 |                    |  |  |
|                         |                                             | Store: Memory [ALUOut] = B  |                 |                    |  |  |
| Memory read completion  |                                             | Load: Reg[IR[20-16]] = MDR  |                 |                    |  |  |