PXN20RM Datasheet, PDF(369/1376 Page) Freescale Semiconductor, Inc – PXN20 Microcontroller

English

English German Russian Spanish Italian Polish Chinese Japanese Korean French Portuguese	Language :

PXN20RM Datasheet, PDF (369/1376 Pages) Freescale Semiconductor, Inc – PXN20 Microcontroller

◁

13.1.3.5 L1 Cache Features

The features of the cache are as follows:

â¢ 32 KB, 4- or 8-way set associative unified cache

â¢ Copyback and writethrough support

â¢ Eight-entry store buffer

â¢ Push buffer

â¢ Linefill buffer

â¢ 32-bit address bus plus attributes and control

â¢ Separate unidirectional 64-bit read data bus and 64-bit write data bus

â¢ Supports cache line locking

â¢ Supports way allocation

â¢ Cache power usage can be minimized

e200z6 Core (Z6)

13.1.3.6 BIU Features

The features of the e200z6 BIU are as follows:

â¢ 32-bit address bus plus attributes and control

â¢ Separate unidirectional 64-bit read data bus and 64-bit write data bus

â¢ Overlapped, in-order accesses

13.1.4 Microarchitecture Summary

The e200z6 processor utilizes a seven stage pipeline for instruction execution. The instruction fetch 1,

instruction fetch 2, instruction decode/register file read, execute1, execute2/memory access1,

execute3/memory access2, and register writeback stages operate in an overlapped fashion, allowing single

clock instruction execution for most instructions.

The integer execution unit consists of a 32-bit arithmetic unit (AU), a logic unit (LU), a 32-bit barrel shifter

(shifter), a mask-insertion unit (MIU), a condition register manipulation unit (CRU), a count-leading-zeros

unit (CLZ), a 32 x 32 hardware multiplier array, result feed-forward hardware, and support hardware for

division.

Most arithmetic and logical operations are executed in a single cycle with the exception of multiply, which

is implemented with a pipelined hardware array, and the divide instructions. A count-leading-zeros unit

operates in a single clock cycle.

The instruction unit contains a PC incrementer and a dedicated branch address adder to minimize delays

during change of flow operations. Sequential prefetching is performed to ensure a supply of instructions

into the execution pipeline. Branch target prefetching is performed to accelerate taken branches.

Prefetched instructions are placed into an instruction buffer capable of holding six sequential instructions.

Branch target addresses are calculated in parallel with branch instruction decode, resulting in execution

time of three clocks. Conditional branches which are not taken execute in a single clock. Branches with

successful lookahead and target prefetching have an effective execution time of one clock.

Freescale Semiconductor

PXN20 Microcontroller Reference Manual, Rev. 1

13-5

▷