16 bit RISC processor for JPEG decoding Team
16 -bit RISC processor for JPEG decoding Team: DARTS Deepesh John, Smitha Shyam, Sujay Phadke, Tejasvi Kachru
JPEG flow Input Image Quantizer Compressed Stream Entropy Encoder Entropy Decoder Compressed Stream DCT Inverse Quantizer IDCT Image
JPEG-block diagram PC Data-path Controller Instruction Memory Booth Multiplier Data Memory Huffman Decoder ROM (Look-up table) Run Length Decoder
Implementation of JPEG Decoding Huffman decoding HUFF_AC HUFF_DC Run length Decoding IDCT
Huffman Decoding Implementation Req data Serial data Huffman decoder R Serial clock data control
Run length Decoding Huffman decoding Run length Decoding MRLD_AC MRLD_DC IDCT
{ Num_of_zero, coeff } Instruction flow DC or AC? 1 st cycle Num >0 RAM Writeback get de-quantization values from memory Put zeroes to RAM multiply 2 nd cycle 3 rd cycle RAM Writeback
Implementation - MRLD 2 nd cycle 1 st cycle 3 rd cycle R MUX T T RAM R RLD MUL data control address ROM 1 st cycle 2 nd cycle
JPEG decoding Huffman decoding Run length Decoding IDCT MUL Rsrc, Rdest SHIFT ADD
Booth Multiplier Booth encoder Shift + Add Partial products
Adder & shifter Adder l 16 -bit Carry Bypass l Carry Chain: Inversion property, Transmission gate XOR for Propagate l Reduction in area achieved by using Wang XNOR Shifter l Logarithmic shifter
Simulation Scripts for all eldo testing l Baseline instructions tested exhaustively l New instructions tested with partial reproduction of jpeg algorithm l
Critical Path l Serial clock – 5 cycles Huffman block -5. 7 ns l ROM read + MUL(MRLD) =2+6 = 8 ns
Layout Data memory Lookup ` table Multiplier Program memory control Huffman & RLD Custom datapath
Statistics Area 1. 2 mm x 1. 2 mm Clock Period 18 ns Serial Clock Period 6 ns I/O pads 16 used
Delay profile for modules Module Register File ALU Shifter Multiplier Huffman Decoder RLD Delay (ns) 0. 45 3. 5 1. 1 3. 5 12 1
System level Diagram JPEG Decoder reqdata sclk Master Processor
Highlights Parallelism of multiply and rom read l Multiply bypass write back to ram for zero values l Adder- wang xnor l Huffman decoding l Scan chain through PC, IREG, PSR l
Improvements DMA port l Interrupts l Peripheral interface l Dynamic changing of Huffman tables l
- Slides: 19