Architectures for Wideband CDMA Software Radios - PowerPoint PPT Presentation

1 / 14
About This Presentation
Title:

Architectures for Wideband CDMA Software Radios

Description:

for 0.25u process. Throughput is dominated by the. MACP satellite ... Formulated based on the data dependencies in the kernel mapping ... Future Work ... – PowerPoint PPT presentation

Number of Views:56
Avg rating:3.0/5.0
Slides: 15
Provided by: vanp
Category:

less

Transcript and Presenter's Notes

Title: Architectures for Wideband CDMA Software Radios


1
Architectures for Wideband CDMA Software Radios
  • CS 252 Final Project
  • Rhett Davis Vandana Prabhu
  • December 9, 1998

2
Motivation for Software Radios
  • General purpose architecture can support many
    standards
  • Can we build them?
  • multi-user detection
  • adaptive interference cancellation
  • low power / portable

Mitola, IEEE Comm. 95
3
Project Goals
TI 'C54x implementation of LMS algorithm would
require 56 processors in parallel (35 with
C6x) Zhang et al., Allerton 98
  • Are these numbers believable?
  • How would a believable mapping of the algorithm
    look on IRAM and Pleiades? Will they meet the
    performance spec?

1.6M LMS loop iterations/sec
4
Wireless Receiver System Overview
cos2p(1.96 GHz)t
RF input
I (50 MS/s)
RF Filter
LNA
Q (50 MS/s)
sin2p(1.96 GHz)t
Teuscher et al., ISCAS 98
5
Digital Baseband Receiver
Adaptive Pilot Correlator
Adaptive Pilot Correlator
Data In
. . .
Acquisition Timing Recovery
Signal Update Block
Adaptive Data Correlator
Data Out
6
Adaptive Pilot Correlator (LMS Algorithm)
Reg
S
S
gtgt
u
S
c
m
Reg
7
The Search for Parallelism Data Dependency Graph
nth iteration
vector ops.
scalar ops.
vector ops.
n1th iteration
8
VIRAM Simulation of Critical Path
  • Assumptions
  • all other operations for this loop have
    completed
  • vectors of length 15
  • input data already stored in memory
  • suggested clock speed is 200 MHz
  • max CPU speed is 300 MHz
  • max Memory speed is 250 MHz

Critical Path
9
Next Step Examine Outer Loop
10
A Reconfigurable Solution?
  • Architecture Overview
  • Data intensive computation executed
  • on a dataflow architecture
  • ARM8 for overall control flow
  • configuration of satellites(operation
    connectivity)
  • Satellites are AGP, SRAM, MACP, ALUP, I/O Ports,
  • reconfig. network
  • Approach for mapping
  • Extract the dataflow graph for the algorithm
  • Parallelise the mapping to reduce latency
  • meet throughput requirements
  • Estimate timing based on configuration overhead
  • kernel execution

AGP
AGP
Arm8
SRAM
SRAM
MACP
ALUP
11
Kernel mapping for Pleiades
  • Key aspects
  • Broadcast
  • Duplicating data streams to decrease latency
  • Writeback mechanism
  • for the X_ri loop
  • 12 MACPs
  • 14 ALUPs
  • 10 MEMs
  • 10 AGPs

X_ri
X_ii
Y_ii
Y_ri
Si
Zx_i1
Zx_r1
Zx_r2
Zx_i2
Zmf_r
Zmf_i
Si
Ys_ii
Ys_ri
Y_ii
Z_i
Z_r
Zx_r
Zx_i
C_i
C_r
Y_ri
?
E_r
E_i
Yx_ii
U_r
U_i
Yx_ri
X_ri
X ? _i2i
X? _i1i
X_ii
X_ri
X _ii
X ? _ii
X ? _ri
C_r

CONSTANT
AGP SRAM
ALUP

MACP
?

12
Performance on Pleiades
  • Architectural Assumptions
  • Memory is updated with input data(yi) at 16MHz
  • Reconfiguration time insignificant compared to
    kernel execution time occurs very rarely
  • Timing Estimations
  • for 0.25u process
  • Throughput is dominated by the
  • MACP satellite transport time
  • on the network is bounded by throughput
  • Formulated based on the data dependencies in the
    kernel mapping

13
Performance Analysis
  • Critical path is the update of X_r limited by
    MAC throughput
  • Available computation time for Zi is 625ns
  • Meets the 1.6 MHz spec at 2.5 V
  • At 1.2V computation times are
  • 422 245 667ns for Zi
  • 245 ns to update xi
  • Hence increase Vdd to 2.5 to increase MACP
    throughput

14
Conclusions and Future Work
  • Seems likely that LMS algorithm can be
    implemented on a single VIRAM processor
  • Can first half of the algorithm be implemented on
    VIRAM in less than 28 cycles/iteration?
  • Worthwhile to explore MIMD
  • Pleiades architecture exploits the dataflow fully
  • Architecture of satellites can be better tuned
    for LMS algorithm to reduce hardware
  • Complete mapping/consequences of the rest of the
    algorithm
Write a Comment
User Comments (0)
About PowerShow.com