Table 1. Table of ultra high speed analog-to-digital converter chips, the last three entries are inexpensive comparators, essentially 1-bit ADCs. fs (GSa/s) BW (GHz) # bits Manuf. Part # ⇠cost 5 2.0 8 e2v EV8AQ160 $300 10 20 4 Adsantec ASNT7120-KMA $800 3500 12.5 8 8 Maxtek — $17k 20 8 5 e2v EV5AS210 $7k 3000 20 13 8 Agilent — https://www.cfa.harvard.edu/twiki5/view/SMAwideband 20 10 3+ oflow Hittite HMC5401LC5 $5k 30 14 6 Micram ADC30 $10k 2500 56 15 8 Fujitsu CHAIS — 20 10 flexibly 1 Hittite HMC874LC3C $40 The SMA Wideband Astronomical ROACH2 Machine (SWARM) processes 4 GHz usable in single polarization mode, reconfigurable as full Stokes dual 2000 12.5 14 1 Inphi 1385DX — SMA polarization. It uses open source technology from the Collaboration for Astronomy Signal Processing and (CASPER). A25707CP 4.6 GSa/s analog25 Electronics 18 Research 1 Inphi — SWARM: A Compact High Resolution CASPER Correlator and Wideband VLBI Phased Array AD EM IA SINIC A SMITHSONI AN OPHYSICA TR L O AS VATORY ER BS C A Instantiated Multipliers SWARM Development Team, SAO & ASIAA & SMA, Cambridge, MA and Hilo, HI A IA 1500 to-digital converter board uses a commercial chip from e2v on a PCB designed by ASIAA. The DSP platform is the second generation Reconfigurable Open Architecture Computing Hardware (ROACH2). Each of two channels per ROACH2 sample a 2.3 GHz Nyquist band from a1000custom block downconverter. due to misalignment in o↵set, gain, phase (OGP), or threshold Integral Non-Linearity (INL). All the cores are clocked by the same external clock input. In the single-channel mode, the D = 32 500 in-phase 1.25 GHz clock is sent to ADC core A and an inverted 1.25 GHz clock is sent to core B. D = 64 The 90 and 270 degree phase shifted clock signals are sent to cores C and D respectively. This D = 128 0 interleaved mode has an equivalent sampling frequency GSa/s.2000A CASPER compatible 500 1000of 5 1500 2500 3000 3500 4000 of PFB channels printed circuit board is available based on this ADC, designed Number by Jiang et al. [28]. Reference [27] details how the OGP and INL corrections are derived, and figure 8 shows the improvement 3500 in Spurious Free Dynamic Range (SFDR) Signal Noise and Distortion (SINAD) whichfor various Fig. 3. and Number of to multipliers versus number of PFB channels can be obtained using these corrections. values of demux. The dashed line is the upper limit if multipliers are 3000 Looking ahead, we are presently developing PCB based onROACH the 262. GSa/s Hittite implemented usinga DSP slices on the HMCAD5831 3-bit single core ADC with an FMC (Vita 57.1) standard interface to the Xilinx VC709 evaluation board. The VC709 2500 is based on the XC7VX690T FPGA which includes 80 GTH multi-gigabit serial transceivers, ten of which are brought out to the FMC. The GTH transceiver is a sophisticated subsystem capable of transmitting and receiving serial data at rates 2000 M = D have log N D + |{z} TD 2D for (3) PFB up to 13.1 Gb/s per channel. Both transmitter and receiver blocks essential |{z} | {z2 functional } reliable transmission of data across this interface—these necessary are not available in optimization FIR FFT features prior generations of SERDES transceiver, 1500 for example those on the Xilinx Virtex 6 family which is used on the CASPER ROACH2. The most important thing to note from this equation is that EV8AQ160 reening • Temperature Range for Packaged Device • Commercial C Grade: 0° C < Tamb < 70° C plications • High-speed Oscilloscopes CASPER Packetized FX Correlator and Beamformer (https://casper.berkeley.edu/) Figure 1-1. Simplified Block Diagram Block Diagram 2.5 GHz Clock Clock Buffer + Selection + SDA LVDS Buffers 1:1 or 1:2 DMUX 8-bit 1.25 Gsps ADC core Gain 8-bit 1.25 Gsps ADC core Gain T/H Offset Quad Description Gain 8-bit 1.25 Gsps ADC core Gain T/H Phase Offset LVDS Buffers 1:1 or 1:2 DMUX 8-bit 1.25 Gsps ADC core T/H Phase Serial Peripheral Interface LVDS Buffers 1:1 or 1:2 DMUX Phase Offset Instantiated Multipliers LVDS Buffers 1:1 or 1:2 DMUX T/H Phase Offset SWARM installation in laboratory, fed by sophisticated phase agile antenna simulator (J. Test poster) Analog MUX (Cross Point Switch) since 1000 the pipelined stages divide their computation over N/D 4.2. SMA Wideband Astronomical ROACH2 Machine clock cycles the total number of multipliers only grows with core interleaved e2v ADC EV8AQ160 The Quad ADC is constituted by four 8-bit ADC cores which can be considered independently (fourchannel mode) or grouped by two cores (two-channel mode with the ADCs interleaved two by two or one-channel mode where all four ADCs are all interleaved). D = 32 is N 500 as log N . The main contributer to multiplier utilization All four ADCs are clocked by the same external input clock signal and controlled via an SPI (Serial Peripheral Interface). An analog multiplexer (cross-point switch) is used to select the analog input depending on the mode the Quad ADC is used. SWARM block (above) and specifications demux factor.(below). Tablediagram 2. instead Table ofthe SWARM specifications Feature Number of antennas Bandwidth per receiver Number of sidebands Simultaneous receivers Baselines Spectral resolution Fastest dump rate Phased array bandwidth The clock circuit is common to all four ADCs. This block receives an external 2.5 GHz clock (maximum frequency) and preferably a low jitter symmetrical signal. In this block, the external clock signal is then divided by two in order to generate the internal sampling clocks: • In four-channel mode, the same 1.25 GHz clock is directed to all four ADC cores and T/H • In two-channel mode, the in-phase 1.25 GHz clock is sent to ADC A or C and the inverted 1.25 GHz clock is sent to ADC B or D, while the analog input is sent to both ADCs, resulting in an interleaved mode with an equivalent sampling frequency of 2.5 Gsps 0846I–BDC–12/10 e2v semiconductors SAS 2010 D = 64 D = 128 Specification Remarks Assuming an 8-tap FIR filter, the 2500 total number of instanti0 500 1000 1500 2000 3000 3500 4000 8 2 receivers each. Number of PFB channels ated multipliers is governed by D log2 DN +6D. A plot of this 2 GHz Dual polarization in each side band. relationship is shown in Fig. 3 for demux of 32, 64, and 128 2 90-270 Walsh splits SBs, Rx are DSB along the upper limit forpolarization implementing multipliers in Fig. 3. with Number of multipliers number of channels 2 Dual frequency or dualversus 230PFB &the 345 GHz for variou 2 values of28demux. line the DSP slices, ondashed the ROACH 2.upper limit if multipliers ar 56 per2016, Rx,The full Stokes, 112 is total implemented usingNyquist DSP slices on thechannels ROACH 2. 140 kHz 2.3 GHz / 16384 0.65 s B. Adders Single full Walsh cycle 4 GHz 2 GHz ⇥dual pol. To find the total adders we go through a similar calculation but noting that the butterflies have 32 as many adders as =D logonly ND + TD 2Dtaps and thus(3 PFBthe multipliersMand FIR needs to sum all |{z} | {z2 } |{z} performs D(T 1) adds.FFT The total is shown below, again with optimization FIR the contributions pointed out, The most important thing to note from this equation is tha since the pipelined3stages divide their computation over N/D APFB = D log2 N D + D(T 1) + |{z} D (4) clock cycles the total {z } only grows wit |2 number {z }of |multipliers reorder F IR N as log N . The mainFFT contributer to multiplier utilization i instead the demux factor. The added D results from extra adders in the block needing an we 8-tap FIR filter, the offitinstant FPGA DSP Utilization: Before committingAssuming to the ROACH2 needed to understand thattotal the bitnumber code would the to do the final reordering of the FFT output. A Asimilar plot Virtex 6 SX475T FPGA. Utilization is dominated by the PFB, (an FFT with improved channel isolation). hard reality ated multipliers is governed by D log2 DN +6D. A plot of thi to Fig. 3 can be shown for the adders however it would look is that the FPGA cannot run nearly as fast as the ADC. So D (demultiplex factor) of parallel streams are processed. D is relationship is shown in Fig. 3 for demux of 32, 64, and 12 also the ratio of ADC to FPGA clock rate.essentially Bandwidth (~D) much more expensive than spectral resolution (~1/N). the issame. along with the upper limit for implementing the multipliers i See https://www.cfa.harvard.edu/twpub/SMAwideband/MemoSeries/sma_wideband_utilization_1.pdf 3500 3000 SWARM ROACH2 assembly with two ADCs installed, and dual SFP+ networking mezzanine cards. Instantiated Multipliers 2500 Spurious Free Dynamic Range (SFDR), Signal to Noise and Distortion Ratio (SINAD) and Noise Power Ratio (NPR), and the impact of of quad core alignment (offset, gain and phase, OGP, and Integral NonLinearity, INL) (Patel et al., JAI, 3, 1) 2000 1500 1000 D = 32 D = 64 D = 128 500 0 SWARM on Mauna Kea, and first light fringe on maser source MWC349. 500 1000 1500 2000 2500 Number of PFB channels 3000 3500 4000 2