Home

document - Indico

image

Contents

1. 13 2 Implementation of a dual Gigabit Ethernet on the board The implementation is using the dual channel Ethernet MAC PHY chip from PMC Sierra The chip is equipped with the PL3 interface on the Link layer side and is driving independent GMII interfaces to each PHY For the PHY chip the Marvell 74 Alaska II 88E1020 Dual Port Transceiver can be chosen To control the PM3386 the local parallel bus from the ECS Glue Card is used in 16 bit multiplexed mode Parameter settings and statistics of the MAC transactions can be directly accessed by the ECS On the Link layer side only the transmission interface from the SyncLink FPGA to the MAC PHY is implemented The receiving data path is not used on the FPGA The PL3 compliant interface is used in 32 bit 104M Hz The firmware on 22The final choice for the physical implementation of the L1T event building network has not been made yet Gigabit ethernet is nevertheless assumed for this design 3POS PHY is a standard developed by PMC Sierra http www pmc sierra com 4nttp www marvell com 23 the SyncLink FPGA can be either developed specific for our application or using the PL3 to Atlantic interface FPGA core from Altera 13 3 Upgrade interface A PL4 interface is provided on a connector using the high speed I Os available on the SyncLink FPGA to allow future upgrades of the DAQ and LIT link The bandwidth of this interface is depending on the clock frequency of the LVDS signa
2. RegName Functionality Comment SyncLink FPGA access 0x00 Brdld Defines a unique board number 0x01 BrdType Defines a unique board Type 0x02 BrdRev Defines the firmware revision 0x03 BrdVersion Defines the firmware version 0x04 0x07 Reserved 0x08 ResetReg Issue resets board wide 0x09 ModeReg Sets the processing mode board wide Ox0A BrdStatusReg0 Status of the board See definition 0x0B BrdStatusReg1 Status of the board 0x0F FEMStatusReg Status of the FEM See definition 0x10 0x13 TTCErrCntReg Error counters of the TTCrx chip See definition 0x60 SyncLinkChipld Identifier for the chip 0x61 SyncLinkData0 Low byte data register on each FPGA 0x62 SyncLinkDatal Second byte data register on each FPGA 0x63 SyncLinkData2 Third byte data register on each FPGA 0x64 SyncLinkData3 High byte data register on each FPGA 0x65 SyncLinkAddr0 Low byte address register on each FPGA 0x66 SyncLinkAddr1 Second byte address register on each FPGA 0x67 SyncLinkAddr2 Third byte address register on each FPGA 0x68 SyncLinkAddr3 High byte address register on each FPGA 0x69 SyncLinkTCtrlReg Transfer control register on each FPGA 0x6A SyncLinkStatus Status of the chip PPO FPGA access 0x80 PPOChipld Identifier for the chip local address space access on each FPGA PP1 FPGA access 0x AO PP1Chipld Identifier for the chip local address space access
3. 4 GPIOto SynkLink FPGA LVTTL 36 Analyzer interface LVTTL 2 ECS I2C 3 3V LVTTL 3 Device address LVTTL 3x2 Reference voltages 1 25V 8x2 Terminationresister reference R 482 Total Table 10 The number of I O s used for the PP FPGA with the proposed partitioning of the board with 4 PP FPGA s The high pin count makes the use of low cost FPGA s which are only available in smaller packages impossible 31 Signals Purpose I O standard 4x 16 2 DAQ link interface LVTTL 4x 32 2 L1T link interface LVTTL 4x 6 1 SyncData link to PP FPGA s LVTTL 32 17 To RO Tx POS PHY L3 2 5V LVTTL 8 7 ECS 3 3V LVTTL 12 FEM interface 3 3VLVTTL 33 TTCrx interface 3 3V LVTTL 4 2 Throttle LVTTL 4x 1 1 LIA EvID LVTTL 4 Clock 40M Hz distribution LVTTL 4 Clock 80M Hz distribution LVTTL 4 Clock LVTTL 2 Processing mode LVTTL 4x2 LIT processing sync LVTTL 4x2 DAQ processing sync LVTTL 5x1 Initialization done LVTTL 4 Resets LVTTL 16 GPIO from PP FPGA s LVTTL 36 Analyzer interface LVTTL 2 ECS I2C 3 3V LVTTL 3 Device address LVTTL 8x2 Termination resister reference R 464 Total Table 11 The number of I O s used for the SyncLink FPGA with the proposed parti tioning of the board of 4 PP FPGA s 32 B Register and local parallel bus address space
4. 8 2 16 bit DAQ fragment link 2 54 win Sao ee ER RS 8 3 VeLo cluster formats for fragment links 8 4 Conversion of Strip number to physical position for the Velo 8 5 Outer Tracker cluster format for fragment links TTCrx interface 10 FEM for Beetle based read out 11 ECS interface 11 1 PLX parallel local bus ese a 4p Oty te a ep ee es 12 FPGA technology TD TV ANGORA Ana ha oe at a Sgn ae oR ae gee gece 12 2 Xilinx VirtexI 00 200 0000000000 12 3 Device choice 0 0 0 0000 eee 13 L1 trigger and DAQ interface RO Tx 13 1 Gigabit Ethernet 2 3 2 4 lt 454 fas ge eed Bk Gay 13 2 Implementation of a dual Gigabit Ethernet on the board 13 3 Upgrade interface 2 oie ae al ee og amp Se oF ores Ga ee 14 Resets 11 11 11 11 11 12 13 15 15 16 16 17 18 18 20 21 21 22 22 23 23 23 24 24 15 Clock distribution and signal termination on the board 16 FPGA configuration 17 JTAG boundary scan 18 Power requirements 19 Physical aspects of the ROB 20 FPGA implementation guidelines 21 Open questions A I O Tables B Register and local parallel bus address space definition C I2C address definition D Signal tables E Pin out for connectors on the board 24 25 26 26 27 27 28 31 33 36 37 41 1 Introduction Several sub detectors of LHCb as Velo ST TT Veto and OT have decided to use a very similar read out schemes for their dete
5. EP1S25 469 593 EP1830 593 EP1C20 301 biggest Cyclone device Table 3 The 780 pin FBGA package allows to migrate between several devices 10 7 Data synchronization and event synchronization For a better understanding of the synchronization mechanism on the board it is use ful to distinguished between data and event synchronization In this context data synchronization comprises the generation of the sampling clock of the ADC for the A RxCard selecting the valid data out of the continuous sampled analog signals and changing the clock domain on the input stage of the FPGA to the on chip common clock domain For the optical receiver card the data synchronization is given by the interface of the deserializer The event synchronization is a second step and performs consistency checks between the transmitted event identification in the header and the local reference This separation can be understood as a two layer transmission model where the data synchronization is on the Physical layer and the event synchronization is the DataLink layer of the OSI model 7 1 Data synchronization for the Velo The analog signal transmission over 40m twisted pair copper links suffer from a skew among channels on the same cable of order 5ns which has to be compensated by using channel individual phase adjustable clocks for sampling the signals These clocks are generated using the PLL circuits on the PP FPGAs The details of its implementation
6. 16 j ClusterData 1 L1 16 DL ClusterData N Ln 16 Figure 8 Link format for LIT and DAQ between PP FPGA and SyncLink FPGA 8 3 VeLo cluster formats for fragment links The LIT cluster in case of the Velo has the following format The DAQ event fragment Bit Description Comment 0 Cluster size Is 0 for clusters of one hit and 1 for two hits lt 12 1 gt Strip number Unique strip number on the ROB 13 Second threshold If one of the hits in the cluster ex ceeded the second threshold level this bit is set lt 15 14 gt Unused Table 5 Velo cluster format for the L1 trigger 13With a L1 accept rate of 40kHz 15 do have a more complicated structure In case for the Velo the following information is sent to the DAQ wrapped in 16 bit wide words Size Description 4 bit Cluster size Number of data words in the cluster 8 bit LIT information The result of the LIT pre processor algorithm for each strip value hit and second threshold 12 bit NStrip Strip number of the first hit in the cluster Nh x 8 bit Strip values Nh is the number of hits in a cluster 4 x 8 bit Neighbor strip values The two right and left neigh boring strip values to the cluster Table 6 Velo cluster format for the DAQ 8 4 Conversion of Strip number to physical position for the Velo Each strip number on the board 11 bit for 2048 has to be converted in a physical locat
7. 2 x O12 RxCard 2 8A 2 4A option 4 x A RxCard 3 9A 2 4A 1A option 4 x PP FPGA 8A 6A 2A power calc 12 x DDR SDRAM 1 4A 1 x SyncLink FPGA 2A 2A 0 5A power calc 1 x EEPROM for OA Used for FPGA config only 1 x TTCrx 50mA 1 x Optical Rev 9 mA 1 x FEM Beetle ImA estimation 1 x MAC PHY 0 37A 0 2 A 2 x PHY A A 3W 1 x OSC 125MHz 35mA 2 x Magnetics 6mA 1 x Upgrade module 0 2 A 1A 1 x CC PC 0 2A 0 5A estimate 1 x Glue Card 0 2A estimate 1 x EEPROM BrdID 0 4mA Total 10A 0 4A 12 2A 9 7A 3 9A 1A Table 9 Table of estimated currents for all components on the board and 5V analog power supply are distributed on the backplane and are separated from the digital The low voltage power supplies as 1 5V 10A 1 8V 0 4A 2 5V 12 2A and 6The number of erase cycle for the smaller EPC devices is significantly lower 100 26 3 3V 5 8A have to be generated on the motherboard 7 In order keep the power dissipation low these need to be implemented with PWM switched power supplies which have a typical efficiency of 85 to 90 The 1 8V is used only for the RO Tx and uses linear regulators Option 1 The low voltage power supplies 1 5V 20A 2 5V 20A and 3 3V 10A are located on the motherboard These supplies do run on 48V input voltage which leads to an estimated current on this supply of 1 5A per board Option 2 Distribute 5V and 3 3V on the backplane and use non isolated DC DC c
8. are given in 3 The data valid signal available on the Velo FE chip Beetle is not transmitted to the read out electronics over the analog links The principle to select the valid data from the continuous sampled signals is based on the data valid signal regenerated by the local FEM The data has further to be synchronized to the common 40M Hz clock domain which is done by the use of FIFOs on the input stage The synchronization is illustrated in figure 6 7 2 Data synchronization for the O RxCards The TLK2501 SERDES chip used on the O RxCards is generating the clock data enable and an error signal which are used to synchronize the data on the input of the PP FPGA The multiplexed data is de multiplexed and written to an input FIFO to allow the change of clock domain for the following processing stages The independent synchronization of each optical input link makes the skew between the optical channels not an issue 7 3 Event synchronization for the Velo After the valid data is masked the header words representing the pipeline column number 8 bit PCN can be verified This is done among the neighboring channels and the reference from the FEM 7 4 Event synchronization for OT The data headers from the OTIS TDCs include a 8 bit BCnt that will be compared to the BCnt generated on the board On error the BCntError bit will be set The BCnt will be used together with the TDC ID to do the de serialization from 80M Hz 11 PP F
9. b 001xxxx b 011xxxx b 101xxxx b 111xxxx b 00xxxxx b 01xxxxx b 10xxxxx b 11xxxxx b 0000000 b 000000x b 1010000 FPGA Firmware EEPROM Figure 11 Overview of the 4 I2C buses and their address spaces defined by hardwired pins on the motherboard Only one JTAG controlled device is on the boards e 12C for the RxCards RxSda RxScl e 2c for the TTCrx TTCSda TTCScl The serial EPROM for the board identification is connected as well to this bus 19 e 2C for the front end emulator Beetle chip FEMSda FEMScl e 2C FPGA all FPGAs are connected on I2C for debugging purpose FP GASda FPGAScl Parallel local bus The local bus generated by the PLX9030 18 PCI bridge provides a simple parallel bus Three chip select signals have to be made available The chip selects are used in the following way see figure 12 nCS1 SyncLink FPGA s99 gt h 7F ncs1 h 80 h 9F PP FPGA h A0 h BF h CO h DF h E0 h FF nCSaux0 h 0000 h FFFF nCSaux1 EA h 0000 h FFFF Local parallel bus Multiplexed mode 10 MHz Figure 12 Overview of the 3 local parallel nCS and their address spaces e nCS1 for the SyncLink FPGA and the PP FPGA s to access the the reg isters described in table 16 L1B on chip RAM and many other registers are accessed indirect through the defined address data
10. 1 Q L1A 1 L1AStr a Generator Clk_40 e EvCntReset BCntReset LOReset SyncData 6 SyncDataStr Sie Throttle OR N x oO ice x lt oO a u a a 2 Ww Analyzer IF Figure 7 Data flow overview of the SyncLink FPGA are chosen 32 bit wide for the L1T and 16 bit for the DAQ The functionality foreseen at the present to be implemented for the Velo ST TT and OT does not demand a high amount of logic resources on this FPGA An estimation is given in table 4 8 1 32 bit LIT fragment link With a transfer rate of 80M Hz and a cluster size of 16 bit the data transfer is restricted to 128 clusters plus the header each event leaving a margin of 10 cycles for start and 13 Functional block Logic El Block Block Block DSP PLL ements memory memory memory blocks LE 512 bit 4k 4k x 144 L1T fragment link 1000 0 8 0 0 1 L1T location conversion 0 0 8 0 0 0 DAQ fragment link 1000 0 8 0 0 0 Control generators 2000 0 8 0 0 2 RO Tx interface 1000 0 8 0 0 1 Total 5000 0 40 0 0 4 Available in 1820 18460 194 82 2 80 6 Table 4 Estimation of needed resources on the SyncLink F PGA stop the transfer Additional hits or clusters need to be discarded to allow the linking to be performed with a fixed latency of 900ns Fixing the event linking latency prevents from possible buffer overflows at the FIFO buffers The fragments with disc
11. TxCard DDR SDRAM SERDES L1T ZSupp OSI MAC PHY POS PHY Level 3 POS PHY Level 4 GMII Analog Receiver Card Optical Receiver Card Receiver Card stands for A RxCard and O RxCard Pre Processor FPGA Synchronization and Link FPGA Read Out Board Front End Emulator L1 Buffer L1 Trigger L1 Accept Data Acquisition TTC receiver chip Experimental Control System Texas Instruments SERDES chip Cern implementation of a radiation hard 1 6 Gbit s serializer Read Out Transmitter Card Double Data Rate Synchronous Dynamic RAM Serializer and de serializer circuit L1 trigger zero suppression Open Systems Interconnect Model Medium Access Controller Gigabit Ethernet terminology Physical layer device Gigabit Ethernet terminology Saturn compatible Packet Over Sonet interface level 3 used for 1 Gigabit Ethernet Saturn compatible Packet Over Sonet interface level 4 used for 10 Gigabit Ethernet Gigabit Medium Independent Interface 8 bit parallel PHY interface 3 Requirements The L1 ROB is used by several sub detectors in LHCb Special requirements are given by different sub detectors concerning interconnection and synchronization In most aspects the Velo imposes the strongest requirements and is therefore taken to guide the implementation In the following list important aspects for the various sub detectors are summarized to give a general overview of the most demanding aspects of each sub detector e Velo T
12. and transfer control registers e nCSaux0 is used for the local bus to the RO Tx e nCSaux l is reserved for a local bus going to the control interface connector 11 1 PLX parallel local bus The parallel local bus is used to access all user registers on chip memories and the L1Bs It is used in the 8 bit multiplexed mode running at 10M Hz The 8 bit address space is used to access the address data and transfer control registers in order to access the local address spaces on the FPGAs Each FPGA has an individual local address space with an address width of 32 bits To simplify the access of different size of registers the read and write operation on the chip can be done with a data width of 8 bit 16 bit or 32 bit The transfer type is marked in the transfer control register for each transaction To do a 32 bit wide access to the local memory space the following operations have to be issued 18See the documentation of the PLX9030 for the functionality of the local bus http www plxtech com 19The two auxiliary chip selects have to be generated from the the GPIO pins 20 Local address space access registers 7 0 31 24 23 16 15 7 7 0 31 24 23 16 15 7 7 0 TCtrl Addr3 Addr2 Addr1 Addr0 Data3 Data2 Datal Data0 Table 7 Data address and transfer control registers to access the local 32 bit address space e If it is a data write operation w
13. def inition SyncLinkTCtrlReg 0x69 PPOTCtrlReg 0x89 PP1TCtrlReg 0xA9 PP2TCtrlReg 0xC9 PP3TCtrlReg 0xE9 Bit Description Read Write Reset 0 Read If set to write to this register issues a read trans Yes Yes 0 fer 1 Write If set to write to this register issues a write trans Yes Yes 0 fer 4 Width8 Set for 8 bit wide ECS access Only the lowest Yes Yes 0 order byte is used 5 Width16 Set for 16 bit wide ECS access Only the two Yes Yes 0 lowest order bytes are used 6 Width32 Set for 32 bit wide ECS access All bytes are Yes Yes 0 used Table 12 FEMStatusReg 0x0F Bit Description Read Write Reset 0 FEMOKk This is the overall status bit to confirm the Yes No 0 proper operation of the FEM 1 FifoFull FifoFull bit from Beetle Yes No 2 nError nError bit from Beetle Yes No 3 WriteMon WriteMon bit from Beetle Yes No 4 TrigMon TrigMon bit from Beetle Yes No Table 13 33 TTCErrCntReg 0x10 0x13 Bit Description Read Write Reset 31 16 DbErrCnt Counter of the registered double errors of Yes No 0x0 the TTCrx Does not overflow 15 0 SinErrCnt Counter of the registered single errors of the Yes No 0x0 TTCrx Does not overflow Table 14 BrdStatusReg0 0x0A Bit Description Read Write Reset 0 TTCReady Indicates the proper operation of the Yes No TTCrx Table 15 34 Addr
14. memory bus systems is making use of parallel termination and is fully integrated in the memory and the I O cells of the FPGA With the use of the SSTL 2 I O standard and the TLK2501 uses 50 Ohm transmission between the optical receiver and the serializer all signal layers on the board are chosen to be 50 Ohm The clock distribution on the board is accomplished with PLL circuits on the FPGAs for de skewing and multiplying the clock signals see figure 13 The Clock40Des1 40M Hz clock from the TTCrx is taken as the reference for all circuits using the LHC system clock and is connected to the SyncLink FPGA The distribution to the various circuits on the board the PLL circuits on the SyncLink FPGA are used This allows to adjust the clock phase individual for each external circuit and ensures the proper timing between the them In addition 24 RxCard Mezzanine card Only for A RxCard for O RxCard clocks are inputs to PP FPGA 160MHz 60 100MHz 125MHz ECS TTC Gigabit Ethernet Figure 13 Overview of the clock distribution on the ROB Only clock signals are drawn to the 40M Hz system clock a x2 multiplied 80M Hz clock is distributed to the PP FPGAs This clock is used for the link interfaces for the LIT and the DAQ With this distribution scheme no external clock buffers are needed and a maximal flexibility can be achieved Event the ECS local parallel bus is running at 10M Hz only care has to be taken that no fast signal edges ar
15. to encode 11 Available cycles 900ns 12 5ns 76 Cycles not used 76 2 128 2 10 12The point to point links are terminated with serial resisters on the source side which is implemented in the FPGA I O structure The perfect termination allows to run at high frequencies 14 multiple hits in one cluster a significant data reduction can be obtained With the assumption that only binary information per hit needs to be sent to the L1T the non zero suppressed information on the fragment link is 384 bit or 24 16 bit words which is equal to an occupancy with single hit encoding of 6 25 This value is an upper limit to the necessary bandwidth 8 2 16 bit DAQ fragment link To link and transfer the DAQ fragments an average time of 25ys is allowed 3 The 16 bit wide links permit to transfer the event fragments without the need of deep FIFO buffers Also the bandwidth allows to transfer very long fragments a restriction on the length of the fragments can help to avoid overflows and is foreseen to be implemented as a value to be set on the FPGA In figure 8 the event fragment format is given for both links L1TFragmentFormat 32 DAQFragmentFormat 16 ErrorFlag 8 TriggType 4 DataLength DL 12 EvCnt 16 EvCnt 3 0 4 Cluster 2 16 Cluster 1 16 EvCnt 23 4 16 Cluster 4 16 Cluster 3 16 ErrorFlag 8 PCN 8 Cluster 5 16 Zz Cluster 6 16 ClusterData 1 2 16 f ClusterData 1 3 16 Unused or Cluster N 16 Cluster N
16. to the FEMDataValid signal as shown in the timing diagram figure 10 The FEMData bits have to be re ordered as indicated the upper 4 bits are not used The distribution of the 8 bit PCN to PP FPGA is done on the SyncData bus The strobe signal SyncPCNStr generated is used by the PP FPGA to latch the data on the SyncData bus The PCN is transmitted over the SyncData bus to the PP FPGAs and has to be multiplexed on the SyncLink FPGA The definition of the header data bits sent by the Beetle FE chip is given in the specification of that chip The PCN is sent over two clock cycles via 6 bit wide bus from the SyncLink FPGA to the PP FPGAs 11 ECS interface With the use of the LHCb specific CC PC and the adaption Glue Card all necessary interfaces are provided http literature agilent com litweb pdf 5988 2576EN pdf 18 Clk_40 LOAccept FemData lt 3 0 gt DataValid SyncData lt 5 0 gt SyncStr Figure 10 Timing for the FEM signals in the upper part and its output from the SyncLink FPGA to the PP FPGA in the lower part JTAG It is used to program the EEPROM containing the firmware code for the FPGAs JTAG is also used for boundary scan used for production testing but this chain is separate from the programming chain and not connected to the ECS I2C Multiple I2C buses are used on the board Four independent buses are provided by the Glue Card allowing individual buses for see figure 11 SyncLink FPGA b 00xxxxx
17. Data 80MHz Throttle L E ka ka FEM Sync a DataValid i LOAccept TTC Throttle To event building network Figure 2 Data flow of the ROB for the Velo read out Only one A RxC and PP FPGA is shown The FIFO data buffers on the in and output of the logic blocks are indicated as small dark rectangles 4 2 DAQ readout The readout starts with the L1T decision distributed over the TTC broadcast command which is interpreted on the SyncLink FPGA Over a serial link the EvCnt and trigger i A A a A A A 6 single o C or 1 2 12 way X2 or X4 optical fibers from FE O RxCard Mezzanine card DDR soMHz 16 pee se set et et X4 120MHz 16 bit data 13 bit addr Sync Sync sync Sync Syne 40MHz PE FTT 80MHz To and from all 4 PP FPGA From all 4 PP FPGA Data 80MHz Throttle d L1T Lin P 120MHz To PP FPGA left and right 8 oon i PP FPGA Data 80MHz a a Fe SyncLink FPGA TTC Throttle To event building network Figure 3 Data flow of the ROB for the optical read out The diagram shows the data flow for 6 optical links and the linking on the board type is transmitted to the PP FPGA The Arbiter reads the requested events stored at the start address given by the EvCnt With the EvCnt also stored in the event header the correct operation can be checked for each event The events are collected in the DAQ PPL
18. HCD ACY IPHE LHCb 2003 007 IPHE 2003 02 February 17 2003 Common L1 read out board for LHCb specification Aurelio Bay a Jorgen Christiansen b 7 Guido Haefeli a Federica Legger a Laurent Locatelli a Ulrich Uwer c Dirk Wiedner c a Institut de Physique des Hautes Energies Universit de Lausanne b Cern Geneva c Kirchhoff Institut fur Physik University Heidelberg Abstract This document specifies the the L1 readout board used by several sub detectors of LHCb It specifies the interface to the sub detector specific receiver cards and all the common interfaces for the LHCb environment 1B mail Aurelio Bay iphe unil ch E mail Jorgen Christiansen cern ch 3E mail Guido Haefeli iphe unil ch 4E mail uwer physi uni heidelberg de 5E mail dwiedner physi uni heidelberg de Contents 9 Introduction Shortcuts Requirements L1 data flow architecture 4 1 L1 buffer Tac Sek ee at hee a hay a a ha 42 DAQ reado t s o mira sar etn Sho see te a hit a le A RxCard and O RxCard Hilt SECS ACGess anced e Pu ea amp diweeg a AA R AA E PP FPGA Gel PO count seca ea eR oe og Be eg Be Data synchronization and event synchronization 7 1 Data synchronization for the Velo 2 7 2 Data synchronization for the O RxCards 7 3 Event synchronization for the Velo 7 4 Event synchronization for OT Se do ge x We ae SyncLink FPGA 8 1 32 bit LIT fragment link
19. PGA Clk Generator ADC_clk Analog input data ADC lt 8 10 bit Check PCN ADD BCnt EvCnt SyncPCNStr sycsons _ smen _ E SyncData lt 5 0 gt Coa T em Taco T Bete omae T Tn T TTCrx interface Figure 6 ADC input data synchronization for the Velo to 40M Hz After error checking and verification of the correct data header for each Otis the EvCnt and BCnt will be added to the data and stored in the L1B 8 SyncLink FPGA Cluster A cluster is formed when one or multiple neighboring detector channel carry a signal The proposed cluster size for the L1T is one 16 bit word The cluster size for the DAQ is variable depending on the number of hits in the cluster but is transmitted in 16 bit words Event fragment All clusters for one event on one PP FPGA is called an event frag ment This FPGA is used to distribute control signals interfacing the TTCrx and the FEM linking the cluster fragments from the whole board and sending the data to the RO Tx The cluster collection uses FIFO based interfaces from the PP FPGAs to the SyncLink FPGA The FIFOs are either located on the input stage of the SyncLink FPGA as a baseline or optional for sub detectors using the SyncLink FPGA for the DAQ zero suppression on the PP FPGA The links to collect the clusters on the L1T interface 12 BrdRst Reset Generator a in LiEviDReset S LiAccept
20. RxAddr5 3 3V RxClkO 23 193 195 197 199 Data0 1 Data0 3 Data0 5 Data0 7 Data0 9 Data0 11 Data0 13 Data0 15 RxDv0 Enabled 24 NC 3 3V RxScl RxAddr6 3 3V Channels are not connected for 3 input receiver card Figure 16 Pin out for the O RxCard signal connector 42 Power Plates Plate Name A RxCard O RxCard Figure 17 Power plate signal definition 43
21. TTC LIT Figure 1 An overview of the building blocks on the L1 ROB constant tables and the L1 buffers The TTCrx A RxCard and the FEM are connected to individual I C buses for direct ECS access 4 1 L1 buffer A block diagram of the principle of the L1 buffer controller is shown in figure 4 The data coming from two synchronization channels are written to the L1B by one L1B controller Its Arbiter allows to schedule the required transaction It checks on the state of the InFifo indicated with the UsedWords signal and performs the read out of the L1 accepted events only if the InFifo does not risk to overflow A possible sequence can be seen in figure 5 With a clock frequency of 120M Hz 7 enough cycles are available for arbitration and refreshing In table 1 the necessary cycle count for each task on the SDRAM is given The chosen SDRAM frequency leads to a sufficient high bandwidth of the memory and allows to keep the InFifos small about 4 events 2 8 With a clock frequency 120M Hz the data transfer rate is at 240M Hz 8The simulation for the L1B access with this scheme has to be verified by simulation 16 Analog Electrical link F fN Nooo from FE X4 A RxCard eel card pat at 40MHz X4 DDR 32 f 120MHz 16 bit data Clk Sync Gen PP To PP FPGA left and right oante oh er os P PP FPGA To and from all 4 ff Toandttom al 4PP FPGA From all4PP FPGA From all 4 PP FPGA
22. accessible on the ECS registers TTCErrCntReg TTCReady is accessible on BoardStatReg Clock The Clock40Des1 is used for the board wide 40M Hz clock called clk_40 The PLL based clock management circuit on the SyncLink FPGA allows to distribu tion the system clock to all necessary locations without external clock buffers The Clock40 and Clock40Des2 are also connected to the SyncLink FPGA but are not used yet BCnt The bunch counter is available on the BCnt bus during the BCntStr high synchronized to the Clock40Des1 and reset by BCntRes The bunch counter is transmitted to the PP FPGAs via 6 bit wide SyncData bus and therefore has to be multiplexed during two clock cycles The timing diagram for the SyncData and its strobe signal SyncStr is given in 10 EvCnt The low part 12 bit of the event counter is available on the BCnt bus during the EvCntLStr high and the high part during EvCntHStr high The BCnt signals are synchronized to the Clock40Des1 and reset by EvCntRes The event counter is also transmitted on the SyncData bus over four clock cycles Brcst The setting of the TTCrx is made such that the broadcast command data signals are all synchronous to Clock40Des1 the appropriate settings are made on the control registers of the TTCrx The broadcast command is used to decode the LHCb L1 accepted events see the broadcast command figure 9 L1 accept The TTCrx signal called Llaccept is named LOacceptLHCb to avoid any problems with the LHCb n
23. aming convention It is used for the Velo FEM and is also used to generate the EvCnt independent of the TTC BCnt which allows to verify the correct synchronization 14The chip is packaged in an 144 pin FBGA package 13mm x 13mm Signals named Str at the end are strobe signals and are used to latch the corresponding data bus 16 At overflow the value of the counter remains at Oxffff 17 For the optical receiver the Agilent HFBR 2316T is used which is recommended for the use for the TTC system in the counting room Trigger type Reject LSB of Physics event ID L1 front LO front end reset Calibration pulse type 0 Default cmD2 Reso o o 1 o x cmos Res o o 1 1 x Figure 9 Broadcast command interpreted by the SyncLink FPGA defined for LHCb 10 FEM for Beetle based read out The FEM used by the sub detectors with the Beetle FE chip 10 is controlled with I C and interfaced to the SyncLink FPGA Its task is to generate the DataValid signal which is not transmitted with the detector data In addition the the PCN is extracted to check the synchronization between the FEM and the data from the FE In addition the available status signals from the Beetle are also connected to the SyncLink FPGA and made available in a register for status monitoring In table 22 in the appendix the signals on the FEM interface are given The PCN is available on the FEMData bus and has to be sampled with respect
24. arded clusters are flagged as such in the ErrorF lag word If a higher bandwidth on these links is needed the clock frequency can be increased up to 160MHz without any changes on the link hardware 1 The increase of the bandwidth on the board is general less critical then the link bandwidth to the L1T event building network To transmit 128 16 bit clusters event needs a minimal bandwidth of 4 Gbit s Velo To find the most appropriate cluster encoding schema the distribution for 1 2 or multiple hit clusters has been simulated and the most appropriate data model has been discussed 4 With an expected occupancy of order 0 6 or an average of 15 clusters board event 8 the most reasonable cluster encoding is the following e One hit clusters are marked as of size 1 and its strip number is transmitted e Two hit clusters are marked as size 2 and the strip number of only the first strip is transmitted e Clusters with three and more hits are split up into clusters of size one and two To allow a flexible limitation of the readout data the maximal number of clusters sent to the L1T can be limited at the two linking stages The limits can be set per PP FPGA and for the whole ROB on the SyncLink FPGA ST TT As for the Velo OT With 6 optical links per PP FPGA a total of 1536 channels are processed on the ROB The restriction to 128 hits event per PP FPGA allows to read out a maximal local occupancy of 33 With a zero suppression that allows
25. as been defined for the L1T The physical implementation of the MAC and the PHY is chosen to be implemented directly on the motherboard This reduces significantly the cost and the occupied space on the board To overcome the disadvantage of a not upgradeable implementation a second interface for future use is implemented 13 1 Gigabit Ethernet With a direct implementation of the link on the motherboard several possibilities are feasible which can not be done on a mezzanine card due to the electrical constraints of interface signals A large number of devices exist with a PCI interface used on basically all Gigabit Ethernet NICs The disadvantage is the high number of pins on the FPGA used for the PCI X interface 97 and the need of an additional PCI bridge to implement a control interface from the CC PC to the MAC PCI bus For the switch market dedicated interfaces for Link and PHY layer devices POS PHY Level 3 PL3 POS PHY Level 4 PL4 Utopia Level 2 SPI Level 3 are currently used The POS PHY Level 3 is a unidirectional FIFO like point to point interface running at up to 104M Hz using a 8 16 or 32 bit wide bus The maximal transfer rate is up to 2 4 Gbit s and is sufficient to transfer data of two Gigabit Ethernet channel All recent FPGAs also support the PL4 interface which is the corresponding interface for 10 Gigabit Ethernet The Level 4 interface is based on a 16 bit data bus signalling with differential LVDS at up to 622M Hz
26. ctors In order to minimize the amount of L1 electronics in the radiation area the LO accepted data is transmitted directly over long analog copper in case of the Velo or digital optical links for all the other sub detectors to the counting room With the use of the same read out chip for Velo ST and TT the development of a common L1 electronics read out board has been started already in the early design phase Due to the circumstance of having two different link implementations the L1 electronics read out board L1 ROB has been designed to be adaptable for the two link system In case of the Velo the receiver side digitizes the analog signals and for the optical links the data serialized with the GOL chip close to the detector is de serialized with the TLK2501 SERDES from Texas Instruments Lack of choice several other sub detectors do also foresee the GOL TLK2501 transmission leading to an identical hardware interface to the L1 ROB For synchronization L1 buffering L1T zero suppression and DAQ zero suppression several large FPGAs are placed on the motherboard This allows the adaption of the board to the special needs of processing the data Also some parts of the FPGA firmware has to be developed specific to the sub detectors a common framework including all interfaces can be used by all user of the board 2 Shortcuts A RxCard O RxCard RxCard PP FPGA SyncLink FPGA ROB FEM L1B LIT L1A DAQ TTCrx ECS TLK2501 GOL RO
27. ded in one part to be common to all sub detectors and an other part with specific firmware In figure 20 the blocks in the data flow diagram are shown with a box with color gradient red 27The 3 3V can be distributed on the backplane since the current is not particularly high 27 A RxCard for 16 ADC Channel PWR PWR A RxCard for 16 ADC Channel A RxCard for 16 ADC Channel m Hm A RxCard for 16 ADC Channel v m lE Ethernet EA PWR 1 8V 21 Open questions Question to sub detectors Is the cluster size of 16 bit suitable for all sub detectors Question to ECS Can we easily create two or three more chip selects for the local bus Remark ECS Address for board identification is 1010000 given by serial EEPROM References 1 Aurelio Bay Guido Haefeli Patrick Koppenburg LHCb VeLo Off Detector Elec tronics Preprocessor and Interface to the Level 1 Trigger LHCb Note 2001 043 2 P Vazquez J Christiansen Simulation of the LHCb L1 front end LHCb Note 2001 126 3 Guido Haefeli FPGA based clock delay generator for multichannel processing on LHCb VeLo L1 ROB Note in preparation 4 Mike Koratzinos The Vertex Detector Trigger Data Model LHCb Note 89 070 28 11 Figure 14 Sideview of the O RxCard Jorgen Christiansen Requirements to the L1 front end el
28. e causing overshoot and undershoot that can destroy the devices on the bus 7 Signal integrity simulation need to be done in order to ensure its proper functioning 16 FPGA configuration For the configuration of the Altera Stratix FPGA one enhanced configuration device EPC16 is sufficient with the assumption that the PP FPGAs do have the identi cal firmware The EPC16 device is programmed over JTAG controlled by the ECS Optional two connecters are available on the motherboard to download the firmware directly to the PP FPGAs or the SyncLink FPGA The EEPROM used on the EPC16 25Remark that the PLX9030 is one of the driver of the local bus Because the local bus is specified to operate at a frequency of up to 60M Hz the edges of the local bus can be much faster than it is needed for the 10M Hz operation 25 is a 16 Mbit flash memory manufactured by Sharp The minimal number of erase cycle is 100 000 7 17 JTAG boundary scan All devices supporting JTAG boundary scan are chained together For production testing the external boundary scan cable device is connected to this chain with a 10 pin connecter 18 Power requirements A list of all power supplies and its estimated current is given in table 9 For the FPGAs a power calculation spread sheet has been used for the estimation The 5V Description LEVI LSY ZEV eae 5V 5V Comment 2 x O6 RxCard 1 4A 1 2A option
29. ectronics LHCb Note 2001 127 Raymond Frei Guido Gagliardi A long analog transmission line for the VELO read out LHCb Note 2001 072 B Jost N Neufeld Raw data transport format LHCb Note 2003 014 Niels Tuning Velo cluster studies LHCb Note 2002 J Christiansen A Marchioro P Moreira and T Toifl TTCrx Reference Manual CERN EP MIC Geneva Switzerland Niels van Bakel Daniel Baumeister Jo van den Brand Martin Feuerstack Raible Neiville Harnew Werner Hofmann Karl Tasso Kn fle Sven L chner Michael Schmelling Edgar Sexauer Nigel Smale Ulrich Trunk Hans Verkoojen The Beetle Reference Manual Prentice Hall 1993 Howard W Johnson Martin Graham High Speed Digital Design a handbook of black magic LHCb 2001 046 29 Input from RxCard 16 errr Syne To PP FPGA left and right PP FPGA Reset a Generator SyncData Generator Throttle 12Cc Parallel To RO Tx 30 A I O Tables Signals Purpose I O standard 16x11 RxCard 3 3V 2 5V LVTTL 3x43 DDR SDRAM 16 bit 2 5V SSTL_2 2x10 PP FPGA to PP FPGA LVTTL 8 6 ECS 3 3V LVTTL 1 Throttle LVTTL 1 1 L1A EvID LVTTL 16 2 PP DAQ link LVTTL 3242 PP L1T link LVTTL 6 1 Event synchronization LVTTL 3 Clock LVTTL 2 Processing mode LVTTL 2 L1Tprocessing sync LVTTL 2 DAQ processing sync LVTTL 1 Initialization done LVTTL 4 Resets LVTTL
30. he L1 electronics interface is analog and digitization must be done on the receiver part A RxCard The number of input data and clocks signals is higher than for other sub detectors since the receiver card is working as digitizer and the data is sent out to the motherboard at 40M Hz on 32 bit wide buses The Velo must provide the information to the L1 trigger An advanced common mode suppression algorithm is foreseen to be imple mented for the L1 trigger pre processor and the DAQ interface The synchronization of the sampled data needs a local front end emulator to generate a data valid signal e ST TT The data is sent multiplexed on 16 2 bit wide buses at 80M Hz from the optical receiver card O RxCard to the motherboard The higher frequency and fast signal edges of these signals need to be taken into account With 24 optical links on the ROB the L1B needs to be designed for this data stream which is higher than for the Velo TT must provide information to the L1 trigger e OT The high occupancy on this detector imposes a high bandwidth for the whole readout path 4 Li data flow architecture In figure 1 a block diagram of the ROB is given to show its partitioning in different daughter cards and FPGAs Four or two independent receiver mezzanine cards A RxCard or O RxCard can be plugged onto the motherboard The receiver card is directly connected to the PP FPGA which is the main processing uni
31. ink and zero suppressed in DAQ ZSupp The data from all PP FPGAs on RdData Data to DAQ_PPLink WrEvCnt L1B controller RdEvCnt From RxCard 32 bit 40MHz To L1T PPLINK UsedWords From SyncLink_FPGA Rd Wr access for ECS Figure 4 Detailed L1B controller block diagram Rate Task Cycle count Remark Each event Write CHO 2 5 34 Data transfer is 2 word per cy cle Each event Write CH1 34 Performed after CHO writing Every 2 events Active 3 Activate the row open Every 2 events Precharge 3 Deactivate the row close Every 25 events Read CHO 6 2 5 34 Every 25 events Read CH1 34 Performed after CHO reading Every 8 events Refresh 10 Refresh once per 7 8us Cyles available 108 per event 900ns 8 3ns Average Cycle 78 per event 72 Table 1 SDRAM cycle access statistic 4 5 time 900ns Figure 5 Example how the Arbiter schedules the required transactions the board are linked and encapsulated on the SyncLink FPGA to be finally sent to the RO Tx To unload the task of the PP FPGA and its need of resources the DAQ zero suppression can optionally also be performed on the SyncLink FPGA Doing so all raw data from the whole ROB can be linked before being zero suppressed 5 A RxCard and O RxCard At present two different receiver daughter card implementations are foreseen to be plugged on the motherboard This is necessary due
32. ion according to the alignment tables calculated off line and downloaded to the SyncLink FPGA The size of the alignment table of 2k x 12 bit or 32kbit which corresponds to a physical resolution of about 5um This table can be implemented either in 8 x 4kbit memory blocks or in one 512 kbit also available on the SyncLink FPGA 8 5 Outer Tracker cluster format for fragment links In preparation 16 9 TTCrx interface The TTC receiver chip synchronization signals are connected to the SyncLink FPGA table 21 The distribution of clock trigger and event synchronization signals is done with point to point links to each PP FPGA The clocks can be individually phase adjusted to ensure the correct clock phase between the FPGAs on the board The configuration registers can be loaded over an ECS driven C bus For production testing the JTAG boundary scan interface is connected to the overall JTAG chain The used of a configuration EEPROM is not foreseen and the configuration registers have to be loaded at each power up The TTCrx is directly mounted on the board to reduce cost and board space For further documentation refer to the TTCrx user manual 9 The The following synchronization tasks are implemented on the SyncLink FPGA using the TTTrx signals 1 TTCrx reset All resets on the board are distributed from the SyncLink FPGA see section Resets TTCrx status DbErrStr and SinErrStr are counted with saturated 16 16 bit counters and are
33. ls up to 16 x 622 Mbit s 10 Gbit s In addition a second connector with the local parallel bus control interface is placed such that a PMC sized mezzanine card can be connected The connector for the PL4 interface can not comply to the PMC standard for signal integrity reasons 14 Resets e All resets are distributed from the SyncLink FPGA e Each FPGA has a local reset accessible by ECS e One board push button 15 Clock distribution and signal termination on the board Special care has to be taken for the clock and fast signal distribution on the board The typical rise fall time for fast signals from and to the FPGAs and ASICS as the TLK2501 is Ins This leads to a maximal trace length of 2 4cm that can be considered as electrical short with the 1 6 rule 11 All long signals have to be terminated in an appropriate fashion The preferred termination scheme for LVTTL signals is to use point to point interconnects with source termination The value of the serial resistor is depending on the driver s impedance and the trace impedance In most the cases on this PCB a serial resistor of 33 Ohms is appropriate Parallel termination can not be applied due to the lack of driving strength and too high power dissipation that results by choosing an other electrical standard All signals driven by the FPGAs can be terminated by programming the I O cell to use the on chip termination option For the DDR SDRAM the SSTL 2 I O standard developed for
34. number of required pins on the connecter 5 1 ECS access I C is used for the slow control the RxCard The four cards share the address space of one dedicated I C bus where the two highest address bits bit 6 and 7 are defined on the motherboard see table in appendix 17 All other bits have to be set on the receiver cards 6 PP FPGA With a long list of tasks this FPGA demands for a high amount of resources Detailed studies for the implementation of the zero suppression for the the L1T called L1PPI have been done in 1 to estimate the amount of logic gates and memory needed on the PP FPGA In table 2 an overview of the estimated resources is given for the For the digital signal connectors a 200 pin 0 643mm pitch connector has been chosen see http www samtec com ftppub pdf QTS PDF and http www samtec com ftppub pdf QSS PDF l0Using a 10 bit ADC is optional for the A RxCard and therefore it needs to be supported by the ROB The further processing will be done on 8 bit resolution implementation of the LCMS algorithm also described in 1 The implementation has been optimized for the Altera APEX20K FPGA architecture but also allows to estimate the resources used in an other FPGA Using the Altera Stratix FPGA devices allows to implement the MAC multiply accumulation operations with the embedded DSP blocks This reduces significantly the LEs logic elements used for the design Functional bl
35. nx VirtexII for Altera Stratix 16 bit adder reg to reg 239 239 32 1 MUX reg to reg 323 216 64 x 8 Distributed RAM 294 32 x 18 M512 Block RAM 242 1k x 9 Block RAM 250 128x18 M4k Block RAM 222 16k x 36 512kbit RAM 212 18x18 Multiplier 105 206 Table 8 Xilinx VirtexII Speed grade 5 second fastest out of 3 compared to Altera Stratiz Speed grade 7 slowest out of 3 12 3 Device choice Several reason have driven the decision to use Altera Stratix devices on the board Migration The migration between devices in the low density device region of the Stratix family allows to have relatively low cost migration to higher density de vices The VirtesII family devices with equivalent size are in the high density region of the family and tend to get very expensive Memory With the three different memory block sizes the memory bits can more efficiently be used in our application DDR SDRAM interface Dedicated read data clock DQS delay circuits for DDR SDRAM 2lhttp waw xilinx com 22 PLL vs DLL PLL are more suitable for clock distribution since they do not suffer from additional jitter after each frequency translation step Cost and speed The slowest speed grade Stratix device is sufficient fast 13 L1 trigger and DAQ interface RO Tx The interface to the DAQ and the L1T is implemented using Gigabit Ethernet run ning on copper To reduce the overhead of the protocol a compact transport header h
36. ock Logic Block Block Block DSP PLL Elements memory memory memory blocks LE 512 bit 4k 4k x 144 Ll trigger ZSupp 6000 60 8 0 60 1 DAQ ZSupp 3000 20 12 0 20 0 L1B 3000 0 12 0 0 1 Synchronization 1000 0 12 0 0 0 ADC clock gen 200 0 0 0 0 2 Total 13200 80 44 0 80 4 Available in 1820 18460 194 82 2 80 6 Available in 1825 25660 224 138 2 80 6 This is the estimated number with the assumption that the processing is done with 160M Hz The zero suppression can also be done on the SyncLink FPGA which reduces the resources needed on the PP FPGA Table 2 Estimation of needed resources on the PP FPGA For sub detectors not con tributing to the L1 trigger the logic resources on the chip are available for other tasks 6 1 I O count To determine the package size of the FPGA a detailed count of the I O is listed in table 10 The number of data signals plus the I O pins used for reference voltage of the SSTL 2 and the reference impedance for the source termination are also included in this calculation The calculated number of I O is supported by several packages and devices of the Altera Stratix FPGAs To allow the migration between different devices the necessary number of I O has to be available by all desired devices Altera 400 Pin 672 Pin 780 Pin Comment Stratix FineLine FineLine FineLine Cyclone BGA BGA BGA Device EP1S10 341 422 not enough I O EP1S20 422 582
37. on each FPGA PP2 FPGA access 0xC0 PP2Chipld Identifier for the chip local address space access on each FPGA PP3 FPGA access OxE0 PP3Chipld Identifier for the chip local address space access on each FPGA 35 Table 16 8 bit address space of the local parallel bus for nCS0 C I2C address definition RxCard I C addr 0 ObOOxxxxx 1 ObO1xxxxx 2 Ob10xxxxx 3 Obl 1xxxxx Table 17 The two highest order bits of the RrCard I2C bus are hardwired on the mother board 36 D Signal tables Option Signal name of pins I O Standard Digital GND Cu plate pwr All Digital 3 3V Cu plate pwr Digital 2 5V Cu plate pwr I C RxSda 1 inout 3 3V LVTTL PC RxScl 1 out 3 3V LVTTL I C RxAddr 2 const 0 or 3 3V A RxCard 16 x 8 bit Data 128 input 3 3V 2 5V LVTTL Clk 16 output 3 3V 2 5V LVTTL Apad TE SOT Data 160 input 3 3V LVTTL Clk 16 output 3 3V LVTTL Oedipus Data 108 pul 3 3V LVTTL Enable 6 input 3 3V LVTTL LoopEn 6 input 3 3V LVTTL PrbsEn 6 input 3 3V LVTTL nLckRef 6 input 3 3V LVTTL Clk 6 input 3 3V LVTTL Total maximal 180 Table 18 Signals on digital signal connector for the RaCard Signal name Number of pins Remark Analog GND 4 Digital 5V 2 Is only used by the O RxCard Analog 5V 2 used for ADC s is
38. onverter on the 5V for generating the 1 5V and 2 5V requires 11AQ 5V Option 3 Distributing only 5V digital on the backplane and generate all other voltages with non isolated switched power supplies requires 15A 5V The total power consumption estimated is 75 Watts per board 19 Physical aspects of the ROB The layout of the board is driven by two major constraints e The A RxCard needs to have a maximum width to allow a reasonable analog circuit layout No other connectors can be allowed on the same panel e All other interfaces and the power supply has to be squeezed on the other side of the board The approach taken is the following The data signals are connected to the front panel For the optical receiver cards the optical fibres take a small space The analog signals are connected with 37 pin DSUB connectors 4 per ROB On the back side the top region is reserved for the power back plane The optical and electrical connectors for the TTC ECS L1T DAQ and Throttle are plugged manually from the back which is accessible since there is no transition module in place 20 FPGA implementation guidelines To allow several groups to work on the software and firmware development for the ROB it is necessary to define the interfaces of the board chips and functional blocks on the chip The timing specification an protocol are kept simple by the use of on chip real dual port FIFOs The development of the FPGA code firmware can be divi
39. r the Apex is shown For details see the specification and application notes on the Altera web site 7 On chip memeory Fast and flexible embedded memory block structure with block sizes of 512bit 4kbit and 512kbit Power and I O Low power consumption due to low core voltage I O Support of a wide range of current signaling standard at its I Os Fast The device allows to process the L1T zero suppression at 160M Hz Therefore 4 data channels can be processed multiplexed on one algorithm processing block 2 nttp waw altera com 21 Termination Termination of the interconnects of the traces on the PCB is possible on the chip This increases significantly the allowed density of fast signals PLL Allows a flexible clock management and replaces clock buffers on the board DSP blocks Embedded multiply accumulate blocks help the to be less critical for speed and reduces significantly the number of needed LEs 12 2 Xilinx VirtexIl The Xilinx VirtexII family is also suitable for the needs of the ROB Devices with the necessary resources are available The architectural differences between Stratix and VirtexII are given by the size of the embedded RAM blocks the width and modularity of the DSP multiplier blocks DDLs instead of PLLs To compare the two device families a table of performance in maximal frequencies is given Function Frequency in MHz Frequency in MHz for Xili
40. rite the data to the data registers According to the width of the data transferred only one two or all four data registers have to be set e Write the whole 32 bit wide address to the 4 address registers e Write to the transfer control register the required command according to defini tion in reftctrlreg to issue the read or write transfer e If it is a data read operation read the data from the data registers that are set after the transfer command has been issued 12 FPGA technology 12 1 Altera The evolution of FPGA technology has driven the devices to higher density faster on chip processing and faster I O capability The development is mostly driven by the telecommunication industry which is also doing multichannel processing on the FPGAs There is nevertheless a major difference on the demand of I O performance For the ROB only single ended 40M Hz 80MHz and 120M Hz interconnect signals are used The standard currently supported by FPGA families are e g 840 Mbps or 3 125Gbps This circumstance should not mislead to the conclusion that these chips are overkilled to use Price investigation for high density FPGA device for the present and the near future show that the most recent devices family will cost less than e g Altera Apex devices This can be explained with the miniaturization of the silicon process to 0 13 m which allows to reduce production cost In an uncomplete list of features the advantages of the Stratix devices ove
41. st strobe 1 BrestStr2 1 Input Broadcast strobe 1 Clock40 1 Input Non de skewed clock Clock40Des1 1 Input De skewed clock 1 Clock40Des2 1 Input De skewed clock 2 DbErrStr 1 Input Double error strobe 1 EvCntHStr 1 Input EvCnt high strobe EvCntLStr 1 Input EvCnt low strobe EvCntRes 1 Input EvCnt reset L1 Accept 1 Input L1 accept LOAcceptLHCb Reset _b 1 Output Chip reset SinErrStr 1 Input Single error strobe TTCReady 1 Input Ready signal TTCSda 1 l Ready signal TTCsel 1 l Ready signal Total 35 Table 21 TTC signals All but the I2C bus signals are connected to the SyncLink FPGA 39 Signal Name Use I O seen from Standard the SyncLink FPGA FEMData lt 3 0 gt In SyncLink 4 3 3V LVTTL FEMDataValid In SyncLink 1 3 3V LVTTL FEMCIk Clock Out 1 3 3V LVTTL FEMRst Reset Out 1 3 3V LVTTL FEMLOAccept Trigger Out 1 3 3V LVTTL FEMFifoFull Status Out 1 3 3V LVTTL FEMScl PC 1 3 3V LVTTL FEMSda PC 1 3 3V LVTTL Total 11 Table 22 FEM signals 40 E Pin out for connectors on the board 193 194 3 3V RxSda 195 196 RxScl RxAddr5 197 198 RxAddr6 3 3V 199 200 3 3V Figure 15 Pin out for the A RaCard signal connector 41 O RxCard Channel Signal Data0 0 Data0 2 Data0 4 Data0 6 Data0 8 Data0 10 Data0 12 Data0 14 RxEr0 LckRef0 ao a ie 3 5 3 3V RxSda
42. t on the board Each PP FPGA uses several independent L1B and L1B controller to store the data during the L1 latency After zero suppression for the L1T and the DAQ the data is linked and encapsulated on the SyncLink FPGA The same FPGA is also used to process the TTCrx ECS and FE emulator information to issue resets synchronize the PP FPGA processing and distribute clocks and L1T decisions The data is sent to the event building network via the read out transmitter RO Tx A more detailed representation of the whole ROB is given in figure 2 for the Velo and in figure 3 for the optical read out To reduce the number of I O pins used on the PP FPGA the data from two synchronization channels are sent to one L1B controller using the full bandwidth of a 16 bit double data rate SDRAM running at 120M Hz The event rate of 40kHz at the DAQ zero suppression allows to use a single common mode and zero suppression for all data on one chip The slow control of the FPGAs is done with a 8 bit wide address and data multiplexed parallel interface This interface is generated from the PCI bridge on the Glue Card and is called the Local Bus of the PLX9030 The ECS interface allows to access the local 32 bit memory space containing registers Two receiver card types are foreseen one for the Velo analog electrical readout and the other for the optical readout A RxCard A RxCard A RxCard O ci O RxCard i ne salle ace DAQ L1 throttle 4
43. to the different data transmission system from the cavern to the counting room For the Velo the receiver card is used to digitize the data transferred over analog copper links 6 and therefore is a mainly analog circuit with pre amplifier line equalizer and ADC This card is called A RxCard The optical receiver card is used by all other sub detectors as IT TT and OT It uses optical receiver and de serializer which results in a mainly digital design except for the optical receiver part This card is called O RxCard The signal connection from the receiver cards to the motherboard is split up into 4 separate connectors The physical placement is chosen such that 2 or 4 mezzanine cards can be inserted giving flexibility for the receiver card design Table 18 shows the number of digital signals on the signal connector for different implementations The connector chosen provide massive copper plate for GND and Vcc connection and ensures very good signal integrity properties The motherboard is designed to allow 64 analog links digitized with 8 or 10 bit for the A RxCard The number of pins needed for the RxCard is driven by the A RxCard In addition to the signal connectors analog power connectors are used to supply the RxCard with all necessary analog power see table 19 All control signals for the O RxCard are assigned to pins of the not used data signals 160 data signals are used for the Velo and only 108 for the O RxCard which minimizes the
44. used for digital 5V for the O RxCard Analog 5V 2 used for ADC s Analog 2 5V 2 used by SERDES Total 10 Table 19 Signals on the power connector for the RaCard 37 Signal Name 1I O seen Comment from the FPGAs EcsAD lt 7 0 gt 8 InOut Multiplexed Addr Data ECSClk 1 In the SyncLink FPGA drives the clock ECSnADS 1 In Address strobe ECSnBlast 1 In Burst Last ECSnCS 1 1 In Chip select ECSWnR 1 tn Write not Read ECSnReady 1 Out Assert by slave when ready ECSnReset 1 In ECS reset goes only to SyncLink FPGA ECSALE 1 Address latch enable not used ECSnBE lt 3 0 gt 4 Byte enable not used ECSnRD 1 Read strobe not used ECSnWR 1 l Write strobe not used Total to SyncLink FPGA 15 Total to PP FPGA 14 no Reset ECSnCSaux0 1 l for RO Tx ECSnCSaux1 1 l for external control interface Table 20 PLX 9030 Local parallel bus used in multiplexed 8 bit mode slave only The given signals are used to access the FPGAs on the board In addition 2 more chip select signals are available 38 Signal Name I O seen Comment from SyncLink FPGA BCnt lt 11 0 gt 12 Input BCnt EvCntL EvCntH BCntRes 1 Input BCnt reset BCntStr 1 Input BCnt strobe Brest lt 7 2 gt 6 Input Broadcast command data BrestStr1 1 Input Broadca

Download Pdf Manuals

image

Related Search

Related Contents

StarTech.com 50 ft Active CL2 In-wall High Speed HDMI Cable - Ultra HD 4k x 2k HDMI Cable - HDMI to HDMI - M/M  Network Camera Camera Management Tool User Manual - AV-iQ  Fiche technique chargeuse VENIERI VF 10.63  AgfaPhoto Optima 103  Belkin Surf Powerline AV  E - Active srl    

Copyright © All rights reserved.
Failed to retrieve file