TY - GEN
T1 - Performance evaluation of multiple lookup tables algorithms for generating CRC on an FPGA
AU - Akagić, Amila
AU - Amano, Hideharu
PY - 2011/8/23
Y1 - 2011/8/23
N2 - A compact architecture of five CRC algorithms based on multiple lookup tables approach is proposed. The focus of this paper is the tradeoff between implementation by using distributed LUTs or BRAM. Our results show that BRAM-based approach is more efficient in terms of area utilization, but LUT based approach allows higher throughput. The proposed architecture has been implemented on Xilinx Virtex 6 LX195T prototyping device, requiring less than 1% of the device resources. Experimental results show that throughput doubles when number of processed bits is doubled. Maximum achieved throughput is 170 Gbps for processing 512 bits at a time with LUT-based approach. We show that in terms of achievable clock speed BRAM-based approach is more efficient when processing 32, 64 and 128 bits at a time, while higher throughput is achieved by LUT-based approach for algorithms that process 256 and 512 bits at a time. In terms of hardware cost, BRAM-based approach without register balancing optimization proved to be most efficient solution.
AB - A compact architecture of five CRC algorithms based on multiple lookup tables approach is proposed. The focus of this paper is the tradeoff between implementation by using distributed LUTs or BRAM. Our results show that BRAM-based approach is more efficient in terms of area utilization, but LUT based approach allows higher throughput. The proposed architecture has been implemented on Xilinx Virtex 6 LX195T prototyping device, requiring less than 1% of the device resources. Experimental results show that throughput doubles when number of processed bits is doubled. Maximum achieved throughput is 170 Gbps for processing 512 bits at a time with LUT-based approach. We show that in terms of achievable clock speed BRAM-based approach is more efficient when processing 32, 64 and 128 bits at a time, while higher throughput is achieved by LUT-based approach for algorithms that process 256 and 512 bits at a time. In terms of hardware cost, BRAM-based approach without register balancing optimization proved to be most efficient solution.
UR - http://www.scopus.com/inward/record.url?scp=80051827951&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=80051827951&partnerID=8YFLogxK
U2 - 10.1109/ISAS.2011.5960941
DO - 10.1109/ISAS.2011.5960941
M3 - Conference contribution
AN - SCOPUS:80051827951
SN - 9781457707179
T3 - Proceedings of 2011 1st International Symposium on Access Spaces, ISAS 2011
SP - 164
EP - 169
BT - Proceedings of 2011 1st International Symposium on Access Spaces, ISAS 2011
T2 - 2011 1st International Symposium on Access Spaces, ISAS 2011
Y2 - 17 June 2011 through 19 June 2011
ER -