high bandwidth Latest Research Papers

Scientific computing is at the core of many High-Performance Computing applications, including computational flow dynamics. Because of the utmost importance to simulate increasingly larger computational models, hardware acceleration is receiving increased attention due to its potential to maximize the performance of scientific computing. Field-Programmable Gate Arrays could accelerate scientific computing because of the possibility to fully customize the memory hierarchy important in irregular applications such as iterative linear solvers. In this article, we study the potential of using Field-Programmable Gate Arrays in High-Performance Computing because of the rapid advances in reconfigurable hardware, such as the increase in on-chip memory size, increasing number of logic cells, and the integration of High-Bandwidth Memories on board. To perform this study, we propose a novel Sparse Matrix-Vector multiplication unit and an ILU0 preconditioner tightly integrated with a BiCGStab solver kernel. We integrate the developed preconditioned iterative solver in Flow from the Open Porous Media project, a state-of-the-art open source reservoir simulator. Finally, we perform a thorough evaluation of the FPGA solver kernel in both stand-alone mode and integrated in the reservoir simulator, using the NORNE field, a real-world case reservoir model using a grid with more than 10 5 cells and using three unknowns per cell.

Download Full-text

Hipernetch: High-Performance FPGA Network Switch

ACM Transactions on Reconfigurable Technology and Systems ◽

10.1145/3477054 ◽

2022 ◽

Vol 15 (1) ◽

pp. 1-31

Author(s):

Philippos Papaphilippou ◽

Jiuxi Meng ◽

Nadeen Gebara ◽

Wayne Luk

Keyword(s):

High Frequency ◽

High Performance ◽

Data Centers ◽

Round Robin ◽

Switching Performance ◽

Wide Range ◽

Crossbar Switches ◽

High Bandwidth ◽

Network Switch ◽

Network Switches

We present Hipernetch, a novel FPGA-based design for performing high-bandwidth network switching. FPGAs have recently become more popular in data centers due to their promising capabilities for a wide range of applications. With the recent surge in transceiver bandwidth, they could further benefit the implementation and refinement of network switches used in data centers. Hipernetch replaces the crossbar with a “combined parallel round-robin arbiter”. Unlike a crossbar, the combined parallel round-robin arbiter is easy to pipeline, and does not require centralised iterative scheduling algorithms that try to fit too many steps in a single or a few FPGA cycles. The result is a network switch implementation on FPGAs operating at a high frequency and with a low port-to-port latency. Our proposed Hipernetch architecture additionally provides a competitive switching performance approaching output-queued crossbar switches. Our implemented Hipernetch designs exhibit a throughput that exceeds 100 Gbps per port for switches of up to 16 ports, reaching an aggregate throughput of around 1.7 Tbps.

Download Full-text

Federated Learning in Edge Computing: A Systematic Survey

Sensors ◽

10.3390/s22020450 ◽

2022 ◽

Vol 22 (2) ◽

pp. 450

Author(s):

Haftay Gebreslasie Abreha ◽

Mohammad Hayajneh ◽

Mohamed Adel Serhani

Keyword(s):

Data Privacy ◽

Edge Computing ◽

Future Research ◽

Open Problems ◽

Systematic Survey ◽

Architecture Framework ◽

Share Data ◽

High Bandwidth ◽

Promising Solution ◽

Cloud Servers

Edge Computing (EC) is a new architecture that extends Cloud Computing (CC) services closer to data sources. EC combined with Deep Learning (DL) is a promising technology and is widely used in several applications. However, in conventional DL architectures with EC enabled, data producers must frequently send and share data with third parties, edge or cloud servers, to train their models. This architecture is often impractical due to the high bandwidth requirements, legalization, and privacy vulnerabilities. The Federated Learning (FL) concept has recently emerged as a promising solution for mitigating the problems of unwanted bandwidth loss, data privacy, and legalization. FL can co-train models across distributed clients, such as mobile phones, automobiles, hospitals, and more, through a centralized server, while maintaining data localization. FL can therefore be viewed as a stimulating factor in the EC paradigm as it enables collaborative learning and model optimization. Although the existing surveys have taken into account applications of FL in EC environments, there has not been any systematic survey discussing FL implementation and challenges in the EC paradigm. This paper aims to provide a systematic survey of the literature on the implementation of FL in EC environments with a taxonomy to identify advanced solutions and other open problems. In this survey, we review the fundamentals of EC and FL, then we review the existing related works in FL in EC. Furthermore, we describe the protocols, architecture, framework, and hardware requirements for FL implementation in the EC environment. Moreover, we discuss the applications, challenges, and related existing solutions in the edge FL. Finally, we detail two relevant case studies of applying FL in EC, and we identify open issues and potential directions for future research. We believe this survey will help researchers better understand the connection between FL and EC enabling technologies and concepts.

Download Full-text

High Bandwidth‐Utilization Digital Holographic Multiplexing: An Approach Using Kramers–Kronig Relations

Advanced Photonics Research ◽

10.1002/adpr.202100273 ◽

2022 ◽

pp. 2100273

Author(s):

Zhengzhong Huang ◽

Liangcai Cao

Keyword(s):

Bandwidth Utilization ◽

High Bandwidth

Download Full-text

Discrete-variable quantum key distribution with homodyne detection

Quantum ◽

10.22331/q-2022-01-03-613 ◽

2022 ◽

Vol 6 ◽

pp. 613

Author(s):

Ignatius William Primaatmaja ◽

Cassey Crystania Liang ◽

Gong Zhang ◽

Jing Yan Haw ◽

Chao Wang ◽

...

Keyword(s):

Quantum Key Distribution ◽

High Speed ◽

Cost Effective ◽

Key Distribution ◽

Continuous Variable ◽

Discrete Variable ◽

Quantum State Preparation ◽

High Bandwidth ◽

The Cost ◽

Selection Of

Most quantum key distribution (QKD) protocols can be classified as either a discrete-variable (DV) protocol or continuous-variable (CV) protocol, based on how classical information is being encoded. We propose a protocol that combines the best of both worlds – the simplicity of quantum state preparation in DV-QKD together with the cost-effective and high-bandwidth of homodyne detectors used in CV-QKD. Our proposed protocol has two highly practical features: (1) it does not require the honest parties to share the same reference phase (as required in CV-QKD) and (2) the selection of decoding basis can be performed after measurement. We also prove the security of the proposed protocol in the asymptotic limit under the assumption of collective attacks. Our simulation suggests that the protocol is suitable for secure and high-speed practical key distribution over metropolitan distances.

Download Full-text

A 25 Gbps VCSEL driving ASIC: an attempt for ultra-high-speed front-end readout applications

Journal of Instrumentation ◽

10.1088/1748-0221/17/01/c01040 ◽

2022 ◽

Vol 17 (01) ◽

pp. C01040

Author(s):

C. Zhao ◽

D. Guo ◽

Q. Chen ◽

N. Fang ◽

Y. Gan ◽

...

Keyword(s):

High Speed ◽

Cmos Technology ◽

Test Results ◽

Eye Diagram ◽

Optical Links ◽

Optical Module ◽

Active Feedback ◽

High Bandwidth ◽

Output Driver ◽

Very High

Abstract This paper presents the design and the test results of a 25 Gbps VCSEL driving ASIC fabricated in a 55 nm CMOS technology as an attempt for the future very high-speed optical links. The VCSEL driving ASIC is composed of an input equalizer stage, a pre-driver stage and a novel output driver stage. To achieve high bandwidth, the pre-driver stage combines the inductor-shared peaking structure and the active-feedback technique. A novel output driver stage uses the pseudo differential CML driver structure and the adjustable FFE pre-emphasis technique to improve the bandwidth. This VCSEL driver has been integrated in a customized optical module with a VCSEL array. Both the electrical function and optical performance have been fully evaluated. The output optical eye diagram has passed the eye mask test at the data rate of 25 Gbps. The peak-to-peak jitter of 25 Gbps optical eye is 19.5 ps and the RMS jitter is 2.9 ps.

Download Full-text

PR_30 Towards Programmable Optical Network Front-end Devices Using Silicon Photonics (1525090-PO)

10.31219/osf.io/szagb ◽

2022 ◽

Author(s):

Shayan Mookherjee

Keyword(s):

Communication Networks ◽

Silicon Photonics ◽

Energy Efficient ◽

National Science Foundation ◽

Optical Network ◽

Project Outcomes ◽

Energy Efficient Communication ◽

Efficient Communication ◽

National Science ◽

High Bandwidth

We study how the performance and utility of high-bandwidth, energy-efficient communication networks can be improved by enabling programmability and user-defined tunability in the optical front-ends using silicon photonics. Summary of a Project Outcomes report of research funded by the U.S. National Science Foundation under Project Number 1525090 (Year 1).

Download Full-text

High-performance packet routing acceleration for cloud systems using high bandwidth memory

IEEE Open Journal of the Communications Society ◽

10.1109/ojcoms.2022.3142249 ◽

2022 ◽

pp. 1-1

Author(s):

Takashi Shimizu ◽

Kazuki Hyoudou ◽

Hiroshi Murakawa ◽

Tomohiro Ishihara

Keyword(s):

High Performance ◽

Packet Routing ◽

Cloud Systems ◽

High Bandwidth

Download Full-text

Design and precision tracking control of a high-bandwidth fast steering mirror for laser beam machining

Precision Engineering ◽

10.1016/j.precisioneng.2021.09.003 ◽

2022 ◽

Vol 73 ◽

pp. 128-139

Author(s):

Jianpeng Zhong ◽

Rina Nishida ◽

Tadahiko Shinshi

Keyword(s):

Laser Beam ◽

Tracking Control ◽

Laser Beam Machining ◽

High Bandwidth ◽

Precision Tracking ◽

Precision Tracking Control

Download Full-text

Adaptively Reduced DRAM Caching for Energy-Efficient High Bandwidth Memory

IEEE Transactions on Computers ◽

10.1109/tc.2022.3140897 ◽

2022 ◽

pp. 1-1

Author(s):

Payman Behnam ◽

Mahdi Nazm Bojnordi

Keyword(s):

Energy Efficient ◽

High Bandwidth

Download Full-text

high bandwidth
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Hardware Acceleration of High-Performance Computational Flow Dynamics Using High-Bandwidth Memory-Enabled Field-Programmable Gate Arrays

Hipernetch: High-Performance FPGA Network Switch

Federated Learning in Edge Computing: A Systematic Survey

High Bandwidth‐Utilization Digital Holographic Multiplexing: An Approach Using Kramers–Kronig Relations

Discrete-variable quantum key distribution with homodyne detection

A 25 Gbps VCSEL driving ASIC: an attempt for ultra-high-speed front-end readout applications

PR_30 Towards Programmable Optical Network Front-end Devices Using Silicon Photonics (1525090-PO)

High-performance packet routing acceleration for cloud systems using high bandwidth memory

Design and precision tracking control of a high-bandwidth fast steering mirror for laser beam machining

Adaptively Reduced DRAM Caching for Energy-Efficient High Bandwidth Memory

Export Citation Format

high bandwidthRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Hardware Acceleration of High-Performance Computational Flow Dynamics Using High-Bandwidth Memory-Enabled Field-Programmable Gate Arrays

Hipernetch: High-Performance FPGA Network Switch

Federated Learning in Edge Computing: A Systematic Survey

High Bandwidth‐Utilization Digital Holographic Multiplexing: An Approach Using Kramers–Kronig Relations

Discrete-variable quantum key distribution with homodyne detection

A 25 Gbps VCSEL driving ASIC: an attempt for ultra-high-speed front-end readout applications

PR_30 Towards Programmable Optical Network Front-end Devices Using Silicon Photonics (1525090-PO)

High-performance packet routing acceleration for cloud systems using high bandwidth memory

Design and precision tracking control of a high-bandwidth fast steering mirror for laser beam machining

Adaptively Reduced DRAM Caching for Energy-Efficient High Bandwidth Memory

high bandwidth
Recently Published Documents