Pouya Haghi
Boston University
H-index: 9
North America-United States
Top articles of Pouya Haghi
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
FPGA-Accelerated Range-Limited Molecular Dynamics | IEEE Transactions on Computers | Chunshu Wu Chen Yang Sahan Bandara Tong Geng Anqi Guo | 2024/3 |
ACiS: smart switches with application-level acceleration | Pouya Haghi | 2023 | |
FASDA: An FPGA-Aided, Scalable and Distributed Accelerator for Range-Limited Molecular Dynamics | Chunshu Wu Tong Geng Anqi Guo Sahan Bandara Pouya Haghi | 2023/11/12 | |
Software-hardware co-design of heterogeneous SmartNIC system for recommendation models inference and training | Anqi Guo Yuchen Hao Chunshu Wu Pouya Haghi Zhenyu Pan | 2023/6/21 | |
Flash: FPGA-accelerated smart switches with GCN case study | P. Haghi W. Krska C. C. Tan T. Geng P.H. Chen | 2023/6 | |
A Survey of Potential MPI Complex Collectives: Large-Scale Mining and Analysis of HPC Applications | arXiv preprint arXiv:2305.19946 | Pouya Haghi Ryan Marshall Po Hao Chen Anthony Skjellum Martin C Herbordt | 2023/5/31 |
Copa use case: Distributed secure joint computation | Rushi Patel Pouya Haghi Shweta Jain Andriy Kot Venkata Krishnan | 2022/5/15 | |
FCsN: A FPGA-Centric SmartNIC Framework for Neural Networks | Anqi Guo Tong Geng Yongan Zhang Pouya Haghi Chunshu Wu | 2022/5/15 | |
The Viability of Using Online Prediction to Perform Extra Work while Executing BSP Applications | Po Hao Chen Pouya Haghi Jae Yoon Chung Tong Geng Richard West | 2022/9/19 | |
Reconfigurable switches for high performance and flexible MPI collectives | Concurrency and Computation: Practice and Experience (CPE) | P. Haghi A Guo Q. Xiong C Yang T. Geng | 2022/3 |
Distributed hardware accelerated secure joint computation on the copa framework | Rushi Patel Pouya Haghi Shweta Jain Andriy Kot Venkata Krishnan | 2022/9/19 | |
Optimized Mappings for Symmetric Range-Limited Molecular Force Calculations on FPGAs | Chunshu Wu Sahan Bandara Tong Geng Anqi Guo Pouya Haghi | 2022/8/29 | |
A framework for neural network inference on fpga-centric smartnics | A. Guo T. Geng Y. Zhang P. Haghi C. Wu | 2022/8 | |
A survey: Handling irregularities in neural network acceleration with fpgas | T. Geng C. Wu C. Tan C Xie A Guo | 2021/9 | |
Workload imbalance in hpc applications: Effect on performance of in-network processing | P. Haghi A. Guo T. Geng A. Skjellum M.C. Herbordt | 2021/9 | |
O⁴-DNN: A Hybrid DSP-LUT-Based Processing Unit With Operation Packing and Out-of-Order Execution for Efficient Realization of Convolutional Neural Networks on FPGA Devices | IEEE Transactions on Circuits and Systems I: Regular Papers | Pouya Haghi Mehdi Kamal Ali Afzali-Kusha Massoud Pedram | 2020/4/29 |
A reconfigurable compute-in-the-network fpga assistant for high-level collective support with distributed matrix multiply case study | Pouya Haghi Anqi Guo Tong Geng Justin Broaddus Derek Schafer | 2020/12/9 | |
AWB-GCN: A graph convolutional network accelerator with runtime workload rebalancing | T. Geng A. Li R. Shi C. Wu T. Wang | 2020/10 | |
FPGAs in the network and novel communicator support accelerate MPI collectives | P. Haghi A. Guo Q. Xiong R. Patel Yang C. | 2020/9 | |
Accelerating MPI collectives with FPGAs in the network and novel communicator support | Qingqing Xiong Chen Yang Pouya Haghi Anthony Skjellum Martin Herbordt | 2020/5 |