Search filters

List of works by Hayden Kwok-Hay So

A Composable Dynamic Sparse Dataflow Architecture for Efficient Event-based Vision Processing on FPGA

scientific article published on 02 April 2024

A Model for Matrix Multiplication Performance on FPGAs

article

A Model for Peak Matrix Performance on FPGAs

article

A Parameterizable Activation Function Generator for FPGA-Based Neural Network Applications

A Reconfigurable Architecture for Real-time Event-based Multi-Object Tracking

scientific article published on 21 April 2023

A Soft Coarse-Grained Reconfigurable Array Based High-level Synthesis Methodology: Promoting Design Productivity and Exploring Extreme FPGA Frequency

A comparison of SAR image speckle filters

A unified hardware/software runtime environment for FPGA-based reconfigurable computers using BORPH

article

A unified hardware/software runtime environment for FPGA-based reconfigurable computers using BORPH

ASIC Design and Verification in an FPGA Environment

article

All-passive pixel super-resolution of time-stretch imaging.

scientific article

An Unified Architecture for Single, Double, Double-Extended, and Quadruple Precision Division

An integrated debugging environment for reprogrammble hardware systems

Architecture Generator for Type-3 Unum Posit Adder/Subtractor

Architecture for quadruple precision floating point division with multi-precision support

scholarly article published July 2016

Area-Efficient Architecture for Dual-Mode Double Precision Floating Point Division

Automatic Soft CGRA Overlay Customization for High-Productivity Nested Loop Acceleration on FPGAs

Automatic system architecture synthesis for FPGA-based reconfigurable computers

scholarly article published December 2009

Computationally Efficient Hyperspectral Data Learning Based on the Doubly Stochastic Dirichlet Process

Configurable Architectures for Multi-Mode Floating Point Adders

DSP48E efficient floating point multiplier architectures on FPGA

Data-driven light field depth estimation using deep Convolutional Neural Networks

scholarly article published July 2016

Deep-learning-assisted biophysical imaging cytometry at massive throughput delineates cell population heterogeneity

scientific article published on 16 September 2020

Design considerations of real-time adaptive beamformer for medical ultrasound research using FPGA and GPU

article published in 2012

Design space exploration for sparse matrix-matrix multiplication on FPGAs

Design space exploration for sparse matrix-matrix multiplication on FPGAs

scholarly article published December 2010

Design space exploration of adaptive beamforming acceleration for bedside and portable medical ultrasound imaging

Direct sigma-delta modulated signal processing in FPGA

scholarly article published 2008

Direct virtual memory access from FPGA for high-productivity heterogeneous computing

Dual-mode double precision / two-parallel single precision floating point multiplier architecture

Dual-mode double precision division architecture

Dynamic power reduction of FPGA-based reconfigurable computers using precomputation

Energy-efficient dataflow computations on FPGAs using application-specific coarse-grain architecture synthesis

Extending BORPH for shared memory reconfigurable computers

article published in 2012

FPGA High-level Synthesis versus Overlay

FPGA Overlays

File system access from reconfigurable FPGA hardware processes in BORPH

article

GraVF: A vertex-centric distributed graph processing framework on FPGAs

High-throughput cellular imaging with high-speed asymmetric-detection time-stretch optical microscopy under FPGA platform

High-throughput time-stretch imaging flow cytometry for multi-class classification of phytoplankton.

scientific article published in December 2016

Improving Usability of FPGA-Based Reconfigurable Computers Through Operating System Support

article

Introduction to the Special Issue on Application-Specific Systems, Architectures and Processors

Large-scale Multi-class Image-based Cell Classification with Deep Learning

scholarly article by Nan Meng et al published 31 October 2018 in IEEE Journal of Biomedical and Health Informatics

Low-Latency <i>In Situ</i> Image Analytics With FPGA-Based Quantized Convolutional Neural Network

scientific article published in 2022

Map-reduce processing of k-means algorithm with FPGA-accelerated computer cluster

scholarly article published June 2014

Medical Ultrasound Imaging: To GPU or Not to GPU?

Message from the ASAP 2015 chairs

Mixed-architecture process scheduling on tightly coupled reconfigurable computers

scholarly article published September 2014

Multi‐ATOM: Ultrahigh‐throughput single‐cell quantitative phase imaging with subcellular resolution

scientific article published on 01 April 2019

NnCore: A parameterized non-linear function generator for machine learning applications in FPGAs

scholarly article published December 2017

OLAF'16

OLAF'17

On IIR-based bit-stream multipliers

Operation scheduling for FPGA-based reconfigurable computers

Quad-level bit-stream signal processing on FPGAs

scholarly article published December 2008

Quantitative Phase Imaging Flow Cytometry for Ultra-Large-Scale Single-Cell Biophysical Phenotyping

scientific article published on 22 April 2019

QuickDough: A rapid FPGA loop accelerator design framework using soft CGRA overlay

RSQP: Problem-specific Architectural Customization for Accelerated Convex Quadratic Optimization

scholarly article

Radio Testbeds Using BEE2

article

Real-time GPU-based adaptive beamformer for high quality ultrasound imaging

Real-time object detection and classification for high-speed asymmetric-detection time-stretch optical microscopy on FPGA

Reducing dynamic power consumption in FPGAs using precomputation

Runtime Filesystem Support for Reconfigurable FPGA Hardware Processes in BORPH

article

Significant papers from the first 25 years of the FPL conference

Sparse Hierarchical Nonparametric Bayesian learning for light field representation and denoising

Taylor Series Based Architecture for Quadruple Precision Floating Point Division

Teaching introductory electrical engineering: Project-based learning experience

scholarly article published August 2012

The First 25 Years of the FPL Conference

Towards FPGA-assisted spark: An SVM training acceleration case study

Towards Flexible Automatic Generation of Graph Processing Gateware

UE-TCAM: An ultra efficient SRAM-based TCAM

Ultra-large-scale single-cell quantitative phase imaging

Ultra-low latency continuous block-parallel stream windowing using FPGA on-chip memory

Universal number posit arithmetic generator on FPGA

Unsupervised tracking with a low computational cost using the doubly stochastic Dirichlet process mixture model

Vertex-Centric Graph Processing on FPGA

Zero-Configuration Identity-Based Signcryption Scheme for Smart Grid

Zero-configuration identity-based IP network encryptor