Search filters

List of works by Minyi Guo

AdaptGear

scientific article published on 04 August 2023

Amanda: Unified Instrumentation Framework for Deep Neural Networks

scientific article published on 17 April 2024

BLAD: Adaptive Load Balanced Scheduling and Operator Overlap Pipeline For Accelerating The Dynamic GNN Training

scientific article published on 30 October 2023

DataFlower: Exploiting the Data-flow Paradigm for Serverless Workflow Orchestration

scientific article published on 07 February 2024

DistSim

scientific article published on 04 August 2023

FaaSFlow: enable efficient workflow execution for function-as-a-service

scientific article published on 22 February 2022

FaaSGraph: Enabling Scalable, Efficient, and Cost-Effective Graph Processing with Serverless Computing

scientific article published on 22 April 2024

FaaSMem: Improving Memory Efficiency of Serverless Computing with Memory Pool Architecture

scientific article published on 24 April 2024

GMLake: Efficient and Transparent GPU Memory Defragmentation for Large-scale DNN Training with Virtual Memory Stitching

scientific article published on 22 April 2024

JUNO: Optimizing High-Dimensional Approximate Nearest Neighbour Search with Sparsity-Aware Algorithm and Ray-Tracing Core Mapping

scientific article published on 22 April 2024

Maximizing the Utilization of GPUs Used by Cloud Gaming through Adaptive Co-location with Combo

scientific article published on 31 October 2023

Not All Resources are Visible

scientific article published on 31 October 2023

OliVe: Accelerating Large Language Models via Hardware-friendly Outlier-Victim Pair Quantization

scientific article published on 16 June 2023

PAC: Preference-Aware Co-location Scheduling on Heterogeneous NUMA Architectures To Improve Resource Utilization

scientific article published on 20 June 2023

VELTAIR: towards high-performance multi-tenant deep learning services via adaptive compilation and scheduling

scientific article published on 22 February 2022