CYCLE
CYCLE
News
People
Events
Publications
Calendar
Contact
Jie Zhao
Latest
Arfa: an Agile Regime-based Floating-point Optimization Approach for Rounding Errors
Enabling Tensor Language Model to Assist in Generating High-Performance Tensor Programs for Deep Learning
A Holistic Approach to Automatic Mixed-Precision Code Generation and Tuning for Affine Programs
Modeling the Interplay between Loop Tiling and Fusion in Optimizing Compilers Using Affine Relations
Apollo: Automatic Partition-based Operator Fusion through Layer by Layer Optimization
Eiffel: Inferring Input Ranges of Significant Floating-point Errors via Polynomial Extrapolation
SIRIUS: Harvesting Whole-Program Optimization Opportunities for DNNs
Effectively Scheduling Computational Graphs of Deep Neural Networks toward Their Domain-Specific Accelerators
Automatically Generating High-performance Matrix Multiplication Kernels on the Latest Sunway Processor
Parallelizing Neural Network Models Effectively on GPU by Implementing Reductions Atomically
AKG: automatic kernel generation for neural processing units using polyhedral transformations
Optimizing the Memory Hierarchy by Compositing Automatic Transformations on Computations and Data
Flextended Tiles: A Flexible Extension of Overlapped Tiles for Polyhedral Compilation
A polyhedral compilation framework for loops with dynamic data-dependent bounds
Cite
×