CYCLE
CYCLE
News
People
Events
Publications
Calendar
Contact
Paper-Conference
Arfa: an Agile Regime-based Floating-point Optimization Approach for Rounding Errors
In
Proceedings of the 33rd ACM International Symposium on Software Testing and Analysis (ISSTA 2024)
Jinchen Xu
,
Mengqi Cui
,
Fei Li
,
Zuoyan Zhang
,
Hongru Yang
,
Bei Zhou
,
Jie Zhao
PDF
Cite
Code
Enabling Tensor Language Model to Assist in Generating High-Performance Tensor Programs for Deep Learning
In
Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation (OSDI 2024)
Yi Zhai
,
Sijia Yang
,
Keyu Pan
,
Renwei Zhang
,
Shuo Liu
,
Chao Liu
,
Zichun Ye
,
Jianmin Ji
,
Jie Zhao
,
Yu Zhang
,
Yanyong Zhang
PDF
Cite
Code
A Holistic Approach to Automatic Mixed-Precision Code Generation and Tuning for Affine Programs
In
Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming (PPoPP 2024)
Jinchen Xu
,
Guanghui Song
,
Bei Zhou
,
Fei Li
,
Jiangwei Hao
,
Jie Zhao
PDF
Cite
Code
Dataset
Apollo: Automatic Partition-based Operator Fusion through Layer by Layer Optimization
In
Proceedings of Machine Learning and Systems (MLSys 2022)
Jie Zhao
,
Xiong Gao
,
Ruijie Xia
,
Zhaochuang Zhang
,
Deshi Chen
,
Lei Chen
,
Renwei Zhang
,
Zhen Geng
,
Bin Cheng
,
Xuefeng Jin
PDF
Cite
Eiffel: Inferring Input Ranges of Significant Floating-point Errors via Polynomial Extrapolation
In
Proceedings of the 38th IEEE/ACM International Conference on Automated Software Engineering (ASE 2023)
Zuoyan Zhang
,
Bei Zhou
,
Jiangwei Hao
,
Hongru Yang
,
Mengqi Cui
,
Yuchang Zhou
,
Guanghui Song
,
Fei Li
,
Jinchen Xu
,
Jie Zhao
PDF
Cite
Code
SIRIUS: Harvesting Whole-Program Optimization Opportunities for DNNs
In
Proceedings of Machine Learning and Systems (MLSys 2023)
Yijin Li
,
Jiacheng Zhao
,
Qianqi Sun
,
Haohui Mai
,
Lei Chen
,
Wanlu Cao
,
Yanfan Chen
,
Zhicheng Li
,
Ying Liu
,
Xinyuan Zhang
,
Xiyu Shi
,
Jie Zhao
,
Jingling Xue
,
Huimin Cui
,
Xiaobing Feng
PDF
Cite
Effectively Scheduling Computational Graphs of Deep Neural Networks toward Their Domain-Specific Accelerators
In
Proceedings of the 17th USENIX Symposium on Operating Systems Design and Implementation (OSDI 2023)
Jie Zhao
,
Siyuan Feng
,
Xiaoqiang Dan
,
Fei Liu
,
Chengke Wang
,
Sheng Yuan
,
Wenyuan Lv
,
Qikai Xie
PDF
Cite
Automatically Generating High-performance Matrix Multiplication Kernels on the Latest Sunway Processor
In
Proceedings of the 51st International Conference on Parallel Processing (ICPP 2022)
Xiaohan Tao
,
Yu Zhu
,
Boyang Wang
,
Jinlong Xu
,
Jianmin Pang
,
Jie Zhao
PDF
Cite
Parallelizing Neural Network Models Effectively on GPU by Implementing Reductions Atomically
In
Proceedings of 31st International Conference on Parallel Architectures and Compilation Techniques (PACT 2022)
Jie Zhao
,
Cédric Bastoul
,
Yanzhi Yi
,
Jiahui Hu
,
Wang Nie
,
Renwei Zhang
,
Zhen Geng
,
Chong Li
,
Thibaut Tachon
,
Zhiliang Gan
PDF
Cite
AKG: automatic kernel generation for neural processing units using polyhedral transformations
In
Proceedings of the 42nd ACM SIGPLAN International Conference on Programming Language Design and Implementation (PLDI 2021)
Jie Zhao
,
Bojie Li
,
Wang Nie
,
Zhen Geng
,
Renwei Zhang
,
Xiong Gao
,
Bin Cheng
,
Chen Wu
,
Yun Cheng
,
Zheng Li
,
Peng Di
,
Kun Zhang
,
Xuefeng Jin
PDF
Cite
»
Cite
×