2020
November
|
Cortex: A Compiler for Recursive Deep Learning Models.
Pratik Fegade, Tianqi Chen, Phil Gibbons, and Todd Mowry.
arXiv preprint.
|
2020
August
|
Redundancy-free computation graphs for graph neural networks.
Zhihao Jia, Sina Lin, Rex Ying, Jiaxuan You, Jure Leskovec, and Alex Aiken.
KDD 2020.
|
2020
March
|
Improving the accuracy, scalability, and performance of graph neural networks with roc.
Zhihao Jia, Sina Lin, Mingyu Gao, Matei Zaharia, and Alex Aiken.
MLSys 2020.
|
2020
February
|
Automating Generation of Low Precision Deep Learning Operators.
Meghan Cowan, Thierry Moreau, Tianqi Chen, and Luis Ceze.
CGO.
|
2019
November
|
TASO: optimizing deep learning computation with automatic generation of graph substitutions.
Zhihao Jia, Oded Padon, James Thomas, Todd Warszawski, Matei Zaharia, and Alex Aiken.
SOSP 2019.
|
2019
September
|
A Hardware-Software Blueprint for Flexible Deep Learning Specialization.
Thierry Moreau, Tianqi Chen, Luis Vega, Jared Roesch, Eddie Yan, Lianmin Zheng, Josh Fromm, Ziheng Jiang, Luis Ceze, Carlos Guestrin, and Arvind Krishnamurthy.
IEEE Micro 39(5).
|
2019
April
|
Beyond data and model parallelism for deep neural networks.
Zhihao Jia, Matei Zaharia, and Alex Aiken.
SysML 2019.
|
2019
April
|
Optimizing DNN Computation with Relaxed Graph Substitutions.
Zhihao Jia, James Thomas, Todd Warzawski, Mingyu Gao, Matei Zaharia, and Alex Aiken.
SysML 2019.
|
2018
December
|
Learning to Optimize Tensor Programs.
Tianqi Chen, Lianmin Zheng, Eddie Yan, Ziheng Jiang, Thierry Moreau, Luis Ceze, Carlos Guestrin, and Arvind Krishnamurthy.
NeurIPS 2018.
|
2018
October
|
TVM: An Automated End-to-End Optimizing Compiler for Deep Learning.
Tianqi Chen, Thierry Moreau, Ziheng Jiang, Lianmin Zheng, Eddie Yan, Meghan Cowan, Haichen Shen, Leyuan Wang, Yuwei Hu, Luis Ceze, Carlos Guestrin, and Arvind Krishnamurthy.
OSDI 2018.
|
2018
July
|
Exploring Hidden Dimensions in Accelerating Convolutional Neural Networks.
Zhihao Jia, Sina Lin, Charles R. Qi, and Alex Aiken.
ICML 2018 (Proceedings of Machine Learning Research).
|
2017
November
|
A Distributed Multi-GPU System for Fast Graph Processing.
Zhihao Jia, Yongkee Kwon, Galen Shipman, Pat McCormick, Mattan Erez, and Alex Aiken.
VLDB 11(3).
|
2016
August
|
XGBoost: A Scalable Tree Boosting System.
Tianqi Chen and Carlos Guestrin.
KDD 2016.
|
2015
December
|
MXNet: A Flexible and Efficient Machine Learning Library for Heterogeneous Distributed Systems.
Tianqi Chen, Mu Li, Yutian Li, Min Lin, Naiyan Wang, Minjie Wang, Tianjun Xiao, Bing Xu, Chiyuan Zhang, and Zheng Zhang.
LearningSys Workshop at Neural Information Processing Systems 2015.
|