Publications

Download BibTeX.

2020
November
PDF Cortex: A Compiler for Recursive Deep Learning Models.
Pratik Fegade, Tianqi Chen, Phil Gibbons, and Todd Mowry.
arXiv preprint.
2020
August
PDF Redundancy-free computation graphs for graph neural networks.
Zhihao Jia, Sina Lin, Rex Ying, Jiaxuan You, Jure Leskovec, and Alex Aiken.
KDD 2020.
2020
March
PDF Improving the accuracy, scalability, and performance of graph neural networks with roc.
Zhihao Jia, Sina Lin, Mingyu Gao, Matei Zaharia, and Alex Aiken.
MLSys 2020.
2020
February
PDF Automating Generation of Low Precision Deep Learning Operators.
Meghan Cowan, Thierry Moreau, Tianqi Chen, and Luis Ceze.
CGO.
2019
November
PDF TASO: optimizing deep learning computation with automatic generation of graph substitutions.
Zhihao Jia, Oded Padon, James Thomas, Todd Warszawski, Matei Zaharia, and Alex Aiken.
SOSP 2019.
2019
September
PDF A Hardware-Software Blueprint for Flexible Deep Learning Specialization.
Thierry Moreau, Tianqi Chen, Luis Vega, Jared Roesch, Eddie Yan, Lianmin Zheng, Josh Fromm, Ziheng Jiang, Luis Ceze, Carlos Guestrin, and Arvind Krishnamurthy.
IEEE Micro 39(5).
2019
April
PDF Beyond data and model parallelism for deep neural networks.
Zhihao Jia, Matei Zaharia, and Alex Aiken.
SysML 2019.
2019
April
PDF Optimizing DNN Computation with Relaxed Graph Substitutions.
Zhihao Jia, James Thomas, Todd Warzawski, Mingyu Gao, Matei Zaharia, and Alex Aiken.
SysML 2019.
2018
December
PDF Learning to Optimize Tensor Programs.
Tianqi Chen, Lianmin Zheng, Eddie Yan, Ziheng Jiang, Thierry Moreau, Luis Ceze, Carlos Guestrin, and Arvind Krishnamurthy.
NeurIPS 2018.
2018
October
PDF TVM: An Automated End-to-End Optimizing Compiler for Deep Learning.
Tianqi Chen, Thierry Moreau, Ziheng Jiang, Lianmin Zheng, Eddie Yan, Meghan Cowan, Haichen Shen, Leyuan Wang, Yuwei Hu, Luis Ceze, Carlos Guestrin, and Arvind Krishnamurthy.
OSDI 2018.
2018
July
PDF Exploring Hidden Dimensions in Accelerating Convolutional Neural Networks.
Zhihao Jia, Sina Lin, Charles R. Qi, and Alex Aiken.
ICML 2018 (Proceedings of Machine Learning Research).
2017
November
PDF A Distributed Multi-GPU System for Fast Graph Processing.
Zhihao Jia, Yongkee Kwon, Galen Shipman, Pat McCormick, Mattan Erez, and Alex Aiken.
VLDB 11(3).
2016
August
PDF XGBoost: A Scalable Tree Boosting System.
Tianqi Chen and Carlos Guestrin.
KDD 2016.
2015
December
PDF MXNet: A Flexible and Efficient Machine Learning Library for Heterogeneous Distributed Systems.
Tianqi Chen, Mu Li, Yutian Li, Min Lin, Naiyan Wang, Minjie Wang, Tianjun Xiao, Bing Xu, Chiyuan Zhang, and Zheng Zhang.
LearningSys Workshop at Neural Information Processing Systems 2015.