A compiler and runtime for mega-kernelizing tensor programs
A universal solution that allows any language model to be deployed natively on a diverse set of hardware backends and native applications.
End-to-end compilation of ML applications with dynamic and irregular control flow and data structure accesses