[API][Backend] Streaming and OpenCL Backends (cornell-zhang#138)

* add sdaccel, aocl for heterocl * fpga * Create codeanalys_openclc.cc * Update target.py * run * can run successfully * Create codegen_opencl.cc * now * all done * Update codegen_sdaccel.cc * Update codegen_sdaccel.cc * modified: python/heterocl/tvm/target.py * new file: samples/ppac/gemm/csrcPrint.py new file: samples/ppac/gemm/data.py new file: samples/ppac/gemm/gemm_ppac.py new file: samples/ppac/gemm/headcode.txt new file: samples/ppac/gemm/ppac_common.py new file: tvm/src/codegen/build_ppac.cc new file: tvm/src/codegen/codegen_rv64_ppac.cc new file: tvm/src/codegen/codegen_rv64_ppac.h * all * remove tvm check code from kernel * opencl-backend * all * fix ppac module build * support ppac MVPb pragma * fix ignoring ppac pragma in cpu backend * opencl-backend * aocl-backend * move ppac codegen to ppac folder; fix argument name with merlinc analyser * discard the new for-loop type; include ppac in hlib * discard some previous changes * Use int64_t as return type of GeMM on ppac * [add] codegenc kernedef + stream init * [add] var_shape_map * [update] kerneldef struct shape * [update] use noderef and restore * [fix] return op * [add] hcl device & kernelstmt printer * [fix] def workaround * [update] stream example * [add] stream expr & stmt ir * [fix] kernel arg location for stream * opt1 * opencl-general * new-version * no bug * a * test+unroll+pipeline * pragma * new * type has fixed * new_test * test_reorder_split_fuse * target * order * simplified by rui * analysis * bug fixed * [delete] all of the code about opencl * [ADD] new opencl back-end including xilinx & intel * fixed __local * fixed data_type for xilinx opencl * add makefile for SDAccel_runtime * add the runtime for sdaccel * create the sdaccel host * fixed the indent problem partly * test the zhang-05 server * add indent to the host.cpp * automatically generate makefile * delete common folder from opencl * add shmat to sdaccel runtime * fixed bug for sdaccel runtime seg fault * fixed the bug of host.cpp multiple * fixed host.cpp multiple bug * fixed endif for makefile * modify sdaccel_sw_emu -> sdaccel_csim * fix the __local and __global for intel opencl back-end * Fix the arbitrary integer precision for aocl * [add] ir visitor & functor for codegen * [add] aocl stream codegen * [add] aocl stream support * [fix] aocl type conversion * [fix] aocl channel syntax * [add] sch.stream_to * [fix] add stream annotation * [add] host device codegen * [add] stream ir mutator * [Add] Interface prag,a for SDx sim * [add] host xcel codegen * [update] build interface * [update] new build interface * [fix] temp update * [update] stream example * [add] rocc-ppac sim * [rm] submodule * [update] rocc ppac hlib * [add] unified sim & kernel updater * re-organize build common util * [update] stream in codegen c * [update] codegen construct for streaming * [update] code post-processing * [fix] test cases * [fix] python compatibility * [update] future * [fix] metaclass * [fix] test import issue
seanlatias · Dec 10, 2019 · 2c75344 · 2c75344
1 parent c124eb9
commit 2c75344
Show file tree

Hide file tree

Showing 142 changed files with 10,417 additions and 572 deletions.
diff --git a/.circleci/config.yml b/.circleci/config.yml
@@ -11,6 +11,7 @@ test: &test
         key: v1.03-libhcl-
     - run: make build-python
     - run: pip install --user pytest 
+    - run: pip install --user future 
     - run: python -m pytest tests
     - run: pip install --user mxnet
     - run: python -m pytest samples

diff --git a/.gitignore b/.gitignore
@@ -16,8 +16,6 @@ tags
 docs/source/samples
 docs/source/tutorials
 soda_*
-*.cpp
-*.h
 out
 
 # Downloaded files

diff --git a/HISTORY b/HISTORY
@@ -0,0 +1,11 @@
+### 2019-12-09 
+  * fixed issue of zc706 simulation 
+    * remove kernel-name variable allocation before KernelDef
+    * change multi-dimension array access to row-major single-dimension access
+    * create local buffer for each on-device variable
+    * updated the `KernelUpdater` class (using position index instead of name)
+    * added `stream_arg_pos` map in `CodeGenC` to facilitate codegen with streaming
+  * fixed test cases 
+    * changed tvm `build` function to support legacy string type target 
+    * fixed opencl aocl data type mismatching issue
+    * fixed kernel def data type conversion issue
diff --git a/Makefile b/Makefile
@@ -12,15 +12,15 @@ build-tvm: build-pkgs
 
 build-hcl: build-tvm
 	cd python; \
-	python setup.py install --user; \
+	python setup.py develop --user; \
   cd ../hlib/python; \
-	python setup.py install --user;
+	python setup.py develop --user;
 
 build-python:
 	cd python; \
-	python setup.py install --user; \
+	python setup.py develop --user; \
   cd ../hlib/python; \
-	python setup.py install --user;
+	python setup.py develop --user;
 
 clean:
 	rm -rf build

diff --git a/Makefile.config b/Makefile.config
@@ -12,6 +12,9 @@ CMAKE_OK = no
 # set whether to use vivado hls runtime
 USE_VIVADO_HLS = 1
 
+# set whether to use sdaccel opencl runtime
+USE_SDACCEL_HLS = 1
+
 # Specify current directory level with respect to CLAY_ROOT
 ifndef LEVEL
 	LEVEL := .

diff --git a/hlib/python/hlib/nn.py b/hlib/python/hlib/nn.py
@@ -32,6 +32,17 @@ def _pad(*indices):
         return data[tuple(index_tuple)]
     return hcl.compute(out_shape, _pad, name='pad')
 
+def conv2d_nchw_imp(Input, Filter, Output, stride=[1,1], padding=[[0,0],[0,0]]):
+    with hcl.for_(0,Output.shape[0]) as n:
+      with hcl.for_(0,Output.shape[1]) as c:
+        with hcl.for_(0,Output.shape[2]) as h:
+          with hcl.for_(0,Output.shape[3]) as w:
+            partial = hcl.scalar(0)
+            with hcl.for_(0,Filter.shape[-2]) as x:
+              with hcl.for_(0,Filter.shape[-1]) as y:
+                partial.v += Input[n][c][h+x][w+y] * Filter[0][0][x][y] 
+            Output[n,c,h,w] = partial
+
 def conv2d_nchw(Input, Filter, name="conv2d", stride=[1,1], padding=[[0,0],[0,0]]):
     out_dtype = Input.dtype
     batch, in_channel, in_height, in_width = Input.shape

diff --git a/hlib/rocc-ppac b/hlib/rocc-ppac
diff --git a/python/heterocl/__init__.py b/python/heterocl/__init__.py
@@ -3,6 +3,7 @@
 from .compute_api import *
 from .dsl import *
 from .types import *
+from .devices import *
 from .nparray import *
 from .debug import hcl_excepthook
 from .tvm.intrin import *

diff --git a/python/heterocl/api.py b/python/heterocl/api.py
@@ -53,7 +53,7 @@ def app2(A, B, C):
         # execute f2
     """
     # set the configurations
-    config.init_dtype = init_dtype
+    config.init_dtype  = init_dtype
     # initialize global variables
     Schedule.stage_ops = []
     Schedule.last_stages = OrderedSet([])
@@ -90,7 +90,7 @@ def placeholder(shape, name=None, dtype=None):
     """
     name = util.get_name("placeholder", name)
     dtype = util.get_dtype(dtype)
-
+    
     if shape == ():
         return Scalar(tvm_api._Var(name, dtype))
     tensor = Tensor(shape, dtype, name)

diff --git a/python/heterocl/debug.py b/python/heterocl/debug.py
@@ -45,6 +45,11 @@ class TensorError(HCLError):
     def __init__(self, msg):
         HCLError.__init__(self, msg, "\33[1;31m[Tensor]\33[0m ")
 
+class DeviceError(HCLError):
+    """A subclass for specifying device related exception"""
+    def __init__(self, msg):
+        HCLError.__init__(self, msg, "\33[1;31m[Device]\33[0m ")
+
 def hcl_excepthook(etype, value, tb):
     """Customized excepthook
 

diff --git a/python/heterocl/devices.py b/python/heterocl/devices.py
@@ -0,0 +1,278 @@
+"""Define HeteroCL device types"""
+#pylint: disable=too-few-public-methods, too-many-return-statements
+from .debug import DeviceError
+from .tools import option_table, model_table
+from future.utils import with_metaclass
+
+class tooling(type):
+    def __getattr__(cls, key):
+        if key in option_table:
+           return cls(key, *option_table[key])
+        else: # unsupported device
+           raise DeviceError("not supported")
+
+class tool(with_metaclass(tooling, object)):
+    """The base class for all device tooling
+
+    mode (sim/impl) is decided by tool configuration
+    e.g. run sw emulation by passing gcc / vivado_hls arg
+    and actual impl by passing sdaccel / aocl arg 
+
+    Parameters
+    ----------
+    types: str
+        Device of device to place data
+    model: str
+        Model of device to place date
+    """
+    def __init__(self, name, mode, kwargs):
+        self.name = name
+        self.mode = mode
+        self.options = kwargs
+
+    def __getattr__(self, entry):
+        return self.mapping[entry] 
+
+    def __call__(self, mode, setting={}):
+        self.mode = mode
+        self.options = setting
+        return self
+
+    def __str__(self):
+        return str(self.name) + "-" + \
+               str(self.mode) + ":\n" + \
+               str(self.options)
+
+    def __repr__(self):
+        return str(self.name) + "-" + \
+               str(self.mode) + ":\n" + \
+               str(self.options)
+
+tool_table = {
+  "aws_f1"      : tool("sdaccel", *option_table["sdaccel"]),
+  "zc706"       : tool("vivado_hls", *option_table["vivado_hls"]),
+  "ppac"        : tool("rocket", *option_table["rocket"]),
+  "stratix10_sx": tool("aocl", *option_table["aocl"]),
+  "llvm"        : tool("llvm", *option_table["llvm"])
+}
+
+class Device(object):
+    """The base class for all device types
+
+    The default data placement is on CPU.
+
+    Parameters
+    ----------
+    types: str
+        Device of device to place data
+    model: str
+        Model of device to place date
+    """
+    def __init__(self, types, vendor, 
+                 model, **kwargs):
+        self.vendor = vendor
+        self.types = types
+        self.model = model
+        self.impls = {"lang": ""}
+        for key, value in kwargs.items(): 
+            self.impls[key] = value
+
+    def __getattr__(self, key):
+        """ device hierarchy """
+        return self.impls[key] 
+
+    def set_lang(self, lang):
+        assert lang in \
+            ["opencl", "hlsc", "c", "opengl", "merlinc", "cuda", "metal"], \
+            "unsupported lang sepc " + lang
+        self.impls["lang"] = lang
+        return self
+
+class CPU(Device):
+    """cpu device with different models"""
+    def __init__(self, vendor, model, **kwargs):
+        if vendor not in ["riscv", "arm", "intel", "sparc", "powerpc"]: 
+            raise DeviceError(vendor + " not supported yet")
+        assert "cpu_" + model in model_table[vendor], \
+            model + " not supported yet"
+        super(CPU, self).__init__("CPU", vendor, model, **kwargs)
+    def __repr__(self):
+        return "cpu-" + self.vendor + "-" + str(self.model) + \
+               ":" + self.impls["lang"]
+
+class FPGA(Device):
+    """fpga device with different models"""
+    def __init__(self, vendor, model, **kwargs):
+        if vendor not in ["xilinx", "intel"]: 
+            raise DeviceError(vendor + " not supported yet")
+        assert "fpga_" + model in model_table[vendor], \
+            model + " not supported yet"
+        super(FPGA, self).__init__("FPGA", vendor, model, **kwargs)
+    def __repr__(self):
+        return "fpga-" + self.vendor + "-" + str(self.model) + \
+               ":" + self.impls["lang"]
+
+class GPU(Device):
+    """gpu device with different models"""
+    def __init__(self, vendor, model, **kwargs):
+        if vendor not in ["nvidia", "amd"]: 
+            raise DeviceError(vendor + " not supported yet")
+        assert "gpu_" + model in model_table[vendor], \
+            model + " not supported yet"
+        super(GPU, self).__init__("GPU", vendor, model, **kwargs)
+    def __repr__(self):
+        return "gpu-" + self.vendor + "-" + str(self.model) + \
+               ":" + self.impls["lang"]
+
+class PIM(Device):
+    """cpu device with different models"""
+    def __init__(self, vendor, model, **kwargs):
+        if model not in ["ppac"]: 
+            raise DeviceError(model + " not supported yet")
+        super(PIM, self).__init__("PIM", vendor, model, **kwargs)
+    def __repr__(self):
+        return "pim-" + str(self.model)
+
+dev_table = {
+  "aws_f1" : [CPU("intel", "e5"), FPGA("xilinx", "xcvu19p")],
+  "zc706" : [CPU("arm", "a9"), FPGA("xilinx", "xc7z045")],
+  "rocc-ppac" : [CPU("riscv", "riscv"), PIM("ppac", "ppac")],
+  "stratix10_sx": [CPU("arm", "a53"), FPGA("intel", "stratix10_gx")]
+}
+
+class env(type):
+    """The platform class for compute environment setups
+    
+     serves as meta-class for attr getting
+     default platform: aws_f1, zynq, ppac
+
+    Parameters
+    ----------
+    host: str
+        Device of device to place data
+    model: str
+        Model of device to place date
+    """
+    def __getattr__(cls, key):
+        if key == "aws_f1":
+            devs = dev_table[key]
+            host = devs[0].set_lang("opencl")
+            xcel = devs[1].set_lang("hlsc")
+        elif key == "zc706":
+            devs = dev_table[key]
+            host = devs[0].set_lang("hlsc")
+            xcel = devs[1].set_lang("hlsc")
+        elif key == "llvm":
+            devs = None 
+            host = None 
+            xcel = None 
+        elif key == "ppac":
+            devs = dev_table["rocc-ppac"]
+            host = devs[0].set_lang("c")
+            xcel = None 
+        else: # unsupported device
+            raise DeviceError("not supported")
+        tool = tool_table[key]
+        return cls(key, devs, host, xcel, tool)
+
+class platform(with_metaclass(env, object)):
+    def __init__(self, name, devs, host, xcel, tool):
+        self.name = name
+        self.devs = devs
+        self.host = host
+        self.xcel = xcel
+        self.tool = tool
+
+        if isinstance(host, CPU):
+            self.cpu = host
+        if isinstance(xcel, FPGA):
+            self.fpga = xcel
+        elif isinstance(xcel, PIM) and \
+             xcel.model == "ppac":
+            self.ppac = xcel
+
+    def __getattr__(self, key):
+        """ return tool options """
+        return self.tool.__getattr__(key)
+
+    def __call__(self, tooling=None):
+        if tooling: # check and update
+            assert isinstance(tooling, tool)
+            self.tool = tooling
+        return self
+
+    def __str__(self):
+        return str(self.name) + "(" + \
+               str(self.host) + " : " + \
+               str(self.xcel) + ")"
+
+    def __repr__(self):
+        return str(self.name) + "(" + \
+               str(self.host) + " : " + \
+               str(self.xcel) + ")"
+
+def device_to_str(dtype):
+    """Convert a device type to string format.
+
+    Parameters
+    ----------
+    dtype : Device or str
+        The device type to be converted
+
+    Returns
+    -------
+    str
+        The converted device type in string format.
+    """
+    if isinstance(dtype, Device):
+        if isinstance(dtype, CPU):
+            return "cpu_" + str(dtype.model)
+        elif isinstance(dtype, FPGA):
+            return "fpga_" + str(dtype.model)
+    else:
+        if not isinstance(dtype, str):
+            raise DeviceError("Unsupported device type format")
+        return dtype
+
+def device_to_hcl(dtype):
+    """Convert a device type to Heterocl type.
+
+    Parameters
+    ----------
+    dtype : Device or str
+        The device type to be converted
+
+    Returns
+    -------
+    Device
+    """
+    if isinstance(dtype, Device):
+        return dtype
+    elif isinstance(dtype, str):
+        device, model = dtype.split("_") 
+        if device == "cpu":
+            return CPU(model)
+        elif device == "gpu":
+            return GPU(model)
+        elif device == "fpga":
+            return FPGA(model)
+        else:
+            raise DeviceError("Unrecognized device type")
+    else:
+        raise DeviceError("Unrecognized device type format")
+
+def get_model(dtype):
+    """Get the model of a given device type.
+
+    Parameters
+    ----------
+    dtype : Device or str
+        The given device type
+
+    Returns
+    -------
+    str
+    """
+    dtype = dtype_to_hcl(dtype)
+    return dtype.types, dtype.model
+
-Original file line number
+Diff line change
@@ Expand Up / @@ -16,8 +16,6 @@ tags @@
     docs/source/samples
     docs/source/tutorials
     soda_*
-    *.cpp
-    *.h
     out
     # Downloaded files
@@ Expand Down @@