AI model compile API Specification

Module contents

mera module

Mera: Public API for Mera ML compiler stack.

mera.deploy module

Mera Deployer classes

mera.deploy.Deployer: alias of MERADeployer

class mera.deploy.MERADeployer(output_dir: str, overwrite: bool = False)

Bases: _DeployerBase

MERA standard deployer with MERA’s compiler stack:

deploy(model: MeraModel, mera_platform: Platform = Platform.SAKURA_2C, build_config={}, target: Target = Target.Simulator, host_arch: str | None = None, mcu_config={}, vela_config={}, **kwargs)

Launches the compilation of a MERA project for a MERA model using the MERA stack.

Parameters:

model – Model object loaded from mera.ModelLoader
mera_platform – MERA platform architecture enum value
build_config – MERA build configuration dict
target – MERA build target
host_arch – Host arch to deploy for. If unset, it will pick the current host platform, provide a value to override the setting.
mcu_config – Dictionary with user overrides for MCU CCodegen tool. The following fields are allowed: suffix, weight_location, use_x86
vela_config – Dictionary with user overrides for MCU Vela tool. The following fields are allowed: enable_ospi, config, sys_config, accel_config, optimise, memory_mode, verbose_all.

Returns:

The object representing the result of a MERA deployment

mera.deploy_project module

Mera Deploy Project utilities.

class mera.deploy_project.Layout(value)

Bases: Enum

List of possible data layouts

NCHW = 'NCHW': N batches, Channels, Height, Width.

NHWC = 'NHWC': N batches, Height, Width, Channels.

class mera.deploy_project.Target(value)

Bases: Enum

List of possible Mera Target values.

IP = ('IP', False, False): Target HW accelerator. Valid for arm and x86 architectures.

Interpreter = ('Interpreter', True, True): Target sw interpretation of the model in floating point. Only valid for x86

InterpreterBf16 = ('InterpreterBf16', True, True): Target sw interpretation of the model in BF16. Only valid for x86

InterpreterHw = ('InterpreterHw', True, False): Target sw interpretation of the model. Only valid for x86

InterpreterHwBf16 = ('InterpreterHwBf16', True, True): Target IP sw interpretation of the model in BF16. Only valid for x86

MCU = ('MCU', False, True)

MERA2Interpreter = ('MERAInterpreter', True, True)

MERAInterpreter = ('MERAInterpreter', True, True)

Quantizer = ('Quantizer', True, True)

Simulator = ('Simulator', True, False): Target sw simulation of the IP model. Only valid for x86

SimulatorBf16 = ('SimulatorBf16', True, True): Target sw simulation of the IP BF16 model. Only valid for x86

VerilatorSimulator = ('VerilatorSimulator', True, False): Target hw emulation of the IP model. Only valid for x86

mera.deploy_project.is_mera_project(path: str) → bool

Returns whether a provided path is a MeraProject or not

Parameters:: path – Path to check for project existence
Returns:: Whether the path belongs to a project

mera.mera_deployment module

Mera Deployment classes

class mera.mera_deployment.DeviceTarget(value)

Bases: Enum

List of possible MERA runtime devices for running IP deployments.

INTEL_IA420 = ('Intel IA420', 3): Target device is an Intel IA420 FPGA board.

SAKURA_1 = ('Sakura-1', 1): Target device is an EdgeCortix’s Sakura-1 ASIC.

SAKURA_2 = ('Sakura-2', 5): Target device is an EdgeCortix’s Sakura-2 ASIC.

XILINX_U50 = ('AMD Xilinx U50', 2): Target device is an AMD Xilinx U50 FPGA board.

property code

class mera.mera_deployment.MeraDeployment(plan_loc, target)

Bases: object

get_runner(device_target: DeviceTarget = DeviceTarget.SAKURA_1, device_ids: int | List[int] | None = None, dynamic_output_list: List[str | int] | None = None) → MeraModelRunner

Prepares the model for running with a given target

Parameters:

device_target – Selects the device run target where the IP deployment will be run. Only applicable for deployments with target=IP. See DeviceTarget enum for a detailed list of possible values.
device_ids – When running in a multi card environment, selects the SAKURA device(s) where the deployment will be run. If unset, MERA will automatically select any available card in the system. Only applicable in the case device_target=DeviceTarget.SAKURA_1
dynamic_output_list – Marks certain outputs so that only a dynamic subset of the data is returned. See special get_output_row() function in MeraModelRunner. This feature is only supported when running in IP.

Returns:

Runner object

class mera.mera_deployment.MeraInterpreterDeployment(model_loc)

Bases: object

get_runner(profiling_mode: bool = False, config_dict: Dict = {}, **kwargs) → MeraInterpreterModelRunner

Prepares the Interpreter for running the model.

Parameters:: profiling_mode – Enables collection of node execution times.
Returns:: Runner object

class mera.mera_deployment.MeraInterpreterModelRunner(int_runner, int_cfg)

Bases: ModelRunnerBase

display_profiling_table()

get_num_inputs() → int

get_num_outputs() → int

Gets the number of available outputs

Returns:: Number of output variables

get_output(output_idx: int = 0) → ndarray

Returns the output tensor given an output id index. run() needs to be called before get_output()

Parameters:: output_idx – Index of output variable to query
Returns:: Output tensor values in numpy format

get_output_row(row_idx: int, output_idx: int = 0) → ndarray

get_outputs() → List[ndarray]

Returns a list of all output tensors. Equivalent to get_output() from [0, get_num_outputs()]

Returns:: List of output tensor values in numpy format

get_outputs_dict() → Dict[str, ndarray]

get_power_metrics() → PowerMetrics

Gets the power metrics reported from MERA after a run(). Note power measurement mode might need to be enable in order to collect and generate such metrics.

Returns:: Container with summary analysis of all collected metrics from MERA.

get_runtime_metrics() → dict

Gets the runtime metrics reported from Mera after a run()

Returns:: Dictionary of measured metrics

run() → None: Runs the model with the specified input data. set_input() needs to be called before run()

set_input(data: Dict[str, ndarray])

Sets the input data for running

Parameters:: data – Input numpy data tensor or dict of input numpy data tensors if the model has more than one input. Setting multiple inputs should have the format {input_name : input_data}

class mera.mera_deployment.MeraInterpreterPrjDeployment(model_loc, prj): Bases: MeraInterpreterDeployment

class mera.mera_deployment.MeraModelRunner(runner, plan)

Bases: ModelRunnerBase

get_input_handle(name: str, as_numpy: bool = True, dtype: str = 'float32')

Gets the zero-copy handler to the specified model input. :param name: Name of the input. :param as_numpy: Whether to prepare handle as numpy array. Defaults to true. :param dtype: Viewer data type.

Returns:: Input data handler.

get_input_names() → List[str]

get_num_outputs() → int

Gets the number of available outputs

Returns:: Number of output variables

get_output(output_idx: int = 0) → ndarray

Returns the output tensor given an output id index. run() needs to be called before get_output()

Parameters:: output_idx – Index of output variable to query
Returns:: Output tensor values in numpy format

get_output_handle(name: str, as_numpy: bool = True, dtype: str = 'float32')

Gets the zero-copy handler to the specified model output. :param name: Name of the output. :param as_numpy: Whether to prepare handle as numpy array. Defaults to true. :param dtype: Viewer data type.

Returns:: Output data handler.

get_output_names() → List[str]

get_output_row(row_idx: int, output_idx: int = 0) → ndarray

get_outputs() → List[ndarray]

Returns a list of all output tensors. Equivalent to get_output() from [0, get_num_outputs()]

Returns:: List of output tensor values in numpy format

get_outputs_dict() → Dict[str, ndarray]

get_power_metrics() → PowerMetrics

Gets the power metrics reported from MERA after a run(). Note power measurement mode might need to be enable in order to collect and generate such metrics.

Returns:: Container with summary analysis of all collected metrics from MERA.

get_runtime_metrics() → dict

Gets the runtime metrics reported from Mera after a run()

Returns:: Dictionary of measured metrics

run() → None: Runs the model with the specified input data. set_input() needs to be called before run()

set_input(data: ndarray | Dict[str, ndarray] | List[ndarray])

Sets the input data for running

Parameters:: data – Input numpy data tensor or dict of input numpy data tensors if the model has more than one input. Setting multiple inputs should have the format {input_name : input_data}

set_named_input(name: str, data: ndarray): Gets the zero-copy numpy handler and copies data to the device. :param name: Name of the input.

class mera.mera_deployment.MeraPrjDeployment(plan_loc, prj, target): Bases: MeraDeployment

class mera.mera_deployment.MeraTvmModelRunner(rt_mod)

Bases: ModelRunnerBase

get_num_outputs() → int

Gets the number of available outputs

Returns:: Number of output variables

get_output(output_idx: int = 0) → ndarray

Returns the output tensor given an output id index. run() needs to be called before get_output()

Parameters:: output_idx – Index of output variable to query
Returns:: Output tensor values in numpy format

get_outputs() → List[ndarray]

Returns a list of all output tensors. Equivalent to get_output() from [0, get_num_outputs()]

Returns:: List of output tensor values in numpy format

get_power_metrics() → PowerMetrics

Gets the power metrics reported from MERA after a run(). Note power measurement mode might need to be enable in order to collect and generate such metrics.

Returns:: Container with summary analysis of all collected metrics from MERA.

get_runtime_metrics() → dict

Gets the runtime metrics reported from Mera after a run()

Returns:: Dictionary of measured metrics

run() → None: Runs the model with the specified input data. set_input() needs to be called before run()

set_input(data: ndarray | Dict[str, ndarray] | List[ndarray])

Sets the input data for running

Parameters:: data – Input numpy data tensor or dict of input numpy data tensors if the model has more than one input. Setting multiple inputs should have the format {input_name : input_data}

class mera.mera_deployment.ModelRunnerBase

Bases: object

API for runtime inference of a model.

abstract get_num_outputs() → int

Gets the number of available outputs

Returns:: Number of output variables

abstract get_output(output_idx: int = 0) → ndarray

Returns the output tensor given an output id index. run() needs to be called before get_output()

Parameters:: output_idx – Index of output variable to query
Returns:: Output tensor values in numpy format

abstract get_outputs() → List[ndarray]

Returns a list of all output tensors. Equivalent to get_output() from [0, get_num_outputs()]

Returns:: List of output tensor values in numpy format

abstract get_power_metrics() → PowerMetrics

Gets the power metrics reported from MERA after a run(). Note power measurement mode might need to be enable in order to collect and generate such metrics.

Returns:: Container with summary analysis of all collected metrics from MERA.

abstract get_runtime_metrics() → dict

Gets the runtime metrics reported from Mera after a run()

Returns:: Dictionary of measured metrics

abstract run() → None: Runs the model with the specified input data. set_input() needs to be called before run()

abstract set_input(data: ndarray | Dict[str, ndarray] | List[ndarray])

Sets the input data for running

Parameters:: data – Input numpy data tensor or dict of input numpy data tensors if the model has more than one input. Setting multiple inputs should have the format {input_name : input_data}

mera.mera_deployment.load_mera_deployment(path: str, target: Target | None = None)

Loads an already built deployment from a directory

Parameters:

path – Directory of a Mera deployment project or full directory of built mera results
target – If there are multiple targets built in the mera project selects which one. Optional if not loading a project or if there is a single target built.

Returns:

Reference to deployment object

mera.mera_model module

Mera Model classes.

class mera.mera_model.Mera2ModelQuantized(prj, model_name, model_path)

Bases: MeraModel

MeraModel class of a model quantized with MERA2 tools.

class mera.mera_model.MeraModel(prj, model_name, model_path, use_prequantize_input=False, save_model=False)

Bases: object

Base class representing a ML model compatible with MERA deployment project.

get_input_shape(input_name: str | None = None) → Tuple[int]

Utility class to query the shape of an input variable of the model

Parameters:: input_name – Specifies which input to get the shape from. If unset, assumes there is only one input.
Returns:: A tuple with 4 items representing the shape of the input variable in the model.

property input_desc

class mera.mera_model.MeraModelExecutorch(prj, model_name, model_path)

Bases: MeraModel

Specialization of MeraModel for a Executorch/EXIR ML model.

class mera.mera_model.MeraModelOnnx(prj, model_name, model_path, batch_num, shape_mapping, model_info)

Bases: MeraModel

Specialization of MeraModel for a ONNX ML model.

class mera.mera_model.MeraModelTflite(prj, model_name, model_path, use_prequantize_input)

Bases: MeraModel

Specialization of MeraModel for a TFLite ML model.

class mera.mera_model.ModelLoader(deployer=None)

Bases: object

Utility class for loading and converting ML models into models compatible with MERA

Parameters:: deployer (mera.deploy.TVMDeployer) – Reference to a MERA deployer class, if None is provided, information about the model will not be added to the deployment project.

from_executorch(model_path: str, model_name: str | None = None) → MeraModelExecutorch

Converts a PyTorch model in Executorch/EXIR format (.pte) into a compatible model for MERA.

Parameters:

model_path – Path to the PyTorch model file in ExecuTorch format (.pte)
model_name – Display name of the model being deployed. Will default to the stem name of the model file if not provided.

Returns: