High level API

ONNXRunTime.InferenceSession — Type

(o::InferenceSession)(inputs [,output_names])

Run an InferenceSession on a collection of inputs. Here inputs can either be a NamedTuple or an AbstractDict. Optionally output_names can be passed. In this case only the outputs whose names are contained in output_names are computed.

source

ONNXRunTime.InferenceSession — Type

struct InferenceSession

Represents an infernence session. Should only be created by calling load_inference.

source

ONNXRunTime.load_inference — Method

load_inference(
    path::AbstractString;
    execution_provider,
    envname,
    logging_level,
    provider_options
) -> InferenceSession

Load an ONNX file at path into an inference session.

Keyword arguments:

execution_provider: Either :cpu or :cuda. The latter requires a CUDA capable GPU and the CUDA and cuDNN packages must first be imported.
envname: Name used for logging purposes.
logging_level: Level of diagnostic output. Options are :verbose, :info, :warning (default), :error, and :fatal.
provider_options: Named tuple with options passed to the execution provider.

Note: Due to limitations of the C API CreateEnv function, envname and logging_level can only be set once per process. Attempts to change these are ignored.

source

ONNXRunTime.release — Method

release(o::InferenceSession)::Nothing

Release memory allocated to an InferenceSession. This also happens automatically when the object has gone out of scope and the garbage collector deletes it.

However, there is no guarantee when that happens, so it can be useful to manually release the memory. This is especially true when the model has allocated GPU memory, which does not put pressure on the garbage collector to run promptly.

Using the inference session after releasing is an error.

source

Low level API

ONNXRunTime.CAPI — Module

module CAPI

This module closely follows the offical onnxruntime C-API. See here for a C code example.

source

ONNXRunTime.CAPI.ONNXTensorElementDataType — Type

ONNXRunTime.CAPI.ONNXTensorElementDataType

CEnum with possible values:

ONNXTENSORELEMENTDATATYPE_UNDEFINED
ONNXTENSORELEMENTDATATYPE_FLOAT
ONNXTENSORELEMENTDATATYPE_UINT8
ONNXTENSORELEMENTDATATYPE_INT8
ONNXTENSORELEMENTDATATYPE_UINT16
ONNXTENSORELEMENTDATATYPE_INT16
ONNXTENSORELEMENTDATATYPE_INT32
ONNXTENSORELEMENTDATATYPE_INT64
ONNXTENSORELEMENTDATATYPE_STRING
ONNXTENSORELEMENTDATATYPE_BOOL
ONNXTENSORELEMENTDATATYPE_FLOAT16
ONNXTENSORELEMENTDATATYPE_DOUBLE
ONNXTENSORELEMENTDATATYPE_UINT32
ONNXTENSORELEMENTDATATYPE_UINT64
ONNXTENSORELEMENTDATATYPE_COMPLEX64
ONNXTENSORELEMENTDATATYPE_COMPLEX128
ONNXTENSORELEMENTDATATYPE_BFLOAT16

source

ONNXRunTime.CAPI.OrtAllocator — Type

ONNXRunTime.CAPI.OrtAllocator

Wraps a pointer to the C object of type ONNXRunTime.CAPI.OrtAllocator.

source

ONNXRunTime.CAPI.OrtAllocatorType — Type

ONNXRunTime.CAPI.OrtAllocatorType

CEnum with possible values:

Invalid
OrtDeviceAllocator
OrtArenaAllocator

source

ONNXRunTime.CAPI.OrtApi — Type

struct OrtApi

source

ONNXRunTime.CAPI.OrtApiBase — Type

struct OrtApiBase

source

ONNXRunTime.CAPI.OrtArenaCfg — Type

ONNXRunTime.CAPI.OrtArenaCfg

Wraps a pointer to the C object of type ONNXRunTime.CAPI.OrtArenaCfg.

source

ONNXRunTime.CAPI.OrtCUDAProviderOptions — Type

struct OrtCUDAProviderOptions

source

ONNXRunTime.CAPI.OrtCUDAProviderOptions — Method

OrtCUDAProviderOptions(
;
    device_id,
    cudnn_conv_algo_search,
    gpu_mem_limit,
    arena_extend_strategy,
    do_copy_in_default_stream,
    has_user_compute_stream,
    user_compute_stream,
    default_memory_arena_cfg
) -> ONNXRunTime.CAPI.OrtCUDAProviderOptions

source

ONNXRunTime.CAPI.OrtCudnnConvAlgoSearch — Type

ONNXRunTime.CAPI.OrtCudnnConvAlgoSearch

CEnum with possible values:

EXHAUSTIVE
HEURISTIC
DEFAULT

source

ONNXRunTime.CAPI.OrtCustomOpDomain — Type

ONNXRunTime.CAPI.OrtCustomOpDomain

Wraps a pointer to the C object of type ONNXRunTime.CAPI.OrtCustomOpDomain.

source

ONNXRunTime.CAPI.OrtEnv — Type

ONNXRunTime.CAPI.OrtEnv

Wraps a pointer to the C object of type ONNXRunTime.CAPI.OrtEnv.

source

ONNXRunTime.CAPI.OrtIoBinding — Type

ONNXRunTime.CAPI.OrtIoBinding

Wraps a pointer to the C object of type ONNXRunTime.CAPI.OrtIoBinding.

source

ONNXRunTime.CAPI.OrtLoggingLevel — Type

ONNXRunTime.CAPI.OrtLoggingLevel

CEnum with possible values:

ORTLOGGINGLEVEL_VERBOSE
ORTLOGGINGLEVEL_INFO
ORTLOGGINGLEVEL_WARNING
ORTLOGGINGLEVEL_ERROR
ORTLOGGINGLEVEL_FATAL

source

ONNXRunTime.CAPI.OrtMapTypeInfo — Type

ONNXRunTime.CAPI.OrtMapTypeInfo

Wraps a pointer to the C object of type ONNXRunTime.CAPI.OrtMapTypeInfo.

source

ONNXRunTime.CAPI.OrtMemType — Type

ONNXRunTime.CAPI.OrtMemType

CEnum with possible values:

OrtMemTypeCPUInput
OrtMemTypeCPUOutput
OrtMemTypeCPUOutput
OrtMemTypeDefault

source

ONNXRunTime.CAPI.OrtMemoryInfo — Type

ONNXRunTime.CAPI.OrtMemoryInfo

Wraps a pointer to the C object of type ONNXRunTime.CAPI.OrtMemoryInfo.

source

ONNXRunTime.CAPI.OrtModelMetadata — Type

ONNXRunTime.CAPI.OrtModelMetadata

Wraps a pointer to the C object of type ONNXRunTime.CAPI.OrtModelMetadata.

source

ONNXRunTime.CAPI.OrtPrepackedWeightsContainer — Type

ONNXRunTime.CAPI.OrtPrepackedWeightsContainer

Wraps a pointer to the C object of type ONNXRunTime.CAPI.OrtPrepackedWeightsContainer.

source

ONNXRunTime.CAPI.OrtRunOptions — Type

ONNXRunTime.CAPI.OrtRunOptions

Wraps a pointer to the C object of type ONNXRunTime.CAPI.OrtRunOptions.

source

ONNXRunTime.CAPI.OrtSequenceTypeInfo — Type

ONNXRunTime.CAPI.OrtSequenceTypeInfo

Wraps a pointer to the C object of type ONNXRunTime.CAPI.OrtSequenceTypeInfo.

source

ONNXRunTime.CAPI.OrtSession — Type

ONNXRunTime.CAPI.OrtSession

Wraps a pointer to the C object of type ONNXRunTime.CAPI.OrtSession.

source

ONNXRunTime.CAPI.OrtSessionOptions — Type

ONNXRunTime.CAPI.OrtSessionOptions

Wraps a pointer to the C object of type ONNXRunTime.CAPI.OrtSessionOptions.

source

ONNXRunTime.CAPI.OrtStatus — Type

ONNXRunTime.CAPI.OrtStatus

Wraps a pointer to the C object of type ONNXRunTime.CAPI.OrtStatus.

source

ONNXRunTime.CAPI.OrtTensorTypeAndShapeInfo — Type

ONNXRunTime.CAPI.OrtTensorTypeAndShapeInfo

Wraps a pointer to the C object of type ONNXRunTime.CAPI.OrtTensorTypeAndShapeInfo.

source

ONNXRunTime.CAPI.OrtThreadingOptions — Type

ONNXRunTime.CAPI.OrtThreadingOptions

Wraps a pointer to the C object of type ONNXRunTime.CAPI.OrtThreadingOptions.

source

ONNXRunTime.CAPI.OrtTypeInfo — Type

ONNXRunTime.CAPI.OrtTypeInfo

Wraps a pointer to the C object of type ONNXRunTime.CAPI.OrtTypeInfo.

source

ONNXRunTime.CAPI.OrtValue — Type

ONNXRunTime.CAPI.OrtValue

Wraps a pointer to the C object of type ONNXRunTime.CAPI.OrtValue.

source

ONNXRunTime.CAPI.AllocatorFree — Method

AllocatorFree(
    api::ONNXRunTime.CAPI.OrtApi,
    allocator::ONNXRunTime.CAPI.OrtAllocator,
    ptr::Ptr
)

source

ONNXRunTime.CAPI.CreateAllocator — Method

CreateAllocator(
    api::ONNXRunTime.CAPI.OrtApi,
    session::ONNXRunTime.CAPI.OrtSession,
    meminfo::ONNXRunTime.CAPI.OrtMemoryInfo
) -> ONNXRunTime.CAPI.OrtAllocator

source

ONNXRunTime.CAPI.CreateArenaCfgV2 — Method

CreateArenaCfgV2(
    api::ONNXRunTime.CAPI.OrtApi,
    keys,
    vals
) -> ONNXRunTime.CAPI.OrtArenaCfg

source

ONNXRunTime.CAPI.CreateCpuMemoryInfo — Method

CreateCpuMemoryInfo(
    api::ONNXRunTime.CAPI.OrtApi;
    allocator_type,
    mem_type
) -> ONNXRunTime.CAPI.OrtMemoryInfo

source

ONNXRunTime.CAPI.CreateEnv — Method

CreateEnv(api::ONNXRunTime.CAPI.OrtApi; logging_level, name)

source

ONNXRunTime.CAPI.CreateRunOptions — Method

CreateRunOptions(
    api::ONNXRunTime.CAPI.OrtApi
) -> ONNXRunTime.CAPI.OrtRunOptions

source

ONNXRunTime.CAPI.CreateSession — Method

CreateSession(
    api::ONNXRunTime.CAPI.OrtApi,
    env::ONNXRunTime.CAPI.OrtEnv,
    model_path::AbstractString,
    options::ONNXRunTime.CAPI.OrtSessionOptions
) -> ONNXRunTime.CAPI.OrtSession

source

ONNXRunTime.CAPI.CreateSessionOptions — Method

CreateSessionOptions(
    api::ONNXRunTime.CAPI.OrtApi
) -> ONNXRunTime.CAPI.OrtSessionOptions

source

ONNXRunTime.CAPI.CreateTensorWithDataAsOrtValue — Method

CreateTensorWithDataAsOrtValue(
    api::ONNXRunTime.CAPI.OrtApi,
    memory_info::ONNXRunTime.CAPI.OrtMemoryInfo,
    data::Vector,
    shape
) -> ONNXRunTime.CAPI.OrtValue

Return a tensor with shape shape that is backed by the memory of data.

source

ONNXRunTime.CAPI.Free — Method

Free(
    alloc::ONNXRunTime.CAPI.OrtAllocator,
    ptr::Union{Cstring, Ptr}
)

source

ONNXRunTime.CAPI.GetApi — Function

GetApi(
    api_base::ONNXRunTime.CAPI.OrtApiBase
) -> ONNXRunTime.CAPI.OrtApi
GetApi(
    api_base::ONNXRunTime.CAPI.OrtApiBase,
    ort_api_version::Integer
) -> ONNXRunTime.CAPI.OrtApi

source

ONNXRunTime.CAPI.GetDimensions — Function

GetDimensions(
    api::ONNXRunTime.CAPI.OrtApi,
    o::ONNXRunTime.CAPI.OrtTensorTypeAndShapeInfo
) -> Vector{Int64}
GetDimensions(
    api::ONNXRunTime.CAPI.OrtApi,
    o::ONNXRunTime.CAPI.OrtTensorTypeAndShapeInfo,
    ndims
) -> Vector{Int64}

source

ONNXRunTime.CAPI.GetDimensionsCount — Method

GetDimensionsCount(
    api::ONNXRunTime.CAPI.OrtApi,
    o::ONNXRunTime.CAPI.OrtTensorTypeAndShapeInfo
) -> UInt64

source

ONNXRunTime.CAPI.GetErrorMessage — Method

GetErrorMessage(
    api::ONNXRunTime.CAPI.OrtApi,
    status::Ptr{Nothing}
) -> String

source

ONNXRunTime.CAPI.GetTensorElementType — Method

GetTensorElementType(
    api::ONNXRunTime.CAPI.OrtApi,
    o::ONNXRunTime.CAPI.OrtTensorTypeAndShapeInfo
) -> ONNXRunTime.CAPI.ONNXTensorElementDataType

source

ONNXRunTime.CAPI.GetTensorMutableData — Method

GetTensorMutableData(
    api::ONNXRunTime.CAPI.OrtApi,
    tensor::ONNXRunTime.CAPI.OrtValue
) -> PermutedDimsArray{T, N, perm, iperm, Array{T1, N1}} where {T, N, perm, iperm, T1, N1}

source

ONNXRunTime.CAPI.GetTensorTypeAndShape — Method

GetTensorTypeAndShape(
    api::ONNXRunTime.CAPI.OrtApi,
    o::ONNXRunTime.CAPI.OrtValue
) -> ONNXRunTime.CAPI.OrtTensorTypeAndShapeInfo

source

ONNXRunTime.CAPI.GetVersionString — Method

GetVersionString(
    api_base::ONNXRunTime.CAPI.OrtApiBase
) -> String

source

ONNXRunTime.CAPI.IsTensor — Method

IsTensor(
    api::ONNXRunTime.CAPI.OrtApi,
    val::ONNXRunTime.CAPI.OrtValue
) -> Bool

source

ONNXRunTime.CAPI.OrtGetApiBase — Method

OrtGetApiBase(
;
    execution_provider
) -> ONNXRunTime.CAPI.OrtApiBase

source

ONNXRunTime.CAPI.Run — Method

Run(
    api::ONNXRunTime.CAPI.OrtApi,
    session::ONNXRunTime.CAPI.OrtSession,
    run_options::Union{Nothing, ONNXRunTime.CAPI.OrtRunOptions},
    input_names::Vector{String},
    inputs::Vector{ONNXRunTime.CAPI.OrtValue},
    output_names::Vector{String}
) -> Vector{ONNXRunTime.CAPI.OrtValue}

source

ONNXRunTime.CAPI.SessionGetInputCount — Method

SessionGetInputCount(
    api::ONNXRunTime.CAPI.OrtApi,
    session::ONNXRunTime.CAPI.OrtSession
) -> UInt64

source

ONNXRunTime.CAPI.SessionGetInputName — Method

SessionGetInputName(
    api::ONNXRunTime.CAPI.OrtApi,
    session::ONNXRunTime.CAPI.OrtSession,
    index::Integer,
    allocator::ONNXRunTime.CAPI.OrtAllocator
) -> String

source

ONNXRunTime.CAPI.SessionGetModelMetadata — Method

SessionGetModelMetadata(
    api::ONNXRunTime.CAPI.OrtApi,
    session::ONNXRunTime.CAPI.OrtSession
) -> ONNXRunTime.CAPI.OrtModelMetadata

source

ONNXRunTime.CAPI.SessionGetOutputCount — Method

SessionGetOutputCount(
    api::ONNXRunTime.CAPI.OrtApi,
    sess::ONNXRunTime.CAPI.OrtSession
) -> UInt64

source

ONNXRunTime.CAPI.SessionGetOutputName — Method

SessionGetOutputName(
    api::ONNXRunTime.CAPI.OrtApi,
    session::ONNXRunTime.CAPI.OrtSession,
    index::Integer,
    allocator::ONNXRunTime.CAPI.OrtAllocator
) -> String

source

ONNXRunTime.CAPI.SessionOptionsAppendExecutionProvider_CUDA — Method

SessionOptionsAppendExecutionProvider_CUDA(
    api::ONNXRunTime.CAPI.OrtApi,
    session_options::ONNXRunTime.CAPI.OrtSessionOptions,
    cuda_options::ONNXRunTime.CAPI.OrtCUDAProviderOptions
)

source

ONNXRunTime.CAPI.into_julia — Method

into_julia(
    _::Type{T},
    api::ONNXRunTime.CAPI.OrtApi,
    objptr::Ref{Ptr{Nothing}},
    status_ptr::Ptr{Nothing},
    gchandles
) -> Any

Create a julia object from the output of an api call. Check and release status_ptr.

source

ONNXRunTime.CAPI.juliatype — Method

juliatype(
    onnx::ONNXRunTime.CAPI.ONNXTensorElementDataType
) -> Type

source

ONNXRunTime.CAPI.release — Function

release(api::OrtApi, obj)::Nothing

Release memory owned by obj. The garbage collector should call this function automatically. If it does not that's a bug that should be reported.

There might however be situations with high memory pressure. In these situations it might help to call this function manually to release memory earlier. Using an object after releasing it is undefined behaviour.

source

ONNXRunTime.CAPI.unsafe_GetTensorMutableData — Method

unsafe_GetTensorMutableData(
    api::ONNXRunTime.CAPI.OrtApi,
    tensor::ONNXRunTime.CAPI.OrtValue
) -> PermutedDimsArray{T, N, perm, iperm, Array{T1, N1}} where {T, N, perm, iperm, T1, N1}

This function is unsafe, because its output points to memory owned by tensor. After tensor is released, accessing the output becomes undefined behaviour.

source